LOCUS PYAFW.TXT 5215 BP DS-DNA CIRCULAR SYN 10-MAR-1998 DEFINITION - ACCESSION - KEYWORDS - SOURCE - FEATURES Location/Qualifiers misc_feature 4..>25 /note="CMV promoter fragment" misc_feature <41..>166 /note="URR fragment" misc_feature <167..>377 /note="E1 fragment" misc_signal 201..320 /note="Zhao 99 Virol packaging signal" enhancer 462..659 /note="EF-1alpha promoter core domain" promoter 462..1640 /note="EF1a promoter" exon 660..692 /note="EF-1a exon 1" intron 693..1632 /note="EF-1a intron A" exon 1632..1640 /note="EF-1a exon 2 leader" CDS 1757..2476 /note="EGFP" mRNA 2560..>3148 /note="WPRE" polyA_signal 3196..>3417 /note="SV40 late polyA" rep_origin 3713..4356 /note="pUC Ori" CDS complement(4363..<4758) /note="BlasticidinR" promoter complement(4777..4832) /note="EM7" misc_feature complement(4867..4867) /note="end homology to SV40" rep_origin complement(4867..5018) /note="SV40 Ori" promoter complement(4880..5189) /note="SV40 early promoter" misc_binding complement(4886..4969) /note="SV40 Ori core domain" misc_feature complement(4960..4966) /note="SV40 early promoter core" repeat_region complement(4978..5041) /note="SV40 21bp repeats" enhancer complement(5046..5188) /note="SV40 enhancer elements" BASE COUNT 1121 A 1412 C 1449 G 1233 T 0 OTHER ORIGIN - 1 CGCGTTGACA TTGATTATTG ACTAGTTAGT TATTACTAGC CGCGTGCAAA GCACCGGCGG 61 CGGTAGATGC GGGGTAAGTA CTGAATTTTA ATTCGACCTA TCCCGGTAAA GCGAAAGCGA 121 CACGCTTTTT TTTCACACAT AGCGGGACCG AACACGTTAT AAGTATCGAA CTCCTAAAGA 181 AGCAGTGTAG TTTTCTGCAG ATGCAAAAAA GATCTCATGA AGGAGGAACT TGTGCAGTTT 241 ACTTAATCTG CTTTAACACA GCTAAAAGCA GAGAAACAGT CCGGAATCTG ATGGCAAACA 301 CGCTAAATGT AAGAGAAGAG TGTTTGATGC TGCAGCCAGC TAAAATTCGA GGACTCAGCG 361 CAGCTCTATT CTGGTTTAGC TTGCAAAGAT GGATAAAGTT TTAAACAGAG AGGAATCTTT 421 GCAGCTAATG GACCTTGTAG GTCTTGAAAG GAGTGGGAAT TGGCTCCGGT GCCCGTCAGT 481 GGGCAGAGCG CACATCGCCC ACAGTCCCCG AGAAGTTGGG GGGAGGGGTC GGCAATTGAA 541 CCGGTGCCTA GAGAAGGTGG CGCGGGGTAA ACTGGGAAAG TGATGTCGTG TACTGGCTCC 601 GCCTTTTTCC CGAGGGTGGG GGAGAACCGT ATATAAGTGC AGTAGTCGCC GTGAACGTTC 661 TTTTTCGCAA CGGGTTTGCC GCCAGAACAC AGGTAAGTGC CGTGTGTGGT TCCCGCGGGC 721 CTGGCCTCTT TACGGGTTAT GGCCCTTGCG TGCCTTGAAT TACTTCCACC TGGCTGCAGT 781 ACGTGATTCT TGATCCCGAG CTTCGGGTTG GAAGTGGGTG GGAGAGTTCG AGGCCTTGCG 841 CTTAAGGAGC CCCTTCGCCT CGTGCTTGAG TTGAGGCCTG GCCTGGGCGC TGGGGCCGCC 901 GCGTGCGAAT CTGGTGGCAC CTTCGCGCCT GTCTCGCTGC TTTCGATAAG TCTCTAGCCA 961 TTTAAAATTT TTGATGACCT GCTGCGACGC TTTTTTTCTG GCAAGATAGT CTTGTAAATG 1021 CGGGCCAAGA TCTGCACACT GGTATTTCGG TTTTTGGGGC CGCGGGCGGC GACGGGGCCC 1081 GTGCGTCCCA GCGCACATGT TCGGCGAGGC GGGGCCTGCG AGCGCGGCCA CCGAGAATCG 1141 GACGGGGGTA GTCTCAAGCT GGCCGGCCTG CTCTGGTGCC TGGCCTCGCG CCGCCGTGTA 1201 TCGCCCCGCC CTGGGCGGCA AGGCTGGCCC GGTCGGCACC AGTTGCGTGA GCGGAAAGAT 1261 GGCCGCTTCC CGGCCCTGCT GCAGGGAGCT CAAAATGGAG GACGCGGCGC TCGGGAGAGC 1321 GGGCGGGTGA GTCACCCACA CAAAGGAAAA GGGCCTTTCC GTCCTCAGCC GTCGCTTCAT 1381 GTGACTCCAC GGAGTACCGG GCGCCGTCCA GGCACCTCGA TTAGTTCTCG AGCTTTTGGA 1441 GTACGTCGTC TTTAGGTTGG GGGGAGGGGT TTTATGCGAT GGAGTTTCCC CACACTGAGT 1501 GGGTGGAGAC TGAAGTTAGG CCAGCTTGGC ACTTGATGTA ATTCTCCTTG GAATTTGCCC 1561 TTTTTGAGTT TGGATCTTGG TTCATTCTCA AGCCTCAGAC AGTGGTTCAA AGTTTTTTTC 1621 TTCCATTTCA GGTGTCGTGA GGAATTCTCT AGAGATCCCT CGACCTCGAG ATCCATTGTG 1681 CTGGATCTGC GATCTAAGTA AGCTTCGAAT TCTGCAGTCG ACGGTACCGC GGGCCCGGGA 1741 TCCACCGGTC GCCACCATGG TGAGCAAGGG CGAGGAGCTG TTCACCGGGG TGGTGCCCAT 1801 CCTGGTCGAG CTGGACGGCG ACGTAAACGG CCACAAGTTC AGCGTGTCCG GCGAGGGCGA 1861 GGGCGATGCC ACCTACGGCA AGCTGACCCT GAAGTTCATC TGCACCACCG GCAAGCTGCC 1921 CGTGCCCTGG CCCACCCTCG TGACCACCCT GACCTACGGC GTGCAGTGCT TCAGCCGCTA 1981 CCCCGACCAC ATGAAGCAGC ACGACTTCTT CAAGTCCGCC ATGCCCGAAG GCTACGTCCA 2041 GGAGCGCACC ATCTTCTTCA AGGACGACGG CAACTACAAG ACCCGCGCCG AGGTGAAGTT 2101 CGAGGGCGAC ACCCTGGTGA ACCGCATCGA GCTGAAGGGC ATCGACTTCA AGGAGGACGG 2161 CAACATCCTG GGGCACAAGC TGGAGTACAA CTACAACAGC CACAACGTCT ATATCATGGC 2221 CGACAAGCAG AAGAACGGCA TCAAGGTGAA CTTCAAGATC CGCCACAACA TCGAGGACGG 2281 CAGCGTGCAG CTCGCCGACC ACTACCAGCA GAACACCCCC ATCGGCGACG GCCCCGTGCT 2341 GCTGCCCGAC AACCACTACC TGAGCACCCA GTCCGCCCTG AGCAAAGACC CCAACGAGAA 2401 GCGCGATCAC ATGGTCCTGC TGGAGTTCGT GACCGCCGCC GGGATCACTC TCGGCATGGA 2461 CGAGCTGTAC AAGTAAAGCG GCCGCTCCAC CGCGGTGGCG GCCGCTCTAG AACTAGTGGA 2521 TCCCCCGGGC TGCAGGAATT CGATATCAAG CTTATCGATA ATCAACCTCT GGATTACAAA 2581 ATTTGTGAAA GATTGACTGG TATTCTTAAC TATGTTGCTC CTTTTACGCT ATGTGGATAC 2641 GCTGCTTTAA TGCCTTTGTA TCATGCTATT GCTTCCCGTA TGGCTTTCAT TTTCTCCTCC 2701 TTGTATAAAT CCTGGTTGCT GTCTCTTTAT GAGGAGTTGT GGCCCGTTGT CAGGCAACGT 2761 GGCGTGGTGT GCACTGTGTT TGCTGACGCA ACCCCCACTG GTTGGGGCAT TGCCACCACC 2821 TGTCAGCTCC TTTCCGGGAC TTTCGCTTTC CCCCTCCCTA TTGCCACGGC GGAACTCATC 2881 GCCGCCTGCC TTGCCCGCTG CTGGACAGGG GCTCGGCTGT TGGGCACTGA CAATTCCGTG 2941 GTGTTGTCGG GGAAATCATC GTCCTTTCCT TGGCTGCTCG CCTGTGTTGC CACCTGGATT 3001 CTGCGCGGGA CGTCCTTCTG CTACGTCCCT TCGGCCCTCA ATCCAGCGGA CCTTCCTTCC 3061 CGCGGCCTGC TGCCGGCTCT GCGGCCTCTT CCGCGTCTTC GCCTTCGCCC TCAGACGAGT 3121 CGGATCTCCC TTTGGGCCGC CTCCCCGCAT CGATACCGTC GGCCGCTTCC CTTTAGTGAG 3181 GGTTAATGCT TCGAGCAGAC ATGATAAGAT ACATTGATGA GTTTGGACAA ACCACAACTA 3241 GAATGCAGTG AAAAAAATGC TTTATTTGTG AAATTTGTGA TGCTATTGCT TTATTTGTAA 3301 CCATTATAAG CTGCAATAAA CAAGTTAACA ACAACAATTG CATTCATTTT ATGTTTCAGG 3361 TTCAGGGGGA GGTGTGGGAG GTTTTTTAAA GCAAGTAAAA CCTCTACAAA TGTGGTAAAA 3421 TCGATAAGGA TCCGTCGACC GATGCCCTTG AGAGCCTTCA ACCCAGTCAG CTCCTTCCGG 3481 TGGGCGCGGG GCATGACTAT CGTCGCCGCA CTTATGACTG TCTTCTTTAT CATGCAACTC 3541 GTAGGACAGG TGCCGGCAGC GCTCTTCCGC TTCCTCGCTC ACTGACTCGC TGCGCTCGGT 3601 CGTTCGGCTG CGGCGAGCGG TATCAGCTCA CTCAAAGGCG GTAATACGGT TATCCACAGA 3661 ATCAGGGGAT AACGCAGGAA AGAACATGTG AGCAAAAGGC CAGCAAAAGG CCAGGAACCG 3721 TAAAAAGGCC GCGTTGCTGG CGTTTTTCCA TAGGCTCCGC CCCCCTGACG AGCATCACAA 3781 AAATCGACGC TCAAGTCAGA GGTGGCGAAA CCCGACAGGA CTATAAAGAT ACCAGGCGTT 3841 TCCCCCTGGA AGCTCCCTCG TGCGCTCTCC TGTTCCGACC CTGCCGCTTA CCGGATACCT 3901 GTCCGCCTTT CTCCCTTCGG GAAGCGTGGC GCTTTCTCAT AGCTCACGCT GTAGGTATCT 3961 CAGTTCGGTG TAGGTCGTTC GCTCCAAGCT GGGCTGTGTG CACGAACCCC CCGTTCAGCC 4021 CGACCGCTGC GCCTTATCCG GTAACTATCG TCTTGAGTCC AACCCGGTAA GACACGACTT 4081 ATCGCCACTG GCAGCAGCCA CTGGTAACAG GATTAGCAGA GCGAGGTATG TAGGCGGTGC 4141 TACAGAGTTC TTGAAGTGGT GGCCTAACTA CGGCTACACT AGAAGGACAG TATTTGGTAT 4201 CTGCGCTCTG CTGAAGCCAG TTACCTTCGG AAAAAGAGTT GGTAGCTCTT GATCCGGCAA 4261 ACAAACCACC GCTGGTAGCG GTGGTTTTTT TGTTTGCAAG CAGCAGATTA CGCGCAGAAA 4321 AAAAGGATCT CAAGAAGATC CTTTGATCTT TTCTACGGGG TCGCCCTCCC ACACATAACC 4381 AGAGGGCAGC AATTCACGAA TCCCAACTGC CGTCGGCTGT CCATCACTGT CCTTCACTAT 4441 GGCTTTGATC CCAGGATGCA GATCGAGAAG CACCTGTCGG CACCGTCCGC AGGGGCTCAA 4501 GATGCCCCTG TTCTCATTTC CGATCGCGAC GATACAAGTC AGGTTGCCAG CTGCCGCAGC 4561 AGCAGCAGTG CCCAGCACCA CGAGTTCTGC ACAAGGTCCC CCAGTAAAAT GATATACATT 4621 GACACCAGTG AAGATGCGGC CGTCGCTAGA GAGAGCTGCG CTGGCGACGC TGTAGTCTTC 4681 AGAGATGGGG ATGCTGTTGA TTGTAGCCGT TGCTCTTTCA ATGAGGGTGG ATTCTTCTTG 4741 AGACAAAGGC TTGGCCATGG TTTAGTTCCT CACCTTGTCG TATTATACTA TGCCGATATA 4801 CTATGCCGAT GATTAATTGT CAACACGTGC TGATCAGATC CGAAAATGGA TATACAAGCT 4861 CCCGGGAGCT TTTTGCAAAA GCCTAGGCCT CCAAAAAAGC CTCCTCACTA CTTCTGGAAT 4921 AGCTCAGAGG CAGAGGCGGC CTCGGCCTCT GCATAAATAA AAAAAATTAG TCAGCCATGG 4981 GGCGGAGAAT GGGCGGAACT GGGCGGAGTT AGGGGCGGGA TGGGCGGAGT TAGGGGCGGG 5041 ACTATGGTTG CTGACTAATT GAGATGCATG CTTTGCATAC TTCTGCCTGC TGGGGAGCCT 5101 GGGGACTTTC CACACCTGGT TGCTGACTAA TTGAGATGCA TGCTTTGCAT ACTTCTGCCT 5161 GCCTGGGGAG CCTGGGGACT TTCCACACCC TAACTGACAC ACATTCCACA GAATT //