LOCUS P5L1W.TXT 6889 BP DS-DNA CIRCULAR SYN 17-MAY-2003 DEFINITION - ACCESSION - KEYWORDS - SOURCE - FEATURES Location/Qualifiers misc_feature complement(>49..57) /note="SV40 minor late 19S RNA" promoter <49..>393 /note="SV40 promoter (∆SELP)" enhancer 71..213 /note="SV40 enhancer elements" misc_feature 218..281 /note="SV40 21bp repeats" rep_origin 238..>373 /note="SV40 Ori" mRNA 287..293 /note="late-early transcription startpoints" mutation 307..307 /note="G->A kills SELP ATG" mRNA 327..333 /note="early-early transcription start points" CDS 418..1137 /note="EGFP" polyA_signal 1153..>1374 /note="SV40 Late PolyA" promoter <1465..2643 /note="EF1a promoter and UTR" misc_difference 1465..>1656 /note="rhesus-derived" exon 1663..1695 /note="EF-1a exon 1" intron 1696..2635 /note="EF-1a intron A" exon 2635..2643 /note="EF-1a exon 2 leader" insertion_seq 2667..>2691 /note="attB1" CDS 2704..4254 /note="HPV5 L1 (codmod)" insertion_seq 4262..>4286 /note="attB2" mRNA 4348..>4936 /note="WPRE" polyA_signal 4972..5640 /note="hEF1a polyA signal" polyA_site 5270..5270 /note="site of polyA addition" rep_origin 5641..6374 /note="MB1 Ori" promoter 6375..6461 /note="EM7 promoter" CDS 6462..6836 /note="Zeo" terminator 6837..6889 /note="terminator (rpmB/G)" BASE COUNT 1570 A 1881 C 1864 G 1574 T 0 OTHER ORIGIN - 1 CCCTGCAGGG CCTGAAATAA CCTCTGAAAG AGGAACTTGG TTAGGTACCT GTGGAATGTG 61 TGTCAGTTAG GGTGTGGAAA GTCCCCAGGC TCCCCAGCAG GCAGAAGTAT GCAAAGCATG 121 CATCTCAATT AGTCAGCAAC CAGGTGTGGA AAGTCCCCAG GCTCCCCAGC AGGCAGAAGT 181 ATGCAAAGCA TGCATCTCAA TTAGTCAGCA ACCATAGTCC CGCCCCTAAC TCCGCCCATC 241 CCGCCCCTAA CTCCGCCCAG TTCCGCCCAT TCTCCGCCCC ATGGCTGACT AATTTTTTTT 301 ATTTATACAG AGGCCGAGGC CGCCTCGGCC TCTGAGCTAT TCCAGAAGTA GTGAGGAGGC 361 TTTTTTGGAG GCCTAGGCTT TTGCAAAAAG CTTGATTGGG ATCCACCGGT CGCCACCATG 421 GTGAGCAAGG GCGAGGAGCT GTTCACCGGG GTGGTGCCCA TCCTGGTCGA GCTGGACGGC 481 GACGTAAACG GCCACAAGTT CAGCGTGTCC GGCGAGGGCG AGGGCGATGC CACCTACGGC 541 AAGCTGACCC TGAAGTTCAT CTGCACCACC GGCAAGCTGC CCGTGCCCTG GCCCACCCTC 601 GTGACCACCC TGACCTACGG CGTGCAGTGC TTCAGCCGCT ACCCCGACCA CATGAAGCAG 661 CACGACTTCT TCAAGTCCGC CATGCCCGAA GGCTACGTCC AGGAGCGCAC CATCTTCTTC 721 AAGGACGACG GCAACTACAA GACCCGCGCC GAGGTGAAGT TCGAGGGCGA CACCCTGGTG 781 AACCGCATCG AGCTGAAGGG CATCGACTTC AAGGAGGACG GCAACATCCT GGGGCACAAG 841 CTGGAGTACA ACTACAACAG CCACAACGTC TATATCATGG CCGACAAGCA GAAGAACGGC 901 ATCAAGGTGA ACTTCAAGAT CCGCCACAAC ATCGAGGACG GCAGCGTGCA GCTCGCCGAC 961 CACTACCAGC AGAACACCCC CATCGGCGAC GGCCCCGTGC TGCTGCCCGA CAACCACTAC 1021 CTGAGCACCC AGTCCGCCCT GAGCAAAGAC CCCAACGAGA AGCGCGATCA CATGGTCCTG 1081 CTGGAGTTCG TGACCGCCGC CGGGATCACT CTCGGCATGG ACGAGCTGTA CAAGTAAAGC 1141 GGCCGCTTCG AGCAGACATG ATAAGATACA TTGATGAGTT TGGACAAACC ACAACTAGAA 1201 TGCAGTGAAA AAAATGCTTT ATTTGTGAAA TTTGTGATGC TATTGCTTTA TTTGTAACCA 1261 TTATAAGCTG CAATAAACAA GTTAACAACA ACAATTGCAT TCATTTTATG TTTCAGGTTC 1321 AGGGGGAGGT GTGGGAGGTT TTTTAAAGCA AGTAAAACCT CTACAAATGT GGTAAAATCG 1381 ATAAGGATCC GGGCTGGCGT AATAGCGAAG AGGCCCGCAC CGATCGCCCT TCCCAACAGT 1441 TGCGGTGGAG AAGAGCATGC GTGAGGCTCC GGTGCCCGTC AGTGGGCAGA GCGCACATCG 1501 CCCACAGTCC CCGAGAAGTT GGGGGGAGGG GTCGGCAATT GAACCGGTGC CTAGAGAAGG 1561 TGGCGCGGGG TAAACTGGGA AAGTGATGTC GTGTACTGGC TCCGCCCTTT TCCCGAGGGT 1621 GGGGGAGAAC CGTATATAAG TGCAGTAGTC GCTGTGAACG TTCTTTTTCG CAACGGGTTT 1681 GCCGCCAGAA CACAGGTAAG TGCCGTGTGT GGTTCCCGCG GGCCTGGCCT CTTTACGGGT 1741 TATGGCCCTT GCGTGCCTTG AATTACTTCC ACCTGGCTGC AGTACGTGAT TCTTGATCCC 1801 GAGCTTCGGG TTGGAAGTGG GTGGGAGAGT TCGAGGCCTT GCGCTTAAGG AGCCCCTTCG 1861 CCTCGTGCTT GAGTTGAGGC CTGGCCTGGG CGCTGGGGCC GCCGCGTGCG AATCTGGTGG 1921 CACCTTCGCG CCTGTCTCGC TGCTTTCGAT AAGTCTCTAG CCATTTAAAA TTTTTGATGA 1981 CCTGCTGCGA CGCTTTTTTT CTGGCAAGAT AGTCTTGTAA ATGCGGGCCA AGATCTGCAC 2041 ACTGGTATTT CGGTTTTTGG GGCCGCGGGC GGCGACGGGG CCCGTGCGTC CCAGCGCACA 2101 TGTTCGGCGA GGCGGGGCCT GCGAGCGCGG CCACCGAGAA TCGGACGGGG GTAGTCTCAA 2161 GCTGGCCGGC CTGCTCTGGT GCCTGGCCTC GCGCCGCCGT GTATCGCCCC GCCCTGGGCG 2221 GCAAGGCTGG CCCGGTCGGC ACCAGTTGCG TGAGCGGAAA GATGGCCGCT TCCCGGCCCT 2281 GCTGCAGGGA GCTCAAAATG GAGGACGCGG CGCTCGGGAG AGCGGGCGGG TGAGTCACCC 2341 ACACAAAGGA AAAGGGCCTT TCCGTCCTCA GCCGTCGCTT CATGTGACTC CACGGAGTAC 2401 CGGGCGCCGT CCAGGCACCT CGATTAGTTC TCGAGCTTTT GGAGTACGTC GTCTTTAGGT 2461 TGGGGGGAGG GGTTTTATGC GATGGAGTTT CCCCACACTG AGTGGGTGGA GACTGAAGTT 2521 AGGCCAGCTT GGCACTTGAT GTAATTCTCC TTGGAATTTG CCCTTTTTGA GTTTGGATCT 2581 TGGTTCATTC TCAAGCCTCA GACAGTGGTT CAAAGTTTTT TTCTTCCATT TCAGGTGTCG 2641 TGAGGAATTC TCTAGAGCTT GATCAAACAA GTTTGTACAA AAAAGCAGGC TTCTAGAGCC 2701 ACCATGGCCG TCTGGCATAG CGCCAACGGC AAGGTCTACT TGCCCCCGAG CACCCCCGTC 2761 GCACGCGTGC AGTCTACAGA CGAGTATATC CAGCGCACCA ACATCTACTA CCACGCCTTC 2821 TCCGATCGCC TCCTGACCGT GGGCCACCCC TACTTTAACG TGTATAACAT CAACGGCGAC 2881 AAGTTGGAAG TCCCCAAAGT CAGCGGCAAC CAGCATCGCG TGTTCAGGTT GAAGCTGCCC 2941 GACCCCAATC GCTTCGCCCT GGCCGACATG AGCGTCTACA ATCCCGATAA GGAGAGGCTC 3001 GTCTGGGCAT GCCGCGGGCT GGAGATCGGC CGCGGGCAAC CCCTGGGCGT GGGCTCCACC 3061 GGCCATCCCT ACTTTAACAA GGTCAAGGAC ACCGAGAATT CCAACGCCTA TATCACCTTC 3121 AGCAAGGACG ATCGCCAAGA CACCAGCTTC GACCCCAAGC AGATTCAGAT GTTCATCGTG 3181 GGCTGTACCC CCTGTATCGG CGAACACTGG GACAAGGCCG TCCCCTGCGC CGAGAACGAC 3241 CAACAGACCG GGTTGTGCCC CCCCATCGAG TTGAAGAATA CCTACATCGA GGACGGCGAC 3301 ATGGCCGATA TCGGCTTCGG CAATATGAAC TTCAAAGCGT TGCAGGACTC CCGGAGCGAC 3361 GTGTCCCTGG ATATTGTGAA CGAGACCTGC AAGTACCCCG ACTTCCTGAA AATGCAGAAT 3421 GACATCTACG GGGACGCCTG TTTCTTCTAC GCCAGGCGCG AACAGTGCTA CGCACGCCAT 3481 TTCTTCGTCC GCGGCGGCAA GACCGGCGAC GATATCCCCG GCGCCCAGAT CGATAACGGC 3541 ACCTATAAGA ACCAATTCTA TATCCCCGGC GCCGACGGGC AGGCCCAGAA AACCATCGGC 3601 AACAGTATGT ACTTTCCCAC CGTCTCCGGG AGCCTGGTCA GTTCCGACGC CCAGCTCTTC 3661 AATCGCCCAT TTTGGTTGCA GCGCGCACAG GGCCACAACA ACGGGATTCT CTGGGCCAAC 3721 CAGATGTTCA TTACCGTCGT CGATAACACC CGCAACACCA ACTTTTCCAT CAGCGTGTAC 3781 AACCAAGCCG GCGCCTTGAA GGACGTCGCC GATTACAACG CCGACCAGTT CCGCGAGTAC 3841 CAGCGCCACG TGGAGGAGTA CGAGATCAGC CTGATCTTGC AGTTGTGCAA AGTCCCCCTG 3901 AAGGCCGAAG TGCTCGCCCA GATTAACGCC ATGAATAGCA GCCTGCTCGA AGACTGGCAG 3961 CTGGGCTTCG TCCCAACCCC CGACAACCCC ATCCAAGATA CATATCGCTA CATCGATAGC 4021 CTCGCCACCA GATGCCCCGA CAAGAACCCC CCCAAGGAGA AGGAGGATCC CTACAAAGGG 4081 CTGCACTTCT GGGACGTGGA CCTGACCGAG CGCCTCAGCC TGGACCTGGA CCAGTACAGT 4141 CTGGGGCGCA AGTTCCTGTT TCAGGCCGGC CTGCAGCAGA CCACAGTCAA TGGCACCAAG 4201 GCCGTCAGCT ACAAGGGCAG CAACCGCGGC ACCAAGAGGA AGAGGAAGAA CTGAGCCCGG 4261 GACCCAGCTT TCTTGTACAA AGTGGTTCGA TCTAGAATGG CTAGTGGATC CCCCGGGCTG 4321 CAGGAATTCG ATATCAAGCT TATCGATAAT CAACCTCTGG ATTACAAAAT TTGTGAAAGA 4381 TTGACTGGTA TTCTTAACTA TGTTGCTCCT TTTACGCTAT GTGGATACGC TGCTTTAATG 4441 CCTTTGTATC ATGCTATTGC TTCCCGTATG GCTTTCATTT TCTCCTCCTT GTATAAATCC 4501 TGGTTGCTGT CTCTTTATGA GGAGTTGTGG CCCGTTGTCA GGCAACGTGG CGTGGTGTGC 4561 ACTGTGTTTG CTGACGCAAC CCCCACTGGT TGGGGCATTG CCACCACCTG TCAGCTCCTT 4621 TCCGGGACTT TCGCTTTCCC CCTCCCTATT GCCACGGCGG AACTCATCGC CGCCTGCCTT 4681 GCCCGCTGCT GGACAGGGGC TCGGCTGTTG GGCACTGACA ATTCCGTGGT GTTGTCGGGG 4741 AAATCATCGT CCTTTCCTTG GCTGCTCGCC TGTGTTGCCA CCTGGATTCT GCGCGGGACG 4801 TCCTTCTGCT ACGTCCCTTC GGCCCTCAAT CCAGCGGACC TTCCTTCCCG CGGCCTGCTG 4861 CCGGCTCTGC GGCCTCTTCC GCGTCTTCGC CTTCGCCCTC AGACGAGTCG GATCTCCCTT 4921 TGGGCCGCCT CCCCGCATCG ATACCGTCGG CCCACTGCTC CCTAAACCTG AGCTAGCATT 4981 ATCCCTAATA CCTGCCACCC CACTCTTAAT CAGTGGTGGA AGAACGGTCT CAGAACTGTT 5041 TGTTTCAATT GGCCATTTAA GTTTAGTAGT AAAAGACTGG TTAATGATAA CAATGCATCG 5101 TAAAACCTTC AGAAGGAAAG GAGAATGTTT TGTGGACCAC TTTGGTTTTC TTTTTTGCGT 5161 GTGGCAGTTT TAAGTTATTA GTTTTTAAAA TCAGTACTTT TTAATGGAAA CAACTTGACC 5221 AAAAATTTGT CACAGAATTT TGAGACCCAT TAAAAAAGTT AAATGAGAAA CCTGTGTGTT 5281 CCTTTGGTCA ACACCGAGAC ATTTAGGTGA AAGACATCTA ATTCTGGTTT TACGAATCTG 5341 GAAACTTCTT GAAAATGTAA TTCTTGAGTT AACACTTCTG GGTGGAGAAT AGGGTTGTTT 5401 TCCCCCCACA TAATTGGAAG GGGAAGGAAT ATCATTTAAA GCTATGGGAG GGTTTCTTTG 5461 ATTACAACAC TGGAGAGAAA TGCAGCATGT TGCTGATTGC CTGTCACTAA AACAGGCCAA 5521 AAACTGAGTC CTTGGGTTGC ATAGAAAGCT TCATGTTGCT AAACCAATGT TAAGTGAATC 5581 TTTGGAAACA AAATGTTTCC AAATTACTGG GATGTGCATG TTGAAACGTG GGTTAATTAA 5641 CTAGCCATGA CCAAAATCCC TTAACGTGAG TTTTCGTTCC ACTGAGCGTC AGACCCCGTA 5701 GAAAAGATCA AAGGATCTTC TTGAGATCCT TTTTTTCTGC GCGTAATCTG CTGCTTGCAA 5761 ACAAAAAAAC CACCGCTACC AGCGGTGGTT TGTTTGCCGG ATCAAGAGCT ACCAACTCTT 5821 TTTCCGAAGG TAACTGGCTT CAGCAGAGCG CAGATACCAA ATACTGTTCT TCTAGTGTAG 5881 CCGTAGTTAG GCCACCACTT CAAGAACTCT GTAGCACCGC CTACATACCT CGCTCTGCTA 5941 ATCCTGTTAC CAGTGGCTGC TGCCAGTGGC GATAAGTCGT GTCTTACCGG GTTGGACTCA 6001 AGACGATAGT TACCGGATAA GGCGCAGCGG TCGGGCTGAA CGGGGGGTTC GTGCACACAG 6061 CCCAGCTTGG AGCGAACGAC CTACACCGAA CTGAGATACC TACAGCGTGA GCTATGAGAA 6121 AGCGCCACGC TTCCCGAAGG GAGAAAGGCG GACAGGTATC CGGTAAGCGG CAGGGTCGGA 6181 ACAGGAGAGC GCACGAGGGA GCTTCCAGGG GGAAACGCCT GGTATCTTTA TAGTCCTGTC 6241 GGGTTTCGCC ACCTCTGACT TGAGCGTCGA TTTTTGTGAT GCTCGTCAGG GGGGCGGAGC 6301 CTATGGAAAA ACGCCAGCAA CGCGGCCTTT TTACGGTTCC TGGCCTTTTG CTGGCCTTTT 6361 GCTCACATGT TCTTAATTAA ATTTTTCAAA AGTAGTTGAC AATTAATCAT CGGCATAGTA 6421 TATCGGCATA GTATAATACG ACTCACTATA GGAGGGCCAT CATGGCCAAG TTGACCAGTG 6481 CTGTCCCAGT GCTCACAGCC AGGGATGTGG CTGGAGCTGT TGAGTTCTGG ACTGACAGGT 6541 TGGGGTTCTC CAGAGATTTT GTGGAGGATG ACTTTGCAGG TGTGGTCAGA GATGATGTCA 6601 CCCTGTTCAT CTCAGCAGTC CAGGACCAGG TGGTGCCTGA CAACACCCTG GCTTGGGTGT 6661 GGGTGAGAGG ACTGGATGAG CTGTATGCTG AGTGGAGTGA GGTGGTCTCC ACCAACTTCA 6721 GGGATGCCAG TGGCCCTGCC ATGACAGAGA TTGGAGAGCA GCCCTGGGGG AGAGAGTTTG 6781 CCCTGAGAGA CCCAGCAGGC AACTGTGTGC ACTTTGTGGC AGAGGAGCAG GACTGAGGAT 6841 AAGAATTGTA ACAAAAAACC CCGCCCCGGC GGGGTTTTTT GTTAATTAA //