LOCUS P5L2W.TXT 6895 BP DS-DNA CIRCULAR SYN 17-MAY-2003 DEFINITION - ACCESSION - KEYWORDS - SOURCE - FEATURES Location/Qualifiers promoter <49..>393 /note="SV40 promoter (∆SELP)" misc_feature complement(>49..57) /note="SV40 minor late 19S RNA" enhancer 71..213 /note="SV40 enhancer elements" misc_feature 218..281 /note="SV40 21bp repeats" rep_origin 238..>373 /note="SV40 Ori" mRNA 287..293 /note="late-early transcription startpoints" mutation 307..307 /note="G->A kills SELP ATG" mRNA 327..333 /note="early-early transcription start points" CDS 418..1137 /note="EGFP" polyA_signal 1153..>1374 /note="SV40 Late PolyA" misc_difference 1465..>1656 /note="rhesus-derived" promoter <1465..2643 /note="EF1a promoter and UTR" exon 1663..1695 /note="EF-1a exon 1" intron 1696..2635 /note="EF-1a intron A" exon 2635..2643 /note="EF-1a exon 2 leader" insertion_seq 2667..>2691 /note="attB1" CDS 2704..4260 /note="HPV5 L2 (codmod)" insertion_seq 4268..>4292 /note="attB2" mRNA 4354..>4942 /note="WPRE" polyA_signal 4978..5646 /note="hEF1a polyA signal" polyA_site 5276..5276 /note="site of polyA addition" rep_origin 5647..6380 /note="MB1 Ori" promoter 6381..6467 /note="EM7 promoter" CDS 6468..6842 /note="Zeo" terminator 6843..6895 /note="terminator (rpmB/G)" BASE COUNT 1535 A 1944 C 1884 G 1532 T 0 OTHER ORIGIN - 1 CCCTGCAGGG CCTGAAATAA CCTCTGAAAG AGGAACTTGG TTAGGTACCT GTGGAATGTG 61 TGTCAGTTAG GGTGTGGAAA GTCCCCAGGC TCCCCAGCAG GCAGAAGTAT GCAAAGCATG 121 CATCTCAATT AGTCAGCAAC CAGGTGTGGA AAGTCCCCAG GCTCCCCAGC AGGCAGAAGT 181 ATGCAAAGCA TGCATCTCAA TTAGTCAGCA ACCATAGTCC CGCCCCTAAC TCCGCCCATC 241 CCGCCCCTAA CTCCGCCCAG TTCCGCCCAT TCTCCGCCCC ATGGCTGACT AATTTTTTTT 301 ATTTATACAG AGGCCGAGGC CGCCTCGGCC TCTGAGCTAT TCCAGAAGTA GTGAGGAGGC 361 TTTTTTGGAG GCCTAGGCTT TTGCAAAAAG CTTGATTGGG ATCCACCGGT CGCCACCATG 421 GTGAGCAAGG GCGAGGAGCT GTTCACCGGG GTGGTGCCCA TCCTGGTCGA GCTGGACGGC 481 GACGTAAACG GCCACAAGTT CAGCGTGTCC GGCGAGGGCG AGGGCGATGC CACCTACGGC 541 AAGCTGACCC TGAAGTTCAT CTGCACCACC GGCAAGCTGC CCGTGCCCTG GCCCACCCTC 601 GTGACCACCC TGACCTACGG CGTGCAGTGC TTCAGCCGCT ACCCCGACCA CATGAAGCAG 661 CACGACTTCT TCAAGTCCGC CATGCCCGAA GGCTACGTCC AGGAGCGCAC CATCTTCTTC 721 AAGGACGACG GCAACTACAA GACCCGCGCC GAGGTGAAGT TCGAGGGCGA CACCCTGGTG 781 AACCGCATCG AGCTGAAGGG CATCGACTTC AAGGAGGACG GCAACATCCT GGGGCACAAG 841 CTGGAGTACA ACTACAACAG CCACAACGTC TATATCATGG CCGACAAGCA GAAGAACGGC 901 ATCAAGGTGA ACTTCAAGAT CCGCCACAAC ATCGAGGACG GCAGCGTGCA GCTCGCCGAC 961 CACTACCAGC AGAACACCCC CATCGGCGAC GGCCCCGTGC TGCTGCCCGA CAACCACTAC 1021 CTGAGCACCC AGTCCGCCCT GAGCAAAGAC CCCAACGAGA AGCGCGATCA CATGGTCCTG 1081 CTGGAGTTCG TGACCGCCGC CGGGATCACT CTCGGCATGG ACGAGCTGTA CAAGTAAAGC 1141 GGCCGCTTCG AGCAGACATG ATAAGATACA TTGATGAGTT TGGACAAACC ACAACTAGAA 1201 TGCAGTGAAA AAAATGCTTT ATTTGTGAAA TTTGTGATGC TATTGCTTTA TTTGTAACCA 1261 TTATAAGCTG CAATAAACAA GTTAACAACA ACAATTGCAT TCATTTTATG TTTCAGGTTC 1321 AGGGGGAGGT GTGGGAGGTT TTTTAAAGCA AGTAAAACCT CTACAAATGT GGTAAAATCG 1381 ATAAGGATCC GGGCTGGCGT AATAGCGAAG AGGCCCGCAC CGATCGCCCT TCCCAACAGT 1441 TGCGGTGGAG AAGAGCATGC GTGAGGCTCC GGTGCCCGTC AGTGGGCAGA GCGCACATCG 1501 CCCACAGTCC CCGAGAAGTT GGGGGGAGGG GTCGGCAATT GAACCGGTGC CTAGAGAAGG 1561 TGGCGCGGGG TAAACTGGGA AAGTGATGTC GTGTACTGGC TCCGCCCTTT TCCCGAGGGT 1621 GGGGGAGAAC CGTATATAAG TGCAGTAGTC GCTGTGAACG TTCTTTTTCG CAACGGGTTT 1681 GCCGCCAGAA CACAGGTAAG TGCCGTGTGT GGTTCCCGCG GGCCTGGCCT CTTTACGGGT 1741 TATGGCCCTT GCGTGCCTTG AATTACTTCC ACCTGGCTGC AGTACGTGAT TCTTGATCCC 1801 GAGCTTCGGG TTGGAAGTGG GTGGGAGAGT TCGAGGCCTT GCGCTTAAGG AGCCCCTTCG 1861 CCTCGTGCTT GAGTTGAGGC CTGGCCTGGG CGCTGGGGCC GCCGCGTGCG AATCTGGTGG 1921 CACCTTCGCG CCTGTCTCGC TGCTTTCGAT AAGTCTCTAG CCATTTAAAA TTTTTGATGA 1981 CCTGCTGCGA CGCTTTTTTT CTGGCAAGAT AGTCTTGTAA ATGCGGGCCA AGATCTGCAC 2041 ACTGGTATTT CGGTTTTTGG GGCCGCGGGC GGCGACGGGG CCCGTGCGTC CCAGCGCACA 2101 TGTTCGGCGA GGCGGGGCCT GCGAGCGCGG CCACCGAGAA TCGGACGGGG GTAGTCTCAA 2161 GCTGGCCGGC CTGCTCTGGT GCCTGGCCTC GCGCCGCCGT GTATCGCCCC GCCCTGGGCG 2221 GCAAGGCTGG CCCGGTCGGC ACCAGTTGCG TGAGCGGAAA GATGGCCGCT TCCCGGCCCT 2281 GCTGCAGGGA GCTCAAAATG GAGGACGCGG CGCTCGGGAG AGCGGGCGGG TGAGTCACCC 2341 ACACAAAGGA AAAGGGCCTT TCCGTCCTCA GCCGTCGCTT CATGTGACTC CACGGAGTAC 2401 CGGGCGCCGT CCAGGCACCT CGATTAGTTC TCGAGCTTTT GGAGTACGTC GTCTTTAGGT 2461 TGGGGGGAGG GGTTTTATGC GATGGAGTTT CCCCACACTG AGTGGGTGGA GACTGAAGTT 2521 AGGCCAGCTT GGCACTTGAT GTAATTCTCC TTGGAATTTG CCCTTTTTGA GTTTGGATCT 2581 TGGTTCATTC TCAAGCCTCA GACAGTGGTT CAAAGTTTTT TTCTTCCATT TCAGGTGTCG 2641 TGAGGAATTC TCTAGAGCTT GATCAAACAA GTTTGTACAA AAAAGCAGGC TTCTAGAGCC 2701 ACCATGGCCA GGGCCAAGCG CGTGAAAAGG GATAGCGTGA CCCACATCTA TCAGACATGC 2761 AAGCAAGCCG GGACCTGTCC ACCCGACGTC ATCAACAAGG TCGAGCAGAC CACCGTCGCC 2821 GATAACATCC TGAAGTACGG GTCCGCCGGC GTGTTCTTCG GCGGGTTGGG CATCTCCACC 2881 GGGAGGGGCA CCGGCGGCGC CACCGGCTAT GTCCCCTTGG GCGAGGGCCC CGGCGTCAGG 2941 GTGGGCGGCA CACCAACCGT CGTGCGCCCC AGTCTCGTCC CCGAGACCAT TGGCCCAGTC 3001 GACATCCTCC CAATCGACAC CGTCAATCCA GTCGAGCCCA CCGCCAGCAG TGTCGTGCCC 3061 TTGACCGAAA GTACCGGGGC CGACCTGTTG CCCGGCGAGG TCGAGACCAT CGCCGAGATT 3121 CACCCCGTGC CCGAGGGCCC CAGCGTCGAC ACACCCGTGG TCACAACCTC AACCGGCAGT 3181 TCCGCCGTCC TGGAAGTCGC ACCCGAGCCC ATCCCACCCA CCAGAGTGCG CGTCAGCAGG 3241 ACCCAATACC ATAACCCCAG CTTCCAGATC ATCACCGAAA GCACCCCCGC CCAGGGCGAG 3301 AGCAGCTTGG CCGACCATGT CCTCGTCACC AGCGGCAGCG GCGGCCAGAG GATCGGCGGC 3361 GACATCACCG ATATCATCGA GCTGGAAGAG ATCCCATCCC GCTACACCTT CGAGATCGAG 3421 GAGCCCACCC CACCGAGGAG GTCATCCACC CCCCTGCCGA GGAACCAGAG CGTGGGGAGG 3481 CGCCGCGGCT TTAGCCTCAC CAACCGCAGG CTGGTGCAGC AAGTGCAGGT CGATAACCCG 3541 CTGTTCTTGA CCCAGCCCAG CAAACTGGTC AGGTTCGCCT TCGACAACCC CGTCTTCGAA 3601 GAGGAGGTCA CCAACATCTT CGAGAACGAC CTCGACGTGT TCGAGGAGCC CCCCGATCGC 3661 GACTTCTTGG ACGTCCGCGA GCTCGGCAGG CCCCAGTACA GCACCACCCC CGCCGGCTAC 3721 GTCCGCGTGT CACGCCTCGG CACCAGGGCA ACCATCAGGA CCAGGAGCGG CGCCCAAATC 3781 GGCAGCCAGG TCCACTTCTA TCGCGACTTG TCTAGCATCA ACACCGAGGA CCCCATCGAG 3841 CTGCAGCTGC TGGGGCAGCA CAGCGGCGAC GCCACCATCG TGCAGGGCCC CGTCGAGTCA 3901 ACCTTCATCG ACATGGACAT CAGCGAGAAC CCCCTGAGCG AGTCTATCGA GGCCTACAGC 3961 CACGACCTGC TGCTGGACGA GACCGTCGAG GACTTTTCCG GCAGCCAACT CGTCATCGGC 4021 AACAGGCGCT CAACCAATAG CTATACCGTC CCCCGCTTCG AGACCACCCG CAACGGCAGC 4081 TACTACACCC AGGATACCAA AGGCTACTAC GTCGCCTACC CCGAAAGCAG GAACAACGCC 4141 GAGATCATCT ACCCCACCCC CGACATCCCC GTCGTGATCA TCCATCCCCA CGATTCCACC 4201 GGCGATTTCT ACCTGCACCC ATCCTTGAGG CGCAGGAAGA GGAAGCGCAA GTACCTCTGA 4261 GCCCGGGACC CAGCTTTCTT GTACAAAGTG GTTCGATCTA GAATGGCTAG TGGATCCCCC 4321 GGGCTGCAGG AATTCGATAT CAAGCTTATC GATAATCAAC CTCTGGATTA CAAAATTTGT 4381 GAAAGATTGA CTGGTATTCT TAACTATGTT GCTCCTTTTA CGCTATGTGG ATACGCTGCT 4441 TTAATGCCTT TGTATCATGC TATTGCTTCC CGTATGGCTT TCATTTTCTC CTCCTTGTAT 4501 AAATCCTGGT TGCTGTCTCT TTATGAGGAG TTGTGGCCCG TTGTCAGGCA ACGTGGCGTG 4561 GTGTGCACTG TGTTTGCTGA CGCAACCCCC ACTGGTTGGG GCATTGCCAC CACCTGTCAG 4621 CTCCTTTCCG GGACTTTCGC TTTCCCCCTC CCTATTGCCA CGGCGGAACT CATCGCCGCC 4681 TGCCTTGCCC GCTGCTGGAC AGGGGCTCGG CTGTTGGGCA CTGACAATTC CGTGGTGTTG 4741 TCGGGGAAAT CATCGTCCTT TCCTTGGCTG CTCGCCTGTG TTGCCACCTG GATTCTGCGC 4801 GGGACGTCCT TCTGCTACGT CCCTTCGGCC CTCAATCCAG CGGACCTTCC TTCCCGCGGC 4861 CTGCTGCCGG CTCTGCGGCC TCTTCCGCGT CTTCGCCTTC GCCCTCAGAC GAGTCGGATC 4921 TCCCTTTGGG CCGCCTCCCC GCATCGATAC CGTCGGCCCA CTGCTCCCTA AACCTGAGCT 4981 AGCATTATCC CTAATACCTG CCACCCCACT CTTAATCAGT GGTGGAAGAA CGGTCTCAGA 5041 ACTGTTTGTT TCAATTGGCC ATTTAAGTTT AGTAGTAAAA GACTGGTTAA TGATAACAAT 5101 GCATCGTAAA ACCTTCAGAA GGAAAGGAGA ATGTTTTGTG GACCACTTTG GTTTTCTTTT 5161 TTGCGTGTGG CAGTTTTAAG TTATTAGTTT TTAAAATCAG TACTTTTTAA TGGAAACAAC 5221 TTGACCAAAA ATTTGTCACA GAATTTTGAG ACCCATTAAA AAAGTTAAAT GAGAAACCTG 5281 TGTGTTCCTT TGGTCAACAC CGAGACATTT AGGTGAAAGA CATCTAATTC TGGTTTTACG 5341 AATCTGGAAA CTTCTTGAAA ATGTAATTCT TGAGTTAACA CTTCTGGGTG GAGAATAGGG 5401 TTGTTTTCCC CCCACATAAT TGGAAGGGGA AGGAATATCA TTTAAAGCTA TGGGAGGGTT 5461 TCTTTGATTA CAACACTGGA GAGAAATGCA GCATGTTGCT GATTGCCTGT CACTAAAACA 5521 GGCCAAAAAC TGAGTCCTTG GGTTGCATAG AAAGCTTCAT GTTGCTAAAC CAATGTTAAG 5581 TGAATCTTTG GAAACAAAAT GTTTCCAAAT TACTGGGATG TGCATGTTGA AACGTGGGTT 5641 AATTAACTAG CCATGACCAA AATCCCTTAA CGTGAGTTTT CGTTCCACTG AGCGTCAGAC 5701 CCCGTAGAAA AGATCAAAGG ATCTTCTTGA GATCCTTTTT TTCTGCGCGT AATCTGCTGC 5761 TTGCAAACAA AAAAACCACC GCTACCAGCG GTGGTTTGTT TGCCGGATCA AGAGCTACCA 5821 ACTCTTTTTC CGAAGGTAAC TGGCTTCAGC AGAGCGCAGA TACCAAATAC TGTTCTTCTA 5881 GTGTAGCCGT AGTTAGGCCA CCACTTCAAG AACTCTGTAG CACCGCCTAC ATACCTCGCT 5941 CTGCTAATCC TGTTACCAGT GGCTGCTGCC AGTGGCGATA AGTCGTGTCT TACCGGGTTG 6001 GACTCAAGAC GATAGTTACC GGATAAGGCG CAGCGGTCGG GCTGAACGGG GGGTTCGTGC 6061 ACACAGCCCA GCTTGGAGCG AACGACCTAC ACCGAACTGA GATACCTACA GCGTGAGCTA 6121 TGAGAAAGCG CCACGCTTCC CGAAGGGAGA AAGGCGGACA GGTATCCGGT AAGCGGCAGG 6181 GTCGGAACAG GAGAGCGCAC GAGGGAGCTT CCAGGGGGAA ACGCCTGGTA TCTTTATAGT 6241 CCTGTCGGGT TTCGCCACCT CTGACTTGAG CGTCGATTTT TGTGATGCTC GTCAGGGGGG 6301 CGGAGCCTAT GGAAAAACGC CAGCAACGCG GCCTTTTTAC GGTTCCTGGC CTTTTGCTGG 6361 CCTTTTGCTC ACATGTTCTT AATTAAATTT TTCAAAAGTA GTTGACAATT AATCATCGGC 6421 ATAGTATATC GGCATAGTAT AATACGACTC ACTATAGGAG GGCCATCATG GCCAAGTTGA 6481 CCAGTGCTGT CCCAGTGCTC ACAGCCAGGG ATGTGGCTGG AGCTGTTGAG TTCTGGACTG 6541 ACAGGTTGGG GTTCTCCAGA GATTTTGTGG AGGATGACTT TGCAGGTGTG GTCAGAGATG 6601 ATGTCACCCT GTTCATCTCA GCAGTCCAGG ACCAGGTGGT GCCTGACAAC ACCCTGGCTT 6661 GGGTGTGGGT GAGAGGACTG GATGAGCTGT ATGCTGAGTG GAGTGAGGTG GTCTCCACCA 6721 ACTTCAGGGA TGCCAGTGGC CCTGCCATGA CAGAGATTGG AGAGCAGCCC TGGGGGAGAG 6781 AGTTTGCCCT GAGAGACCCA GCAGGCAACT GTGTGCACTT TGTGGCAGAG GAGCAGGACT 6841 GAGGATAAGA ATTGTAACAA AAAACCCCGC CCCGGCGGGG TTTTTTGTTA ATTAA //