LOCUS PWM.GB MAY 6610 BP DS-DNA CIRCULAR SYN 17-MAY-2003 DEFINITION - ACCESSION - KEYWORDS - SOURCE - FEATURES Location/Qualifiers misc_feature complement(>49..57) /note="SV40 minor late 19S RNA" promoter <49..>393 /note="SV40 promoter (∆SELP)" enhancer 71..213 /note="SV40 enhancer elements" misc_feature 218..281 /note="SV40 21bp repeats" rep_origin 238..>373 /note="SV40 Ori" mRNA 287..293 /note="late-early transcription startpoints" mutation 307..307 /note="G->A kills SELP ATG" mRNA 327..333 /note="early-early transcription start points" CDS 418..1137 /note="EGFP" polyA_signal 1153..>1374 /note="SV40 Late PolyA" promoter <1465..2643 /note="EF1a promoter and UTR" misc_difference 1465..>1656 /note="rhesus-derived" exon 1663..1695 /note="EF-1a exon 1" intron 1696..2635 /note="EF-1a intron A" exon 2635..2643 /note="EF-1a exon 2 leader" insertion_seq 2667..>2691 /note="attB1" CDS 2704..3975 /note="Merkel VP1" mutation 3565..3567 /note="D288H in MCV350" mutation 3649..3651 /note="R316I in MCV350" mutation 3799..3801 /note="D366N in MCV350" misc_recomb 3983..>4007 /note="attB2" mRNA 4069..>4657 /note="WPRE" polyA_signal 4693..5361 /note="hEF1a polyA signal" polyA_site 4991..4991 /note="site of polyA addition" rep_origin 5362..6095 /note="MB1 Ori" promoter 6096..6182 /note="EM7 promoter" CDS 6183..6557 /note="Zeocin resistance" terminator 6558..6610 /note="terminator (rpmB/G)" BASE COUNT 1519 A 1793 C 1805 G 1493 T 0 OTHER ORIGIN - 1 CCCTGCAGGG CCTGAAATAA CCTCTGAAAG AGGAACTTGG TTAGGTACCT GTGGAATGTG 61 TGTCAGTTAG GGTGTGGAAA GTCCCCAGGC TCCCCAGCAG GCAGAAGTAT GCAAAGCATG 121 CATCTCAATT AGTCAGCAAC CAGGTGTGGA AAGTCCCCAG GCTCCCCAGC AGGCAGAAGT 181 ATGCAAAGCA TGCATCTCAA TTAGTCAGCA ACCATAGTCC CGCCCCTAAC TCCGCCCATC 241 CCGCCCCTAA CTCCGCCCAG TTCCGCCCAT TCTCCGCCCC ATGGCTGACT AATTTTTTTT 301 ATTTATACAG AGGCCGAGGC CGCCTCGGCC TCTGAGCTAT TCCAGAAGTA GTGAGGAGGC 361 TTTTTTGGAG GCCTAGGCTT TTGCAAAAAG CTTGATTGGG ATCCACCGGT CGCCACCATG 421 GTGAGCAAGG GCGAGGAGCT GTTCACCGGG GTGGTGCCCA TCCTGGTCGA GCTGGACGGC 481 GACGTAAACG GCCACAAGTT CAGCGTGTCC GGCGAGGGCG AGGGCGATGC CACCTACGGC 541 AAGCTGACCC TGAAGTTCAT CTGCACCACC GGCAAGCTGC CCGTGCCCTG GCCCACCCTC 601 GTGACCACCC TGACCTACGG CGTGCAGTGC TTCAGCCGCT ACCCCGACCA CATGAAGCAG 661 CACGACTTCT TCAAGTCCGC CATGCCCGAA GGCTACGTCC AGGAGCGCAC CATCTTCTTC 721 AAGGACGACG GCAACTACAA GACCCGCGCC GAGGTGAAGT TCGAGGGCGA CACCCTGGTG 781 AACCGCATCG AGCTGAAGGG CATCGACTTC AAGGAGGACG GCAACATCCT GGGGCACAAG 841 CTGGAGTACA ACTACAACAG CCACAACGTC TATATCATGG CCGACAAGCA GAAGAACGGC 901 ATCAAGGTGA ACTTCAAGAT CCGCCACAAC ATCGAGGACG GCAGCGTGCA GCTCGCCGAC 961 CACTACCAGC AGAACACCCC CATCGGCGAC GGCCCCGTGC TGCTGCCCGA CAACCACTAC 1021 CTGAGCACCC AGTCCGCCCT GAGCAAAGAC CCCAACGAGA AGCGCGATCA CATGGTCCTG 1081 CTGGAGTTCG TGACCGCCGC CGGGATCACT CTCGGCATGG ACGAGCTGTA CAAGTAAAGC 1141 GGCCGCTTCG AGCAGACATG ATAAGATACA TTGATGAGTT TGGACAAACC ACAACTAGAA 1201 TGCAGTGAAA AAAATGCTTT ATTTGTGAAA TTTGTGATGC TATTGCTTTA TTTGTAACCA 1261 TTATAAGCTG CAATAAACAA GTTAACAACA ACAATTGCAT TCATTTTATG TTTCAGGTTC 1321 AGGGGGAGGT GTGGGAGGTT TTTTAAAGCA AGTAAAACCT CTACAAATGT GGTAAAATCG 1381 ATAAGGATCC GGGCTGGCGT AATAGCGAAG AGGCCCGCAC CGATCGCCCT TCCCAACAGT 1441 TGCGGTGGAG AAGAGCATGC GTGAGGCTCC GGTGCCCGTC AGTGGGCAGA GCGCACATCG 1501 CCCACAGTCC CCGAGAAGTT GGGGGGAGGG GTCGGCAATT GAACCGGTGC CTAGAGAAGG 1561 TGGCGCGGGG TAAACTGGGA AAGTGATGTC GTGTACTGGC TCCGCCCTTT TCCCGAGGGT 1621 GGGGGAGAAC CGTATATAAG TGCAGTAGTC GCTGTGAACG TTCTTTTTCG CAACGGGTTT 1681 GCCGCCAGAA CACAGGTAAG TGCCGTGTGT GGTTCCCGCG GGCCTGGCCT CTTTACGGGT 1741 TATGGCCCTT GCGTGCCTTG AATTACTTCC ACCTGGCTGC AGTACGTGAT TCTTGATCCC 1801 GAGCTTCGGG TTGGAAGTGG GTGGGAGAGT TCGAGGCCTT GCGCTTAAGG AGCCCCTTCG 1861 CCTCGTGCTT GAGTTGAGGC CTGGCCTGGG CGCTGGGGCC GCCGCGTGCG AATCTGGTGG 1921 CACCTTCGCG CCTGTCTCGC TGCTTTCGAT AAGTCTCTAG CCATTTAAAA TTTTTGATGA 1981 CCTGCTGCGA CGCTTTTTTT CTGGCAAGAT AGTCTTGTAA ATGCGGGCCA AGATCTGCAC 2041 ACTGGTATTT CGGTTTTTGG GGCCGCGGGC GGCGACGGGG CCCGTGCGTC CCAGCGCACA 2101 TGTTCGGCGA GGCGGGGCCT GCGAGCGCGG CCACCGAGAA TCGGACGGGG GTAGTCTCAA 2161 GCTGGCCGGC CTGCTCTGGT GCCTGGCCTC GCGCCGCCGT GTATCGCCCC GCCCTGGGCG 2221 GCAAGGCTGG CCCGGTCGGC ACCAGTTGCG TGAGCGGAAA GATGGCCGCT TCCCGGCCCT 2281 GCTGCAGGGA GCTCAAAATG GAGGACGCGG CGCTCGGGAG AGCGGGCGGG TGAGTCACCC 2341 ACACAAAGGA AAAGGGCCTT TCCGTCCTCA GCCGTCGCTT CATGTGACTC CACGGAGTAC 2401 CGGGCGCCGT CCAGGCACCT CGATTAGTTC TCGAGCTTTT GGAGTACGTC GTCTTTAGGT 2461 TGGGGGGAGG GGTTTTATGC GATGGAGTTT CCCCACACTG AGTGGGTGGA GACTGAAGTT 2521 AGGCCAGCTT GGCACTTGAT GTAATTCTCC TTGGAATTTG CCCTTTTTGA GTTTGGATCT 2581 TGGTTCATTC TCAAGCCTCA GACAGTGGTT CAAAGTTTTT TTCTTCCATT TCAGGTGTCG 2641 TGAGGAATTC TCTAGAGCTT GATCAAACAA GTTTGTACAA AAAAGCAGGC TTCCGGAGCC 2701 ACCATGGCCC CGAAGCGCAA GGCCAGCAGT ACATGCAAGA CCCCAAAGCG CCAGTGCATC 2761 CCGAAACCCG GCTGTTGTCC CAACGTCGCA AGCGTCCCGA AGCTGTTGGT CAAGGGCGGC 2821 GTCGAGGTGC TGAGCGTCGT CACCGGCGAG GACTCAATCA CACAAATCGA ACTCTACCTC 2881 AACCCACGCA TGGGCGTCAA CAGTCCCGAC TTGCCCACCA CCAGCAATTG GTACACCTAC 2941 ACCTACGATC TGCAACCCAA AGGCAGCAGC CCGGACCAGC CAATCAAAGA GAACCTGCCC 3001 GCCTATTCCG TCGCCCGCGT CAGCCTGCCC ATGTTGAACG AAGACATCAC ATGCGATACC 3061 CTGCAAATGT GGGAAGCCAT CAGCGTCAAG ACCGAGGTGG TCGGCATCAG CAGCCTGATC 3121 AACGTCCACT ACTGGGATAT GAAGCGCGTC CACGACTACG GCGCCGGCAT CCCGGTCAGC 3181 GGCGTCAACT ATCACATGTT CGCAATCGGC GGCGAGCCGT TGGACCTGCA AGGGCTGGTC 3241 CTGGACTATC AGACCCAGTA CCCCAAGACC ACCAACGGCG GCCCCATCAC CATCGAGACC 3301 GTGCTCGGCC GCAAGATGAC CCCGAAGAAC CAGGGGTTGG ACCCACAAGC CAAGGCCAAG 3361 CTGGACAAGG ACGGCAACTA CCCCATCGAG GTCTGGTGCC CCGACCCGAG CAAGAACGAG 3421 AATTCCCGCT ACTACGGCAG CATCCAAACC GGGAGCCAGA CCCCGACCGT GTTGCAGTTC 3481 AGCAACACCC TGACCACCGT GCTGCTGGAC GAAAACGGCG TCGGCCCACT GTGCAAGGGC 3541 GATGGCCTGT TCATCTCATG CGCCGATATC GTCGGATTCC TGTTCAAGAC ATCCGGCAAG 3601 ATGGCCTTGC ACGGCCTGCC ACGCTACTTC AACGTCACCC TGCGAAAGCG CTGGGTCAAG 3661 AATCCATATC CCGTGGTCAA CCTGATCAAT AGCTTGTTCA GCAATCTGAT GCCCAAGGTC 3721 AGCGGCCAGC CCATGGAGGG CAAGGACAAC CAAGTGGAGG AAGTCCGCAT CTACGAAGGC 3781 AGCGAGCAGC TGCCCGGCGA CCCCGACATC GTGCGGTTCC TGGACAAGTT CGGCCAAGAA 3841 AAGACCGTCT ATCCCAAACC AAGCGTCGCA CCCGCCGCCG TGACATTCCA GTCCAACCAG 3901 CAAGACAAAG GCAAGGCCCC GCTGAAGGGC CCGCAAAAGG CAAGCCAGAA GGAGTCACAG 3961 ACGCAGGAGC TATGAGCCTA GGACCCAGCT TTCTTGTACA AAGTGGTTCG ATCTAGAATG 4021 GCTAGTGGAT CCCCCGGGCT GCAGGAATTC GATATCAAGC TTATCGATAA TCAACCTCTG 4081 GATTACAAAA TTTGTGAAAG ATTGACTGGT ATTCTTAACT ATGTTGCTCC TTTTACGCTA 4141 TGTGGATACG CTGCTTTAAT GCCTTTGTAT CATGCTATTG CTTCCCGTAT GGCTTTCATT 4201 TTCTCCTCCT TGTATAAATC CTGGTTGCTG TCTCTTTATG AGGAGTTGTG GCCCGTTGTC 4261 AGGCAACGTG GCGTGGTGTG CACTGTGTTT GCTGACGCAA CCCCCACTGG TTGGGGCATT 4321 GCCACCACCT GTCAGCTCCT TTCCGGGACT TTCGCTTTCC CCCTCCCTAT TGCCACGGCG 4381 GAACTCATCG CCGCCTGCCT TGCCCGCTGC TGGACAGGGG CTCGGCTGTT GGGCACTGAC 4441 AATTCCGTGG TGTTGTCGGG GAAATCATCG TCCTTTCCTT GGCTGCTCGC CTGTGTTGCC 4501 ACCTGGATTC TGCGCGGGAC GTCCTTCTGC TACGTCCCTT CGGCCCTCAA TCCAGCGGAC 4561 CTTCCTTCCC GCGGCCTGCT GCCGGCTCTG CGGCCTCTTC CGCGTCTTCG CCTTCGCCCT 4621 CAGACGAGTC GGATCTCCCT TTGGGCCGCC TCCCCGCATC GATACCGTCG GCCCACTGCT 4681 CCCTAAACCT GAGCTAGCAT TATCCCTAAT ACCTGCCACC CCACTCTTAA TCAGTGGTGG 4741 AAGAACGGTC TCAGAACTGT TTGTTTCAAT TGGCCATTTA AGTTTAGTAG TAAAAGACTG 4801 GTTAATGATA ACAATGCATC GTAAAACCTT CAGAAGGAAA GGAGAATGTT TTGTGGACCA 4861 CTTTGGTTTT CTTTTTTGCG TGTGGCAGTT TTAAGTTATT AGTTTTTAAA ATCAGTACTT 4921 TTTAATGGAA ACAACTTGAC CAAAAATTTG TCACAGAATT TTGAGACCCA TTAAAAAAGT 4981 TAAATGAGAA ACCTGTGTGT TCCTTTGGTC AACACCGAGA CATTTAGGTG AAAGACATCT 5041 AATTCTGGTT TTACGAATCT GGAAACTTCT TGAAAATGTA ATTCTTGAGT TAACACTTCT 5101 GGGTGGAGAA TAGGGTTGTT TTCCCCCCAC ATAATTGGAA GGGGAAGGAA TATCATTTAA 5161 AGCTATGGGA GGGTTTCTTT GATTACAACA CTGGAGAGAA ATGCAGCATG TTGCTGATTG 5221 CCTGTCACTA AAACAGGCCA AAAACTGAGT CCTTGGGTTG CATAGAAAGC TTCATGTTGC 5281 TAAACCAATG TTAAGTGAAT CTTTGGAAAC AAAATGTTTC CAAATTACTG GGATGTGCAT 5341 GTTGAAACGT GGGTTAATTA ACTAGCCATG ACCAAAATCC CTTAACGTGA GTTTTCGTTC 5401 CACTGAGCGT CAGACCCCGT AGAAAAGATC AAAGGATCTT CTTGAGATCC TTTTTTTCTG 5461 CGCGTAATCT GCTGCTTGCA AACAAAAAAA CCACCGCTAC CAGCGGTGGT TTGTTTGCCG 5521 GATCAAGAGC TACCAACTCT TTTTCCGAAG GTAACTGGCT TCAGCAGAGC GCAGATACCA 5581 AATACTGTTC TTCTAGTGTA GCCGTAGTTA GGCCACCACT TCAAGAACTC TGTAGCACCG 5641 CCTACATACC TCGCTCTGCT AATCCTGTTA CCAGTGGCTG CTGCCAGTGG CGATAAGTCG 5701 TGTCTTACCG GGTTGGACTC AAGACGATAG TTACCGGATA AGGCGCAGCG GTCGGGCTGA 5761 ACGGGGGGTT CGTGCACACA GCCCAGCTTG GAGCGAACGA CCTACACCGA ACTGAGATAC 5821 CTACAGCGTG AGCTATGAGA AAGCGCCACG CTTCCCGAAG GGAGAAAGGC GGACAGGTAT 5881 CCGGTAAGCG GCAGGGTCGG AACAGGAGAG CGCACGAGGG AGCTTCCAGG GGGAAACGCC 5941 TGGTATCTTT ATAGTCCTGT CGGGTTTCGC CACCTCTGAC TTGAGCGTCG ATTTTTGTGA 6001 TGCTCGTCAG GGGGGCGGAG CCTATGGAAA AACGCCAGCA ACGCGGCCTT TTTACGGTTC 6061 CTGGCCTTTT GCTGGCCTTT TGCTCACATG TTCTTAATTA AATTTTTCAA AAGTAGTTGA 6121 CAATTAATCA TCGGCATAGT ATATCGGCAT AGTATAATAC GACTCACTAT AGGAGGGCCA 6181 TCATGGCCAA GTTGACCAGT GCTGTCCCAG TGCTCACAGC CAGGGATGTG GCTGGAGCTG 6241 TTGAGTTCTG GACTGACAGG TTGGGGTTCT CCAGAGATTT TGTGGAGGAT GACTTTGCAG 6301 GTGTGGTCAG AGATGATGTC ACCCTGTTCA TCTCAGCAGT CCAGGACCAG GTGGTGCCTG 6361 ACAACACCCT GGCTTGGGTG TGGGTGAGAG GACTGGATGA GCTGTATGCT GAGTGGAGTG 6421 AGGTGGTCTC CACCAACTTC AGGGATGCCA GTGGCCCTGC CATGACAGAG ATTGGAGAGC 6481 AGCCCTGGGG GAGAGAGTTT GCCCTGAGAG ACCCAGCAGG CAACTGTGTG CACTTTGTGG 6541 CAGAGGAGCA GGACTGAGGA TAAGAATTGT AACAAAAAAC CCCGCCCCGG CGGGGTTTTT 6601 TGTTAATTAA //