LOCUS pe1VP1 6502 bp ss-DNA circular SYN 17-JUN-2003 DEFINITION - ACCESSION - KEYWORDS - SOURCE - FEATURES Location/Qualifiers enhancer 71..213 /note=SV40 enhancer elements rep_origin 238..>373 /note=SV40 Ori mutation 307..307 /note=G->A kills SELP ATG CDS 418..1137 /note=EGFP polyA_signal 1153..>1374 /note=SV40 Late PolyA promoter <1465..2643 /note=EF1a promoter and UTR exon 1663..1695 /note=EF-1a exon 1 intron 1696..2635 /note=EF-1a intron A exon 2635..2643 /note=EF-1a exon 2 leader misc_recomb 2667..>3691 /note=attB1 CDS 2704..>3867 /note=HPyV6 VP1 (codmod) misc_recomb 3875..>3899 /note=attB2 mRNA 3961..>4549 /note=WPRE polyA_signal 4585..5253 /note=hEF1a polyA signal rep_origin 5254..5987 /note=MB1 Ori promoter 5988..6074 /note=EM7 promoter CDS 6075..6449 /note=Zeo resistance terminator 6450..6502 /note=terminator (rpmB/G) BASE COUNT 1463 A 1776 C 1763 G 1500 T 0 OTHER ORIGIN ? 1 CCCTGCAGGG CCTGAAATAA CCTCTGAAAG AGGAACTTGG TTAGGTACCT GTGGAATGTG 61 TGTCAGTTAG GGTGTGGAAA GTCCCCAGGC TCCCCAGCAG GCAGAAGTAT GCAAAGCATG 121 CATCTCAATT AGTCAGCAAC CAGGTGTGGA AAGTCCCCAG GCTCCCCAGC AGGCAGAAGT 181 ATGCAAAGCA TGCATCTCAA TTAGTCAGCA ACCATAGTCC CGCCCCTAAC TCCGCCCATC 241 CCGCCCCTAA CTCCGCCCAG TTCCGCCCAT TCTCCGCCCC ATGGCTGACT AATTTTTTTT 301 ATTTATACAG AGGCCGAGGC CGCCTCGGCC TCTGAGCTAT TCCAGAAGTA GTGAGGAGGC 361 TTTTTTGGAG GCCTAGGCTT TTGCAAAAAG CTTGATTGGG ATCCACCGGT CGCCACCATG 421 GTGAGCAAGG GCGAGGAGCT GTTCACCGGG GTGGTGCCCA TCCTGGTCGA GCTGGACGGC 481 GACGTAAACG GCCACAAGTT CAGCGTGTCC GGCGAGGGCG AGGGCGATGC CACCTACGGC 541 AAGCTGACCC TGAAGTTCAT CTGCACCACC GGCAAGCTGC CCGTGCCCTG GCCCACCCTC 601 GTGACCACCC TGACCTACGG CGTGCAGTGC TTCAGCCGCT ACCCCGACCA CATGAAGCAG 661 CACGACTTCT TCAAGTCCGC CATGCCCGAA GGCTACGTCC AGGAGCGCAC CATCTTCTTC 721 AAGGACGACG GCAACTACAA GACCCGCGCC GAGGTGAAGT TCGAGGGCGA CACCCTGGTG 781 AACCGCATCG AGCTGAAGGG CATCGACTTC AAGGAGGACG GCAACATCCT GGGGCACAAG 841 CTGGAGTACA ACTACAACAG CCACAACGTC TATATCATGG CCGACAAGCA GAAGAACGGC 901 ATCAAGGTGA ACTTCAAGAT CCGCCACAAC ATCGAGGACG GCAGCGTGCA GCTCGCCGAC 961 CACTACCAGC AGAACACCCC CATCGGCGAC GGCCCCGTGC TGCTGCCCGA CAACCACTAC 1021 CTGAGCACCC AGTCCGCCCT GAGCAAAGAC CCCAACGAGA AGCGCGATCA CATGGTCCTG 1081 CTGGAGTTCG TGACCGCCGC CGGGATCACT CTCGGCATGG ACGAGCTGTA CAAGTAAAGC 1141 GGCCGCTTCG AGCAGACATG ATAAGATACA TTGATGAGTT TGGACAAACC ACAACTAGAA 1201 TGCAGTGAAA AAAATGCTTT ATTTGTGAAA TTTGTGATGC TATTGCTTTA TTTGTAACCA 1261 TTATAAGCTG CAATAAACAA GTTAACAACA ACAATTGCAT TCATTTTATG TTTCAGGTTC 1321 AGGGGGAGGT GTGGGAGGTT TTTTAAAGCA AGTAAAACCT CTACAAATGT GGTAAAATCG 1381 ATAAGGATCC GGGCTGGCGT AATAGCGAAG AGGCCCGCAC CGATCGCCCT TCCCAACAGT 1441 TGCGGTGGAG AAGAGCATGC GTGAGGCTCC GGTGCCCGTC AGTGGGCAGA GCGCACATCG 1501 CCCACAGTCC CCGAGAAGTT GGGGGGAGGG GTCGGCAATT GAACCGGTGC CTAGAGAAGG 1561 TGGCGCGGGG TAAACTGGGA AAGTGATGTC GTGTACTGGC TCCGCCCTTT TCCCGAGGGT 1621 GGGGGAGAAC CGTATATAAG TGCAGTAGTC GCTGTGAACG TTCTTTTTCG CAACGGGTTT 1681 GCCGCCAGAA CACAGGTAAG TGCCGTGTGT GGTTCCCGCG GGCCTGGCCT CTTTACGGGT 1741 TATGGCCCTT GCGTGCCTTG AATTACTTCC ACCTGGCTGC AGTACGTGAT TCTTGATCCC 1801 GAGCTTCGGG TTGGAAGTGG GTGGGAGAGT TCGAGGCCTT GCGCTTAAGG AGCCCCTTCG 1861 CCTCGTGCTT GAGTTGAGGC CTGGCCTGGG CGCTGGGGCC GCCGCGTGCG AATCTGGTGG 1921 CACCTTCGCG CCTGTCTCGC TGCTTTCGAT AAGTCTCTAG CCATTTAAAA TTTTTGATGA 1981 CCTGCTGCGA CGCTTTTTTT CTGGCAAGAT AGTCTTGTAA ATGCGGGCCA AGATCTGCAC 2041 ACTGGTATTT CGGTTTTTGG GGCCGCGGGC GGCGACGGGG CCCGTGCGTC CCAGCGCACA 2101 TGTTCGGCGA GGCGGGGCCT GCGAGCGCGG CCACCGAGAA TCGGACGGGG GTAGTCTCAA 2161 GCTGGCCGGC CTGCTCTGGT GCCTGGCCTC GCGCCGCCGT GTATCGCCCC GCCCTGGGCG 2221 GCAAGGCTGG CCCGGTCGGC ACCAGTTGCG TGAGCGGAAA GATGGCCGCT TCCCGGCCCT 2281 GCTGCAGGGA GCTCAAAATG GAGGACGCGG CGCTCGGGAG AGCGGGCGGG TGAGTCACCC 2341 ACACAAAGGA AAAGGGCCTT TCCGTCCTCA GCCGTCGCTT CATGTGACTC CACGGAGTAC 2401 CGGGCGCCGT CCAGGCACCT CGATTAGTTC TCGAGCTTTT GGAGTACGTC GTCTTTAGGT 2461 TGGGGGGAGG GGTTTTATGC GATGGAGTTT CCCCACACTG AGTGGGTGGA GACTGAAGTT 2521 AGGCCAGCTT GGCACTTGAT GTAATTCTCC TTGGAATTTG CCCTTTTTGA GTTTGGATCT 2581 TGGTTCATTC TCAAGCCTCA GACAGTGGTT CAAAGTTTTT TTCTTCCATT TCAGGTGTCG 2641 TGAGGAATTC TCTAGAGCTT GATCAAACAA GTTTGTACAA AAAAGCAGGC TTCCGGAGCC 2701 ACCATGCCCT GCCATCGCAA AGGCAACGGC CCAATCCAGA AGCTGCCACG CGTGATCAAG 2761 AAGGGAGGCG TCGAAGTCAT GGAGACCGTC CCGTTGTCCG AGGATACCAT CTACAAGGTC 2821 GAAGCCATCC TGCTGCCCAA CTTCGCCAGC GGCAGTAACA CCGCCGTGTA TCAAAGCAGA 2881 GGCGCACCAT ACACATTCAC CGATACCCTC GACGCTGGCA GCAGCCTCTG CTACACACTC 2941 GCCGTCGTCA ACCTGCCAGA GATCCCCGAA GCACTCTGCG ACGACACCCT GCTGGTCTGG 3001 GAAGCATTCC GCGTCGAGAC CGAGCTCATC TTCACACCGC AAGTCGGATC CGCTGGCTAC 3061 ATCCGCGCAC AAGGCACACC AGCTGGAGTG GAAGGCAGTC AGATGTACTT CTGGGCATGC 3121 GGAGGGTCCC CACTCGACGT GATCGGCATC AATCCCGACC CAGAGCGCAT GAACGTTGCT 3181 GCCGGGCTCG AAGGCCCAAG CAAGGAGAAC CAGCCCAGCG TCGCTGGCAT CAAGGCCACA 3241 CGCAAGCAGG TCACCGCTGC AAACTTCCCA ATCGAGATCT GGTCCGCCGA CCCTACACGC 3301 AACGAGAACT GCCGCTACTT CGGACGCATC GTCGGCGGAT CCGTCACTCC ACCTGTCGTG 3361 TCCTTCGGCA ACCAATCCAC AACCCCACTG GTCGACGAGA ACGGCGTCGG CATCTTGTGC 3421 CTGTTCGGCG CCATCTACCT GACCAGCGCC GACATGCTCG GCATGGTCGG CTACGCTGGC 3481 AATCCAACCT TGTCCGACGC ATACAGCCAG CAGCGCTCCG TCCAAGCCGC ATTCGGCCGC 3541 TTCTTCCGCG TCCATTTCCG CCAGCGCCGC GTCAAGCACC CCTACACCGT CGACATGATG 3601 TTCCGCCAGT TCCTGCAACC CCAGAAGCCT CAGGTCCAGG GAACCCAGCC AAACGCTGTG 3661 CAGGAAGTGG TCATGGAGCA GATGCAGCCA TCTATCTTGC CCACTACCCT CGAAGGAGCC 3721 ATCGGCTATA GCCCATCCAC CAAGTTCATC CTGCAAAACG GAGAGCTGAT CTATCCAAGC 3781 TCCACCGTGG CAGCCGGAGC CGCTAACCTG TTCGGACCGC CTGTGGAAAA GCAGACCTCC 3841 AAGGAGCCCA GCAAGGGCGA ACTCTGAGAC TAGTACCCAG CTTTCTTGTA CAAAGTGGTT 3901 CGATCTAGAA TGGCTAGTGG ATCCCCCGGG CTGCAGGAAT TCGATATCAA GCTTATCGAT 3961 AATCAACCTC TGGATTACAA AATTTGTGAA AGATTGACTG GTATTCTTAA CTATGTTGCT 4021 CCTTTTACGC TATGTGGATA CGCTGCTTTA ATGCCTTTGT ATCATGCTAT TGCTTCCCGT 4081 ATGGCTTTCA TTTTCTCCTC CTTGTATAAA TCCTGGTTGC TGTCTCTTTA TGAGGAGTTG 4141 TGGCCCGTTG TCAGGCAACG TGGCGTGGTG TGCACTGTGT TTGCTGACGC AACCCCCACT 4201 GGTTGGGGCA TTGCCACCAC CTGTCAGCTC CTTTCCGGGA CTTTCGCTTT CCCCCTCCCT 4261 ATTGCCACGG CGGAACTCAT CGCCGCCTGC CTTGCCCGCT GCTGGACAGG GGCTCGGCTG 4321 TTGGGCACTG ACAATTCCGT GGTGTTGTCG GGGAAATCAT CGTCCTTTCC TTGGCTGCTC 4381 GCCTGTGTTG CCACCTGGAT TCTGCGCGGG ACGTCCTTCT GCTACGTCCC TTCGGCCCTC 4441 AATCCAGCGG ACCTTCCTTC CCGCGGCCTG CTGCCGGCTC TGCGGCCTCT TCCGCGTCTT 4501 CGCCTTCGCC CTCAGACGAG TCGGATCTCC CTTTGGGCCG CCTCCCCGCA TCGATACCGT 4561 CGGCCCACTG CTCCCTAAAC CTGAGCTAGC ATTATCCCTA ATACCTGCCA CCCCACTCTT 4621 AATCAGTGGT GGAAGAACGG TCTCAGAACT GTTTGTTTCA ATTGGCCATT TAAGTTTAGT 4681 AGTAAAAGAC TGGTTAATGA TAACAATGCA TCGTAAAACC TTCAGAAGGA AAGGAGAATG 4741 TTTTGTGGAC CACTTTGGTT TTCTTTTTTG CGTGTGGCAG TTTTAAGTTA TTAGTTTTTA 4801 AAATCAGTAC TTTTTAATGG AAACAACTTG ACCAAAAATT TGTCACAGAA TTTTGAGACC 4861 CATTAAAAAA GTTAAATGAG AAACCTGTGT GTTCCTTTGG TCAACACCGA GACATTTAGG 4921 TGAAAGACAT CTAATTCTGG TTTTACGAAT CTGGAAACTT CTTGAAAATG TAATTCTTGA 4981 GTTAACACTT CTGGGTGGAG AATAGGGTTG TTTTCCCCCC ACATAATTGG AAGGGGAAGG 5041 AATATCATTT AAAGCTATGG GAGGGTTTCT TTGATTACAA CACTGGAGAG AAATGCAGCA 5101 TGTTGCTGAT TGCCTGTCAC TAAAACAGGC CAAAAACTGA GTCCTTGGGT TGCATAGAAA 5161 GCTTCATGTT GCTAAACCAA TGTTAAGTGA ATCTTTGGAA ACAAAATGTT TCCAAATTAC 5221 TGGGATGTGC ATGTTGAAAC GTGGGTTAAT TAACTAGCCA TGACCAAAAT CCCTTAACGT 5281 GAGTTTTCGT TCCACTGAGC GTCAGACCCC GTAGAAAAGA TCAAAGGATC TTCTTGAGAT 5341 CCTTTTTTTC TGCGCGTAAT CTGCTGCTTG CAAACAAAAA AACCACCGCT ACCAGCGGTG 5401 GTTTGTTTGC CGGATCAAGA GCTACCAACT CTTTTTCCGA AGGTAACTGG CTTCAGCAGA 5461 GCGCAGATAC CAAATACTGT TCTTCTAGTG TAGCCGTAGT TAGGCCACCA CTTCAAGAAC 5521 TCTGTAGCAC CGCCTACATA CCTCGCTCTG CTAATCCTGT TACCAGTGGC TGCTGCCAGT 5581 GGCGATAAGT CGTGTCTTAC CGGGTTGGAC TCAAGACGAT AGTTACCGGA TAAGGCGCAG 5641 CGGTCGGGCT GAACGGGGGG TTCGTGCACA CAGCCCAGCT TGGAGCGAAC GACCTACACC 5701 GAACTGAGAT ACCTACAGCG TGAGCTATGA GAAAGCGCCA CGCTTCCCGA AGGGAGAAAG 5761 GCGGACAGGT ATCCGGTAAG CGGCAGGGTC GGAACAGGAG AGCGCACGAG GGAGCTTCCA 5821 GGGGGAAACG CCTGGTATCT TTATAGTCCT GTCGGGTTTC GCCACCTCTG ACTTGAGCGT 5881 CGATTTTTGT GATGCTCGTC AGGGGGGCGG AGCCTATGGA AAAACGCCAG CAACGCGGCC 5941 TTTTTACGGT TCCTGGCCTT TTGCTGGCCT TTTGCTCACA TGTTCTTAAT TAAATTTTTC 6001 AAAAGTAGTT GACAATTAAT CATCGGCATA GTATATCGGC ATAGTATAAT ACGACTCACT 6061 ATAGGAGGGC CATCATGGCC AAGTTGACCA GTGCTGTCCC AGTGCTCACA GCCAGGGATG 6121 TGGCTGGAGC TGTTGAGTTC TGGACTGACA GGTTGGGGTT CTCCAGAGAT TTTGTGGAGG 6181 ATGACTTTGC AGGTGTGGTC AGAGATGATG TCACCCTGTT CATCTCAGCA GTCCAGGACC 6241 AGGTGGTGCC TGACAACACC CTGGCTTGGG TGTGGGTGAG AGGACTGGAT GAGCTGTATG 6301 CTGAGTGGAG TGAGGTGGTC TCCACCAACT TCAGGGATGC CAGTGGCCCT GCCATGACAG 6361 AGATTGGAGA GCAGCCCTGG GGGAGAGAGT TTGCCCTGAG AGACCCAGCA GGCAACTGTG 6421 TGCACTTTGT GGCAGAGGAG CAGGACTGAG GATAAGAATT GTAACAAAAA ACCCCGCCCC 6481 GGCGGGGTTT TTTGTTAATT AA //