LOCUS POL2.TXT 4919 BP DS-DNA CIRCULAR SYN 10-NOV-2003 DEFINITION - ACCESSION - KEYWORDS - SOURCE - FEATURES Location/Qualifiers promoter 14..544 /note="CMV promoter" 5'UTR 545..828 /note="HTLV1 R-U5" CDS 830..>2353 /note="HPV1 L2 (prototype, codon modified)" polyA_signal 2364..2597 /note="SV40 Late polyA" rep_origin 2615..3336 /note="MB1 Ori" misc_feature 3358..3701 /note="SV40 homology" promoter 3527..3701 /note="SV40 early promoter" rep_origin 3550..3701 /note="SV40 Ori" misc_feature 3599..3648 /note="SV40 transcription start points" CDS 3614..3685 /note="SELP" promoter 3736..3791 /note="EM7" CDS 3811..4605 /note="NeoR" polyA_signal 4653..4919 /note="HSV TK pA" BASE COUNT 1122 A 1471 C 1277 G 1049 T 0 OTHER ORIGIN - 1 CCTGCAGGGC CCACTAGTCC GTTACATAAC TTACGGTAAA TGGCCCGCCT GGCTGACCGC 61 CCAACGACCC CCGCCCATTG ACGTCAATAA TGACGTATGT TCCCATAGTA ACGCCAATAG 121 GGACTTTCCA TTGACGTCAA TGGGTGGAGT ATTTACGGTA AACTGCCCAC TTGGCAGTAC 181 ATCAAGTGTA TCATATGCCA AGTACGCCCC CTATTGACGT CAATGACGGT AAATGGCCCG 241 CCTGGCATTA TGCCCAGTAC ATGACCTTAT GGGACTTTCC TACTTGGCAG TACATCTACG 301 TATTAGTCAT CGCTATTACC ATGATGATGC GGTTTTGGCA GTACATCAAT GGGCGTGGAT 361 AGCGGTTTGA CTCACGGGGA TTTCCAAGTC TCCACCCCAT TGACGTCAAT GGGAGTTTGT 421 TTTGGCACCA AAATCAACGG GACTTTCCAA AATGTCGTAA CAACTCCGCC CCATTGACGC 481 AAATGAGCGG TAGGCGTGTA CGGTGGGAGG TCTATATAAG CAGAGCTCGT TTAGTGAACC 541 GTAAGCTTCG AGGGGCTCGC ATCTCTCCTT CACGCGCCCG CCGCCCTACC TGAGGCCGCC 601 ATCCACGCCG GTTGAGTCGC GTTCTGCCGC CTCCCGCCTG TGGTGCCTCC TGAACTGCGT 661 CCGCCGTCTA GGTAAGTTTA AAGCTCAGGT CGAGACCGGG CCTTTGTCCG GCGCTCCCTT 721 GGAGCCTACC TAGACTCAGC CGGCTCTCCA CGCTTTGCCT GACCCTGCTT GCTCAACTCT 781 ACGTCTTTGT TTCGTTTTCT GTTCTGCGCC GTTACAGATC CAAGCCACCA TGTACAGATT 841 GAGGCGCAAG AGGGCCGCAC CAAAGGACAT CTATCCAAGC TGCAAGATCA GCAATACATG 901 CCCACCCGAT ATCCAGAACA AGATCGAACA CACCACCATC GCCGACAAGA TCCTGCAGTA 961 CGGGTCCCTC GGCGTGTTCC TCGGCGGCCT CGGGATCGGC ACCGCACGCG GGAGCGGCGG 1021 GCGCATCGGC TACACCCCAT TGGGCGAAGG CGGCGGCGTC CGCGTCGCCA CAAGACCCAC 1081 CCCCGTCCGC CCCACCATCC CCGTCGAGAC CGTCGGGCCA TCCGAGATCT TTCCAATCGA 1141 CGTCGTGGAC CCCACCGGGC CCGCCGTCAT CCCATTGCAG GACCTGGGCC GCGATTTTCC 1201 AATCCCCACC GTCCAAGTCA TCGCCGAGAT CCATCCCATC AGCGATATCC CCAATATCGT 1261 CGCCAGCAGC ACCAACGAGG GCGAGAGCGC AATCCTGGAC GTCCTGCAAG GCTCCGCCAC 1321 AATCAGAACC GTCAGCCGCA CCCAGTACAA CAATCCAAGC TTTACCGTCG CCAGCACCAG 1381 CAACATCTCC GCCGGCGAGG CCAGCACCAG CGACATCGTG TTCGTCTCAA ACGGCAGCGG 1441 CGATCGCGTC GTCGGGGAAG ACATTCCACT CGTGGAGCTG AATCTGGGGT TGGAGACCGA 1501 TACCAGCAGC GTCGTGCAGG AGACCGCCTT CAGTTCATCA ACCCCCATCG CCGAGCGCCC 1561 AAGCTTCCGC CCAAGCCGCT TTTACAACCG CAGACTCTAC GAGCAAGTCC AGGTCCAGGA 1621 TCCCCGCTTT GTCGAACAGC CCCAAAGCAT GGTCACCTTC GACAACCCCG CCTTCGAACC 1681 CGAATTGGAC GAAGTCAGCA TCATCTTTCA GCGCGATCTG GACGCCTTGG CCCAAACCCC 1741 CGTCCCCGAG TTCCGCGACG TGGTCTACCT CTCAAAACCA ACCTTCAGCA GAGAGCCCGG 1801 CGGCAGACTG CGCGTCTCAA GATTGGGGAA GTCCAGCACC ATCAGAACCA GACTCGGGAC 1861 CGCCATCGGG GCACGCACAC ATTTCTTTTA CGACCTGTCC AGCATCGCCC CCGAGGATAG 1921 CATCGAGCTG CTGCCCCTGG GCGAACACTC CCAGACCACC GTCATCTCCA GCAATCTGGG 1981 CGATACCGCC TTCATCCAGG GCGAAACCGC CGAAGACGAT CTGGAGGTCA TTAGCCTGGA 2041 GACCCCACAG CTGTATAGCG AGGAGGAATT GCTGGATACC AATGAGTCCG TCGGGGAGAA 2101 CCTCCAGTTG ACCATCACCA ATAGCGAAGG CGAAGTCAGC ATCTTGGACC TGACCCAGTC 2161 ACGCGTCCGC CCACCCTTCG GGACCGAGGA CACCTCACTC CACGTCTACT ATCCCAACAG 2221 CAGCAAGGGC ACCCCCATCA TCAACCCCGA GGAGAGCTTC ACCCCACTCG TCATCATCGC 2281 CTTGAACAAT AGCACCGGCG ACTTCGAACT GCACCCATCC TTGCGCAAAA GGAGGAAGCG 2341 CGCCTACGTC TGAGGCTCGA GGCTAGCTGG CCAGACATGA TAAGATACAT TGATGAGTTT 2401 GGACAAACCA CAACTAGAAT GCAGTGAAAA AAATGCTTTA TTTGTGAAAT TTGTGATGCT 2461 ATTGCTTTAT TTGTAACCAT TATAAGCTGC AATAAACAAG TTAACAACAA CAATTGCATT 2521 CATTTTATGT TTCAGGTTCA GGGGGAGGTG TGGGAGGTTT TTTAAAGCAA GTAAAACCTC 2581 TACAAATGTG GTATGGAATT CTTAATTAAC TAGCCATGAC CAAAATCCCT TAACGTGAGT 2641 TTTCGTTCCA CTGAGCGTCA GACCCCGTAG AAAAGATCAA AGGATCTTCT TGAGATCCTT 2701 TTTTTCTGCG CGTAATCTGC TGCTTGCAAA CAAAAAAACC ACCGCTACCA GCGGTGGTTT 2761 GTTTGCCGGA TCAAGAGCTA CCAACTCTTT TTCCGAAGGT AACTGGCTTC AGCAGAGCGC 2821 AGATACCAAA TACTGTTCTT CTAGTGTAGC CGTAGTTAGG CCACCACTTC AAGAACTCTG 2881 TAGCACCGCC TACATACCTC GCTCTGCTAA TCCTGTTACC AGTGGCTGCT GCCAGTGGCG 2941 ATAAGTCGTG TCTTACCGGG TTGGACTCAA GACGATAGTT ACCGGATAAG GCGCAGCGGT 3001 CGGGCTGAAC GGGGGGTTCG TGCACACAGC CCAGCTTGGA GCGAACGACC TACACCGAAC 3061 TGAGATACCT ACAGCGTGAG CTATGAGAAA GCGCCACGCT TCCCGAAGGG AGAAAGGCGG 3121 ACAGGTATCC GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG CACGAGGGAG CTTCCAGGGG 3181 GAAACGCCTG GTATCTTTAT AGTCCTGTCG GGTTTCGCCA CCTCTGACTT GAGCGTCGAT 3241 TTTTGTGATG CTCGTCAGGG GGGCGGAGCC TATGGAAAAA CGCCAGCAAC GCGGCCTTTT 3301 TACGGTTCCT GGCCTTTTGC TGGCCTTTTG CTCACATGTT CTTAATTAAG CTGTACACTG 3361 TGGAATGTGT GTCAGTTAGG GTGTGGAAAG TCCCCAGGCT CCCCAGCAGG CAGAAGTATG 3421 CAAAGCATGC ATCTCAATTA GTCAGCAACC AGGTGTGGAA AGTCCCCAGG CTCCCCAGCA 3481 GGCAGAAGTA TGCAAAGCAT GCATCTCAAT TAGTCAGCAA CCATAGTCCC GCCCCTAACT 3541 CCGCCCATCC CGCCCCTAAC TCCGCCCAGT TCCGCCCATT CTCCGCCCCA TGACTGACTA 3601 ATTTTTTTTA TTTATGCAGA GGCCGAGGCC GCCTCTGCCT CTGAGCTATT CCAGAAGTAG 3661 TGAGGAGGCT TTTTTGGAGG CCTAGGCTTT TGCAAAAAGC TCCCGGGAGC TTGTATATCC 3721 ATTTTCGGAT CTGATCAGCA CGTGTTGACA ATTAATCATC GGCATAGTAT ATCGGCATAG 3781 TATAATACGA CTCACTATAG GAGGGCCACC ATGATTGAAC AAGATGGATT GCACGCAGGT 3841 TCTCCGGCCG CTTGGGTGGA GAGGCTATTC GGCTATGACT GGGCACAACA GACAATCGGC 3901 TGCTCTGATG CCGCCGTGTT CCGGCTGTCA GCGCAGGGGC GCCCGGTTCT TTTTGTCAAG 3961 ACCGACCTGT CCGGTGCCCT GAATGAACTG CAAGACGAGG CAGCGCGGCT ATCGTGGCTG 4021 GCCACGACGG GCGTTCCTTG CGCAGCTGTG CTCGACGTTG TCACTGAAGC GGGAAGGGAC 4081 TGGCTGCTAT TGGGCGAAGT GCCGGGGCAG GATCTCCTGT CATCTCACCT TGCTCCTGCC 4141 GAGAAAGTAT CCATCATGGC TGATGCAATG CGGCGGCTGC ATACGCTTGA TCCGGCTACC 4201 TGCCCATTCG ACCACCAAGC GAAACATCGC ATCGAGCGAG CACGTACTCG GATGGAAGCC 4261 GGTCTTGTCG ATCAGGATGA TCTGGACGAA GAGCATCAGG GGCTCGCGCC AGCCGAACTG 4321 TTCGCCAGGC TCAAGGCGAG CATGCCCGAC GGCGAGGATC TCGTCGTGAC ACATGGCGAT 4381 GCCTGCTTGC CGAATATCAT GGTGGAAAAT GGCCGCTTTT CTGGATTCAT CGACTGTGGC 4441 CGGCTGGGTG TGGCGGACCG CTATCAGGAC ATAGCGTTGG CTACCCGTGA TATTGCTGAA 4501 GAGCTTGGCG GCGAATGGGC TGACCGCTTC CTCGTGCTTT ACGGTATCGC CGCTCCCGAT 4561 TCGCAGCGCA TCGCCTTCTA TCGCCTTCTT GACGAGTTCT TCTGAGCGGG ACTCTGGGGT 4621 TCGAAATGAC CGACCAAGCG AATTCGCTAG AGGGAGGCTA ACTGAAACAC GGAAGGAGAC 4681 AATACCGGAA GGAACCCGCG CTATGACGGC AATAAAAAGA CAGAATAAAA CGCACGGTGT 4741 TGGGTCGTTT GTTCATAAAC GCGGGGTTCG GTCCCAGGGC TGGCACTCTG TCGATACCCC 4801 ACCGAGACCC CATTGGGGCC AATACGCCCG CGTTTCTTCC TTTTCCCCAC CCCACCCCCC 4861 AAGTTCGGGT GAAGGCCCAG GGCTCGCAGC CAACGTCGGG GCGGCAGGCC CTGCCATAG //