LOCUS PE1-E2 (LA 8253 BP DS-DNA CIRCULAR SYN 10-JAN-1997 DEFINITION - ACCESSION - KEYWORDS - SOURCE - FEATURES Location/Qualifiers promoter 1..795 /note="CMV promoter" CAAT_signal 674..678 /note="CAAT box" TATA_signal 707..713 /note="TATA box" intron 857..989 /note="intron" promoter 1034..1052 /note="T7 promoter" CDS 1087..>2904 /note="E1" mutation 1443..1443 /note="N in Genbank sequence verified as T" CDS 2846..>2904 /note="E2 leader" mRNA 2958..>3533 /note="EMCV IRES" CDS 3534..>4766 /note="E2-TA (weighs 45.4kD)" CDS <3534..>3592 /note="E1 tail" mRNA 4006..4006 /note="p3080 first transcribed base" protein_bind 4014..4025 /note="E2 BS" CDS 4017..4766 /note="E2-TR (weighs 26.9kD)" mutation 4018..4018 /note="T->C mutation kills E2TR ATG" CDS 4117..>4480 /note="E4" misc_feature 4140..4163 /note="putative splice acceptor" misc_feature 4149..4151 /note="first E2 residue in E8-E2" mutation 4370..4370 /note="Unknown base missed in original publications" mutation 4434..4436 /note="S301A kills PEST (Penrose 01)" mRNA 4800..>5305 /note="HBV PRE" polyA_signal 5356..5577 /note="SV40 polyA" rep_origin 5667..6122 /note="phage f1" promoter 6491..6526 /note="AmpR's promoter" CDS 6561..7421 /note="ampR" rep_origin 7570..8240 /note="ColE1 Ori" BASE COUNT 2142 A 1928 C 2055 G 2126 T 2 OTHER ORIGIN - 1 TCAATATTGG CCATTAGCCA TATTATTCAT TGGTTATATA GCATAAATCA ATATTGGCTA 61 TTGGCCATTG CATACGTTGT ATCTATATCA TAATATGTAC ATTTATATTG GCTCATGTCC 121 AATATGACCG CCATGTTGGC ATTGATTATT GACTAGTTAT TAATAGTAAT CAATTACGGG 181 GTCATTAGTT CATAGCCCAT ATATGGAGTT CCGCGTTACA TAACTTACGG TAAATGGCCC 241 GCCTGGCTGA CCGCCCAACG ACCCCCGCCC ATTGACGTCA ATAATGACGT ATGTTCCCAT 301 AGTAACGCCA ATAGGGACTT TCCATTGACG TCAATGGGTG GAGTATTTAC GGTAAACTGC 361 CCACTTGGCA GTACATCAAG TGTATCATAT GCCAAGTCCG CCCCCTATTG ACGTCAATGA 421 CGGTAAATGG CCCGCCTGGC ATTATGCCCA GTACATGACC TTACGGGACT TTCCTACTTG 481 GCAGTACATC TACGTATTAG TCATCGCTAT TACCATGGTG ATGCGGTTTT GGCAGTACAC 541 CAATGGGCGT GGATAGCGGT TTGACTCACG GGGATTTCCA AGTCTCCACC CCATTGACGT 601 CAATGGGAGT TTGTTTTGGC ACCAAAATCA ACGGGACTTT CCAAAATGTC GTAATAACCC 661 CGCCCCGTTG ACGCAAATGG GCGGTAGGCG TGTACGGTGG GAGGTCTATA TAAGCAGAGC 721 TCGTTTAGTG AACCGTCAGA TCACTAGAAG CTTTATTGCG GTAGTTTATC ACAGTTAAAT 781 TGCTAACGCA GTCAGTGCTT CTGACACAAC AGTCTCGAAC TTAAGCTGCA GAAGTTGGTC 841 GTGAGGCACT GGGCAGGTAA GTATCAAGGT TACAAGACAG GTTTAAGGAG ACCAATAGAA 901 ACTGGGCTTG TCGAGACAGA GAAGACTCTT GCGTTTCTGA TAGGCACCTA TTGGTCTTAC 961 TGACATCCAC TTTGCCTTTC TCTCCACAGG TGTCCACTCC CAGTTCAATT ACAGCTCTTA 1021 AGGCTAGAGT ACTTAATACG ACTCACTATA GGCTAGCCTC GAGAATTCAC GCGTGGTACC 1081 CGCACCATGG CAAACGATAA AGGTAGCAAT TGGGATTCGG GCTTGGGATG CTCATATCTG 1141 CTGACTGAGG CAGAATGTGA AAGTGACAAA GAGAATGAGG AACCCGGGGC AGGTGTAGAA 1201 CTGTCTGTGG AATCTGATCG GTATGATAGC CAGGATGAGG ATTTTGTTGA CAATGCATCA 1261 GTCTTTCAGG GAAATCACCT GGAGGTCTTC CAGGCATTAG AGAAAAAGGC GGGTGAGGAG 1321 CAGATTTTAA ATTTGAAAAG AAAAGTATTG GGGAGTTCGC AAAACAGCAG CGGTTCCGAA 1381 GCATCTGAAA CTCCAGTTAA AAGACGGAAA TCAGGAGCAA AGCGAAGATT ATTTGCTGAA 1441 AATGAAGCTA ACCGTGTTCT TACGCCCCTC CAGGTACAGG GGGAGGGGGA GGGGAGGCAA 1501 GAACTTAATG AGGAGCAGGC AATTAGTCAT CTACATCTGC AGCTTGTTAA ATCTAAAAAT 1561 GCTACAGTTT TTAAGCTGGG GCTCTTTAAA TCTTTGTTCC TTTGTAGCTT CCATGATATT 1621 ACGAGGTTGT TTAAGAATGA TAAGACCACT AATCAGCAAT GGGTGCTGGC TGTGTTTGGC 1681 CTTGCAGAGG TGTTTTTTGA GGCGAGTTTC GAACTCCTAA AGAAGCAGTG TAGTTTTCTG 1741 CAGATGCAAA AAAGATCTCA TGAAGGAGGA ACTTGTGCAG TTTACTTAAT CTGCTTTAAC 1801 ACAGCTAAAA GCAGAGAAAC AGTCCGGAAT CTGATGGCAA ACACGCTAAA TGTAAGAGAA 1861 GAGTGTTTGA TGCTGCAGCC AGCTAAAATT CGAGGACTCA GCGCAGCTCT ATTCTGGTTT 1921 AAAAGTAGTT TGTCACCCGC TACACTTAAA CATGGTGCTT TACCTGAGTG GATACGGGCG 1981 CAAACTACTC TGAACGAGAG CTTGCAGACC GAGAAATTCG ACTTCGGAAC TATGGTGCAA 2041 TGGGCCTATG ATCACAAATA TGCTGAGGAG TCTAAAATAG CCTATGAATA TGCTTTGGCT 2101 GCAGGATCTG ATAGCAATGC ACGGGCTTTT TTAGCAACTA ACAGCCAAGC TAAGCATGTG 2161 AAGGACTGTG CAACTATGGT AAGACACTAT CTAAGAGCTG AAACACAAGC ATTAAGCATG 2221 CCTGCATATA TTAAAGCTAG GTGCAAGCTG GCAACTGGGG AAGGAAGCTG GAAGTCTATC 2281 CTAACTTTTT TTAACTATCA GAATATTGAA TTAATTACCT TTATTAATGC TTTAAAGCTC 2341 TGGCTAAAAG GAATTCCAAA AAAAAACTGT TTAGCATTTA TTGGCCCTCC AAACACAGGC 2401 AAGTCTATGC TCTGCAACTC ATTAATTCAT TTTTTGGGTG GTAGTGTTTT ATCTTTTGCC 2461 AACCATAAAA GTCACTTTTG GCTTGCTTCC CTAGCAGATA CTAGAGCTGC TTTAGTAGAT 2521 GATGCTACTC ATGCTTGCTG GAGGTACTTT GACACATACC TCAGAAATGC ATTGGATGGC 2581 TACCCTGTCA GTATTGATAG AAAACACAAA GCAGCGGTTC AAATTAAAGC TCCACCCCTC 2641 CTGGTAACCA GTAATATTGA TGTGCAGGCA GAGGACAGAT ATTTGTACTT GCATAGTCGG 2701 GTGCAAACCT TTCGCTTTGA GCAGCCATGC ACAGATGAAT CGGGTGAGCA ACCTTTTAAT 2761 ATTACTGATG CAGATTGGAA ATCTTTTTTT GTAAGGTTAT GGGGGCGTTT AGACCTGATT 2821 GACGAGGAGG AGGATAGTGA AGAGGATGGA GACAGCATGC GAACGTTTAC ATGTAGCGCA 2881 AGAAACACAA ATGCAGTTGA TTGAGAAAAG TAGTGATAAG TTGCAAGATC GGATCTAGAG 2941 CGGCCGCCAC GGGATCCGCC CCTCTCCCTC CCCCCCCCCT AACGTTACTG GCCGAAGCCG 3001 CTTGGAATAA GGCCGGTGTG CGTTTGTCTA TATGTTATTT TCCACCATAT TGCCGTCTTT 3061 TGGCAATGTG AGGGCCCGGA AACCTGGCCC TGTCTTCTTG ACGAGCATTC CTAGGGGTCT 3121 TTCCCCTCTC GCCAAAGGAA TGCAAGGTCT GTTGAATGTC GTGAAGGAAG CAGTTCCTCT 3181 GGAAGCTTCT TGAAGACAAA CAACGTCTGT AGCGACCCTT TGCAGGCAGC GGAACCCCCC 3241 ACCTGGCGAC AGGTGCCTCT GCGGCCAAAA GCCACGTGTA TAAGATACAC CTGCAAAGGC 3301 GGCACAACCC CAGTGCCACG TTGTGAGTTG GATAGTTGTG GAAAGAGTCA AATGGCTCTC 3361 CTCAAGCGTA TTCAACAAGG GGCTGAAGGA TGCCCAGAAG GTACCCCATT GTATGGGATC 3421 TGATCTGGGG CCTCGGTGCA CATGCTTTAC ATGTGTTTAG TCGAGGTTAA AAAAACGTCT 3481 AGGCCCCCCG AACCACGGGG ACGTGGTTTT CCTTTGAAAA ACACGATGAT AATATGGAGA 3541 CAGCATGCGA ACGTTTACAT GTAGCGCAAG AAACACAAAT GCAGTTGATT GAGAAAAGTA 3601 GTGATAAGTT GCAAGATCAT ATACTGTACT GGACTGCTGT TAGAACTGAG AACACACTGC 3661 TTTATGCTGC AAGGAAAAAA GGGGTGACTG TCCTAGGACA CTGCAGAGTA CCACACTCTG 3721 TAGTTTGTCA AGAGAGAGCC AAGCAGGCCA TTGAAATGCA GTTGTCTTTG CAGGAGTTAA 3781 GCAAAACTGA GTTTGGGGAT GAACCATGGT CTTTGCTTGA CACAAGCTGG GACCGATATA 3841 TGTCAGAACC TAAACGGTGC TTTAAGAAAG GCGCCAGGGT GGTAGAGGTG GAGTTTGATG 3901 GAAATGCAAG CAATACAAAC TGGTACACTG TCTACAGCAA TTTGTACATG CGCACAGAGG 3961 ACGGCTGGCA GCTTGCGAAG GCTGGGGCTG ACGGAACTGG GCTCTACTAC TGCACCATGG 4021 CCGGTGCTGG ACGCATTTAC TATTCTCGCT TTGGTGACGA GGCAGCCAGA TTTAGTACAA 4081 CAGGGCATTA CTCTGTAAGA GATCAGGACA GAGTGTATGC TGGTGTCTCA TCCACCTCTT 4141 CTGATTTTAG AGATCGCCCA GACGGAGTCT GGGTCGCATC CGAAGGACCT GAAGGAGACC 4201 CTGCAGGAAA AGAAGCCGAG CCAGCCCAGC CTGTCTCTTC TTTGCTCGGC TCCCCCGCCT 4261 GCGGTCCCAT CAGAGCAGGC CTCGGTTGGG TACGGGACGG TCCTCGCTCG CACCCCTACA 4321 ATTTTCCTGC AGGCTCGGGG GGCTCTATTC TCCGCTCTTC CTCCACCCCN GTGCAGGGCA 4381 CGGTACCGGT GGACTTGGCA TCAAGGCAGG AAGAAGAGGA GCAGTCGCCC GACTCCACAG 4441 AGGAAGAACC AGTGACTCTC CCAAGGCGCA CCACCAATGA TGGATTCCAC CTGTTAAAGG 4501 CAGGAGGGTC ATGCTTTGCT CTAATTTCAG GAACTGCTAA CCAGGTAAAG TGCTATCGCT 4561 TTCGGGTGAA AAAGAACCAT AGACATCGCT ACGAGAACTG CACCACCACC TGGTTCACAG 4621 TTGCTGACAA CGGTGCTGAA AGACAAGGAC AAGCACAAAT ACTGATCACC TTTGGATCGC 4681 CAAGTCAAAG GCAAGACTTT CTGAAACATG TACCACTACC TCCTGGAATG AACATTTCCG 4741 GCTTTACAGC CAGCTTGGAC TTCTGATCAC TTCGAGTCTA GAGGGCCCGT TTGTACCCCC 4801 GTGGAACCTT TGTGGCTCCT CTGCCGATCC ATACTGCGGA ACTCCTAGCC GCTTGTTTTG 4861 CTCGCAGCCC GTTTCCATGG CTGCTAGGCT GTACTGCCAA CTGGATCCTT CGCGGGACGT 4921 CCTTTGTTTA CGTCCCGTCG GCGCTGAATC CCGCGGACGA CCCCTCGCGG GGCCGCTTGG 4981 GACTCTCTCG TCCCCTTCTC CGTCTGCCGT TCCAGCCGAC CACGGGGCGC ACCTCTCTTT 5041 ACGCGGTCTC CCCGTCTGTG CCTTCTCATC TGCCGGTCCG TGTGCACTTC GCTTCACCTC 5101 TGCACGTTGC ATGGAGACCA CCGTGAACGC CCATCAGATC CTGCCCAAGG TCTTACATAA 5161 GAGGACTCTT GGACTCCCAG CAATGTCAAC GACCGACCTT GAGGCCTACT TCAAAGACTG 5221 TGTGTTTAAG GACTGGGAGG AGCTGGGGGA GGAGATTAGG TTAAAGGTCT TTGTATTAGG 5281 AGGCTGTAGG DCATAAATTG GTCTGCGGGT TCGAAATCGA TAAGCTGATC CGTCGACCCG 5341 GGCGGCCGCT TCGAGCAGAC ATGATAAGAT ACATTGATGA GTTTGGACAA ACCACAACTA 5401 GAATGCAGTG AAAAAAATGC TTTATTTGTG AAATTTGTGA TGCTATTGCT TTATTTGTAA 5461 CCATTATAAG CTGCAATAAA CAAGTTAACA ACAACAATTG CATTCATTTT ATGTTTCAGG 5521 TTCAGGGGGA GATGTGGGAG GTTTTTTAAA GCAAGTAAAA CCTCTACAAA TGTGGTAAAA 5581 TCGATAAGGA TCCGGGCTGG CGTAATAGCG AAGAGGCCCG CACCGATCGC CCTTCCCAAC 5641 AGTTGCGCAG CCTGAATGGC GAATGGACGC GCCCTGTAGC GGCGCATTAA GCGCGGCGGG 5701 TGTGGTGGTT ACGCGCAGCG TGACCGCTAC ACTTGCCAGC GCCCTAGCGC CCGCTCCTTT 5761 CGCTTTCTTC CCTTCCTTTC TCGCCACGTT CGCCGGCTTT CCCCGTCAAG CTCTAAATCG 5821 GGGGCTCCCT TTAGGGTTCC GATTTAGAGC TTTACGGCAC CTCGACCGCA AAAAACTTGA 5881 TTTGGGTGAT GGTTCACGTA GTGGGCCATC GCCCTGATAG ACGGTTTTTC GCCCTTTGAC 5941 GTTGGAGTCC ACGTTCTTTA ATAGTGGACT CTTGTTCCAA ACTGGAACAA CACTCAACCC 6001 TATCTCGGTC TATTCTTTTG ATTTATAAGG GATTTTGCCG ATTTCGGCCT ATTGGTTAAA 6061 AAATGAGCTG ATTTAACAAA TATTTAACGC GAATTTTAAC AAAATATTAA CGTTTACAAT 6121 TTCGCCTGAT GCGGTATTTT CTCCTTACGC ATCTGTGCGG TATTTCACAC CGCATATGGT 6181 GCACTCTCAG TACAATCTGC TCTGATGCCG CATAGTTAAG CCAGCCCCGA CACCCGCCAA 6241 CACCCGCTGA CGCGCCCTGA CGGGCTTGTC TGCTCCCGGC ATCCGCTTAC AGACAAGCTG 6301 TGACCGTCTC CGGGAGCTGC ATGTGTCAGA GGTTTTCACC GTCATCACCG AAACGCGCGA 6361 GACGAAAGGG CCTCGTGATA CGCCTATTTT TATAGGTTAA TGTCATGATA ATAATGGTTT 6421 CTTAGACGTC AGGTGGCACT TTTCGGGGAA ATGTGCGCGG AACCCCTATT TGTTTATTTT 6481 TCTAAATACA TTCAAATATG TATCCGCTCA TGAGACAATA ACCCTGATAA ATGCTTCAAT 6541 AATATTGAAA AAGGAAGAGT ATGAGTATTC AACATTTCCG TGTCGCCCTT ATTCCCTTTT 6601 TTGCGGCATT TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 6661 CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC AGCGGTAAGA 6721 TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGAT GAGCACTTTT AAAGTTCTGC 6781 TATGTGGCGC GGTATTATCC CGTATTGACG CCGGGCAAGA GCAACTCGGT CGCCGCATAC 6841 ACTATTCTCA GAATGACTTG GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG 6901 GCATGACAGT AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCGGCCA 6961 ACTTACTTCT GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG CACAACATGG 7021 GGGATCATGT AACTCGCCTT GATCGTTGGG AACCGGAGCT GAATGAAGCC ATACCAAACG 7081 ACGAGCGTGA CACCACGATG CCTGTAGCAA TGGCAACAAC GTTGCGCAAA CTATTAACTG 7141 GCGAACTACT TACTCTAGCT TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG 7201 TTGCAGGACC ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG 7261 GAGCCGGTGA GCGTGGGTCT CGCGGTATCA TTGCAGCACT GGGGCCAGAT GGTAAGCCCT 7321 CCCGTATCGT AGTTATCTAC ACGACGGGGA GTCAGGCAAC TATGGATGAA CGAAATAGAC 7381 AGATCGCTGA GATAGGTGCC TCACTGATTA AGCATTGGTA ACTGTCAGAC CAAGTTTACT 7441 CATATATACT TTAGATTGAT TTAAAACTTC ATTTTTAATT TAAAAGGATC TAGGTGAAGA 7501 TCCTTTTTGA TAATCTCATG ACCAAAATCC CTTAACGTGA GTTTTCGTTC CACTGAGCGT 7561 CAGACCCCGT AGAAAAGATC AAAGGATCTT CTTGAGATCC TTTTTTTCTG CGCGTAATCT 7621 GCTGCTTGCA AACAAAAAAA CCACCGCTAC CAGCGGTGGT TTGTTTGCCG GATCAAGAGC 7681 TACCAACTCT TTTTCCGAAG GTAACTGGCT TCAGCAGAGC GCAGATACCA AATACTGTCC 7741 TTCTAGTGTA GCCGTAGTTA GGCCACCACT TCAAGAACTC TGTAGCACCG CCTACATACC 7801 TCGCTCTGCT AATCCTGTTA CCAGTGGCTG CTGCCAGTGG CGATAAGTCG TGTCTTACCG 7861 GGTTGGACTC AAGACGATAG TTACCGGATA AGGCGCAGCG GTCGGGCTGA ACGGGGGGTT 7921 CGTGCACACA GCCCAGCTTG GAGCGAACGA CCTACACCGA ACTGAGATAC CTACAGCGTG 7981 AGCTATGAGA AAGCGCCACG CTTCCCGAAG GGAGAAAGGC GGACAGGTAT CCGGTAAGCG 8041 GCAGGGTCGG AACAGGAGAG CGCACGAGGG AGCTTCCAGG GGGAAACGCC TGGTATCTTT 8101 ATAGTCCTGT CGGGTTTCGC CACCTCTGAC TTGAGCGTCG ATTTTTGTGA TGCTCGTCAG 8161 GGGGGCGGAG CCTATGGAAA AACGCCAGCA ACGCGGCCTT TTTACGGTTC CTGGCCTTTT 8221 GCTGGCCTTT TGCTCACATG GCTCGACAGA TCT //