LOCUS PE1 (LABNA 6434 BP DS-DNA CIRCULAR SYN 10-JAN-1997 DEFINITION - ACCESSION - KEYWORDS - SOURCE - FEATURES Location/Qualifiers promoter 1..795 /note="CMV promoter" CAAT_signal 674..678 /note="CAAT box" TATA_signal 707..713 /note="TATA box" intron 857..989 /note="intron" promoter 1034..1052 /note="T7 promoter" CDS 1087..>2904 /note="E1" mutation 1443..1443 /note="N in Genbank sequence verified as T" mutation 1501..1501 /note="this base is possibly an A - not clear by 9/17 sequencing" misc_signal 1744..1863 /note="Zhao's packaging signal" CDS 2846..>2904 /note="E2 leader" frag 2906..2934 /note="discovered by sequencing" mRNA 2981..>3486 /note="HBV PRE" polyA_signal 3537..3758 /note="SV40 polyA" rep_origin 3848..4303 /note="phage f1" promoter 4672..4707 /note="AmpR's promoter" CDS 4742..5602 /note="ampR" rep_origin 5751..6421 /note="ColE1 Ori" BASE COUNT 1676 A 1467 C 1577 G 1713 T 1 OTHER ORIGIN - 1 TCAATATTGG CCATTAGCCA TATTATTCAT TGGTTATATA GCATAAATCA ATATTGGCTA 61 TTGGCCATTG CATACGTTGT ATCTATATCA TAATATGTAC ATTTATATTG GCTCATGTCC 121 AATATGACCG CCATGTTGGC ATTGATTATT GACTAGTTAT TAATAGTAAT CAATTACGGG 181 GTCATTAGTT CATAGCCCAT ATATGGAGTT CCGCGTTACA TAACTTACGG TAAATGGCCC 241 GCCTGGCTGA CCGCCCAACG ACCCCCGCCC ATTGACGTCA ATAATGACGT ATGTTCCCAT 301 AGTAACGCCA ATAGGGACTT TCCATTGACG TCAATGGGTG GAGTATTTAC GGTAAACTGC 361 CCACTTGGCA GTACATCAAG TGTATCATAT GCCAAGTCCG CCCCCTATTG ACGTCAATGA 421 CGGTAAATGG CCCGCCTGGC ATTATGCCCA GTACATGACC TTACGGGACT TTCCTACTTG 481 GCAGTACATC TACGTATTAG TCATCGCTAT TACCATGGTG ATGCGGTTTT GGCAGTACAC 541 CAATGGGCGT GGATAGCGGT TTGACTCACG GGGATTTCCA AGTCTCCACC CCATTGACGT 601 CAATGGGAGT TTGTTTTGGC ACCAAAATCA ACGGGACTTT CCAAAATGTC GTAATAACCC 661 CGCCCCGTTG ACGCAAATGG GCGGTAGGCG TGTACGGTGG GAGGTCTATA TAAGCAGAGC 721 TCGTTTAGTG AACCGTCAGA TCACTAGAAG CTTTATTGCG GTAGTTTATC ACAGTTAAAT 781 TGCTAACGCA GTCAGTGCTT CTGACACAAC AGTCTCGAAC TTAAGCTGCA GAAGTTGGTC 841 GTGAGGCACT GGGCAGGTAA GTATCAAGGT TACAAGACAG GTTTAAGGAG ACCAATAGAA 901 ACTGGGCTTG TCGAGACAGA GAAGACTCTT GCGTTTCTGA TAGGCACCTA TTGGTCTTAC 961 TGACATCCAC TTTGCCTTTC TCTCCACAGG TGTCCACTCC CAGTTCAATT ACAGCTCTTA 1021 AGGCTAGAGT ACTTAATACG ACTCACTATA GGCTAGCCTC GAGAATTCAC GCGTGGTACC 1081 CGCACCATGG CAAACGATAA AGGTAGCAAT TGGGATTCGG GCTTGGGATG CTCATATCTG 1141 CTGACTGAGG CAGAATGTGA AAGTGACAAA GAGAATGAGG AACCCGGGGC AGGTGTAGAA 1201 CTGTCTGTGG AATCTGATCG GTATGATAGC CAGGATGAGG ATTTTGTTGA CAATGCATCA 1261 GTCTTTCAGG GAAATCACCT GGAGGTCTTC CAGGCATTAG AGAAAAAGGC GGGTGAGGAG 1321 CAGATTTTAA ATTTGAAAAG AAAAGTATTG GGGAGTTCGC AAAACAGCAG CGGTTCCGAA 1381 GCATCTGAAA CTCCAGTTAA AAGACGGAAA TCAGGAGCAA AGCGAAGATT ATTTGCTGAA 1441 AATGAAGCTA ACCGTGTTCT TACGCCCCTC CAGGTACAGG GGGAGGGGGA GGGGAGGCAA 1501 GAACTTAATG AGGAGCAGGC AATTAGTCAT CTACATCTGC AGCTTGTTAA ATCTAAAAAT 1561 GCTACAGTTT TTAAGCTGGG GCTCTTTAAA TCTTTGTTCC TTTGTAGCTT CCATGATATT 1621 ACGAGGTTGT TTAAGAATGA TAAGACCACT AATCAGCAAT GGGTGCTGGC TGTGTTTGGC 1681 CTTGCAGAGG TGTTTTTTGA GGCGAGTTTC GAACTCCTAA AGAAGCAGTG TAGTTTTCTG 1741 CAGATGCAAA AAAGATCTCA TGAAGGAGGA ACTTGTGCAG TTTACTTAAT CTGCTTTAAC 1801 ACAGCTAAAA GCAGAGAAAC AGTCCGGAAT CTGATGGCAA ACACGCTAAA TGTAAGAGAA 1861 GAGTGTTTGA TGCTGCAGCC AGCTAAAATT CGAGGACTCA GCGCAGCTCT ATTCTGGTTT 1921 AAAAGTAGTT TGTCACCCGC TACACTTAAA CATGGTGCTT TACCTGAGTG GATACGGGCG 1981 CAAACTACTC TGAACGAGAG CTTGCAGACC GAGAAATTCG ACTTCGGAAC TATGGTGCAA 2041 TGGGCCTATG ATCACAAATA TGCTGAGGAG TCTAAAATAG CCTATGAATA TGCTTTGGCT 2101 GCAGGATCTG ATAGCAATGC ACGGGCTTTT TTAGCAACTA ACAGCCAAGC TAAGCATGTG 2161 AAGGACTGTG CAACTATGGT AAGACACTAT CTAAGAGCTG AAACACAAGC ATTAAGCATG 2221 CCTGCATATA TTAAAGCTAG GTGCAAGCTG GCAACTGGGG AAGGAAGCTG GAAGTCTATC 2281 CTAACTTTTT TTAACTATCA GAATATTGAA TTAATTACCT TTATTAATGC TTTAAAGCTC 2341 TGGCTAAAAG GAATTCCAAA AAAAAACTGT TTAGCATTTA TTGGCCCTCC AAACACAGGC 2401 AAGTCTATGC TCTGCAACTC ATTAATTCAT TTTTTGGGTG GTAGTGTTTT ATCTTTTGCC 2461 AACCATAAAA GTCACTTTTG GCTTGCTTCC CTAGCAGATA CTAGAGCTGC TTTAGTAGAT 2521 GATGCTACTC ATGCTTGCTG GAGGTACTTT GACACATACC TCAGAAATGC ATTGGATGGC 2581 TACCCTGTCA GTATTGATAG AAAACACAAA GCAGCGGTTC AAATTAAAGC TCCACCCCTC 2641 CTGGTAACCA GTAATATTGA TGTGCAGGCA GAGGACAGAT ATTTGTACTT GCATAGTCGG 2701 GTGCAAACCT TTCGCTTTGA GCAGCCATGC ACAGATGAAT CGGGTGAGCA ACCTTTTAAT 2761 ATTACTGATG CAGATTGGAA ATCTTTTTTT GTAAGGTTAT GGGGGCGTTT AGACCTGATT 2821 GACGAGGAGG AGGATAGTGA AGAGGATGGA GACAGCATGC GAACGTTTAC ATGTAGCGCA 2881 AGAAACACAA ATGCAGTTGA TTGAGAAAAG TAGTGATAAG TTGCAAGATC GGATCTAGAG 2941 CGGCCGCCAC CGCGGTGGCT AGAGGGCCCG TTTGTACCCC CGTGGAACCT TTGTGGCTCC 3001 TCTGCCGATC CATACTGCGG AACTCCTAGC CGCTTGTTTT GCTCGCAGCC CGTTTCCATG 3061 GCTGCTAGGC TGTACTGCCA ACTGGATCCT TCGCGGGACG TCCTTTGTTT ACGTCCCGTC 3121 GGCGCTGAAT CCCGCGGACG ACCCCTCGCG GGGCCGCTTG GGACTCTCTC GTCCCCTTCT 3181 CCGTCTGCCG TTCCAGCCGA CCACGGGGCG CACCTCTCTT TACGCGGTCT CCCCGTCTGT 3241 GCCTTCTCAT CTGCCGGTCC GTGTGCACTT CGCTTCACCT CTGCACGTTG CATGGAGACC 3301 ACCGTGAACG CCCATCAGAT CCTGCCCAAG GTCTTACATA AGAGGACTCT TGGACTCCCA 3361 GCAATGTCAA CGACCGACCT TGAGGCCTAC TTCAAAGACT GTGTGTTTAA GGACTGGGAG 3421 GAGCTGGGGG AGGAGATTAG GTTAAAGGTC TTTGTATTAG GAGGCTGTAG GDCATAAATT 3481 GGTCTGCGGG TTCGAAATCG ATAAGCTGAT CCGTCGACCC GGGCGGCCGC TTCGAGCAGA 3541 CATGATAAGA TACATTGATG AGTTTGGACA AACCACAACT AGAATGCAGT GAAAAAAATG 3601 CTTTATTTGT GAAATTTGTG ATGCTATTGC TTTATTTGTA ACCATTATAA GCTGCAATAA 3661 ACAAGTTAAC AACAACAATT GCATTCATTT TATGTTTCAG GTTCAGGGGG AGATGTGGGA 3721 GGTTTTTTAA AGCAAGTAAA ACCTCTACAA ATGTGGTAAA ATCGATAAGG ATCCGGGCTG 3781 GCGTAATAGC GAAGAGGCCC GCACCGATCG CCCTTCCCAA CAGTTGCGCA GCCTGAATGG 3841 CGAATGGACG CGCCCTGTAG CGGCGCATTA AGCGCGGCGG GTGTGGTGGT TACGCGCAGC 3901 GTGACCGCTA CACTTGCCAG CGCCCTAGCG CCCGCTCCTT TCGCTTTCTT CCCTTCCTTT 3961 CTCGCCACGT TCGCCGGCTT TCCCCGTCAA GCTCTAAATC GGGGGCTCCC TTTAGGGTTC 4021 CGATTTAGAG CTTTACGGCA CCTCGACCGC AAAAAACTTG ATTTGGGTGA TGGTTCACGT 4081 AGTGGGCCAT CGCCCTGATA GACGGTTTTT CGCCCTTTGA CGTTGGAGTC CACGTTCTTT 4141 AATAGTGGAC TCTTGTTCCA AACTGGAACA ACACTCAACC CTATCTCGGT CTATTCTTTT 4201 GATTTATAAG GGATTTTGCC GATTTCGGCC TATTGGTTAA AAAATGAGCT GATTTAACAA 4261 ATATTTAACG CGAATTTTAA CAAAATATTA ACGTTTACAA TTTCGCCTGA TGCGGTATTT 4321 TCTCCTTACG CATCTGTGCG GTATTTCACA CCGCATATGG TGCACTCTCA GTACAATCTG 4381 CTCTGATGCC GCATAGTTAA GCCAGCCCCG ACACCCGCCA ACACCCGCTG ACGCGCCCTG 4441 ACGGGCTTGT CTGCTCCCGG CATCCGCTTA CAGACAAGCT GTGACCGTCT CCGGGAGCTG 4501 CATGTGTCAG AGGTTTTCAC CGTCATCACC GAAACGCGCG AGACGAAAGG GCCTCGTGAT 4561 ACGCCTATTT TTATAGGTTA ATGTCATGAT AATAATGGTT TCTTAGACGT CAGGTGGCAC 4621 TTTTCGGGGA AATGTGCGCG GAACCCCTAT TTGTTTATTT TTCTAAATAC ATTCAAATAT 4681 GTATCCGCTC ATGAGACAAT AACCCTGATA AATGCTTCAA TAATATTGAA AAAGGAAGAG 4741 TATGAGTATT CAACATTTCC GTGTCGCCCT TATTCCCTTT TTTGCGGCAT TTTGCCTTCC 4801 TGTTTTTGCT CACCCAGAAA CGCTGGTGAA AGTAAAAGAT GCTGAAGATC AGTTGGGTGC 4861 ACGAGTGGGT TACATCGAAC TGGATCTCAA CAGCGGTAAG ATCCTTGAGA GTTTTCGCCC 4921 CGAAGAACGT TTTCCAATGA TGAGCACTTT TAAAGTTCTG CTATGTGGCG CGGTATTATC 4981 CCGTATTGAC GCCGGGCAAG AGCAACTCGG TCGCCGCATA CACTATTCTC AGAATGACTT 5041 GGTTGAGTAC TCACCAGTCA CAGAAAAGCA TCTTACGGAT GGCATGACAG TAAGAGAATT 5101 ATGCAGTGCT GCCATAACCA TGAGTGATAA CACTGCGGCC AACTTACTTC TGACAACGAT 5161 CGGAGGACCG AAGGAGCTAA CCGCTTTTTT GCACAACATG GGGGATCATG TAACTCGCCT 5221 TGATCGTTGG GAACCGGAGC TGAATGAAGC CATACCAAAC GACGAGCGTG ACACCACGAT 5281 GCCTGTAGCA ATGGCAACAA CGTTGCGCAA ACTATTAACT GGCGAACTAC TTACTCTAGC 5341 TTCCCGGCAA CAATTAATAG ACTGGATGGA GGCGGATAAA GTTGCAGGAC CACTTCTGCG 5401 CTCGGCCCTT CCGGCTGGCT GGTTTATTGC TGATAAATCT GGAGCCGGTG AGCGTGGGTC 5461 TCGCGGTATC ATTGCAGCAC TGGGGCCAGA TGGTAAGCCC TCCCGTATCG TAGTTATCTA 5521 CACGACGGGG AGTCAGGCAA CTATGGATGA ACGAAATAGA CAGATCGCTG AGATAGGTGC 5581 CTCACTGATT AAGCATTGGT AACTGTCAGA CCAAGTTTAC TCATATATAC TTTAGATTGA 5641 TTTAAAACTT CATTTTTAAT TTAAAAGGAT CTAGGTGAAG ATCCTTTTTG ATAATCTCAT 5701 GACCAAAATC CCTTAACGTG AGTTTTCGTT CCACTGAGCG TCAGACCCCG TAGAAAAGAT 5761 CAAAGGATCT TCTTGAGATC CTTTTTTTCT GCGCGTAATC TGCTGCTTGC AAACAAAAAA 5821 ACCACCGCTA CCAGCGGTGG TTTGTTTGCC GGATCAAGAG CTACCAACTC TTTTTCCGAA 5881 GGTAACTGGC TTCAGCAGAG CGCAGATACC AAATACTGTC CTTCTAGTGT AGCCGTAGTT 5941 AGGCCACCAC TTCAAGAACT CTGTAGCACC GCCTACATAC CTCGCTCTGC TAATCCTGTT 6001 ACCAGTGGCT GCTGCCAGTG GCGATAAGTC GTGTCTTACC GGGTTGGACT CAAGACGATA 6061 GTTACCGGAT AAGGCGCAGC GGTCGGGCTG AACGGGGGGT TCGTGCACAC AGCCCAGCTT 6121 GGAGCGAACG ACCTACACCG AACTGAGATA CCTACAGCGT GAGCTATGAG AAAGCGCCAC 6181 GCTTCCCGAA GGGAGAAAGG CGGACAGGTA TCCGGTAAGC GGCAGGGTCG GAACAGGAGA 6241 GCGCACGAGG GAGCTTCCAG GGGGAAACGC CTGGTATCTT TATAGTCCTG TCGGGTTTCG 6301 CCACCTCTGA CTTGAGCGTC GATTTTTGTG ATGCTCGTCA GGGGGGCGGA GCCTATGGAA 6361 AAACGCCAGC AACGCGGCCT TTTTACGGTT CCTGGCCTTT TGCTGGCCTT TTGCTCACAT 6421 GGCTCGACAG ATCT //