LOCUS PUP-7665 ( 7665 BP DS-DNA CIRCULAR SYN 10-MAR-1998 DEFINITION - ACCESSION - KEYWORDS - SOURCE - FEATURES Location/Qualifiers CDS <233..>1951 /note="E1 [Split]" frag 233..2055 /note="948 to 2770 of pML2d" CDS 489..767 /note="E8" misc_signal 791..910 /note="Zhao 99 Virol minimal packaging signal" protein_bind 1681..1692 /note="E2 BS #15" CDS 1893..>2055 /note="E2-TA [Split]" promoter <2070..>2753 /note="URR fragment [Split]" rep_origin 2614..2691 /note="Minimal Ori (Fields p2211)" misc_RNA 2751..>2757 /note="P89 cap site (Type 1)" polyA_signal 2787..3056 /note="HSV TK pA signal" polyA_site 2845..2863 /note="HSV TK pA sites" promoter 3169..4347 /note="EF1a promoter" enhancer 3169..3366 /note="EF-1alpha promoter core domain" exon 3367..3399 /note="EF-1a exon 1" intron 3400..4339 /note="EF-1a intron A" exon 4339..4347 /note="EF-1a exon 2 leader" CDS 4408..5124 /note="EGFP (weighs 27kD)" polyA_signal 5139..5416 /note="BGH polyA" rep_origin 5846..6519 /note="Coli Ori" CDS complement(6671..7529) /note="AmpR" promoter complement(7564..7599) /note="AmpR's promoter" BASE COUNT 1898 A 1860 C 2039 G 1868 T 0 OTHER ORIGIN - 1 GACGGATCGG GAGATCTCCC GATCCCCTAT GGTCGACTCT CAGTACAATC TGCTCTGATG 61 CCGCATAGTT AAGCCAGTAT CTGCTCCCTG CTTGTGTGTT GGAGGTCGCT GAGTAGTGCG 121 CGAGCAAAAT TTAAGCTACA ACAAGGCAAG GCTTGACCGA CAATTGCATG AAGAATCTGC 181 TTAGGGTTAG GCGTTTTGCG CTGCTTCGCG ATGTACGGGC CAGATATACG CGGGGGCAGG 241 TGTAGAACTG TCTGTGGAAT CTGATCGGTA TGATAGCCAG GATGAGGATT TTGTTGACAA 301 TGCATCAGTC TTTCAGGGAA ATCACCTGGA GGTCTTCCAG GCATTAGAGA AAAAGGCGGG 361 TGAGGAGCAG ATTTTAAATT TGAAAAGAAA AGTATTGGGG AGTTCGCAAA ACAGCAGCGG 421 TTCCGAAGCA TCTGAAACTC CAGTTAAAAG ACGGAAATCA GGAGCAAAGC GAAGATTATT 481 TGCTGAAAAT GAAGCTAACC GTGTTCTTAC GCCCCTCCAG GTACAGGGGG AGGGGGAGGG 541 GAGGCAAGAA CTTAATGAGG AGCAGGCAAT TAGTCATCTA CATCTGCAGC TTGTTAAATC 601 TAAAAATGCT ACAGTTTTTA AGCTGGGGCT CTTTAAATCT TTGTTCCTTT GTAGCTTCCA 661 TGATATTACG AGGTTGTTTA AGAATGATAA GACCACTAAT CAGCAATGGG TGCTGGCTGT 721 GTTTGGCCTT GCAGAGGTGT TTTTTGAGGC GAGTTTCGAA CTCCTAAAGA AGCAGTGTAG 781 TTTTCTGCAG ATGCAAAAAA GATCTCATGA AGGAGGAACT TGTGCAGTTT ACTTAATCTG 841 CTTTAACACA GCTAAAAGCA GAGAAACAGT CCGGAATCTG ATGGCAAACA CGCTAAATGT 901 AAGAGAAGAG TGTTTGATGC TGCAGCCAGC TAAAATTCGA GGACTCAGCG CAGCTCTATT 961 CTGGTTTAAA AGTAGTTTGT CACCCGCTAC ACTTAAACAT GGTGCTTTAC CTGAGTGGAT 1021 ACGGGCGCAA ACTACTCTGA ACGAGAGCTT GCAGACCGAG AAATTCGACT TCGGAACTAT 1081 GGTGCAATGG GCCTATGATC ACAAATATGC TGAGGAGTCT AAAATAGCCT ATGAATATGC 1141 TTTGGCTGCA GGATCTGATA GCAATGCACG GGCTTTTTTA GCAACTAACA GCCAAGCTAA 1201 GCATGTGAAG GACTGTGCAA CTATGGTAAG ACACTATCTA AGAGCTGAAA CACAAGCATT 1261 AAGCATGCCT GCATATATTA AAGCTAGGTG CAAGCTGGCA ACTGGGGAAG GAAGCTGGAA 1321 GTCTATCCTA ACTTTTTTTA ACTATCAGAA TATTGAATTA ATTACCTTTA TTAATGCTTT 1381 AAAGCTCTGG CTAAAAGGAA TTCCAAAAAA AAACTGTTTA GCATTTATTG GCCCTCCAAA 1441 CACAGGCAAG TCTATGCTCT GCAACTCATT AATTCATTTT TTGGGTGGTA GTGTTTTATC 1501 TTTTGCCAAC CATAAAAGTC ACTTTTGGCT TGCTTCCCTA GCAGATACTA GAGCTGCTTT 1561 AGTAGATGAT GCTACTCATG CTTGCTGGAG GTACTTTGAC ACATACCTCA GAAATGCATT 1621 GGATGGCTAC CCTGTCAGTA TTGATAGAAA ACACAAAGCA GCGGTTCAAA TTAAAGCTCC 1681 ACCCCTCCTG GTAACCAGTA ATATTGATGT GCAGGCAGAG GACAGATATT TGTACTTGCA 1741 TAGTCGGGTG CAAACCTTTC GCTTTGAGCA GCCATGCACA GATGAATCGG GTGAGCAACC 1801 TTTTAATATT ACTGATGCAG ATTGGAAATC TTTTTTTGTA AGGTTATGGG GGCGTTTAGA 1861 CCTGATTGAC GAGGAGGAGG ATAGTGAAGA GGATGGAGAC AGCATGCGAA CGTTTACATG 1921 TAGCGCAAGA AACACAAATG CAGTTGATTG AGAAAAGTAG TGATAAGTTG CAAGATCATA 1981 TACTGTACTG GACTGCTGTT AGAACTGAGA ACACACTGCT TTATGCTGCA AGGAAAAAAG 2041 GGGTGACTGT CCTAGTAGTT ATTACTAGCC GCGTGCAAAG CACCGGCGGC GGTAGATGCG 2101 GGGTAAGTAC TGAATTTTAA TTCGACCTAT CCCGGTAAAG CGAAAGCGAC ACGCTTTTTT 2161 TTCACACATA GCGGGACCGA ACACGTTATA AGTATCGATT AGGTCTATTT TTGTCTCTCT 2221 GTCGGAACCA GAACTGGTAA AAGTTTCCAT TGCGTCTGGG CTTGTCTATC ATTGCGTCTC 2281 TATGGTTTTT GGAGGATTAG ACCGGGCCAC CAGTAATGGT GCATAGCGGA TGTCTGTACC 2341 GCCATCGGTG CACCGATATA GGTTTGGGGC TCCCCAAGGG ACTGCTGGGA TGACAGCTTC 2401 ATATTATATT GAATGGGCGC ATAATCAGCT TAATTGGTGA GGACAAGCTA CAAGTTGTAA 2461 CCTGATCTCC ACAAAGTACC GTTGCCGGTC GGGGTCAAAC CGTCTTCGGT GCTCGAAACC 2521 GCCTTAAACT ACAGACAGGT CCCAGCCAAG TAGGCGGATC AAAACCTCAA AAAGGCGGGA 2581 GCCAATCAAA ATGCAGCATT ATATTTTAAG CTCACCGAAA CCGGTAAGTA AAGACTATGT 2641 ATTTTTTCCC AGTGAATAAT TGTTGTTAAC AATAATCACA CCATCACCGT TTTTTCAAGC 2701 GGGAAAAAAT AGCCAGCTAA CTATAAAAAG CTGCTGACAG ACCCCGGTTT TCATTAAGCT 2761 TGGTACCGAG CTCGGATCCA CTAGGGGGAG GCTAACTGAA ACACGGAAGG AGACAATACC 2821 GGAAGGAACC CGCGCTATGA CGGCAATAAA AAGACAGAAT AAAACGCACG GTGTTGGGTC 2881 GTTTGTTCAT AAACGCGGGG TTCGGTCCCA GGGCTGGCAC TCTGTCGATA CCCCACCGAG 2941 ACCCCATTGG GGCCAATACG CCCGCGTTTC TTCCTTTTCC CCACCCCACC CCCCAAGTTC 3001 GGGTGAAGGC CCAGGGCTCG CAGCCAACGT CGGGGCGGCA GGCCCTGCCA TAGCCTCAGC 3061 TACCGGACTC AGATCTCGAG CTCAAGCTTG CAAAGATGGA TAAAGTTTTA AACAGAGAGG 3121 AATCTTTGCA GCTAATGGAC CTTGTAGGTC TTGAAAGGAG TGGGAATTGG CTCCGGTGCC 3181 CGTCAGTGGG CAGAGCGCAC ATCGCCCACA GTCCCCGAGA AGTTGGGGGG AGGGGTCGGC 3241 AATTGAACCG GTGCCTAGAG AAGGTGGCGC GGGGTAAACT GGGAAAGTGA TGTCGTGTAC 3301 TGGCTCCGCC TTTTTCCCGA GGGTGGGGGA GAACCGTATA TAAGTGCAGT AGTCGCCGTG 3361 AACGTTCTTT TTCGCAACGG GTTTGCCGCC AGAACACAGG TAAGTGCCGT GTGTGGTTCC 3421 CGCGGGCCTG GCCTCTTTAC GGGTTATGGC CCTTGCGTGC CTTGAATTAC TTCCACCTGG 3481 CTGCAGTACG TGATTCTTGA TCCCGAGCTT CGGGTTGGAA GTGGGTGGGA GAGTTCGAGG 3541 CCTTGCGCTT AAGGAGCCCC TTCGCCTCGT GCTTGAGTTG AGGCCTGGCC TGGGCGCTGG 3601 GGCCGCCGCG TGCGAATCTG GTGGCACCTT CGCGCCTGTC TCGCTGCTTT CGATAAGTCT 3661 CTAGCCATTT AAAATTTTTG ATGACCTGCT GCGACGCTTT TTTTCTGGCA AGATAGTCTT 3721 GTAAATGCGG GCCAAGATCT GCACACTGGT ATTTCGGTTT TTGGGGCCGC GGGCGGCGAC 3781 GGGGCCCGTG CGTCCCAGCG CACATGTTCG GCGAGGCGGG GCCTGCGAGC GCGGCCACCG 3841 AGAATCGGAC GGGGGTAGTC TCAAGCTGGC CGGCCTGCTC TGGTGCCTGG CCTCGCGCCG 3901 CCGTGTATCG CCCCGCCCTG GGCGGCAAGG CTGGCCCGGT CGGCACCAGT TGCGTGAGCG 3961 GAAAGATGGC CGCTTCCCGG CCCTGCTGCA GGGAGCTCAA AATGGAGGAC GCGGCGCTCG 4021 GGAGAGCGGG CGGGTGAGTC ACCCACACAA AGGAAAAGGG CCTTTCCGTC CTCAGCCGTC 4081 GCTTCATGTG ACTCCACGGA GTACCGGGCG CCGTCCAGGC ACCTCGATTA GTTCTCGAGC 4141 TTTTGGAGTA CGTCGTCTTT AGGTTGGGGG GAGGGGTTTT ATGCGATGGA GTTTCCCCAC 4201 ACTGAGTGGG TGGAGACTGA AGTTAGGCCA GCTTGGCACT TGATGTAATT CTCCTTGGAA 4261 TTTGCCCTTT TTGAGTTTGG ATCTTGGTTC ATTCTCAAGC CTCAGACAGT GGTTCAAAGT 4321 TTTTTTCTTC CATTTCAGGT GTCGTGAGAA TTCTGTAGAG ATCCCTCGAC CTCGAGATCC 4381 ATTGTGCTGG ATCCACCGGT CGCCACCATG GTGAGCAAGG GCGAGGAGCT GTTCACCGGG 4441 GTGGTGCCCA TCCTGGTCGA GCTGGACGGC GACGTAAACG GCCACAAGTT CAGCGTGTCC 4501 GGCGAGGGCG AGGGCGATGC CACCTACGGC AAGCTGACCC TGAAGTTCAT CTGCACCACC 4561 GGCAAGCTGC CCGTGCCCTG GCCCACCCTC GTGACCACCC TGACCTACGG CGTGCAGTGC 4621 TTCAGCCGCT ACCCCGACCA CATGAAGCAG CACGACTTCT TCAAGTCCGC CATGCCCGAA 4681 GGCTACGTCC AGGAGCGCAC CATCTTCTTC AAGGACGACG GCAACTACAA GACCCGCGCC 4741 GAGGTGAAGT TCGAGGGCGA CACCCTGGTG AACCGCATCG AGCTGAAGGG CATCGACTTC 4801 AAGGAGGACG GCAACATCCT GGGGCACAAG CTGGAGTACA ACTACAACAG CCACAACGTC 4861 TATATCATGG CCGACAAGCA GAAGAACGGC ATCAAGGTGA ACTTCAAGAT CCGCCACAAC 4921 ATCGAGGACG GCAGCGTGCA GCTCGCCGAC CACTACCAGC AGAACACCCC CATCGGCGAC 4981 GGCCCCGTGC TGCTGCCCGA CAACCACTAC CTGAGCACCC AGTCCGCCCT GAGCAAAGAC 5041 CCCAACGAGA AGCGCGATCA CATGGTCCTG CTGGAGTTCG TGACCGCCGC CGGGATCACT 5101 CTCGGCATGG ACGAGCTGTA CAAGTAAAGC GGCCCTAGAG CTCGCTGATC AGCCTCGACT 5161 GTGCCTCTAG TTGCCAGCCA TCTGTTGTTT GCCCCTCCCC CGTGCCTTCC TTGACCCTGG 5221 AAGGTGCCAC TCCCACTGTC CTTTCCTAAT AAAATGAGGA AATTGCATCG CATTGTCTGA 5281 GTAGGTGTCA TTCTATTCTG GGGGGTGGGG TGGGGCAGGA CAGCAAGGGG GAGGATTGGG 5341 AAGACAATAG CAGGCATGCT GGGGATGCGG TGGGCTCTAT GGCTTCTGAG GCGGAAAGAA 5401 CCAGCTGGGG CTCGAGTGCA TTCTAGTTGT GGTTTGTCCA AACTCATCAA TGTATCTTAT 5461 CATGTCTGTA TACCGTCGAC CTCTAGCTAG AGCTTGGCGT AATCATGGTC ATAGCTGTTT 5521 CCTGTGTGAA ATTGTTATCC GCTCACAATT CCACACAACA TACGAGCCGG AAGCATAAAG 5581 TGTAAAGCCT GGGGTGCCTA ATGAGTGAGC TAACTCACAT TAATTGCGTT GCGCTCACTG 5641 CCCGCTTTCC AGTCGGGAAA CCTGTCGTGC CAGCTGCATT AATGAATCGG CCAACGCGCG 5701 GGGAGAGGCG GTTTGCGTAT TGGGCGCTCT TCCGCTTCCT CGCTCACTGA CTCGCTGCGC 5761 TCGGTCGTTC GGCTGCGGCG AGCGGTATCA GCTCACTCAA AGGCGGTAAT ACGGTTATCC 5821 ACAGAATCAG GGGATAACGC AGGAAAGAAC ATGTGAGCAA AAGGCCAGCA AAAGGCCAGG 5881 AACCGTAAAA AGGCCGCGTT GCTGGCGTTT TTCCATAGGC TCCGCCCCCC TGACGAGCAT 5941 CACAAAAATC GACGCTCAAG TCAGAGGTGG CGAAACCCGA CAGGACTATA AAGATACCAG 6001 GCGTTTCCCC CTGGAAGCTC CCTCGTGCGC TCTCCTGTTC CGACCCTGCC GCTTACCGGA 6061 TACCTGTCCG CCTTTCTCCC TTCGGGAAGC GTGGCGCTTT CTCAATGCTC ACGCTGTAGG 6121 TATCTCAGTT CGGTGTAGGT CGTTCGCTCC AAGCTGGGCT GTGTGCACGA ACCCCCCGTT 6181 CAGCCCGACC GCTGCGCCTT ATCCGGTAAC TATCGTCTTG AGTCCAACCC GGTAAGACAC 6241 GACTTATCGC CACTGGCAGC AGCCACTGGT AACAGGATTA GCAGAGCGAG GTATGTAGGC 6301 GGTGCTACAG AGTTCTTGAA GTGGTGGCCT AACTACGGCT ACACTAGAAG GACAGTATTT 6361 GGTATCTGCG CTCTGCTGAA GCCAGTTACC TTCGGAAAAA GAGTTGGTAG CTCTTGATCC 6421 GGCAAACAAA CCACCGCTGG TAGCGGTGGT TTTTTTGTTT GCAAGCAGCA GATTACGCGC 6481 AGAAAAAAAG GATCTCAAGA AGATCCTTTG ATCTTTTCTA CGGGGTCTGA CGCTCAGTGG 6541 AACGAAAACT CACGTTAAGG GATTTTGGTC ATGAGATTAT CAAAAAGGAT CTTCACCTAG 6601 ATCCTTTTAA ATTAAAAATG AAGTTTTAAA TCAATCTAAA GTATATATGA GTAAACTTGG 6661 TCTGACAGTT ACCAATGCTT AATCAGTGAG GCACCTATCT CAGCGATCTG TCTATTTCGT 6721 TCATCCATAG TTGCCTGACT CCCCGTCGTG TAGATAACTA CGATACGGGA GGGCTTACCA 6781 TCTGGCCCCA GTGCTGCAAT GATACCGCGA GACCCACGCT CACCGGCTCC AGATTTATCA 6841 GCAATAAACC AGCCAGCCGG AAGGGCCGAG CGCAGAAGTG GTCCTGCAAC TTTATCCGCC 6901 TCCATCCAGT CTATTAATTG TTGCCGGGAA GCTAGAGTAA GTAGTTCGCC AGTTAATAGT 6961 TTGCGCAACG TTGTTGCCAT TGCTACAGGC ATCGTGGTGT CACGCTCGTC GTTTGGTATG 7021 GCTTCATTCA GCTCCGGTTC CCAACGATCA AGGCGAGTTA CATGATCCCC CATGTTGTGC 7081 AAAAAAGCGG TTAGCTCCTT CGGTCCTCCG ATCGTTGTCA GAAGTAAGTT GGCCGCAGTG 7141 TTATCACTCA TGGTTATGGC AGCACTGCAT AATTCTCTTA CTGTCATGCC ATCCGTAAGA 7201 TGCTTTTCTG TGACTGGTGA GTACTCAACC AAGTCATTCT GAGAATAGTG TATGCGGCGA 7261 CCGAGTTGCT CTTGCCCGGC GTCAATACGG GATAATACCG CGCCACATAG CAGAACTTTA 7321 AAAGTGCTCA TCATTGGAAA ACGTTCTTCG GGGCGAAAAC TCTCAAGGAT CTTACCGCTG 7381 TTGAGATCCA GTTCGATGTA ACCCACTCGT GCACCCAACT GATCTTCAGC ATCTTTTACT 7441 TTCACCAGCG TTTCTGGGTG AGCAAAAACA GGAAGGCAAA ATGCCGCAAA AAAGGGAATA 7501 AGGGCGACAC GGAAATGTTG AATACTCATA CTCTTCCTTT TTCAATATTA TTGAAGCATT 7561 TATCAGGGTT ATTGTCTCAT GAGCGGATAC ATATTTGAAT GTATTTAGAA AAATAAACAA 7621 ATAGGGGTTC CGCGCACATT TCCCCGAAAA GTGCCACCTG ACGTC //