LOCUS ph2j 5027 bp ds-DNA circular SYN 17-Jun-2003 DEFINITION - ACCESSION - KEYWORDS - SOURCE - FEATURES Location/Qualifiers misc_RNA complement(<4..12) /note="SV40 minor late 19S RNA" promoter <4..>348 /note="SV40 promoter" enhancer 26..168 /note="SV40 enhancer elements" misc_feature 173..236 /note="SV40 21bp repeats" rep_origin 193..>328 /note="SV40 Ori" mRNA 242..248 /note="early-late transcription startpoints" mutation 262 /note="G->A kills SELP ATG" mRNA 282..288 /note="early-early transcription start points" CDS 373..1092 /note="EGFP" polyA_signal 1108..>1329 /note="SV40 Late PolyA" promoter <1420..1657 /note="hEF1alpha promoter core" 5'UTR 1665..1952 /note="HTLV R-U5 leader" insertion_seq 1973..1997 /note="attB1" misc_recomb 2003..2008 /note="Kpn1 for making VP3 construct" CDS 2009..2029 /note="Tev recognition (ENLYFQ//G)(untranslated)" CDS 2030..3064 /note="JCV VP2 (consensus)" misc_recomb 2322..2327 /note="Kpn1 for making VP3 construct" CDS 2387..3064 /note="VP3" misc_recomb 3066..3071 /note="Avr2" misc_recomb 3072..3096 /note="attB2" polyA_signal <3116..3778 /note="hEF1a polyA signal" rep_origin 3779..4512 /note="MB1 Ori" promoter 4513..4599 /note="EM7 promoter" CDS 4600..4974 /note="ShBle (ZeoR)" terminator 4975..5027 /note="terminator (rpmB/G)" BASE COUNT 1228 A 1361 C 1332 G 1104 T 0 OTHER ORIGIN ? 1 CCCCTGTGGA ATGTGTGTCA GTTAGGGTGT GGAAAGTCCC CAGGCTCCCC AGCAGGCAGA 61 AGTATGCAAA GCATGCATCT CAATTAGTCA GCAACCAGGT GTGGAAAGTC CCCAGGCTCC 121 CCAGCAGGCA GAAGTATGCA AAGCATGCAT CTCAATTAGT CAGCAACCAT AGTCCCGCCC 181 CTAACTCCGC CCATCCCGCC CCTAACTCCG CCCAGTTCCG CCCATTCTCC GCCCCATGGC 241 TGACTAATTT TTTTTATTTA TACAGAGGCC GAGGCCGCCT CGGCCTCTGA GCTATTCCAG 301 AAGTAGTGAG GAGGCTTTTT TGGAGGCCTA GGCTTTTGCA AAAAGCTTGA TTGGGATCCA 361 CCGGTCGCCA CCATGGTGAG CAAGGGCGAG GAGCTGTTCA CCGGGGTGGT GCCCATCCTG 421 GTCGAGCTGG ACGGCGACGT AAACGGCCAC AAGTTCAGCG TGTCCGGCGA GGGCGAGGGC 481 GATGCCACCT ACGGCAAGCT GACCCTGAAG TTCATCTGCA CCACCGGCAA GCTGCCCGTG 541 CCCTGGCCCA CCCTCGTGAC CACCCTGACC TACGGCGTGC AGTGCTTCAG CCGCTACCCC 601 GACCACATGA AGCAGCACGA CTTCTTCAAG TCCGCCATGC CCGAAGGCTA CGTCCAGGAG 661 CGCACCATCT TCTTCAAGGA CGACGGCAAC TACAAGACCC GCGCCGAGGT GAAGTTCGAG 721 GGCGACACCC TGGTGAACCG CATCGAGCTG AAGGGCATCG ACTTCAAGGA GGACGGCAAC 781 ATCCTGGGGC ACAAGCTGGA GTACAACTAC AACAGCCACA ACGTCTATAT CATGGCCGAC 841 AAGCAGAAGA ACGGCATCAA GGTGAACTTC AAGATCCGCC ACAACATCGA GGACGGCAGC 901 GTGCAGCTCG CCGACCACTA CCAGCAGAAC ACCCCCATCG GCGACGGCCC CGTGCTGCTG 961 CCCGACAACC ACTACCTGAG CACCCAGTCC GCCCTGAGCA AAGACCCCAA CGAGAAGCGC 1021 GATCACATGG TCCTGCTGGA GTTCGTGACC GCCGCCGGGA TCACTCTCGG CATGGACGAG 1081 CTGTACAAGT AAAGCGGCCG CTTCGAGCAG ACATGATAAG ATACATTGAT GAGTTTGGAC 1141 AAACCACAAC TAGAATGCAG TGAAAAAAAT GCTTTATTTG TGAAATTTGT GATGCTATTG 1201 CTTTATTTGT AACCATTATA AGCTGCAATA AACAAGTTAA CAACAACAAT TGCATTCATT 1261 TTATGTTTCA GGTTCAGGGG GAGGTGTGGG AGGTTTTTTA AAGCAAGTAA AACCTCTACA 1321 AATGTGGTAA AATCGATAAG GATCCGGGCT GGCGTAATAG CGAAGAGGCC CGCACCGATC 1381 GCCCTTCCCA ACAGTTGCGG TGGAGAAGAG CATGCGTGAG GCTCCGGTGC CCGTCAGTGG 1441 GCAGAGCGCA CATCGCCCAC AGTCCCCGAG AAGTTGGGGG GAGGGGTCGG CAATTGAACC 1501 GGTGCCTAGA GAAGGTGGCG CGGGGTAAAC TGGGAAAGTG ATGTCGTGTA CTGGCTCCGC 1561 CTTTTTCCCG AGGGTGGGGG AGAACCGTAT ATAAGTGCAG TAGTCGCCGT GAACGTTCTT 1621 TTTCGCAACG GGTTTGCCGC CAGAACACAG CTGAAGCTTC GAGGGGCTCG CATCTCTCCT 1681 TCACGCGCCC GCCGCCCTAC CTGAGGCCGC CATCCACGCC GGTTGAGTCG CGTTCTGCCG 1741 CCTCCCGCCT GTGGTGCCTC CTGAACTGCG TCCGCCGTCT AGGTAAGTTT AAAGCTCAGG 1801 TCGAGACCGG GCCTTTGTCC GGCGCTCCCT TGGAGCCTAC CTAGACTCAG CCGGCTCTCC 1861 ACGCTTTGCC TGACCCTGCT TGCTCAACTC TACGTCTTTG TTTCGTTTTC TGTTCTGCGC 1921 CGTTACAGAT CCAAGCTGTG ACCGGCGCCT ACGTAAGTGA TAGCTTGATC AAACAAGTTT 1981 GTACAAAAAA GCAGGCTCCG GAGGTACCGA AAACCTGTAT TTTCAGGGCA TGGGCGCAGC 2041 TCTGGCCTTG CTCGGCGATC TGGTCGCCAC CGTCAGCGAA GCCGCAGCCG CAACCGGATT 2101 CAGCGTGGCC GAGATCGCAG CCGGAGAAGC TGCAGCCACA ATCGAGGTCG AGATCGCCAG 2161 TTTGGCCACT GTGGAAGGCA TCACCTCCAC AAGCGAAGCC ATCGCTGCCA TCGGACTGAC 2221 ACCCGAGACC TACGCCGTGA TCACTGGCGC ACCCGGAGCC GTGGCAGGAT TCGCCGCACT 2281 CGTCCAAACC GTGACCGGCG GCTCCGCCAT CGCCCAACTC GGGTACCGGT TCTTCGCCGA 2341 TTGGGACCAC AAGGTCAGCA CCGTCGGCTT GTTCCAACAA CCCGCAATGG CACTGCAGCT 2401 GTTCAACCCC GAGGACTACT ACGACATCCT GTTCCCGGGC GTCAACGCAT TCGTCAACAA 2461 CATCCATTAC CTGGACCCAC GCCACTGGGG GCCCAGCCTG TTTTCGACCA TTAGTCAAGC 2521 ATTCTGGAAC TTGGTCCGCG ACGACCTGCC AAGCCTGACC AGCCAAGAGA TCCAGCGCCG 2581 CACACAGAAG TTGTTCGTCG AGTCCCTGGC CCGCTTCCTG GAAGAGACAA CCTGGGCCAT 2641 CGTCAACAGC CCCGTCAATC TGTACAACTA CATCAGCGAT TACTACTCGC GACTCAGCCC 2701 CGTCCGCCCA AGCATGGTCC GCCAAGTCGC ACAGCGCGAA GGCACATACA TCAGTTTCGG 2761 GCATAGCTAT ACACAGTCCA TCGACGACGC CGATTCAATC CAGGAGGTCA CACAGCGCTT 2821 GGACCTGAAG ACACCCAACG TCCAGAGCGG CGAGTTCATC GAAAAGTCCA TCGCCCCTGG 2881 CGGAGCAAAC CAGCGCAGCG CACCACAATG GATGCTGCCA CTGTTGCTGG GCCTCTATGG 2941 CACCGTGACC CCAGCACTGG AGGCCTACGA GGACGGACCA AACAAGAAAA AGCGCCGCAA 3001 GGAAGGCCCA AGGGCATCCA GCAAGACCAG CTACAAACGT CGGTCCAGGT CCAGCCGCTC 3061 CTGAGCCTAG GACCCAGCTT TCTTGTACAA AGTGGTTCGA TCTAGAATGG CTAGCATTAT 3121 CCCTAATACC TGCCACCCCA CTCTTAATCA GTGGTGGAAG AACGGTCTCA GAACTGTTTG 3181 TTTCAATTGG CCATTTAAGT TTAGTAGTAA AAGACTGGTT AATGATAACA ATGCATCGTA 3241 AAACCTTCAG AAGGAAAGGA GAATGTTTTG TGGACCACTT TGGTTTTCTT TTTTGCGTGT 3301 GGCAGTTTTA AGTTATTAGT TTTTAAAATC AGTACTTTTT AATGGAAACA ACTTGACCAA 3361 AAATTTGTCA CAGAATTTTG AGACCCATTA AAAAAGTTAA ATGAGAAACC TGTGTGTTCC 3421 TTTGGTCAAC ACCGAGACAT TTAGGTGAAA GACATCTAAT TCTGGTTTTA CGAATCTGGA 3481 AACTTCTTGA AAATGTAATT CTTGAGTTAA CACTTCTGGG TGGAGAATAG GGTTGTTTTC 3541 CCCCCACATA ATTGGAAGGG GAAGGAATAT CATTTAAAGC TATGGGAGGG TTTCTTTGAT 3601 TACAACACTG GAGAGAAATG CAGCATGTTG CTGATTGCCT GTCACTAAAA CAGGCCAAAA 3661 ACTGAGTCCT TGGGTTGCAT AGAAAGCTTC ATGTTGCTAA ACCAATGTTA AGTGAATCTT 3721 TGGAAACAAA ATGTTTCCAA ATTACTGGGA TGTGCATGTT GAAACGTGGG TTAATTAACT 3781 AGCCATGACC AAAATCCCTT AACGTGAGTT TTCGTTCCAC TGAGCGTCAG ACCCCGTAGA 3841 AAAGATCAAA GGATCTTCTT GAGATCCTTT TTTTCTGCGC GTAATCTGCT GCTTGCAAAC 3901 AAAAAAACCA CCGCTACCAG CGGTGGTTTG TTTGCCGGAT CAAGAGCTAC CAACTCTTTT 3961 TCCGAAGGTA ACTGGCTTCA GCAGAGCGCA GATACCAAAT ACTGTTCTTC TAGTGTAGCC 4021 GTAGTTAGGC CACCACTTCA AGAACTCTGT AGCACCGCCT ACATACCTCG CTCTGCTAAT 4081 CCTGTTACCA GTGGCTGCTG CCAGTGGCGA TAAGTCGTGT CTTACCGGGT TGGACTCAAG 4141 ACGATAGTTA CCGGATAAGG CGCAGCGGTC GGGCTGAACG GGGGGTTCGT GCACACAGCC 4201 CAGCTTGGAG CGAACGACCT ACACCGAACT GAGATACCTA CAGCGTGAGC TATGAGAAAG 4261 CGCCACGCTT CCCGAAGGGA GAAAGGCGGA CAGGTATCCG GTAAGCGGCA GGGTCGGAAC 4321 AGGAGAGCGC ACGAGGGAGC TTCCAGGGGG AAACGCCTGG TATCTTTATA GTCCTGTCGG 4381 GTTTCGCCAC CTCTGACTTG AGCGTCGATT TTTGTGATGC TCGTCAGGGG GGCGGAGCCT 4441 ATGGAAAAAC GCCAGCAACG CGGCCTTTTT ACGGTTCCTG GCCTTTTGCT GGCCTTTTGC 4501 TCACATGTTC TTAATTAAAT TTTTCAAAAG TAGTTGACAA TTAATCATCG GCATAGTATA 4561 TCGGCATAGT ATAATACGAC TCACTATAGG AGGGCCATCA TGGCCAAGTT GACCAGTGCT 4621 GTCCCAGTGC TCACAGCCAG GGATGTGGCT GGAGCTGTTG AGTTCTGGAC TGACAGGTTG 4681 GGGTTCTCCA GAGATTTTGT GGAGGATGAC TTTGCAGGTG TGGTCAGAGA TGATGTCACC 4741 CTGTTCATCT CAGCAGTCCA GGACCAGGTG GTGCCTGACA ACACCCTGGC TTGGGTGTGG 4801 GTGAGAGGAC TGGATGAGCT GTATGCTGAG TGGAGTGAGG TGGTCTCCAC CAACTTCAGG 4861 GATGCCAGTG GCCCTGCCAT GACAGAGATT GGAGAGCAGC CCTGGGGGAG AGAGTTTGCC 4921 CTGAGAGACC CAGCAGGCAA CTGTGTGCAC TTTGTGGCAG AGGAGCAGGA CTGAGGATAA 4981 GAATTGTAAC AAAAAACCCC GCCCCGGCGG GGTTTTTTGT TAATTAA //