LOCUS PH2M.GB MAY 4700 BP DS-DNA CIRCULAR SYN 17-MAY-2003 DEFINITION - ACCESSION - KEYWORDS - SOURCE - FEATURES Location/Qualifiers misc_RNA complement(>4..12) /note="SV40 minor late 19S RNA" promoter <4..>348 /note="SV40 promoter (∆SELP)" enhancer 26..168 /note="SV40 enhancer elements" misc_feature 173..236 /note="SV40 21bp repeats" rep_origin 193..>328 /note="SV40 Ori" mRNA 242..248 /note="early-late transcription startpoints" mutation 262..262 /note="G->A kills SELP ATG" mRNA 282..288 /note="early-early transcription start points" CDS 373..1092 /note="EGFP" polyA_signal 1108..>1329 /note="SV40 Late PolyA" promoter <1420..1657 /note="hEF1alpha promoter core" 5'UTR 1665..1952 /note="HTLV R-U5 leader" insertion_seq 1973..>1997 /note="attB1" CDS 2011..2736 /note="Merkel VP2" CDS 2146..2148 /note="unlikely VP3 start" CDS 2395..2397 /note="unlikely VP3 start" misc_recomb 2745..>2769 /note="attB2" polyA_signal <2789..3451 /note="hEF1a polyA signal" rep_origin 3452..4185 /note="MB1 Ori" promoter 4186..4272 /note="EM7 promoter" CDS 4273..4647 /note="ShBle (ZeoR)" terminator 4648..4700 /note="terminator (rpmB/G)" BASE COUNT 1147 A 1242 C 1257 G 1054 T 0 OTHER ORIGIN - 1 CCCCTGTGGA ATGTGTGTCA GTTAGGGTGT GGAAAGTCCC CAGGCTCCCC AGCAGGCAGA 61 AGTATGCAAA GCATGCATCT CAATTAGTCA GCAACCAGGT GTGGAAAGTC CCCAGGCTCC 121 CCAGCAGGCA GAAGTATGCA AAGCATGCAT CTCAATTAGT CAGCAACCAT AGTCCCGCCC 181 CTAACTCCGC CCATCCCGCC CCTAACTCCG CCCAGTTCCG CCCATTCTCC GCCCCATGGC 241 TGACTAATTT TTTTTATTTA TACAGAGGCC GAGGCCGCCT CGGCCTCTGA GCTATTCCAG 301 AAGTAGTGAG GAGGCTTTTT TGGAGGCCTA GGCTTTTGCA AAAAGCTTGA TTGGGATCCA 361 CCGGTCGCCA CCATGGTGAG CAAGGGCGAG GAGCTGTTCA CCGGGGTGGT GCCCATCCTG 421 GTCGAGCTGG ACGGCGACGT AAACGGCCAC AAGTTCAGCG TGTCCGGCGA GGGCGAGGGC 481 GATGCCACCT ACGGCAAGCT GACCCTGAAG TTCATCTGCA CCACCGGCAA GCTGCCCGTG 541 CCCTGGCCCA CCCTCGTGAC CACCCTGACC TACGGCGTGC AGTGCTTCAG CCGCTACCCC 601 GACCACATGA AGCAGCACGA CTTCTTCAAG TCCGCCATGC CCGAAGGCTA CGTCCAGGAG 661 CGCACCATCT TCTTCAAGGA CGACGGCAAC TACAAGACCC GCGCCGAGGT GAAGTTCGAG 721 GGCGACACCC TGGTGAACCG CATCGAGCTG AAGGGCATCG ACTTCAAGGA GGACGGCAAC 781 ATCCTGGGGC ACAAGCTGGA GTACAACTAC AACAGCCACA ACGTCTATAT CATGGCCGAC 841 AAGCAGAAGA ACGGCATCAA GGTGAACTTC AAGATCCGCC ACAACATCGA GGACGGCAGC 901 GTGCAGCTCG CCGACCACTA CCAGCAGAAC ACCCCCATCG GCGACGGCCC CGTGCTGCTG 961 CCCGACAACC ACTACCTGAG CACCCAGTCC GCCCTGAGCA AAGACCCCAA CGAGAAGCGC 1021 GATCACATGG TCCTGCTGGA GTTCGTGACC GCCGCCGGGA TCACTCTCGG CATGGACGAG 1081 CTGTACAAGT AAAGCGGCCG CTTCGAGCAG ACATGATAAG ATACATTGAT GAGTTTGGAC 1141 AAACCACAAC TAGAATGCAG TGAAAAAAAT GCTTTATTTG TGAAATTTGT GATGCTATTG 1201 CTTTATTTGT AACCATTATA AGCTGCAATA AACAAGTTAA CAACAACAAT TGCATTCATT 1261 TTATGTTTCA GGTTCAGGGG GAGGTGTGGG AGGTTTTTTA AAGCAAGTAA AACCTCTACA 1321 AATGTGGTAA AATCGATAAG GATCCGGGCT GGCGTAATAG CGAAGAGGCC CGCACCGATC 1381 GCCCTTCCCA ACAGTTGCGG TGGAGAAGAG CATGCGTGAG GCTCCGGTGC CCGTCAGTGG 1441 GCAGAGCGCA CATCGCCCAC AGTCCCCGAG AAGTTGGGGG GAGGGGTCGG CAATTGAACC 1501 GGTGCCTAGA GAAGGTGGCG CGGGGTAAAC TGGGAAAGTG ATGTCGTGTA CTGGCTCCGC 1561 CTTTTTCCCG AGGGTGGGGG AGAACCGTAT ATAAGTGCAG TAGTCGCCGT GAACGTTCTT 1621 TTTCGCAACG GGTTTGCCGC CAGAACACAG CTGAAGCTTC GAGGGGCTCG CATCTCTCCT 1681 TCACGCGCCC GCCGCCCTAC CTGAGGCCGC CATCCACGCC GGTTGAGTCG CGTTCTGCCG 1741 CCTCCCGCCT GTGGTGCCTC CTGAACTGCG TCCGCCGTCT AGGTAAGTTT AAAGCTCAGG 1801 TCGAGACCGG GCCTTTGTCC GGCGCTCCCT TGGAGCCTAC CTAGACTCAG CCGGCTCTCC 1861 ACGCTTTGCC TGACCCTGCT TGCTCAACTC TACGTCTTTG TTTCGTTTTC TGTTCTGCGC 1921 CGTTACAGAT CCAAGCTGTG ACCGGCGCCT ACGTAAGTGA TAGCTTGATC AAACAAGTTT 1981 GTACAAAAAA GCAGGCTGCG GCCGCAACAA ATGGGCGGGA TCATTACCCT GCTCGCAAAC 2041 ATCGGCGAGA TCGCCACCGA GCTGTCCGCA ACAACCGGCG TGACACTGGA AGCCATCTTG 2101 ACCGGCGAGG CCCTGGCCGC ACTCGAGGCC GAGATTAGTA GCCTGATGAC CATCGAAGGC 2161 ATCAGCGGGA TCGAAGCCCT GGCACAGCTG GGCTTTACCG CCGAGCAATT CAGCAACTTC 2221 AGCCTGGTCG CCAGCCTCGT CAATCAGGGC CTGACCTACG GGTTCATCTT GCAAACCGTG 2281 TCCGGCATCG GGAGCCTGAT CACCGTCGGC GTCCGCCTGA GCAGGGAACA GGTCAGCCTG 2341 GTCAACCGCG ACGTCAGCTG GGTCGGCTCC AACGAAGTCC TGCGCCACGC CTTGATGGCA 2401 TTCTCACTGG ACCCGCTCCA ATGGGAGAAC AGCCTGCTGC ACAGCGTCGG CCAGGACATC 2461 TTCAACAGCC TGAGCCCCAC AAGCCGCCTG CAAATCCAGA GCAACTTGGT CAACCTCATC 2521 CTGAACTCAA GATGGGTGTT CCAAACCACC GCCAGCCAGA ACCAGGGGCT GCTGAGCGGC 2581 GAAGCCATCC TGATCCCCGA ACACATCGGC GGCACCCTGC AACAGCAGAC ACCCGACTGG 2641 CTGCTGCCAC TGGTGCTGGG GTTGTCCGGC TACATCAGCC CCGAGCTGCA AGTCATCGAG 2701 GACGGGACAA AGAAGAAGTC AATCATTCAT CTCTGAGTGC TAGCACCCAG CTTTCTTGTA 2761 CAAAGTGGTT CGATCTAGAA TGGCTAGCAT TATCCCTAAT ACCTGCCACC CCACTCTTAA 2821 TCAGTGGTGG AAGAACGGTC TCAGAACTGT TTGTTTCAAT TGGCCATTTA AGTTTAGTAG 2881 TAAAAGACTG GTTAATGATA ACAATGCATC GTAAAACCTT CAGAAGGAAA GGAGAATGTT 2941 TTGTGGACCA CTTTGGTTTT CTTTTTTGCG TGTGGCAGTT TTAAGTTATT AGTTTTTAAA 3001 ATCAGTACTT TTTAATGGAA ACAACTTGAC CAAAAATTTG TCACAGAATT TTGAGACCCA 3061 TTAAAAAAGT TAAATGAGAA ACCTGTGTGT TCCTTTGGTC AACACCGAGA CATTTAGGTG 3121 AAAGACATCT AATTCTGGTT TTACGAATCT GGAAACTTCT TGAAAATGTA ATTCTTGAGT 3181 TAACACTTCT GGGTGGAGAA TAGGGTTGTT TTCCCCCCAC ATAATTGGAA GGGGAAGGAA 3241 TATCATTTAA AGCTATGGGA GGGTTTCTTT GATTACAACA CTGGAGAGAA ATGCAGCATG 3301 TTGCTGATTG CCTGTCACTA AAACAGGCCA AAAACTGAGT CCTTGGGTTG CATAGAAAGC 3361 TTCATGTTGC TAAACCAATG TTAAGTGAAT CTTTGGAAAC AAAATGTTTC CAAATTACTG 3421 GGATGTGCAT GTTGAAACGT GGGTTAATTA ACTAGCCATG ACCAAAATCC CTTAACGTGA 3481 GTTTTCGTTC CACTGAGCGT CAGACCCCGT AGAAAAGATC AAAGGATCTT CTTGAGATCC 3541 TTTTTTTCTG CGCGTAATCT GCTGCTTGCA AACAAAAAAA CCACCGCTAC CAGCGGTGGT 3601 TTGTTTGCCG GATCAAGAGC TACCAACTCT TTTTCCGAAG GTAACTGGCT TCAGCAGAGC 3661 GCAGATACCA AATACTGTTC TTCTAGTGTA GCCGTAGTTA GGCCACCACT TCAAGAACTC 3721 TGTAGCACCG CCTACATACC TCGCTCTGCT AATCCTGTTA CCAGTGGCTG CTGCCAGTGG 3781 CGATAAGTCG TGTCTTACCG GGTTGGACTC AAGACGATAG TTACCGGATA AGGCGCAGCG 3841 GTCGGGCTGA ACGGGGGGTT CGTGCACACA GCCCAGCTTG GAGCGAACGA CCTACACCGA 3901 ACTGAGATAC CTACAGCGTG AGCTATGAGA AAGCGCCACG CTTCCCGAAG GGAGAAAGGC 3961 GGACAGGTAT CCGGTAAGCG GCAGGGTCGG AACAGGAGAG CGCACGAGGG AGCTTCCAGG 4021 GGGAAACGCC TGGTATCTTT ATAGTCCTGT CGGGTTTCGC CACCTCTGAC TTGAGCGTCG 4081 ATTTTTGTGA TGCTCGTCAG GGGGGCGGAG CCTATGGAAA AACGCCAGCA ACGCGGCCTT 4141 TTTACGGTTC CTGGCCTTTT GCTGGCCTTT TGCTCACATG TTCTTAATTA AATTTTTCAA 4201 AAGTAGTTGA CAATTAATCA TCGGCATAGT ATATCGGCAT AGTATAATAC GACTCACTAT 4261 AGGAGGGCCA TCATGGCCAA GTTGACCAGT GCTGTCCCAG TGCTCACAGC CAGGGATGTG 4321 GCTGGAGCTG TTGAGTTCTG GACTGACAGG TTGGGGTTCT CCAGAGATTT TGTGGAGGAT 4381 GACTTTGCAG GTGTGGTCAG AGATGATGTC ACCCTGTTCA TCTCAGCAGT CCAGGACCAG 4441 GTGGTGCCTG ACAACACCCT GGCTTGGGTG TGGGTGAGAG GACTGGATGA GCTGTATGCT 4501 GAGTGGAGTG AGGTGGTCTC CACCAACTTC AGGGATGCCA GTGGCCCTGC CATGACAGAG 4561 ATTGGAGAGC AGCCCTGGGG GAGAGAGTTT GCCCTGAGAG ACCCAGCAGG CAACTGTGTG 4621 CACTTTGTGG CAGAGGAGCA GGACTGAGGA TAAGAATTGT AACAAAAAAC CCCGCCCCGG 4681 CGGGGTTTTT TGTTAATTAA //