LOCUS PH3P.GB MAY 4568 BP DS-DNA CIRCULAR SYN 17-MAY-2003 DEFINITION - ACCESSION - KEYWORDS - SOURCE - FEATURES Location/Qualifiers misc_RNA complement(>4..12) /note="SV40 minor late 19S RNA" promoter <4..>348 /note="SV40 promoter (∆SELP)" enhancer 26..168 /note="SV40 enhancer elements" misc_feature 173..236 /note="SV40 21bp repeats" rep_origin 193..>328 /note="SV40 Ori" mRNA 242..248 /note="early-late transcription startpoints" mutation 262..262 /note="G->A kills SELP ATG" mRNA 282..288 /note="early-early transcription start points" CDS 373..1092 /note="EGFP" polyA_signal 1108..>1329 /note="SV40 Late PolyA" promoter <1420..1657 /note="hEF1alpha promoter core" 5'UTR 1665..1952 /note="HTLV R-U5 leader" 5'UTR <1953..>1990 /note="VP2 code" CDS 1991..2605 /note="VP3 start" misc_recomb 2613..>2637 /note="attB2" polyA_signal <2657..3319 /note="hEF1a polyA signal" rep_origin 3320..4053 /note="MB1 Ori" promoter 4054..4140 /note="EM7 promoter" CDS 4141..4515 /note="ShBle (ZeoR)" terminator 4516..4568 /note="terminator (rpmB/G)" BASE COUNT 1113 A 1212 C 1221 G 1022 T 0 OTHER ORIGIN - 1 CCCCTGTGGA ATGTGTGTCA GTTAGGGTGT GGAAAGTCCC CAGGCTCCCC AGCAGGCAGA 61 AGTATGCAAA GCATGCATCT CAATTAGTCA GCAACCAGGT GTGGAAAGTC CCCAGGCTCC 121 CCAGCAGGCA GAAGTATGCA AAGCATGCAT CTCAATTAGT CAGCAACCAT AGTCCCGCCC 181 CTAACTCCGC CCATCCCGCC CCTAACTCCG CCCAGTTCCG CCCATTCTCC GCCCCATGGC 241 TGACTAATTT TTTTTATTTA TACAGAGGCC GAGGCCGCCT CGGCCTCTGA GCTATTCCAG 301 AAGTAGTGAG GAGGCTTTTT TGGAGGCCTA GGCTTTTGCA AAAAGCTTGA TTGGGATCCA 361 CCGGTCGCCA CCATGGTGAG CAAGGGCGAG GAGCTGTTCA CCGGGGTGGT GCCCATCCTG 421 GTCGAGCTGG ACGGCGACGT AAACGGCCAC AAGTTCAGCG TGTCCGGCGA GGGCGAGGGC 481 GATGCCACCT ACGGCAAGCT GACCCTGAAG TTCATCTGCA CCACCGGCAA GCTGCCCGTG 541 CCCTGGCCCA CCCTCGTGAC CACCCTGACC TACGGCGTGC AGTGCTTCAG CCGCTACCCC 601 GACCACATGA AGCAGCACGA CTTCTTCAAG TCCGCCATGC CCGAAGGCTA CGTCCAGGAG 661 CGCACCATCT TCTTCAAGGA CGACGGCAAC TACAAGACCC GCGCCGAGGT GAAGTTCGAG 721 GGCGACACCC TGGTGAACCG CATCGAGCTG AAGGGCATCG ACTTCAAGGA GGACGGCAAC 781 ATCCTGGGGC ACAAGCTGGA GTACAACTAC AACAGCCACA ACGTCTATAT CATGGCCGAC 841 AAGCAGAAGA ACGGCATCAA GGTGAACTTC AAGATCCGCC ACAACATCGA GGACGGCAGC 901 GTGCAGCTCG CCGACCACTA CCAGCAGAAC ACCCCCATCG GCGACGGCCC CGTGCTGCTG 961 CCCGACAACC ACTACCTGAG CACCCAGTCC GCCCTGAGCA AAGACCCCAA CGAGAAGCGC 1021 GATCACATGG TCCTGCTGGA GTTCGTGACC GCCGCCGGGA TCACTCTCGG CATGGACGAG 1081 CTGTACAAGT AAAGCGGCCG CTTCGAGCAG ACATGATAAG ATACATTGAT GAGTTTGGAC 1141 AAACCACAAC TAGAATGCAG TGAAAAAAAT GCTTTATTTG TGAAATTTGT GATGCTATTG 1201 CTTTATTTGT AACCATTATA AGCTGCAATA AACAAGTTAA CAACAACAAT TGCATTCATT 1261 TTATGTTTCA GGTTCAGGGG GAGGTGTGGG AGGTTTTTTA AAGCAAGTAA AACCTCTACA 1321 AATGTGGTAA AATCGATAAG GATCCGGGCT GGCGTAATAG CGAAGAGGCC CGCACCGATC 1381 GCCCTTCCCA ACAGTTGCGG TGGAGAAGAG CATGCGTGAG GCTCCGGTGC CCGTCAGTGG 1441 GCAGAGCGCA CATCGCCCAC AGTCCCCGAG AAGTTGGGGG GAGGGGTCGG CAATTGAACC 1501 GGTGCCTAGA GAAGGTGGCG CGGGGTAAAC TGGGAAAGTG ATGTCGTGTA CTGGCTCCGC 1561 CTTTTTCCCG AGGGTGGGGG AGAACCGTAT ATAAGTGCAG TAGTCGCCGT GAACGTTCTT 1621 TTTCGCAACG GGTTTGCCGC CAGAACACAG CTGAAGCTTC GAGGGGCTCG CATCTCTCCT 1681 TCACGCGCCC GCCGCCCTAC CTGAGGCCGC CATCCACGCC GGTTGAGTCG CGTTCTGCCG 1741 CCTCCCGCCT GTGGTGCCTC CTGAACTGCG TCCGCCGTCT AGGTAAGTTT AAAGCTCAGG 1801 TCGAGACCGG GCCTTTGTCC GGCGCTCCCT TGGAGCCTAC CTAGACTCAG CCGGCTCTCC 1861 ACGCTTTGCC TGACCCTGCT TGCTCAACTC TACGTCTTTG TTTCGTTTTC TGTTCTGCGC 1921 CGTTACAGAT CCAAGCTGTG ACCGGCGCCT ACATCTGCAT AACGAGGAAG TGCCCACCGT 1981 GAACCGCAAC ATGGCCCTCA TCCCATGGCG CGACCCCGCA CTGCTGGACA TCTACTTTCC 2041 GGGCGTCAAC CAATTCGCCC ACGCCTTGAA CGTGGTGCAC GACTGGGGGC ACGGCCTGCT 2101 GCACTCCGTC GGACGCTACG TCTGGCAGAT GGTCGTCCAA GAGACCCAGC ATCGCCTCGA 2161 GGGCGCCGTC CGCGAGTTGA CCGTCCGCCA AACCCACACA TTTCTGGACG GGCTGGCCCG 2221 CCTGCTGGAG AATACAAGAT GGGTCGTCAG CAACGCCCCA CAAAGCGCAA TCGACGCCAT 2281 CAACCGCGGC GCAAGCAGCG TCTCAAGCGG ATACTCGAGC TTGTCTGATT ACTACCGCCA 2341 GCTGGGACTG AACCCACCCC AACGGAGAGC ACTGTTCAAC AGGATCGAGG GATCAATGGG 2401 CAACGGAGGC CCAACACCCG CCGCCCACAT CCAAGACGAA AGCGGCGAAG TCATCAAGTT 2461 CTACCAAGCA CCCGGCGGCG CACATCAGCG CGTGACCCCC GATTGGATGT TGCCCCTGAT 2521 CCTGGGCCTC TATGGCGACA TTACCCCCAC CTGGGCCACC GTGATCGAAG AGGACGGGCC 2581 ACAGAAGAAG AAACGCAGGT TGTGATGCTA GCACCCAGCT TTCTTGTACA AAGTGGTTCG 2641 ATCTAGAATG GCTAGCATTA TCCCTAATAC CTGCCACCCC ACTCTTAATC AGTGGTGGAA 2701 GAACGGTCTC AGAACTGTTT GTTTCAATTG GCCATTTAAG TTTAGTAGTA AAAGACTGGT 2761 TAATGATAAC AATGCATCGT AAAACCTTCA GAAGGAAAGG AGAATGTTTT GTGGACCACT 2821 TTGGTTTTCT TTTTTGCGTG TGGCAGTTTT AAGTTATTAG TTTTTAAAAT CAGTACTTTT 2881 TAATGGAAAC AACTTGACCA AAAATTTGTC ACAGAATTTT GAGACCCATT AAAAAAGTTA 2941 AATGAGAAAC CTGTGTGTTC CTTTGGTCAA CACCGAGACA TTTAGGTGAA AGACATCTAA 3001 TTCTGGTTTT ACGAATCTGG AAACTTCTTG AAAATGTAAT TCTTGAGTTA ACACTTCTGG 3061 GTGGAGAATA GGGTTGTTTT CCCCCCACAT AATTGGAAGG GGAAGGAATA TCATTTAAAG 3121 CTATGGGAGG GTTTCTTTGA TTACAACACT GGAGAGAAAT GCAGCATGTT GCTGATTGCC 3181 TGTCACTAAA ACAGGCCAAA AACTGAGTCC TTGGGTTGCA TAGAAAGCTT CATGTTGCTA 3241 AACCAATGTT AAGTGAATCT TTGGAAACAA AATGTTTCCA AATTACTGGG ATGTGCATGT 3301 TGAAACGTGG GTTAATTAAC TAGCCATGAC CAAAATCCCT TAACGTGAGT TTTCGTTCCA 3361 CTGAGCGTCA GACCCCGTAG AAAAGATCAA AGGATCTTCT TGAGATCCTT TTTTTCTGCG 3421 CGTAATCTGC TGCTTGCAAA CAAAAAAACC ACCGCTACCA GCGGTGGTTT GTTTGCCGGA 3481 TCAAGAGCTA CCAACTCTTT TTCCGAAGGT AACTGGCTTC AGCAGAGCGC AGATACCAAA 3541 TACTGTTCTT CTAGTGTAGC CGTAGTTAGG CCACCACTTC AAGAACTCTG TAGCACCGCC 3601 TACATACCTC GCTCTGCTAA TCCTGTTACC AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG 3661 TCTTACCGGG TTGGACTCAA GACGATAGTT ACCGGATAAG GCGCAGCGGT CGGGCTGAAC 3721 GGGGGGTTCG TGCACACAGC CCAGCTTGGA GCGAACGACC TACACCGAAC TGAGATACCT 3781 ACAGCGTGAG CTATGAGAAA GCGCCACGCT TCCCGAAGGG AGAAAGGCGG ACAGGTATCC 3841 GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG CACGAGGGAG CTTCCAGGGG GAAACGCCTG 3901 GTATCTTTAT AGTCCTGTCG GGTTTCGCCA CCTCTGACTT GAGCGTCGAT TTTTGTGATG 3961 CTCGTCAGGG GGGCGGAGCC TATGGAAAAA CGCCAGCAAC GCGGCCTTTT TACGGTTCCT 4021 GGCCTTTTGC TGGCCTTTTG CTCACATGTT CTTAATTAAA TTTTTCAAAA GTAGTTGACA 4081 ATTAATCATC GGCATAGTAT ATCGGCATAG TATAATACGA CTCACTATAG GAGGGCCATC 4141 ATGGCCAAGT TGACCAGTGC TGTCCCAGTG CTCACAGCCA GGGATGTGGC TGGAGCTGTT 4201 GAGTTCTGGA CTGACAGGTT GGGGTTCTCC AGAGATTTTG TGGAGGATGA CTTTGCAGGT 4261 GTGGTCAGAG ATGATGTCAC CCTGTTCATC TCAGCAGTCC AGGACCAGGT GGTGCCTGAC 4321 AACACCCTGG CTTGGGTGTG GGTGAGAGGA CTGGATGAGC TGTATGCTGA GTGGAGTGAG 4381 GTGGTCTCCA CCAACTTCAG GGATGCCAGT GGCCCTGCCA TGACAGAGAT TGGAGAGCAG 4441 CCCTGGGGGA GAGAGTTTGC CCTGAGAGAC CCAGCAGGCA ACTGTGTGCA CTTTGTGGCA 4501 GAGGAGCAGG ACTGAGGATA AGAATTGTAA CAAAAAACCC CGCCCCGGCG GGGTTTTTTG 4561 TTAATTAA //