LOCUS PH2P.GB MAY 4932 BP DS-DNA CIRCULAR SYN 17-MAY-2003 DEFINITION - ACCESSION - KEYWORDS - SOURCE - FEATURES Location/Qualifiers misc_RNA complement(>4..12) /note="SV40 minor late 19S RNA" promoter <4..>348 /note="SV40 promoter (∆SELP)" enhancer 26..168 /note="SV40 enhancer elements" misc_feature 173..236 /note="SV40 21bp repeats" rep_origin 193..>328 /note="SV40 Ori" mRNA 242..248 /note="early-late transcription startpoints" mutation 262..262 /note="G->A kills SELP ATG" mRNA 282..288 /note="early-early transcription start points" CDS 373..1092 /note="EGFP" polyA_signal 1108..>1329 /note="SV40 Late PolyA" promoter <1420..1657 /note="hEF1alpha promoter core" 5'UTR 1665..1952 /note="HTLV R-U5 leader" insertion_seq 1973..>1997 /note="attB1" CDS 2010..>2969 /note="MPyV VP2 (codmod)" CDS 2268..2270 /note="next ATG back" misc_recomb 2314..2319 /note="EcoRV for VP3 cloning" CDS 2355..2357 /note="VP3 start" misc_recomb <2977..>3001 /note="attB2" polyA_signal <3021..3683 /note="hEF1a polyA signal" rep_origin 3684..4417 /note="MB1 Ori" promoter 4418..4504 /note="EM7 promoter" CDS 4505..4879 /note="ShBle (ZeoR)" terminator 4880..4932 /note="terminator (rpmB/G)" BASE COUNT 1200 A 1318 C 1328 G 1086 T 0 OTHER ORIGIN - 1 CCCCTGTGGA ATGTGTGTCA GTTAGGGTGT GGAAAGTCCC CAGGCTCCCC AGCAGGCAGA 61 AGTATGCAAA GCATGCATCT CAATTAGTCA GCAACCAGGT GTGGAAAGTC CCCAGGCTCC 121 CCAGCAGGCA GAAGTATGCA AAGCATGCAT CTCAATTAGT CAGCAACCAT AGTCCCGCCC 181 CTAACTCCGC CCATCCCGCC CCTAACTCCG CCCAGTTCCG CCCATTCTCC GCCCCATGGC 241 TGACTAATTT TTTTTATTTA TACAGAGGCC GAGGCCGCCT CGGCCTCTGA GCTATTCCAG 301 AAGTAGTGAG GAGGCTTTTT TGGAGGCCTA GGCTTTTGCA AAAAGCTTGA TTGGGATCCA 361 CCGGTCGCCA CCATGGTGAG CAAGGGCGAG GAGCTGTTCA CCGGGGTGGT GCCCATCCTG 421 GTCGAGCTGG ACGGCGACGT AAACGGCCAC AAGTTCAGCG TGTCCGGCGA GGGCGAGGGC 481 GATGCCACCT ACGGCAAGCT GACCCTGAAG TTCATCTGCA CCACCGGCAA GCTGCCCGTG 541 CCCTGGCCCA CCCTCGTGAC CACCCTGACC TACGGCGTGC AGTGCTTCAG CCGCTACCCC 601 GACCACATGA AGCAGCACGA CTTCTTCAAG TCCGCCATGC CCGAAGGCTA CGTCCAGGAG 661 CGCACCATCT TCTTCAAGGA CGACGGCAAC TACAAGACCC GCGCCGAGGT GAAGTTCGAG 721 GGCGACACCC TGGTGAACCG CATCGAGCTG AAGGGCATCG ACTTCAAGGA GGACGGCAAC 781 ATCCTGGGGC ACAAGCTGGA GTACAACTAC AACAGCCACA ACGTCTATAT CATGGCCGAC 841 AAGCAGAAGA ACGGCATCAA GGTGAACTTC AAGATCCGCC ACAACATCGA GGACGGCAGC 901 GTGCAGCTCG CCGACCACTA CCAGCAGAAC ACCCCCATCG GCGACGGCCC CGTGCTGCTG 961 CCCGACAACC ACTACCTGAG CACCCAGTCC GCCCTGAGCA AAGACCCCAA CGAGAAGCGC 1021 GATCACATGG TCCTGCTGGA GTTCGTGACC GCCGCCGGGA TCACTCTCGG CATGGACGAG 1081 CTGTACAAGT AAAGCGGCCG CTTCGAGCAG ACATGATAAG ATACATTGAT GAGTTTGGAC 1141 AAACCACAAC TAGAATGCAG TGAAAAAAAT GCTTTATTTG TGAAATTTGT GATGCTATTG 1201 CTTTATTTGT AACCATTATA AGCTGCAATA AACAAGTTAA CAACAACAAT TGCATTCATT 1261 TTATGTTTCA GGTTCAGGGG GAGGTGTGGG AGGTTTTTTA AAGCAAGTAA AACCTCTACA 1321 AATGTGGTAA AATCGATAAG GATCCGGGCT GGCGTAATAG CGAAGAGGCC CGCACCGATC 1381 GCCCTTCCCA ACAGTTGCGG TGGAGAAGAG CATGCGTGAG GCTCCGGTGC CCGTCAGTGG 1441 GCAGAGCGCA CATCGCCCAC AGTCCCCGAG AAGTTGGGGG GAGGGGTCGG CAATTGAACC 1501 GGTGCCTAGA GAAGGTGGCG CGGGGTAAAC TGGGAAAGTG ATGTCGTGTA CTGGCTCCGC 1561 CTTTTTCCCG AGGGTGGGGG AGAACCGTAT ATAAGTGCAG TAGTCGCCGT GAACGTTCTT 1621 TTTCGCAACG GGTTTGCCGC CAGAACACAG CTGAAGCTTC GAGGGGCTCG CATCTCTCCT 1681 TCACGCGCCC GCCGCCCTAC CTGAGGCCGC CATCCACGCC GGTTGAGTCG CGTTCTGCCG 1741 CCTCCCGCCT GTGGTGCCTC CTGAACTGCG TCCGCCGTCT AGGTAAGTTT AAAGCTCAGG 1801 TCGAGACCGG GCCTTTGTCC GGCGCTCCCT TGGAGCCTAC CTAGACTCAG CCGGCTCTCC 1861 ACGCTTTGCC TGACCCTGCT TGCTCAACTC TACGTCTTTG TTTCGTTTTC TGTTCTGCGC 1921 CGTTACAGAT CCAAGCTGTG ACCGGCGCCT ACGTAAGTGA TAGCTTGATC AAACAAGTTT 1981 GTACAAAAAA GCAGGCTGCG GCCGCAATCA TGGGAGCAGC CCTCACCATC CTGGTGGATC 2041 TGATTGAAGG ACTGGCTGAG GTCAGCACAT TGACTGGACT GAGCGCCGAG GCCATCCTGA 2101 GCGGCGAGGC ACTGGCCGCA CTGGACGGGG AGATCACCGC CCTCACCCTC GAAGGCGTGA 2161 TGTCCAGCGA AACCGCATTG GCCACCATGG GCATCAGCGA AGAAGTCTAC GGCTTCGTGT 2221 CCACCGTCCC CGTCTTCGTG AACAGGACCG CCGGCGCCAT CTGGCTCATG CAAACAGTCC 2281 AGGGCGCAAG CACCATCAGT TTGGGAATCC AGAGATATCT GCATAACGAG GAAGTGCCCA 2341 CCGTGAACCG CAACATGGCC CTCATCCCAT GGCGCGACCC CGCACTGCTG GACATCTACT 2401 TTCCGGGCGT CAACCAATTC GCCCACGCCT TGAACGTGGT GCACGACTGG GGGCACGGCC 2461 TGCTGCACTC CGTCGGACGC TACGTCTGGC AGATGGTCGT CCAAGAGACC CAGCATCGCC 2521 TCGAGGGCGC CGTCCGCGAG TTGACCGTCC GCCAAACCCA CACATTTCTG GACGGGCTGG 2581 CCCGCCTGCT GGAGAATACA AGATGGGTCG TCAGCAACGC CCCACAAAGC GCAATCGACG 2641 CCATCAACCG CGGCGCAAGC AGCGTCTCAA GCGGATACTC GAGCTTGTCT GATTACTACC 2701 GCCAGCTGGG ACTGAACCCA CCCCAACGGA GAGCACTGTT CAACAGGATC GAGGGATCAA 2761 TGGGCAACGG AGGCCCAACA CCCGCCGCCC ACATCCAAGA CGAAAGCGGC GAAGTCATCA 2821 AGTTCTACCA AGCACCCGGC GGCGCACATC AGCGCGTGAC CCCCGATTGG ATGTTGCCCC 2881 TGATCCTGGG CCTCTATGGC GACATTACCC CCACCTGGGC CACCGTGATC GAAGAGGACG 2941 GGCCACAGAA GAAGAAACGC AGGTTGTGAT GCTAGCACCC AGCTTTCTTG TACAAAGTGG 3001 TTCGATCTAG AATGGCTAGC ATTATCCCTA ATACCTGCCA CCCCACTCTT AATCAGTGGT 3061 GGAAGAACGG TCTCAGAACT GTTTGTTTCA ATTGGCCATT TAAGTTTAGT AGTAAAAGAC 3121 TGGTTAATGA TAACAATGCA TCGTAAAACC TTCAGAAGGA AAGGAGAATG TTTTGTGGAC 3181 CACTTTGGTT TTCTTTTTTG CGTGTGGCAG TTTTAAGTTA TTAGTTTTTA AAATCAGTAC 3241 TTTTTAATGG AAACAACTTG ACCAAAAATT TGTCACAGAA TTTTGAGACC CATTAAAAAA 3301 GTTAAATGAG AAACCTGTGT GTTCCTTTGG TCAACACCGA GACATTTAGG TGAAAGACAT 3361 CTAATTCTGG TTTTACGAAT CTGGAAACTT CTTGAAAATG TAATTCTTGA GTTAACACTT 3421 CTGGGTGGAG AATAGGGTTG TTTTCCCCCC ACATAATTGG AAGGGGAAGG AATATCATTT 3481 AAAGCTATGG GAGGGTTTCT TTGATTACAA CACTGGAGAG AAATGCAGCA TGTTGCTGAT 3541 TGCCTGTCAC TAAAACAGGC CAAAAACTGA GTCCTTGGGT TGCATAGAAA GCTTCATGTT 3601 GCTAAACCAA TGTTAAGTGA ATCTTTGGAA ACAAAATGTT TCCAAATTAC TGGGATGTGC 3661 ATGTTGAAAC GTGGGTTAAT TAACTAGCCA TGACCAAAAT CCCTTAACGT GAGTTTTCGT 3721 TCCACTGAGC GTCAGACCCC GTAGAAAAGA TCAAAGGATC TTCTTGAGAT CCTTTTTTTC 3781 TGCGCGTAAT CTGCTGCTTG CAAACAAAAA AACCACCGCT ACCAGCGGTG GTTTGTTTGC 3841 CGGATCAAGA GCTACCAACT CTTTTTCCGA AGGTAACTGG CTTCAGCAGA GCGCAGATAC 3901 CAAATACTGT TCTTCTAGTG TAGCCGTAGT TAGGCCACCA CTTCAAGAAC TCTGTAGCAC 3961 CGCCTACATA CCTCGCTCTG CTAATCCTGT TACCAGTGGC TGCTGCCAGT GGCGATAAGT 4021 CGTGTCTTAC CGGGTTGGAC TCAAGACGAT AGTTACCGGA TAAGGCGCAG CGGTCGGGCT 4081 GAACGGGGGG TTCGTGCACA CAGCCCAGCT TGGAGCGAAC GACCTACACC GAACTGAGAT 4141 ACCTACAGCG TGAGCTATGA GAAAGCGCCA CGCTTCCCGA AGGGAGAAAG GCGGACAGGT 4201 ATCCGGTAAG CGGCAGGGTC GGAACAGGAG AGCGCACGAG GGAGCTTCCA GGGGGAAACG 4261 CCTGGTATCT TTATAGTCCT GTCGGGTTTC GCCACCTCTG ACTTGAGCGT CGATTTTTGT 4321 GATGCTCGTC AGGGGGGCGG AGCCTATGGA AAAACGCCAG CAACGCGGCC TTTTTACGGT 4381 TCCTGGCCTT TTGCTGGCCT TTTGCTCACA TGTTCTTAAT TAAATTTTTC AAAAGTAGTT 4441 GACAATTAAT CATCGGCATA GTATATCGGC ATAGTATAAT ACGACTCACT ATAGGAGGGC 4501 CATCATGGCC AAGTTGACCA GTGCTGTCCC AGTGCTCACA GCCAGGGATG TGGCTGGAGC 4561 TGTTGAGTTC TGGACTGACA GGTTGGGGTT CTCCAGAGAT TTTGTGGAGG ATGACTTTGC 4621 AGGTGTGGTC AGAGATGATG TCACCCTGTT CATCTCAGCA GTCCAGGACC AGGTGGTGCC 4681 TGACAACACC CTGGCTTGGG TGTGGGTGAG AGGACTGGAT GAGCTGTATG CTGAGTGGAG 4741 TGAGGTGGTC TCCACCAACT TCAGGGATGC CAGTGGCCCT GCCATGACAG AGATTGGAGA 4801 GCAGCCCTGG GGGAGAGAGT TTGCCCTGAG AGACCCAGCA GGCAACTGTG TGCACTTTGT 4861 GGCAGAGGAG CAGGACTGAG GATAAGAATT GTAACAAAAA ACCCCGCCCC GGCGGGGTTT 4921 TTTGTTAATT AA //