LOCUS P18L1-GFP. 5444 BP DS-DNA CIRCULAR SYN 17-MAY-2003 DEFINITION - ACCESSION - KEYWORDS - SOURCE - FEATURES Location/Qualifiers misc_RNA complement(>4..12) /note="SV40 minor late 19S RNA" promoter <4..>348 /note="SV40 promoter (∆SELP)" enhancer 26..168 /note="SV40 enhancer elements" misc_feature 173..236 /note="SV40 21bp repeats" rep_origin 193..>328 /note="SV40 Ori" mRNA 242..248 /note="early-late transcription startpoints" mutation 262..262 /note="G->A kills SELP ATG" mRNA 282..288 /note="early-early transcription start points" CDS 373..1092 /note="EGFP" polyA_signal 1108..>1329 /note="SV40 Late PolyA" promoter <1421..1658 /note="hEF1alpha promoter core" 5'UTR 1666..1953 /note="HTLV R-U5 leader" CDS 2003..3526 /note="HPV-18 L1" polyA_signal <3533..4195 /note="hEF1a polyA signal" rep_origin 4196..4929 /note="MB1 Ori" promoter 4930..5016 /note="EM7 promoter" CDS 5017..5391 /note="ShBle (ZeoR)" terminator 5392..5444 /note="terminator (rpmB/G)" BASE COUNT 1346 A 1516 C 1427 G 1155 T 0 OTHER ORIGIN - 1 CCCCTGTGGA ATGTGTGTCA GTTAGGGTGT GGAAAGTCCC CAGGCTCCCC AGCAGGCAGA 61 AGTATGCAAA GCATGCATCT CAATTAGTCA GCAACCAGGT GTGGAAAGTC CCCAGGCTCC 121 CCAGCAGGCA GAAGTATGCA AAGCATGCAT CTCAATTAGT CAGCAACCAT AGTCCCGCCC 181 CTAACTCCGC CCATCCCGCC CCTAACTCCG CCCAGTTCCG CCCATTCTCC GCCCCATGGC 241 TGACTAATTT TTTTTATTTA TACAGAGGCC GAGGCCGCCT CGGCCTCTGA GCTATTCCAG 301 AAGTAGTGAG GAGGCTTTTT TGGAGGCCTA GGCTTTTGCA AAAAGCTTGA TTGGGATCCA 361 CCGGTCGCCA CCATGGTGAG CAAGGGCGAG GAGCTGTTCA CCGGGGTGGT GCCCATCCTG 421 GTCGAGCTGG ACGGCGACGT AAACGGCCAC AAGTTCAGCG TGTCCGGCGA GGGCGAGGGC 481 GATGCCACCT ACGGCAAGCT GACCCTGAAG TTCATCTGCA CCACCGGCAA GCTGCCCGTG 541 CCCTGGCCCA CCCTCGTGAC CACCCTGACC TACGGCGTGC AGTGCTTCAG CCGCTACCCC 601 GACCACATGA AGCAGCACGA CTTCTTCAAG TCCGCCATGC CCGAAGGCTA CGTCCAGGAG 661 CGCACCATCT TCTTCAAGGA CGACGGCAAC TACAAGACCC GCGCCGAGGT GAAGTTCGAG 721 GGCGACACCC TGGTGAACCG CATCGAGCTG AAGGGCATCG ACTTCAAGGA GGACGGCAAC 781 ATCCTGGGGC ACAAGCTGGA GTACAACTAC AACAGCCACA ACGTCTATAT CATGGCCGAC 841 AAGCAGAAGA ACGGCATCAA GGTGAACTTC AAGATCCGCC ACAACATCGA GGACGGCAGC 901 GTGCAGCTCG CCGACCACTA CCAGCAGAAC ACCCCCATCG GCGACGGCCC CGTGCTGCTG 961 CCCGACAACC ACTACCTGAG CACCCAGTCC GCCCTGAGCA AAGACCCCAA CGAGAAGCGC 1021 GATCACATGG TCCTGCTGGA GTTCGTGACC GCCGCCGGGA TCACTCTCGG CATGGACGAG 1081 CTGTACAAGT AAAGCGGCCG CTTCGAGCAG ACATGATAAG ATACATTGAT GAGTTTGGAC 1141 AAACCACAAC TAGAATGCAG TGAAAAAAAT GCTTTATTTG TGAAATTTGT GATGCTATTG 1201 CTTTATTTGT AACCATTATA AGCTGCAATA AACAAGTTAA CAACAACAAT TGCATTCATT 1261 TTATGTTTCA GGTTCAGGGG GAGGTGTGGG AGGTTTTTTA AAGCAAGTAA AACCTCTACA 1321 AATGTGGTAA AATCGATAAG GATCCGGGCT GGCGTAATAG CGAAGAGGCC CGCACCGATC 1381 GCCCTTCCCA ACAGTTGCAG GTGGAGAAGA GCATGCGTGA GGCTCCGGTG CCCGTCAGTG 1441 GGCAGAGCGC ACATCGCCCA CAGTCCCCGA GAAGTTGGGG GGAGGGGTCG GCAATTGAAC 1501 CGGTGCCTAG AGAAGGTGGC GCGGGGTAAA CTGGGAAAGT GATGTCGTGT ACTGGCTCCG 1561 CCTTTTTCCC GAGGGTGGGG GAGAACCGTA TATAAGTGCA GTAGTCGCCG TGAACGTTCT 1621 TTTTCGCAAC GGGTTTGCCG CCAGAACACA GCTGAAGCTT CGAGGGGCTC GCATCTCTCC 1681 TTCACGCGCC CGCCGCCCTA CCTGAGGCCG CCATCCACGC CGGTTGAGTC GCGTTCTGCC 1741 GCCTCCCGCC TGTGGTGCCT CCTGAACTGC GTCCGCCGTC TAGGTAAGTT TAAAGCTCAG 1801 GTCGAGACCG GGCCTTTGTC CGGCGCTCCC TTGGAGCCTA CCTAGACTCA GCCGGCTCTC 1861 CACGCTTTGC CTGACCCTGC TTGCTCAACT CTACGTCTTT GTTTCGTTTT CTGTTCTGCG 1921 CCGTTACAGA TCCAAGCTGT GACCGGCGCC TACGTAAGTG ATATCTACTA GATTTATCAA 1981 AAAGAGTGTT GACTTGTGAG CCATGGCCCT CTGGAGACCA TCCGATAACA CAGTGTACTT 2041 GCCCCCACCC AGCGTCGCCC GGGTGGTGAA CACAGACGAC TACGTCACCA GAACCTCAAT 2101 CTTCTACCAC GCCGGGTCCA GCCGGCTGCT GACCGTGGGC AACCCCTACT TCCGCGTGCC 2161 CGCCGGCGGC GGAAACAAAC AAGACATCCC CAAAGTCAGC GCCTATCAGT ACCGGGTGTT 2221 CCGCGTCCAA CTGCCCGATC CCAACAAGTT CGGCCTGCCC GACACCTCCA TCTACAACCC 2281 CGAGACCCAG AGGCTGGTCT GGGCTTGCGC CGGCGTCGAG ATCGGGAGGG GCCAACCCCT 2341 GGGCGTGGGG TTGTCCGGCC ACCCCTTCTA CAACAAGCTG GACGATACCG AGTCCAGCCA 2401 CGCAGCAACC AGCAACGTCT CCGAAGATGT GCGCGATAAC GTCAGCGTGG ACTACAAACA 2461 AACCCAACTG TGCATCCTGG GATGCGCACC CGCCATCGGC GAGCATTGGG CCAAGGGGAC 2521 CGCCTGCAAG AGCAGGCCCC TGAGCCAAGG GGACTGTCCA CCCCTGGAGT TGAAGAATAC 2581 CGTGCTCGAG GACGGCGACA TGGTGGACAC CGGCTACGGC GCTATGGATT TCTCCACCCT 2641 CCAGGACACC AAGTGCGAAG TGCCCCTCGA CATCTGCCAA AGCATCTGCA AGTACCCCGA 2701 CTACCTCCAG ATGAGCGCCG ACCCCTACGG CGACAGCATG TTCTTCTGTC TCAGAAGGGA 2761 ACAATTGTTC GCCCGCCACT TCTGGAACCG GGCCGGCACA ATGGGAGATA CAGTCCCCCA 2821 GAGCCTGTAC ATCAAGGGGA CCGGAATGAG GGCCAGCCCC GGGTCCTGCG TCTACAGCCC 2881 AAGCCCCTCC GGGAGCATCG TCACAAGCGA TAGCCAACTC TTCAACAAGC CCTACTGGCT 2941 CCACAAAGCC CAAGGCCACA ATAACGGGGT GTGTTGGCAC AACCAGCTGT TCGTGACCGT 3001 CGTGGACACA ACCAGGTCCA CAAACCTGAC CATCTGCGCC AGCACCCAAA GCCCCGTGCC 3061 CGGCCAGTAC GACGCCACAA AGTTCAAACA ATACTCTCGG CACGTGGAAG AGTACGACCT 3121 CCAATTCATC TTCCAACTCT GCACCATCAC CCTCACCGCC GACGTGATGA GCTACATCCA 3181 CTCCATGAAC TCCTCCATCC TGGAAGACTG GAATTTCGGC GTGCCACCAC CCCCTACCAC 3241 CTCCCTCGTC GACACCTACA GATTCGTGCA GAGCGTGGCC ATCACATGCC AGAAAGACGC 3301 CGCCCCCGCC GAGAACAAAG ACCCATACGA CAAACTGAAA TTCTGGAACG TCGACCTGAA 3361 AGAGAAATTC AGCCTGGATC TGGACCAGTA CCCATTGGGC AGGAAGTTCC TCGTGCAAGC 3421 CGGCCTCAGG AGAAAACCAA CAATCGGGCC CAGGAAGAGG AGCGCCCCCA GCGCAACCAC 3481 CAGCAGCAAG CCCGCAAAAA GGGTCAGAGT GAGGGCACGC AAATGAGCTA GCATTATCCC 3541 TAATACCTGC CACCCCACTC TTAATCAGTG GTGGAAGAAC GGTCTCAGAA CTGTTTGTTT 3601 CAATTGGCCA TTTAAGTTTA GTAGTAAAAG ACTGGTTAAT GATAACAATG CATCGTAAAA 3661 CCTTCAGAAG GAAAGGAGAA TGTTTTGTGG ACCACTTTGG TTTTCTTTTT TGCGTGTGGC 3721 AGTTTTAAGT TATTAGTTTT TAAAATCAGT ACTTTTTAAT GGAAACAACT TGACCAAAAA 3781 TTTGTCACAG AATTTTGAGA CCCATTAAAA AAGTTAAATG AGAAACCTGT GTGTTCCTTT 3841 GGTCAACACC GAGACATTTA GGTGAAAGAC ATCTAATTCT GGTTTTACGA ATCTGGAAAC 3901 TTCTTGAAAA TGTAATTCTT GAGTTAACAC TTCTGGGTGG AGAATAGGGT TGTTTTCCCC 3961 CCACATAATT GGAAGGGGAA GGAATATCAT TTAAAGCTAT GGGAGGGTTT CTTTGATTAC 4021 AACACTGGAG AGAAATGCAG CATGTTGCTG ATTGCCTGTC ACTAAAACAG GCCAAAAACT 4081 GAGTCCTTGG GTTGCATAGA AAGCTTCATG TTGCTAAACC AATGTTAAGT GAATCTTTGG 4141 AAACAAAATG TTTCCAAATT ACTGGGATGT GCATGTTGAA ACGTGGGTTA ATTAACTAGC 4201 CATGACCAAA ATCCCTTAAC GTGAGTTTTC GTTCCACTGA GCGTCAGACC CCGTAGAAAA 4261 GATCAAAGGA TCTTCTTGAG ATCCTTTTTT TCTGCGCGTA ATCTGCTGCT TGCAAACAAA 4321 AAAACCACCG CTACCAGCGG TGGTTTGTTT GCCGGATCAA GAGCTACCAA CTCTTTTTCC 4381 GAAGGTAACT GGCTTCAGCA GAGCGCAGAT ACCAAATACT GTTCTTCTAG TGTAGCCGTA 4441 GTTAGGCCAC CACTTCAAGA ACTCTGTAGC ACCGCCTACA TACCTCGCTC TGCTAATCCT 4501 GTTACCAGTG GCTGCTGCCA GTGGCGATAA GTCGTGTCTT ACCGGGTTGG ACTCAAGACG 4561 ATAGTTACCG GATAAGGCGC AGCGGTCGGG CTGAACGGGG GGTTCGTGCA CACAGCCCAG 4621 CTTGGAGCGA ACGACCTACA CCGAACTGAG ATACCTACAG CGTGAGCTAT GAGAAAGCGC 4681 CACGCTTCCC GAAGGGAGAA AGGCGGACAG GTATCCGGTA AGCGGCAGGG TCGGAACAGG 4741 AGAGCGCACG AGGGAGCTTC CAGGGGGAAA CGCCTGGTAT CTTTATAGTC CTGTCGGGTT 4801 TCGCCACCTC TGACTTGAGC GTCGATTTTT GTGATGCTCG TCAGGGGGGC GGAGCCTATG 4861 GAAAAACGCC AGCAACGCGG CCTTTTTACG GTTCCTGGCC TTTTGCTGGC CTTTTGCTCA 4921 CATGTTCTTA ATTAAATTTT TCAAAAGTAG TTGACAATTA ATCATCGGCA TAGTATATCG 4981 GCATAGTATA ATACGACTCA CTATAGGAGG GCCATCATGG CCAAGTTGAC CAGTGCTGTC 5041 CCAGTGCTCA CAGCCAGGGA TGTGGCTGGA GCTGTTGAGT TCTGGACTGA CAGGTTGGGG 5101 TTCTCCAGAG ATTTTGTGGA GGATGACTTT GCAGGTGTGG TCAGAGATGA TGTCACCCTG 5161 TTCATCTCAG CAGTCCAGGA CCAGGTGGTG CCTGACAACA CCCTGGCTTG GGTGTGGGTG 5221 AGAGGACTGG ATGAGCTGTA TGCTGAGTGG AGTGAGGTGG TCTCCACCAA CTTCAGGGAT 5281 GCCAGTGGCC CTGCCATGAC AGAGATTGGA GAGCAGCCCT GGGGGAGAGA GTTTGCCCTG 5341 AGAGACCCAG CAGGCAACTG TGTGCACTTT GTGGCAGAGG AGCAGGACTG AGGATAAGAA 5401 TTGTAACAAA AAACCCCGCC CCGGCGGGGT TTTTTGTTAA TTAA //