LOCUS P5L1H.TXT 5523 BP DS-DNA CIRCULAR SYN 17-MAY-2003 DEFINITION - ACCESSION - KEYWORDS - SOURCE - FEATURES Location/Qualifiers misc_RNA complement(>4..12) /note="SV40 minor late 19S RNA" promoter <4..>348 /note="SV40 promoter" enhancer 26..168 /note="SV40 enhancer elements" misc_feature 173..236 /note="SV40 21bp repeats" rep_origin 193..>328 /note="SV40 Ori" mRNA 242..248 /note="early-late transcription startpoints" mutation 262..262 /note="G->A kills SELP ATG" mRNA 282..288 /note="early-early transcription start points" CDS 373..1092 /note="EGFP" polyA_signal 1108..>1329 /note="SV40 late polyA" promoter <1420..1657 /note="hEF1alpha promoter core" 5'UTR 1665..1952 /note="HTLV R-U5 leader" insertion_seq 1973..>1997 /note="attB1" CDS 2010..3560 /note="HPV5 L1 (codmod)" insertion_seq 3568..>3592 /note="attB2" polyA_signal <3612..4274 /note="hEF1a polyA signal" rep_origin 4275..5008 /note="MB1 Ori" promoter 5009..5095 /note="EM7 promoter" CDS 5096..5470 /note="ShBle (Zeocin resistance)" terminator 5471..5523 /note="terminator (rpmB/G)" BASE COUNT 1355 A 1516 C 1452 G 1200 T 0 OTHER ORIGIN - 1 CCCCTGTGGA ATGTGTGTCA GTTAGGGTGT GGAAAGTCCC CAGGCTCCCC AGCAGGCAGA 61 AGTATGCAAA GCATGCATCT CAATTAGTCA GCAACCAGGT GTGGAAAGTC CCCAGGCTCC 121 CCAGCAGGCA GAAGTATGCA AAGCATGCAT CTCAATTAGT CAGCAACCAT AGTCCCGCCC 181 CTAACTCCGC CCATCCCGCC CCTAACTCCG CCCAGTTCCG CCCATTCTCC GCCCCATGGC 241 TGACTAATTT TTTTTATTTA TACAGAGGCC GAGGCCGCCT CGGCCTCTGA GCTATTCCAG 301 AAGTAGTGAG GAGGCTTTTT TGGAGGCCTA GGCTTTTGCA AAAAGCTTGA TTGGGATCCA 361 CCGGTCGCCA CCATGGTGAG CAAGGGCGAG GAGCTGTTCA CCGGGGTGGT GCCCATCCTG 421 GTCGAGCTGG ACGGCGACGT AAACGGCCAC AAGTTCAGCG TGTCCGGCGA GGGCGAGGGC 481 GATGCCACCT ACGGCAAGCT GACCCTGAAG TTCATCTGCA CCACCGGCAA GCTGCCCGTG 541 CCCTGGCCCA CCCTCGTGAC CACCCTGACC TACGGCGTGC AGTGCTTCAG CCGCTACCCC 601 GACCACATGA AGCAGCACGA CTTCTTCAAG TCCGCCATGC CCGAAGGCTA CGTCCAGGAG 661 CGCACCATCT TCTTCAAGGA CGACGGCAAC TACAAGACCC GCGCCGAGGT GAAGTTCGAG 721 GGCGACACCC TGGTGAACCG CATCGAGCTG AAGGGCATCG ACTTCAAGGA GGACGGCAAC 781 ATCCTGGGGC ACAAGCTGGA GTACAACTAC AACAGCCACA ACGTCTATAT CATGGCCGAC 841 AAGCAGAAGA ACGGCATCAA GGTGAACTTC AAGATCCGCC ACAACATCGA GGACGGCAGC 901 GTGCAGCTCG CCGACCACTA CCAGCAGAAC ACCCCCATCG GCGACGGCCC CGTGCTGCTG 961 CCCGACAACC ACTACCTGAG CACCCAGTCC GCCCTGAGCA AAGACCCCAA CGAGAAGCGC 1021 GATCACATGG TCCTGCTGGA GTTCGTGACC GCCGCCGGGA TCACTCTCGG CATGGACGAG 1081 CTGTACAAGT AAAGCGGCCG CTTCGAGCAG ACATGATAAG ATACATTGAT GAGTTTGGAC 1141 AAACCACAAC TAGAATGCAG TGAAAAAAAT GCTTTATTTG TGAAATTTGT GATGCTATTG 1201 CTTTATTTGT AACCATTATA AGCTGCAATA AACAAGTTAA CAACAACAAT TGCATTCATT 1261 TTATGTTTCA GGTTCAGGGG GAGGTGTGGG AGGTTTTTTA AAGCAAGTAA AACCTCTACA 1321 AATGTGGTAA AATCGATAAG GATCCGGGCT GGCGTAATAG CGAAGAGGCC CGCACCGATC 1381 GCCCTTCCCA ACAGTTGCGG TGGAGAAGAG CATGCGTGAG GCTCCGGTGC CCGTCAGTGG 1441 GCAGAGCGCA CATCGCCCAC AGTCCCCGAG AAGTTGGGGG GAGGGGTCGG CAATTGAACC 1501 GGTGCCTAGA GAAGGTGGCG CGGGGTAAAC TGGGAAAGTG ATGTCGTGTA CTGGCTCCGC 1561 CTTTTTCCCG AGGGTGGGGG AGAACCGTAT ATAAGTGCAG TAGTCGCCGT GAACGTTCTT 1621 TTTCGCAACG GGTTTGCCGC CAGAACACAG CTGAAGCTTC GAGGGGCTCG CATCTCTCCT 1681 TCACGCGCCC GCCGCCCTAC CTGAGGCCGC CATCCACGCC GGTTGAGTCG CGTTCTGCCG 1741 CCTCCCGCCT GTGGTGCCTC CTGAACTGCG TCCGCCGTCT AGGTAAGTTT AAAGCTCAGG 1801 TCGAGACCGG GCCTTTGTCC GGCGCTCCCT TGGAGCCTAC CTAGACTCAG CCGGCTCTCC 1861 ACGCTTTGCC TGACCCTGCT TGCTCAACTC TACGTCTTTG TTTCGTTTTC TGTTCTGCGC 1921 CGTTACAGAT CCAAGCTGTG ACCGGCGCCT ACGTAAGTGA TAGCTTGATC AAACAAGTTT 1981 GTACAAAAAA GCAGGCTTCT AGAGCCACCA TGGCCGTCTG GCATAGCGCC AACGGCAAGG 2041 TCTACTTGCC CCCGAGCACC CCCGTCGCAC GCGTGCAGTC TACAGACGAG TATATCCAGC 2101 GCACCAACAT CTACTACCAC GCCTTCTCCG ATCGCCTCCT GACCGTGGGC CACCCCTACT 2161 TTAACGTGTA TAACATCAAC GGCGACAAGT TGGAAGTCCC CAAAGTCAGC GGCAACCAGC 2221 ATCGCGTGTT CAGGTTGAAG CTGCCCGACC CCAATCGCTT CGCCCTGGCC GACATGAGCG 2281 TCTACAATCC CGATAAGGAG AGGCTCGTCT GGGCATGCCG CGGGCTGGAG ATCGGCCGCG 2341 GGCAACCCCT GGGCGTGGGC TCCACCGGCC ATCCCTACTT TAACAAGGTC AAGGACACCG 2401 AGAATTCCAA CGCCTATATC ACCTTCAGCA AGGACGATCG CCAAGACACC AGCTTCGACC 2461 CCAAGCAGAT TCAGATGTTC ATCGTGGGCT GTACCCCCTG TATCGGCGAA CACTGGGACA 2521 AGGCCGTCCC CTGCGCCGAG AACGACCAAC AGACCGGGTT GTGCCCCCCC ATCGAGTTGA 2581 AGAATACCTA CATCGAGGAC GGCGACATGG CCGATATCGG CTTCGGCAAT ATGAACTTCA 2641 AAGCGTTGCA GGACTCCCGG AGCGACGTGT CCCTGGATAT TGTGAACGAG ACCTGCAAGT 2701 ACCCCGACTT CCTGAAAATG CAGAATGACA TCTACGGGGA CGCCTGTTTC TTCTACGCCA 2761 GGCGCGAACA GTGCTACGCA CGCCATTTCT TCGTCCGCGG CGGCAAGACC GGCGACGATA 2821 TCCCCGGCGC CCAGATCGAT AACGGCACCT ATAAGAACCA ATTCTATATC CCCGGCGCCG 2881 ACGGGCAGGC CCAGAAAACC ATCGGCAACA GTATGTACTT TCCCACCGTC TCCGGGAGCC 2941 TGGTCAGTTC CGACGCCCAG CTCTTCAATC GCCCATTTTG GTTGCAGCGC GCACAGGGCC 3001 ACAACAACGG GATTCTCTGG GCCAACCAGA TGTTCATTAC CGTCGTCGAT AACACCCGCA 3061 ACACCAACTT TTCCATCAGC GTGTACAACC AAGCCGGCGC CTTGAAGGAC GTCGCCGATT 3121 ACAACGCCGA CCAGTTCCGC GAGTACCAGC GCCACGTGGA GGAGTACGAG ATCAGCCTGA 3181 TCTTGCAGTT GTGCAAAGTC CCCCTGAAGG CCGAAGTGCT CGCCCAGATT AACGCCATGA 3241 ATAGCAGCCT GCTCGAAGAC TGGCAGCTGG GCTTCGTCCC AACCCCCGAC AACCCCATCC 3301 AAGATACATA TCGCTACATC GATAGCCTCG CCACCAGATG CCCCGACAAG AACCCCCCCA 3361 AGGAGAAGGA GGATCCCTAC AAAGGGCTGC ACTTCTGGGA CGTGGACCTG ACCGAGCGCC 3421 TCAGCCTGGA CCTGGACCAG TACAGTCTGG GGCGCAAGTT CCTGTTTCAG GCCGGCCTGC 3481 AGCAGACCAC AGTCAATGGC ACCAAGGCCG TCAGCTACAA GGGCAGCAAC CGCGGCACCA 3541 AGAGGAAGAG GAAGAACTGA GCCCGGGACC CAGCTTTCTT GTACAAAGTG GTTCGATCTA 3601 GAATGGCTAG CATTATCCCT AATACCTGCC ACCCCACTCT TAATCAGTGG TGGAAGAACG 3661 GTCTCAGAAC TGTTTGTTTC AATTGGCCAT TTAAGTTTAG TAGTAAAAGA CTGGTTAATG 3721 ATAACAATGC ATCGTAAAAC CTTCAGAAGG AAAGGAGAAT GTTTTGTGGA CCACTTTGGT 3781 TTTCTTTTTT GCGTGTGGCA GTTTTAAGTT ATTAGTTTTT AAAATCAGTA CTTTTTAATG 3841 GAAACAACTT GACCAAAAAT TTGTCACAGA ATTTTGAGAC CCATTAAAAA AGTTAAATGA 3901 GAAACCTGTG TGTTCCTTTG GTCAACACCG AGACATTTAG GTGAAAGACA TCTAATTCTG 3961 GTTTTACGAA TCTGGAAACT TCTTGAAAAT GTAATTCTTG AGTTAACACT TCTGGGTGGA 4021 GAATAGGGTT GTTTTCCCCC CACATAATTG GAAGGGGAAG GAATATCATT TAAAGCTATG 4081 GGAGGGTTTC TTTGATTACA ACACTGGAGA GAAATGCAGC ATGTTGCTGA TTGCCTGTCA 4141 CTAAAACAGG CCAAAAACTG AGTCCTTGGG TTGCATAGAA AGCTTCATGT TGCTAAACCA 4201 ATGTTAAGTG AATCTTTGGA AACAAAATGT TTCCAAATTA CTGGGATGTG CATGTTGAAA 4261 CGTGGGTTAA TTAACTAGCC ATGACCAAAA TCCCTTAACG TGAGTTTTCG TTCCACTGAG 4321 CGTCAGACCC CGTAGAAAAG ATCAAAGGAT CTTCTTGAGA TCCTTTTTTT CTGCGCGTAA 4381 TCTGCTGCTT GCAAACAAAA AAACCACCGC TACCAGCGGT GGTTTGTTTG CCGGATCAAG 4441 AGCTACCAAC TCTTTTTCCG AAGGTAACTG GCTTCAGCAG AGCGCAGATA CCAAATACTG 4501 TTCTTCTAGT GTAGCCGTAG TTAGGCCACC ACTTCAAGAA CTCTGTAGCA CCGCCTACAT 4561 ACCTCGCTCT GCTAATCCTG TTACCAGTGG CTGCTGCCAG TGGCGATAAG TCGTGTCTTA 4621 CCGGGTTGGA CTCAAGACGA TAGTTACCGG ATAAGGCGCA GCGGTCGGGC TGAACGGGGG 4681 GTTCGTGCAC ACAGCCCAGC TTGGAGCGAA CGACCTACAC CGAACTGAGA TACCTACAGC 4741 GTGAGCTATG AGAAAGCGCC ACGCTTCCCG AAGGGAGAAA GGCGGACAGG TATCCGGTAA 4801 GCGGCAGGGT CGGAACAGGA GAGCGCACGA GGGAGCTTCC AGGGGGAAAC GCCTGGTATC 4861 TTTATAGTCC TGTCGGGTTT CGCCACCTCT GACTTGAGCG TCGATTTTTG TGATGCTCGT 4921 CAGGGGGGCG GAGCCTATGG AAAAACGCCA GCAACGCGGC CTTTTTACGG TTCCTGGCCT 4981 TTTGCTGGCC TTTTGCTCAC ATGTTCTTAA TTAAATTTTT CAAAAGTAGT TGACAATTAA 5041 TCATCGGCAT AGTATATCGG CATAGTATAA TACGACTCAC TATAGGAGGG CCATCATGGC 5101 CAAGTTGACC AGTGCTGTCC CAGTGCTCAC AGCCAGGGAT GTGGCTGGAG CTGTTGAGTT 5161 CTGGACTGAC AGGTTGGGGT TCTCCAGAGA TTTTGTGGAG GATGACTTTG CAGGTGTGGT 5221 CAGAGATGAT GTCACCCTGT TCATCTCAGC AGTCCAGGAC CAGGTGGTGC CTGACAACAC 5281 CCTGGCTTGG GTGTGGGTGA GAGGACTGGA TGAGCTGTAT GCTGAGTGGA GTGAGGTGGT 5341 CTCCACCAAC TTCAGGGATG CCAGTGGCCC TGCCATGACA GAGATTGGAG AGCAGCCCTG 5401 GGGGAGAGAG TTTGCCCTGA GAGACCCAGC AGGCAACTGT GTGCACTTTG TGGCAGAGGA 5461 GCAGGACTGA GGATAAGAAT TGTAACAAAA AACCCCGCCC CGGCGGGGTT TTTTGTTAAT 5521 TAA //