LOCUS PC175S.TXT 5440 BP DS-DNA CIRCULAR SYN 17-MAY-2003 DEFINITION - ACCESSION - KEYWORDS - SOURCE - FEATURES Location/Qualifiers misc_RNA complement(>4..12) /note="SV40 minor late 19S RNA" promoter <4..>348 /note="SV40 promoter (∆SELP)" enhancer 26..168 /note="SV40 enhancer elements" misc_feature 173..236 /note="SV40 21bp repeats" rep_origin 193..>328 /note="SV40 Ori" mRNA 242..248 /note="early-late transcription startpoints" mutation 262..262 /note="G->A kills SELP ATG" mRNA 282..288 /note="early-early transcription start points" CDS 373..1092 /note="EGFP" polyA_signal 1108..>1329 /note="SV40 Late PolyA" promoter <1420..>1657 /note="hEF1alpha promoter core" 5'UTR 1665..1931 /note="HTLV-1 R-U5" CDS 1963..>3480 /note="Müller's HPV16 L1" mutation 2485..2485 /note="T->A makes Cys175->Ser" polyA_signal <3529..4191 /note="hEF1a polyA signal" rep_origin 4192..4925 /note="MB1 Ori" promoter 4926..5012 /note="EM7 promoter" CDS 5013..5387 /note="ShBle (ZeoR)" terminator 5388..5440 /note="terminator (rpmB/G)" BASE COUNT 1312 A 1538 C 1479 G 1111 T 0 OTHER ORIGIN - 1 CCCCTGTGGA ATGTGTGTCA GTTAGGGTGT GGAAAGTCCC CAGGCTCCCC AGCAGGCAGA 61 AGTATGCAAA GCATGCATCT CAATTAGTCA GCAACCAGGT GTGGAAAGTC CCCAGGCTCC 121 CCAGCAGGCA GAAGTATGCA AAGCATGCAT CTCAATTAGT CAGCAACCAT AGTCCCGCCC 181 CTAACTCCGC CCATCCCGCC CCTAACTCCG CCCAGTTCCG CCCATTCTCC GCCCCATGGC 241 TGACTAATTT TTTTTATTTA TACAGAGGCC GAGGCCGCCT CGGCCTCTGA GCTATTCCAG 301 AAGTAGTGAG GAGGCTTTTT TGGAGGCCTA GGCTTTTGCA AAAAGCTTGA TTGGGATCCA 361 CCGGTCGCCA CCATGGTGAG CAAGGGCGAG GAGCTGTTCA CCGGGGTGGT GCCCATCCTG 421 GTCGAGCTGG ACGGCGACGT AAACGGCCAC AAGTTCAGCG TGTCCGGCGA GGGCGAGGGC 481 GATGCCACCT ACGGCAAGCT GACCCTGAAG TTCATCTGCA CCACCGGCAA GCTGCCCGTG 541 CCCTGGCCCA CCCTCGTGAC CACCCTGACC TACGGCGTGC AGTGCTTCAG CCGCTACCCC 601 GACCACATGA AGCAGCACGA CTTCTTCAAG TCCGCCATGC CCGAAGGCTA CGTCCAGGAG 661 CGCACCATCT TCTTCAAGGA CGACGGCAAC TACAAGACCC GCGCCGAGGT GAAGTTCGAG 721 GGCGACACCC TGGTGAACCG CATCGAGCTG AAGGGCATCG ACTTCAAGGA GGACGGCAAC 781 ATCCTGGGGC ACAAGCTGGA GTACAACTAC AACAGCCACA ACGTCTATAT CATGGCCGAC 841 AAGCAGAAGA ACGGCATCAA GGTGAACTTC AAGATCCGCC ACAACATCGA GGACGGCAGC 901 GTGCAGCTCG CCGACCACTA CCAGCAGAAC ACCCCCATCG GCGACGGCCC CGTGCTGCTG 961 CCCGACAACC ACTACCTGAG CACCCAGTCC GCCCTGAGCA AAGACCCCAA CGAGAAGCGC 1021 GATCACATGG TCCTGCTGGA GTTCGTGACC GCCGCCGGGA TCACTCTCGG CATGGACGAG 1081 CTGTACAAGT AAAGCGGCCG CTTCGAGCAG ACATGATAAG ATACATTGAT GAGTTTGGAC 1141 AAACCACAAC TAGAATGCAG TGAAAAAAAT GCTTTATTTG TGAAATTTGT GATGCTATTG 1201 CTTTATTTGT AACCATTATA AGCTGCAATA AACAAGTTAA CAACAACAAT TGCATTCATT 1261 TTATGTTTCA GGTTCAGGGG GAGGTGTGGG AGGTTTTTTA AAGCAAGTAA AACCTCTACA 1321 AATGTGGTAA AATCGATAAG GATCCGGGCT GGCGTAATAG CGAAGAGGCC CGCACCGATC 1381 GCCCTTCCCA ACAGTTGCGG TGGAGAAGAG CATGCGTGAG GCTCCGGTGC CCGTCAGTGG 1441 GCAGAGCGCA CATCGCCCAC AGTCCCCGAG AAGTTGGGGG GAGGGGTCGG CAATTGAACC 1501 GGTGCCTAGA GAAGGTGGCG CGGGGTAAAC TGGGAAAGTG ATGTCGTGTA CTGGCTCCGC 1561 CTTTTTCCCG AGGGTGGGGG AGAACCGTAT ATAAGTGCAG TAGTCGCCGT GAACGTTCTT 1621 TTTCGCAACG GGTTTGCCGC CAGAACACAG CTGAAGCTTC GAGGGGCTCG CATCTCTCCT 1681 TCACGCGCCC GCCGCCCTAC CTGAGGCCGC CATCCACGCC GGTTGAGTCG CGTTCTGCCG 1741 CCTCCCGCCT GTGGTGCCTC CTGAACTGCG TCCGCCGTCT AGGTAAGTTT AAAGCTCAGG 1801 TCGAGACCGG GCCTTTGTCC GGCGCTCCCT TGGAGCCTAC CTAGACTCAG CCGGCTCTCC 1861 ACGCTTTGCC TGACCCTGCT TGCTCAACTC TACGTCTTTG TTTCGTTTTC TGTTCTGCGC 1921 CGTTACAGAT CCAAGCTGTG ACCGGCCCGC TCTAGAGCCA CCATGAGCCT GTGGCTGCCC 1981 AGCGAGGCCA CCGTGTACCT GCCCCCCGTG CCCGTGAGCA AGGTGGTGAG CACCGACGAG 2041 TACGTGGCCA GGACCAACAT CTACTACCAC GCCGGCACCA GCAGGCTGCT GGCCGTGGGC 2101 CACCCCTACT TCCCCATCAA GAAGCCCAAC AACAACAAGA TCCTGGTGCC CAAGGTGAGC 2161 GGCCTGCAGT ACAGGGTGTT CAGGATCCAC CTGCCCGACC CCAACAAGTT CGGCTTCCCC 2221 GACACCAGCT TCTACAACCC CGACACCCAG AGGCTGGTGT GGGCCTGCGT GGGCGTGGAG 2281 GTGGGCAGGG GCCAGCCCCT GGGCGTGGGC ATCAGCGGCC ACCCCCTGCT GAACAAGCTG 2341 GACGACACCG AGAACGCCAG CGCCTACGCC GCCAACGCCG GCGTGGACAA CAGGGAGTGC 2401 ATCAGCATGG ACTACAAGCA GACCCAGCTG TGCCTGATCG GCTGCAAGCC CCCCATCGGC 2461 GAGCACTGGG GCAAGGGCAG CCCCAGCACC AACGTGGCCG TGAACCCCGG CGACTGCCCC 2521 CCCCTGGAGC TGATCAACAC CGTGATCCAG GACGGCGACA TGGTGGACAC CGGCTTCGGC 2581 GCCATGGACT TCACCACCCT GCAGGCCAAC AAGAGCGAGG TGCCCCTGGA CATCTGCACC 2641 AGCATCTGCA AGTACCCCGA CTACATCAAG ATGGTGAGCG AGCCCTACGG CGACAGCCTG 2701 TTCTTCTACC TGAGGAGGGA GCAGATGTTC GTGAGGCACC TGTTCAACAG GGCCGGCGCC 2761 GTGGGCGAGA ACGTGCCCGA CGACCTGTAC ATCAAGGGCA GCGGCAGCAC CGCCAACCTG 2821 GCCAGCAGCA ACTACTTCCC CACCCCCAGC GGCAGCATGG TGACCAGCGA CGCCCAGATC 2881 TTCAACAAGC CCTACTGGCT GCAGAGGGCC CAGGGCCACA ACAACGGCAT CTGCTGGGGC 2941 AACCAGCTGT TCGTGACCGT GGTGGACACC ACCAGGAGCA CCAACATGAG CCTGTGCGCC 3001 GCCATCAGCA CCAGCGAGAC CACCTACAAG AACACCAACT TCAAGGAGTA CCTGAGGCAC 3061 GGCGAGGAGT ACGACCTGCA GTTCATCTTC CAGCTGTGCA AGATCACCCT GACCGCCGAC 3121 GTGATGACCT ACATCCACAG CATGAACAGC ACCATCCTGG AGGACTGGAA CTTCGGCCTG 3181 CAGCCCCCCC CCGGCGGCAC CCTGGAGGAC ACCTACAGGT TCGTGACCAG CCAGGCCATC 3241 GCCTGCCAGA AGCACACCCC CCCCGCCCCC AAGGAGGACC CCCTGAAGAA GTACACCTTC 3301 TGGGAGGTGA ACCTGAAGGA GAAGTTCAGC GCCGACCTGG ACCAGTTCCC CCTGGGCAGG 3361 AAGTTCCTGC TGCAGGCCGG CCTGAAGGCC AAGCCCAAGT TCACCCTGGG CAAGAGGAAG 3421 GCCACCCCCA CCACCAGCAG CACCAGCACC ACCGCCAAGA GGAAGAAGAG GAAGCTGTGA 3481 AAGCTACCCA CGGCCGAATA GCCGTGAGCC GGAATCCTGC ACGCTAGCAT TATCCCTAAT 3541 ACCTGCCACC CCACTCTTAA TCAGTGGTGG AAGAACGGTC TCAGAACTGT TTGTTTCAAT 3601 TGGCCATTTA AGTTTAGTAG TAAAAGACTG GTTAATGATA ACAATGCATC GTAAAACCTT 3661 CAGAAGGAAA GGAGAATGTT TTGTGGACCA CTTTGGTTTT CTTTTTTGCG TGTGGCAGTT 3721 TTAAGTTATT AGTTTTTAAA ATCAGTACTT TTTAATGGAA ACAACTTGAC CAAAAATTTG 3781 TCACAGAATT TTGAGACCCA TTAAAAAAGT TAAATGAGAA ACCTGTGTGT TCCTTTGGTC 3841 AACACCGAGA CATTTAGGTG AAAGACATCT AATTCTGGTT TTACGAATCT GGAAACTTCT 3901 TGAAAATGTA ATTCTTGAGT TAACACTTCT GGGTGGAGAA TAGGGTTGTT TTCCCCCCAC 3961 ATAATTGGAA GGGGAAGGAA TATCATTTAA AGCTATGGGA GGGTTTCTTT GATTACAACA 4021 CTGGAGAGAA ATGCAGCATG TTGCTGATTG CCTGTCACTA AAACAGGCCA AAAACTGAGT 4081 CCTTGGGTTG CATAGAAAGC TTCATGTTGC TAAACCAATG TTAAGTGAAT CTTTGGAAAC 4141 AAAATGTTTC CAAATTACTG GGATGTGCAT GTTGAAACGT GGGTTAATTA ACTAGCCATG 4201 ACCAAAATCC CTTAACGTGA GTTTTCGTTC CACTGAGCGT CAGACCCCGT AGAAAAGATC 4261 AAAGGATCTT CTTGAGATCC TTTTTTTCTG CGCGTAATCT GCTGCTTGCA AACAAAAAAA 4321 CCACCGCTAC CAGCGGTGGT TTGTTTGCCG GATCAAGAGC TACCAACTCT TTTTCCGAAG 4381 GTAACTGGCT TCAGCAGAGC GCAGATACCA AATACTGTTC TTCTAGTGTA GCCGTAGTTA 4441 GGCCACCACT TCAAGAACTC TGTAGCACCG CCTACATACC TCGCTCTGCT AATCCTGTTA 4501 CCAGTGGCTG CTGCCAGTGG CGATAAGTCG TGTCTTACCG GGTTGGACTC AAGACGATAG 4561 TTACCGGATA AGGCGCAGCG GTCGGGCTGA ACGGGGGGTT CGTGCACACA GCCCAGCTTG 4621 GAGCGAACGA CCTACACCGA ACTGAGATAC CTACAGCGTG AGCTATGAGA AAGCGCCACG 4681 CTTCCCGAAG GGAGAAAGGC GGACAGGTAT CCGGTAAGCG GCAGGGTCGG AACAGGAGAG 4741 CGCACGAGGG AGCTTCCAGG GGGAAACGCC TGGTATCTTT ATAGTCCTGT CGGGTTTCGC 4801 CACCTCTGAC TTGAGCGTCG ATTTTTGTGA TGCTCGTCAG GGGGGCGGAG CCTATGGAAA 4861 AACGCCAGCA ACGCGGCCTT TTTACGGTTC CTGGCCTTTT GCTGGCCTTT TGCTCACATG 4921 TTCTTAATTA AATTTTTCAA AAGTAGTTGA CAATTAATCA TCGGCATAGT ATATCGGCAT 4981 AGTATAATAC GACTCACTAT AGGAGGGCCA TCATGGCCAA GTTGACCAGT GCTGTCCCAG 5041 TGCTCACAGC CAGGGATGTG GCTGGAGCTG TTGAGTTCTG GACTGACAGG TTGGGGTTCT 5101 CCAGAGATTT TGTGGAGGAT GACTTTGCAG GTGTGGTCAG AGATGATGTC ACCCTGTTCA 5161 TCTCAGCAGT CCAGGACCAG GTGGTGCCTG ACAACACCCT GGCTTGGGTG TGGGTGAGAG 5221 GACTGGATGA GCTGTATGCT GAGTGGAGTG AGGTGGTCTC CACCAACTTC AGGGATGCCA 5281 GTGGCCCTGC CATGACAGAG ATTGGAGAGC AGCCCTGGGG GAGAGAGTTT GCCCTGAGAG 5341 ACCCAGCAGG CAACTGTGTG CACTTTGTGG CAGAGGAGCA GGACTGAGGA TAAGAATTGT 5401 AACAAAAAAC CCCGCCCCGG CGGGGTTTTT TGTTAATTAA //