LOCUS PHELL.GB 6118 BP DS-DNA CIRCULAR SYN 17-MAY-2003 DEFINITION - ACCESSION - KEYWORDS - SOURCE - FEATURES Location/Qualifiers misc_RNA complement(>4..12) /note="SV40 minor late 19S RNA" promoter <4..>348 /note="SV40 promoter (∆SELP)" enhancer 26..168 /note="SV40 enhancer elements" misc_feature 173..236 /note="SV40 21bp repeats" rep_origin 193..>328 /note="SV40 Ori" mRNA 242..248 /note="early-late transcription startpoints" mutation 262..262 /note="G->A kills SELP ATG" mRNA 282..288 /note="early-early transcription start points" CDS 373..1761 /note="HPV18 L2 (codon modified)" polyA_signal 1783..>2004 /note="SV40 Late PolyA" promoter <2095..2332 /note="hEF1alpha promoter core" 5'UTR 2340..2627 /note="HTLV R-U5 leader" CDS 2677..4200 /note="HPV18 L1 (codon modified)" polyA_signal <4207..4869 /note="hEF1a polyA signal" rep_origin 4870..5603 /note="MB1 Ori" promoter 5604..5690 /note="EM7 promoter" CDS 5691..6065 /note="ShBle (ZeoR)" terminator 6066..6118 /note="terminator (rpmB/G)" BASE COUNT 1462 A 1817 C 1572 G 1267 T 0 OTHER ORIGIN - 1 CCCCTGTGGA ATGTGTGTCA GTTAGGGTGT GGAAAGTCCC CAGGCTCCCC AGCAGGCAGA 61 AGTATGCAAA GCATGCATCT CAATTAGTCA GCAACCAGGT GTGGAAAGTC CCCAGGCTCC 121 CCAGCAGGCA GAAGTATGCA AAGCATGCAT CTCAATTAGT CAGCAACCAT AGTCCCGCCC 181 CTAACTCCGC CCATCCCGCC CCTAACTCCG CCCAGTTCCG CCCATTCTCC GCCCCATGGC 241 TGACTAATTT TTTTTATTTA TACAGAGGCC GAGGCCGCCT CGGCCTCTGA GCTATTCCAG 301 AAGTAGTGAG GAGGCTTTTT TGGAGGCCTA GGCTTTTGCA AAAAGCTTGA TTGGGATCCA 361 CCGGTCGCCA CCATGGTCAG CCATAGGGCT GCCAGGAGGA AGAGAGCCAG CGTGACCGAT 421 CTGTACAAGA CCTGCAAGCA GAGCGGCACC TGCCCCCCCG ACGTGGTGCC CAAAGTCGAA 481 GGGACAACCC TGGCCGACAA GATCCTCCAG TGGAGCTCTT TGGGCATCTT CCTCGGCGGC 541 TTGGGGATCG GCACCGGGTC CGGCACCGGC GGCAGGACCG GCTATATCCC CCTCGGGGGC 601 AGGAGCAACA CCGTCGTCGA CGTGGGCCCC ACCAGGCCTC CCGTCGTGAT CGAGCCCGTC 661 GGGCCTACCG ATCCCAGCAT CGTGACCCTG ATCGAAGATA GCTCCGTCGT GACCAGCGGC 721 GCCCCCCGCC CCACCTTCAC CGGGACCAGC GGCTTCGACA TCACCAGCGC CGGCACCACC 781 ACCCCCGCCG TGCTCGACAT TACCCCCAGC AGCACAAGCG TCAGCATCAG TACCACAAAC 841 TTCACAAACC CCGCCTTCAG CGACCCCAGC ATCATCGAGG TGCCCCAGAC CGGCGAAGTC 901 GCCGGCAACG TGTTCGTGGG CACACCCACC AGCGGCACCC ACGGCTACGA AGAGATCCCC 961 CTGCAGACCT TCGCCAGCAG CGGCACCGGC GAAGAGCCAA TCTCCTCCAC ACCCCTCCCC 1021 ACCGTCAGAA GGGTGGCCGG CCCAAGGTTG TATTCCCGCG CTTATCAGCA AGTCAGCGTC 1081 GCCAATCCCG AATTCCTGAC CAGGCCCAGC AGCCTGATCA CCTACGATAA CCCCGCTTTC 1141 GAACCCGTCG ATACCACCCT GACCTTCGAC CCCAGGTCCG ACGTGCCCGA CAGCGACTTC 1201 ATGGACATCA TTAGGTTGCA CCGCCCCGCC CTGACCAGCC GCAGGGGCAC CGTGAGGTTC 1261 TCCCGGCTGG GCCAGAGAGC CACCATGTTC ACAAGGTCTG GCACCCAGAT CGGCGCCCGC 1321 GTGCATTTCT ATCACGACAT CTCCCCCATC GCCCCCAGCC CCGAGTACAT CGAGCTCCAA 1381 CCCCTCGTGA GCGCTACCGA AGATAACGAT CTCTTCGACA TCTACGCCGA CGATATGGAT 1441 CCCGCCGTCC CCGTGCCCAG CAGGAGCACC ACAAGCTTCG CCTTCTTCAA GTACAGCCCA 1501 ACCATCAGCA GCGCTAGCAG TTACTCCAAC GTGACCGTGC CCCTGACAAG TAGCTGGGAC 1561 GTCCCCGTGT ATACCGGCCC CGACATCACC CTGCCCAGCA CCACAAGCGT GTGGCCAATC 1621 GTGAGCCCAA CCGCTCCCGC TAGCACCCAA TACATCGGCA TCCACGGCAC CCACTACTAC 1681 CTCTGGCCCC TGTACTACTT CATCCCCAAG AAGAGGAAGA GGGTGCCATA CTTCTTCGCC 1741 GACGGGTTCG TCGCCGCTTG AGACGCTCGA GGCCGCTTCG AGCAGACATG ATAAGATACA 1801 TTGATGAGTT TGGACAAACC ACAACTAGAA TGCAGTGAAA AAAATGCTTT ATTTGTGAAA 1861 TTTGTGATGC TATTGCTTTA TTTGTAACCA TTATAAGCTG CAATAAACAA GTTAACAACA 1921 ACAATTGCAT TCATTTTATG TTTCAGGTTC AGGGGGAGGT GTGGGAGGTT TTTTAAAGCA 1981 AGTAAAACCT CTACAAATGT GGTAAAATCG ATAAGGATCC GGGCTGGCGT AATAGCGAAG 2041 AGGCCCGCAC CGATCGCCCT TCCCAACAGT TGCGGTGGAG AAGAGCATGC GTGAGGCTCC 2101 GGTGCCCGTC AGTGGGCAGA GCGCACATCG CCCACAGTCC CCGAGAAGTT GGGGGGAGGG 2161 GTCGGCAATT GAACCGGTGC CTAGAGAAGG TGGCGCGGGG TAAACTGGGA AAGTGATGTC 2221 GTGTACTGGC TCCGCCTTTT TCCCGAGGGT GGGGGAGAAC CGTATATAAG TGCAGTAGTC 2281 GCCGTGAACG TTCTTTTTCG CAACGGGTTT GCCGCCAGAA CACAGCTGAA GCTTCGAGGG 2341 GCTCGCATCT CTCCTTCACG CGCCCGCCGC CCTACCTGAG GCCGCCATCC ACGCCGGTTG 2401 AGTCGCGTTC TGCCGCCTCC CGCCTGTGGT GCCTCCTGAA CTGCGTCCGC CGTCTAGGTA 2461 AGTTTAAAGC TCAGGTCGAG ACCGGGCCTT TGTCCGGCGC TCCCTTGGAG CCTACCTAGA 2521 CTCAGCCGGC TCTCCACGCT TTGCCTGACC CTGCTTGCTC AACTCTACGT CTTTGTTTCG 2581 TTTTCTGTTC TGCGCCGTTA CAGATCCAAG CTGTGACCGG CGCCTACGTA AGTGATATCT 2641 ACTAGATTTA TCAAAAAGAG TGTTGACTTG TGAGCCATGG CCCTCTGGAG ACCATCCGAT 2701 AACACAGTGT ACTTGCCCCC ACCCAGCGTC GCCCGGGTGG TGAACACAGA CGACTACGTC 2761 ACCAGAACCT CAATCTTCTA CCACGCCGGG TCCAGCCGGC TGCTGACCGT GGGCAACCCC 2821 TACTTCCGCG TGCCCGCCGG CGGCGGAAAC AAACAAGACA TCCCCAAAGT CAGCGCCTAT 2881 CAGTACCGGG TGTTCCGCGT CCAACTGCCC GATCCCAACA AGTTCGGCCT GCCCGACACC 2941 TCCATCTACA ACCCCGAGAC CCAGAGGCTG GTCTGGGCTT GCGCCGGCGT CGAGATCGGG 3001 AGGGGCCAAC CCCTGGGCGT GGGGTTGTCC GGCCACCCCT TCTACAACAA GCTGGACGAT 3061 ACCGAGTCCA GCCACGCAGC AACCAGCAAC GTCTCCGAAG ATGTGCGCGA TAACGTCAGC 3121 GTGGACTACA AACAAACCCA ACTGTGCATC CTGGGATGCG CACCCGCCAT CGGCGAGCAT 3181 TGGGCCAAGG GGACCGCCTG CAAGAGCAGG CCCCTGAGCC AAGGGGACTG TCCACCCCTG 3241 GAGTTGAAGA ATACCGTGCT CGAGGACGGC GACATGGTGG ACACCGGCTA CGGCGCTATG 3301 GATTTCTCCA CCCTCCAGGA CACCAAGTGC GAAGTGCCCC TCGACATCTG CCAAAGCATC 3361 TGCAAGTACC CCGACTACCT CCAGATGAGC GCCGACCCCT ACGGCGACAG CATGTTCTTC 3421 TGTCTCAGAA GGGAACAATT GTTCGCCCGC CACTTCTGGA ACCGGGCCGG CACAATGGGA 3481 GATACAGTCC CCCAGAGCCT GTACATCAAG GGGACCGGAA TGAGGGCCAG CCCCGGGTCC 3541 TGCGTCTACA GCCCAAGCCC CTCCGGGAGC ATCGTCACAA GCGATAGCCA ACTCTTCAAC 3601 AAGCCCTACT GGCTCCACAA AGCCCAAGGC CACAATAACG GGGTGTGTTG GCACAACCAG 3661 CTGTTCGTGA CCGTCGTGGA CACAACCAGG TCCACAAACC TGACCATCTG CGCCAGCACC 3721 CAAAGCCCCG TGCCCGGCCA GTACGACGCC ACAAAGTTCA AACAATACTC TCGGCACGTG 3781 GAAGAGTACG ACCTCCAATT CATCTTCCAA CTCTGCACCA TCACCCTCAC CGCCGACGTG 3841 ATGAGCTACA TCCACTCCAT GAACTCCTCC ATCCTGGAAG ACTGGAATTT CGGCGTGCCA 3901 CCACCCCCTA CCACCTCCCT CGTCGACACC TACAGATTCG TGCAGAGCGT GGCCATCACA 3961 TGCCAGAAAG ACGCCGCCCC CGCCGAGAAC AAAGACCCAT ACGACAAACT GAAATTCTGG 4021 AACGTCGACC TGAAAGAGAA ATTCAGCCTG GATCTGGACC AGTACCCATT GGGCAGGAAG 4081 TTCCTCGTGC AAGCCGGCCT CAGGAGAAAA CCAACAATCG GGCCCAGGAA GAGGAGCGCC 4141 CCCAGCGCAA CCACCAGCAG CAAGCCCGCA AAAAGGGTCA GAGTGAGGGC ACGCAAATGA 4201 GCTAGCATTA TCCCTAATAC CTGCCACCCC ACTCTTAATC AGTGGTGGAA GAACGGTCTC 4261 AGAACTGTTT GTTTCAATTG GCCATTTAAG TTTAGTAGTA AAAGACTGGT TAATGATAAC 4321 AATGCATCGT AAAACCTTCA GAAGGAAAGG AGAATGTTTT GTGGACCACT TTGGTTTTCT 4381 TTTTTGCGTG TGGCAGTTTT AAGTTATTAG TTTTTAAAAT CAGTACTTTT TAATGGAAAC 4441 AACTTGACCA AAAATTTGTC ACAGAATTTT GAGACCCATT AAAAAAGTTA AATGAGAAAC 4501 CTGTGTGTTC CTTTGGTCAA CACCGAGACA TTTAGGTGAA AGACATCTAA TTCTGGTTTT 4561 ACGAATCTGG AAACTTCTTG AAAATGTAAT TCTTGAGTTA ACACTTCTGG GTGGAGAATA 4621 GGGTTGTTTT CCCCCCACAT AATTGGAAGG GGAAGGAATA TCATTTAAAG CTATGGGAGG 4681 GTTTCTTTGA TTACAACACT GGAGAGAAAT GCAGCATGTT GCTGATTGCC TGTCACTAAA 4741 ACAGGCCAAA AACTGAGTCC TTGGGTTGCA TAGAAAGCTT CATGTTGCTA AACCAATGTT 4801 AAGTGAATCT TTGGAAACAA AATGTTTCCA AATTACTGGG ATGTGCATGT TGAAACGTGG 4861 GTTAATTAAC TAGCCATGAC CAAAATCCCT TAACGTGAGT TTTCGTTCCA CTGAGCGTCA 4921 GACCCCGTAG AAAAGATCAA AGGATCTTCT TGAGATCCTT TTTTTCTGCG CGTAATCTGC 4981 TGCTTGCAAA CAAAAAAACC ACCGCTACCA GCGGTGGTTT GTTTGCCGGA TCAAGAGCTA 5041 CCAACTCTTT TTCCGAAGGT AACTGGCTTC AGCAGAGCGC AGATACCAAA TACTGTTCTT 5101 CTAGTGTAGC CGTAGTTAGG CCACCACTTC AAGAACTCTG TAGCACCGCC TACATACCTC 5161 GCTCTGCTAA TCCTGTTACC AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG TCTTACCGGG 5221 TTGGACTCAA GACGATAGTT ACCGGATAAG GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG 5281 TGCACACAGC CCAGCTTGGA GCGAACGACC TACACCGAAC TGAGATACCT ACAGCGTGAG 5341 CTATGAGAAA GCGCCACGCT TCCCGAAGGG AGAAAGGCGG ACAGGTATCC GGTAAGCGGC 5401 AGGGTCGGAA CAGGAGAGCG CACGAGGGAG CTTCCAGGGG GAAACGCCTG GTATCTTTAT 5461 AGTCCTGTCG GGTTTCGCCA CCTCTGACTT GAGCGTCGAT TTTTGTGATG CTCGTCAGGG 5521 GGGCGGAGCC TATGGAAAAA CGCCAGCAAC GCGGCCTTTT TACGGTTCCT GGCCTTTTGC 5581 TGGCCTTTTG CTCACATGTT CTTAATTAAA TTTTTCAAAA GTAGTTGACA ATTAATCATC 5641 GGCATAGTAT ATCGGCATAG TATAATACGA CTCACTATAG GAGGGCCATC ATGGCCAAGT 5701 TGACCAGTGC TGTCCCAGTG CTCACAGCCA GGGATGTGGC TGGAGCTGTT GAGTTCTGGA 5761 CTGACAGGTT GGGGTTCTCC AGAGATTTTG TGGAGGATGA CTTTGCAGGT GTGGTCAGAG 5821 ATGATGTCAC CCTGTTCATC TCAGCAGTCC AGGACCAGGT GGTGCCTGAC AACACCCTGG 5881 CTTGGGTGTG GGTGAGAGGA CTGGATGAGC TGTATGCTGA GTGGAGTGAG GTGGTCTCCA 5941 CCAACTTCAG GGATGCCAGT GGCCCTGCCA TGACAGAGAT TGGAGAGCAG CCCTGGGGGA 6001 GAGAGTTTGC CCTGAGAGAC CCAGCAGGCA ACTGTGTGCA CTTTGTGGCA GAGGAGCAGG 6061 ACTGAGGATA AGAATTGTAA CAAAAAACCC CGCCCCGGCG GGGTTTTTTG TTAATTAA //