LOCUS POHUL1.TXT 4603 BP DS-DNA CIRCULAR SYN 17-MAY-2003 DEFINITION - ACCESSION - KEYWORDS - SOURCE - FEATURES Location/Qualifiers unsure 11..86 /note="monkey DNA picked up during passaging" repeat_unit 84..91 /note="3' end of second 21bp repeat" protein_bind 84..>134 /note="T antigen binding sites" rep_origin 84..>220 /note="SV40 Ori core" insertion_seq 94..100 /note="sequence duplication relative to prototype SV40" mRNA 115..121 /note="late-early transcription start points" TATA_signal 118..134 /note="AT tract" mutation 135..135 /note="SELP ATG" misc_structure 136..162 /note="early palindrome" mRNA 155..161 /note="early-early transcription start points" polyA_signal 275..>496 /note="SV40 Late PolyA" promoter <587..>824 /note="hEF1alpha promoter core" 5'UTR 832..1098 /note="HTLV-1 R-U5" CDS 1130..>2647 /note="HPV16 L1" polyA_signal <2697..3359 /note="hEF1a polyA signal" rep_origin 3360..4093 /note="MB1 Ori" promoter 4094..4180 /note="EM7 promoter" CDS 4181..4555 /note="ShBle (ZeoR)" terminator 4556..>4603 /note="terminator (rpmB/G)" BASE COUNT 1106 A 1247 C 1253 G 997 T 0 OTHER ORIGIN - 1 ATCGGGTACC TGACTCATCT CAGGGGCAAC CGCTGCTGAG TTCCAGATGG TTGTTGCCAG 61 GCAGAAAAGT GATGAGTCAC AGTTCCGCCC ATTCTCCGCC TCCGCCCCAT GGCTGACTAA 121 TTTTTTTTAT TTATGCAGAG GCCGAGGCCG CCTCGGCCTC TGAGCTATTC CAGAAGTAGT 181 GAGGAGGCTT TTTTGGAGGC CTAGGCTTTT GCAAAAAGCT CAAGCTTTAA TGCGGTAGTT 241 TATCACAGTT AAATTGATAT CCGGCCGCTT CGAGCAGACA TGATAAGATA CATTGATGAG 301 TTTGGACAAA CCACAACTAG AATGCAGTGA AAAAAATGCT TTATTTGTGA AATTTGTGAT 361 GCTATTGCTT TATTTGTAAC CATTATAAGC TGCAATAAAC AAGTTAACAA CAACAATTGC 421 ATTCATTTTA TGTTTCAGGT TCAGGGGGAG GTGTGGGAGG TTTTTTAAAG CAAGTAAAAC 481 CTCTACAAAT GTGGTAAAAT CGATAAGGAT CCGGGCTGGC GTAATAGCGA AGAGGCCCGC 541 ACCGATCGCC CTTCCCAACA GTTGCGGTGG AGAAGAGCAT GCGTGAGGCT CCGGTGCCCG 601 TCAGTGGGCA GAGCGCACAT CGCCCACAGT CCCCGAGAAG TTGGGGGGAG GGGTCGGCAA 661 TTGAACCGGT GCCTAGAGAA GGTGGCGCGG GGTAAACTGG GAAAGTGATG TCGTGTACTG 721 GCTCCGCCTT TTTCCCGAGG GTGGGGGAGA ACCGTATATA AGTGCAGTAG TCGCCGTGAA 781 CGTTCTTTTT CGCAACGGGT TTGCCGCCAG AACACAGCTG AAGCTTCGAG GGGCTCGCAT 841 CTCTCCTTCA CGCGCCCGCC GCCCTACCTG AGGCCGCCAT CCACGCCGGT TGAGTCGCGT 901 TCTGCCGCCT CCCGCCTGTG GTGCCTCCTG AACTGCGTCC GCCGTCTAGG TAAGTTTAAA 961 GCTCAGGTCG AGACCGGGCC TTTGTCCGGC GCTCCCTTGG AGCCTACCTA GACTCAGCCG 1021 GCTCTCCACG CTTTGCCTGA CCCTGCTTGC TCAACTCTAC GTCTTTGTTT CGTTTTCTGT 1081 TCTGCGCCGT TACAGATCCA AGCTGTGACC GGCCCGCTCT AGAGCCACCA TGAGCCTGTG 1141 GCTGCCCAGC GAGGCCACCG TGTACCTGCC CCCCGTGCCC GTGAGCAAGG TGGTGAGCAC 1201 CGACGAGTAC GTGGCCAGGA CCAACATCTA CTACCACGCC GGCACCAGCA GGCTGCTGGC 1261 CGTGGGCCAC CCCTACTTCC CCATCAAGAA GCCCAACAAC AACAAGATCC TGGTGCCCAA 1321 GGTGAGCGGC CTGCAGTACA GGGTGTTCAG GATCCACCTG CCCGACCCCA ACAAGTTCGG 1381 CTTCCCCGAC ACCAGCTTCT ACAACCCCGA CACCCAGAGG CTGGTGTGGG CCTGCGTGGG 1441 CGTGGAGGTG GGCAGGGGCC AGCCCCTGGG CGTGGGCATC AGCGGCCACC CCCTGCTGAA 1501 CAAGCTGGAC GACACCGAGA ACGCCAGCGC CTACGCCGCC AACGCCGGCG TGGACAACAG 1561 GGAGTGCATC AGCATGGACT ACAAGCAGAC CCAGCTGTGC CTGATCGGCT GCAAGCCCCC 1621 CATCGGCGAG CACTGGGGCA AGGGCAGCCC CTGCACCAAC GTGGCCGTGA ACCCCGGCGA 1681 CTGCCCCCCC CTGGAGCTGA TCAACACCGT GATCCAGGAC GGCGACATGG TGGACACCGG 1741 CTTCGGCGCC ATGGACTTCA CCACCCTGCA GGCCAACAAG AGCGAGGTGC CCCTGGACAT 1801 CTGCACCAGC ATCTGCAAGT ACCCCGACTA CATCAAGATG GTGAGCGAGC CCTACGGCGA 1861 CAGCCTGTTC TTCTACCTGA GGAGGGAGCA GATGTTCGTG AGGCACCTGT TCAACAGGGC 1921 CGGCGCCGTG GGCGAGAACG TGCCCGACGA CCTGTACATC AAGGGCAGCG GCAGCACCGC 1981 CAACCTGGCC AGCAGCAACT ACTTCCCCAC CCCCAGCGGC AGCATGGTGA CCAGCGACGC 2041 CCAGATCTTC AACAAGCCCT ACTGGCTGCA GAGGGCCCAG GGCCACAACA ACGGCATCTG 2101 CTGGGGCAAC CAGCTGTTCG TGACCGTGGT GGACACCACC AGGAGCACCA ACATGAGCCT 2161 GTGCGCCGCC ATCAGCACCA GCGAGACCAC CTACAAGAAC ACCAACTTCA AGGAGTACCT 2221 GAGGCACGGC GAGGAGTACG ACCTGCAGTT CATCTTCCAG CTGTGCAAGA TCACCCTGAC 2281 CGCCGACGTG ATGACCTACA TCCACAGCAT GAACAGCACC ATCCTGGAGG ACTGGAACTT 2341 CGGCCTGCAG CCCCCCCCCG GCGGCACCCT GGAGGACACC TACAGGTTCG TGACCAGCCA 2401 GGCCATCGCC TGCCAGAAGC ACACCCCCCC CGCCCCCAAG GAGGACCCCC TGAAGAAGTA 2461 CACCTTCTGG GAGGTGAACC TGAAGGAGAA GTTCAGCGCC GACCTGGACC AGTTCCCCCT 2521 GGGCAGGAAG TTCCTGCTGC AGGCCGGCCT GAAGGCCAAG CCCAAGTTCA CCCTGGGCAA 2581 GAGGAAGGCC ACCCCCACCA CCAGCAGCAC CAGCACCACC GCCAAGAGGA AGAAGAGGAA 2641 GCTGTGAAAG CTGACCCACG GCCGAATAGC CGTGAGCCGG AATCCTGCAC GCTAGCATTA 2701 TCCCTAATAC CTGCCACCCC ACTCTTAATC AGTGGTGGAA GAACGGTCTC AGAACTGTTT 2761 GTTTCAATTG GCCATTTAAG TTTAGTAGTA AAAGACTGGT TAATGATAAC AATGCATCGT 2821 AAAACCTTCA GAAGGAAAGG AGAATGTTTT GTGGACCACT TTGGTTTTCT TTTTTGCGTG 2881 TGGCAGTTTT AAGTTATTAG TTTTTAAAAT CAGTACTTTT TAATGGAAAC AACTTGACCA 2941 AAAATTTGTC ACAGAATTTT GAGACCCATT AAAAAAGTTA AATGAGAAAC CTGTGTGTTC 3001 CTTTGGTCAA CACCGAGACA TTTAGGTGAA AGACATCTAA TTCTGGTTTT ACGAATCTGG 3061 AAACTTCTTG AAAATGTAAT TCTTGAGTTA ACACTTCTGG GTGGAGAATA GGGTTGTTTT 3121 CCCCCCACAT AATTGGAAGG GGAAGGAATA TCATTTAAAG CTATGGGAGG GTTTCTTTGA 3181 TTACAACACT GGAGAGAAAT GCAGCATGTT GCTGATTGCC TGTCACTAAA ACAGGCCAAA 3241 AACTGAGTCC TTGGGTTGCA TAGAAAGCTT CATGTTGCTA AACCAATGTT AAGTGAATCT 3301 TTGGAAACAA AATGTTTCCA AATTACTGGG ATGTGCATGT TGAAACGTGG GTTAATTAAC 3361 TAGCCATGAC CAAAATCCCT TAACGTGAGT TTTCGTTCCA CTGAGCGTCA GACCCCGTAG 3421 AAAAGATCAA AGGATCTTCT TGAGATCCTT TTTTTCTGCG CGTAATCTGC TGCTTGCAAA 3481 CAAAAAAACC ACCGCTACCA GCGGTGGTTT GTTTGCCGGA TCAAGAGCTA CCAACTCTTT 3541 TTCCGAAGGT AACTGGCTTC AGCAGAGCGC AGATACCAAA TACTGTTCTT CTAGTGTAGC 3601 CGTAGTTAGG CCACCACTTC AAGAACTCTG TAGCACCGCC TACATACCTC GCTCTGCTAA 3661 TCCTGTTACC AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG TCTTACCGGG TTGGACTCAA 3721 GACGATAGTT ACCGGATAAG GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG TGCACACAGC 3781 CCAGCTTGGA GCGAACGACC TACACCGAAC TGAGATACCT ACAGCGTGAG CTATGAGAAA 3841 GCGCCACGCT TCCCGAAGGG AGAAAGGCGG ACAGGTATCC GGTAAGCGGC AGGGTCGGAA 3901 CAGGAGAGCG CACGAGGGAG CTTCCAGGGG GAAACGCCTG GTATCTTTAT AGTCCTGTCG 3961 GGTTTCGCCA CCTCTGACTT GAGCGTCGAT TTTTGTGATG CTCGTCAGGG GGGCGGAGCC 4021 TATGGAAAAA CGCCAGCAAC GCGGCCTTTT TACGGTTCCT GGCCTTTTGC TGGCCTTTTG 4081 CTCACATGTT CTTAATTAAA TTTTTCAAAA GTAGTTGACA ATTAATCATC GGCATAGTAT 4141 ATCGGCATAG TATAATACGA CTCACTATAG GAGGGCCATC ATGGCCAAGT TGACCAGTGC 4201 TGTCCCAGTG CTCACAGCCA GGGATGTGGC TGGAGCTGTT GAGTTCTGGA CTGACAGGTT 4261 GGGGTTCTCC AGAGATTTTG TGGAGGATGA CTTTGCAGGT GTGGTCAGAG ATGATGTCAC 4321 CCTGTTCATC TCAGCAGTCC AGGACCAGGT GGTGCCTGAC AACACCCTGG CTTGGGTGTG 4381 GGTGAGAGGA CTGGATGAGC TGTATGCTGA GTGGAGTGAG GTGGTCTCCA CCAACTTCAG 4441 GGATGCCAGT GGCCCTGCCA TGACAGAGAT TGGAGAGCAG CCCTGGGGGA GAGAGTTTGC 4501 CCTGAGAGAC CCAGCAGGCA ACTGTGTGCA CTTTGTGGCA GAGGAGCAGG ACTGAGGATA 4561 AGAATTGTAA CAAAAAACCC CGCCCCGGCG GGGTTTTTTG TTA //