LOCUS PXULL.TXT 8589 BP DS-DNA CIRCULAR SYN 17-MAY-2003 DEFINITION - ACCESSION - KEYWORDS - SOURCE - FEATURES Location/Qualifiers misc_RNA complement(>4..12) /note="SV40 minor late 19S RNA" promoter <4..>348 /note="SV40 promoter" enhancer 26..168 /note="SV40 enhancer elements" misc_feature 173..236 /note="SV40 21bp repeats" rep_origin 193..>328 /note="SV40 Ori" mRNA 242..248 /note="early-late transcription startpoints" mutation 262..262 /note="G->A kills SELP ATG" mRNA 282..288 /note="early-early transcription start points" misc_feature 390..449 /note="SV40 homology" intron 417..511 /note="SV40 late intron" misc_feature 460..546 /note="SV40 homology" CDS 581..>2002 /note="Müller's HPV16 L2" polyA_signal 2056..>2277 /note="SV40 Late PolyA" misc_feature 2290..>4277 /note="2kb EcoR1-Bgl frag with SAR in it" promoter <4569..>4806 /note="hEF1alpha promoter core" 5'UTR 4814..5080 /note="HTLV-1 R-U5" CDS 5112..6629 /note="Müller's HPV16 L1" polyA_signal <6678..7340 /note="hEF1a polyA signal" rep_origin 7341..8074 /note="MB1 Ori" promoter 8075..8161 /note="EM7 promoter" CDS 8162..8536 /note="ShBle (ZeoR)" terminator 8537..8589 /note="terminator (rpmB/G)" BASE COUNT 2175 A 2330 C 2052 G 2032 T 0 OTHER ORIGIN - 1 CCCCTGTGGA ATGTGTGTCA GTTAGGGTGT GGAAAGTCCC CAGGCTCCCC AGCAGGCAGA 61 AGTATGCAAA GCATGCATCT CAATTAGTCA GCAACCAGGT GTGGAAAGTC CCCAGGCTCC 121 CCAGCAGGCA GAAGTATGCA AAGCATGCAT CTCAATTAGT CAGCAACCAT AGTCCCGCCC 181 CTAACTCCGC CCATCCCGCC CCTAACTCCG CCCAGTTCCG CCCATTCTCC GCCCCATGGC 241 TGACTAATTT TTTTTATTTA TACAGAGGCC GAGGCCGCCT CGGCCTCTGA GCTATTCCAG 301 AAGTAGTGAG GAGGCTTTTT TGGAGGCCTA GGCTTTTGCA AAAAGCTTGA TTGGGATCCA 361 CCGGACTCTA GAGGATCCGG TACTCGAGGA ACTGAAAAAC CAGAAAGTTA ACTGGTAAGT 421 TTAGTCTTTT TGTCTTTTAT TTCAGGTCCC GGATCCGGTG GTGGTGCAAA TCAAAGAACT 481 GCTCCTCAGT GGATGTTGCC TTTACTTCTA GGCCTGTACG GAAGTGTTAC TTCTGCTCTA 541 AAAGCTGCGG AATTGTACCC GCGGCCGCTC TAGAGCCACC ATGAGGCACA AGAGGAGCGC 601 CAAGAGGACC AAGAGGGCCA GCGCCACCCA GCTGTACAAG ACCTGCAAGC AGGCCGGCAC 661 CTGCCCCCCC GACATCATCC CCAAGGTGGA GGGCAAGACC ATCGCCGACC AGATCCTGCA 721 GTACGGCAGC ATGGGCGTGT TCTTCGGCGG CCTGGGCATC GGCACCGGCA GCGGCACCGG 781 CGGCAGGACC GGCTACATCC CCCTGGGCAC CAGGCCCCCC ACCGCCACCG ACACCCTGGC 841 CCCCGTGAGG CCCCCCCTGA CCGTGGACCC CGTGGGCCCC AGCGACCCCA GCATCGTGAG 901 CCTGGTGGAG GAGACCAGCT TCATCGACGC CGGCGCCCCC ACCAGCGTGC CCAGCATCCC 961 CCCCGACGTG AGCGGCTTCA GCATCACCAC CAGCACCGAC ACCACCCCCG CCATCCTGGA 1021 CATCAACAAC ACCGTGACCA CCGTGACCAC CCACAACAAC CCCACCTTCA CCGACCCCAG 1081 CGTGCTGCAG CCCCCCACCC CCGCCGAGAC CGGCGGCCAC TTCACCCTGA GCAGCAGCAC 1141 CATCAGCACC CACAACTACG AGGAGATCCC CATGGACACC TTCATCGTGA GCACCAACCC 1201 CAACACCGTG ACCAGCAGCA CCCCCATCCC CGGCAGCAGG CCCGTGGCCA GGCTGGGCCT 1261 GTACAGCAGG ACCACCCAGC AGGTGAAGGT GGTGGACCCC GCCTTCGTGA CCACCCCCAC 1321 CAAGCTGATC ACCTACGACA ACCCCGCCTA CGAGGGCATC GACGTGGACA ACACCCTGTA 1381 CTTCAGCAGC AACGACAACA GCATCAACAT CGCCCCCGAC CCCGACTTCC TGGACATCGT 1441 GGCCCTGCAC AGGCCCGCCC TGACCAGCAG GAGGACCGGC ATCAGGTACA GCAGGATCGG 1501 CAACAAGCAG ACCCTGAGGA CCAGGAGCGG CAAGAGCATC GGCGCCAAGG TGCACTACTA 1561 CTACGACCTG AGCACCATCG ACCCCGCCGA GGAGATCGAG CTGCAGACCA TCACCCCCAG 1621 CACCTACACC ACCACCAGCC ACGCCGCCAG CCCCACCAGC ATCAACAACG GCCTGTACGA 1681 CATCTACGCC GACGACTTCA TCACCGACAC CAGCACCACC CCCGTGCCCA GCGTGCCCAG 1741 CACCAGCCTG AGCGGCTACA TCCCCGCCAA CACCACCATC CCCTTCGGTG GCGCCTACAA 1801 CATCCCCCTG GTGAGCGGCC CCGACATCCC CATCAACATC ACCGACCAGG CCCCCAGCCT 1861 GATCCCCATC GTGCCCGGCA GCCCCCAGTA CACCATCATC GCCGACGCCG GCGACTTCTA 1921 CCTGCACCCC AGCTACTACA TGCTGAGGAA GAGGAGGAAG AGGCTGCCCT ACTTCTTCAG 1981 CGACGTGAGC CTGGCCGCCT GAAAGCTTTT TGAATTCTTT GGATCCACTA GTGTCGACTA 2041 GAGGGCCGCT TCGAGCAGAC ATGATAAGAT ACATTGATGA GTTTGGACAA ACCACAACTA 2101 GAATGCAGTG AAAAAAATGC TTTATTTGTG AAATTTGTGA TGCTATTGCT TTATTTGTAA 2161 CCATTATAAG CTGCAATAAA CAAGTTAACA ACAACAATTG CATTCATTTT ATGTTTCAGG 2221 TTCAGGGGGA GGTGTGGGAG GTTTTTTAAA GCAAGTAAAA CCTCTACAAA TGTGGTAAAA 2281 TCGACTGCAG AATTCTATCA AATATTTAAA GAAAAAAAAA TTGTATCAAC TTTCTACAAT 2341 CTCTTTCAGA AGACAGAAGC AGAGGGAATA CTTCCTAAAT CATTCAACTA GGCCAGCATT 2401 ACCTTAATAC CGGAACTAGA AAATGACATT ACAAGAAAAG AAAACAACAG ACCAATATCT 2461 CTCATGAACA AAGATACAAA CATTTTCAAC AAAATATTAG CAAAAAGAAT CCAAGAATGT 2521 ATCAAAAAAT ATACACCACA ACCAAGTAGA ATTTATTCCA GATATGTAAG GGTGGTTCAA 2581 CGTTTGAAAA TCAATTAACG TAATTTGTCC CATCAACAGG TTAAAGAAGA AAATCACATG 2641 GTCATATTGA TAGACACAGA AAAAGCATTT GACAAAATTT AACACCCATT CATGATGCAA 2701 TCTCTCAGTA AACTAGGAAT AGAGGAAAAC TTCCTCAGCT TGAATGTACC TTCCTCTCAA 2761 TTTTGCTATG AACCTGAAAC TCCTCTTAAA AAATAAAGTT TTTCATTTAA AAAGAAAACA 2821 AAAAACATGG AGGAGCGTTG ATGTATCTCA TTTTAGACCA ATCAGCTATG GATAGTTAGG 2881 CGACAGCACA GATAGCTGCT GTACTTCTGT TTCTGGCAAT GTTCCAGACT ACATTTAAAA 2941 AATTTTTAAT TATAGACTTG TACTTAATGT TCAAGAAAAA TATGAAAATG GCTTTGCCGT 3001 GTTAATGCTA CTCTTTTTTA AAAAAAACTA AAGTTCAAAC TTTATTTATA TTTCATTAGT 3061 TTTTTAGCTA CTGTTCTTTT TCTGTTCTGG GATCTCATTC AGAATGCCAC ATTACATATA 3121 ATTCTCATGT CTCCTTGGGT TCCTCTTAGT TTTGACAGTT CCTCAGACTT TTCTTATTTT 3181 TGATGACCTT GACAGTTTTG AGGAGTACTG GTTAGATATA GGGTAATGGT TTTTAAAGTA 3241 TATTTGTCAT GATTTATACT GGGGTAAGGG TTTGGGGAGG AAGCCCATGG GGTAAAGTAC 3301 TGTTCTCATC ACATCATATC AAGGTTATAT ACCATCAATA TTGCCACAGA TGTTACTTAG 3361 CCTTTTAATA TTTCTCTAAT TTAGTGTATA TGCAATGATA GTTCTCTGAT TTCTGAGATT 3421 GAGTTTCTCA TGTGTAATGA TTATTTAGAG TTTCTCTTTC ATCTGTTCAA ATTTTTGTCT 3481 AGTTTTATTT TTTACTGATT TGTAAGACTT CTTTTTATAA TCTGCATATT ACAATTCTCT 3541 TTACTGGGGT GTTGCAAATA TTTTCTGTCA TTCTATGGCC TGACTTTTCT TAATGGTTTT 3601 TTAATTTTAA AAATAAGTCT TAATATTCAT GCAATCTAAT TAACAATCTT TTCTTTGTGG 3661 TTAGGACTTT GAGTCATAAG AAATTTTTCT CTACACTGAA GTCATGATGG CATGCTTCTA 3721 TATTATTTTC TAAAAGATTT AAAGTTTTGC CTTCTCCATT TAGACTTATA ATTCACTGGA 3781 ATTTTTTTGT GTGTATGGTA TGACATATGG GTTCCCTTTT ATTTTTTACA TATAAATATA 3841 TTTCCCTGTT TTTCTAAAAA AGAAAAAGAT CATCATTTTC CCATTGTAAA ATGCCATATT 3901 TTTTTCATAG GTCACTTACA TATATCAATG GGTCTGTTTC TGAGCTCTAC TCTATTTTAT 3961 CAGCCTCACT GTCTATCCCC ACACATCTCA TGCTTTGCTC TAAATCTTGA TATTTAGTGG 4021 AACATTCTTT CCCATTTTGT TCTACAAGAA TATTTTTGTT ATTGTCTTTT GGGCTTCTAT 4081 ATACATTTTA GAATGAGGTT GGCAAGTTAA CAAACAGCTT TTTTGGGGTG AACATATTGA 4141 CTACAAATTT ATGTGGAAAG AAAGTATACC TTCACAATAT TAAGTCTTTT AGTTCATGAA 4201 TATAGTATGT CTCTCCGTTT CTGCATTAAC TTAGACATTC ATTAATTTCT CTCACAATTT 4261 ATAAGTTTAT TTAGATCCGA GCTCGGTACA GCTCGTCCAT GCCGAGAGTG ATCCCGGCGG 4321 CGGTCACGAA CTCCAGCAGG ACCATGTGAT CGCGCTTCTC GTTGGGGTCT TTGCTCAGGG 4381 CGGACTGGGT GCTCAGGTAG TGGTTGTCGG GCAGCAGCAC GGGGCCGTCG CCGATGGGGG 4441 TGTTCTGCTG GTAGTGGTCG GCGAGCTGCA CGCTGCCGTC CTCGATAAGG ATCCGGGCTG 4501 GCGTAATAGC GAAGAGGCCC GCACCGATCG CCCTTCCCAA CAGTTGCGGT GGAGAAGAGC 4561 ATGCGTGAGG CTCCGGTGCC CGTCAGTGGG CAGAGCGCAC ATCGCCCACA GTCCCCGAGA 4621 AGTTGGGGGG AGGGGTCGGC AATTGAACCG GTGCCTAGAG AAGGTGGCGC GGGGTAAACT 4681 GGGAAAGTGA TGTCGTGTAC TGGCTCCGCC TTTTTCCCGA GGGTGGGGGA GAACCGTATA 4741 TAAGTGCAGT AGTCGCCGTG AACGTTCTTT TTCGCAACGG GTTTGCCGCC AGAACACAGC 4801 TGAAGCTTCG AGGGGCTCGC ATCTCTCCTT CACGCGCCCG CCGCCCTACC TGAGGCCGCC 4861 ATCCACGCCG GTTGAGTCGC GTTCTGCCGC CTCCCGCCTG TGGTGCCTCC TGAACTGCGT 4921 CCGCCGTCTA GGTAAGTTTA AAGCTCAGGT CGAGACCGGG CCTTTGTCCG GCGCTCCCTT 4981 GGAGCCTACC TAGACTCAGC CGGCTCTCCA CGCTTTGCCT GACCCTGCTT GCTCAACTCT 5041 ACGTCTTTGT TTCGTTTTCT GTTCTGCGCC GTTACAGATC CAAGCTGTGA CCGGCCCGCT 5101 CTAGAGCCAC CATGAGCCTG TGGCTGCCCA GCGAGGCCAC CGTGTACCTG CCCCCCGTGC 5161 CCGTGAGCAA GGTGGTGAGC ACCGACGAGT ACGTGGCCAG GACCAACATC TACTACCACG 5221 CCGGCACCAG CAGGCTGCTG GCCGTGGGCC ACCCCTACTT CCCCATCAAG AAGCCCAACA 5281 ACAACAAGAT CCTGGTGCCC AAGGTGAGCG GCCTGCAGTA CAGGGTGTTC AGGATCCACC 5341 TGCCCGACCC CAACAAGTTC GGCTTCCCCG ACACCAGCTT CTACAACCCC GACACCCAGA 5401 GGCTGGTGTG GGCCTGCGTG GGCGTGGAGG TGGGCAGGGG CCAGCCCCTG GGCGTGGGCA 5461 TCAGCGGCCA CCCCCTGCTG AACAAGCTGG ACGACACCGA GAACGCCAGC GCCTACGCCG 5521 CCAACGCCGG CGTGGACAAC AGGGAGTGCA TCAGCATGGA CTACAAGCAG ACCCAGCTGT 5581 GCCTGATCGG CTGCAAGCCC CCCATCGGCG AGCACTGGGG CAAGGGCAGC CCCTGCACCA 5641 ACGTGGCCGT GAACCCCGGC GACTGCCCCC CCCTGGAGCT GATCAACACC GTGATCCAGG 5701 ACGGCGACAT GGTGGACACC GGCTTCGGCG CCATGGACTT CACCACCCTG CAGGCCAACA 5761 AGAGCGAGGT GCCCCTGGAC ATCTGCACCA GCATCTGCAA GTACCCCGAC TACATCAAGA 5821 TGGTGAGCGA GCCCTACGGC GACAGCCTGT TCTTCTACCT GAGGAGGGAG CAGATGTTCG 5881 TGAGGCACCT GTTCAACAGG GCCGGCGCCG TGGGCGAGAA CGTGCCCGAC GACCTGTACA 5941 TCAAGGGCAG CGGCAGCACC GCCAACCTGG CCAGCAGCAA CTACTTCCCC ACCCCCAGCG 6001 GCAGCATGGT GACCAGCGAC GCCCAGATCT TCAACAAGCC CTACTGGCTG CAGAGGGCCC 6061 AGGGCCACAA CAACGGCATC TGCTGGGGCA ACCAGCTGTT CGTGACCGTG GTGGACACCA 6121 CCAGGAGCAC CAACATGAGC CTGTGCGCCG CCATCAGCAC CAGCGAGACC ACCTACAAGA 6181 ACACCAACTT CAAGGAGTAC CTGAGGCACG GCGAGGAGTA CGACCTGCAG TTCATCTTCC 6241 AGCTGTGCAA GATCACCCTG ACCGCCGACG TGATGACCTA CATCCACAGC ATGAACAGCA 6301 CCATCCTGGA GGACTGGAAC TTCGGCCTGC AGCCCCCCCC CGGCGGCACC CTGGAGGACA 6361 CCTACAGGTT CGTGACCAGC CAGGCCATCG CCTGCCAGAA GCACACCCCC CCCGCCCCCA 6421 AGGAGGACCC CCTGAAGAAG TACACCTTCT GGGAGGTGAA CCTGAAGGAG AAGTTCAGCG 6481 CCGACCTGGA CCAGTTCCCC CTGGGCAGGA AGTTCCTGCT GCAGGCCGGC CTGAAGGCCA 6541 AGCCCAAGTT CACCCTGGGC AAGAGGAAGG CCACCCCCAC CACCAGCAGC ACCAGCACCA 6601 CCGCCAAGAG GAAGAAGAGG AAGCTGTGAA AGCTACCCAC GGCCGAATAG CCGTGAGCCG 6661 GAATCCTGCA CGCTAGCATT ATCCCTAATA CCTGCCACCC CACTCTTAAT CAGTGGTGGA 6721 AGAACGGTCT CAGAACTGTT TGTTTCAATT GGCCATTTAA GTTTAGTAGT AAAAGACTGG 6781 TTAATGATAA CAATGCATCG TAAAACCTTC AGAAGGAAAG GAGAATGTTT TGTGGACCAC 6841 TTTGGTTTTC TTTTTTGCGT GTGGCAGTTT TAAGTTATTA GTTTTTAAAA TCAGTACTTT 6901 TTAATGGAAA CAACTTGACC AAAAATTTGT CACAGAATTT TGAGACCCAT TAAAAAAGTT 6961 AAATGAGAAA CCTGTGTGTT CCTTTGGTCA ACACCGAGAC ATTTAGGTGA AAGACATCTA 7021 ATTCTGGTTT TACGAATCTG GAAACTTCTT GAAAATGTAA TTCTTGAGTT AACACTTCTG 7081 GGTGGAGAAT AGGGTTGTTT TCCCCCCACA TAATTGGAAG GGGAAGGAAT ATCATTTAAA 7141 GCTATGGGAG GGTTTCTTTG ATTACAACAC TGGAGAGAAA TGCAGCATGT TGCTGATTGC 7201 CTGTCACTAA AACAGGCCAA AAACTGAGTC CTTGGGTTGC ATAGAAAGCT TCATGTTGCT 7261 AAACCAATGT TAAGTGAATC TTTGGAAACA AAATGTTTCC AAATTACTGG GATGTGCATG 7321 TTGAAACGTG GGTTAATTAA CTAGCCATGA CCAAAATCCC TTAACGTGAG TTTTCGTTCC 7381 ACTGAGCGTC AGACCCCGTA GAAAAGATCA AAGGATCTTC TTGAGATCCT TTTTTTCTGC 7441 GCGTAATCTG CTGCTTGCAA ACAAAAAAAC CACCGCTACC AGCGGTGGTT TGTTTGCCGG 7501 ATCAAGAGCT ACCAACTCTT TTTCCGAAGG TAACTGGCTT CAGCAGAGCG CAGATACCAA 7561 ATACTGTTCT TCTAGTGTAG CCGTAGTTAG GCCACCACTT CAAGAACTCT GTAGCACCGC 7621 CTACATACCT CGCTCTGCTA ATCCTGTTAC CAGTGGCTGC TGCCAGTGGC GATAAGTCGT 7681 GTCTTACCGG GTTGGACTCA AGACGATAGT TACCGGATAA GGCGCAGCGG TCGGGCTGAA 7741 CGGGGGGTTC GTGCACACAG CCCAGCTTGG AGCGAACGAC CTACACCGAA CTGAGATACC 7801 TACAGCGTGA GCTATGAGAA AGCGCCACGC TTCCCGAAGG GAGAAAGGCG GACAGGTATC 7861 CGGTAAGCGG CAGGGTCGGA ACAGGAGAGC GCACGAGGGA GCTTCCAGGG GGAAACGCCT 7921 GGTATCTTTA TAGTCCTGTC GGGTTTCGCC ACCTCTGACT TGAGCGTCGA TTTTTGTGAT 7981 GCTCGTCAGG GGGGCGGAGC CTATGGAAAA ACGCCAGCAA CGCGGCCTTT TTACGGTTCC 8041 TGGCCTTTTG CTGGCCTTTT GCTCACATGT TCTTAATTAA ATTTTTCAAA AGTAGTTGAC 8101 AATTAATCAT CGGCATAGTA TATCGGCATA GTATAATACG ACTCACTATA GGAGGGCCAT 8161 CATGGCCAAG TTGACCAGTG CTGTCCCAGT GCTCACAGCC AGGGATGTGG CTGGAGCTGT 8221 TGAGTTCTGG ACTGACAGGT TGGGGTTCTC CAGAGATTTT GTGGAGGATG ACTTTGCAGG 8281 TGTGGTCAGA GATGATGTCA CCCTGTTCAT CTCAGCAGTC CAGGACCAGG TGGTGCCTGA 8341 CAACACCCTG GCTTGGGTGT GGGTGAGAGG ACTGGATGAG CTGTATGCTG AGTGGAGTGA 8401 GGTGGTCTCC ACCAACTTCA GGGATGCCAG TGGCCCTGCC ATGACAGAGA TTGGAGAGCA 8461 GCCCTGGGGG AGAGAGTTTG CCCTGAGAGA CCCAGCAGGC AACTGTGTGC ACTTTGTGGC 8521 AGAGGAGCAG GACTGAGGAT AAGAATTGTA ACAAAAAACC CCGCCCCGGC GGGGTTTTTT 8581 GTTAATTAA //