LOCUS pMCV-L 8683 bp ss-DNA circular SYN 11-Nov-1999 DEFINITION - ACCESSION - KEYWORDS - SOURCE - FEATURES Location/Qualifiers iDNA complement(123..796) /note=ColE1 Ori promoter 1044..1093 /note=KanR's promoter CDS 1182..1976 /note=KanR misc_feature complement(1991..2403) /note=F1 ori CDS complement(2428..2730) /note=ccdB ORF CDS complement(2731..2739) /note=fusion joint CDS complement(2740..>2967) /note=LacZ ORF [Split] primer complement(2891..2910) /note=T7 promoter source 2968..8353 /note=Merkel cell polyomavirus isolate R17a, accession# HM011555 misc_recomb 2968..2973 /note=EcoR1 CDS join(complement(<2968..4939),complement(5371..5604)) /note=Large T antigen [Split] CDS join(complement(<2968..3022),complement(4178..4939), complement(5371..5604)) /note=57kT (Shuda09) [Split] intron complement(3023..4177) /note=57kT intron C_region complement(4259..4276) /note=conserved among PyVs promoter 4321..4330 /note=potential NFKB binding site misc_signal complement(4334..4345) /note=NLS (Nakamura10) promoter 4472..4481 /note=potential NFKB binding site (GGGRNNYYCC) misc_feature complement(4526..4543) /note=Rb binding (DLXCXE) CR2 (Pipas92) misc_RNA 4564..4585 /note=microRNA mir-M1 (Seo09) promoter 4785..4794 /note=potential NFKB binding site misc_feature complement(4787..4828) /note=CM2B4 mAb epitope (Shuda09) intron complement(4940..5024) /note=sta intron (Shuda09) intron complement(4940..5370) /note=LTA intron (Shuda09) CDS complement(5044..5604) /note=small t antigen misc_feature complement(5149..5166) /note=PP2A binding motif (CXCXXC)(Sariyer09) misc_feature complement(5233..5250) /note=PP2A binding motif (CXCXXC)(Sariyer09) misc_feature complement(5464..5481) /note=DnaJ (HPDKGG)(Whalen05) misc_feature complement(5548..5580) /note=CR1 (EXXXLXXLXXI)(Pipas92) promoter complement(5690..5697) /note=JCV repeat (GGGNGGRR)(Martin85) rep_origin 5738..5791 /note=T Ag binding sites (GAGGC)(Kwun09) promoter 5951..5958 /note=JCV repeat (GGGNGGRR)(Martin85) promoter 6022..6029 /note=JCV repeat (GGGNGGRR)(Martin85) CDS 6068..6793 /note=VP2 CDS 6759..8030 /note=VP1 CDS complement(8106..>8359) /note=Large T antigen [Split] CDS complement(8106..>8359) /note=57kT (Shuda09) [Split] misc_recomb 8354..8359 /note=EcoR1 CDS complement(<8360..8467) /note=LacZ ORF [Split] primer complement(8428..8445) /note=Sp6 primer primer complement(8463..8479) /note=M13 reverse primer promoter complement(8468..8589) /note=Lac promoter BASE COUNT 2332 A 1911 C 2066 G 2374 T 0 OTHER ORIGIN ? 1 CTTCCGCTTC CTCGCTCACT GACTCGCTGC GCTCGGTCGT TCGGCTGCGG CGAGCGGTAT 61 CAGCTCACTC AAAGGCGGTA ATACGGTTAT CCACAGAATC AGGGGATAAC GCAGGAAAGA 121 ACATGTGAGC AAAAGGCCAG CAAAAGCCCA GGAACCGTAA AAAGGCCGCG TTGCTGGCGT 181 TTTTCCATAG GCTCCGCCCC CCTGACGAGC ATCACAAAAA TCGACGCTCA AGTCAGAGGT 241 GGCGAAACCC GACAGGACTA TAAAGATACC AGGCGTTTCC CCCTGGAAGC TCCCTCGTGC 301 GCTCTCCTGT TCCGACCCTG CCGCTTACCG GATACCTGTC CGCCTTTCTC CCTTCGGGAA 361 GCGTGGCGCT TTCTCATAGC TCACGCTGTA GGTATCTCAG TTCGGTGTAG GTCGTTCGCT 421 CCAAGCTGGG CTGTGTGCAC GAACCCCCCG TTCAGCCCGA CCGCTGCGCC TTATCCGGTA 481 ACTATCGTCT TGAGTCCAAC CCGGTAAGAC ACGACTTATC GCCACTGGCA GCAGCCACTG 541 GTAACAGGAT TAGCAGAGCG AGGTATGTAG GCGGTGCTAC AGAGTTCTTG AAGTGGTGGC 601 CTAACTACGG CTACACTAGA AGGACAGTAT TTGGTATCTG CGCTCTGCTG AAGCCAGTTA 661 CCTTCGGAAA AAGAGTTGGT AGCTCTTGAT CCGGCAAACA AACCACCGCT GGTAGCGGTG 721 GTTTTTTTGT TTGCAAGCAG CAGATTACGC GCAGAAAAAA AGGATCTCAA GAAGATCCTT 781 TGATCTTTTC TACGGGGTCT GACGCTCAGT GGAACGAAAA CTCACGTTAA GGGATTTTGG 841 TCATGAGATT ATCAAAAAGG ATCTTCACCT AGATCCTTTT CACGTAGAAA GCCAGTCCGC 901 AGAAACGGTG CTGACCCCGG ATGAATGTCA GCTACTGGGC TATCTGGACA AGGGAAAACG 961 CAAGCGCAAA GAGAAAGCAG GTAGCTTGCA GTGGGCTTAC ATGGCGATAG CTAGACTGGG 1021 CGGTTTTATG GACAGCAAGC GAACCGGAAT TGCCAGCTGG GGCGCCCTCT GGTAAGGTTG 1081 GGAAGCCCTG CAAAGTAAAC TGGATGGCTT TCTCGCCGCC AAGGATCTGA TGGCGCAGGG 1141 GATCAAGATC TGATCAAGAG ACAGGATGAG GATCGTTTCG CATGATTGAA CAAGATGGAT 1201 TGCACGCAGG TTCTCCGGCC GCTTGGGTGG AGAGGCTATT CGGCTATGAC TGGGCACAAC 1261 AGACAATCGG CTGCTCTGAT GCCGCCGTGT TCCGGCTGTC AGCGCAGGGG CGCCCGGTTC 1321 TTTTTGTCAA GACCGACCTG TCCGGTGCCC TGAATGAACT GCAAGACGAG GCAGCGCGGC 1381 TATCGTGGCT GGCCACGACG GGCGTTCCTT GCGCAGCTGT GCTCGACGTT GTCACTGAAG 1441 CGGGAAGGGA CTGGCTGCTA TTGGGCGAAG TGCCGGGGCA GGATCTCCTG TCATCTCACC 1501 TTGCTCCTGC CGAGAAAGTA TCCATCATGG CTGATGCAAT GCGGCGGCTG CATACGCTTG 1561 ATCCGGCTAC CTGCCCATTC GACCACCAAG CGAAACATCG CATCGAGCGA GCACGTACTC 1621 GGATGGAAGC CGGTCTTGTC GATCAGGATG ATCTGGACGA AGAGCATCAG GGGCTCGCGC 1681 CAGCCGAACT GTTCGCCAGG CTCAAGGCGA GCATGCCCGA CGGCGAGGAT CTCGTCGTGA 1741 CCCATGGCGA TGCCTGCTTG CCGAATATCA TGGTGGAAAA TGGCCGCTTT TCTGGATTCA 1801 TCGACTGTGG CCGGCTGGGT GTGGCGGACC GCTATCAGGA CATAGCGTTG GCTACCCGTG 1861 ATATTGCTGA AGAGCTTGGC GGCGAATGGG CTGACCGCTT CCTCGTGCTT TACGGTATCG 1921 CCGCTCCCGA TTCGCAGCGC ATCGCCTTCT ATCGCCTTCT TGACGAGTTC TTCTGAATTT 1981 TGTTAAAATT TTTGTTAAAT CAGCTCATTT TTTAACCAAT AGGCCGAAAT CGGCAACATC 2041 CCTTATAAAT CAAAAGAATA GACCGCGATA GGGTTGAGTG TTGTTCCAGT TTGGAACAAG 2101 AGTCCACTAT TAAAGAACGT GGACTCCAAC GTCAAAGGGC GAAAAACCGT CTATCAGGGC 2161 GATGGCCCAC TACGTGAACC ATCACCCAAA TCAAGTTTTT TGCGGTCGAG GTGCCGTAAA 2221 GCTCTAAATC GGAACCCTAA AGGGAGCCCC CGATTTAGAG CTTGACGGGG AAAGCCGGCG 2281 AACGTGGCGA GAAAGGAAGG GAAGAAAGCG AAAGGAGCGG GCGCTAGGGC GCTGGCAAGT 2341 GTAGCGGTCA CGCTGCGCGT AACCACCACA CCCGCGCGCT TAATGCGCCG CTACAGGGCG 2401 CGTCCATTCG CCATTCAGGC CTGACATTTA TATTCCCCAG AACATCAGGT TAATGGCGTT 2461 TTTGATGTCA TTTTCGCGGT GGCTGAGATC AGCCACTTCT TCCCCGATAA CGGAGACCGG 2521 CACACTGGCC ATATCGGTGG TCATCATGCG CCAGCTTTCA TCCCCGATAT GCACCACCGG 2581 GTAAAGTTCA CGGGAGACTT TATCTGACAG CAGACGTGCA CTGGCCAGGG GGATCACCAT 2641 CCGTCGCCCC GGCGTGTCAA TAATATCACT CTGTACATCC ACAAACAGAC GATAACGGCT 2701 CTCTCTTTTA TAGGTGTAAA CCTTAAACTG CCGTACGTAT AGGCTGCGCA ACTGTTGGGA 2761 AGGGCGATCG GTGCGGGCCT CTTCGCTATT ACGCCAGCTG GCGAAAGGGG GATGTGCTGC 2821 AAGGCGATTA AGTTGGGTAA CGCCAGGGTT TTCCCAGTCA CGACGTTGTA AAACGACGGC 2881 CAGTGAATTG TAATACGACT CACTATAGGG CGAATTGGGC CCTCTAGATG CATGCTCGAG 2941 CGGCCGCCAG TGTGATGGAT ATCTGCAGAA TTCTTCTTTT TCTTATTTCC ATGTTCTGAT 3001 CCAGGGAATC TCTTAGATTT GCCTTTGGGG AAAAGTGTAA AGTATAACTA AATCTTGCTA 3061 TTAATGTTTT GGGAATAAAA TAATCATTAG CAGTAACAAT ACAAGGAGGA AAAATCTGAT 3121 GCTTTTTATT CACATGCTTC TTCTCTAAGC TTACAGCTAC AGCACCATCT AGATGATCTC 3181 TTAAGTTATC AAGGTTATTT ATTCCTTGCC CTGGTTGCAG ATCTTTATTT AGGCTATTTT 3241 GCCCTTTCAC ATCCTCAAAA ACAACCATAA ATTTATCCAA AGCACATCCT AGTTCAAAAG 3301 GCAGTTTATC AGATGGACAG TTTATATTCA AGGCCTTCCC TTCTAGCAAA TCTATTAAGG 3361 CTGCAGCAAA GCTTGTTTTT CCACTGTTAA TAGGCCCTTT AAACCAAATG TTTCTATACT 3421 TAGGTATATT CTCTGTTAAT AATTGAATAA TTTTCTGCAG CTTCTTTTCA AACTCTTCAA 3481 ATAAGCAGCA GTACCAGGCC ACACCACCCA TATAATACAG TAGATCTATT GTATCTAAAT 3541 CTCTTAATCT CTCTAGGTGC TTCTTAAACT TCTTACATAG CATTTCTGTC CTGGTCATTT 3601 CCAGCATCTC TAACCTCCTT TTGGCTAGAA CAGTGTCTGC GGCTTGTTGG CAAATGGTTT 3661 TCTGAGATTT AGATTCATAA AATAGCTTAG CATTAGAATG ATGAGCCTCA TGAGCCTTGT 3721 GAGGTTTGAG GCGAGATCTG TTTTCACACT TTTGGCAAGG AAATGGTTTT GCAAAGTCTA 3781 GATAATGGGC TAAGATAATA AAGTGGTCGT CTAGCTCATA TTCACAAGCA AATTCAGCAA 3841 CTAAATTCCA ATTACAGCTG GCCTCTTTTT CTTTTTCTTG AAATTCATAA TTGAGCAGTG 3901 GCTTATTCTC TTGCAGTAAT TTGTAAGGGG GCTTGCATAA ATTATTATAC ATTTCAGGCA 3961 TCTTATTCAC TCCTTTACAA ATTAAAAAGC TTATAGTGCA GAAGGTAGAG CAAAAATTCT 4021 TAATAGCAGA TACTCTATGC TTTGATAAAG TTATAAACAA TAAAATACAT CCTAATTCAC 4081 AGGCATGCCT GCTTTTAAAA TCAACTTTAA ATTTCTCAAT CTTATCATAT AACTCTATAG 4141 CTTTATCAGA AGTAGTATAA ATGGCAAAAC AACTTACTGT TTTATTACTA TATACAGCAT 4201 GGCTAAGATA ATCAGAAAGA TCAATAGGAA AATCAGTAGG AACAGGAGTT TCTCTGTTCT 4261 TTTTTGGCTT TGGTGGAGTG CTTGTAAAAC TTGCTGAACT AGCAGAGCTT GCAGAGCTTC 4321 GGGACCCCCC AAATTTTCGC TTTCTTGAGA ATGGAGGAGG GGTCTTCGGG GTGGTGAAGG 4381 AGGAGGATCT GTATTCCTCA TCTGTAAACT GAGATGACGA GGCCTCCTCG GCAGAGGAAG 4441 ACGGGGGCTG CCGGGGCGAG CTTCTTGAGG AGGGGGGCTC CTCAGGCTCC TCAGAGGACG 4501 AGGGAGGCTC AGGGGAGGAA AGTGATTCAT CGCAGAAGAG ATCCTCCCAG GTGCCATCCG 4561 TTCTGGAAGA ATTTCTAGGT ACACTGGTTC CATTGGGTGT GCTGGATTCT CTTCCTGAAT 4621 TGGTGGTCTC CTCTCTGCTA CTGGATCCAG AGGATGAGGT GGGTTCCTCA TGGTGTTCGG 4681 GAGGTATATC GGGTCCTCTG GACTGGGAGT CTGAAGCCTG GGACGCTGAG AAGGACCCAT 4741 ACCCAGAGGA AGAGCTCTGG CTGTGGGGTG GTGAGCTTCC ACTGGGGGCT CCCCTGGATG 4801 CATTGGAGGA AGGCTTTCTG GATCTTGAGT TGGTCCCGTG TGGATTGGGC CCATATTCGT 4861 ATGCCTTCCC GAAGCTGAAT CCTCCTGATC TCCACCATTC TTTGAATTTA GTGGTCCCAT 4921 ATATAGGGGC CTCGTCAACC TAGATGGGAA AGTACAGAAA ATCTGTCATA AATAACCTTT 4981 CTTTGATATT TTGCCTTATA GACTTTTCCA TATCTAATAC TTACAGAGGA AGGAAGTAGG 5041 AGTCTAGAAA AGGTGCAGAT GCAGTAAGCA GTAGTCAGTT TCTTCTAAAG TTTTTTGCCA 5101 CCAGTCAAAA CTTTCCCAAG TAGGAGGAAA TCCAAACCAA AGAATAAAGC ACTGATAGCA 5161 AAAACACTCT CCCCACGTCA GACAGTTTTT TTGCTTTAAA GTTTTTAGAC TACAATGCTG 5221 GCGAGACAAC TTACAGCTAA TACAAGCGCA CTTAGAATCT CTAAGTTGCT TAAGCATGCA 5281 CCCAGGACCT CTGCAAAATC TAGCATTATA TCCACTTTGC ATATAATCCT TTAAAGTTCC 5341 ATATTCTTCC CAAGGAAATT TTGTACTGAC CTCATCAAAC ATAGAGAAGT CACTTCTGAG 5401 CTTGTGGATA TTTTGCTGGA ATTTGCTCCA AAGGGTGTTC AATTCCATCA TTATAACAGG 5461 ATTTCCCCCT TTATCAGGGT GATGCTTTAA GCAGCTTCTT TTGAAAGCAG CTTTCATCAG 5521 AGGGATGTTG CCATAACAAT TAGGAGCAAT CTCTAAAAGC TTGCAGAGAG CCTCTCTTTC 5581 TTTCCTATTT AGGACTAAAT CCATCTTGTC TATATGCAGA AGGAGTTTGC AGAAAGAGCA 5641 GAGGAGCAAA TGAGCTACCT CACTAAGGAG TGGTTTTTAT ACTGCAGTTT CCCGCCCTTG 5701 GGATCTGCCC TTAGATACTG CCTTTTTTGC TAATTAAGCC TCTTAAGCCT CAGAGGCCTC 5761 TCTCTTTTTT TCCAGAGGCC TCGGAGGCTA GGAGCCCCAA GCCTCTGCCA ACTTGAAAAA 5821 AAAAAGTCAC CTAGGCAGCC AAGTTGTGGT TACATGATTG AACTTTTATT GCTGCAGGGT 5881 TTCTGGCATT GACTCATTTC CTGGAGAGGC GGAGTTTGAC TGATAAACAA AACTTTTTTT 5941 CTTTCTGTTT GGGAGGGAGA CGGAAGACTC TTAACTTTTT TTCAACAAGG GAGGCCCGGA 6001 GGCTTTTTTT TCTCTTACAA AGGGAGGAGG ACATTAAAAG AGTAAGTATC CTTATTTATT 6061 TTTCAGGATG GGGGGCATCA TCACACTGCT GGCCAATATT GGTGAAATTG CTACTGAACT 6121 AAGTGCCACC ACAGGAGTAA CTTTGGAAGC TATTCTTACA GGAGAAGCTT TAGCAGCTTT 6181 GGAAGCAGAG ATCTCCAGTT TAATGACAAT TGAGGGTATT TCTGGCATTG AGGCTTTAGC 6241 TCAACTTGGG TTCACAGCTG AACAGTTTTC AAATTTCTCA TTAGTGGCTT CTTTGGTTAA 6301 CCAAGGTTTA ACTTATGGCT TCATTCTCCA AACTGTTAGT GGTATAGGCT CTCTAATAAC 6361 TGTGGGGGTG AGGTTGTCAC GCGAGCAAGT GTCACTTGTA AAGAGGGATG TTTCGTGGGT 6421 AGGTAGTAAT GAGGTTTTGA GGCATGCACT TATGGCCTTT AGCCTAGATC CTCTGCAGTG 6481 GGAAAATAGC TTGCTGCATT CTGTGGGGCA AGATATTTTT AATTCTTTAT CTCCTACCTC 6541 TAGGCTGCAG ATACAATCAA ACCTAGTGAA TCTGATACTA AATAGCCGGT GGGTCTTTCA 6601 GACAACTGCT TCTCAGAATC AGGGCCTTTT ATCAGGAGAG GCTATATTAA TTCCTGAACA 6661 TATAGGAGGA ACTCTGCAGC AGCAAACTCC AGATTGGCTT CTTCCTCTGG TACTAGGCCT 6721 TAGTGGATAT ATTTCTCCTG AATTACAAGT AATTGAAGAT GGCACCAAAA AGAAAAGCAT 6781 CATCCACCTG TAAAACACCC AAAAGGCAAT GTATACCTAA GCCGGGATGC TGCCCTAATG 6841 TTGCCTCAGT TCCAAAACTG CTTGTTAAAG GAGGAGTGGA AGTATTATCT GTGGTTACTG 6901 GAGAAGATAG CATTACCCAA ATTGAGTTGT ATTTGAATCC AAGAATGGGA GTTAATTCCC 6961 CTGATCTTCC TACTACTTCA AACTGGTATA CTTATACTTA TGACCTGCAG CCAAAGGGAT 7021 CATCTCCAGA TCAGCCCATC AAGGAAAATT TGCCAGCTTA CAGTGTGGCA AGAGTGTCTC 7081 TGCCAATGCT AAATGAGGAT ATTACCTGTG ACACATTGCA GATGTGGGAG GCAATATCTG 7141 TTAAAACAGA AGTAGTTGGA ATAAGTTCTT TAATTAATGT TCATTATTGG GACATGAAAA 7201 GAGTTCATGA TTATGGTGCT GGTATTCCTG TGTCAGGGGT AAATTACCAT ATGTTTGCCA 7261 TTGGGGGAGA ACCTCTGGAT TTGCAAGGCC TAGTTTTAGA TTACCAGACT GAGTATCCAA 7321 AAACTACAAA TGGTGGGCCT ATTACAATTG AAACTGTATT GGGAAGAAAA ATGACACCTA 7381 AAAATCAGGG CCTAGATCCA CAAGCTAAAG CAAAATTAGA TAAAGATGGA AATTATCCTA 7441 TAGAAGTATG GTGTCCTGAT CCTTCTAAAA ATGAAAACAG TAGATACTAT GGGTCTATTC 7501 AGACAGGCTC TCAGACTCCT ACAGTTCTTC AATTTAGTAA TACTCTAACT ACTGTCCTTT 7561 TAGATGAGAA TGGAGTGGGC CCTCTATGCA AAGGAGATGG CCTATTTATT AGCTGTGCAG 7621 ACATAGTGGG GTTTCTGTTT AAAACCAGTG GAAAAATGGC TCTTCATGGG TTGCCTAGAT 7681 ATTTTAATGT TACTTTGAGA AAAAGATGGG TGAAAAACCC CTACCCAGTA GTTAATTTAA 7741 TAAACTCACT CTTCAGCAAC TTAATGCCAA AAGTGTCAGG CCAACCTATG GAAGGAAAAG 7801 ATAATCAGGT AGAAGAGGTT AGAATATATG AGGGGTCAGA ACAATTACCT GGTGATCCTG 7861 ATATTGTCAG ATTTTTAGAT AAATTTGGGC AGGAGAAAAC TGTTTACCCA AAGCCCTCTG 7921 TTGCCCCAGC AGCAGTAACA TTCCAAAGTA ATCAGCAGGA TAAGGGCAAG GCGCCACTGA 7981 AAGGACCTCA AAAGGCCTCT CAAAAAGAAA GCCAAACACA AGAATTATGA GAATTATTTC 8041 ATGCATTCCT ATTCAGTTAA GTAGGCCCCA GAAAAACAAA CACAGGAAAT ATGAAGCAGA 8101 TGCCTTTATT GAGAAAAAGT ACCAGAATCT TGGGTTTCTT CAGTTTCCTC AGGGCCCTCT 8161 TCCTCAATAA GAATATTGAG CAGAGGGTCC TGACCAGCTT CTACATTTTC TATCATTTGA 8221 CAAAATTTAC CATATGATAT TTCACTCTGT AAAATTTGCT TCCAGTTTTT AATTTCTTCT 8281 TGTAAGCAAG GCTTAAAGGT TGTATCAGGC AAGCACCAAA TAAGACAAAG CAATAAAGTG 8341 GTTCCACTTT GAAGAATTCC AGCACACTGG CGGCCGTTAC TAGTGGATCC GAGCTCGGTA 8401 CCAAGCTTGA TGCATAGCTT GAGTATTCTA TAGTGTCACC TAAATAGCTT GGCGTAATCA 8461 TGGTCATAGC TGTTTCCTGT GTGAAATTGT TATCCGCTCA CAATTCCACA CAACATACGA 8521 GCCGGAAGCA TAAAGTGTAA AGCCTGGGGT GCCTAATGAG TGAGCTAACT CACATTAATT 8581 GCGTTGCGCT CACTGCCCGC TTTCCAGTCG GGAAACCTGT CGTGCCAGCT GCATTAATGA 8641 ATCGGCCAAC GCGCGGGGAG AGGCGGTTTG CGTATTGGGC GCT //