LOCUS PDTH.TXT 7834 BP DS-DNA CIRCULAR SYN 18-MAR-1998 DEFINITION - ACCESSION - KEYWORDS - SOURCE - FEATURES Location/Qualifiers promoter 232..>892 /note="CMV promoter" CAAT_signal 771..775 /note="CAAT box" TATA_signal 804..810 /note="TATA box" misc_RNA 833..833 /note="transcription start" intron 954..1086 /note="intron" promoter 1131..1149 /note="T7 promoter" CDS 1191..3317 /note="de-fanged T antigen (Cooper PNAS 94:6450)" mRNA 3381..3967 /note="EMCV IRES" CDS 3980..5014 /note="Hygro R" polyA_signal 5303..5579 /note="BHG pA" CDS 6838..7695 /note="Amp R" BASE COUNT 2057 A 1788 C 1983 G 2006 T 0 OTHER ORIGIN - 1 GACGGATCGG GAGATCTCCC GATCCCCTAT GGTCGACTCT CAGTACAATC TGCTCTGATG 61 CCGCATAGTT AAGCCAGTAT CTGCTCCCTG CTTGTGTGTT GGAGGTCGCT GAGTAGTGCG 121 CGAGCAAAAT TTAAGCTACA ACAAGGCAAG GCTTGACCGA CAATTGCATG AAGAATCTGC 181 TTAGGGTTAG GCGTTTTGCG CTGCTTCGCG ATGTACGGGC CAGATATACG CGTTGACATT 241 GATTATTGAC TAGTTATTAA TAGTAATCAA TTACGGGGTC ATTAGTTCAT AGCCCATATA 301 TGGAGTTCCG CGTTACATAA CTTACGGTAA ATGGCCCGCC TGGCTGACCG CCCAACGACC 361 CCCGCCCATT GACGTCAATA ATGACGTATG TTCCCATAGT AACGCCAATA GGGACTTTCC 421 ATTGACGTCA ATGGGTGGAG TATTTACGGT AAACTGCCCA CTTGGCAGTA CATCAAGTGT 481 ATCATATGCC AAGTCCGCCC CCTATTGACG TCAATGACGG TAAATGGCCC GCCTGGCATT 541 ATGCCCAGTA CATGACCTTA CGGGACTTTC CTACTTGGCA GTACATCTAC GTATTAGTCA 601 TCGCTATTAC CATGGTGATG CGGTTTTGGC AGTACACCAA TGGGCGTGGA TAGCGGTTTG 661 ACTCACGGGG ATTTCCAAGT CTCCACCCCA TTGACGTCAA TGGGAGTTTG TTTTGGCACC 721 AAAATCAACG GGACTTTCCA AAATGTCGTA ATAACCCCGC CCCGTTGACG CAAATGGGCG 781 GTAGGCGTGT ACGGTGGGAG GTCTATATAA GCAGAGCTCG TTTAGTGAAC CGTCAGATCA 841 CTAGAAGCTT TATTGCGGTA GTTTATCACA GTTAAATTGC TAACGCAGTC AGTGCTTCTG 901 ACACAACAGT CTCGAACTTA AGCTGCAGAA GTTGGTCGTG AGGCACTGGG CAGGTAAGTA 961 TCAAGGTTAC AAGACAGGTT TAAGGAGACC AATAGAAACT GGGCTTGTCG AGACAGAGAA 1021 GACTCTTGCG TTTCTGATAG GCACCTATTG GTCTTACTGA CATCCACTTT GCCTTTCTCT 1081 CCACAGGTGT CCACTCCCAG TTCAATTACA GCTCTTAAGG CTAGAGTACT TAATACGACT 1141 CACTATAGGC TAGACTAGGA TCCCGCGACC GGTCGACCAG CTTTGCAAAG ATGGATAAAG 1201 TTTTAAACAG AGAGGAATCT TTGCAGCTAA TGGACCTTCT AGGTCTTGAA AGGAGTGCCT 1261 GGGGGAATAT TCCTCTGATG AGAAAGGCAT ATTTAAAAAA ATGCAAAGAG TTTCATCCTG 1321 ATAAAGGAGG AGATGAAGAA AAAATGAAGA AAATGAATAC TCTGTACAAG AAAATGGAAG 1381 ATGGAGTAAA ATATGCTCAT CAACCTGACT TTGGAGGCTT CTGGGATGCA ACTGAGATTC 1441 CAACCTATGG AACTGATGAA TGGGAGCAGT GGTGGAATGC CTTTAATGAG GAAAACCTGT 1501 TTTGCTCAAA AGAAATGCCA TCTAGTGATG ATGAGGCTAC TGCTGACTCT CAACATTCTA 1561 CTCCTCCAAA AAAGAAGAGA AAGGTAGAAG ACCCCAAGGA CTTTCCTTCA GAATTGCTAA 1621 GTTTTTTGAG TCATGCTGTG TTTAGTAATA GAACTCTTGC TTGCTTTGCT ATTTACACCA 1681 CAAAGGAAAA AGCTGCACTG CTATACAAGA AAATTATGGA AAAATATTCT GTAACCTTTA 1741 TAAGTAGGCA TAACAGTTAT AATCATAACA TACTGTTTTT TCTTACTCCA CACAGGCATA 1801 GAGTGTCTGC TATTAATAAC TATGCTCAAA AATTGTGTAC CTTTAGCTTT TTAATTTGTA 1861 AAGGGGTTAA TAAGGAATAT TTGATGTATA GTGCCTTGAC TAGAGATCCA TTTTCTGTTA 1921 TTGAGGAAAG TTTGCCAGGT GGGTTAAAGG AGCATGATTT TAATCCAGAA GAAGCAGAGG 1981 AAACTAAACA AGTGTCCTGG AAGCTTGTAA CAGAGTATGC AATGGAAACA AAATGTGATG 2041 ATGTGTTGTT ATTGCTTGGG ATGTACTTGG AATTTCAGTA CAGTTTTGAA ATGTGTTTAA 2101 AATGTATTAA AAAAGAACAG CCCAGCCACT ATAAGTACCA TGAAAAGCAT TATGCAAATG 2161 CTGCTATATT TGCTGACAGC AAAAACCAAA AAACCATATG CCAACAGGCT GTTGATACTG 2221 TTTTAGCTAA AAAGCGGGTT GATAGCCTAC AATTAACTAG AGAACAAATG TTAACAAACA 2281 GATTTAATGA TCTTTTGGAT AGGATGGATA TAATGTTTGG TTCTACAGGC TCTGCTGACA 2341 TAGAAGAATG GATGGCTGGA GTTGCTTGGC TACACTGTTT GTTGCCCAAA ATGGAATCAG 2401 TGGTGTATGA CTTTTTAAAA TGCATGGTGT ACAACATTCC TAAAAAAAGA TACTGGCTGT 2461 TTAAAGGACC AATTGATAGT GGTAAAACTA CATTAGCAGC TGCTTTGCTT GAATTATGTG 2521 GGGGGAAAGC TTTAAATGTT AATTTGCCCT TGGACAGGCT GAACTTTGAG CTAGGAGTAG 2581 CTATTGACCA GTTTTTAGTA GTTTTTGAGG ATGTAAAGGG CACTGGAGGG GAGTCCAGAG 2641 ATTTGCCTTC AGGTCAGGGA ATTAATAACC TGGACAATTT AAGGGATTAT TTGGATGGCA 2701 GTGTTAAGGT AAACTTAGAA AAGAAACACC TAAATAAAAG AACTCAAATA TTTCCCCCTG 2761 GAATAGTCAC CATGAATGAG TACAGTGTGC CTAAAACACT GCAGGCCAGA TTTGTAAAAC 2821 AAATAGATTT TAGGCCCAAA GATTATTTAA AGCATTGCCT GGAACGCAGT GAGTTTTTGT 2881 TAGAAAAGAG AATAATTCAA AGTGGCATTG CTTTGCTTCT TATGTTAATT TGGTACAGAC 2941 CTGTGGCTGA GTTTGCTCAA AGTATTCAGA GCAGAATTGT GGAGTGGAAA GAGAGATTGG 3001 ACAAAGAGTT TAGTTTGTCA GTGTATCAAA AAATGAAGTT TAATGTGGCT ATGGGAATTG 3061 GAGTTTTAGA TTGGCTAAGA AACAGTGATG ATGATGATGA AGACAGCCAG GAAAATGCTG 3121 ATAAAAATGA AGATGGTGGG GAGAAGAACA TGGAAGACTC AGGGCATGAA ACAGGCATTG 3181 ATTCACAGTC CCAAGGCTCA TTTCAGGCCC CTCAGTCCTC ACAGTCTGTT CATGATCATA 3241 ATCAGCCATA CCACATTTGT AGAGGTTTTA CTTGCTTTAA AAAACCTCCC ACACCTCCCC 3301 CTGAACCTGA AACATAAAAT GAATGCAATT GTTGTTGTTA ACGGGGATCC TAGTCTAGAG 3361 GGCCGCACTA GAGGAATTCC GCCCCTCTCC CTCCCCCCCC CCTAACGTTA CTGGCCGAAG 3421 CCGCTTGGAA TAAGGCCGGT GTGTGTTTGT CTATATGTGA TTTTCCACCA TATTGCCGTC 3481 TTTTGGCAAT GTGAGGGCCC GGAAACCTGG CCCTGTCTTC TTGACGAGCA TTCCTAGGGG 3541 TCTTTCCCCT CTCGCCAAAG GAATGCAAGG TCTGTTGAAT GTCGTGAAGG AAGCAGTTCC 3601 TCTGGAAGCT TCTTGAAGAC AAACAACGTC TGTAGCGACC CTTTGCAGGC AGCGGAACCC 3661 CCCACCTGGC GACAGGTGCC TCTGCGGCCA AAAGCCACGT GTATAAGATA CACCTGCAAA 3721 GGCGGCACAA CCCCAGTGCC ACGTTGTGAG TTGGATAGTT GTGGAAAGAG TCAAATGGCT 3781 CTCCTCAAGC GTAGTCAACA AGGGGCTGAA GGATGCCCAG AAGGTACCCC ATTGTATGGG 3841 AATCTGATCT GGGGCCTCGG TGCACATGCT TTACATGTGT TTAGTCGAGG TTAAAAAAAC 3901 GTCTAGGCCC CCCGAACCAC GGGGACGTGG TTTTCCTTTG AAAAACACGA TGATAAGCTT 3961 GCCACAACCC GTACCAAAGA TGGATAGATC CGGAAAGCCT GAACTCACCG CGACGTCTGT 4021 CGAGAAGTTT CTGATCGAAA AGTTCGACAG CGTCTCCGAC CTGATGCAGC TCTCGGAGGG 4081 CGAAGAATCT CGTGCTTTCA GCTTCGATGT AGGAGGGCGT GGATATGTCC TGCGGGTAAA 4141 TAGCTGCGCC GATGGTTTCT ACAAAGATCG TTATGTTTAT CGGCACTTTG CATCGGCCGC 4201 GCTCCCGATT CCGGAAGTGC TTGACATTGG GGAATTCAGC GAGAGCCTGA CCTATTGCAT 4261 CTCCCGCCGT GCACAGGGTG TCACGTTGCA AGACCTGCCT GAAACCGAAC TGCCCGCTGT 4321 TCTGCAGCCG GTCGCGGAGG CCATGGATGC GATCGCTGCG GCCGATCTTA GCCAGACGAG 4381 CGGGTTCGGC CCATTCGGAC CGCAAGGAAT CGGTCAATAC ACTACATGGC GTGATTTCAT 4441 ATGCGCGATT GCTGATCCCC ATGTGTATCA CTGGCAAACT GTGATGGACG ACACCGTCAG 4501 TGCGTCCGTC GCGCAGGCTC TCGATGAGCT GATGCTTTGG GCCGAGGACT GCCCCGAAGT 4561 CCGGCACCTC GTGCACGCGG ATTTCGGCTC CAACAATGTC CTGACGGACA ATGGCCGCAT 4621 AACAGCGGTC ATTGACTGGA GCGAGGCGAT GTTCGGGGAT TCCCAATACG AGGTCGCCAA 4681 CATCTTCTTC TGGAGGCCGT GGTTGGCTTG TATGGAGCAG CAGACGCGCT ACTTCGAGCG 4741 GAGGCATCCG GAGCTTGCAG GATCGCCGCG GCTCCGGGCG TATATGCTCC GCATTGGTCT 4801 TGACCAACTC TATCAGAGCT TGGTTGACGG CAATTTCGAT GATGCAGCTT GGGCGCAGGG 4861 TCGATGCGAC GCAATCGTCC GATCCGGAGC CGGGACTGTC GGGCGTACAC AAATCGCCCG 4921 CAGAAGCGCG GCCGTCTGGA CCGATGGCTG TGTAGAAGTA CTCGCCGATA GTGGAAACCG 4981 ACGCCCCAGC ACTCGTCCGA GGGCAAAGGA ATAGAGTAGA TGCCGACCGA ACAAGAGCTG 5041 ATTTCGAGAA CGCCTCAGCC AGCAACTCGC GCGAGCCTAG CAAGGCAAAT GCGAGAGAAC 5101 GGCCTTACGC TTGGTGGCAC AGTTCTCGTC CACAGTTCGC TAAGCTCGCT CGGCTGGGTC 5161 GCGGGAGGGC CGGTCGCAGT GATTCAGGCC CTTCTGGATT GTGTTGGTCC CCAGGGCACG 5221 ATTGTCATGC CCACGCACTC GGGTGATCTG ACTGATCCCG CAGATTGGAG ATCGCCGCCC 5281 GTGCCTGCCG ATTGGGTGCA GATCTAGAGC TCGCTGATCA GCCTCGACTG TGCCTCTAGT 5341 TGCCAGCCAT CTGTTGTTTG CCCCTCCCCC GTGCCTTCCT TGACCCTGGA AGGTGCCACT 5401 CCCACTGTCC TTTCCTAATA AAATGAGGAA ATTGCATCGC ATTGTCTGAG TAGGTGTCAT 5461 TCTATTCTGG GGGGTGGGGT GGGGCAGGAC AGCAAGGGGG AGGATTGGGA AGACAATAGC 5521 AGGCATGCTG GGGATGCGGT GGGCTCTATG GCTTCTGAGG CGGAAAGAAC CAGCTGGGGC 5581 TCGAGTGCAT TCTAGTTGTG GTTTGTCCAA ACTCATCAAT GTATCTTATC ATGTCTGTAT 5641 ACCGTCGACC TCTAGCTAGA GCTTGGCGTA ATCATGGTCA TAGCTGTTTC CTGTGTGAAA 5701 TTGTTATCCG CTCACAATTC CACACAACAT ACGAGCCGGA AGCATAAAGT GTAAAGCCTG 5761 GGGTGCCTAA TGAGTGAGCT AACTCACATT AATTGCGTTG CGCTCACTGC CCGCTTTCCA 5821 GTCGGGAAAC CTGTCGTGCC AGCTGCATTA ATGAATCGGC CAACGCGCGG GGAGAGGCGG 5881 TTTGCGTATT GGGCGCTCTT CCGCTTCCTC GCTCACTGAC TCGCTGCGCT CGGTCGTTCG 5941 GCTGCGGCGA GCGGTATCAG CTCACTCAAA GGCGGTAATA CGGTTATCCA CAGAATCAGG 6001 GGATAACGCA GGAAAGAACA TGTGAGCAAA AGGCCAGCAA AAGGCCAGGA ACCGTAAAAA 6061 GGCCGCGTTG CTGGCGTTTT TCCATAGGCT CCGCCCCCCT GACGAGCATC ACAAAAATCG 6121 ACGCTCAAGT CAGAGGTGGC GAAACCCGAC AGGACTATAA AGATACCAGG CGTTTCCCCC 6181 TGGAAGCTCC CTCGTGCGCT CTCCTGTTCC GACCCTGCCG CTTACCGGAT ACCTGTCCGC 6241 CTTTCTCCCT TCGGGAAGCG TGGCGCTTTC TCAATGCTCA CGCTGTAGGT ATCTCAGTTC 6301 GGTGTAGGTC GTTCGCTCCA AGCTGGGCTG TGTGCACGAA CCCCCCGTTC AGCCCGACCG 6361 CTGCGCCTTA TCCGGTAACT ATCGTCTTGA GTCCAACCCG GTAAGACACG ACTTATCGCC 6421 ACTGGCAGCA GCCACTGGTA ACAGGATTAG CAGAGCGAGG TATGTAGGCG GTGCTACAGA 6481 GTTCTTGAAG TGGTGGCCTA ACTACGGCTA CACTAGAAGG ACAGTATTTG GTATCTGCGC 6541 TCTGCTGAAG CCAGTTACCT TCGGAAAAAG AGTTGGTAGC TCTTGATCCG GCAAACAAAC 6601 CACCGCTGGT AGCGGTGGTT TTTTTGTTTG CAAGCAGCAG ATTACGCGCA GAAAAAAAGG 6661 ATCTCAAGAA GATCCTTTGA TCTTTTCTAC GGGGTCTGAC GCTCAGTGGA ACGAAAACTC 6721 ACGTTAAGGG ATTTTGGTCA TGAGATTATC AAAAAGGATC TTCACCTAGA TCCTTTTAAA 6781 TTAAAAATGA AGTTTTAAAT CAATCTAAAG TATATATGAG TAAACTTGGT CTGACAGTTA 6841 CCAATGCTTA ATCAGTGAGG CACCTATCTC AGCGATCTGT CTATTTCGTT CATCCATAGT 6901 TGCCTGACTC CCCGTCGTGT AGATAACTAC GATACGGGAG GGCTTACCAT CTGGCCCCAG 6961 TGCTGCAATG ATACCGCGAG ACCCACGCTC ACCGGCTCCA GATTTATCAG CAATAAACCA 7021 GCCAGCCGGA AGGGCCGAGC GCAGAAGTGG TCCTGCAACT TTATCCGCCT CCATCCAGTC 7081 TATTAATTGT TGCCGGGAAG CTAGAGTAAG TAGTTCGCCA GTTAATAGTT TGCGCAACGT 7141 TGTTGCCATT GCTACAGGCA TCGTGGTGTC ACGCTCGTCG TTTGGTATGG CTTCATTCAG 7201 CTCCGGTTCC CAACGATCAA GGCGAGTTAC ATGATCCCCC ATGTTGTGCA AAAAAGCGGT 7261 TAGCTCCTTC GGTCCTCCGA TCGTTGTCAG AAGTAAGTTG GCCGCAGTGT TATCACTCAT 7321 GGTTATGGCA GCACTGCATA ATTCTCTTAC TGTCATGCCA TCCGTAAGAT GCTTTTCTGT 7381 GACTGGTGAG TACTCAACCA AGTCATTCTG AGAATAGTGT ATGCGGCGAC CGAGTTGCTC 7441 TTGCCCGGCG TCAATACGGG ATAATACCGC GCCACATAGC AGAACTTTAA AAGTGCTCAT 7501 CATTGGAAAA CGTTCTTCGG GGCGAAAACT CTCAAGGATC TTACCGCTGT TGAGATCCAG 7561 TTCGATGTAA CCCACTCGTG CACCCAACTG ATCTTCAGCA TCTTTTACTT TCACCAGCGT 7621 TTCTGGGTGA GCAAAAACAG GAAGGCAAAA TGCCGCAAAA AAGGGAATAA GGGCGACACG 7681 GAAATGTTGA ATACTCATAC TCTTCCTTTT TCAATATTAT TGAAGCATTT ATCAGGGTTA 7741 TTGTCTCATG AGCGGATACA TATTTGAATG TATTTAGAAA AATAAACAAA TAGGGGTTCC 7801 GCGCACATTT CCCCGAAAAG TGCCACCTGA CGTC //