LOCUS PAL2.TXT 4801 BP DS-DNA CIRCULAR SYN 10-NOV-2003 DEFINITION - ACCESSION - KEYWORDS - SOURCE - FEATURES Location/Qualifiers promoter 14..544 /note="CMV promoter" 5'UTR 545..828 /note="HTLV R-U5" CDS 830..2236 /note="HPV2a L2 (codon modified)" mutation 956..958 /note="Ile->Leu in Sally Roberts HPV2a clone" polyA_signal 2246..2479 /note="SV40 Late polyA" rep_origin 2497..3218 /note="MB1 Ori" misc_feature 3240..3583 /note="SV40 homology" promoter 3409..3583 /note="SV40 early promoter" rep_origin 3432..3583 /note="SV40 Ori" misc_feature 3481..3530 /note="SV40 transcription start points" CDS 3496..3567 /note="SELP" promoter 3618..3673 /note="EM7" CDS 3693..4487 /note="Kan/Neo resistance" polyA_signal 4535..4801 /note="HSV TK pA" BASE COUNT 1063 A 1462 C 1259 G 1017 T 0 OTHER ORIGIN - 1 CCTGCAGGGC CCACTAGTCC GTTACATAAC TTACGGTAAA TGGCCCGCCT GGCTGACCGC 61 CCAACGACCC CCGCCCATTG ACGTCAATAA TGACGTATGT TCCCATAGTA ACGCCAATAG 121 GGACTTTCCA TTGACGTCAA TGGGTGGAGT ATTTACGGTA AACTGCCCAC TTGGCAGTAC 181 ATCAAGTGTA TCATATGCCA AGTACGCCCC CTATTGACGT CAATGACGGT AAATGGCCCG 241 CCTGGCATTA TGCCCAGTAC ATGACCTTAT GGGACTTTCC TACTTGGCAG TACATCTACG 301 TATTAGTCAT CGCTATTACC ATGATGATGC GGTTTTGGCA GTACATCAAT GGGCGTGGAT 361 AGCGGTTTGA CTCACGGGGA TTTCCAAGTC TCCACCCCAT TGACGTCAAT GGGAGTTTGT 421 TTTGGCACCA AAATCAACGG GACTTTCCAA AATGTCGTAA CAACTCCGCC CCATTGACGC 481 AAATGAGCGG TAGGCGTGTA CGGTGGGAGG TCTATATAAG CAGAGCTCGT TTAGTGAACC 541 GTAAGCTTCG AGGGGCTCGC ATCTCTCCTT CACGCGCCCG CCGCCCTACC TGAGGCCGCC 601 ATCCACGCCG GTTGAGTCGC GTTCTGCCGC CTCCCGCCTG TGGTGCCTCC TGAACTGCGT 661 CCGCCGTCTA GGTAAGTTTA AAGCTCAGGT CGAGACCGGG CCTTTGTCCG GCGCTCCCTT 721 GGAGCCTACC TAGACTCAGC CGGCTCTCCA CGCTTTGCCT GACCCTGCTT GCTCAACTCT 781 ACGTCTTTGT TTCGTTTTCT GTTCTGCGCC GTTACAGATC CAAGCCACCA TGAGCATCCG 841 CGCAAAGAGG CGCAAAAGGG CAAGTCCAAC CGATCTGTAC AGGACATGCA AGCAGGCCGG 901 CACATGTCCA CCCGATATCA TTCCCCGCGT CGAGCAAAAC ACCCTGGCCG ACAAGATCTT 961 GAAATGGGGG TCCCTGGGCG TGTTCTTCGG CGGCCTGGGC ATCGGGACAG GGTCCGGGAC 1021 CGGCGGCAGG ACCGGCTACA TCCCCGTGGG CAGCAGGCCA ACAACCGTGG TCGATATCGG 1081 CCCCACCCCA CGCCCACCCG TCATCATCGA GCCCGTCGGC GCAAGCGAGC CAAGCATCGT 1141 GACCCTCGTC GAAGATAGCT CCATCATCAA TGCCGGCGCC AGCCACCCAA CATTCACCGG 1201 CACCGGCGGG TTTGAGGTCA CCACAAGTAC AGTCACCGAT CCAGCAGTGC TCGACATTAC 1261 ACCAAGCGGC ACATCCGTCC AGGTCTCCAG CTCCAGCTTC CTGAATCCCT TGTATACCGA 1321 ACCCGCCATC GTCGAAGCCC CACAGACCGG CGAGGTCAGC GGGCACGTGC TGGTCTCCAC 1381 CGCAACAAGC GGCAGCCACG GCTACGAAGA GATCCCCATG CAAACCTTCG CAACCAGCGG 1441 CGGGTCTGGC ACCGAACCCA TTTCCAGCAC CCCATTGCCC GGGGTCAGAC GCGTCGCCGG 1501 CCCAAGGCTC TATTCCCGCG CAAACCAGCA GGTCCAGGTC CGCGACCCCG CCTTCCTGGC 1561 CCGCCCCGCC GACTTGGTCA CCTTCGATAA CCCCGTCTAC GATCCCGAAG AGACCATCAT 1621 CTTCCAACAC CCCGATCTCC ACGAACCCCC CGACCCCGAC TTCCTGGATA TCGTCGCCCT 1681 CCACCGCCCA GCACTGACCA GTCGCCGCGG CACCGTGCGC TTCAGCCGCC TCGGCAGGCG 1741 CGCCACCTTG AGGACAAGGT CCGGCAAGCA GATCGGCGCC AGAGTCCATT TCTACCACGA 1801 CATCAGCCCC ATCGGCACCG AAGAACTCGA AATGGAACCC CTGCTCCCAC CCGCCAGCAC 1861 CGACAATACC GACATGCTGT ACGACGTCTA CGCCGACAGC GACGTGCTGC AACCCCTGTT 1921 GGACGAACTG CCAGCAGCAC CCAGGGGCAG CCTGAGCCTC GCCGATACCG CCGTCAGCGC 1981 AACAAGTGCC AGCACCCTGA GAGGCAGTAC CACCGTGCCC CTGAGCTCCG GCATCGACGT 2041 CCCCGTCTAT ACAGGCCCCG ATATCGAGCC CCCAAACGTC CCCGGGATGG GCCCCCTGAT 2101 CCCCGTCGCC CCGAGTCTGC CCAGCAGCGT CTACATCTTC GGCGGCGACT ACTACCTCAT 2161 GCCCAGCTAC GTGCTCTGGC CCAAGAGGAG GAAGCGCGTG CATTACTTCT TCGCCGACGG 2221 GTTCGTCGCC GCATGAGCTC GAGGCTAGCT GGCCAGACAT GATAAGATAC ATTGATGAGT 2281 TTGGACAAAC CACAACTAGA ATGCAGTGAA AAAAATGCTT TATTTGTGAA ATTTGTGATG 2341 CTATTGCTTT ATTTGTAACC ATTATAAGCT GCAATAAACA AGTTAACAAC AACAATTGCA 2401 TTCATTTTAT GTTTCAGGTT CAGGGGGAGG TGTGGGAGGT TTTTTAAAGC AAGTAAAACC 2461 TCTACAAATG TGGTATGGAA TTCTTAATTA ACTAGCCATG ACCAAAATCC CTTAACGTGA 2521 GTTTTCGTTC CACTGAGCGT CAGACCCCGT AGAAAAGATC AAAGGATCTT CTTGAGATCC 2581 TTTTTTTCTG CGCGTAATCT GCTGCTTGCA AACAAAAAAA CCACCGCTAC CAGCGGTGGT 2641 TTGTTTGCCG GATCAAGAGC TACCAACTCT TTTTCCGAAG GTAACTGGCT TCAGCAGAGC 2701 GCAGATACCA AATACTGTTC TTCTAGTGTA GCCGTAGTTA GGCCACCACT TCAAGAACTC 2761 TGTAGCACCG CCTACATACC TCGCTCTGCT AATCCTGTTA CCAGTGGCTG CTGCCAGTGG 2821 CGATAAGTCG TGTCTTACCG GGTTGGACTC AAGACGATAG TTACCGGATA AGGCGCAGCG 2881 GTCGGGCTGA ACGGGGGGTT CGTGCACACA GCCCAGCTTG GAGCGAACGA CCTACACCGA 2941 ACTGAGATAC CTACAGCGTG AGCTATGAGA AAGCGCCACG CTTCCCGAAG GGAGAAAGGC 3001 GGACAGGTAT CCGGTAAGCG GCAGGGTCGG AACAGGAGAG CGCACGAGGG AGCTTCCAGG 3061 GGGAAACGCC TGGTATCTTT ATAGTCCTGT CGGGTTTCGC CACCTCTGAC TTGAGCGTCG 3121 ATTTTTGTGA TGCTCGTCAG GGGGGCGGAG CCTATGGAAA AACGCCAGCA ACGCGGCCTT 3181 TTTACGGTTC CTGGCCTTTT GCTGGCCTTT TGCTCACATG TTCTTAATTA AGCTGTACAC 3241 TGTGGAATGT GTGTCAGTTA GGGTGTGGAA AGTCCCCAGG CTCCCCAGCA GGCAGAAGTA 3301 TGCAAAGCAT GCATCTCAAT TAGTCAGCAA CCAGGTGTGG AAAGTCCCCA GGCTCCCCAG 3361 CAGGCAGAAG TATGCAAAGC ATGCATCTCA ATTAGTCAGC AACCATAGTC CCGCCCCTAA 3421 CTCCGCCCAT CCCGCCCCTA ACTCCGCCCA GTTCCGCCCA TTCTCCGCCC CATGACTGAC 3481 TAATTTTTTT TATTTATGCA GAGGCCGAGG CCGCCTCTGC CTCTGAGCTA TTCCAGAAGT 3541 AGTGAGGAGG CTTTTTTGGA GGCCTAGGCT TTTGCAAAAA GCTCCCGGGA GCTTGTATAT 3601 CCATTTTCGG ATCTGATCAG CACGTGTTGA CAATTAATCA TCGGCATAGT ATATCGGCAT 3661 AGTATAATAC GACTCACTAT AGGAGGGCCA CCATGATTGA ACAAGATGGA TTGCACGCAG 3721 GTTCTCCGGC CGCTTGGGTG GAGAGGCTAT TCGGCTATGA CTGGGCACAA CAGACAATCG 3781 GCTGCTCTGA TGCCGCCGTG TTCCGGCTGT CAGCGCAGGG GCGCCCGGTT CTTTTTGTCA 3841 AGACCGACCT GTCCGGTGCC CTGAATGAAC TGCAAGACGA GGCAGCGCGG CTATCGTGGC 3901 TGGCCACGAC GGGCGTTCCT TGCGCAGCTG TGCTCGACGT TGTCACTGAA GCGGGAAGGG 3961 ACTGGCTGCT ATTGGGCGAA GTGCCGGGGC AGGATCTCCT GTCATCTCAC CTTGCTCCTG 4021 CCGAGAAAGT ATCCATCATG GCTGATGCAA TGCGGCGGCT GCATACGCTT GATCCGGCTA 4081 CCTGCCCATT CGACCACCAA GCGAAACATC GCATCGAGCG AGCACGTACT CGGATGGAAG 4141 CCGGTCTTGT CGATCAGGAT GATCTGGACG AAGAGCATCA GGGGCTCGCG CCAGCCGAAC 4201 TGTTCGCCAG GCTCAAGGCG AGCATGCCCG ACGGCGAGGA TCTCGTCGTG ACACATGGCG 4261 ATGCCTGCTT GCCGAATATC ATGGTGGAAA ATGGCCGCTT TTCTGGATTC ATCGACTGTG 4321 GCCGGCTGGG TGTGGCGGAC CGCTATCAGG ACATAGCGTT GGCTACCCGT GATATTGCTG 4381 AAGAGCTTGG CGGCGAATGG GCTGACCGCT TCCTCGTGCT TTACGGTATC GCCGCTCCCG 4441 ATTCGCAGCG CATCGCCTTC TATCGCCTTC TTGACGAGTT CTTCTGAGCG GGACTCTGGG 4501 GTTCGAAATG ACCGACCAAG CGAATTCGCT AGAGGGAGGC TAACTGAAAC ACGGAAGGAG 4561 ACAATACCGG AAGGAACCCG CGCTATGACG GCAATAAAAA GACAGAATAA AACGCACGGT 4621 GTTGGGTCGT TTGTTCATAA ACGCGGGGTT CGGTCCCAGG GCTGGCACTC TGTCGATACC 4681 CCACCGAGAC CCCATTGGGG CCAATACGCC CGCGTTTCTT CCTTTTCCCC ACCCCACCCC 4741 CCAAGTTCGG GTGAAGGCCC AGGGCTCGCA GCCAACGTCG GGGCGGCAGG CCCTGCCATA 4801 G //