LOCUS P6L1B.TXT 4398 BP DS-DNA CIRCULAR SYN 10-MAR-2003 DEFINITION - ACCESSION - KEYWORDS - SOURCE - FEATURES Location/Qualifiers promoter 13..>245 /note="hEF1alpha promoter core" 5'UTR 248..531 /note="HTLV1 5' UTR" CDS 533..2035 /note="HPV6a L1 (codon modified)" polyA_signal 2051..2299 /note="SV40 Late polyA" rep_origin 2325..3047 /note="MB1 Ori" promoter 3069..3412 /note="SV40 early promoter" rep_origin 3261..3412 /note="SV40 Ori" misc_feature 3412..3412 /note="end homology to SV40" promoter 3447..3501 /note="EM7 promoter" CDS 3521..3943 /note="Blasticidin R" polyA_signal 3997..4214 /note="BGH polyA" terminator 4242..4295 /note="transcription pause (a2 globin)" terminator 4298..4391 /note="transcription terminator" BASE COUNT 1059 A 1264 C 1122 G 953 T 0 OTHER ORIGIN - 1 GGATCTGCGA TCGCTCCGGT GCCCGTCAGT GGGCAGAGCG CACATCGCCC ACAGTCCCCG 61 AGAAGTTGGG GGGAGGGGTC GGCAATTGAA CCGGTGCCTA GAGAAGGTGG CGCGGGGTAA 121 ACTGGGAAAG TGATGTCGTG TACTGGCTCC GCCTTTTTCC CGAGGGTGGG GGAGAACCGT 181 ATATAAGTGC AGTAGTCGCC GTGAACGTTC TTTTTCGCAA CGGGTTTGCC GCCAGAACAC 241 AGCTGAAGCT TCGAGGGGCT CGCATCTCTC CTTCACGCGC CCGCCGCCCT ACCTGAGGCC 301 GCCATCCACG CCGGTTGAGT CGCGTTCTGC CGCCTCCCGC CTGTGGTGCC TCCTGAACTG 361 CGTCCGCCGT CTAGGTAAGT TTAAAGCTCA GGTCGAGACC GGGCCTTTGT CCGGCGCTCC 421 CTTGGAGCCT ACCTAGACTC AGCCGGCTCT CCACGCTTTG CCTGACCCTG CTTGCTCAAC 481 TCTACGTCTT TGTTTCGTTT TCTGTTCTGC GCCGTTACAG ATCCAAGCCA CCATGTGGCG 541 CCCATCTGAT TCCACCGTGT ACGTCCCGCC ACCCAATCCC GTCAGCAAGG TCGTCGCAAC 601 CGACGCCTAC GTCACCAGGA CAAACATCTT CTACCACGCA TCATCCAGCC GCCTGTTGGC 661 CGTCGGCCAC CCCTACTTCA GTATCAAGAG AGCCAACAAG ACCGTCGTCC CCAAGGTCAG 721 CGGCTACCAG TATCGCGTGT TCAAAGTCGT CCTGCCCGAC CCCAACAAGT TCGCCCTGCC 781 CGATAGCAGC TTGTTCGACC CAACCACCCA GAGGCTCGTG TGGGCCTGTA CCGGGCTGGA 841 AGTCGGGAGA GGCCAACCCC TGGGCGTCGG CGTGTCCGGC CACCCGTTCT TGAACAAGTA 901 CGACGACGTC GAGAACAGCG GCTCCGGCGG CAATCCAGGC CAGGACAATC GCGTCAACGT 961 GGGCATGGAC TACAAGCAGA CCCAGCTGTG TATGGTCGGC TGCGCACCAC CCCTCGGGGA 1021 ACACTGGGGC AAGGGCAAGC AATGCACCAA CACCCCAGTG CAGGCCGGCG ATTGTCCCCC 1081 ACTGGAGTTG ATCACATCCG TCATCCAGGA CGGGGACATG GTCGATACCG GGTTCGGCGC 1141 CATGAACTTC GCCGACCTCC AAACGAACAA GAGCGACGTC CCCATCGATA TCTGCGGGAC 1201 CACCTGCAAG TACCCCGACT ACCTGCAAAT GGCCGCCGAT CCCTACGGCG ACCGCCTGTT 1261 CTTCTTCTTG AGAAAAGAGC AGATGTTCGC ACGCCACTTC TTCAATCGCG CCGGGGAAGT 1321 CGGCGAGCCC GTCCCCGACA CCCTGATCAT CAAAGGCTCC GGCAACAGGA CCAGCGTGGG 1381 CTCCAGCATC TACGTCAATA CACCGTCTGG GAGCCTCGTC AGTAGCGAAG CCCAGCTCTT 1441 CAACAAACCC TACTGGTTGC AGAAGGCACA AGGCCACAAC AACGGCATCT GCTGGGGCAA 1501 CCAGCTCTTC GTCACCGTCG TGGACACAAC CAGGTCCACA AACATGACCC TGTGCGCCAG 1561 TGTCACCACC AGCAGTACCT ACACGAACAG CGACTACAAG GAGTACATGA GGCACGTCGA 1621 GGAATACGAC CTGCAGTTCA TCTTCCAACT GTGCTCAATC ACCCTGTCCG CCGAAGTCAT 1681 GGCATACATC CATACCATGA ACCCAAGCGT CCTGGAGGAT TGGAACTTCG GCCTGAGCCC 1741 CCCACCCAAC GGCACCCTGG AGGACACATA CCGCTACGTC CAAAGCCAGG CAATCACATG 1801 CCAGAAGCCA ACCCCCGAGA AAGAGAAGCC CGACCCATAC AAGAACTTGA GCTTCTGGGA 1861 AGTCAACCTG AAGGAGAAGT TCAGCTCCGA GCTCGACCAG TACCCACTCG GCAGGAAGTT 1921 CCTGCTGCAA TCCGGCTACC GCGGCAGAAG TAGCATCCGC ACAGGCGTCA AGAGGCCCGC 1981 CGTCAGCAAG GCAAGCGCCG CACCCAAGCG CAAAAGGGCA AAGACCAAGC GCTGAGCTCG 2041 AGGCTAGCTC GACATGATAA GATACATTGA TGAGTTTGGA CAAACCACAA CTAGAATGCA 2101 GTGAAAAAAA TGCTTTATTT GTGAAATTTG TGATGCTATT GCTTTATTTG TGAAATTTGT 2161 GATGCTATTG CTTTATTTGT AACCATTATA AGCTGCAATA AACAAGTTAA CAACAACAAT 2221 TGCATTCATT TTATGTTTCA GGTTCAGGGG GAGGTGTGGG AGGTTTTTTA AAGCAAGTAA 2281 AACCTCTACA AATGTGGTAG ATCATTTAAA TGTTAATTAA GAACATGTGA GCAAAAGGCC 2341 AGCAAAAGGC CAGGAACCGT AAAAAGGCCG CGTTGCTGGC GTTTTTCCAT AGGCTCCGCC 2401 CCCCTGACGA GCATCACAAA AATCGACGCT CAAGTCAGAG GTGGCGAAAC CCGACAGGAC 2461 TATAAAGATA CCAGGCGTTT CCCCCTGGAA GCTCCCTCGT GCGCTCTCCT GTTCCGACCC 2521 TGCCGCTTAC CGGATACCTG TCCGCCTTTC TCCCTTCGGG AAGCGTGGCG CTTTCTCAAT 2581 GCTCACGCTG TAGGTATCTC AGTTCGGTGT AGGTCGTTCG CTCCAAGCTG GGCTGTGTGC 2641 ACGAACCCCC CGTTCAGCCC GACCGCTGCG CCTTATCCGG TAACTATCGT CTTGAGTCCA 2701 ACCCGGTAAG ACACGACTTA TCGCCACTGG CAGCAGCCAC TGGTAACAGG ATTAGCAGAG 2761 CGAGGTATGT AGGCGGTGCT ACAGAGTTCT TGAAGTGGTG GCCTAACTAC GGCTACACTA 2821 GAAGAACAGT ATTTGGTATC TGCGCTCTGC TGAAGCCAGT TACCTTCGGA AAAAGAGTTG 2881 GTAGCTCTTG ATCCGGCAAA CAAACCACCG CTGGTAGCGG TGGTTTTTTT GTTTGCAAGC 2941 AGCAGATTAC GCGCAGAAAA AAAGGATCTC AAGAAGATCC TTTGATCTTT TCTACGGGGT 3001 CTGACGCTCA GTGGAACGAA AACTCACGTT AAGGGATTTT GGTCATGGCT AGTTAATTAA 3061 GCTGTACACT GTGGAATGTG TGTCAGTTAG GGTGTGGAAA GTCCCCAGGC TCCCCAGCAG 3121 GCAGAAGTAT GCAAAGCATG CATCTCAATT AGTCAGCAAC CAGGTGTGGA AAGTCCCCAG 3181 GCTCCCCAGC AGGCAGAAGT ATGCAAAGCA TGCATCTCAA TTAGTCAGCA ACCATAGTCC 3241 CGCCCCTAAC TCCGCCCATC CCGCCCCTAA CTCCGCCCAG TTCCGCCCAT TCTCCGCCCC 3301 ATGGCTGACT AATTTTTTTT ATTTATGCAG AGGCCGAGGC CGCCTCTGCC TCTGAGCTAT 3361 TCCAGAAGTA GTGAGGAGGC TTTTTTGGAG GCCTAGGCTT TTGCAAAAAG CTCCCGGGAG 3421 CTTGTATATC CATTTTCGGA TCTGATCAGC ACGTGTTGAC AATTAATCAT CGGCATAGTA 3481 TATCGGCATA GTATAATACG ACAAGGTGAG GAACTAAATC ATGAAGACCT TCAACATCTC 3541 TCAGCAGGAT CTGGAGCTGG TGGAGGTCGC CACTGAGAAG ATCACCATGC TCTATGAGGA 3601 CAACAAGCAC CATGTCGGGG CGGCCATCAG GACCAAGACT GGGGAGATCA TCTCTGCTGT 3661 CCACATTGAG GCCTACATTG GCAGGGTCAC TGTCTGTGCT GAAGCCATTG CCATTGGGTC 3721 TGCTGTGAGC AACGGGCAGA AGGACTTTGA CACCATTGTG GCTGTCAGGC ACCCCTACTC 3781 TGATGAGGTG GACAGATCCA TCAGGGTGGT CAGCCCCTGT GGCATGTGCA GAGAGCTCAT 3841 CTCTGACTAT GCTCCTGACT GCTTTGTGCT CATTGAGATG AATGGCAAGC TGGTCAAAAC 3901 CACCATTGAG GAACTCATCC CCCTCAAGTA CACCAGGAAC TAAACCTGAA TTCGCTAGAG 3961 GGCCCTATTC TATAGTGTCA CCTAAATGCT AGAGCTCGCT GATCAGCCTC GACTGTGCCT 4021 TCTAGTTGCC AGCCATCTGT TGTTTGCCCC TCCCCCGTGC CTTCCTTGAC CCTGGAAGGT 4081 GCCACTCCCA CTGTCCTTTC CTAATAAAAT GAGGAAATTG CATCGCATTG TCTGAGTAGG 4141 TGTCATTCTA TTCTGGGGGG TGGGGTGGGG CAGGACAGCA AGGGGGAGGA TTGGGAAGAC 4201 AATAGCAGGC ATGCGCAGGG CCCAATTGCT CGAGCGGCCG CAATAAAATA TCTTTATTTT 4261 CATTACATCT GTGTGTTGGT TTTTTGTGTG AATCGTAACT AACATACGCT CTCCATCAAA 4321 ACAAAACGAA ACAAAACAAA CTAGCAAAAT AGGCTGTCCC CAGTGCAAGT GCAGGTGCCA 4381 GAACATTTCT CTATCGAA //