LOCUS PYSEAP.TXT 5297 BP DS-DNA CIRCULAR SYN 10-MAR-1998 DEFINITION - ACCESSION - KEYWORDS - SOURCE - FEATURES Location/Qualifiers misc_feature <41..>166 /note="URR fragment" misc_feature <167..>377 /note="E1 fragment" promoter 462..1640 /note="EF1a promoter" enhancer 462..659 /note="EF-1alpha promoter core domain" exon 660..692 /note="EF-1a exon 1" intron 693..1632 /note="EF-1a intron A" exon 1632..1640 /note="EF-1a exon 2 leader" CDS 1727..3283 /note="SEAP" misc_signal 1727..1777 /note="Signal peptide" mat_peptide 1778..3283 /note="mature SEAP" misc_difference 3245..3283 /note="C-term extension" polyA_signal 3278..3499 /note="SV40 polyA" polyA_site 3397..3402 /note="SV40 late pA signal" rep_origin 3795..4438 /note="pUC Ori" CDS complement(4445..<4840) /note="BlasticidinR" promoter complement(4859..4914) /note="EM7" rep_origin complement(4949..5100) /note="SV40 Ori" promoter complement(4962..5271) /note="SV40 early promoter" repeat_region complement(5060..5123) /note="SV40 21bp repeats" enhancer complement(5127..5198) /note="72bp enhancer repeat A" enhancer complement(5199..5271) /note="72bp enhancer repeat B" BASE COUNT 1166 A 1431 C 1535 G 1165 T 0 OTHER ORIGIN - 1 CGCGTTGACA TTGATTATTG ACTAGTTAGT TATTACTAGC CGCGTGCAAA GCACCGGCGG 61 CGGTAGATGC GGGGTAAGTA CTGAATTTTA ATTCGACCTA TCCCGGTAAA GCGAAAGCGA 121 CACGCTTTTT TTTCACACAT AGCGGGACCG AACACGTTAT AAGTATCGAA CTCCTAAAGA 181 AGCAGTGTAG TTTTCTGCAG ATGCAAAAAA GATCTCATGA AGGAGGAACT TGTGCAGTTT 241 ACTTAATCTG CTTTAACACA GCTAAAAGCA GAGAAACAGT CCGGAATCTG ATGGCAAACA 301 CGCTAAATGT AAGAGAAGAG TGTTTGATGC TGCAGCCAGC TAAAATTCGA GGACTCAGCG 361 CAGCTCTATT CTGGTTTAGC TTGCAAAGAT GGATAAAGTT TTAAACAGAG AGGAATCTTT 421 GCAGCTAATG GACCTTGTAG GTCTTGAAAG GAGTGGGAAT TGGCTCCGGT GCCCGTCAGT 481 GGGCAGAGCG CACATCGCCC ACAGTCCCCG AGAAGTTGGG GGGAGGGGTC GGCAATTGAA 541 CCGGTGCCTA GAGAAGGTGG CGCGGGGTAA ACTGGGAAAG TGATGTCGTG TACTGGCTCC 601 GCCTTTTTCC CGAGGGTGGG GGAGAACCGT ATATAAGTGC AGTAGTCGCC GTGAACGTTC 661 TTTTTCGCAA CGGGTTTGCC GCCAGAACAC AGGTAAGTGC CGTGTGTGGT TCCCGCGGGC 721 CTGGCCTCTT TACGGGTTAT GGCCCTTGCG TGCCTTGAAT TACTTCCACC TGGCTGCAGT 781 ACGTGATTCT TGATCCCGAG CTTCGGGTTG GAAGTGGGTG GGAGAGTTCG AGGCCTTGCG 841 CTTAAGGAGC CCCTTCGCCT CGTGCTTGAG TTGAGGCCTG GCCTGGGCGC TGGGGCCGCC 901 GCGTGCGAAT CTGGTGGCAC CTTCGCGCCT GTCTCGCTGC TTTCGATAAG TCTCTAGCCA 961 TTTAAAATTT TTGATGACCT GCTGCGACGC TTTTTTTCTG GCAAGATAGT CTTGTAAATG 1021 CGGGCCAAGA TCTGCACACT GGTATTTCGG TTTTTGGGGC CGCGGGCGGC GACGGGGCCC 1081 GTGCGTCCCA GCGCACATGT TCGGCGAGGC GGGGCCTGCG AGCGCGGCCA CCGAGAATCG 1141 GACGGGGGTA GTCTCAAGCT GGCCGGCCTG CTCTGGTGCC TGGCCTCGCG CCGCCGTGTA 1201 TCGCCCCGCC CTGGGCGGCA AGGCTGGCCC GGTCGGCACC AGTTGCGTGA GCGGAAAGAT 1261 GGCCGCTTCC CGGCCCTGCT GCAGGGAGCT CAAAATGGAG GACGCGGCGC TCGGGAGAGC 1321 GGGCGGGTGA GTCACCCACA CAAAGGAAAA GGGCCTTTCC GTCCTCAGCC GTCGCTTCAT 1381 GTGACTCCAC GGAGTACCGG GCGCCGTCCA GGCACCTCGA TTAGTTCTCG AGCTTTTGGA 1441 GTACGTCGTC TTTAGGTTGG GGGGAGGGGT TTTATGCGAT GGAGTTTCCC CACACTGAGT 1501 GGGTGGAGAC TGAAGTTAGG CCAGCTTGGC ACTTGATGTA ATTCTCCTTG GAATTTGCCC 1561 TTTTTGAGTT TGGATCTTGG TTCATTCTCA AGCCTCAGAC AGTGGTTCAA AGTTTTTTTC 1621 TTCCATTTCA GGTGTCGTGA GGAATTCTCT AGAGATCCCT CGACCTCGAG ATCCATTGTG 1681 CTGGATCTGC GATCTAAGTA AGCTTCGAAT CGCGAATTCG CCCACCATGC TGCTGCTGCT 1741 GCTGCTGCTG GGCCTGAGGC TACAGCTCTC CCTGGGCATC ATCCCAGTTG AGGAGGAGAA 1801 CCCGGACTTC TGGAACCGCG AGGCAGCCGA GGCCCTGGGT GCCGCCAAGA AGCTGCAGCC 1861 TGCACAGACA GCCGCCAAGA ACCTCATCAT CTTCCTGGGC GATGGGATGG GGGTGTCTAC 1921 GGTGACAGCT GCCAGGATCC TAAAAGGGCA GAAGAAGGAC AAACTGGGGC CTGAGATACC 1981 CCTGGCCATG GACCGCTTCC CATATGTGGC TCTGTCCAAG ACATACAATG TAGACAAACA 2041 TGTGCCAGAC AGTGGAGCCA CAGCCACGGC CTACCTGTGC GGGGTCAAGG GCAACTTCCA 2101 GACCATTGGC TTGAGTGCAG CCGCCCGCTT TAACCAGTGC AACACGACAC GCGGCAACGA 2161 GGTCATCTCC GTGATGAATC GGGCCAAGAA AGCAGGGAAG TCAGTGGGAG TGGTAACCAC 2221 CACACGAGTG CAGCACGCCT CGCCAGCCGG CACCTACGCC CACACGGTGA ACCGCAACTG 2281 GTACTCGGAC GCCGACGTGC CTGCCTCGGC CCGCCAGGAG GGGTGCCAGG ACATCGCTAC 2341 GCAGCTCATC TCCAACATGG ACATTGACGT GATCCTAGGT GGAGGCCGAA AGTACATGTT 2401 TCGCATGGGA ACCCCAGACC CTGAGTACCC AGATGACTAC AGCCAAGGTG GGACCAGGCT 2461 GGACGGGAAG AATCTGGTGC AGGAATGGCT GGCGAAGCGC CAGGGTGCCC GGTATGTGTG 2521 GAACCGCACT GAGCTCATGC AGGCTTCCCT GGACCCGTCT GTGACCCATC TCATGGGTCT 2581 CTTTGAGCCT GGAGACATGA AATACGAGAT CCACCGAGAC TCCACACTGG ACCCCTCCCT 2641 GATGGAGATG ACAGAGGCTG CCCTGCGCCT GCTGAGCAGG AACCCCCGCG GCTTCTTCCT 2701 CTTCGTGGAG GGTGGTCGCA TCGACCATGG TCATCATGAA AGCAGGGCTT ACCGGGCACT 2761 GACTGAGACG ATCATGTTCG ACGACGCCAT TGAGAGGGCG GGCCAGCTCA CCAGCGAGGA 2821 GGACACGCTG AGCCTCGTCA CTGCCGACCA CTCCCACGTC TTCTCCTTCG GAGGCTACCC 2881 CCTGCGAGGG AGCTCCATCT TCGGGCTGGC CCCTGGCAAG GCCCGGGACA GGAAGGCCTA 2941 CACGGTCCTC CTATACGGAA ACGGTCCAGG CTATGTGCTC AAGGACGGCG CCCGGCCGGA 3001 TGTTACCGAG AGCGAGAGCG GGAGCCCCGA GTATCGGCAG CAGTCAGCAG TGCCCCTGGA 3061 CGAAGAGACC CACGCAGGCG AGGACGTGGC GGTGTTCGCG CGCGGCCCGC AGGCGCACCT 3121 GGTTCACGGC GTGCAGGAGC AGACCTTCAT AGCGCACGTC ATGGCCTTCG CCGCCTGCCT 3181 GGAGCCCTAC ACCGCCTGCG ACCTGGCGCC CCCCGCCGGC ACCACCGACG CCGCGCACCC 3241 GGGTTACTCT AGAGTCGGGG CGGCCGGCCG CTTCGAGCAG ACATGATAAG ATACATTGAT 3301 GAGTTTGGAC AAACCACAAC TAGAATGCAG TGAAAAAAAT GCTTTATTTG TGAAATTTGT 3361 GATGCTATTG CTTTATTTGT AACCATTATA AGCTGCAATA AACAAGTTAA CAACAACAAT 3421 TGCATTCATT TTATGTTTCA GGTTCAGGGG GAGGTGTGGG AGGTTTTTTA AAGCAAGTAA 3481 AACCTCTACA AATGTGGTAA AATCGATAAG GATCCGTCGA CCGATGCCCT TGAGAGCCTT 3541 CAACCCAGTC AGCTCCTTCC GGTGGGCGCG GGGCATGACT ATCGTCGCCG CACTTATGAC 3601 TGTCTTCTTT ATCATGCAAC TCGTAGGACA GGTGCCGGCA GCGCTCTTCC GCTTCCTCGC 3661 TCACTGACTC GCTGCGCTCG GTCGTTCGGC TGCGGCGAGC GGTATCAGCT CACTCAAAGG 3721 CGGTAATACG GTTATCCACA GAATCAGGGG ATAACGCAGG AAAGAACATG TGAGCAAAAG 3781 GCCAGCAAAA GGCCAGGAAC CGTAAAAAGG CCGCGTTGCT GGCGTTTTTC CATAGGCTCC 3841 GCCCCCCTGA CGAGCATCAC AAAAATCGAC GCTCAAGTCA GAGGTGGCGA AACCCGACAG 3901 GACTATAAAG ATACCAGGCG TTTCCCCCTG GAAGCTCCCT CGTGCGCTCT CCTGTTCCGA 3961 CCCTGCCGCT TACCGGATAC CTGTCCGCCT TTCTCCCTTC GGGAAGCGTG GCGCTTTCTC 4021 ATAGCTCACG CTGTAGGTAT CTCAGTTCGG TGTAGGTCGT TCGCTCCAAG CTGGGCTGTG 4081 TGCACGAACC CCCCGTTCAG CCCGACCGCT GCGCCTTATC CGGTAACTAT CGTCTTGAGT 4141 CCAACCCGGT AAGACACGAC TTATCGCCAC TGGCAGCAGC CACTGGTAAC AGGATTAGCA 4201 GAGCGAGGTA TGTAGGCGGT GCTACAGAGT TCTTGAAGTG GTGGCCTAAC TACGGCTACA 4261 CTAGAAGGAC AGTATTTGGT ATCTGCGCTC TGCTGAAGCC AGTTACCTTC GGAAAAAGAG 4321 TTGGTAGCTC TTGATCCGGC AAACAAACCA CCGCTGGTAG CGGTGGTTTT TTTGTTTGCA 4381 AGCAGCAGAT TACGCGCAGA AAAAAAGGAT CTCAAGAAGA TCCTTTGATC TTTTCTACGG 4441 GGTCGCCCTC CCACACATAA CCAGAGGGCA GCAATTCACG AATCCCAACT GCCGTCGGCT 4501 GTCCATCACT GTCCTTCACT ATGGCTTTGA TCCCAGGATG CAGATCGAGA AGCACCTGTC 4561 GGCACCGTCC GCAGGGGCTC AAGATGCCCC TGTTCTCATT TCCGATCGCG ACGATACAAG 4621 TCAGGTTGCC AGCTGCCGCA GCAGCAGCAG TGCCCAGCAC CACGAGTTCT GCACAAGGTC 4681 CCCCAGTAAA ATGATATACA TTGACACCAG TGAAGATGCG GCCGTCGCTA GAGAGAGCTG 4741 CGCTGGCGAC GCTGTAGTCT TCAGAGATGG GGATGCTGTT GATTGTAGCC GTTGCTCTTT 4801 CAATGAGGGT GGATTCTTCT TGAGACAAAG GCTTGGCCAT GGTTTAGTTC CTCACCTTGT 4861 CGTATTATAC TATGCCGATA TACTATGCCG ATGATTAATT GTCAACACGT GCTGATCAGA 4921 TCCGAAAATG GATATACAAG CTCCCGGGAG CTTTTTGCAA AAGCCTAGGC CTCCAAAAAA 4981 GCCTCCTCAC TACTTCTGGA ATAGCTCAGA GGCAGAGGCG GCCTCGGCCT CTGCATAAAT 5041 AAAAAAAATT AGTCAGCCAT GGGGCGGAGA ATGGGCGGAA CTGGGCGGAG TTAGGGGCGG 5101 GATGGGCGGA GTTAGGGGCG GGACTATGGT TGCTGACTAA TTGAGATGCA TGCTTTGCAT 5161 ACTTCTGCCT GCTGGGGAGC CTGGGGACTT TCCACACCTG GTTGCTGACT AATTGAGATG 5221 CATGCTTTGC ATACTTCTGC CTGCCTGGGG AGCCTGGGGA CTTTCCACAC CCTAACTGAC 5281 ACACATTCCA CAGAATT //