LOCUS PIND-E2 (L 7192 BP DS-DNA CIRCULAR SYN 03-SEP-2001 DEFINITION - ACCESSION - KEYWORDS - SOURCE - FEATURES Location/Qualifiers protein_bind 1..174 /note="Ecdysone/glucocorticoid response elements (5x)" promoter 181..475 /note="minimal heat shock promoter" precursor_RNA <492..539 /note="3' fragment of CMV promoter" intron 601..733 /note="intron" precursor_RNA 778..796 /note="T7 promoter" mutation 847..847 /note="discovered by sequencing" CDS 848..>2080 /note="E2-TA (weighs 45.4)" misc_feature <848..>906 /note="E1" mRNA 1320..1320 /note="p3080 first transcribed base" mutation 1331..1333 /note="ATG -> ATC mutation kills E2-TR" misc_feature <1334..>2080 /note="E2-TR" misc_feature 1431..>1794 /note="E4" misc_feature 1454..1477 /note="putative splice acceptor" misc_feature 1463..1465 /note="first E2 residue in E8-E2" mutation 1684..1684 /note="Unknown base missed in original publications" mutation 1748..1750 /note="S301A kills PEST (Penrose 01)" mRNA 2208..>2712 /note="HBV PRE" polyA_signal 2770..2984 /note="BGH pA" misc_feature 3047..3461 /note="F1 Ori" promoter 3525..3850 /note="SV40 promoter/Ori" CDS 3886..4680 /note="NeoR" polyA_signal 4696..4935 /note="SV40 pA" rep_origin 5367..6040 /note="pUC Ori" CDS 6185..7045 /note="AmpR" BASE COUNT 1711 A 1870 C 1844 G 1766 T 1 OTHER ORIGIN - 1 AGATCTCGGC CGCATATTAA GTGCATTGTT CTCGATACCG CTAAGTGCAT TGTTCTCGTT 61 AGCTCGATGG ACAAGTGCAT TGTTCTCTTG CTGAAAGCTC GATGGACAAG TGCATTGTTC 121 TCTTGCTGAA AGCTCGATGG ACAAGTGCAT TGTTCTCTTG CTGAAAGCTC AGTACCCGGG 181 AGTACCCTCG ACCGCCGGAG TATAAATAGA GGCGCTTCGT CTACGGAGCG ACAATTCAAT 241 TCAAACAAGC AAAGTGAACA CGTCGCTAAG CGAAAGCTAA GCAAATAAAC AAGCGCAGCT 301 GAACAAGCTA AACAATCTGC AGTAAAGTGC AAGTTAAAGT GAATCAATTA AAAGTAACCA 361 GCAACCAAGT AAATCAACTG CAACTACTGA AATCTGCCAA GAAGTAATTA TTGAATACAA 421 GAAGAGAACT CTGAATACTT TCAACAAGTT ACCGAGAAAG AAGAACTCAC ACACAGCTAG 481 CGTTTAAACT TAAGCTTTAT TGCGGTAGTT TATCACAGTT AAATTGCTAA CGCAGTCAGT 541 GCTTCTGACA CAACAGTCTC GAACTTAAGC TGCAGAAGTT GGTCGTGAGG CACTGGGCAG 601 GTAAGTATCA AGGTTACAAG ACAGGTTTAA GGAGACCAAT AGAAACTGGG CTTGTCGAGA 661 CAGAGAAGAC TCTTGCGTTT CTGATAGGCA CCTATTGGTC TTACTGACAT CCACTTTGCC 721 TTTCTCTCCA CAGGTGTCCA CTCCCAGTTC AATTACAGCT CTTAAGGCTA GAGTACTTAA 781 TACGACTCAC TATAGGCTAG CCTCGAGAAT TCACGCGTGG TACAGCTTGG TACCGAGCTC 841 GGATCCCATG GAGACAGCAT GCGAACGTTT ACATGTAGCG CAAGAAACAC AAATGCAGTT 901 GATTGAGAAA AGTAGTGATA AGTTGCAAGA TCATATACTG TACTGGACTG CTGTTAGAAC 961 TGAGAACACA CTGCTTTATG CTGCAAGGAA AAAAGGGGTG ACTGTCCTAG GACACTGCAG 1021 AGTACCACAC TCTGTAGTTT GTCAAGAGAG AGCCAAGCAG GCCATTGAAA TGCAGTTGTC 1081 TTTGCAGGAG TTAAGCAAAA CTGAGTTTGG GGATGAACCA TGGTCTTTGC TTGACACAAG 1141 CTGGGACCGA TATATGTCAG AACCTAAACG GTGCTTTAAG AAAGGCGCCA GGGTGGTAGA 1201 GGTGGAGTTT GATGGAAATG CAAGCAATAC AAACTGGTAC ACTGTCTACA GCAATTTGTA 1261 CATGCGCACA GAGGACGGCT GGCAGCTTGC GAAGGCTGGG GCTGACGGAA CTGGGCTCTA 1321 CTACTGCACC ATCGCCGGTG CTGGACGCAT TTACTATTCT CGCTTTGGTG ACGAGGCAGC 1381 CAGATTTAGT ACAACAGGGC ATTACTCTGT AAGAGATCAG GACAGAGTGT ATGCTGGTGT 1441 CTCATCCACC TCTTCTGATT TTAGAGATCG CCCAGACGGA GTCTGGGTCG CATCCGAAGG 1501 ACCTGAAGGA GACCCTGCAG GAAAAGAAGC CGAGCCAGCC CAGCCTGTCT CTTCTTTGCT 1561 CGGCTCCCCC GCCTGCGGTC CCATCAGAGC AGGCCTCGGT TGGGTACGGG ACGGTCCTCG 1621 CTCGCACCCC TACAATTTTC CTGCAGGCTC GGGGGGCTCT ATTCTCCGCT CTTCCTCCAC 1681 CCCNGTGCAG GGCACGGTAC CGGTGGACTT GGCATCAAGG CAGGAAGAAG AGGAGCAGTC 1741 GCCCGACTCC ACAGAGGAAG AACCAGTGAC TCTCCCAAGG CGCACCACCA ATGATGGATT 1801 CCACCTGTTA AAGGCAGGAG GGTCATGCTT TGCTCTAATT TCAGGAACTG CTAACCAGGT 1861 AAAGTGCTAT CGCTTTCGGG TGAAAAAGAA CCATAGACAT CGCTACGAGA ACTGCACCAC 1921 CACCTGGTTC ACAGTTGCTG ACAACGGTGC TGAAAGACAA GGACAAGCAC AAATACTGAT 1981 CACCTTTGGA TCGCCAAGTC AAAGGCAAGA CTTTCTGAAA CATGTACCAC TACCTCCTGG 2041 AATGAACATT TCCGGCTTTA CAGCCAGCTT GGACTTCTGA TCACTTCGAG TCTAGTAACG 2101 GCCGCCAGTG TGCTGGAATT CTGCAGATAT CCATCACACT GGCGGCCGCT CGAGCATGCA 2161 TCTAGAGCGG CCGCCACCGC GGTGGCTAGA GGGCCCGTTT GTACCCCCGT GGAACCTTTG 2221 TGGCTCCTCT GCCGATCCAT ACTGCGGAAC TCCTAGCCGC TTGTTTTGCT CGCAGCCCGT 2281 TTCCATGGCT GCTAGGCTGT ACTGCCAACT GGATCCTTCG CGGGACGTCC TTTGTTTACG 2341 TCCCGTCGGC GCTGAATCCC GCGGACGACC CCTCGCGGGG CCGCTTGGGA CTCTCTCGTC 2401 CCCTTCTCCG TCTGCCGTTC CAGCCGACCA CGGGGCGCAC CTCTCTTTAC GCGGTCTCCC 2461 CGTCTGTGCC TTCTCATCTG CCGGTCCGTG TGCACTTCGC TTCACCTCTG CACGTTGCAT 2521 GGAGACCACC GTGAACGCCC ATCAGATCCT GCCCAAGGTC TTACATAAGA GGACTCTTGG 2581 ACTCCCAGCA ATGTCAACGA CCGACCTTGA GGCCTACTTC AAAGACTGTG TGTTTAAGGA 2641 CTGGGAGGAG CTGGGGGAGG AGATTAGGTT AAAGGTCTTT GTATTAGGAG GCTGTAGGCA 2701 TAAATTGGTC TGCGGGTTCG AAATCGATAA GCTGATCCGT CGACCCGGCC CGTTTAAACC 2761 CGCTGATCAG CCTCGACTGT GCCTTCTAGT TGCCAGCCAT CTGTTGTTTG CCCCTCCCCC 2821 GTGCCTTCCT TGACCCTGGA AGGTGCCACT CCCACTGTCC TTTCCTAATA AAATGAGGAA 2881 ATTGCATCGC ATTGTCTGAG TAGGTGTCAT TCTATTCTGG GGGGTGGGGT GGGGCAGGAC 2941 AGCAAGGGGG AGGATTGGGA AGACAATAGC AGGCATGCTG GGGATGCGGT GGGCTCTATG 3001 GCTTCTGAGG CGGAAAGAAC CAGCTGGGGC TCTAGGGGGT ATCCCCACGC GCCCTGTAGC 3061 GGCGCATTAA GCGCGGCGGG TGTGGTGGTT ACGCGCAGCG TGACCGCTAC ACTTGCCAGC 3121 GCCCTAGCGC CCGCTCCTTT CGCTTTCTTC CCTTCCTTTC TCGCCACGTT CGCCGGCTTT 3181 CCCCGTCAAG CTCTAAATCG GGGCATCCCT TTAGGGTTCC GATTTAGTGC TTTACGGCAC 3241 CTCGACCCCA AAAAACTTGA TTAGGGTGAT GGTTCACGTA GTGGGCCATC GCCCTGATAG 3301 ACGGTTTTTC GCCCTTTGAC GTTGGAGTCC ACGTTCTTTA ATAGTGGACT CTTGTTCCAA 3361 ACTGGAACAA CACTCAACCC TATCTCGGTC TATTCTTTTG ATTTATAAGG GATTTTGGGG 3421 ATTTCGGCCT ATTGGTTAAA AAATGAGCTG ATTTAACAAA AATTTAACGC GAATTAATTC 3481 TGTGGAATGT GTGTCAGTTA GGGTGTGGAA AGTCCCCAGG CTCCCCAGGC AGGCAGAAGT 3541 ATGCAAAGCA TGCATCTCAA TTAGTCAGCA ACCAGGTGTG GAAAGTCCCC AGGCTCCCCA 3601 GCAGGCAGAA GTATGCAAAG CATGCATCTC AATTAGTCAG CAACCATAGT CCCGCCCCTA 3661 ACTCCGCCCA TCCCGCCCCT AACTCCGCCC AGTTCCGCCC ATTCTCCGCC CCATGGCTGA 3721 CTAATTTTTT TTATTTATGC AGAGGCCGAG GCCGCCTCTG CCTCTGAGCT ATTCCAGAAG 3781 TAGTGAGGAG GCTTTTTTGG AGGCCTAGGC TTTTGCAAAA AGCTCCCGGG AGCTTGTATA 3841 TCCATTTTCG GATCTGATCA AGAGACAGGA TGAGGATCGT TTCGCATGAT TGAACAAGAT 3901 GGATTGCACG CAGGTTCTCC GGCCGCTTGG GTGGAGAGGC TATTCGGCTA TGACTGGGCA 3961 CAACAGACAA TCGGCTGCTC TGATGCCGCC GTGTTCCGGC TGTCAGCGCA GGGGCGCCCG 4021 GTTCTTTTTG TCAAGACCGA CCTGTCCGGT GCCCTGAATG AACTGCAGGA CGAGGCAGCG 4081 CGGCTATCGT GGCTGGCCAC GACGGGCGTT CCTTGCGCAG CTGTGCTCGA CGTTGTCACT 4141 GAAGCGGGAA GGGACTGGCT GCTATTGGGC GAAGTGCCGG GGCAGGATCT CCTGTCATCT 4201 CACCTTGCTC CTGCCGAGAA AGTATCCATC ATGGCTGATG CAATGCGGCG GCTGCATACG 4261 CTTGATCCGG CTACCTGCCC ATTCGACCAC CAAGCGAAAC ATCGCATCGA GCGAGCACGT 4321 ACTCGGATGG AAGCCGGTCT TGTCGATCAG GATGATCTGG ACGAAGAGCA TCAGGGGCTC 4381 GCGCCAGCCG AACTGTTCGC CAGGCTCAAG GCGCGCATGC CCGACGGCGA GGATCTCGTC 4441 GTGACCCATG GCGATGCCTG CTTGCCGAAT ATCATGGTGG AAAATGGCCG CTTTTCTGGA 4501 TTCATCGACT GTGGCCGGCT GGGTGTGGCG GACCGCTATC AGGACATAGC GTTGGCTACC 4561 CGTGATATTG CTGAAGAGCT TGGCGGCGAA TGGGCTGACC GCTTCCTCGT GCTTTACGGT 4621 ATCGCCGCTC CCGATTCGCA GCGCATCGCC TTCTATCGCC TTCTTGACGA GTTCTTCTGA 4681 GCGGGACTCT GGGGTTCGAA ATGACCGACC AAGCGACGCC CAACCTGCCA TCACGAGATT 4741 TCGATTCCAC CGCCGCCTTC TATGAAAGGT TGGGCTTCGG AATCGTTTTC CGGGACGCCG 4801 GCTGGATGAT CCTCCAGCGC GGGGATCTCA TGCTGGAGTT CTTCGCCCAC CCCAACTTGT 4861 TTATTGCAGC TTATAATGGT TACAAATAAA GCAATAGCAT CACAAATTTC ACAAATAAAG 4921 CATTTTTTTC ACTGCATTCT AGTTGTGGTT TGTCCAAACT CATCAATGTA TCTTATCATG 4981 TCTGTATACC GTCGACCTCT AGCTAGAGCT TGGCGTAATC ATGGTCATAG CTGTTTCCTG 5041 TGTGAAATTG TTATCCGCTC ACAATTCCAC ACAACATACG AGCCGGAAGC ATAAAGTGTA 5101 AAGCCTGGGG TGCCTAATGA GTGAGCTAAC TCACATTAAT TGCGTTGCGC TCACTGCCCG 5161 CTTTCCAGTC GGGAAACCTG TCGTGCCAGC TGCATTAATG AATCGGCCAA CGCGCGGGGA 5221 GAGGCGGTTT GCGTATTGGG CGCTCTTCCG CTTCCTCGCT CACTGACTCG CTGCGCTCGG 5281 TCGTTCGGCT GCGGCGAGCG GTATCAGCTC ACTCAAAGGC GGTAATACGG TTATCCACAG 5341 AATCAGGGGA TAACGCAGGA AAGAACATGT GAGCAAAAGG CCAGCAAAAG GCCAGGAACC 5401 GTAAAAAGGC CGCGTTGCTG GCGTTTTTCC ATAGGCTCCG CCCCCCTGAC GAGCATCACA 5461 AAAATCGACG CTCAAGTCAG AGGTGGCGAA ACCCGACAGG ACTATAAAGA TACCAGGCGT 5521 TTCCCCCTGG AAGCTCCCTC GTGCGCTCTC CTGTTCCGAC CCTGCCGCTT ACCGGATACC 5581 TGTCCGCCTT TCTCCCTTCG GGAAGCGTGG CGCTTTCTCA ATGCTCACGC TGTAGGTATC 5641 TCAGTTCGGT GTAGGTCGTT CGCTCCAAGC TGGGCTGTGT GCACGAACCC CCCGTTCAGC 5701 CCGACCGCTG CGCCTTATCC GGTAACTATC GTCTTGAGTC CAACCCGGTA AGACACGACT 5761 TATCGCCACT GGCAGCAGCC ACTGGTAACA GGATTAGCAG AGCGAGGTAT GTAGGCGGTG 5821 CTACAGAGTT CTTGAAGTGG TGGCCTAACT ACGGCTACAC TAGAAGGACA GTATTTGGTA 5881 TCTGCGCTCT GCTGAAGCCA GTTACCTTCG GAAAAAGAGT TGGTAGCTCT TGATCCGGCA 5941 AACAAACCAC CGCTGGTAGC GGTGGTTTTT TTGTTTGCAA GCAGCAGATT ACGCGCAGAA 6001 AAAAAGGATC TCAAGAAGAT CCTTTGATCT TTTCTACGGG GTCTGACGCT CAGTGGAACG 6061 AAAACTCACG TTAAGGGATT TTGGTCATGA GATTATCAAA AAGGATCTTC ACCTAGATCC 6121 TTTTAAATTA AAAATGAAGT TTTAAATCAA TCTAAAGTAT ATATGAGTAA ACTTGGTCTG 6181 ACAGTTACCA ATGCTTAATC AGTGAGGCAC CTATCTCAGC GATCTGTCTA TTTCGTTCAT 6241 CCATAGTTGC CTGACTCCCC GTCGTGTAGA TAACTACGAT ACGGGAGGGC TTACCATCTG 6301 GCCCCAGTGC TGCAATGATA CCGCGAGACC CACGCTCACC GGCTCCAGAT TTATCAGCAA 6361 TAAACCAGCC AGCCGGAAGG GCCGAGCGCA GAAGTGGTCC TGCAACTTTA TCCGCCTCCA 6421 TCCAGTCTAT TAATTGTTGC CGGGAAGCTA GAGTAAGTAG TTCGCCAGTT AATAGTTTGC 6481 GCAACGTTGT TGCCATTGCT ACAGGCATCG TGGTGTCACG CTCGTCGTTT GGTATGGCTT 6541 CATTCAGCTC CGGTTCCCAA CGATCAAGGC GAGTTACATG ATCCCCCATG TTGTGCAAAA 6601 AAGCGGTTAG CTCCTTCGGT CCTCCGATCG TTGTCAGAAG TAAGTTGGCC GCAGTGTTAT 6661 CACTCATGGT TATGGCAGCA CTGCATAATT CTCTTACTGT CATGCCATCC GTAAGATGCT 6721 TTTCTGTGAC TGGTGAGTAC TCAACCAAGT CATTCTGAGA ATAGTGTATG CGGCGACCGA 6781 GTTGCTCTTG CCCGGCGTCA ATACGGGATA ATACCGCGCC ACATAGCAGA ACTTTAAAAG 6841 TGCTCATCAT TGGAAAACGT TCTTCGGGGC GAAAACTCTC AAGGATCTTA CCGCTGTTGA 6901 GATCCAGTTC GATGTAACCC ACTCGTGCAC CCAACTGATC TTCAGCATCT TTTACTTTCA 6961 CCAGCGTTTC TGGGTGAGCA AAAACAGGAA GGCAAAATGC CGCAAAAAAG GGAATAAGGG 7021 CGACACGGAA ATGTTGAATA CTCATACTCT TCCTTTTTCA ATATTATTGA AGCATTTATC 7081 AGGGTTATTG TCTCATGAGC GGATACATAT TTGAATGTAT TTAGAAAAAT AAACAAATAG 7141 GGGTTCCGCG CACATTTCCC CGAAAAGTGC CACCTGACGT CGACGGATCG GG //