LOCUS pEGFP-N1 4733 bp ss-DNA circular SYN 30-JUL-1997 DEFINITION - ACCESSION - KEYWORDS - SOURCE - FEATURES Location/Qualifiers promoter 1..589 /note=CMV promoter enhancer 59..465 /note=enhancer TATA_signal 554..560 /note=TATA mRNA 583..583 /note=transcription start misc_feature 591..671 /note=MCS CDS 679..1398 /note=EGFP (weighs 27kD) polyA_site complement(1527..1532) /note=SV40 late polyA polyA_signal 1551..1602 /note=SV40 early polyA core misc_feature 1552..1557 /note=pA signal misc_feature 1581..1586 /note=pA signal polyA_site 1590..1590 /note=mRNA 3' end polyA_site 1602..1602 /note=mRNA 3' end mutation 1644..1644 /note=discovered by sequencing rep_origin 1649..2104 /note=F1 Ori -35_signal 2166..2171 /note=-35 region promoter 2166..2201 /note=bacterial promoter -10_signal 2189..2194 /note=-10 region misc_RNA 2201..2201 /note=transcription start enhancer 2278..2421 /note=SV40 enhancer elements misc_feature 2425..2488 /note=SV40 21bp repeats rep_origin 2445..2580 /note=SV40 Ori mRNA 2497..2546 /note=SV40 transcription start points promoter 2501..2507 /note=SV40 promoter CDS 2629..3423 /note=Kan/Neo polyA_signal 3601..3870 /note=HSV TK pA signal polyA_site 3659..3677 /note=HSV TK pA sites rep_origin 4008..4651 /note=pUC Ori BASE COUNT 1140 A 1286 C 1245 G 1062 T 0 OTHER ORIGIN ? 1 TAGTTATTAA TAGTAATCAA TTACGGGGTC ATTAGTTCAT AGCCCATATA TGGAGTTCCG 61 CGTTACATAA CTTACGGTAA ATGGCCCGCC TGGCTGACCG CCCAACGACC CCCGCCCATT 121 GACGTCAATA ATGACGTATG TTCCCATAGT AACGCCAATA GGGACTTTCC ATTGACGTCA 181 ATGGGTGGAG TATTTACGGT AAACTGCCCA CTTGGCAGTA CATCAAGTGT ATCATATGCC 241 AAGTACGCCC CCTATTGACG TCAATGACGG TAAATGGCCC GCCTGGCATT ATGCCCAGTA 301 CATGACCTTA TGGGACTTTC CTACTTGGCA GTACATCTAC GTATTAGTCA TCGCTATTAC 361 CATGGTGATG CGGTTTTGGC AGTACATCAA TGGGCGTGGA TAGCGGTTTG ACTCACGGGG 421 ATTTCCAAGT CTCCACCCCA TTGACGTCAA TGGGAGTTTG TTTTGGCACC AAAATCAACG 481 GGACTTTCCA AAATGTCGTA ACAACTCCGC CCCATTGACG CAAATGGGCG GTAGGCGTGT 541 ACGGTGGGAG GTCTATATAA GCAGAGCTGG TTTAGTGAAC CGTCAGATCC GCTAGCGCTA 601 CCGGACTCAG ATCTCGAGCT CAAGCTTCGA ATTCTGCAGT CGACGGTACC GCGGGCCCGG 661 GATCCACCGG TCGCCACCAT GGTGAGCAAG GGCGAGGAGC TGTTCACCGG GGTGGTGCCC 721 ATCCTGGTCG AGCTGGACGG CGACGTAAAC GGCCACAAGT TCAGCGTGTC CGGCGAGGGC 781 GAGGGCGATG CCACCTACGG CAAGCTGACC CTGAAGTTCA TCTGCACCAC CGGCAAGCTG 841 CCCGTGCCCT GGCCCACCCT CGTGACCACC CTGACCTACG GCGTGCAGTG CTTCAGCCGC 901 TACCCCGACC ACATGAAGCA GCACGACTTC TTCAAGTCCG CCATGCCCGA AGGCTACGTC 961 CAGGAGCGCA CCATCTTCTT CAAGGACGAC GGCAACTACA AGACCCGCGC CGAGGTGAAG 1021 TTCGAGGGCG ACACCCTGGT GAACCGCATC GAGCTGAAGG GCATCGACTT CAAGGAGGAC 1081 GGCAACATCC TGGGGCACAA GCTGGAGTAC AACTACAACA GCCACAACGT CTATATCATG 1141 GCCGACAAGC AGAAGAACGG CATCAAGGTG AACTTCAAGA TCCGCCACAA CATCGAGGAC 1201 GGCAGCGTGC AGCTCGCCGA CCACTACCAG CAGAACACCC CCATCGGCGA CGGCCCCGTG 1261 CTGCTGCCCG ACAACCACTA CCTGAGCACC CAGTCCGCCC TGAGCAAAGA CCCCAACGAG 1321 AAGCGCGATC ACATGGTCCT GCTGGAGTTC GTGACCGCCG CCGGGATCAC TCTCGGCATG 1381 GACGAGCTGT ACAAGTAAAG CGGCCGCGAC TCTAGATCAT AATCAGCCAT ACCACATTTG 1441 TAGAGGTTTT ACTTGCTTTA AAAAACCTCC CACACCTCCC CCTGAACCTG AAACATAAAA 1501 TGAATGCAAT TGTTGTTGTT AACTTGTTTA TTGCAGCTTA TAATGGTTAC AAATAAAGCA 1561 ATAGCATCAC AAATTTCACA AATAAAGCAT TTTTTTCACT GCATTCTAGT TGTGGTTTGT 1621 CCAAACTCAT CAATGTATCT TAACGCGTAA ATTGTAAGCG TTAATATTTT GTTAAAATTC 1681 GCGTTAAATT TTTGTTAAAT CAGCTCATTT TTTAACCAAT AGGCCGAAAT CGGCAAAATC 1741 CCTTATAAAT CAAAAGAATA GACCGAGATA GGGTTGAGTG TTGTTCCAGT TTGGAACAAG 1801 AGTCCACTAT TAAAGAACGT GGACTCCAAC GTCAAAGGGC GAAAAACCGT CTATCAGGGC 1861 GATGGCCCAC TACGTGAACC ATCACCCTAA TCAAGTTTTT TGGGGTCGAG GTGCCGTAAA 1921 GCACTAAATC GGAACCCTAA AGGGAGCCCC CGATTTAGAG CTTGACGGGG AAAGCCGGCG 1981 AACGTGGCGA GAAAGGAAGG GAAGAAAGCG AAAGGAGCGG GCGCTAGGGC GCTGGCAAGT 2041 GTAGCGGTCA CGCTGCGCGT AACCACCACA CCCGCCGCGC TTAATGCGCC GCTACAGGGC 2101 GCGTCAGGTG GCACTTTTCG GGGAAATGTG CGCGGAACCC CTATTTGTTT ATTTTTCTAA 2161 ATACATTCAA ATATGTATCC GCTCATGAGA CAATAACCCT GATAAATGCT TCAATAATAT 2221 TGAAAAAGGA AGAGTCCTGA GGCGGAAAGA ACCAGCTGTG GAATGTGTGT CAGTTAGGGT 2281 GTGGAAAGTC CCCAGGCTCC CCAGCAGGCA GAAGTATGCA AAGCATGCAT CTCAATTAGT 2341 CAGCAACCAG GTGTGGAAAG TCCCCAGGCT CCCCAGCAGG CAGAAGTATG CAAAGCATGC 2401 ATCTCAATTA GTCAGCAACC ATAGTCCCGC CCCTAACTCC GCCCATCCCG CCCCTAACTC 2461 CGCCCAGTTC CGCCCATTCT CCGCCCCATG GCTGACTAAT TTTTTTTATT TATGCAGAGG 2521 CCGAGGCCGC CTCGGCCTCT GAGCTATTCC AGAAGTAGTG AGGAGGCTTT TTTGGAGGCC 2581 TAGGCTTTTG CAAAGATCGA TCAAGAGACA GGATGAGGAT CGTTTCGCAT GATTGAACAA 2641 GATGGATTGC ACGCAGGTTC TCCGGCCGCT TGGGTGGAGA GGCTATTCGG CTATGACTGG 2701 GCACAACAGA CAATCGGCTG CTCTGATGCC GCCGTGTTCC GGCTGTCAGC GCAGGGGCGC 2761 CCGGTTCTTT TTGTCAAGAC CGACCTGTCC GGTGCCCTGA ATGAACTGCA AGACGAGGCA 2821 GCGCGGCTAT CGTGGCTGGC CACGACGGGC GTTCCTTGCG CAGCTGTGCT CGACGTTGTC 2881 ACTGAAGCGG GAAGGGACTG GCTGCTATTG GGCGAAGTGC CGGGGCAGGA TCTCCTGTCA 2941 TCTCACCTTG CTCCTGCCGA GAAAGTATCC ATCATGGCTG ATGCAATGCG GCGGCTGCAT 3001 ACGCTTGATC CGGCTACCTG CCCATTCGAC CACCAAGCGA AACATCGCAT CGAGCGAGCA 3061 CGTACTCGGA TGGAAGCCGG TCTTGTCGAT CAGGATGATC TGGACGAAGA GCATCAGGGG 3121 CTCGCGCCAG CCGAACTGTT CGCCAGGCTC AAGGCGAGCA TGCCCGACGG CGAGGATCTC 3181 GTCGTGACCC ATGGCGATGC CTGCTTGCCG AATATCATGG TGGAAAATGG CCGCTTTTCT 3241 GGATTCATCG ACTGTGGCCG GCTGGGTGTG GCGGACCGCT ATCAGGACAT AGCGTTGGCT 3301 ACCCGTGATA TTGCTGAAGA GCTTGGCGGC GAATGGGCTG ACCGCTTCCT CGTGCTTTAC 3361 GGTATCGCCG CTCCCGATTC GCAGCGCATC GCCTTCTATC GCCTTCTTGA CGAGTTCTTC 3421 TGAGCGGGAC TCTGGGGTTC GAAATGACCG ACCAAGCGAC GCCCAACCTG CCATCACGAG 3481 ATTTCGATTC CACCGCCGCC TTCTATGAAA GGTTGGGCTT CGGAATCGTT TTCCGGGACG 3541 CCGGCTGGAT GATCCTCCAG CGCGGGGATC TCATGCTGGA GTTCTTCGCC CACCCTAGGG 3601 GGAGGCTAAC TGAAACACGG AAGGAGACAA TACCGGAAGG AACCCGCGCT ATGACGGCAA 3661 TAAAAAGACA GAATAAAACG CACGGTGTTG GGTCGTTTGT TCATAAACGC GGGGTTCGGT 3721 CCCAGGGCTG GCACTCTGTC GATACCCCAC CGAGACCCCA TTGGGGCCAA TACGCCCGCG 3781 TTTCTTCCTT TTCCCCACCC CACCCCCCAA GTTCGGGTGA AGGCCCAGGG CTCGCAGCCA 3841 ACGTCGGGGC GGCAGGCCCT GCCATAGCCT CAGGTTACTC ATATATACTT TAGATTGATT 3901 TAAAACTTCA TTTTTAATTT AAAAGGATCT AGGTGAAGAT CCTTTTTGAT AATCTCATGA 3961 CCAAAATCCC TTAACGTGAG TTTTCGTTCC ACTGAGCGTC AGACCCCGTA GAAAAGATCA 4021 AAGGATCTTC TTGAGATCCT TTTTTTCTGC GCGTAATCTG CTGCTTGCAA ACAAAAAAAC 4081 CACCGCTACC AGCGGTGGTT TGTTTGCCGG ATCAAGAGCT ACCAACTCTT TTTCCGAAGG 4141 TAACTGGCTT CAGCAGAGCG CAGATACCAA ATACTGTCCT TCTAGTGTAG CCGTAGTTAG 4201 GCCACCACTT CAAGAACTCT GTAGCACCGC CTACATACCT CGCTCTGCTA ATCCTGTTAC 4261 CAGTGGCTGC TGCCAGTGGC GATAAGTCGT GTCTTACCGG GTTGGACTCA AGACGATAGT 4321 TACCGGATAA GGCGCAGCGG TCGGGCTGAA CGGGGGGTTC GTGCACACAG CCCAGCTTGG 4381 AGCGAACGAC CTACACCGAA CTGAGATACC TACAGCGTGA GCTATGAGAA AGCGCCACGC 4441 TTCCCGAAGG GAGAAAGGCG GACAGGTATC CGGTAAGCGG CAGGGTCGGA ACAGGAGAGC 4501 GCACGAGGGA GCTTCCAGGG GGAAACGCCT GGTATCTTTA TAGTCCTGTC GGGTTTCGCC 4561 ACCTCTGACT TGAGCGTCGA TTTTTGTGAT GCTCGTCAGG GGGGCGGAGC CTATGGAAAA 4621 ACGCCAGCAA CGCGGCCTTT TTACGGTTCC TGGCCTTTTG CTGGCCTTTT GCTCACATGT 4681 TCTTTCCTGC GTTATCCCCT GATTCTGTGG ATAACCGTAT TACCGCCATG CAT //