LOCUS PGWF.TXT 6974 BP DS-DNA CIRCULAR SYN 17-MAY-2003 DEFINITION - ACCESSION - KEYWORDS - SOURCE - FEATURES Location/Qualifiers misc_feature complement(>49..57) /note="SV40 minor late 19S RNA" promoter <49..>393 /note="SV40 promoter (∆SELP)" enhancer 71..213 /note="SV40 enhancer elements" misc_feature 218..281 /note="SV40 21bp repeats" rep_origin 238..>373 /note="SV40 Ori" mRNA 287..293 /note="late-early transcription startpoints" mutation 307..307 /note="G->A kills SELP ATG" mRNA 327..333 /note="early-early transcription start points" CDS 418..1137 /note="EGFP" polyA_signal 1153..>1374 /note="SV40 Late PolyA" promoter <1465..2643 /note="EF1a promoter and UTR" exon 1663..1695 /note="EF-1a exon 1" intron 1696..2635 /note="EF-1a intron A" exon 2635..2643 /note="EF-1a exon 2 leader" insertion_seq 2667..2791 /note="attR1" CDS 2900..3559 /note="CAT" CDS 3901..4206 /note="ccdB" insertion_seq 4247..4371 /note="attR2" mRNA 4433..>5021 /note="WPRE" polyA_signal 5057..5725 /note="hEF1a polyA signal" polyA_site 5355..5355 /note="site of polyA addition" rep_origin 5726..6459 /note="MB1 Ori" promoter 6460..6546 /note="EM7 promoter" CDS 6547..6921 /note="Zeo" terminator 6922..6974 /note="terminator (rpmB/G)" BASE COUNT 1657 A 1695 C 1853 G 1769 T 0 OTHER ORIGIN - 1 CCCTGCAGGG CCTGAAATAA CCTCTGAAAG AGGAACTTGG TTAGGTACCT GTGGAATGTG 61 TGTCAGTTAG GGTGTGGAAA GTCCCCAGGC TCCCCAGCAG GCAGAAGTAT GCAAAGCATG 121 CATCTCAATT AGTCAGCAAC CAGGTGTGGA AAGTCCCCAG GCTCCCCAGC AGGCAGAAGT 181 ATGCAAAGCA TGCATCTCAA TTAGTCAGCA ACCATAGTCC CGCCCCTAAC TCCGCCCATC 241 CCGCCCCTAA CTCCGCCCAG TTCCGCCCAT TCTCCGCCCC ATGGCTGACT AATTTTTTTT 301 ATTTATACAG AGGCCGAGGC CGCCTCGGCC TCTGAGCTAT TCCAGAAGTA GTGAGGAGGC 361 TTTTTTGGAG GCCTAGGCTT TTGCAAAAAG CTTGATTGGG ATCCACCGGT CGCCACCATG 421 GTGAGCAAGG GCGAGGAGCT GTTCACCGGG GTGGTGCCCA TCCTGGTCGA GCTGGACGGC 481 GACGTAAACG GCCACAAGTT CAGCGTGTCC GGCGAGGGCG AGGGCGATGC CACCTACGGC 541 AAGCTGACCC TGAAGTTCAT CTGCACCACC GGCAAGCTGC CCGTGCCCTG GCCCACCCTC 601 GTGACCACCC TGACCTACGG CGTGCAGTGC TTCAGCCGCT ACCCCGACCA CATGAAGCAG 661 CACGACTTCT TCAAGTCCGC CATGCCCGAA GGCTACGTCC AGGAGCGCAC CATCTTCTTC 721 AAGGACGACG GCAACTACAA GACCCGCGCC GAGGTGAAGT TCGAGGGCGA CACCCTGGTG 781 AACCGCATCG AGCTGAAGGG CATCGACTTC AAGGAGGACG GCAACATCCT GGGGCACAAG 841 CTGGAGTACA ACTACAACAG CCACAACGTC TATATCATGG CCGACAAGCA GAAGAACGGC 901 ATCAAGGTGA ACTTCAAGAT CCGCCACAAC ATCGAGGACG GCAGCGTGCA GCTCGCCGAC 961 CACTACCAGC AGAACACCCC CATCGGCGAC GGCCCCGTGC TGCTGCCCGA CAACCACTAC 1021 CTGAGCACCC AGTCCGCCCT GAGCAAAGAC CCCAACGAGA AGCGCGATCA CATGGTCCTG 1081 CTGGAGTTCG TGACCGCCGC CGGGATCACT CTCGGCATGG ACGAGCTGTA CAAGTAAAGC 1141 GGCCGCTTCG AGCAGACATG ATAAGATACA TTGATGAGTT TGGACAAACC ACAACTAGAA 1201 TGCAGTGAAA AAAATGCTTT ATTTGTGAAA TTTGTGATGC TATTGCTTTA TTTGTAACCA 1261 TTATAAGCTG CAATAAACAA GTTAACAACA ACAATTGCAT TCATTTTATG TTTCAGGTTC 1321 AGGGGGAGGT GTGGGAGGTT TTTTAAAGCA AGTAAAACCT CTACAAATGT GGTAAAATCG 1381 ATAAGGATCC GGGCTGGCGT AATAGCGAAG AGGCCCGCAC CGATCGCCCT TCCCAACAGT 1441 TGCGGTGGAG AAGAGCATGC GTGAGGCTCC GGTGCCCGTC AGTGGGCAGA GCGCACATCG 1501 CCCACAGTCC CCGAGAAGTT GGGGGGAGGG GTCGGCAATT GAACCGGTGC CTAGAGAAGG 1561 TGGCGCGGGG TAAACTGGGA AAGTGATGTC GTGTACTGGC TCCGCCCTTT TCCCGAGGGT 1621 GGGGGAGAAC CGTATATAAG TGCAGTAGTC GCTGTGAACG TTCTTTTTCG CAACGGGTTT 1681 GCCGCCAGAA CACAGGTAAG TGCCGTGTGT GGTTCCCGCG GGCCTGGCCT CTTTACGGGT 1741 TATGGCCCTT GCGTGCCTTG AATTACTTCC ACCTGGCTGC AGTACGTGAT TCTTGATCCC 1801 GAGCTTCGGG TTGGAAGTGG GTGGGAGAGT TCGAGGCCTT GCGCTTAAGG AGCCCCTTCG 1861 CCTCGTGCTT GAGTTGAGGC CTGGCCTGGG CGCTGGGGCC GCCGCGTGCG AATCTGGTGG 1921 CACCTTCGCG CCTGTCTCGC TGCTTTCGAT AAGTCTCTAG CCATTTAAAA TTTTTGATGA 1981 CCTGCTGCGA CGCTTTTTTT CTGGCAAGAT AGTCTTGTAA ATGCGGGCCA AGATCTGCAC 2041 ACTGGTATTT CGGTTTTTGG GGCCGCGGGC GGCGACGGGG CCCGTGCGTC CCAGCGCACA 2101 TGTTCGGCGA GGCGGGGCCT GCGAGCGCGG CCACCGAGAA TCGGACGGGG GTAGTCTCAA 2161 GCTGGCCGGC CTGCTCTGGT GCCTGGCCTC GCGCCGCCGT GTATCGCCCC GCCCTGGGCG 2221 GCAAGGCTGG CCCGGTCGGC ACCAGTTGCG TGAGCGGAAA GATGGCCGCT TCCCGGCCCT 2281 GCTGCAGGGA GCTCAAAATG GAGGACGCGG CGCTCGGGAG AGCGGGCGGG TGAGTCACCC 2341 ACACAAAGGA AAAGGGCCTT TCCGTCCTCA GCCGTCGCTT CATGTGACTC CACGGAGTAC 2401 CGGGCGCCGT CCAGGCACCT CGATTAGTTC TCGAGCTTTT GGAGTACGTC GTCTTTAGGT 2461 TGGGGGGAGG GGTTTTATGC GATGGAGTTT CCCCACACTG AGTGGGTGGA GACTGAAGTT 2521 AGGCCAGCTT GGCACTTGAT GTAATTCTCC TTGGAATTTG CCCTTTTTGA GTTTGGATCT 2581 TGGTTCATTC TCAAGCCTCA GACAGTGGTT CAAAGTTTTT TTCTTCCATT TCAGGTGTCG 2641 TGAGGAATTC TCTAGAGCTT GATCAAACAA GTTTGTACAA AAAAGCTGAA CGAGAAACGT 2701 AAAATGATAT AAATATCAAT ATATTAAATT AGATTTTGCA TAAAAAACAG ACTACATAAT 2761 ACTGTAAAAC ACAACATATC CAGTCACTAT GGCGGCCGCA TTAGGCACCC CAGGCTTTAC 2821 ACTTTATGCT TCCGGCTCGT ATAATGTGTG GATTTTGAGT TAGGATCCGT CGAGATTTTC 2881 AGGAGCTAAG GAAGCTAAAA TGGAGAAAAA AATCACTGGA TATACCACCG TTGATATATC 2941 CCAATGGCAT CGTAAAGAAC ATTTTGAGGC ATTTCAGTCA GTTGCTCAAT GTACCTATAA 3001 CCAGACCGTT CAGCTGGATA TTACGGCCTT TTTAAAGACC GTAAAGAAAA ATAAGCACAA 3061 GTTTTATCCG GCCTTTATTC ACATTCTTGC CCGCCTGATG AATGCTCATC CGGAATTCCG 3121 TATGGCAATG AAAGACGGTG AGCTGGTGAT ATGGGATAGT GTTCACCCTT GTTACACCGT 3181 TTTCCATGAG CAAACTGAAA CGTTTTCATC GCTCTGGAGT GAATACCACG ACGATTTCCG 3241 GCAGTTTCTA CACATATATT CGCAAGATGT GGCGTGTTAC GGTGAAAACC TGGCCTATTT 3301 CCCTAAAGGG TTTATTGAGA ATATGTTTTT CGTCTCAGCC AATCCCTGGG TGAGTTTCAC 3361 CAGTTTTGAT TTAAACGTGG CCAATATGGA CAACTTCTTC GCCCCCGTTT TCACCATGGG 3421 CAAATATTAT ACGCAAGGCG ACAAGGTGCT GATGCCGCTG GCGATTCAGG TTCATCATGC 3481 CGTCTGTGAT GGCTTCCATG TCGGCAGAAT GCTTAATGAA TTACAACAGT ACTGCGATGA 3541 GTGGCAGGGC GGGGCGTAAT CTAGAGGATC CGGCTTACTA AAAGCCAGAT AACAGTATGC 3601 GTATTTGCGC GCTGATTTTT GCGGTATAAG AATATATACT GATATGTATA CCCGAAGTAT 3661 GTCAAAAAGA GGTGTGCTAT GAAGCAGCGT ATTACAGTGA CAGTTGACAG CGACAGCTAT 3721 CAGTTGCTCA AGGCATATAT GATGTCAATA TCTCCGGTCT GGTAAGCACA ACCATGCAGA 3781 ATGAAGCCCG TCGTCTGCGT GCCGAACGCT GGAAAGCGGA AAATCAGGAA GGGATGGCTG 3841 AGGTCGCCCG GTTTATTGAA ATGAACGGCT CTTTTGCTGA CGAGAACAGG GACTGGTGAA 3901 ATGCAGTTTA AGGTTTACAC CTATAAAAGA GAGAGCCGTT ATCGTCTGTT TGTGGATGTA 3961 CAGAGTGATA TTATTGACAC GCCCGGGCGA CGGATGGTGA TCCCCCTGGC CAGTGCACGT 4021 CTGCTGTCAG ATAAAGTCTC CCGTGAACTT TACCCGGTGG TGCATATCGG GGATGAAAGC 4081 TGGCGCATGA TGACCACCGA TATGGCCAGT GTGCCGGTCT CCGTTATCGG GGAAGAAGTG 4141 GCTGATCTCA GCCACCGCGA AAATGACATC AAAAACGCCA TTAACCTGAT GTTCTGGGGA 4201 ATATAAATGT CAGGCTCCGT TATACACAGC CAGTCTGCAG GTCGACCATA GTGACTGGAT 4261 ATGTTGTGTT TTACAGTATT ATGTAGTCTG TTTTTTATGC AAAATCTAAT TTAATATATT 4321 GATATTTATA TCATTTTACG TTTCTCGTTC AGCTTTCTTG TACAAAGTGG TTCGATCTAG 4381 AATGGCTAGT GGATCCCCCG GGCTGCAGGA ATTCGATATC AAGCTTATCG ATAATCAACC 4441 TCTGGATTAC AAAATTTGTG AAAGATTGAC TGGTATTCTT AACTATGTTG CTCCTTTTAC 4501 GCTATGTGGA TACGCTGCTT TAATGCCTTT GTATCATGCT ATTGCTTCCC GTATGGCTTT 4561 CATTTTCTCC TCCTTGTATA AATCCTGGTT GCTGTCTCTT TATGAGGAGT TGTGGCCCGT 4621 TGTCAGGCAA CGTGGCGTGG TGTGCACTGT GTTTGCTGAC GCAACCCCCA CTGGTTGGGG 4681 CATTGCCACC ACCTGTCAGC TCCTTTCCGG GACTTTCGCT TTCCCCCTCC CTATTGCCAC 4741 GGCGGAACTC ATCGCCGCCT GCCTTGCCCG CTGCTGGACA GGGGCTCGGC TGTTGGGCAC 4801 TGACAATTCC GTGGTGTTGT CGGGGAAATC ATCGTCCTTT CCTTGGCTGC TCGCCTGTGT 4861 TGCCACCTGG ATTCTGCGCG GGACGTCCTT CTGCTACGTC CCTTCGGCCC TCAATCCAGC 4921 GGACCTTCCT TCCCGCGGCC TGCTGCCGGC TCTGCGGCCT CTTCCGCGTC TTCGCCTTCG 4981 CCCTCAGACG AGTCGGATCT CCCTTTGGGC CGCCTCCCCG CATCGATACC GTCGGCCCAC 5041 TGCTCCCTAA ACCTGAGCTA GCATTATCCC TAATACCTGC CACCCCACTC TTAATCAGTG 5101 GTGGAAGAAC GGTCTCAGAA CTGTTTGTTT CAATTGGCCA TTTAAGTTTA GTAGTAAAAG 5161 ACTGGTTAAT GATAACAATG CATCGTAAAA CCTTCAGAAG GAAAGGAGAA TGTTTTGTGG 5221 ACCACTTTGG TTTTCTTTTT TGCGTGTGGC AGTTTTAAGT TATTAGTTTT TAAAATCAGT 5281 ACTTTTTAAT GGAAACAACT TGACCAAAAA TTTGTCACAG AATTTTGAGA CCCATTAAAA 5341 AAGTTAAATG AGAAACCTGT GTGTTCCTTT GGTCAACACC GAGACATTTA GGTGAAAGAC 5401 ATCTAATTCT GGTTTTACGA ATCTGGAAAC TTCTTGAAAA TGTAATTCTT GAGTTAACAC 5461 TTCTGGGTGG AGAATAGGGT TGTTTTCCCC CCACATAATT GGAAGGGGAA GGAATATCAT 5521 TTAAAGCTAT GGGAGGGTTT CTTTGATTAC AACACTGGAG AGAAATGCAG CATGTTGCTG 5581 ATTGCCTGTC ACTAAAACAG GCCAAAAACT GAGTCCTTGG GTTGCATAGA AAGCTTCATG 5641 TTGCTAAACC AATGTTAAGT GAATCTTTGG AAACAAAATG TTTCCAAATT ACTGGGATGT 5701 GCATGTTGAA ACGTGGGTTA ATTAACTAGC CATGACCAAA ATCCCTTAAC GTGAGTTTTC 5761 GTTCCACTGA GCGTCAGACC CCGTAGAAAA GATCAAAGGA TCTTCTTGAG ATCCTTTTTT 5821 TCTGCGCGTA ATCTGCTGCT TGCAAACAAA AAAACCACCG CTACCAGCGG TGGTTTGTTT 5881 GCCGGATCAA GAGCTACCAA CTCTTTTTCC GAAGGTAACT GGCTTCAGCA GAGCGCAGAT 5941 ACCAAATACT GTTCTTCTAG TGTAGCCGTA GTTAGGCCAC CACTTCAAGA ACTCTGTAGC 6001 ACCGCCTACA TACCTCGCTC TGCTAATCCT GTTACCAGTG GCTGCTGCCA GTGGCGATAA 6061 GTCGTGTCTT ACCGGGTTGG ACTCAAGACG ATAGTTACCG GATAAGGCGC AGCGGTCGGG 6121 CTGAACGGGG GGTTCGTGCA CACAGCCCAG CTTGGAGCGA ACGACCTACA CCGAACTGAG 6181 ATACCTACAG CGTGAGCTAT GAGAAAGCGC CACGCTTCCC GAAGGGAGAA AGGCGGACAG 6241 GTATCCGGTA AGCGGCAGGG TCGGAACAGG AGAGCGCACG AGGGAGCTTC CAGGGGGAAA 6301 CGCCTGGTAT CTTTATAGTC CTGTCGGGTT TCGCCACCTC TGACTTGAGC GTCGATTTTT 6361 GTGATGCTCG TCAGGGGGGC GGAGCCTATG GAAAAACGCC AGCAACGCGG CCTTTTTACG 6421 GTTCCTGGCC TTTTGCTGGC CTTTTGCTCA CATGTTCTTA ATTAAATTTT TCAAAAGTAG 6481 TTGACAATTA ATCATCGGCA TAGTATATCG GCATAGTATA ATACGACTCA CTATAGGAGG 6541 GCCATCATGG CCAAGTTGAC CAGTGCTGTC CCAGTGCTCA CAGCCAGGGA TGTGGCTGGA 6601 GCTGTTGAGT TCTGGACTGA CAGGTTGGGG TTCTCCAGAG ATTTTGTGGA GGATGACTTT 6661 GCAGGTGTGG TCAGAGATGA TGTCACCCTG TTCATCTCAG CAGTCCAGGA CCAGGTGGTG 6721 CCTGACAACA CCCTGGCTTG GGTGTGGGTG AGAGGACTGG ATGAGCTGTA TGCTGAGTGG 6781 AGTGAGGTGG TCTCCACCAA CTTCAGGGAT GCCAGTGGCC CTGCCATGAC AGAGATTGGA 6841 GAGCAGCCCT GGGGGAGAGA GTTTGCCCTG AGAGACCCAG CAGGCAACTG TGTGCACTTT 6901 GTGGCAGAGG AGCAGGACTG AGGATAAGAA TTGTAACAAA AAACCCCGCC CCGGCGGGGT 6961 TTTTTGTTAA TTAA //