LOCUS PLUCF.TXT 6936 BP DS-DNA CIRCULAR SYN 17-MAY-2003 DEFINITION - ACCESSION - KEYWORDS - SOURCE - FEATURES Location/Qualifiers misc_feature complement(>49..57) /note="SV40 minor late 19S RNA" promoter <49..>393 /note="SV40 promoter (∆SELP)" enhancer 71..213 /note="SV40 enhancer elements" misc_feature 218..281 /note="SV40 21bp repeats" rep_origin 238..>373 /note="SV40 Ori" mRNA 287..293 /note="late-early transcription startpoints" mutation 307..307 /note="G->A kills SELP ATG" mRNA 327..333 /note="early-early transcription start points" CDS 418..1137 /note="EGFP" polyA_signal 1153..>1374 /note="SV40 Late PolyA" mutation 1444..1444 /note="missing A?" promoter <1465..2643 /note="EF1a promoter and UTR" misc_difference 1465..>1656 /note="rhesus-derived" exon 1663..1695 /note="EF-1a exon 1" intron 1696..2635 /note="EF-1a intron A" exon 2635..2643 /note="EF-1a exon 2 leader" CDS 2690..4342 /note="Luc2 (codmod, weighs 60.6kD)" mRNA 4395..>4983 /note="WPRE" polyA_signal 5019..5687 /note="hEF1a polyA signal" polyA_site 5317..5317 /note="site of polyA addition" rep_origin 5688..6421 /note="MB1 Ori" promoter 6422..6508 /note="EM7 promoter" CDS 6509..6883 /note="Zeo" terminator 6884..6936 /note="terminator (rpmB/G)" BASE COUNT 1570 A 1852 C 1916 G 1598 T 0 OTHER ORIGIN - 1 CCCTGCAGGG CCTGAAATAA CCTCTGAAAG AGGAACTTGG TTAGGTACCT GTGGAATGTG 61 TGTCAGTTAG GGTGTGGAAA GTCCCCAGGC TCCCCAGCAG GCAGAAGTAT GCAAAGCATG 121 CATCTCAATT AGTCAGCAAC CAGGTGTGGA AAGTCCCCAG GCTCCCCAGC AGGCAGAAGT 181 ATGCAAAGCA TGCATCTCAA TTAGTCAGCA ACCATAGTCC CGCCCCTAAC TCCGCCCATC 241 CCGCCCCTAA CTCCGCCCAG TTCCGCCCAT TCTCCGCCCC ATGGCTGACT AATTTTTTTT 301 ATTTATACAG AGGCCGAGGC CGCCTCGGCC TCTGAGCTAT TCCAGAAGTA GTGAGGAGGC 361 TTTTTTGGAG GCCTAGGCTT TTGCAAAAAG CTTGATTGGG ATCCACCGGT CGCCACCATG 421 GTGAGCAAGG GCGAGGAGCT GTTCACCGGG GTGGTGCCCA TCCTGGTCGA GCTGGACGGC 481 GACGTAAACG GCCACAAGTT CAGCGTGTCC GGCGAGGGCG AGGGCGATGC CACCTACGGC 541 AAGCTGACCC TGAAGTTCAT CTGCACCACC GGCAAGCTGC CCGTGCCCTG GCCCACCCTC 601 GTGACCACCC TGACCTACGG CGTGCAGTGC TTCAGCCGCT ACCCCGACCA CATGAAGCAG 661 CACGACTTCT TCAAGTCCGC CATGCCCGAA GGCTACGTCC AGGAGCGCAC CATCTTCTTC 721 AAGGACGACG GCAACTACAA GACCCGCGCC GAGGTGAAGT TCGAGGGCGA CACCCTGGTG 781 AACCGCATCG AGCTGAAGGG CATCGACTTC AAGGAGGACG GCAACATCCT GGGGCACAAG 841 CTGGAGTACA ACTACAACAG CCACAACGTC TATATCATGG CCGACAAGCA GAAGAACGGC 901 ATCAAGGTGA ACTTCAAGAT CCGCCACAAC ATCGAGGACG GCAGCGTGCA GCTCGCCGAC 961 CACTACCAGC AGAACACCCC CATCGGCGAC GGCCCCGTGC TGCTGCCCGA CAACCACTAC 1021 CTGAGCACCC AGTCCGCCCT GAGCAAAGAC CCCAACGAGA AGCGCGATCA CATGGTCCTG 1081 CTGGAGTTCG TGACCGCCGC CGGGATCACT CTCGGCATGG ACGAGCTGTA CAAGTAAAGC 1141 GGCCGCTTCG AGCAGACATG ATAAGATACA TTGATGAGTT TGGACAAACC ACAACTAGAA 1201 TGCAGTGAAA AAAATGCTTT ATTTGTGAAA TTTGTGATGC TATTGCTTTA TTTGTAACCA 1261 TTATAAGCTG CAATAAACAA GTTAACAACA ACAATTGCAT TCATTTTATG TTTCAGGTTC 1321 AGGGGGAGGT GTGGGAGGTT TTTTAAAGCA AGTAAAACCT CTACAAATGT GGTAAAATCG 1381 ATAAGGATCC GGGCTGGCGT AATAGCGAAG AGGCCCGCAC CGATCGCCCT TCCCAACAGT 1441 TGCGGTGGAG AAGAGCATGC GTGAGGCTCC GGTGCCCGTC AGTGGGCAGA GCGCACATCG 1501 CCCACAGTCC CCGAGAAGTT GGGGGGAGGG GTCGGCAATT GAACCGGTGC CTAGAGAAGG 1561 TGGCGCGGGG TAAACTGGGA AAGTGATGTC GTGTACTGGC TCCGCCCTTT TCCCGAGGGT 1621 GGGGGAGAAC CGTATATAAG TGCAGTAGTC GCTGTGAACG TTCTTTTTCG CAACGGGTTT 1681 GCCGCCAGAA CACAGGTAAG TGCCGTGTGT GGTTCCCGCG GGCCTGGCCT CTTTACGGGT 1741 TATGGCCCTT GCGTGCCTTG AATTACTTCC ACCTGGCTGC AGTACGTGAT TCTTGATCCC 1801 GAGCTTCGGG TTGGAAGTGG GTGGGAGAGT TCGAGGCCTT GCGCTTAAGG AGCCCCTTCG 1861 CCTCGTGCTT GAGTTGAGGC CTGGCCTGGG CGCTGGGGCC GCCGCGTGCG AATCTGGTGG 1921 CACCTTCGCG CCTGTCTCGC TGCTTTCGAT AAGTCTCTAG CCATTTAAAA TTTTTGATGA 1981 CCTGCTGCGA CGCTTTTTTT CTGGCAAGAT AGTCTTGTAA ATGCGGGCCA AGATCTGCAC 2041 ACTGGTATTT CGGTTTTTGG GGCCGCGGGC GGCGACGGGG CCCGTGCGTC CCAGCGCACA 2101 TGTTCGGCGA GGCGGGGCCT GCGAGCGCGG CCACCGAGAA TCGGACGGGG GTAGTCTCAA 2161 GCTGGCCGGC CTGCTCTGGT GCCTGGCCTC GCGCCGCCGT GTATCGCCCC GCCCTGGGCG 2221 GCAAGGCTGG CCCGGTCGGC ACCAGTTGCG TGAGCGGAAA GATGGCCGCT TCCCGGCCCT 2281 GCTGCAGGGA GCTCAAAATG GAGGACGCGG CGCTCGGGAG AGCGGGCGGG TGAGTCACCC 2341 ACACAAAGGA AAAGGGCCTT TCCGTCCTCA GCCGTCGCTT CATGTGACTC CACGGAGTAC 2401 CGGGCGCCGT CCAGGCACCT CGATTAGTTC TCGAGCTTTT GGAGTACGTC GTCTTTAGGT 2461 TGGGGGGAGG GGTTTTATGC GATGGAGTTT CCCCACACTG AGTGGGTGGA GACTGAAGTT 2521 AGGCCAGCTT GGCACTTGAT GTAATTCTCC TTGGAATTTG CCCTTTTTGA GTTTGGATCT 2581 TGGTTCATTC TCAAGCCTCA GACAGTGGTT CAAAGTTTTT TTCTTCCATT TCAGGTGTCG 2641 TGAGGAATTC TCTAGAGCTT GGCAATCCGG TACTGTTGGT AAAGCCACCA TGGAAGATGC 2701 CAAAAACATT AAGAAGGGCC CAGCGCCATT CTACCCACTC GAAGACGGGA CCGCCGGCGA 2761 GCAGCTGCAC AAAGCCATGA AGCGCTACGC CCTGGTGCCC GGCACCATCG CCTTTACCGA 2821 CGCACATATC GAGGTGGACA TTACCTACGC CGAGTACTTC GAGATGAGCG TTCGGCTGGC 2881 AGAAGCTATG AAGCGCTATG GGCTGAATAC AAACCATCGG ATCGTGGTGT GCAGCGAGAA 2941 TAGCTTGCAG TTCTTCATGC CCGTGTTGGG TGCCCTGTTC ATCGGTGTGG CTGTGGCCCC 3001 AGCTAACGAC ATCTACAACG AGCGCGAGCT GCTGAACAGC ATGGGCATCA GCCAGCCCAC 3061 CGTCGTATTC GTGAGCAAGA AAGGGCTGCA AAAGATCCTC AACGTGCAAA AGAAGCTACC 3121 GATCATACAA AAGATCATCA TCATGGATAG CAAGACCGAC TACCAGGGCT TCCAAAGCAT 3181 GTACACCTTC GTGACTTCCC ATTTGCCACC CGGCTTCAAC GAGTACGACT TCGTGCCCGA 3241 GAGCTTCGAC CGGGACAAAA CCATCGCCCT GATCATGAAC AGTAGTGGCA GTACCGGATT 3301 GCCCAAGGGC GTAGCCCTAC CGCACCGCAC CGCTTGTGTC CGATTCAGTC ATGCCCGCGA 3361 CCCCATCTTC GGCAACCAGA TCATCCCCGA CACCGCTATC CTCAGCGTGG TGCCATTTCA 3421 CCACGGCTTC GGCATGTTCA CCACGCTGGG CTACTTGATC TGCGGCTTTC GGGTCGTGCT 3481 CATGTACCGC TTCGAGGAGG AGCTATTCTT GCGCAGCTTG CAAGACTATA AGATTCAATC 3541 TGCCCTGCTG GTGCCCACAC TATTTAGCTT CTTCGCTAAG AGCACTCTCA TCGACAAGTA 3601 CGACCTAAGC AACTTGCACG AGATCGCCAG CGGCGGGGCG CCGCTCAGCA AGGAGGTAGG 3661 TGAGGCCGTG GCCAAACGCT TCCACCTACC AGGCATCCGC CAGGGCTACG GCCTGACAGA 3721 AACAACCAGC GCCATTCTGA TCACCCCCGA AGGGGACGAC AAGCCTGGCG CAGTAGGCAA 3781 GGTGGTGCCC TTCTTCGAGG CTAAGGTGGT GGACTTGGAC ACCGGTAAGA CACTGGGTGT 3841 GAACCAGCGC GGCGAGCTGT GCGTCCGTGG CCCCATGATC ATGAGCGGCT ACGTTAACAA 3901 CCCCGAGGCT ACAAACGCTC TCATCGACAA GGACGGCTGG CTGCACAGCG GCGACATCGC 3961 CTACTGGGAC GAGGACGAGC ACTTCTTCAT CGTGGACCGG CTGAAGAGCC TGATCAAATA 4021 CAAGGGCTAC CAGGTAGCCC CAGCCGAACT GGAGAGCATC CTGCTGCAAC ACCCCAACAT 4081 CTTCGACGCC GGGGTCGCCG GCCTGCCCGA CGACGATGCC GGCGAGCTGC CCGCCGCAGT 4141 CGTCGTGCTG GAACACGGTA AAACCATGAC CGAGAAGGAG ATCGTGGACT ATGTGGCCAG 4201 CCAGGTTACA ACCGCCAAGA AGCTGCGCGG TGGTGTTGTG TTCGTGGACG AGGTGCCTAA 4261 AGGACTGACC GGCAAGTTGG ACGCCCGCAA GATCCGCGAG ATTCTCATTA AGGCCAAGAA 4321 GGGCGGCAAG ATCGCCGTGT AATAATTCTA GTGGATCCCC CGGGCTGCAG GAATTCGATA 4381 TCAAGCTTAT CGATAATCAA CCTCTGGATT ACAAAATTTG TGAAAGATTG ACTGGTATTC 4441 TTAACTATGT TGCTCCTTTT ACGCTATGTG GATACGCTGC TTTAATGCCT TTGTATCATG 4501 CTATTGCTTC CCGTATGGCT TTCATTTTCT CCTCCTTGTA TAAATCCTGG TTGCTGTCTC 4561 TTTATGAGGA GTTGTGGCCC GTTGTCAGGC AACGTGGCGT GGTGTGCACT GTGTTTGCTG 4621 ACGCAACCCC CACTGGTTGG GGCATTGCCA CCACCTGTCA GCTCCTTTCC GGGACTTTCG 4681 CTTTCCCCCT CCCTATTGCC ACGGCGGAAC TCATCGCCGC CTGCCTTGCC CGCTGCTGGA 4741 CAGGGGCTCG GCTGTTGGGC ACTGACAATT CCGTGGTGTT GTCGGGGAAA TCATCGTCCT 4801 TTCCTTGGCT GCTCGCCTGT GTTGCCACCT GGATTCTGCG CGGGACGTCC TTCTGCTACG 4861 TCCCTTCGGC CCTCAATCCA GCGGACCTTC CTTCCCGCGG CCTGCTGCCG GCTCTGCGGC 4921 CTCTTCCGCG TCTTCGCCTT CGCCCTCAGA CGAGTCGGAT CTCCCTTTGG GCCGCCTCCC 4981 CGCATCGATA CCGTCGGCCC ACTGCTCCCT AAACCTGAGC TAGCATTATC CCTAATACCT 5041 GCCACCCCAC TCTTAATCAG TGGTGGAAGA ACGGTCTCAG AACTGTTTGT TTCAATTGGC 5101 CATTTAAGTT TAGTAGTAAA AGACTGGTTA ATGATAACAA TGCATCGTAA AACCTTCAGA 5161 AGGAAAGGAG AATGTTTTGT GGACCACTTT GGTTTTCTTT TTTGCGTGTG GCAGTTTTAA 5221 GTTATTAGTT TTTAAAATCA GTACTTTTTA ATGGAAACAA CTTGACCAAA AATTTGTCAC 5281 AGAATTTTGA GACCCATTAA AAAAGTTAAA TGAGAAACCT GTGTGTTCCT TTGGTCAACA 5341 CCGAGACATT TAGGTGAAAG ACATCTAATT CTGGTTTTAC GAATCTGGAA ACTTCTTGAA 5401 AATGTAATTC TTGAGTTAAC ACTTCTGGGT GGAGAATAGG GTTGTTTTCC CCCCACATAA 5461 TTGGAAGGGG AAGGAATATC ATTTAAAGCT ATGGGAGGGT TTCTTTGATT ACAACACTGG 5521 AGAGAAATGC AGCATGTTGC TGATTGCCTG TCACTAAAAC AGGCCAAAAA CTGAGTCCTT 5581 GGGTTGCATA GAAAGCTTCA TGTTGCTAAA CCAATGTTAA GTGAATCTTT GGAAACAAAA 5641 TGTTTCCAAA TTACTGGGAT GTGCATGTTG AAACGTGGGT TAATTAACTA GCCATGACCA 5701 AAATCCCTTA ACGTGAGTTT TCGTTCCACT GAGCGTCAGA CCCCGTAGAA AAGATCAAAG 5761 GATCTTCTTG AGATCCTTTT TTTCTGCGCG TAATCTGCTG CTTGCAAACA AAAAAACCAC 5821 CGCTACCAGC GGTGGTTTGT TTGCCGGATC AAGAGCTACC AACTCTTTTT CCGAAGGTAA 5881 CTGGCTTCAG CAGAGCGCAG ATACCAAATA CTGTTCTTCT AGTGTAGCCG TAGTTAGGCC 5941 ACCACTTCAA GAACTCTGTA GCACCGCCTA CATACCTCGC TCTGCTAATC CTGTTACCAG 6001 TGGCTGCTGC CAGTGGCGAT AAGTCGTGTC TTACCGGGTT GGACTCAAGA CGATAGTTAC 6061 CGGATAAGGC GCAGCGGTCG GGCTGAACGG GGGGTTCGTG CACACAGCCC AGCTTGGAGC 6121 GAACGACCTA CACCGAACTG AGATACCTAC AGCGTGAGCT ATGAGAAAGC GCCACGCTTC 6181 CCGAAGGGAG AAAGGCGGAC AGGTATCCGG TAAGCGGCAG GGTCGGAACA GGAGAGCGCA 6241 CGAGGGAGCT TCCAGGGGGA AACGCCTGGT ATCTTTATAG TCCTGTCGGG TTTCGCCACC 6301 TCTGACTTGA GCGTCGATTT TTGTGATGCT CGTCAGGGGG GCGGAGCCTA TGGAAAAACG 6361 CCAGCAACGC GGCCTTTTTA CGGTTCCTGG CCTTTTGCTG GCCTTTTGCT CACATGTTCT 6421 TAATTAAATT TTTCAAAAGT AGTTGACAAT TAATCATCGG CATAGTATAT CGGCATAGTA 6481 TAATACGACT CACTATAGGA GGGCCATCAT GGCCAAGTTG ACCAGTGCTG TCCCAGTGCT 6541 CACAGCCAGG GATGTGGCTG GAGCTGTTGA GTTCTGGACT GACAGGTTGG GGTTCTCCAG 6601 AGATTTTGTG GAGGATGACT TTGCAGGTGT GGTCAGAGAT GATGTCACCC TGTTCATCTC 6661 AGCAGTCCAG GACCAGGTGG TGCCTGACAA CACCCTGGCT TGGGTGTGGG TGAGAGGACT 6721 GGATGAGCTG TATGCTGAGT GGAGTGAGGT GGTCTCCACC AACTTCAGGG ATGCCAGTGG 6781 CCCTGCCATG ACAGAGATTG GAGAGCAGCC CTGGGGGAGA GAGTTTGCCC TGAGAGACCC 6841 AGCAGGCAAC TGTGTGCACT TTGTGGCAGA GGAGCAGGAC TGAGGATAAG AATTGTAACA 6901 AAAAACCCCG CCCCGGCGGG GTTTTTTGTT AATTAA //