LOCUS PTWB.TXT 6519 BP DS-DNA CIRCULAR SYN 17-MAY-2003 DEFINITION - ACCESSION - KEYWORDS - SOURCE - FEATURES Location/Qualifiers misc_feature complement(>49..57) /note="SV40 minor late 19S RNA" promoter <49..>392 /note="SV40 promoter (∆SELP)" enhancer 71..213 /note="SV40 enhancer elements" misc_feature 218..281 /note="SV40 21bp repeats" rep_origin 238..>392 /note="SV40 Ori" mRNA 287..293 /note="late-early transcription start points" mutation 307..307 /note="G->A kills SELP ATG" mRNA 327..333 /note="early-early transcription start points" misc_feature 392..392 /note="end homology to SV40" promoter 435..481 /note="EM7 promoter" CDS 501..923 /note="Blasticidin R" polyA_signal 953..>1174 /note="SV40 Late PolyA" mutation 1244..1244 /note="missing A?" enhancer <1265..1462 /note="EF-1alpha promoter (rhesus) core domain" promoter <1265..>2443 /note="EF1a promoter and UTR" exon 1463..1495 /note="EF-1a exon 1" intron 1496..2435 /note="EF-1a intron A" exon 2435..2443 /note="EF-1a exon 2 leader" STS <2452..2477 /note="T7 gene 10 [Split]" CDS <2457..>2519 /note="leader" STS 2484..2507 /note="Anti-Xpress epitope" RBS 2513..2519 /note="Kozak" CDS 2520..3950 /note="tdTomato" mat_peptide 2534..3197 /note="dsRed repeat A" mat_peptide 3267..3923 /note="dsRed repeat B" mRNA 3978..>4566 /note="WPRE" polyA_signal 4602..5270 /note="hEF1a polyA signal" rep_origin 5271..6004 /note="MB1 Ori" promoter 6025..6071 /note="EM7 promoter" CDS 6092..6466 /note="Zeo R" terminator 6467..6519 /note="terminator (rpmB/G)" BASE COUNT 1462 A 1717 C 1821 G 1519 T 0 OTHER ORIGIN - 1 CCCTGCAGGG CCTGAAATAA CCTCTGAAAG AGGAACTTGG TTAGGTACCT GTGGAATGTG 61 TGTCAGTTAG GGTGTGGAAA GTCCCCAGGC TCCCCAGCAG GCAGAAGTAT GCAAAGCATG 121 CATCTCAATT AGTCAGCAAC CAGGTGTGGA AAGTCCCCAG GCTCCCCAGC AGGCAGAAGT 181 ATGCAAAGCA TGCATCTCAA TTAGTCAGCA ACCATAGTCC CGCCCCTAAC TCCGCCCATC 241 CCGCCCCTAA CTCCGCCCAG TTCCGCCCAT TCTCCGCCCC ATGGCTGACT AATTTTTTTT 301 ATTTATACAG AGGCCGAGGC CGCCTCGGCC TCTGAGCTAT TCCAGAAGTA GTGAGGAGGC 361 TTTTTTGGAG GCCTAGGCTT TTGCAAAAAG CTCCCGGGAG CTTGTATATC CATTTTCGGA 421 TCTGATCAGC ACGTGTTGAC AATTAATCAT CGGCATAGTA TATCGGCATA GTATAATACG 481 ACAAGGTGAG GAACTAAATC ATGAAGACCT TCAACATCTC TCAGCAGGAT CTGGAGCTGG 541 TGGAGGTCGC CACTGAGAAG ATCACCATGC TCTATGAGGA CAACAAGCAC CATGTCGGGG 601 CGGCCATCAG GACCAAGACT GGGGAGATCA TCTCTGCTGT CCACATTGAG GCCTACATTG 661 GCAGGGTCAC TGTCTGTGCT GAAGCCATTG CCATTGGGTC TGCTGTGAGC AACGGGCAGA 721 AGGACTTTGA CACCATTGTG GCTGTCAGGC ACCCCTACTC TGATGAGGTG GACAGATCCA 781 TCAGGGTGGT CAGCCCCTGT GGCATGTGCA GAGAGCTCAT CTCTGACTAT GCTCCTGACT 841 GCTTTGTGCT CATTGAGATG AATGGCAAGC TGGTCAAAAC CACCATTGAG GAACTCATCC 901 CCCTCAAGTA CACCAGGAAC TAAACCTGAA TTCGCTAGAG GGCCGCTTCG AGCAGACATG 961 ATAAGATACA TTGATGAGTT TGGACAAACC ACAACTAGAA TGCAGTGAAA AAAATGCTTT 1021 ATTTGTGAAA TTTGTGATGC TATTGCTTTA TTTGTAACCA TTATAAGCTG CAATAAACAA 1081 GTTAACAACA ACAATTGCAT TCATTTTATG TTTCAGGTTC AGGGGGAGGT GTGGGAGGTT 1141 TTTTAAAGCA AGTAAAACCT CTACAAATGT GGTAAAATCG ATAAGGATCC GGGCTGGCGT 1201 AATAGCGAAG AGGCCCGCAC CGATCGCCCT TCCCAACAGT TGCGGTGGAG AAGAGCATGC 1261 GTGAGGCTCC GGTGCCCGTC AGTGGGCAGA GCGCACATCG CCCACAGTCC CCGAGAAGTT 1321 GGGGGGAGGG GTCGGCAATT GAACCGGTGC CTAGAGAAGG TGGCGCGGGG TAAACTGGGA 1381 AAGTGATGTC GTGTACTGGC TCCGCCCTTT TCCCGAGGGT GGGGGAGAAC CGTATATAAG 1441 TGCAGTAGTC GCTGTGAACG TTCTTTTTCG CAACGGGTTT GCCGCCAGAA CACAGGTAAG 1501 TGCCGTGTGT GGTTCCCGCG GGCCTGGCCT CTTTACGGGT TATGGCCCTT GCGTGCCTTG 1561 AATTACTTCC ACCTGGCTGC AGTACGTGAT TCTTGATCCC GAGCTTCGGG TTGGAAGTGG 1621 GTGGGAGAGT TCGAGGCCTT GCGCTTAAGG AGCCCCTTCG CCTCGTGCTT GAGTTGAGGC 1681 CTGGCCTGGG CGCTGGGGCC GCCGCGTGCG AATCTGGTGG CACCTTCGCG CCTGTCTCGC 1741 TGCTTTCGAT AAGTCTCTAG CCATTTAAAA TTTTTGATGA CCTGCTGCGA CGCTTTTTTT 1801 CTGGCAAGAT AGTCTTGTAA ATGCGGGCCA AGATCTGCAC ACTGGTATTT CGGTTTTTGG 1861 GGCCGCGGGC GGCGACGGGG CCCGTGCGTC CCAGCGCACA TGTTCGGCGA GGCGGGGCCT 1921 GCGAGCGCGG CCACCGAGAA TCGGACGGGG GTAGTCTCAA GCTGGCCGGC CTGCTCTGGT 1981 GCCTGGCCTC GCGCCGCCGT GTATCGCCCC GCCCTGGGCG GCAAGGCTGG CCCGGTCGGC 2041 ACCAGTTGCG TGAGCGGAAA GATGGCCGCT TCCCGGCCCT GCTGCAGGGA GCTCAAAATG 2101 GAGGACGCGG CGCTCGGGAG AGCGGGCGGG TGAGTCACCC ACACAAAGGA AAAGGGCCTT 2161 TCCGTCCTCA GCCGTCGCTT CATGTGACTC CACGGAGTAC CGGGCGCCGT CCAGGCACCT 2221 CGATTAGTTC TCGAGCTTTT GGAGTACGTC GTCTTTAGGT TGGGGGGAGG GGTTTTATGC 2281 GATGGAGTTT CCCCACACTG AGTGGGTGGA GACTGAAGTT AGGCCAGCTT GGCACTTGAT 2341 GTAATTCTCC TTGGAATTTG CCCTTTTTGA GTTTGGATCT TGGTTCATTC TCAAGCCTCA 2401 GACAGTGGTT CAAAGTTTTT TTCTTCCATT TCAGGTGTCG TGAGGAATTC TCTAGCATGA 2461 CTGGTGGACA GCAAATGGGT CGGGATCTGT ACGACGATGA CGATAAGGAT CCCGCCACCA 2521 TGGTGAGCAA GGGCGAGGAG GTCATCAAAG AGTTCATGCG CTTCAAGGTG CGCATGGAGG 2581 GCTCCATGAA CGGCCACGAG TTCGAGATCG AGGGCGAGGG CGAGGGCCGC CCCTACGAGG 2641 GCACCCAGAC CGCCAAGCTG AAGGTGACCA AGGGCGGCCC CCTGCCCTTC GCCTGGGACA 2701 TCCTGTCCCC CCAGTTCATG TACGGCTCCA AGGCGTACGT GAAGCACCCC GCCGACATCC 2761 CCGATTACAA GAAGCTGTCC TTCCCCGAGG GCTTCAAGTG GGAGCGCGTG ATGAACTTCG 2821 AGGACGGCGG TCTGGTGACC GTGACCCAGG ACTCCTCCCT GCAGGACGGC ACGCTGATCT 2881 ACAAGGTGAA GATGCGCGGC ACCAACTTCC CCCCCGACGG CCCCGTAATG CAGAAGAAGA 2941 CCATGGGCTG GGAGGCCTCC ACCGAGCGCC TGTACCCCCG CGACGGCGTG CTGAAGGGCG 3001 AGATCCACCA GGCCCTGAAG CTGAAGGACG GCGGCCACTA CCTGGTGGAG TTCAAGACCA 3061 TCTACATGGC CAAGAAGCCC GTGCAACTGC CCGGCTACTA CTACGTGGAC ACCAAGCTGG 3121 ACATCACCTC CCACAACGAG GACTACACCA TCGTGGAACA GTACGAGCGC TCCGAGGGCC 3181 GCCACCACCT GTTCCTGGGG CATGGCACCG GCAGCACCGG CAGCGGCAGC TCCGGCACCG 3241 CCTCCTCCGA GGACAACAAC ATGGCCGTCA TCAAAGAGTT CATGCGCTTC AAGGTGCGCA 3301 TGGAGGGCTC CATGAACGGC CACGAGTTCG AGATCGAGGG CGAGGGCGAG GGCCGCCCCT 3361 ACGAGGGCAC CCAGACCGCC AAGCTGAAGG TGACCAAGGG CGGCCCCCTG CCCTTCGCCT 3421 GGGACATCCT GTCCCCCCAG TTCATGTACG GCTCCAAGGC GTACGTGAAG CACCCCGCCG 3481 ACATCCCCGA TTACAAGAAG CTGTCCTTCC CCGAGGGCTT CAAGTGGGAG CGCGTGATGA 3541 ACTTCGAGGA CGGCGGTCTG GTGACCGTGA CCCAGGACTC CTCCCTGCAG GACGGCACGC 3601 TGATCTACAA GGTGAAGATG CGCGGCACCA ACTTCCCCCC CGACGGCCCC GTAATGCAGA 3661 AGAAGACCAT GGGCTGGGAG GCCTCCACCG AGCGCCTGTA CCCCCGCGAC GGCGTGCTGA 3721 AGGGCGAGAT CCACCAGGCC CTGAAGCTGA AGGACGGCGG CCACTACCTG GTGGAGTTCA 3781 AGACCATCTA CATGGCCAAG AAGCCCGTGC AACTGCCCGG CTACTACTAC GTGGACACCA 3841 AGCTGGACAT CACCTCCCAC AACGAGGACT ACACCATCGT GGAACAGTAC GAGCGCTCCG 3901 AGGGCCGCCA CCACCTGTTC CTGTACGGCA TGGACGAGCT GTACAAGTAA GAATTCGAAG 3961 CTATCAAGCT TATCGATAAT CAACCTCTGG ATTACAAAAT TTGTGAAAGA TTGACTGGTA 4021 TTCTTAACTA TGTTGCTCCT TTTACGCTAT GTGGATACGC TGCTTTAATG CCTTTGTATC 4081 ATGCTATTGC TTCCCGTATG GCTTTCATTT TCTCCTCCTT GTATAAATCC TGGTTGCTGT 4141 CTCTTTATGA GGAGTTGTGG CCCGTTGTCA GGCAACGTGG CGTGGTGTGC ACTGTGTTTG 4201 CTGACGCAAC CCCCACTGGT TGGGGCATTG CCACCACCTG TCAGCTCCTT TCCGGGACTT 4261 TCGCTTTCCC CCTCCCTATT GCCACGGCGG AACTCATCGC CGCCTGCCTT GCCCGCTGCT 4321 GGACAGGGGC TCGGCTGTTG GGCACTGACA ATTCCGTGGT GTTGTCGGGG AAATCATCGT 4381 CCTTTCCTTG GCTGCTCGCC TGTGTTGCCA CCTGGATTCT GCGCGGGACG TCCTTCTGCT 4441 ACGTCCCTTC GGCCCTCAAT CCAGCGGACC TTCCTTCCCG CGGCCTGCTG CCGGCTCTGC 4501 GGCCTCTTCC GCGTCTTCGC CTTCGCCCTC AGACGAGTCG GATCTCCCTT TGGGCCGCCT 4561 CCCCGCATCG ATACCGTCGG CCCACTGCTC CCTAAACCTG AGCTAGCATT ATCCCTAATA 4621 CCTGCCACCC CACTCTTAAT CAGTGGTGGA AGAACGGTCT CAGAACTGTT TGTTTCAATT 4681 GGCCATTTAA GTTTAGTAGT AAAAGACTGG TTAATGATAA CAATGCATCG TAAAACCTTC 4741 AGAAGGAAAG GAGAATGTTT TGTGGACCAC TTTGGTTTTC TTTTTTGCGT GTGGCAGTTT 4801 TAAGTTATTA GTTTTTAAAA TCAGTACTTT TTAATGGAAA CAACTTGACC AAAAATTTGT 4861 CACAGAATTT TGAGACCCAT TAAAAAAGTT AAATGAGAAA CCTGTGTGTT CCTTTGGTCA 4921 ACACCGAGAC ATTTAGGTGA AAGACATCTA ATTCTGGTTT TACGAATCTG GAAACTTCTT 4981 GAAAATGTAA TTCTTGAGTT AACACTTCTG GGTGGAGAAT AGGGTTGTTT TCCCCCCACA 5041 TAATTGGAAG GGGAAGGAAT ATCATTTAAA GCTATGGGAG GGTTTCTTTG ATTACAACAC 5101 TGGAGAGAAA TGCAGCATGT TGCTGATTGC CTGTCACTAA AACAGGCCAA AAACTGAGTC 5161 CTTGGGTTGC ATAGAAAGCT TCATGTTGCT AAACCAATGT TAAGTGAATC TTTGGAAACA 5221 AAATGTTTCC AAATTACTGG GATGTGCATG TTGAAACGTG GGTTAATTAA CTAGCCATGA 5281 CCAAAATCCC TTAACGTGAG TTTTCGTTCC ACTGAGCGTC AGACCCCGTA GAAAAGATCA 5341 AAGGATCTTC TTGAGATCCT TTTTTTCTGC GCGTAATCTG CTGCTTGCAA ACAAAAAAAC 5401 CACCGCTACC AGCGGTGGTT TGTTTGCCGG ATCAAGAGCT ACCAACTCTT TTTCCGAAGG 5461 TAACTGGCTT CAGCAGAGCG CAGATACCAA ATACTGTTCT TCTAGTGTAG CCGTAGTTAG 5521 GCCACCACTT CAAGAACTCT GTAGCACCGC CTACATACCT CGCTCTGCTA ATCCTGTTAC 5581 CAGTGGCTGC TGCCAGTGGC GATAAGTCGT GTCTTACCGG GTTGGACTCA AGACGATAGT 5641 TACCGGATAA GGCGCAGCGG TCGGGCTGAA CGGGGGGTTC GTGCACACAG CCCAGCTTGG 5701 AGCGAACGAC CTACACCGAA CTGAGATACC TACAGCGTGA GCTATGAGAA AGCGCCACGC 5761 TTCCCGAAGG GAGAAAGGCG GACAGGTATC CGGTAAGCGG CAGGGTCGGA ACAGGAGAGC 5821 GCACGAGGGA GCTTCCAGGG GGAAACGCCT GGTATCTTTA TAGTCCTGTC GGGTTTCGCC 5881 ACCTCTGACT TGAGCGTCGA TTTTTGTGAT GCTCGTCAGG GGGGCGGAGC CTATGGAAAA 5941 ACGCCAGCAA CGCGGCCTTT TTACGGTTCC TGGCCTTTTG CTGGCCTTTT GCTCACATGT 6001 TCTTAATTAA ATTTTTCAAA AGTAGTTGAC AATTAATCAT CGGCATAGTA TATCGGCATA 6061 GTATAATACG ACTCACTATA GGAGGGCCAT CATGGCCAAG TTGACCAGTG CTGTCCCAGT 6121 GCTCACAGCC AGGGATGTGG CTGGAGCTGT TGAGTTCTGG ACTGACAGGT TGGGGTTCTC 6181 CAGAGATTTT GTGGAGGATG ACTTTGCAGG TGTGGTCAGA GATGATGTCA CCCTGTTCAT 6241 CTCAGCAGTC CAGGACCAGG TGGTGCCTGA CAACACCCTG GCTTGGGTGT GGGTGAGAGG 6301 ACTGGATGAG CTGTATGCTG AGTGGAGTGA GGTGGTCTCC ACCAACTTCA GGGATGCCAG 6361 TGGCCCTGCC ATGACAGAGA TTGGAGAGCA GCCCTGGGGG AGAGAGTTTG CCCTGAGAGA 6421 CCCAGCAGGC AACTGTGTGC ACTTTGTGGC AGAGGAGCAG GACTGAGGAT AAGAATTGTA 6481 ACAAAAAACC CCGCCCCGGC GGGGTTTTTT GTTAATTAA //