LOCUS P45L2W.TXT 6730 BP DS-DNA CIRCULAR SYN 17-MAY-2003 DEFINITION - ACCESSION - KEYWORDS - SOURCE - FEATURES Location/Qualifiers misc_feature complement(>49..57) /note="SV40 minor late 19S RNA" promoter <49..>393 /note="SV40 promoter" enhancer 71..213 /note="SV40 enhancer elements" misc_feature 218..281 /note="SV40 21bp repeats" rep_origin 238..>373 /note="SV40 Ori" mRNA 287..293 /note="late-early transcription startpoints" mutation 307..307 /note="G->A kills SELP ATG" mRNA 327..333 /note="early-early transcription start points" CDS 418..1137 /note="EGFP" polyA_signal 1153..>1374 /note="SV40 Late PolyA" promoter <1465..2643 /note="EF1a promoter and UTR" misc_difference 1465..>1656 /note="rhesus-derived" exon 1663..1695 /note="EF-1a exon 1" intron 1696..2635 /note="EF-1a intron A" exon 2635..2643 /note="EF-1a exon 2 leader" insertion_seq 2667..>2691 /note="attB1" CDS 2704..4095 /note="HPV45 L2 (codmod)" insertion_seq 4103..>4127 /note="attL2" mRNA 4189..>4777 /note="WPRE" polyA_signal 4813..5481 /note="hEF1a polyA signal" polyA_site 5111..5111 /note="site of polyA addition" rep_origin 5482..6215 /note="MB1 Ori" promoter 6216..6302 /note="EM7 promoter" CDS 6303..6677 /note="Zeocin resistance" terminator 6678..6730 /note="terminator (rpmB/G)" BASE COUNT 1504 A 1914 C 1793 G 1519 T 0 OTHER ORIGIN - 1 CCCTGCAGGG CCTGAAATAA CCTCTGAAAG AGGAACTTGG TTAGGTACCT GTGGAATGTG 61 TGTCAGTTAG GGTGTGGAAA GTCCCCAGGC TCCCCAGCAG GCAGAAGTAT GCAAAGCATG 121 CATCTCAATT AGTCAGCAAC CAGGTGTGGA AAGTCCCCAG GCTCCCCAGC AGGCAGAAGT 181 ATGCAAAGCA TGCATCTCAA TTAGTCAGCA ACCATAGTCC CGCCCCTAAC TCCGCCCATC 241 CCGCCCCTAA CTCCGCCCAG TTCCGCCCAT TCTCCGCCCC ATGGCTGACT AATTTTTTTT 301 ATTTATACAG AGGCCGAGGC CGCCTCGGCC TCTGAGCTAT TCCAGAAGTA GTGAGGAGGC 361 TTTTTTGGAG GCCTAGGCTT TTGCAAAAAG CTTGATTGGG ATCCACCGGT CGCCACCATG 421 GTGAGCAAGG GCGAGGAGCT GTTCACCGGG GTGGTGCCCA TCCTGGTCGA GCTGGACGGC 481 GACGTAAACG GCCACAAGTT CAGCGTGTCC GGCGAGGGCG AGGGCGATGC CACCTACGGC 541 AAGCTGACCC TGAAGTTCAT CTGCACCACC GGCAAGCTGC CCGTGCCCTG GCCCACCCTC 601 GTGACCACCC TGACCTACGG CGTGCAGTGC TTCAGCCGCT ACCCCGACCA CATGAAGCAG 661 CACGACTTCT TCAAGTCCGC CATGCCCGAA GGCTACGTCC AGGAGCGCAC CATCTTCTTC 721 AAGGACGACG GCAACTACAA GACCCGCGCC GAGGTGAAGT TCGAGGGCGA CACCCTGGTG 781 AACCGCATCG AGCTGAAGGG CATCGACTTC AAGGAGGACG GCAACATCCT GGGGCACAAG 841 CTGGAGTACA ACTACAACAG CCACAACGTC TATATCATGG CCGACAAGCA GAAGAACGGC 901 ATCAAGGTGA ACTTCAAGAT CCGCCACAAC ATCGAGGACG GCAGCGTGCA GCTCGCCGAC 961 CACTACCAGC AGAACACCCC CATCGGCGAC GGCCCCGTGC TGCTGCCCGA CAACCACTAC 1021 CTGAGCACCC AGTCCGCCCT GAGCAAAGAC CCCAACGAGA AGCGCGATCA CATGGTCCTG 1081 CTGGAGTTCG TGACCGCCGC CGGGATCACT CTCGGCATGG ACGAGCTGTA CAAGTAAAGC 1141 GGCCGCTTCG AGCAGACATG ATAAGATACA TTGATGAGTT TGGACAAACC ACAACTAGAA 1201 TGCAGTGAAA AAAATGCTTT ATTTGTGAAA TTTGTGATGC TATTGCTTTA TTTGTAACCA 1261 TTATAAGCTG CAATAAACAA GTTAACAACA ACAATTGCAT TCATTTTATG TTTCAGGTTC 1321 AGGGGGAGGT GTGGGAGGTT TTTTAAAGCA AGTAAAACCT CTACAAATGT GGTAAAATCG 1381 ATAAGGATCC GGGCTGGCGT AATAGCGAAG AGGCCCGCAC CGATCGCCCT TCCCAACAGT 1441 TGCGGTGGAG AAGAGCATGC GTGAGGCTCC GGTGCCCGTC AGTGGGCAGA GCGCACATCG 1501 CCCACAGTCC CCGAGAAGTT GGGGGGAGGG GTCGGCAATT GAACCGGTGC CTAGAGAAGG 1561 TGGCGCGGGG TAAACTGGGA AAGTGATGTC GTGTACTGGC TCCGCCCTTT TCCCGAGGGT 1621 GGGGGAGAAC CGTATATAAG TGCAGTAGTC GCTGTGAACG TTCTTTTTCG CAACGGGTTT 1681 GCCGCCAGAA CACAGGTAAG TGCCGTGTGT GGTTCCCGCG GGCCTGGCCT CTTTACGGGT 1741 TATGGCCCTT GCGTGCCTTG AATTACTTCC ACCTGGCTGC AGTACGTGAT TCTTGATCCC 1801 GAGCTTCGGG TTGGAAGTGG GTGGGAGAGT TCGAGGCCTT GCGCTTAAGG AGCCCCTTCG 1861 CCTCGTGCTT GAGTTGAGGC CTGGCCTGGG CGCTGGGGCC GCCGCGTGCG AATCTGGTGG 1921 CACCTTCGCG CCTGTCTCGC TGCTTTCGAT AAGTCTCTAG CCATTTAAAA TTTTTGATGA 1981 CCTGCTGCGA CGCTTTTTTT CTGGCAAGAT AGTCTTGTAA ATGCGGGCCA AGATCTGCAC 2041 ACTGGTATTT CGGTTTTTGG GGCCGCGGGC GGCGACGGGG CCCGTGCGTC CCAGCGCACA 2101 TGTTCGGCGA GGCGGGGCCT GCGAGCGCGG CCACCGAGAA TCGGACGGGG GTAGTCTCAA 2161 GCTGGCCGGC CTGCTCTGGT GCCTGGCCTC GCGCCGCCGT GTATCGCCCC GCCCTGGGCG 2221 GCAAGGCTGG CCCGGTCGGC ACCAGTTGCG TGAGCGGAAA GATGGCCGCT TCCCGGCCCT 2281 GCTGCAGGGA GCTCAAAATG GAGGACGCGG CGCTCGGGAG AGCGGGCGGG TGAGTCACCC 2341 ACACAAAGGA AAAGGGCCTT TCCGTCCTCA GCCGTCGCTT CATGTGACTC CACGGAGTAC 2401 CGGGCGCCGT CCAGGCACCT CGATTAGTTC TCGAGCTTTT GGAGTACGTC GTCTTTAGGT 2461 TGGGGGGAGG GGTTTTATGC GATGGAGTTT CCCCACACTG AGTGGGTGGA GACTGAAGTT 2521 AGGCCAGCTT GGCACTTGAT GTAATTCTCC TTGGAATTTG CCCTTTTTGA GTTTGGATCT 2581 TGGTTCATTC TCAAGCCTCA GACAGTGGTT CAAAGTTTTT TTCTTCCATT TCAGGTGTCG 2641 TGAGGAATTC TCTAGAGCTT GATCAAACAA GTTTGTACAA AAAAGCAGGC TTCTAGAGCC 2701 ACCATGGTCA GTCATAGGGC CGCCAGGAGG AAGAGAGCAA GCGCCACCGA TCTGTACCGC 2761 ACCTGCAAAC AGAGTGGCAC CTGTCCACCC GACGTCATCA ATAAGGTCGA GGGGACCACA 2821 CTGGCCGACA AGATCCTGCA ATGGAGCTCA TTGGGCATCT TCCTCGGCGG GTTGGGGATC 2881 GGCACAGGGT CCGGCAGCGG CGGGAGGACC GGATACGTGC CACTGGGCGG GCGCAGCAAC 2941 ACCGTCGTCG ACGTCGGGCC AACCCGCCCC CCCGTCGTCA TCGAGCCCGT GGGCCCCACC 3001 GACCCCAGCA TCGTCACCCT CGTGGAAGAC AGTTCCGTCG TCGCAAGCGG CGCCCCCGTC 3061 CCAACCTTCA CCGGCACAAG CGGCTTCGAG ATCACCAGCA GCGGCACCAC AACCCCCGCC 3121 GTCCTCGATA TTACCCCCAC CGTCGATAGC GTCAGCATCA GCAGCACCTC CTTCACCAAC 3181 CCAGCCTTCA GCGACCCAAG CATCATCGAG GTCCCACAGA CCGGCGAAGT CAGCGGCAAC 3241 ATCTTCGTCG GCACCCCCAC CAGCGGGTCT CACGGCTACG AAGAGATCCC ACTGCAGACC 3301 TTCGCCAGCA GCGGCAGCGG CACCGAGCCA ATCTCCTCCA CACCATTGCC CACCGTCAGA 3361 AGAGTGGCCG GCCCAAGGCT CTACTCCCGC GCCAACCAGC AAGTCAGGGT CAGTACAAGC 3421 CAGTTCCTGA CCCACCCAAG CAGCCTCGTC ACCTTCGACA ACCCCGCCTA CGAGCCACTC 3481 GATACAACCT TGAGTTTCGA ACCCACATCC AACGTCCCCG ACAGTGACTT CATGGACATC 3541 ATCAGGCTCC ACCGCCCCGC CCTGAGTAGC CGCAGGGGGA CCGTCCGCTT CTCCCGCCTC 3601 GGCCAGCGCG CCACAATGTT CACCAGGTCC GGCAAGCAGA TCGGCGGCCG CGTGCACTTC 3661 TATCACGACA TCTCTCCAAT CGCCGCCACC GAAGAGATCG AGCTCCAACC CCTGATCTCC 3721 GCCACCAACG ACTCCGATCT CTTCGACGTG TACGCCGATT TTCCGCCACC CGCCAGTACC 3781 ACCCCCTCAA CCATCCATAA GAGCTTCACC TACCCCAAAT ACAGTCTCAC AATGCCCAGC 3841 ACCGCCGCCA GTAGCTATTC CAACGTCACC GTGCCCCTGA CCAGCGCCTG GGACGTGCCC 3901 ATCTACACCG GGCCCGATAT CATCCTCCCG AGTCACACCC CCATGTGGCC CTCCACCAGC 3961 CCCACAAACG CCAGTACAAC AACATACATC GGCATCCACG GGACCCAGTA CTACCTGTGG 4021 CCCTGGTACT ACTACTTCCC CAAGAAGAGG AAGAGGATCC CATACTTCTT CGCCGACGGG 4081 TTCGTCGCCG CATGAGCCCG GGACCCAGCT TTCTTGTACA AAGTGGTTCG ATCTAGAATG 4141 GCTAGTGGAT CCCCCGGGCT GCAGGAATTC GATATCAAGC TTATCGATAA TCAACCTCTG 4201 GATTACAAAA TTTGTGAAAG ATTGACTGGT ATTCTTAACT ATGTTGCTCC TTTTACGCTA 4261 TGTGGATACG CTGCTTTAAT GCCTTTGTAT CATGCTATTG CTTCCCGTAT GGCTTTCATT 4321 TTCTCCTCCT TGTATAAATC CTGGTTGCTG TCTCTTTATG AGGAGTTGTG GCCCGTTGTC 4381 AGGCAACGTG GCGTGGTGTG CACTGTGTTT GCTGACGCAA CCCCCACTGG TTGGGGCATT 4441 GCCACCACCT GTCAGCTCCT TTCCGGGACT TTCGCTTTCC CCCTCCCTAT TGCCACGGCG 4501 GAACTCATCG CCGCCTGCCT TGCCCGCTGC TGGACAGGGG CTCGGCTGTT GGGCACTGAC 4561 AATTCCGTGG TGTTGTCGGG GAAATCATCG TCCTTTCCTT GGCTGCTCGC CTGTGTTGCC 4621 ACCTGGATTC TGCGCGGGAC GTCCTTCTGC TACGTCCCTT CGGCCCTCAA TCCAGCGGAC 4681 CTTCCTTCCC GCGGCCTGCT GCCGGCTCTG CGGCCTCTTC CGCGTCTTCG CCTTCGCCCT 4741 CAGACGAGTC GGATCTCCCT TTGGGCCGCC TCCCCGCATC GATACCGTCG GCCCACTGCT 4801 CCCTAAACCT GAGCTAGCAT TATCCCTAAT ACCTGCCACC CCACTCTTAA TCAGTGGTGG 4861 AAGAACGGTC TCAGAACTGT TTGTTTCAAT TGGCCATTTA AGTTTAGTAG TAAAAGACTG 4921 GTTAATGATA ACAATGCATC GTAAAACCTT CAGAAGGAAA GGAGAATGTT TTGTGGACCA 4981 CTTTGGTTTT CTTTTTTGCG TGTGGCAGTT TTAAGTTATT AGTTTTTAAA ATCAGTACTT 5041 TTTAATGGAA ACAACTTGAC CAAAAATTTG TCACAGAATT TTGAGACCCA TTAAAAAAGT 5101 TAAATGAGAA ACCTGTGTGT TCCTTTGGTC AACACCGAGA CATTTAGGTG AAAGACATCT 5161 AATTCTGGTT TTACGAATCT GGAAACTTCT TGAAAATGTA ATTCTTGAGT TAACACTTCT 5221 GGGTGGAGAA TAGGGTTGTT TTCCCCCCAC ATAATTGGAA GGGGAAGGAA TATCATTTAA 5281 AGCTATGGGA GGGTTTCTTT GATTACAACA CTGGAGAGAA ATGCAGCATG TTGCTGATTG 5341 CCTGTCACTA AAACAGGCCA AAAACTGAGT CCTTGGGTTG CATAGAAAGC TTCATGTTGC 5401 TAAACCAATG TTAAGTGAAT CTTTGGAAAC AAAATGTTTC CAAATTACTG GGATGTGCAT 5461 GTTGAAACGT GGGTTAATTA ACTAGCCATG ACCAAAATCC CTTAACGTGA GTTTTCGTTC 5521 CACTGAGCGT CAGACCCCGT AGAAAAGATC AAAGGATCTT CTTGAGATCC TTTTTTTCTG 5581 CGCGTAATCT GCTGCTTGCA AACAAAAAAA CCACCGCTAC CAGCGGTGGT TTGTTTGCCG 5641 GATCAAGAGC TACCAACTCT TTTTCCGAAG GTAACTGGCT TCAGCAGAGC GCAGATACCA 5701 AATACTGTTC TTCTAGTGTA GCCGTAGTTA GGCCACCACT TCAAGAACTC TGTAGCACCG 5761 CCTACATACC TCGCTCTGCT AATCCTGTTA CCAGTGGCTG CTGCCAGTGG CGATAAGTCG 5821 TGTCTTACCG GGTTGGACTC AAGACGATAG TTACCGGATA AGGCGCAGCG GTCGGGCTGA 5881 ACGGGGGGTT CGTGCACACA GCCCAGCTTG GAGCGAACGA CCTACACCGA ACTGAGATAC 5941 CTACAGCGTG AGCTATGAGA AAGCGCCACG CTTCCCGAAG GGAGAAAGGC GGACAGGTAT 6001 CCGGTAAGCG GCAGGGTCGG AACAGGAGAG CGCACGAGGG AGCTTCCAGG GGGAAACGCC 6061 TGGTATCTTT ATAGTCCTGT CGGGTTTCGC CACCTCTGAC TTGAGCGTCG ATTTTTGTGA 6121 TGCTCGTCAG GGGGGCGGAG CCTATGGAAA AACGCCAGCA ACGCGGCCTT TTTACGGTTC 6181 CTGGCCTTTT GCTGGCCTTT TGCTCACATG TTCTTAATTA AATTTTTCAA AAGTAGTTGA 6241 CAATTAATCA TCGGCATAGT ATATCGGCAT AGTATAATAC GACTCACTAT AGGAGGGCCA 6301 TCATGGCCAA GTTGACCAGT GCTGTCCCAG TGCTCACAGC CAGGGATGTG GCTGGAGCTG 6361 TTGAGTTCTG GACTGACAGG TTGGGGTTCT CCAGAGATTT TGTGGAGGAT GACTTTGCAG 6421 GTGTGGTCAG AGATGATGTC ACCCTGTTCA TCTCAGCAGT CCAGGACCAG GTGGTGCCTG 6481 ACAACACCCT GGCTTGGGTG TGGGTGAGAG GACTGGATGA GCTGTATGCT GAGTGGAGTG 6541 AGGTGGTCTC CACCAACTTC AGGGATGCCA GTGGCCCTGC CATGACAGAG ATTGGAGAGC 6601 AGCCCTGGGG GAGAGAGTTT GCCCTGAGAG ACCCAGCAGG CAACTGTGTG CACTTTGTGG 6661 CAGAGGAGCA GGACTGAGGA TAAGAATTGT AACAAAAAAC CCCGCCCCGG CGGGGTTTTT 6721 TGTTAATTAA //