LOCUS P45L1W.TXT 6880 BP DS-DNA CIRCULAR SYN 17-MAY-2003 DEFINITION - ACCESSION - KEYWORDS - SOURCE - FEATURES Location/Qualifiers misc_feature complement(>49..57) /note="SV40 minor late 19S RNA" promoter <49..>393 /note="SV40 promoter" enhancer 71..213 /note="SV40 enhancer elements" misc_feature 218..281 /note="SV40 21bp repeats" rep_origin 238..>373 /note="SV40 Ori" mRNA 287..293 /note="late-early transcription startpoints" mutation 307..307 /note="G->A kills SELP ATG" mRNA 327..333 /note="early-early transcription start points" CDS 418..1137 /note="EGFP" polyA_signal 1153..>1374 /note="SV40 Late PolyA" promoter <1465..2643 /note="EF1a promoter and UTR" misc_difference 1465..>1656 /note="rhesus-derived" exon 1663..1695 /note="EF-1a exon 1" intron 1696..2635 /note="EF-1a intron A" exon 2635..2643 /note="EF-1a exon 2 leader" insertion_seq 2667..>2691 /note="attB1" CDS 2704..4245 /note="HPV45 L1 (codmod)" insertion_seq 4253..>4277 /note="attB2" mRNA 4339..>4927 /note="WPRE" polyA_signal 4963..5631 /note="hEF1a polyA signal" polyA_site 5261..5261 /note="site of polyA addition" rep_origin 5632..6365 /note="MB1 Ori" promoter 6366..6452 /note="EM7 promoter" CDS 6453..6827 /note="Zeocin resistance" terminator 6828..6880 /note="terminator (rpmB/G)" BASE COUNT 1541 A 1943 C 1844 G 1552 T 0 OTHER ORIGIN - 1 CCCTGCAGGG CCTGAAATAA CCTCTGAAAG AGGAACTTGG TTAGGTACCT GTGGAATGTG 61 TGTCAGTTAG GGTGTGGAAA GTCCCCAGGC TCCCCAGCAG GCAGAAGTAT GCAAAGCATG 121 CATCTCAATT AGTCAGCAAC CAGGTGTGGA AAGTCCCCAG GCTCCCCAGC AGGCAGAAGT 181 ATGCAAAGCA TGCATCTCAA TTAGTCAGCA ACCATAGTCC CGCCCCTAAC TCCGCCCATC 241 CCGCCCCTAA CTCCGCCCAG TTCCGCCCAT TCTCCGCCCC ATGGCTGACT AATTTTTTTT 301 ATTTATACAG AGGCCGAGGC CGCCTCGGCC TCTGAGCTAT TCCAGAAGTA GTGAGGAGGC 361 TTTTTTGGAG GCCTAGGCTT TTGCAAAAAG CTTGATTGGG ATCCACCGGT CGCCACCATG 421 GTGAGCAAGG GCGAGGAGCT GTTCACCGGG GTGGTGCCCA TCCTGGTCGA GCTGGACGGC 481 GACGTAAACG GCCACAAGTT CAGCGTGTCC GGCGAGGGCG AGGGCGATGC CACCTACGGC 541 AAGCTGACCC TGAAGTTCAT CTGCACCACC GGCAAGCTGC CCGTGCCCTG GCCCACCCTC 601 GTGACCACCC TGACCTACGG CGTGCAGTGC TTCAGCCGCT ACCCCGACCA CATGAAGCAG 661 CACGACTTCT TCAAGTCCGC CATGCCCGAA GGCTACGTCC AGGAGCGCAC CATCTTCTTC 721 AAGGACGACG GCAACTACAA GACCCGCGCC GAGGTGAAGT TCGAGGGCGA CACCCTGGTG 781 AACCGCATCG AGCTGAAGGG CATCGACTTC AAGGAGGACG GCAACATCCT GGGGCACAAG 841 CTGGAGTACA ACTACAACAG CCACAACGTC TATATCATGG CCGACAAGCA GAAGAACGGC 901 ATCAAGGTGA ACTTCAAGAT CCGCCACAAC ATCGAGGACG GCAGCGTGCA GCTCGCCGAC 961 CACTACCAGC AGAACACCCC CATCGGCGAC GGCCCCGTGC TGCTGCCCGA CAACCACTAC 1021 CTGAGCACCC AGTCCGCCCT GAGCAAAGAC CCCAACGAGA AGCGCGATCA CATGGTCCTG 1081 CTGGAGTTCG TGACCGCCGC CGGGATCACT CTCGGCATGG ACGAGCTGTA CAAGTAAAGC 1141 GGCCGCTTCG AGCAGACATG ATAAGATACA TTGATGAGTT TGGACAAACC ACAACTAGAA 1201 TGCAGTGAAA AAAATGCTTT ATTTGTGAAA TTTGTGATGC TATTGCTTTA TTTGTAACCA 1261 TTATAAGCTG CAATAAACAA GTTAACAACA ACAATTGCAT TCATTTTATG TTTCAGGTTC 1321 AGGGGGAGGT GTGGGAGGTT TTTTAAAGCA AGTAAAACCT CTACAAATGT GGTAAAATCG 1381 ATAAGGATCC GGGCTGGCGT AATAGCGAAG AGGCCCGCAC CGATCGCCCT TCCCAACAGT 1441 TGCGGTGGAG AAGAGCATGC GTGAGGCTCC GGTGCCCGTC AGTGGGCAGA GCGCACATCG 1501 CCCACAGTCC CCGAGAAGTT GGGGGGAGGG GTCGGCAATT GAACCGGTGC CTAGAGAAGG 1561 TGGCGCGGGG TAAACTGGGA AAGTGATGTC GTGTACTGGC TCCGCCCTTT TCCCGAGGGT 1621 GGGGGAGAAC CGTATATAAG TGCAGTAGTC GCTGTGAACG TTCTTTTTCG CAACGGGTTT 1681 GCCGCCAGAA CACAGGTAAG TGCCGTGTGT GGTTCCCGCG GGCCTGGCCT CTTTACGGGT 1741 TATGGCCCTT GCGTGCCTTG AATTACTTCC ACCTGGCTGC AGTACGTGAT TCTTGATCCC 1801 GAGCTTCGGG TTGGAAGTGG GTGGGAGAGT TCGAGGCCTT GCGCTTAAGG AGCCCCTTCG 1861 CCTCGTGCTT GAGTTGAGGC CTGGCCTGGG CGCTGGGGCC GCCGCGTGCG AATCTGGTGG 1921 CACCTTCGCG CCTGTCTCGC TGCTTTCGAT AAGTCTCTAG CCATTTAAAA TTTTTGATGA 1981 CCTGCTGCGA CGCTTTTTTT CTGGCAAGAT AGTCTTGTAA ATGCGGGCCA AGATCTGCAC 2041 ACTGGTATTT CGGTTTTTGG GGCCGCGGGC GGCGACGGGG CCCGTGCGTC CCAGCGCACA 2101 TGTTCGGCGA GGCGGGGCCT GCGAGCGCGG CCACCGAGAA TCGGACGGGG GTAGTCTCAA 2161 GCTGGCCGGC CTGCTCTGGT GCCTGGCCTC GCGCCGCCGT GTATCGCCCC GCCCTGGGCG 2221 GCAAGGCTGG CCCGGTCGGC ACCAGTTGCG TGAGCGGAAA GATGGCCGCT TCCCGGCCCT 2281 GCTGCAGGGA GCTCAAAATG GAGGACGCGG CGCTCGGGAG AGCGGGCGGG TGAGTCACCC 2341 ACACAAAGGA AAAGGGCCTT TCCGTCCTCA GCCGTCGCTT CATGTGACTC CACGGAGTAC 2401 CGGGCGCCGT CCAGGCACCT CGATTAGTTC TCGAGCTTTT GGAGTACGTC GTCTTTAGGT 2461 TGGGGGGAGG GGTTTTATGC GATGGAGTTT CCCCACACTG AGTGGGTGGA GACTGAAGTT 2521 AGGCCAGCTT GGCACTTGAT GTAATTCTCC TTGGAATTTG CCCTTTTTGA GTTTGGATCT 2581 TGGTTCATTC TCAAGCCTCA GACAGTGGTT CAAAGTTTTT TTCTTCCATT TCAGGTGTCG 2641 TGAGGAATTC TCTAGAGCTT GATCAAACAA GTTTGTACAA AAAAGCAGGC TTCTAGAGCC 2701 ACCATGGCCC TCTGGAGACC CTCCGATTCC ACCGTGTACT TGCCCCCCCC CAGCGTCGCA 2761 CGCGTCGTGT CTACCGACGA CTACGTCAGC AGGACCTCAA TCTTCTACCA CGCCGGGTCC 2821 AGTAGGCTGC TGACCGTGGG GAACCCCTAC TTCCGCGTCG TGCCCAACGG CGCCGGCAAC 2881 AAGCAAGCCG TCCCCAAAGT CAGTGCCTAC CAGTACCGCG TCTTCCGCGT GGCCCTGCCA 2941 GACCCCAACA AGTTCGGCCT GCCCGACAGC ACCATCTACA ACCCCGAGAC CCAGAGGCTC 3001 GTCTGGGCCT GCGTGGGCAT GGAGATCGGC AGGGGCCAAC CCCTGGGCAT CGGGTTGTCC 3061 GGGCACCCCT TCTACAACAA GCTCGACGAC ACCGAGTCCG CCCACGCCGC CACCGCCGTC 3121 ATCACCCAGG ACGTCCGCGA CAACGTCAGC GTCGACTACA AACAGACCCA ACTCTGCATC 3181 CTGGGCTGCG TGCCCGCCAT CGGCGAACAT TGGGCAAAGG GGACCTTGTG CAAGCCCGCC 3241 CAGCTCCAGC CCGGCGATTG CCCCCCCCTC GAGTTGAAGA ATACAATCAT CGAGGACGGC 3301 GACATGGTCG ACACCGGCTA CGGCGCCATG GACTTCTCCA CCCTCCAAGA CACCAAATGT 3361 GAAGTCCCCC TGGATATCTG CCAGAGTATT TGCAAGTACC CCGACTACCT CCAGATGAGC 3421 GCCGACCCAT ACGGCGACAG CATGTTCTTC TGTTTGAGGA GGGAGCAGCT CTTCGCCCGC 3481 CACTTCTGGA ACCGCGCCGG CGTCATGGGC GATACCGTGC CCACCGATTT GTACATCAAG 3541 GGGACCTCAG CCAACATGAG GGAGACACCG GGGTCCTGCG TCTACAGTCC CAGCCCATCC 3601 GGGAGCATCA TCACCAGCGA CAGCCAGCTG TTCAACAAGC CCTACTGGCT GCACAAAGCA 3661 CAGGGGCACA ATAACGGCAT CTGCTGGCAC AACCAACTCT TCGTCACCGT GGTCGATACC 3721 ACAAGGTCCA CCAACCTGAC CCTGTGCGCA AGCACCCAGA ACCCCGTCCC CTCCACCTAC 3781 GATCCCACCA AGTTCAAACA GTACTCCCGC CACGTCGAAG AGTACGACCT GCAGTTCATC 3841 TTCCAACTCT GTACCATCAC CCTGACCGCC GAGGTCATGA GCTACATTCA CTCCATGAAC 3901 TCCTCCATCC TGGAGAACTG GAACTTCGGC GTGCCCCCCC CCCCCACCAC CTCCCTCGTC 3961 GACACCTACA GGTTCGTCCA GAGCGTCGCC GTCACATGCC AGAAGGACAC CACCCCCCCC 4021 GAGAAACAGG ACCCCTACGA CAAGCTGAAG TTCTGGACCG TCGATTTGAA GGAGAAGTTC 4081 AGTAGTGACC TCGACCAGTA CCCATTGGGC AGGAAATTCC TGGTCCAAGC CGGCCTGAGG 4141 AGGCGCCCCA CAATCGGCCC CAGGAAGAGG CCCGCCGCCA GTACCAGCAC CGCCAGCACC 4201 GCCAGCCGCC CCGCAAAGCG CGTCAGGATC AGGTCCAAGA AATGAGCCCG GGACCCAGCT 4261 TTCTTGTACA AAGTGGTTCG ATCTAGAATG GCTAGTGGAT CCCCCGGGCT GCAGGAATTC 4321 GATATCAAGC TTATCGATAA TCAACCTCTG GATTACAAAA TTTGTGAAAG ATTGACTGGT 4381 ATTCTTAACT ATGTTGCTCC TTTTACGCTA TGTGGATACG CTGCTTTAAT GCCTTTGTAT 4441 CATGCTATTG CTTCCCGTAT GGCTTTCATT TTCTCCTCCT TGTATAAATC CTGGTTGCTG 4501 TCTCTTTATG AGGAGTTGTG GCCCGTTGTC AGGCAACGTG GCGTGGTGTG CACTGTGTTT 4561 GCTGACGCAA CCCCCACTGG TTGGGGCATT GCCACCACCT GTCAGCTCCT TTCCGGGACT 4621 TTCGCTTTCC CCCTCCCTAT TGCCACGGCG GAACTCATCG CCGCCTGCCT TGCCCGCTGC 4681 TGGACAGGGG CTCGGCTGTT GGGCACTGAC AATTCCGTGG TGTTGTCGGG GAAATCATCG 4741 TCCTTTCCTT GGCTGCTCGC CTGTGTTGCC ACCTGGATTC TGCGCGGGAC GTCCTTCTGC 4801 TACGTCCCTT CGGCCCTCAA TCCAGCGGAC CTTCCTTCCC GCGGCCTGCT GCCGGCTCTG 4861 CGGCCTCTTC CGCGTCTTCG CCTTCGCCCT CAGACGAGTC GGATCTCCCT TTGGGCCGCC 4921 TCCCCGCATC GATACCGTCG GCCCACTGCT CCCTAAACCT GAGCTAGCAT TATCCCTAAT 4981 ACCTGCCACC CCACTCTTAA TCAGTGGTGG AAGAACGGTC TCAGAACTGT TTGTTTCAAT 5041 TGGCCATTTA AGTTTAGTAG TAAAAGACTG GTTAATGATA ACAATGCATC GTAAAACCTT 5101 CAGAAGGAAA GGAGAATGTT TTGTGGACCA CTTTGGTTTT CTTTTTTGCG TGTGGCAGTT 5161 TTAAGTTATT AGTTTTTAAA ATCAGTACTT TTTAATGGAA ACAACTTGAC CAAAAATTTG 5221 TCACAGAATT TTGAGACCCA TTAAAAAAGT TAAATGAGAA ACCTGTGTGT TCCTTTGGTC 5281 AACACCGAGA CATTTAGGTG AAAGACATCT AATTCTGGTT TTACGAATCT GGAAACTTCT 5341 TGAAAATGTA ATTCTTGAGT TAACACTTCT GGGTGGAGAA TAGGGTTGTT TTCCCCCCAC 5401 ATAATTGGAA GGGGAAGGAA TATCATTTAA AGCTATGGGA GGGTTTCTTT GATTACAACA 5461 CTGGAGAGAA ATGCAGCATG TTGCTGATTG CCTGTCACTA AAACAGGCCA AAAACTGAGT 5521 CCTTGGGTTG CATAGAAAGC TTCATGTTGC TAAACCAATG TTAAGTGAAT CTTTGGAAAC 5581 AAAATGTTTC CAAATTACTG GGATGTGCAT GTTGAAACGT GGGTTAATTA ACTAGCCATG 5641 ACCAAAATCC CTTAACGTGA GTTTTCGTTC CACTGAGCGT CAGACCCCGT AGAAAAGATC 5701 AAAGGATCTT CTTGAGATCC TTTTTTTCTG CGCGTAATCT GCTGCTTGCA AACAAAAAAA 5761 CCACCGCTAC CAGCGGTGGT TTGTTTGCCG GATCAAGAGC TACCAACTCT TTTTCCGAAG 5821 GTAACTGGCT TCAGCAGAGC GCAGATACCA AATACTGTTC TTCTAGTGTA GCCGTAGTTA 5881 GGCCACCACT TCAAGAACTC TGTAGCACCG CCTACATACC TCGCTCTGCT AATCCTGTTA 5941 CCAGTGGCTG CTGCCAGTGG CGATAAGTCG TGTCTTACCG GGTTGGACTC AAGACGATAG 6001 TTACCGGATA AGGCGCAGCG GTCGGGCTGA ACGGGGGGTT CGTGCACACA GCCCAGCTTG 6061 GAGCGAACGA CCTACACCGA ACTGAGATAC CTACAGCGTG AGCTATGAGA AAGCGCCACG 6121 CTTCCCGAAG GGAGAAAGGC GGACAGGTAT CCGGTAAGCG GCAGGGTCGG AACAGGAGAG 6181 CGCACGAGGG AGCTTCCAGG GGGAAACGCC TGGTATCTTT ATAGTCCTGT CGGGTTTCGC 6241 CACCTCTGAC TTGAGCGTCG ATTTTTGTGA TGCTCGTCAG GGGGGCGGAG CCTATGGAAA 6301 AACGCCAGCA ACGCGGCCTT TTTACGGTTC CTGGCCTTTT GCTGGCCTTT TGCTCACATG 6361 TTCTTAATTA AATTTTTCAA AAGTAGTTGA CAATTAATCA TCGGCATAGT ATATCGGCAT 6421 AGTATAATAC GACTCACTAT AGGAGGGCCA TCATGGCCAA GTTGACCAGT GCTGTCCCAG 6481 TGCTCACAGC CAGGGATGTG GCTGGAGCTG TTGAGTTCTG GACTGACAGG TTGGGGTTCT 6541 CCAGAGATTT TGTGGAGGAT GACTTTGCAG GTGTGGTCAG AGATGATGTC ACCCTGTTCA 6601 TCTCAGCAGT CCAGGACCAG GTGGTGCCTG ACAACACCCT GGCTTGGGTG TGGGTGAGAG 6661 GACTGGATGA GCTGTATGCT GAGTGGAGTG AGGTGGTCTC CACCAACTTC AGGGATGCCA 6721 GTGGCCCTGC CATGACAGAG ATTGGAGAGC AGCCCTGGGG GAGAGAGTTT GCCCTGAGAG 6781 ACCCAGCAGG CAACTGTGTG CACTTTGTGG CAGAGGAGCA GGACTGAGGA TAAGAATTGT 6841 AACAAAAAAC CCCGCCCCGG CGGGGTTTTT TGTTAATTAA //