LOCUS PHGF.TXT 5608 BP DS-DNA CIRCULAR SYN 17-MAY-2003 DEFINITION - ACCESSION - KEYWORDS - SOURCE - FEATURES Location/Qualifiers misc_RNA complement(>4..12) /note="SV40 minor late 19S RNA" promoter <4..>348 /note="SV40 promoter (∆SELP)" enhancer 26..168 /note="SV40 enhancer elements" misc_feature 173..236 /note="SV40 21bp repeats" rep_origin 193..>328 /note="SV40 Ori" mRNA 242..248 /note="early-late transcription startpoints" mutation 262..262 /note="G->A kills SELP ATG" mRNA 282..288 /note="early-early transcription start points" CDS 373..1092 /note="EGFP" polyA_signal 1108..>1329 /note="SV40 Late PolyA" promoter <1420..1657 /note="hEF1alpha promoter core" 5'UTR 1665..1952 /note="HTLV R-U5 leader" insertion_seq 1973..2097 /note="attR1" CDS 2206..2865 /note="CAT" CDS 3207..3512 /note="ccdB" insertion_seq 3553..3677 /note="attR2" polyA_signal <3697..4359 /note="hEF1a polyA signal" rep_origin 4360..5093 /note="MB1 Ori" promoter 5094..5180 /note="EM7 promoter" CDS 5181..5555 /note="ShBle (ZeoR)" terminator 5556..5608 /note="terminator (rpmB/G)" BASE COUNT 1442 A 1330 C 1441 G 1395 T 0 OTHER ORIGIN - 1 CCCCTGTGGA ATGTGTGTCA GTTAGGGTGT GGAAAGTCCC CAGGCTCCCC AGCAGGCAGA 61 AGTATGCAAA GCATGCATCT CAATTAGTCA GCAACCAGGT GTGGAAAGTC CCCAGGCTCC 121 CCAGCAGGCA GAAGTATGCA AAGCATGCAT CTCAATTAGT CAGCAACCAT AGTCCCGCCC 181 CTAACTCCGC CCATCCCGCC CCTAACTCCG CCCAGTTCCG CCCATTCTCC GCCCCATGGC 241 TGACTAATTT TTTTTATTTA TACAGAGGCC GAGGCCGCCT CGGCCTCTGA GCTATTCCAG 301 AAGTAGTGAG GAGGCTTTTT TGGAGGCCTA GGCTTTTGCA AAAAGCTTGA TTGGGATCCA 361 CCGGTCGCCA CCATGGTGAG CAAGGGCGAG GAGCTGTTCA CCGGGGTGGT GCCCATCCTG 421 GTCGAGCTGG ACGGCGACGT AAACGGCCAC AAGTTCAGCG TGTCCGGCGA GGGCGAGGGC 481 GATGCCACCT ACGGCAAGCT GACCCTGAAG TTCATCTGCA CCACCGGCAA GCTGCCCGTG 541 CCCTGGCCCA CCCTCGTGAC CACCCTGACC TACGGCGTGC AGTGCTTCAG CCGCTACCCC 601 GACCACATGA AGCAGCACGA CTTCTTCAAG TCCGCCATGC CCGAAGGCTA CGTCCAGGAG 661 CGCACCATCT TCTTCAAGGA CGACGGCAAC TACAAGACCC GCGCCGAGGT GAAGTTCGAG 721 GGCGACACCC TGGTGAACCG CATCGAGCTG AAGGGCATCG ACTTCAAGGA GGACGGCAAC 781 ATCCTGGGGC ACAAGCTGGA GTACAACTAC AACAGCCACA ACGTCTATAT CATGGCCGAC 841 AAGCAGAAGA ACGGCATCAA GGTGAACTTC AAGATCCGCC ACAACATCGA GGACGGCAGC 901 GTGCAGCTCG CCGACCACTA CCAGCAGAAC ACCCCCATCG GCGACGGCCC CGTGCTGCTG 961 CCCGACAACC ACTACCTGAG CACCCAGTCC GCCCTGAGCA AAGACCCCAA CGAGAAGCGC 1021 GATCACATGG TCCTGCTGGA GTTCGTGACC GCCGCCGGGA TCACTCTCGG CATGGACGAG 1081 CTGTACAAGT AAAGCGGCCG CTTCGAGCAG ACATGATAAG ATACATTGAT GAGTTTGGAC 1141 AAACCACAAC TAGAATGCAG TGAAAAAAAT GCTTTATTTG TGAAATTTGT GATGCTATTG 1201 CTTTATTTGT AACCATTATA AGCTGCAATA AACAAGTTAA CAACAACAAT TGCATTCATT 1261 TTATGTTTCA GGTTCAGGGG GAGGTGTGGG AGGTTTTTTA AAGCAAGTAA AACCTCTACA 1321 AATGTGGTAA AATCGATAAG GATCCGGGCT GGCGTAATAG CGAAGAGGCC CGCACCGATC 1381 GCCCTTCCCA ACAGTTGCGG TGGAGAAGAG CATGCGTGAG GCTCCGGTGC CCGTCAGTGG 1441 GCAGAGCGCA CATCGCCCAC AGTCCCCGAG AAGTTGGGGG GAGGGGTCGG CAATTGAACC 1501 GGTGCCTAGA GAAGGTGGCG CGGGGTAAAC TGGGAAAGTG ATGTCGTGTA CTGGCTCCGC 1561 CTTTTTCCCG AGGGTGGGGG AGAACCGTAT ATAAGTGCAG TAGTCGCCGT GAACGTTCTT 1621 TTTCGCAACG GGTTTGCCGC CAGAACACAG CTGAAGCTTC GAGGGGCTCG CATCTCTCCT 1681 TCACGCGCCC GCCGCCCTAC CTGAGGCCGC CATCCACGCC GGTTGAGTCG CGTTCTGCCG 1741 CCTCCCGCCT GTGGTGCCTC CTGAACTGCG TCCGCCGTCT AGGTAAGTTT AAAGCTCAGG 1801 TCGAGACCGG GCCTTTGTCC GGCGCTCCCT TGGAGCCTAC CTAGACTCAG CCGGCTCTCC 1861 ACGCTTTGCC TGACCCTGCT TGCTCAACTC TACGTCTTTG TTTCGTTTTC TGTTCTGCGC 1921 CGTTACAGAT CCAAGCTGTG ACCGGCGCCT ACGTAAGTGA TAGCTTGATC AAACAAGTTT 1981 GTACAAAAAA GCTGAACGAG AAACGTAAAA TGATATAAAT ATCAATATAT TAAATTAGAT 2041 TTTGCATAAA AAACAGACTA CATAATACTG TAAAACACAA CATATCCAGT CACTATGGCG 2101 GCCGCATTAG GCACCCCAGG CTTTACACTT TATGCTTCCG GCTCGTATAA TGTGTGGATT 2161 TTGAGTTAGG ATCCGTCGAG ATTTTCAGGA GCTAAGGAAG CTAAAATGGA GAAAAAAATC 2221 ACTGGATATA CCACCGTTGA TATATCCCAA TGGCATCGTA AAGAACATTT TGAGGCATTT 2281 CAGTCAGTTG CTCAATGTAC CTATAACCAG ACCGTTCAGC TGGATATTAC GGCCTTTTTA 2341 AAGACCGTAA AGAAAAATAA GCACAAGTTT TATCCGGCCT TTATTCACAT TCTTGCCCGC 2401 CTGATGAATG CTCATCCGGA ATTCCGTATG GCAATGAAAG ACGGTGAGCT GGTGATATGG 2461 GATAGTGTTC ACCCTTGTTA CACCGTTTTC CATGAGCAAA CTGAAACGTT TTCATCGCTC 2521 TGGAGTGAAT ACCACGACGA TTTCCGGCAG TTTCTACACA TATATTCGCA AGATGTGGCG 2581 TGTTACGGTG AAAACCTGGC CTATTTCCCT AAAGGGTTTA TTGAGAATAT GTTTTTCGTC 2641 TCAGCCAATC CCTGGGTGAG TTTCACCAGT TTTGATTTAA ACGTGGCCAA TATGGACAAC 2701 TTCTTCGCCC CCGTTTTCAC CATGGGCAAA TATTATACGC AAGGCGACAA GGTGCTGATG 2761 CCGCTGGCGA TTCAGGTTCA TCATGCCGTC TGTGATGGCT TCCATGTCGG CAGAATGCTT 2821 AATGAATTAC AACAGTACTG CGATGAGTGG CAGGGCGGGG CGTAATCTAG AGGATCCGGC 2881 TTACTAAAAG CCAGATAACA GTATGCGTAT TTGCGCGCTG ATTTTTGCGG TATAAGAATA 2941 TATACTGATA TGTATACCCG AAGTATGTCA AAAAGAGGTG TGCTATGAAG CAGCGTATTA 3001 CAGTGACAGT TGACAGCGAC AGCTATCAGT TGCTCAAGGC ATATATGATG TCAATATCTC 3061 CGGTCTGGTA AGCACAACCA TGCAGAATGA AGCCCGTCGT CTGCGTGCCG AACGCTGGAA 3121 AGCGGAAAAT CAGGAAGGGA TGGCTGAGGT CGCCCGGTTT ATTGAAATGA ACGGCTCTTT 3181 TGCTGACGAG AACAGGGACT GGTGAAATGC AGTTTAAGGT TTACACCTAT AAAAGAGAGA 3241 GCCGTTATCG TCTGTTTGTG GATGTACAGA GTGATATTAT TGACACGCCC GGGCGACGGA 3301 TGGTGATCCC CCTGGCCAGT GCACGTCTGC TGTCAGATAA AGTCTCCCGT GAACTTTACC 3361 CGGTGGTGCA TATCGGGGAT GAAAGCTGGC GCATGATGAC CACCGATATG GCCAGTGTGC 3421 CGGTCTCCGT TATCGGGGAA GAAGTGGCTG ATCTCAGCCA CCGCGAAAAT GACATCAAAA 3481 ACGCCATTAA CCTGATGTTC TGGGGAATAT AAATGTCAGG CTCCGTTATA CACAGCCAGT 3541 CTGCAGGTCG ACCATAGTGA CTGGATATGT TGTGTTTTAC AGTATTATGT AGTCTGTTTT 3601 TTATGCAAAA TCTAATTTAA TATATTGATA TTTATATCAT TTTACGTTTC TCGTTCAGCT 3661 TTCTTGTACA AAGTGGTTCG ATCTAGAATG GCTAGCATTA TCCCTAATAC CTGCCACCCC 3721 ACTCTTAATC AGTGGTGGAA GAACGGTCTC AGAACTGTTT GTTTCAATTG GCCATTTAAG 3781 TTTAGTAGTA AAAGACTGGT TAATGATAAC AATGCATCGT AAAACCTTCA GAAGGAAAGG 3841 AGAATGTTTT GTGGACCACT TTGGTTTTCT TTTTTGCGTG TGGCAGTTTT AAGTTATTAG 3901 TTTTTAAAAT CAGTACTTTT TAATGGAAAC AACTTGACCA AAAATTTGTC ACAGAATTTT 3961 GAGACCCATT AAAAAAGTTA AATGAGAAAC CTGTGTGTTC CTTTGGTCAA CACCGAGACA 4021 TTTAGGTGAA AGACATCTAA TTCTGGTTTT ACGAATCTGG AAACTTCTTG AAAATGTAAT 4081 TCTTGAGTTA ACACTTCTGG GTGGAGAATA GGGTTGTTTT CCCCCCACAT AATTGGAAGG 4141 GGAAGGAATA TCATTTAAAG CTATGGGAGG GTTTCTTTGA TTACAACACT GGAGAGAAAT 4201 GCAGCATGTT GCTGATTGCC TGTCACTAAA ACAGGCCAAA AACTGAGTCC TTGGGTTGCA 4261 TAGAAAGCTT CATGTTGCTA AACCAATGTT AAGTGAATCT TTGGAAACAA AATGTTTCCA 4321 AATTACTGGG ATGTGCATGT TGAAACGTGG GTTAATTAAC TAGCCATGAC CAAAATCCCT 4381 TAACGTGAGT TTTCGTTCCA CTGAGCGTCA GACCCCGTAG AAAAGATCAA AGGATCTTCT 4441 TGAGATCCTT TTTTTCTGCG CGTAATCTGC TGCTTGCAAA CAAAAAAACC ACCGCTACCA 4501 GCGGTGGTTT GTTTGCCGGA TCAAGAGCTA CCAACTCTTT TTCCGAAGGT AACTGGCTTC 4561 AGCAGAGCGC AGATACCAAA TACTGTTCTT CTAGTGTAGC CGTAGTTAGG CCACCACTTC 4621 AAGAACTCTG TAGCACCGCC TACATACCTC GCTCTGCTAA TCCTGTTACC AGTGGCTGCT 4681 GCCAGTGGCG ATAAGTCGTG TCTTACCGGG TTGGACTCAA GACGATAGTT ACCGGATAAG 4741 GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG TGCACACAGC CCAGCTTGGA GCGAACGACC 4801 TACACCGAAC TGAGATACCT ACAGCGTGAG CTATGAGAAA GCGCCACGCT TCCCGAAGGG 4861 AGAAAGGCGG ACAGGTATCC GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG CACGAGGGAG 4921 CTTCCAGGGG GAAACGCCTG GTATCTTTAT AGTCCTGTCG GGTTTCGCCA CCTCTGACTT 4981 GAGCGTCGAT TTTTGTGATG CTCGTCAGGG GGGCGGAGCC TATGGAAAAA CGCCAGCAAC 5041 GCGGCCTTTT TACGGTTCCT GGCCTTTTGC TGGCCTTTTG CTCACATGTT CTTAATTAAA 5101 TTTTTCAAAA GTAGTTGACA ATTAATCATC GGCATAGTAT ATCGGCATAG TATAATACGA 5161 CTCACTATAG GAGGGCCATC ATGGCCAAGT TGACCAGTGC TGTCCCAGTG CTCACAGCCA 5221 GGGATGTGGC TGGAGCTGTT GAGTTCTGGA CTGACAGGTT GGGGTTCTCC AGAGATTTTG 5281 TGGAGGATGA CTTTGCAGGT GTGGTCAGAG ATGATGTCAC CCTGTTCATC TCAGCAGTCC 5341 AGGACCAGGT GGTGCCTGAC AACACCCTGG CTTGGGTGTG GGTGAGAGGA CTGGATGAGC 5401 TGTATGCTGA GTGGAGTGAG GTGGTCTCCA CCAACTTCAG GGATGCCAGT GGCCCTGCCA 5461 TGACAGAGAT TGGAGAGCAG CCCTGGGGGA GAGAGTTTGC CCTGAGAGAC CCAGCAGGCA 5521 ACTGTGTGCA CTTTGTGGCA GAGGAGCAGG ACTGAGGATA AGAATTGTAA CAAAAAACCC 5581 CGCCCCGGCG GGGTTTTTTG TTAATTAA //