LOCUS phu____Lsf 4550 bp DNA circular SYN 17-Jun-2003 DEFINITION - ACCESSION - KEYWORDS - SOURCE - FEATURES Location/Qualifiers misc_RNA complement(<4..12) /note="SV40 minor late 19S RNA" promoter <4..>348 /note="SV40 promoter ( SELP)" enhancer 26..168 /note="SV40 enhancer elements" misc_feature 173..236 /note="SV40 21bp repeats" rep_origin 193..>328 /note="SV40 Ori" mRNA 242..248 /note="early-late transcription startpoints" mutation 262 /note="G->A kills SELP ATG" mRNA 282..288 /note="early-early transcription start points" CDS 373..1092 /note="EGFP" polyA_signal 1108..>1329 /note="SV40 Late PolyA" mutation 1399 /note="missing A (huLsf sequencing)" promoter <1420..>1657 /note="hEF1alpha promoter core" 5'UTR 1665..1931 /note="HTLV-1 R-U5" mutation 1947 /note="missing GG (discovered by sequencing)" CDS 2024..2623 /note="Nanoluc luciferase with IL-6 secretion signal" polyA_signal <2639..3301 /note="hEF1a polyA signal" rep_origin 3302..4035 /note="MB1 Ori" promoter 4036..4122 /note="EM7 promoter" CDS 4123..4497 /note="ShBle (ZeoR)" terminator 4498..4550 /note="terminator (rpmB/G)" BASE COUNT 1103 A 1168 C 1216 G 1063 T 0 OTHER ORIGIN ? 1 CCCCTGTGGA ATGTGTGTCA GTTAGGGTGT GGAAAGTCCC CAGGCTCCCC AGCAGGCAGA 61 AGTATGCAAA GCATGCATCT CAATTAGTCA GCAACCAGGT GTGGAAAGTC CCCAGGCTCC 121 CCAGCAGGCA GAAGTATGCA AAGCATGCAT CTCAATTAGT CAGCAACCAT AGTCCCGCCC 181 CTAACTCCGC CCATCCCGCC CCTAACTCCG CCCAGTTCCG CCCATTCTCC GCCCCATGGC 241 TGACTAATTT TTTTTATTTA TACAGAGGCC GAGGCCGCCT CGGCCTCTGA GCTATTCCAG 301 AAGTAGTGAG GAGGCTTTTT TGGAGGCCTA GGCTTTTGCA AAAAGCTTGA TTGGGATCCA 361 CCGGTCGCCA CCATGGTGAG CAAGGGCGAG GAGCTGTTCA CCGGGGTGGT GCCCATCCTG 421 GTCGAGCTGG ACGGCGACGT AAACGGCCAC AAGTTCAGCG TGTCCGGCGA GGGCGAGGGC 481 GATGCCACCT ACGGCAAGCT GACCCTGAAG TTCATCTGCA CCACCGGCAA GCTGCCCGTG 541 CCCTGGCCCA CCCTCGTGAC CACCCTGACC TACGGCGTGC AGTGCTTCAG CCGCTACCCC 601 GACCACATGA AGCAGCACGA CTTCTTCAAG TCCGCCATGC CCGAAGGCTA CGTCCAGGAG 661 CGCACCATCT TCTTCAAGGA CGACGGCAAC TACAAGACCC GCGCCGAGGT GAAGTTCGAG 721 GGCGACACCC TGGTGAACCG CATCGAGCTG AAGGGCATCG ACTTCAAGGA GGACGGCAAC 781 ATCCTGGGGC ACAAGCTGGA GTACAACTAC AACAGCCACA ACGTCTATAT CATGGCCGAC 841 AAGCAGAAGA ACGGCATCAA GGTGAACTTC AAGATCCGCC ACAACATCGA GGACGGCAGC 901 GTGCAGCTCG CCGACCACTA CCAGCAGAAC ACCCCCATCG GCGACGGCCC CGTGCTGCTG 961 CCCGACAACC ACTACCTGAG CACCCAGTCC GCCCTGAGCA AAGACCCCAA CGAGAAGCGC 1021 GATCACATGG TCCTGCTGGA GTTCGTGACC GCCGCCGGGA TCACTCTCGG CATGGACGAG 1081 CTGTACAAGT AAAGCGGCCG CTTCGAGCAG ACATGATAAG ATACATTGAT GAGTTTGGAC 1141 AAACCACAAC TAGAATGCAG TGAAAAAAAT GCTTTATTTG TGAAATTTGT GATGCTATTG 1201 CTTTATTTGT AACCATTATA AGCTGCAATA AACAAGTTAA CAACAACAAT TGCATTCATT 1261 TTATGTTTCA GGTTCAGGGG GAGGTGTGGG AGGTTTTTTA AAGCAAGTAA AACCTCTACA 1321 AATGTGGTAA AATCGATAAG GATCCGGGCT GGCGTAATAG CGAAGAGGCC CGCACCGATC 1381 GCCCTTCCCA ACAGTTGCGG TGGAGAAGAG CATGCGTGAG GCTCCGGTGC CCGTCAGTGG 1441 GCAGAGCGCA CATCGCCCAC AGTCCCCGAG AAGTTGGGGG GAGGGGTCGG CAATTGAACC 1501 GGTGCCTAGA GAAGGTGGCG CGGGGTAAAC TGGGAAAGTG ATGTCGTGTA CTGGCTCCGC 1561 CTTTTTCCCG AGGGTGGGGG AGAACCGTAT ATAAGTGCAG TAGTCGCCGT GAACGTTCTT 1621 TTTCGCAACG GGTTTGCCGC CAGAACACAG CTGAAGCTTC GAGGGGCTCG CATCTCTCCT 1681 TCACGCGCCC GCCGCCCTAC CTGAGGCCGC CATCCACGCC GGTTGAGTCG CGTTCTGCCG 1741 CCTCCCGCCT GTGGTGCCTC CTGAACTGCG TCCGCCGTCT AGGTAAGTTT AAAGCTCAGG 1801 TCGAGACCGG GCCTTTGTCC GGCGCTCCCT TGGAGCCTAC CTAGACTCAG CCGGCTCTCC 1861 ACGCTTTGCC TGACCCTGCT TGCTCAACTC TACGTCTTTG TTTCGTTTTC TGTTCTGCGC 1921 CGTTACAGAT CCAAGCTGTG ACCGGCCCGC TCTAGCCTCG AGGATATCAA GATCTGGCCT 1981 CGGCGGCCAA GCTTGGCAAT CCGGTACTGT TGGTAAAGCC ACCATGAACT CCTTCTCCAC 2041 AAGCGCCTTC GGTCCAGTTG CCTTCTCCCT GGGCCTGCTC CTGGTGTTGC CTGCTGCCTT 2101 CCCTGCCCCA GTCTTCACAC TCGAAGATTT CGTTGGGGAC TGGCGACAGA CAGCCGGCTA 2161 CAACCTGGAC CAAGTCCTTG AACAGGGAGG TGTGTCCAGT TTGTTTCAGA ATCTCGGGGT 2221 GTCCGTAACT CCGATCCAAA GGATTGTCCT GAGCGGTGAA AATGGGCTGA AGATCGACAT 2281 CCATGTCATC ATCCCGTATG AAGGTCTGAG CGGCGACCAA ATGGGCCAGA TCGAAAAAAT 2341 TTTTAAGGTG GTGTACCCTG TGGATGATCA TCACTTTAAG GTGATCCTGC ACTATGGCAC 2401 ACTGGTAATC GACGGGGTTA CGCCGAACAT GATCGACTAT TTCGGACGGC CGTATGAAGG 2461 CATCGCCGTG TTCGACGGCA AAAAGATCAC TGTAACAGGG ACCCTGTGGA ACGGCAACAA 2521 AATTATCGAC GAGCGCCTGA TCAACCCCGA CGGCTCCCTG CTGTTCCGAG TAACCATCAA 2581 CGGAGTGACC GGCTGGCGGC TGTGCGAACG CATTCTGGCG TAAGGCCGCG ACTCTAGCAT 2641 TATCCCTAAT ACCTGCCACC CCACTCTTAA TCAGTGGTGG AAGAACGGTC TCAGAACTGT 2701 TTGTTTCAAT TGGCCATTTA AGTTTAGTAG TAAAAGACTG GTTAATGATA ACAATGCATC 2761 GTAAAACCTT CAGAAGGAAA GGAGAATGTT TTGTGGACCA CTTTGGTTTT CTTTTTTGCG 2821 TGTGGCAGTT TTAAGTTATT AGTTTTTAAA ATCAGTACTT TTTAATGGAA ACAACTTGAC 2881 CAAAAATTTG TCACAGAATT TTGAGACCCA TTAAAAAAGT TAAATGAGAA ACCTGTGTGT 2941 TCCTTTGGTC AACACCGAGA CATTTAGGTG AAAGACATCT AATTCTGGTT TTACGAATCT 3001 GGAAACTTCT TGAAAATGTA ATTCTTGAGT TAACACTTCT GGGTGGAGAA TAGGGTTGTT 3061 TTCCCCCCAC ATAATTGGAA GGGGAAGGAA TATCATTTAA AGCTATGGGA GGGTTTCTTT 3121 GATTACAACA CTGGAGAGAA ATGCAGCATG TTGCTGATTG CCTGTCACTA AAACAGGCCA 3181 AAAACTGAGT CCTTGGGTTG CATAGAAAGC TTCATGTTGC TAAACCAATG TTAAGTGAAT 3241 CTTTGGAAAC AAAATGTTTC CAAATTACTG GGATGTGCAT GTTGAAACGT GGGTTAATTA 3301 ACTAGCCATG ACCAAAATCC CTTAACGTGA GTTTTCGTTC CACTGAGCGT CAGACCCCGT 3361 AGAAAAGATC AAAGGATCTT CTTGAGATCC TTTTTTTCTG CGCGTAATCT GCTGCTTGCA 3421 AACAAAAAAA CCACCGCTAC CAGCGGTGGT TTGTTTGCCG GATCAAGAGC TACCAACTCT 3481 TTTTCCGAAG GTAACTGGCT TCAGCAGAGC GCAGATACCA AATACTGTTC TTCTAGTGTA 3541 GCCGTAGTTA GGCCACCACT TCAAGAACTC TGTAGCACCG CCTACATACC TCGCTCTGCT 3601 AATCCTGTTA CCAGTGGCTG CTGCCAGTGG CGATAAGTCG TGTCTTACCG GGTTGGACTC 3661 AAGACGATAG TTACCGGATA AGGCGCAGCG GTCGGGCTGA ACGGGGGGTT CGTGCACACA 3721 GCCCAGCTTG GAGCGAACGA CCTACACCGA ACTGAGATAC CTACAGCGTG AGCTATGAGA 3781 AAGCGCCACG CTTCCCGAAG GGAGAAAGGC GGACAGGTAT CCGGTAAGCG GCAGGGTCGG 3841 AACAGGAGAG CGCACGAGGG AGCTTCCAGG GGGAAACGCC TGGTATCTTT ATAGTCCTGT 3901 CGGGTTTCGC CACCTCTGAC TTGAGCGTCG ATTTTTGTGA TGCTCGTCAG GGGGGCGGAG 3961 CCTATGGAAA AACGCCAGCA ACGCGGCCTT TTTACGGTTC CTGGCCTTTT GCTGGCCTTT 4021 TGCTCACATG TTCTTAATTA AATTTTTCAA AAGTAGTTGA CAATTAATCA TCGGCATAGT 4081 ATATCGGCAT AGTATAATAC GACTCACTAT AGGAGGGCCA TCATGGCCAA GTTGACCAGT 4141 GCTGTCCCAG TGCTCACAGC CAGGGATGTG GCTGGAGCTG TTGAGTTCTG GACTGACAGG 4201 TTGGGGTTCT CCAGAGATTT TGTGGAGGAT GACTTTGCAG GTGTGGTCAG AGATGATGTC 4261 ACCCTGTTCA TCTCAGCAGT CCAGGACCAG GTGGTGCCTG ACAACACCCT GGCTTGGGTG 4321 TGGGTGAGAG GACTGGATGA GCTGTATGCT GAGTGGAGTG AGGTGGTCTC CACCAACTTC 4381 AGGGATGCCA GTGGCCCTGC CATGACAGAG ATTGGAGAGC AGCCCTGGGG GAGAGAGTTT 4441 GCCCTGAGAG ACCCAGCAGG CAACTGTGTG CACTTTGTGG CAGAGGAGCA GGACTGAGGA 4501 TAAGAATTGT AACAAAAAAC CCCGCCCCGG CGGGGTTTTT TGTTAATTAA //