LOCUS SPHISH34 2506 bp DNA INV 12-SEP-1993 DEFINITION Sea urchin (S. purpuratus) late embryonic H3 and H4 histone genes. ACCESSION X03952 NID g10256 KEYWORDS histone; histone H3; histone H4; inverted repeat. SOURCE purple urchin. ORGANISM Strongylocentrotus purpuratus Eukaryotae; mitochondrial eukaryotes; Metazoa; Echinodermata; Echinozoa; Echinoidea; Euechinoidea; Echinacea; Echinoida; Strongylocentrotidae; Strongylocentrotus. REFERENCE 1 (bases 1 to 2506) AUTHORS Kaumeyer,J.F. and Weinberg,E.S. TITLE Sequence, organization and expression of late embryonic H3 and H4 histone genes from the sea urchin, Strongylocentrotus purpuratus JOURNAL Nucleic Acids Res. 14 (11), 4557-4576 (1986) MEDLINE 86232591 FEATURES Location/Qualifiers source 1..2506 /organism="Strongylocentrotus purpuratus" /db_xref="taxon:7668" repeat_unit 609..618 /note="inverted repeat A" repeat_unit 625..647 /note="inverted repeat B" CDS complement(678..989) /note="histone H4 (aa 1-103)" /codon_start=1 /db_xref="PID:g10257" /db_xref="SWISS-PROT:P02306" /translation="MSGRGKGGKGLGKGGAKRHRKVLRDNIQGITKPAIRRLARRGGV KRISGLIYEETRGVLKVFLENVIRDAVTYCEHAKRKTVTAMDVVYALKRQGRTLYGFG G" repeat_region 1034..1038 /note="direct repeat 1" promoter complement(1048..1055) /note="TATA-like sequence" repeat_region 1071..1075 /note="direct repeat 1" promoter 1865..1871 /note="CAAT-like sequence" promoter 1898..1904 /note="CAAT-like sequence" promoter 1924..1930 /note="CAAT-like sequence" promoter 1947..1954 /note="TATA-like sequence" CDS 2006..2416 /note="histone H3 (aa 1-136)" /codon_start=1 /db_xref="PID:g10258" /db_xref="SWISS-PROT:P06352" /translation="MARTKQTARKSTGGKAPRKQLATKAARKSAPATGGVKKPHRYRP GTVALREIRRYQKSTELLIRKLPFQRLVREIAQDFKTELRFQSSAVMALQEASEAYLV RLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA" repeat_unit 2444..2466 /note="inverted repeat B'" repeat_unit 2473..2481 /note="inverted repeat A'" BASE COUNT 720 a 592 c 486 g 708 t ORIGIN 1 aattcttagg ggagggggcc caattttttt ttttttttac cccctacgac gaccacccca 61 tgggacactt accccccgga caactaggct aggcctagtg ggtagttatc atttatgaca 121 tgaaattgta catattatat agactataaa taattcataa aaatagtgtg tggcaaaaat 181 agtcaactgt gaccgattgc caaacaaaaa caaatcaaga agaagaatat cctaggccta 241 ttttgagaat gagaaatcca agggttaggc ctactgttag tgagcactaa ttttttgttt 301 cttccctcaa tcaccccccc cccccccttt tgtttaaaaa aaatggcatg taacaggttc 361 cctggccctg cctgccttac gagtgagaac attggggaaa gttgggtaat tatcaggcct 421 ggaaatcaca gatcatttct tatcactctc cagagtagtc tagctctaga cctagactag 481 atctagtttt agactcactt tcgaattaat actttcaaaa gccccgcatt cttcggtcac 541 cacacacact tcatttatac tagactctag atcatgtgct gtcaaactta gttagagatt 601 cttattgatt ctttcttgga tatttggtgg ctctaaaaag agccgtttga tatgccgagt 661 taagaggtat ttccttctta accgccgaat ccgtacaggg tacgtccttg gcgcttcaga 721 gcgtagacga cgtccatggc agtgacggtc tttctcttgg cgtgctcgca gtaagtgacg 781 gcatcacgga tgacgttctc aaggaagacc ttgaggaccc cacgggtctc ttcgtagatg 841 agaccgctga tacgcttgac acctccacgt cgggcaagac ggcgaatggc gggcttggtg 901 attccttgga tgttatcacg caaaaccttg cgatgacgct tggctcctcc ctttccgagt 961 ccttttcctc ctttaccacg tccagacata ttgacttgta gatttgacaa ttgaaaaaat 1021 ctgtactgaa aatcgagttt cggcgggtat atataactcc tttgcggacg cgagtgtact 1081 aaataatttc tattgagagt gagccgccta cggtcaaggg gactaaaatc tcgtcgcttc 1141 gtcgatgcaa tatttgcata aactatcgca cgttcgttat gaacaaatcg tctttactca 1201 gatcggtata aaaaatcgtc aagaacagag ttccggttat caaatactat ttaaacactt 1261 taattcatct caatacagta tttaagcgta ttagtttaca tattgcatag tagacaagag 1321 aacattaata tattttactt gaccaaaatc gtttacaaca ggtcgccagc accttatgaa 1381 taattcatca ggactccttg aagtcgtttg ctcgccaaaa tagaaaacaa cgtggaaata 1441 ttctttcaat ttctactttt gtttggtgaa aaaatagtcc atattatttt ctattcaata 1501 tcgatcattt cattttcaat tttgatggat gcttttattg ataaatgata acatacttct 1561 caaaaagcca aaatgtcgac gacaggaaag ccggtttgtt aatgaattat tcatttttac 1621 agcgatctcg aaaatccaat ccagtacgat ctttcttctc ttaaaaccta atgaatatta 1681 cgtattagcg tataaatttc tgtaatacat ttacaaatac tactttacag cgataacgat 1741 gcaatttagg atgtaattaa gtttaatatt tcataatctt ataacgttta ctacaatgac 1801 catgtacaaa atcacacgac gaggcccgaa gaaatcatga atatattaag aaagcggaca 1861 gtacccaatc acatactgtg cttgatatag cgaatggcca atcactgctt gtcgcacgac 1921 taaccaatca tcttcgtcaa ttttgatata aatacgagtg cgggattttt gaaacatcag 1981 ttgatatcac attcagcaaa tcaaaatggc ccgtaccaag cagaccgctc gcaagtccac 2041 cggaggaaag gctcctcgca agcagctggc caccaaggca gctcgcaagt ccgccccagc 2101 cactggcgga gtcaagaagc cccatcgtta caggcccggt accgtcgctc tccgtgagat 2161 ccgtcgctac cagaagagta ccgagctgct catccgcaag ctccccttcc agcgtctggt 2221 ccgtgagatc gctcaggact tcaagaccga gctccgtttc cagagctctg ccgtcatggc 2281 tcttcaggag gccagcgaag cctacttggt ccgtcttttc gaggacacca acctttgcgc 2341 catccacgcc aagcgtgtta ccatcatgcc aaaggacatc cagctggccc gccgcatccg 2401 tggcgagcga gcttagattg tcagcttgac atctaaataa accaacggct ctttttagag 2461 ccaccacatt tccaagaaag atcaaattct aaactctgcg tagatc //