LOCUS SPHISH34 2506 bp DNA INV 12-SEP-1993
DEFINITION Sea urchin (S. purpuratus) late embryonic H3 and H4 histone genes.
ACCESSION X03952
NID g10256
KEYWORDS histone; histone H3; histone H4; inverted repeat.
SOURCE purple urchin.
ORGANISM Strongylocentrotus purpuratus
Eukaryotae; mitochondrial eukaryotes; Metazoa; Echinodermata;
Echinozoa; Echinoidea; Euechinoidea; Echinacea; Echinoida;
Strongylocentrotidae; Strongylocentrotus.
REFERENCE 1 (bases 1 to 2506)
AUTHORS Kaumeyer,J.F. and Weinberg,E.S.
TITLE Sequence, organization and expression of late embryonic H3 and H4
histone genes from the sea urchin, Strongylocentrotus purpuratus
JOURNAL Nucleic Acids Res. 14 (11), 4557-4576 (1986)
MEDLINE 86232591
FEATURES Location/Qualifiers
source 1..2506
/organism="Strongylocentrotus purpuratus"
/db_xref="taxon:7668"
repeat_unit 609..618
/note="inverted repeat A"
repeat_unit 625..647
/note="inverted repeat B"
CDS complement(678..989)
/note="histone H4 (aa 1-103)"
/codon_start=1
/db_xref="PID:g10257"
/db_xref="SWISS-PROT:P02306"
/translation="MSGRGKGGKGLGKGGAKRHRKVLRDNIQGITKPAIRRLARRGGV
KRISGLIYEETRGVLKVFLENVIRDAVTYCEHAKRKTVTAMDVVYALKRQGRTLYGFG
G"
repeat_region 1034..1038
/note="direct repeat 1"
promoter complement(1048..1055)
/note="TATA-like sequence"
repeat_region 1071..1075
/note="direct repeat 1"
promoter 1865..1871
/note="CAAT-like sequence"
promoter 1898..1904
/note="CAAT-like sequence"
promoter 1924..1930
/note="CAAT-like sequence"
promoter 1947..1954
/note="TATA-like sequence"
CDS 2006..2416
/note="histone H3 (aa 1-136)"
/codon_start=1
/db_xref="PID:g10258"
/db_xref="SWISS-PROT:P06352"
/translation="MARTKQTARKSTGGKAPRKQLATKAARKSAPATGGVKKPHRYRP
GTVALREIRRYQKSTELLIRKLPFQRLVREIAQDFKTELRFQSSAVMALQEASEAYLV
RLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA"
repeat_unit 2444..2466
/note="inverted repeat B'"
repeat_unit 2473..2481
/note="inverted repeat A'"
BASE COUNT 720 a 592 c 486 g 708 t
ORIGIN
1 aattcttagg ggagggggcc caattttttt ttttttttac cccctacgac gaccacccca
61 tgggacactt accccccgga caactaggct aggcctagtg ggtagttatc atttatgaca
121 tgaaattgta catattatat agactataaa taattcataa aaatagtgtg tggcaaaaat
181 agtcaactgt gaccgattgc caaacaaaaa caaatcaaga agaagaatat cctaggccta
241 ttttgagaat gagaaatcca agggttaggc ctactgttag tgagcactaa ttttttgttt
301 cttccctcaa tcaccccccc cccccccttt tgtttaaaaa aaatggcatg taacaggttc
361 cctggccctg cctgccttac gagtgagaac attggggaaa gttgggtaat tatcaggcct
421 ggaaatcaca gatcatttct tatcactctc cagagtagtc tagctctaga cctagactag
481 atctagtttt agactcactt tcgaattaat actttcaaaa gccccgcatt cttcggtcac
541 cacacacact tcatttatac tagactctag atcatgtgct gtcaaactta gttagagatt
601 cttattgatt ctttcttgga tatttggtgg ctctaaaaag agccgtttga tatgccgagt
661 taagaggtat ttccttctta accgccgaat ccgtacaggg tacgtccttg gcgcttcaga
721 gcgtagacga cgtccatggc agtgacggtc tttctcttgg cgtgctcgca gtaagtgacg
781 gcatcacgga tgacgttctc aaggaagacc ttgaggaccc cacgggtctc ttcgtagatg
841 agaccgctga tacgcttgac acctccacgt cgggcaagac ggcgaatggc gggcttggtg
901 attccttgga tgttatcacg caaaaccttg cgatgacgct tggctcctcc ctttccgagt
961 ccttttcctc ctttaccacg tccagacata ttgacttgta gatttgacaa ttgaaaaaat
1021 ctgtactgaa aatcgagttt cggcgggtat atataactcc tttgcggacg cgagtgtact
1081 aaataatttc tattgagagt gagccgccta cggtcaaggg gactaaaatc tcgtcgcttc
1141 gtcgatgcaa tatttgcata aactatcgca cgttcgttat gaacaaatcg tctttactca
1201 gatcggtata aaaaatcgtc aagaacagag ttccggttat caaatactat ttaaacactt
1261 taattcatct caatacagta tttaagcgta ttagtttaca tattgcatag tagacaagag
1321 aacattaata tattttactt gaccaaaatc gtttacaaca ggtcgccagc accttatgaa
1381 taattcatca ggactccttg aagtcgtttg ctcgccaaaa tagaaaacaa cgtggaaata
1441 ttctttcaat ttctactttt gtttggtgaa aaaatagtcc atattatttt ctattcaata
1501 tcgatcattt cattttcaat tttgatggat gcttttattg ataaatgata acatacttct
1561 caaaaagcca aaatgtcgac gacaggaaag ccggtttgtt aatgaattat tcatttttac
1621 agcgatctcg aaaatccaat ccagtacgat ctttcttctc ttaaaaccta atgaatatta
1681 cgtattagcg tataaatttc tgtaatacat ttacaaatac tactttacag cgataacgat
1741 gcaatttagg atgtaattaa gtttaatatt tcataatctt ataacgttta ctacaatgac
1801 catgtacaaa atcacacgac gaggcccgaa gaaatcatga atatattaag aaagcggaca
1861 gtacccaatc acatactgtg cttgatatag cgaatggcca atcactgctt gtcgcacgac
1921 taaccaatca tcttcgtcaa ttttgatata aatacgagtg cgggattttt gaaacatcag
1981 ttgatatcac attcagcaaa tcaaaatggc ccgtaccaag cagaccgctc gcaagtccac
2041 cggaggaaag gctcctcgca agcagctggc caccaaggca gctcgcaagt ccgccccagc
2101 cactggcgga gtcaagaagc cccatcgtta caggcccggt accgtcgctc tccgtgagat
2161 ccgtcgctac cagaagagta ccgagctgct catccgcaag ctccccttcc agcgtctggt
2221 ccgtgagatc gctcaggact tcaagaccga gctccgtttc cagagctctg ccgtcatggc
2281 tcttcaggag gccagcgaag cctacttggt ccgtcttttc gaggacacca acctttgcgc
2341 catccacgcc aagcgtgtta ccatcatgcc aaaggacatc cagctggccc gccgcatccg
2401 tggcgagcga gcttagattg tcagcttgac atctaaataa accaacggct ctttttagag
2461 ccaccacatt tccaagaaag atcaaattct aaactctgcg tagatc
//