LOCUS SUSEGFI 3362 bp DNA INV 09-FEB-1995
DEFINITION Strongylocentrotus purpuratus fibropellin Ia and alternatively
spliced fibropellin Ib (EGFI) mRNA, complete cds.
ACCESSION L08692
NID g161465
KEYWORDS epidermal growth factor (EGF) repeat-containing protein;
extracellular matrix protein; fibropellin.
SOURCE Strongylocentrotus purpuratus (tissue library: lambda gt11/lambda
EMBL3) gastrula DNA.
ORGANISM Strongylocentrotus purpuratus
Eukaryotae; mitochondrial eukaryotes; Metazoa; Echinodermata;
Echinozoa; Echinoidea; Euechinoidea; Echinacea; Echinoida;
Strongylocentrotidae; Strongylocentrotus.
REFERENCE 1 (bases 1 to 3362)
AUTHORS Delgadillo-Reynoso,M.G., Rollo,D.R., Hursh,D.A. and Raff,R.A.
TITLE Structural analysis of the uEGF gene in the sea urchin
strongylocentrotus purpuratus reveals more similarity to vertebrate
than to invertebrate genes with EGF-like repeats
JOURNAL J. Mol. Evol. 29 (4), 314-327 (1989)
MEDLINE 90112459
FEATURES Location/Qualifiers
source 1..3362
/organism="Strongylocentrotus purpuratus"
/db_xref="taxon:7668"
/dev_stage="gastrula"
/tissue_lib="lambda gt11/lambda EMBL3"
gene join(133..1570,2483..3327)
/gene="EGFI"
sig_peptide 133..186
/gene="EGFI"
CDS 133..3327
/gene="EGFI"
/note="domains: C1s-like (aa: 57..175), avidin-like (aa.
980..1108); glycosylation sites: (aa. 74..76, 180..182, &
895..897); EGF-like repeats: #1 (aa. 19..56) #2-21 (aa.
176..976)"
/codon_start=1
/product="fibropellin Ia"
/db_xref="PID:g161467"
/translation="MRTWLLAVLLLSVIAVTYGQGECDSDPCENGSTCQEGEGSYICQ
CPMGYDGQNCDRFTGSNCGYNVFDANGMIDSPNYPAMYNNRADCLYLVRITKARSITF
TIEDFMTEVFKDVVEYGIGPEADFNQALGSFEGNLTQDDVIPAPFTVQGDQAWFIFST
DRNIVNRGFRITFSSDGDDCDPNLCQNGAACTDLVNDYACTCPPGFTGRNCEIDIDEC
ASDPCQNGGACVDGVNGYVCNCVPGFDGDECENNINECASSPCLNGGICVDGVNMFEC
TCLAGFTGVRCEVNIDECASAPCQNGGICIDGINGYTCSCPLGFSGDNCENNDDECSS
IPCLNGGTCVDLVNAYMCVCAPGWTGPTCADNIDECASAPCQNGGVCIDGVNGYMCDC
QPGYTGTHCETDIDECARPPCQNGGDCVDGVNGYVCICAPGFDGLNCENNIDECASRP
CQNGAVCVDGVNGFVCTCSAGYTGVLCETDINECASMPCLNGGVCTDLVNGYICTCAA
GFEGTNCETDTDECASFPCQNGATCTDQVNGYVCTCVPGYTGVLCETDINECASFPCL
NGGTCNDQVNGYVCVCAQDTSVSTCETDRDECASAPCLNGGACMDVVNGFVCTCLPGW
EGTNCEINTDECASSPCMNGGLCVDQVNSYVCFCLPGFTGIHCGTEIDECASSPCLNG
GQCIDRVDSYECVCAAGYTAVRCQINIDECASAPCQNGGVCVDGVNGYVCNCAPGYTG
DNCETEIDECASMPCLNGGACIEMVNGYTCQCVAGYTGVICETDIDECASAPCQNGGV
CTDTINGYICACVPGFTGSNCETNIDECASDPCLNGGICVDGVNGFVCQCPPNYSGTY
CEISLDACRSMPCQNGATCVNVGADYVCECVPGYAGQNCEIDINECASLPCQNGGLCI
DGIAGYTCQCRLGYIGVNCEEVGFCDLEGMWYNECNDQVTITKTSTGMMLGDYMTYNE
RALGYAAPTVVVGYASNNYDFPSFGFTVVRDNGQSTTSWTGQCHLCDGEEVLYTTWIN
TNMVSTCQDIKKSNMVGQDKWTRYEQSIAPQPDA"
CDS join(133..1570,2483..3327)
/gene="EGFI"
/note="alternatively spliced product not containing
EGF-like repeats 10-17."
/codon_start=1
/product="fibropellin Ib"
/db_xref="PID:g161466"
/translation="MRTWLLAVLLLSVIAVTYGQGECDSDPCENGSTCQEGEGSYICQ
CPMGYDGQNCDRFTGSNCGYNVFDANGMIDSPNYPAMYNNRADCLYLVRITKARSITF
TIEDFMTEVFKDVVEYGIGPEADFNQALGSFEGNLTQDDVIPAPFTVQGDQAWFIFST
DRNIVNRGFRITFSSDGDDCDPNLCQNGAACTDLVNDYACTCPPGFTGRNCEIDIDEC
ASDPCQNGGACVDGVNGYVCNCVPGFDGDECENNINECASSPCLNGGICVDGVNMFEC
TCLAGFTGVRCEVNIDECASAPCQNGGICIDGINGYTCSCPLGFSGDNCENNDDECSS
IPCLNGGTCVDLVNAYMCVCAPGWTGPTCADNIDECASAPCQNGGVCIDGVNGYMCDC
QPGYTGTHCETDIDECARPPCQNGGDCVDGVNGYVCICAPGFDGLNCENNIDECASRP
CQNGAVCVDGVNGFVCTCSAGYTGVLCETDIDECASAPCQNGGVCTDTINGYICACVP
GFTGSNCETNIDECASDPCLNGGICVDGVNGFVCQCPPNYSGTYCEISLDACRSMPCQ
NGATCVNVGADYVCECVPGYAGQNCEIDINECASLPCQNGGLCIDGIAGYTCQCRLGY
IGVNCEEVGFCDLEGMWYNECNDQVTITKTSTGMMLGDYMTYNERALGYAAPTVVVGY
ASNNYDFPSFGFTVVRDNGQSTTSWTGQCHLCDGEEVLYTTWINTNMVSTCQDIKKSN
MVGQDKWTRYEQSIAPQPDA"
BASE COUNT 822 a 770 c 891 g 879 t
ORIGIN
1 ccttggtata ttgtggacta cagcgcttga agcaggttcg tttgttggac attgttcagg
61 tgccggtttc ctcatccatc acgctcctta ccagtgactt ttgttttctt cgctggaaaa
121 gaacgcttca aaatgaggac gtggttacta gctgtattgc ttctcagcgt gatagctgtt
181 acatacgggc aaggtgaatg tgacagcgat ccctgtgaaa atggatcaac ctgtcaggag
241 ggtgaagggt cgtatatctg ccagtgtccc atgggatacg atggacaaaa ctgcgaccgt
301 ttcacaggtt caaactgcgg atacaatgtc ttcgatgcca acggtatgat cgattcacct
361 aactacccgg ccatgtacaa caaccgtgcc gattgtcttt atcttgttcg tatcaccaag
421 gctcgcagca tcactttcac aatcgaagac ttcatgactg aggtcttcaa agacgttgtc
481 gagtatggta ttgggccaga ggcagacttc aaccaggctc tcggttcgtt cgaaggtaac
541 ctgacacaag acgacgtcat cccagctcct ttcactgtcc agggcgatca ggcttggttc
601 attttcagta ctgatcgtaa tatcgtcaac aggggattca gaattacatt ctcatcagat
661 ggagacgatt gtgatcccaa cctttgtcag aatggcgctg cctgtactga cctcgtgaat
721 gattatgctt gtacctgccc tccaggattc acgggtagaa actgcgaaat cgatattgac
781 gaatgtgcca gtgatccctg tcagaatggt ggcgcctgtg tcgatggagt caacggctat
841 gtctgtaact gtgtcccagg attcgacgga gatgaatgtg aaaacaatat caatgagtgt
901 gcaagcagcc cttgtcttaa cggaggaatc tgtgttgatg gcgttaacat gttcgagtgt
961 acctgtttag ccggcttcac tggcgtacga tgtgaagtca acattgatga atgtgcaagt
1021 gccccttgtc agaatggtgg tatctgtatt gatggtatca atggatacac ctgctcatgt
1081 ccgctcggct tctctggaga taactgtgaa aacaatgatg atgaatgctc cagcatccct
1141 tgtttaaatg gtggaacctg tgtggatctt gttaacgcct acatgtgtgt ctgtgccccc
1201 ggctggaccg gccctacctg cgctgacaac attgacgagt gtgctagtgc cccttgccag
1261 aacggaggtg tgtgcattga cggtgtgaac ggatacatgt gtgactgtca acctggatac
1321 accggaaccc attgcgaaac tgatatcgac gagtgcgcaa ggcccccttg ccaaaatgga
1381 ggtgactgtg tggatggagt caacggatac gtctgcatct gcgctcctgg attcgacgga
1441 ctcaactgcg agaacaatat tgacgaatgc gccagccgtc cctgccagaa cggagctgtc
1501 tgcgttgatg gtgtaaacgg gttcgtctgc acctgctctg ctggctacac aggagtcctt
1561 tgtgaaaccg atatcaacga atgtgctagc atgccttgtc tgaatggtgg tgtttgcacg
1621 gacctagtga acgggtacat ctgcacatgc gcagcaggct tcgagggaac taattgcgag
1681 acagacaccg acgaatgtgc ttcattccca tgtcaaaacg gagccacgtg tacagaccag
1741 gttaatggat acgtgtgcac atgtgttcca ggatacacgg gagtcctctg cgaaacagat
1801 attaacgaat gtgcctcatt tccttgtctg aatggaggta cttgtaacga tcaagtcaat
1861 ggatacgtgt gcgtgtgcgc acaggatact tcggtgtcaa cctgtgaaac agatcgtgac
1921 gagtgtgcat ctgccccatg tttgaatggt ggagcttgta tggacgtagt gaatggattt
1981 gtatgtactt gcttacctgg atgggaggga accaattgtg aaatcaacac ggacgagtgt
2041 gcaagctctc catgcatgaa tggtggtctc tgtgttgacc aggtcaatag ctacgtctgc
2101 ttctgtctcc ctggtttcac tggcattcat tgcggaaccg aaattgacga gtgtgcaagc
2161 agcccatgtc taaacggagg acagtgtatc gaccgagttg actcgtacga gtgcgtttgc
2221 gctgctggct acactgctgt cagatgccaa atcaatatcg acgaatgtgc ttctgcccct
2281 tgtcaaaatg gcggagtgtg tgttgatgga gttaatggtt acgtgtgtaa ttgtgcacca
2341 ggctacactg gcgataactg tgaaactgaa atcgacgaat gtgcttccat gccttgtttg
2401 aacggaggag cgtgcattga aatggttaac ggatacacct gtcagtgtgt agctggctac
2461 actggggtta tttgcgagac tgatattgac gagtgtgcca gtgccccttg ccagaatggt
2521 ggtgtgtgta ctgataccat taacggatat atctgtgcct gtgtgccagg attcaccgga
2581 agcaactgcg agactaacat cgacgagtgt gctagcgacc cctgtctaaa tggaggtatc
2641 tgtgtggatg gagtcaatgg tttcgtctgc cagtgccctc ccaactactc tggaacttat
2701 tgtgaaatct cacttgatgc atgcaggagt atgccatgcc agaatggcgc cacgtgcgta
2761 aacgttggag ccgactacgt ctgcgaatgc gtaccaggat atgctggaca aaactgtgaa
2821 attgacatca acgagtgtgc tagtcttcca tgccaaaacg gcggtctatg tattgatggt
2881 attgctggat acacctgtca gtgccgtcta ggatacatcg gtgtcaactg cgaggaagtt
2941 ggtttctgcg acttggaggg tatgtggtac aacgagtgca atgatcaggt caccatcacc
3001 aagacctcta caggaatgat gcttggagat tacatgactt acaatgaacg tgccctcgga
3061 tacgcagccc caaccgtcgt ggtcggttac gccagcaaca actatgactt cccatctttc
3121 ggtttcacgg tggtccgtga caatggtcag tctactacca gttggaccgg tcagtgccat
3181 ctatgtgacg gtgaagaggt tctctacacc acctggatca acaccaacat ggtcagcacc
3241 tgccaggaca tcaagaaatc aaacatggtt ggccaggaca aatggacacg ttatgaacag
3301 agcatcgcac ctcagcccga tgcataggca atttaactac attaatattg taacatgaat
3361 ac
//