LOCUS SUSEGFI 3362 bp DNA INV 09-FEB-1995 DEFINITION Strongylocentrotus purpuratus fibropellin Ia and alternatively spliced fibropellin Ib (EGFI) mRNA, complete cds. ACCESSION L08692 NID g161465 KEYWORDS epidermal growth factor (EGF) repeat-containing protein; extracellular matrix protein; fibropellin. SOURCE Strongylocentrotus purpuratus (tissue library: lambda gt11/lambda EMBL3) gastrula DNA. ORGANISM Strongylocentrotus purpuratus Eukaryotae; mitochondrial eukaryotes; Metazoa; Echinodermata; Echinozoa; Echinoidea; Euechinoidea; Echinacea; Echinoida; Strongylocentrotidae; Strongylocentrotus. REFERENCE 1 (bases 1 to 3362) AUTHORS Delgadillo-Reynoso,M.G., Rollo,D.R., Hursh,D.A. and Raff,R.A. TITLE Structural analysis of the uEGF gene in the sea urchin strongylocentrotus purpuratus reveals more similarity to vertebrate than to invertebrate genes with EGF-like repeats JOURNAL J. Mol. Evol. 29 (4), 314-327 (1989) MEDLINE 90112459 FEATURES Location/Qualifiers source 1..3362 /organism="Strongylocentrotus purpuratus" /db_xref="taxon:7668" /dev_stage="gastrula" /tissue_lib="lambda gt11/lambda EMBL3" gene join(133..1570,2483..3327) /gene="EGFI" sig_peptide 133..186 /gene="EGFI" CDS 133..3327 /gene="EGFI" /note="domains: C1s-like (aa: 57..175), avidin-like (aa. 980..1108); glycosylation sites: (aa. 74..76, 180..182, & 895..897); EGF-like repeats: #1 (aa. 19..56) #2-21 (aa. 176..976)" /codon_start=1 /product="fibropellin Ia" /db_xref="PID:g161467" /translation="MRTWLLAVLLLSVIAVTYGQGECDSDPCENGSTCQEGEGSYICQ CPMGYDGQNCDRFTGSNCGYNVFDANGMIDSPNYPAMYNNRADCLYLVRITKARSITF TIEDFMTEVFKDVVEYGIGPEADFNQALGSFEGNLTQDDVIPAPFTVQGDQAWFIFST DRNIVNRGFRITFSSDGDDCDPNLCQNGAACTDLVNDYACTCPPGFTGRNCEIDIDEC ASDPCQNGGACVDGVNGYVCNCVPGFDGDECENNINECASSPCLNGGICVDGVNMFEC TCLAGFTGVRCEVNIDECASAPCQNGGICIDGINGYTCSCPLGFSGDNCENNDDECSS IPCLNGGTCVDLVNAYMCVCAPGWTGPTCADNIDECASAPCQNGGVCIDGVNGYMCDC QPGYTGTHCETDIDECARPPCQNGGDCVDGVNGYVCICAPGFDGLNCENNIDECASRP CQNGAVCVDGVNGFVCTCSAGYTGVLCETDINECASMPCLNGGVCTDLVNGYICTCAA GFEGTNCETDTDECASFPCQNGATCTDQVNGYVCTCVPGYTGVLCETDINECASFPCL NGGTCNDQVNGYVCVCAQDTSVSTCETDRDECASAPCLNGGACMDVVNGFVCTCLPGW EGTNCEINTDECASSPCMNGGLCVDQVNSYVCFCLPGFTGIHCGTEIDECASSPCLNG GQCIDRVDSYECVCAAGYTAVRCQINIDECASAPCQNGGVCVDGVNGYVCNCAPGYTG DNCETEIDECASMPCLNGGACIEMVNGYTCQCVAGYTGVICETDIDECASAPCQNGGV CTDTINGYICACVPGFTGSNCETNIDECASDPCLNGGICVDGVNGFVCQCPPNYSGTY CEISLDACRSMPCQNGATCVNVGADYVCECVPGYAGQNCEIDINECASLPCQNGGLCI DGIAGYTCQCRLGYIGVNCEEVGFCDLEGMWYNECNDQVTITKTSTGMMLGDYMTYNE RALGYAAPTVVVGYASNNYDFPSFGFTVVRDNGQSTTSWTGQCHLCDGEEVLYTTWIN TNMVSTCQDIKKSNMVGQDKWTRYEQSIAPQPDA" CDS join(133..1570,2483..3327) /gene="EGFI" /note="alternatively spliced product not containing EGF-like repeats 10-17." /codon_start=1 /product="fibropellin Ib" /db_xref="PID:g161466" /translation="MRTWLLAVLLLSVIAVTYGQGECDSDPCENGSTCQEGEGSYICQ CPMGYDGQNCDRFTGSNCGYNVFDANGMIDSPNYPAMYNNRADCLYLVRITKARSITF TIEDFMTEVFKDVVEYGIGPEADFNQALGSFEGNLTQDDVIPAPFTVQGDQAWFIFST DRNIVNRGFRITFSSDGDDCDPNLCQNGAACTDLVNDYACTCPPGFTGRNCEIDIDEC ASDPCQNGGACVDGVNGYVCNCVPGFDGDECENNINECASSPCLNGGICVDGVNMFEC TCLAGFTGVRCEVNIDECASAPCQNGGICIDGINGYTCSCPLGFSGDNCENNDDECSS IPCLNGGTCVDLVNAYMCVCAPGWTGPTCADNIDECASAPCQNGGVCIDGVNGYMCDC QPGYTGTHCETDIDECARPPCQNGGDCVDGVNGYVCICAPGFDGLNCENNIDECASRP CQNGAVCVDGVNGFVCTCSAGYTGVLCETDIDECASAPCQNGGVCTDTINGYICACVP GFTGSNCETNIDECASDPCLNGGICVDGVNGFVCQCPPNYSGTYCEISLDACRSMPCQ NGATCVNVGADYVCECVPGYAGQNCEIDINECASLPCQNGGLCIDGIAGYTCQCRLGY IGVNCEEVGFCDLEGMWYNECNDQVTITKTSTGMMLGDYMTYNERALGYAAPTVVVGY ASNNYDFPSFGFTVVRDNGQSTTSWTGQCHLCDGEEVLYTTWINTNMVSTCQDIKKSN MVGQDKWTRYEQSIAPQPDA" BASE COUNT 822 a 770 c 891 g 879 t ORIGIN 1 ccttggtata ttgtggacta cagcgcttga agcaggttcg tttgttggac attgttcagg 61 tgccggtttc ctcatccatc acgctcctta ccagtgactt ttgttttctt cgctggaaaa 121 gaacgcttca aaatgaggac gtggttacta gctgtattgc ttctcagcgt gatagctgtt 181 acatacgggc aaggtgaatg tgacagcgat ccctgtgaaa atggatcaac ctgtcaggag 241 ggtgaagggt cgtatatctg ccagtgtccc atgggatacg atggacaaaa ctgcgaccgt 301 ttcacaggtt caaactgcgg atacaatgtc ttcgatgcca acggtatgat cgattcacct 361 aactacccgg ccatgtacaa caaccgtgcc gattgtcttt atcttgttcg tatcaccaag 421 gctcgcagca tcactttcac aatcgaagac ttcatgactg aggtcttcaa agacgttgtc 481 gagtatggta ttgggccaga ggcagacttc aaccaggctc tcggttcgtt cgaaggtaac 541 ctgacacaag acgacgtcat cccagctcct ttcactgtcc agggcgatca ggcttggttc 601 attttcagta ctgatcgtaa tatcgtcaac aggggattca gaattacatt ctcatcagat 661 ggagacgatt gtgatcccaa cctttgtcag aatggcgctg cctgtactga cctcgtgaat 721 gattatgctt gtacctgccc tccaggattc acgggtagaa actgcgaaat cgatattgac 781 gaatgtgcca gtgatccctg tcagaatggt ggcgcctgtg tcgatggagt caacggctat 841 gtctgtaact gtgtcccagg attcgacgga gatgaatgtg aaaacaatat caatgagtgt 901 gcaagcagcc cttgtcttaa cggaggaatc tgtgttgatg gcgttaacat gttcgagtgt 961 acctgtttag ccggcttcac tggcgtacga tgtgaagtca acattgatga atgtgcaagt 1021 gccccttgtc agaatggtgg tatctgtatt gatggtatca atggatacac ctgctcatgt 1081 ccgctcggct tctctggaga taactgtgaa aacaatgatg atgaatgctc cagcatccct 1141 tgtttaaatg gtggaacctg tgtggatctt gttaacgcct acatgtgtgt ctgtgccccc 1201 ggctggaccg gccctacctg cgctgacaac attgacgagt gtgctagtgc cccttgccag 1261 aacggaggtg tgtgcattga cggtgtgaac ggatacatgt gtgactgtca acctggatac 1321 accggaaccc attgcgaaac tgatatcgac gagtgcgcaa ggcccccttg ccaaaatgga 1381 ggtgactgtg tggatggagt caacggatac gtctgcatct gcgctcctgg attcgacgga 1441 ctcaactgcg agaacaatat tgacgaatgc gccagccgtc cctgccagaa cggagctgtc 1501 tgcgttgatg gtgtaaacgg gttcgtctgc acctgctctg ctggctacac aggagtcctt 1561 tgtgaaaccg atatcaacga atgtgctagc atgccttgtc tgaatggtgg tgtttgcacg 1621 gacctagtga acgggtacat ctgcacatgc gcagcaggct tcgagggaac taattgcgag 1681 acagacaccg acgaatgtgc ttcattccca tgtcaaaacg gagccacgtg tacagaccag 1741 gttaatggat acgtgtgcac atgtgttcca ggatacacgg gagtcctctg cgaaacagat 1801 attaacgaat gtgcctcatt tccttgtctg aatggaggta cttgtaacga tcaagtcaat 1861 ggatacgtgt gcgtgtgcgc acaggatact tcggtgtcaa cctgtgaaac agatcgtgac 1921 gagtgtgcat ctgccccatg tttgaatggt ggagcttgta tggacgtagt gaatggattt 1981 gtatgtactt gcttacctgg atgggaggga accaattgtg aaatcaacac ggacgagtgt 2041 gcaagctctc catgcatgaa tggtggtctc tgtgttgacc aggtcaatag ctacgtctgc 2101 ttctgtctcc ctggtttcac tggcattcat tgcggaaccg aaattgacga gtgtgcaagc 2161 agcccatgtc taaacggagg acagtgtatc gaccgagttg actcgtacga gtgcgtttgc 2221 gctgctggct acactgctgt cagatgccaa atcaatatcg acgaatgtgc ttctgcccct 2281 tgtcaaaatg gcggagtgtg tgttgatgga gttaatggtt acgtgtgtaa ttgtgcacca 2341 ggctacactg gcgataactg tgaaactgaa atcgacgaat gtgcttccat gccttgtttg 2401 aacggaggag cgtgcattga aatggttaac ggatacacct gtcagtgtgt agctggctac 2461 actggggtta tttgcgagac tgatattgac gagtgtgcca gtgccccttg ccagaatggt 2521 ggtgtgtgta ctgataccat taacggatat atctgtgcct gtgtgccagg attcaccgga 2581 agcaactgcg agactaacat cgacgagtgt gctagcgacc cctgtctaaa tggaggtatc 2641 tgtgtggatg gagtcaatgg tttcgtctgc cagtgccctc ccaactactc tggaacttat 2701 tgtgaaatct cacttgatgc atgcaggagt atgccatgcc agaatggcgc cacgtgcgta 2761 aacgttggag ccgactacgt ctgcgaatgc gtaccaggat atgctggaca aaactgtgaa 2821 attgacatca acgagtgtgc tagtcttcca tgccaaaacg gcggtctatg tattgatggt 2881 attgctggat acacctgtca gtgccgtcta ggatacatcg gtgtcaactg cgaggaagtt 2941 ggtttctgcg acttggaggg tatgtggtac aacgagtgca atgatcaggt caccatcacc 3001 aagacctcta caggaatgat gcttggagat tacatgactt acaatgaacg tgccctcgga 3061 tacgcagccc caaccgtcgt ggtcggttac gccagcaaca actatgactt cccatctttc 3121 ggtttcacgg tggtccgtga caatggtcag tctactacca gttggaccgg tcagtgccat 3181 ctatgtgacg gtgaagaggt tctctacacc acctggatca acaccaacat ggtcagcacc 3241 tgccaggaca tcaagaaatc aaacatggtt ggccaggaca aatggacacg ttatgaacag 3301 agcatcgcac ctcagcccga tgcataggca atttaactac attaatattg taacatgaat 3361 ac //