This web page was produced as an assignment for an undergraduate course at Davidson College.
Amyloid Precursor Protein Orthologs
Recall that Amyloid Precursor Protein (APP) is transmarine protein. Its normal function in humans has only been speculated upon, but mutations in this protein have been linked to the development of neuronal plaques and predisposal towards Alzheimer's diesease. Humans produce three different isoforms of this protein: A, B, and C. The primary structures for these isoforms appear below with the single letter abbreviation for individual amino acids. For a detailed review of the structure of APP and the differences between the isoforms, please visit "my favorite protein page."
Homo Sapiens (Human)
APP Isoform-A Sequence of Amino Acids (courtesy of Entrez):
mlpglallll aawtaralev ptdgnaglla epqiamfcgr lnmhmnvqng kwdsdpsgtk 60
tcidtkegil qycqevypel qitnvveanq pvtiqnwckr grkqckthph fvipyrclvg 120
efvsdallvp dkckflhqer mdvcethlhw htvaketcse kstnlhdygm llpcgidkfr 180
gvefvccpla eesdnvdsad aeeddsdvww ggadtdyadg sedkvvevae eeevaeveee 240
eadddedded gdeveeeaee pyeeatertt siattttttt esveevvrev cseqaetgpc 300
ramisrwyfd vtegkcapff yggcggnrnn fdteeycmav cgsamsqsll kttqeplard 360
knlpkadkka viqhfqekve sleqeaaner qqlvethmar veamlndrrr lalenyital 480
qavpprprhv fnmlkkyvra eqkdrqhtlk hfehvrmvdp kkaaqirsqv mthlrviyer 540
mnqslsllyn vpavaeeiqd evdellqkeq nysddvlanm iseprisygn dalmpsltet 600
kttvellpvn gefslddlqp whsfgadsvp antenevepv darpaadrgl ttrpgsgltn 660
ikteeisevk mdaefrhdsg yevhhqklvf faedvgsnkg aiiglmvggv viatvivitl 720
vmlkkkqyts ihhgvvevda avtpeerhls kmqqngyenp tykffeqmqn 770APP Isoform-B Sequence of Amino Acids (courtesy of Entrez):
mlpglallll aawtaralev ptdgnaglla epqiamfcgr lnmhmnvqng kwdsdpsgtk 60
tcidtkegil qycqevypel qitnvveanq pvtiqnwckr grkqckthph fvipyrclvg 120
efvsdallvp dkckflhqer mdvcethlhw htvaketcse kstnlhdygm llpcgidkfr 180
gvefvccpla eesdnvdsad aeeddsdvww ggadtdyadg sedkvvevae eeevaeveee 240
eadddedded gdeveeeaee pyeeatertt siattttttt esveevvrev cseqaetgpc 300
ramisrwyfd vtegkcapff yggcggnrnn fdteeycmav cgsaipttaa stpdavdkyl 360
etpgdeneha hfqkakerle akhrermsqv mreweeaerq aknlpkadkk aviqhfqekv 420
esleqeaane rqqlvethma rveamlndrr rlalenyita lqavpprprh vfnmlkkyvr 480
aeqkdrqhtl khfehvrmvd pkkaaqirsq vmthlrviye rmnqslslly nvpavaeeiq 540
devdellqke qnysddvlan miseprisyg ndalmpslte tkttvellpv ngefslddlq 600
pwhsfgadsv pantenevep vdarpaadrg lttrpgsglt nikteeisev kmdaefrhds 660
gyevhhqklv ffaedvgsnk gaiiglmvgg vviatvivit lvmlkkkqyt sihhgvvevd 720
aavtpeerhl skmqqngyen ptykffeqmq n 751APP Isoform-C Sequence of Amino Acids (courtesy of Entrez):
mlpglallll aawtaralev ptdgnaglla epqiamfcgr lnmhmnvqng kwdsdpsgtk 60
tcidtkegil qycqevypel qitnvveanq pvtiqnwckr grkqckthph fvipyrclvg 120
efvsdallvp dkckflhqer mdvcethlhw htvaketcse kstnlhdygm llpcgidkfr 180
gvefvccpla eesdnvdsad aeeddsdvww ggadtdyadg sedkvvevae eeevaeveee 240
eadddedded gdeveeeaee pyeeatertt siattttttt esveevvrvp ttaastpdav 300
dkyletpgde nehahfqkak erleakhrer msqvmrewee aerqaknlpk adkkaviqhf 360
qekvesleqe aanerqqlve thmarveaml ndrrrlalen yitalqavpp rprhvfnmlk 420
kyvraeqkdr qhtlkhfehv rmvdpkkaaq irsqvmthlr viyermnqsl sllynvpava 480
eeiqdevdel lqkeqnysdd vlanmisepr isygndalmp sltetkttve llpvngefsl 540
ddlqpwhsfg adsvpanten evepvdarpa adrglttrpg sgltniktee isevkmdaef 600
rhdsgyevhh qklvffaedv gsnkgaiigl mvggvviatv ivitlvmlkk kqytsihhgv 660
vevdaavtpe erhlskmqqn gyenptykff eqmqn 695
Orthologs are genes that have similar functions in species that share a common ancestor/evolution. The nucleotide sequences or primary structures of proteins made by these related genes can be compared to identify key parts. The primary structures of nine orthologs of APP appear below. Only multicellular eukaryotes orthologs could be found for APP. Select orthologs have more detailed information regarding their homology to Homo sapiens APP.
Ortholog 1: Sus scrofa (Pig)
APP Amino Acid Sequence (courtesy of Entrez):
mlpglalvll aawtaralev ptdgnaglla epqvamfcgk lnmhmnvqng kwesdpsgtk 60
tcigtkegil qycqevypel qitnvveanq pvtiqnwckr srkqckthth ivipyrclvg 120
efvsdallvp dkckflhqer mdvcethlhw htvaketcse kstnlhdygm llpcgidkfr 180
gvefvccpla eesdnidsad aeeddsdvww ggadtdyadg sedkvvevae eeevadveee 240
eaeddedded gdeveeeaee pyeeatertt siattttttt esveevvrev cseqaetgpc 300
ramisrwyfd vtegkcapff yggcggnrnn fdteeycmav cgsvmsqsll kttqehlpqd 360
pvklpttaas tpdavdkyle tpgdenehah fqkakerlea khrermsqvm reweeaerqa 420
knlpkadkka viqhfqekve sleqeaaner qqlvethmar veamlndrrr lalenyital 480
qavpprprhv fnmlkkyvra eqkdrqhtlk hfehvrmvdp kkaaqirsqv mthlrviyer 540
mnqslsllyn vpavaeeiqd evdellqkeq nysddvlanm iseprisygn dalmpsltet 600
kttvellpvn gefslddlqp whpfgvdsvp antenevepv darpaadrgl ttrpgsgltn 660
ikteeisevk mdaefrhdsg yevhhqklvf faedvgsnkg aiiglmvggv viatvivitl 720
vmlkkkqyts ihhgvvevda avtpeerhls kmqqngyenp tykffeqmqn 770The amino acids in Sus scrofa that are different from the corresponding amino acid in Homo sapiens 770-APP (Isoform-A) appear in bold. Indeed, the primary structures of the two proteins appear very similar. This suggests that APP underwent little change/evolution from pig to human. See the ß-Amyloid domain section below for comparison between this domain in both pig and human APP.
Ortholog 2: Gallus gallus (Chicken)
APP Amino Acid Sequence (courtesy of Entrez):
mlphlallll aagaaralev padgnaglla epqiamfcgk lnmhmnvqng kwesdpsgtk 60
tcidtkegil qycqevypel qitnvveanq pvtiqnwckr gwkqcnghph ivvpyrclvg 120
efvsdallvp dkckllhqer mdvcethlhw htvakescse ksmnlhdygm llpcgidkfr 180
gvefvccpla eesdnldsad aedddsdvww ggadadyadg sddkvveeqP Eedeeltvve 240
dedadddddd -dgdeieetee eyeeatertt siattttttt esveevvrev cseqaetgpc 300
ramisrwyfd vaegkcapff yggcggnrnn fdseeycmav cgs\vlpttaa stpdavdkyl 360
etpgdeneha hfqkakerle akhrermsqv mreweeaerq a\knlpkadkk aviqhfqekv 420
esleqeaane rqqlvethma rveamlndrr rialenyita lqtvpprprh vfnmlkkyvr 480
aeqkdrqhtl khfehvrmvd pkkaaqirsq vmthlrviye rmnqslsfly nvpavaeeiq 540
devdellqke qnysddvlan miseprisyg ndalmpslte tkttvellpv dgefslddlq 600
pwhpfgvdsv pantenevep vdarpaadrg lttrpgsglt nvkteevsev kmdaefrhds 660
gyevhhqklv ffaedvgsnk gaiiglmvgg vviatvivit lvmlkkkqyt sihhgvvevd 720
aavtpeerhl skmqqngyen ptykffeqmq n 751Above, the amino acid sequence of Gallus gallus APP is compared to the amino acid sequence of Homo sapiens 770-APP (Isoform- A). Again, the primary structures of these two proteins are very similar. The bold represent amino acids that are amino acids present in Gallus gallus APP that are different from the corresponding amino acids in Homo sapiens APP. Capitalized, bolded letters represent amino acids in the Gallus gallus APP removed from the corresponding Homo sapiens APP sequence. The dash represents amino acids missing in the Gallus gallus sequence when compared to the corresponding Homo sapiens sequence. The sequence between the slashes could not be related to the corresponding Homo sapiens sequence. Notice that there are more differences between chicken and human APP than between pig and human APP. APP underwent several changes as birds and mammals diverged.
Ortholog 3: Canis familiaris (Dog)
APP Isoform-695 Amino Acid Sequence (courtesy of Entrez):
mlpalalvll aswtaralev ptdgnaglla epqvamlcgk lrmhmnvqng kwesdplgtk 60
tcigskedil qycqevypel qitnvveanq pvtiqnwckk grkqckthah ivipyrclvg 120
efvsdallvp dkckflhqer mdvcethlhw htvaketcse kstnlhdygm llpcgidkfr 180
gvefvccpla eesdnidsad aeeddsdvww ggadtdyadg sedkvvevae eeevadveee 240
eaeddedded gdeveeeaee pyeeatertt siattttttt esveevvrvp ttaastpdav 300
dkyletpgde nehahfqkak erleakhrer msqvmrewee aerqaknlpk adkkaviqhf 360
qekvesleqe aanerqqlve thmarveaml ndrrrlalen yitalqavpp rprhvfnmlk 420
kyvraeqkdr qhtlkhfehv rmvdpkkaaq irsqvmthlr viyermnqsl sllynvpava 480
eeiqdevdel lqkeqnysdd ilanmisepr isygndalmp sltetkttve llpvngefsl 540
ddlqpwhpfg vdsvpanten evepvdarpa adrglttrpg sgltniktee isevkmdaef 600
rhdsgyevhh qklvffaedv gsnkgaiigl mvggvviatv ivitlvmlkk kqytsihhgv 660
vevdaavtpe erhlskmqqn gyenptykff eqmqn 695APP Isoform-751 Amino Acid Sequence (courtesy of Entrez):
mlpalalvll aswtaralev ptdgnaglla epqvamlcgk lrmhmnvqng kwesdplgtk 60
tcigskedil qycqevypel qitnvveanq pvtiqnwckk grkqckthah ivipyrclvg 120
efvsdallvp dkckflhqer mdvcethlhw htvaketcse kstnlhdygm llpcgidkfr 180
gvefvccpla eesdnidsad aeeddsdvww ggadtdyadg sedkvvevae eeevadveee 240
eaeddedded gdeveeeaee pyeeatertt siattttttt esveevvrev cseqaetgpc 300
ramisrwyfd vtegkcapff yggcggnrnn fdteeycmav cgsvipttaa stpdavdkyl 360
etpgdeneha hfqkakerle akhrermsqv mreweeaerq aknlpkadkk aviqhfqekv 420
esleqeaane rqqlvethma rveamlndrr rlalenyita lqavpprprh vfnmlkkyvr 480
aeqkdrqhtl khfehvrmvd pkkaaqirsq vmthlrviye rmnqslslly nvpavaeeiq 540
devdellqke qnysddilan miseprisyg ndalmpslte tkttvellpv ngefslddlq 600
pwhpfgvdsv pantenevep vdarpaadrg lttrpgsglt nikteeisev kmdaefrhds 660
gyevhhqklv ffaedvgsnk gaiiglmvgg vviatvivit lvmlkkkqyt sihhgvvevd 720
aavtpeerhl skmqqngyen ptykffeqmq n 751APP Isoform-770 Amino Acid Sequence (courtesy of Entrez):
mlpalalvll aswtaralev ptdgnaglla epqvamlcgk lrmhmnvqng kwesdplgtk 60
tcigskedil qycqevypel qitnvveanq pvtiqnwckk grkqckthah ivipyrclvg 120
efvsdallvp dkckflhqer mdvcethlhw htvaketcse kstnlhdygm llpcgidkfr 180
gvefvccpla eesdnidsad aeeddsdvww ggadtdyadg sedkvvevae eeevadveee 240
eaeddedded gdeveeeaee pyeeatertt siattttttt esveevvrev cseqaetgpc 300
ramisrwyfd vtegkcapff yggcggnrnn fdteeycmav cgsvmsqsll kttqeplpqd 360
avklpttaas tpdavdkyle tpgdenehah fqkakerlea khrermsqvm reweeaerqa 420
knlpkadkka viqhfqekve sleqeaaner qqlvethmar veamlndrrr lalenyital 480
qavpprprhv fnmlkkyvra eqkdrqhtlk hfehvrmvdp kkaaqirsqv mthlrviyer 540
mnqslsllyn vpavaeeiqd evdellqkeq nysddilanm iseprisygn dalmpsltet 600
kttvellpvn gefslddlqp whpfgvdsvp antenevepv darpaadrgl ttrpgsgltn 660
ikteeisevk mdaefrhdsg yevhhqklvf faedvgsnkg aiiglmvggv viatvivitl 720
vmlkkkqyts ihhgvvevda avtpeerhls kmqqngyenp tykffeqmqn 770
Comparison between Homo sapiens 770-APP (Hs/1-770) and Canis familiaris 770-APP (Cf/1-770) (courtesy of Ensembl):
(N.B. Asterisks represent corresponding conserved amino acids between these two species (identity match), blank spaces represent very different corresponding amino acids (no match), periods and colons represent corresponding similar amino acids (partial match))
Hs/1-770 MLPGLALLLLAAWTARALEVPTDGNAGLLAEPQIAMFCGRLNMHMNVQNGKWDSDPSGTK
Cf/1-770 MLPALALVLLASWTARALEVPTDGNAGLLAEPQVAMLCGKLRMHMNVQNGKWESDPLGTK
Compare: ***.***:***:*********************:**:**:*.**********:*** ***
Hs/1-770 TCIDTKEGILQYCQEVYPELQITNVVEANQPVTIQNWCKRGRKQCKTHPHFVIPYRCLVG
Cf/1-770 TCIGSKEDILQYCQEVYPELQITNVVEANQPVTIQNWCKKGRKQCKTHAHIVIPYRCLVG
Compare: ***.:**.*******************************:********.*:*********
Hs/1-770 EFVSDALLVPDKCKFLHQERMDVCETHLHWHTVAKETCSEKSTNLHDYGMLLPCGIDKFR
Cf/1-770 EFVSDALLVPDKCKFLHQERMDVCETHLHWHTVAKETCSEKSTNLHDYGMLLPCGIDKFR
Compare: ************************************************************
Hs/1-770 GVEFVCCPLAEESDNVDSADAEEDDSDVWWGGADTDYADGSEDKVVEVAEEEEVAEVEEE
Cf/1-770 GVEFVCCPLAEESDNIDSADAEEDDSDVWWGGADTDYADGSEDKVVEVAEEEEVADVEEE
Compare: ***************:***************************************:****
Hs/1-770 EADDDEDDEDGDEVEEEAEEPYEEATERTTSIATTTTTTTESVEEVVREVCSEQAETGPC
Cf/1-770 EAEDDEDDEDGDEVEEEAEEPYEEATERTTSIATTTTTTTESVEEVVREVCSEQAETGPC
Compare: **:*********************************************************
Hs/1-770 RAMISRWYFDVTEGKCAPFFYGGCGGNRNNFDTEEYCMAVCGSAMSQSLLKTTQEPLARD
Cf/1-770 RAMISRWYFDVTEGKCAPFFYGGCGGNRNNFDTEEYCMAVCGSVMSQSLLKTTQEPLPQD
Compare: *******************************************.*************.:*
Hs/1-770 PVKLPTTAASTPDAVDKYLETPGDENEHAHFQKAKERLEAKHRERMSQVMREWEEAERQA
Cf/1-770 AVKLPTTAASTPDAVDKYLETPGDENEHAHFQKAKERLEAKHRERMSQVMREWEEAERQA
Compare: .***********************************************************
Hs/1-770 KNLPKADKKAVIQHFQEKVESLEQEAANERQQLVETHMARVEAMLNDRRRLALENYITAL
Cf/1-770 KNLPKADKKAVIQHFQEKVESLEQEAANERQQLVETHMARVEAMLNDRRRLALENYITAL
Compare: ************************************************************
Hs/1-770 QAVPPRPRHVFNMLKKYVRAEQKDRQHTLKHFEHVRMVDPKKAAQIRSQVMTHLRVIYER
Cf/1-770 QAVPPRPRHVFNMLKKYVRAEQKDRQHTLKHFEHVRMVDPKKAAQIRSQVMTHLRVIYER
Compare: ************************************************************
Hs/1-770 MNQSLSLLYNVPAVAEEIQDEVDELLQKEQNYSDDVLANMISEPRISYGNDALMPSLTET
Cf/1-770 MNQSLSLLYNVPAVAEEIQDEVDELLQKEQNYSDDVLANMISEPRISYGNDALMPSLTET
Compare: ************************************************************
Hs/1-770 KTTVELLPVNGEFSLDDLQPWHSFGADSVPANTENEVEPVDARPAADRGLTTRPGSGLTN
Cf/1-770 KTTVELLPVNGEFSLDDLQPWHPFGVDSVPANTENEVEPVDARPAADRGLTTRPGSGLTN
Compare: **********************.**.**********************************
Hs/1-770 IKTEEISEVKMDAEFRHDSGYEVHHQKLVFFAEDVGSNKGAIIGLMVGGVVIATVIVITL
Cs/1-770 IKTEEISEVKMDAEFRHDSGYEVHHQKLVFFAEDVGSNKGAIIGLMVGGVVIATVIVITL
Compare: ************************************************************
Hs/1-770 VMLKKKQYTSIHHGVVEVDAAVTPEERHLSKMQQNGYENPTYKFFEQMQN
Cf/1-770 VMLKKKQYTSIHHGVVEVDAAVTPEERHLSKMQQNGYENPTYKFFEQMQN
Compare: **************************************************These amino acid sequences for Canis familiaris and Homo sapiens 770-APP are very similar. There is only pair of one amino acids that does not match at all between the species. See the ß-Amyloid domain section below for comparison between this domain in both dog and human APP.
Ortholog 4: Mus musculus (Mouse)
APP Amino Acid Sequence (courtesy of Entrez):
mlpslallll aawtvralev ptdgnaglla epqiamfcgk lnmhmnvqng kwesdpsgtk 60
tcigtkegil qycqevypel qitnvveanq pvtiqnwckr grkqckthth ivipyrclvg 120
efvsdallvp dkckflhqer mdvcethlhw htvaketcse kstnlhdygm llpcgidkfr 180
gvefvccpla eesdsvdsad aeeddsdvww ggadtdyadg gedkvvevae eeevadveee 240
eadddedved gdeveeeaee pyeeatertt stattttttt esveevvrvp ttaastpdav 300
dkyletpgde nehahfqkak erleakhrer msqvmrewee aerqaknlpk adkkaviqhf 360
qekvesleqe aanerqqlve thmarveaml ndrrrlalen yitalqavpp rphhvfnmlk 420
kyvraeqkdr qhtlkhfehv rmvdpkkaaq irsqvmthlr viyermnqsl sllynvpava 480
eeiqdevdel lqkeqnysdd vlanmisepr isygndalmp sltetkttve llpvngefsl 540
ddlqpwhpfg vdsvpanten evepvdarpa adrglttrpg sgltniktee isevkmdaef 600
ghdsgfevrh qklvffaedv gsnkgaiigl mvggvviatv ivitlvmlkk kqytsihhgv 660
vevdaavtpe erhlskmqqn gyenptykff eqmqn 695
Compare this amino acid sequence to the Homo sapiens 695-APP sequence above, as illustrated in the previous three orthologs. Again, the two sequences are very similar. How are they different? See the ß-Amyloid domain section below for comparison between this domain in both mouse and human APP.
Ortholog 5: Rattus norvegicus (Rat)
APP Amino Acid Sequence (courtesy of Entrez):
mlpslallll aawtvralev ptdgnaglla epqiamfcgk lnmhmnvqng kwesdpsgtk 60
tcigtkegil qycqevypel qitnvveanq pvtiqnwckr grkqckthth ivipyrclvg 120
efvsdallvp dkckflhqer mdvcethlhw htvaketcse kstnlhdygm llpcgidkfr 180
gvefvccpla eesdsidsad aeeddsdvww ggadtdyadg gedkvvevae eeevadveee 240
eaeddedved gdeveeeaee pyeeatertt siattttttt esveevvrev cseqaetgpc 300
ramisrwyfd vtegkcapff yggcggnrnn fdteeycmav cgsvssqsll kttseplpqd 360
pvklpttaas tpdavdkyle tpgdenehah fqkakerlea khrermsqvm reweeaerqa 420
knlpkadkka viqhfqekve sleqeaaner qqlvethmar veamlndrrr lalenyital 480
qavpprphhv fnmlkkyvra eqkdrqhtlk hfehvrmvdp kkaaqirsqv mthlrviyer 540
mnqslsllyn vpavaeeiqd evdellqkeq nysddvlanm iseprisygn dalmpsltet 600
kttvellpvn gefslddlqp whpfgvdsvp antenevepv darpaadrgl ttrpgsgltn 660
ikteeisevk mdaefghdsg fevrhqklvf faedvgsnkg aiiglmvggv viatvivitl 720
vmlkkkqyts ihhgvvevda avtpeerhls kmqqngyenp tykffeqmqn 770
Compare this amino acid sequence to the Homo sapiens 770-APP sequence above. Also, compare the rat sequence to the mouse sequence above. Again, the two are very similar. How are they different? See the ß-Amyloid domain section below for comparison between this domain in both rat, mouse, and human APP.
Ortholog 6: Macaca fascicularis (Crab-eating Macaque)
APP Amino Acid Sequence (courtesy of Entrez):
mlpglallll aawtaralev ptdgnaglla epqiamfcgr lnmhmnvqng kwdsdpsgtk 60
tcidtkegil qycqevypel qitnvveanq pvtiqnwckr grkqckthph fvipyrclvg 120
efvsdallvp dkckflhqer mdvcethlhw htvaketcse kstnlhdygm llpcgidkfr 180
gvefvccpla eesdnvdsad aeeddsdvww ggadtdyadg sedkvvevae eeevaeveee 240
eadddedded gdeveeeaee pyeeatertt siattttttt esveevvrev cseqaetgpc 300
ramisrwyfd vtegkcapff yggcggnrnn fdteeycmav cgsvipttaa stpdavdkyl 360
etpgdeneha hfqkakerle akhrermsqv mreweeaerq aknlpkadkk aviqhfqekv 420
esleqeaane rqqlvethma rveatlndrr rlalenyita lqavpprprh vfnmlkkyvr 480
aeqkdrqhtl khfehvrmvd pkkaaqirsq vmthlrviye rmnqslslly nvpavaeeiq 540
devdellqke qnysddvlan miseprisyg ndalmpslte tkttvellpv ngefslddlq 600
pwhsfgadsv pantenevep vdarpaadrg lttrpgsglt nikteeisev kmdaefrhds 660
gyevhhqklv ffaedvgsnk gaiiglmvgg vviatv 696Compare this amino acid sequence to the Homo sapiens 695-APP sequence above. How are they different?
Ortholog 7: Danio rerio (Zebra fish)
APP Isoform-A Amino Acid Sequence (courtesy of Entrez) :
mrsrelfill mavastlave vpsdsgtgll aepqiamfcg klnmhiniqs gkwepdpsgs 60
kscignkegi lqycqevype lqitnvvean qpvsiwdwck ksrkqcrshm hivvpyrclv 120
gefvsdallv pdkckflhqe rmdmceshlh whtvakescg drsmnlhdyg mllpcgidrf 180
rgvefvccpa dagkesesaa veeddsdvww ggaeadyten smtrdaaaep avleddedad 240
eeededqdgd gdrdekieee eeeeertqst saaltstttt ttesveevvr evcfasaetg 300
pcramlsrwy yvreerrcap fiyggcggnr nnfeseeycl svcsgvlptp sssppdavdr 360
yletpadene hahflqakes letkhrerms qvmreweeae rqakslprnd kkaviqhfqe 420
kvealeqesa serqqlveth marveallnd rrrlalesyl salqadpprp rhvfsllkky 480
vraeqkdrqh tlkhfehvrm vdpkkaaqir pqvlthlrvi eermnqslgl lykvpgvadd 540
iqdqvellqr eqqemsaqla nlqsdarvsy gndalmpdst aglellpaed tqgfgfihpe 600
sfnqpnthnq vepvdarpvp dldlatrpvs glkpddipel rmeaeerhse vyhqklvffa 660
edvssnkgai iglmvggvvi atiivitlvm lrkkqytsih hgiievdaav tpeerhlskm 720
qqngyenpty kffeqmhn 738APP Isoform-B Amino Acid Sequence (courtesy of Entrez) :
mgidrtvfll lmlttlslai evpsddsvgl laepqvamfc gklnmhinvq sgkwepdptg 60
tkscistkeg ilkycqevyp dlqitnvvea nqpvsiqnwc kmgrrqcrsh thivvpyrcl 120
vgefvsdall vpdkckflhq ermdmceshl hwhtvakesc gdrsmnlhdy gmllpcgidr 180
frgvefvccp meeqkdldse eqeeansdvw wggaeteytd asvlkeqvta kpdpavtedd 240
edlnneeeev wdndedgdge ddedeeddde diideqdtse qtsniamttt tttttesiee 300
vvrvptmaps padavdryle apgdmnehmr fqkakeslea khrekmsevm reweeaerqa 361
knlpradkkt iiqrfqekve slekeaager qqlvethmar veallndrrr qalesylssl 420
qsdqprprqv lnllkkyira eqkdrqhtlk hfehvrevdp kkasqirpfv mthlrvieer 480
mnqslgylyk vpqvandiqd qvavlvqrdq aevtqqlssl qskmrvsygn dalmpdlpds 540
ttpldnlppe qdglgfihpe sfnqantdnh vepvdarpip erglptrpei pkvrldieer 600
hnagydvrdk rlmflaedmg snkgaiiglm vggvviatvi vitlvmlrkk qytsihhgvi 660
evdaavtpee rhlakmqqng yenptykffe qmqn 694It is important to note that the APP sequences for this fish species are correspond less frequently to Homo sapiens APP isoforms than the other vertebrate orthologs. This suggests that the gene experienced more changes as evolution occured from fish to birds and mammals.
Ortholog 8: Drosophila melanogaster (Fruit Fly)
Fruit flys produce Amyloid Precursor-like Proteins coded by their genome.
APP-like Amino Acid Sequence (courtesy of Ensembl):
MCAALRRNLL LRSLWVVLAI GTAQVQAASP RWEPQIAVLC EAGQIYQPQY LSEEGRWVTD 60
LSKKTTGPTC LRDKMDLLDY CKKAYPNRDI TNIVESSHYQ KIGGWCRQGA LNAAKCKGSH 120
RWIKPFRCLG PFQSDALLVP EGCLFDHIHN ASRCWPFVRW NQTGAAACQE RGMQMRSFAM 180
LLPCGISVFS GVEFVCCPKH FKTDEIHVKK TDLPVMPAAQ INSANDELVM NDEDDSNDSN 240
YSKDANEDDL DDEDDLMGDD EEDDMVADEA ATAGGSPNTG SSGDSNSGSL DDINAEYDSG 300
EEGDNYEEDG AGSESEAEVE ASWDQSGGAK VVSLKSDSSS PSSAPVAPAP EKAPVKSESV 360
TSTPQLSASA AAFVAANSGN SGTGAGAPPS TAQPTSDPYF THFDPHYEHQ SYKVSQKRLE 420
ESHREKVTRV MKDWSDLEEK YQDMRLADPK AAQSFKQRMT ARFQTSVQAL EEEGNAEKHQ 480
LAAMHQQRVL AHINQRKREA MTCYTQALTE QPPNAHHVEK CLQKLLRALH KDRAHALAHY 540
RHLLNSGGPG GLEAAASERP RTLERLIDID RAVNQSMTML KRYPELSAKI AQLMNDYILA 600
LRSKDDIPGS SLGMSEEAEA GILDKYRVEI ERKVAEKERL RLAEKQRKEQ RAAEREKLRE 660
EKLRLEAKKV DDMLKSQVAE QQSQPTQSST QSQAQQQQQE KSLPGKELGP DAALVTAANP 720
NLETTKSEKD LSDTEYGEAT VSSTKVQTVL PTVDDDAVQR AVEDVAAAVA HQEAEPQVQH 780
FMTHDLGHRE SSFSLRREFA QHAHAAKEGR NVYFTLSFAG IALMAAVFVG VAVAKWRTSR 840
SPHAQGFIEV DQNVTTHHPI VREEKIVPNM QINGYENPTY KYFEVKE 887
Comparison between Homo sapiens 695-APP and Drosophila melangaster 886-APP (Rosen DR, et al., 1988):
The amino acid sequences of APP in these two species markedly similar, even though Homo sapiens is classified as vertebrate and Drosophila melangaster is classified as invertebrate (their common ancestor appears further back in history). The amino acid comparision is illustrated in Figure 1 below.
Figure 1: APP Amino acid alignment between two different species, Homo sapiens (Hu) and Drosophila melangaster (Dr). The alignments appear in groups of three lines and are divided by regions of the protein (extracellular 1, extracellular 2, and cytoplasmic). Drosophila APP primary structure appears on the top line, followed by a comparison table in between, and the Homo sapiens APP primary structure on the bottom. Numbers represent order of amino acids. The vertical lines in the comparison line and the capital letters in the amino acid sequences represent identical matches in the two species. Dots in the comparison line represent similar corresponding amino acids (partial match). Triangles represent conserved cytosine amino acids in both species. An especially long stretch of homolgy is underlined. Image courtesy of (Rosen DR, et al., 1989), permission granted by Dr. Kalpana White: 3/9/05.
An alternative representation of the comparison between these two proteins is depicted below in Figure 2.
Figure 2. Cartoon of Drosophila and Human APP homology. The numbers represent order of amino acids. The E1, E2, and C represent extracellular region 1, extracellular region 2, and cytoplasmic region, respectively. The loops in the Drosophila model represent additional amino acids in the sequence. The black box and the arrow in the human model indicate the location of the ß-Amyloid domain present only in the human amino acid sequence. The dots represent glycosylation sites. Image courtesy of (Rosen DR, et al., 1989), permission granted by Dr. Kalpana White: 3/9/05.
Indeed, the amnio acid sequences and overall structure of human and fruit fly APP contain several conserved regions even though they are more distantly related. In particular, see the N-terminal domain and E2 domain sections below for implications of particular regions of homology. Important differences appear as there is an additional glycosylation site in human APP and a ß-Amyloid domain of human APP. The amino acid sequence of Drosophila APP is also much longer than human APP.
Ortholog 9: Caenorhabditis elegans (Nematode)
APP-like Isoform-A1 Amino Acid Sequence (courtesy of Ensembl):
MTVGKLMIGL LIPILVATVY AEGSPAGSKR HEKFIPMVAF SCGYRNQYMT EEGSWKTDDE 60
RYATCFSGKL DILKYCRKAY PSMNITNIVE YSHEVSISDW CREEGSPCKW THSVRPYHCI 120
DGEFHSEALQ VPHDCQFSHV NSRDQCNDYQ HWKDEAGKQC KTKKSKGNKD MIVRSFAVLE 180
PCALDMFTGV EFVCCPNDQT NKTDVQKTKE DEDDDDDEDD AYEDDYSEES DEKDEEEPSS 240
QDPYFKIANW TNEHDDFKKA EMRMDEKHRK KVDKVMKEWG DLETRYNEQK AKDPKGAEKF 300
KSQMNARFQK TVSSLEEEHK RMRKEIEAVH EERVQAMLNE KKRDATHDYR QALATHVNKP 360
NKHSVLQSLK AYIRAEEKDR MHTLNRYRHL LKADSKEAAA YKPTVIHRLR YIDLRINGTL 420
AMLRDFPDLE KYVRPIAVTY WKDYRDEVSP DISVEDSELT PIIHDDEFSK NAKLDVKAPT 480
TTAKPVKETD NAKVLPTEAS DSEEEADEYY EDEDDEQVKK TPDMKKKVKV VDIKPKEIKV 540
TIEEEKKAPK LVETSVQTDD EDDDEDSSSS TSSESDEDED KNIKELRVDI EPIIDEPASF 600
YRHDKLIQSP EVERSASSVF QPYVLASAMF ITAICIIAFA ITNARRRRAM RGFIEVDVYT 660
PEERHVAGMQ VNGYENPTYS FFDSKA 686APP-like Isoform-B1 Amino Acid Sequence (courtesty of Ensembl):
MTVGKLMIGL LIPILVATVY AEGSPAGSKR HEKFIPMVAF SCGYRNQYMT EEGSWKTDDE 60
RYATCFSGKL DILKYCRKAY PSMNITNIVE YSHEVSISDW CREEGSPCKW THSVRPYHCI 120
DGEFHSEALQ VPHDCQFSHV NSRDQCNDYQ HWKDEAGKQC KTKKSKGNKD MIVRSFAVLE 180
PCALDMFTGV EFVCCPNDQT NKTDVQKTKE DEDDDDDEDD AYEDDYSEES DEKDEEEPSS 240
QDPYFKIANW TNEHDDFKKA EMRMDEKHRK KVDKVMKEWG DLETRYNEQK AKDPKGAEKF 300
KSQMNARFQK TVSSLEEEHK RMRKEIEAVH EERVQAMLNE KKRDATHDYR QALATHVNKP 360
NKHSVLQSLK AYIRAEEKDR MHTLNRYRHL LKADSKEAAA YKPTVIHRLR YIDLRINGTL 420
AMLRDFPDLE KYVRPIAVTY WKDYRDEVSP DISVEDSELT PIIHDDEFSK NAKLDVKAPT 480
TTAKPVKETD NAKVLPTEAS DSEEEADEYY EDEDDEQVKK TPDMKKKVKV VDIKPKEVTI 540
EEEKKAPKLV ETSVQTDDED DDEDSSSSTS SESDEDEDKN IKELRVDIEP IIDEPASFYR 600
HDKLIQSPEV ERSASSVFQP YVLASAMFIT AICIIAFAIT NARRRRAMRG FIEVDVYTPE 660
ERHVAGMQVN GYENPTYSFF DSKA 684Compare the amino acid sequences of the C. elegans and Homo sapiens APP. Like the fruit fly, the amnio acid sequences of human and C. elegans APP contain several conserved regions even though they are more distantly related. In particular, see the N-terminal domain and E2 domain sections below for implications of particular regions of homology. How are the two sequences different?
The alignment of specific domains of APP of various species can be compared to determine amino acids that are especially important for protein function or amino acids critical in dysfunction in humans.
N-Terminal Domain
Scientists have found that the N-terminal domain of human, electric ray, C. elegans, and Drosophila APPs are between 36% and 84% conserved across species. In particular, all of the cytoseine residues and the hydrophobic core are fully conserved among species. These regions of this domain play critical roles in the folding of the protein and their conservation suggests that the overall shape of the APP N-terminal domain was maintained throughout evolution (Rossjohn J, et. al., 1999). See Figure 3 below for a visual representaion of the alignment.
Figure 3. Alignment of amino acid sequences for N-Terminal, growth factor like domain of APPs from various species. APP 1 represents Homo sapiens APP, APP 2 represents electric ray APP, APP 3 represents C. elegans APP, APP 4 represents Drosophila APP. APLP 1, 2 represent human APP-like proteins. The numbers above the column letters represent order of amino acids. The numbered ßs, alphas, and arrows above the amino acid code indicate the location of secondary structure of the protein. The small dots below the amino acid code represent hydrophobic amino acids, while the +s indicate conserved positively charged amino acids. The large dot below the amino acids represent conserved cytosine residues. The shaded boxes represent amino acids conserved throughout all species/proteins. Image courtesy of (Rossjohn J, et al., 1999), permission granted by Dr. Michael Parker: 2/24/05.
E2 Domain
Scientists have found that amino acids critical to the folding and dimerization of the E2 domain of APP are highly conserved across species. The overall conservation of the domain is 33%, while the conservation of the amino acids involved in the helical packing is 64%. Thus, the folding and dimerization ability of this domain likely critical to APP function (Wa Y, Ha Y, 2004). See Figure 4 below for a visual representation of the alignment.
Figure 4. Alignment of amino acid sequences of E2 domain of APPs from various species. HsAPP represents Homo sapiens APP, while HsAPLP1 and 2 represent Homo sapiens AP-like proteins. DmAPPL and CeAPL represent Drosophila and C. elegans APP. The numbers above the single letter amino acid code represent number of amino acids. The asterisks represents amino acids connecting dimer interfaces, while the +s indicate amino acids involved in helical coiling of the domain. The alpha A-F indicate the positions of the six alpha helices of the secondary structure. The shaded boxes represent amino acid conserved across all species. Image courtesy of (Wang Y, Ha Y, 2004), permission granted by Dr. Ya Ha: 2/16/05.
ß-Amyloid Domain
Scientists have found that the 42 amino acid ß-Amyloid domains of Human, Dog, Pig, Polar Bear, Rabbit, Sheep, Cow, and Guinea Pig APPs are identical. Their corresponding nucleotide sequences only have silent differences. All of these species exhibit the ß-Amyloid neuronal plaques characteristic of Alzheimer's Disease in old age. However, the corresponding amino acid sequences of ß-Amyloid domains of rat and mouse species contain three amino acid changes. The alignment of the divergent amino acid sequences appears below adapted from an in image in (Johnstone EM, et al., 1991). The differing amino acids appear in bold.
DAEFRHDSGYEVHHQKLVFFAEDVGSNKGAIIGLMVGGVVIA Human,Dog, Pig etc. ß-Amyloid Domain
DAEFGHDSGFEVRHQKLVFFAEDVGSNKGAIIGLMVGGVVIA Rat,Mouse ß-Amyloid Domain
It is interesting to note that normal mice and rats only rarely exhibit ß-Amyloid neuronal plaques characteristic of Alzheimer's disease. These findings indicate the these three differing amino acids may play an important role in the development or lack of development of degenerative neuronal plaques (Johnstone EM, et al., 1991).
For more information about APP orthologs and their significance: Visit the APP Genecard or the Ensembl Gene Report.References:
Johnstone EM, et al. 1991. Conservation of the Alzheimer's disease amyloid peptide in dog, polar bear and five other mammals by cross-species polymerase chain reaction analysis. Molecular Brain Research 10: 299-305.
Rosen DR, et al. 1989 Apr. A Drosophila gene encoding a protein resembling the human ß-amyloid protein precursor. Proc Natl Acad Sci USA 86: 2478-2482.
Rossjohn J, et al. 1999 Apr. Crystal structure of the N-terminal, growth factor-like domain of Alzheimer amyloid precursor protein. Nature Structural Biology 6 (4): 327-331.
Wang Y, Ha Y. 2004 Aug 13. The X-Ray Structure of an Antiparallel Dimer of the Human Amyloid Precursor Protein E2 Domain. Molecular Cell 15: 343-353