This page contains a real cDNA and protein but they have not been submitted to Genbank. Can you deduce what this protein does?
This is the challenge investigators face every time they characterize a new portein.
>Sequence 1 ORF:183..3260 Frame +3
MAVACAVAVRPLVQVAVASAVSTAAPASSKPAVKLAASAVSAVALTTVSVSAGLLATTAVEDPRFHAADC
QSRSADASASCEDLQPSTSTCTSAVRDANRPTRRVRRSGSKAQRRGSTTLTASVPSMAAAVVLPPKIALR
RRHRLRLRAGHSATAAATDKTPREQPDKPAALPEDLLPADATSTSSTGKISSAAVCCGLLAHCSAAQLHA
ILCGLVQAVASSSVKGNNRKLLLGSKLRKLLEGVGVAPANGKAYTAADVAALSGPKLERLRATLKSQPGL
LLWFLLFTAPAKLQALQAALLPGGAGDRSFEEWRAAIDAVAGSGHEQLAAAQEVRGRQSACVEGSTAGNT
ATTATITTTNNNPASHGGVYTALTGTEVTGKKPAALPEDLLPADATSTSSTGKISSAAVCCGLLAHCSAA
QLHAILCGLVQAVASSSVKGNNRKLLLGSKLRKLLEGVGVAPANGKAYTAADVAALSGPKLERLRATLKS
QPGLLLWFLLFTAPAKLQALQAALLPGGAGDRSFEEWRAAIDAVAGSGHEQLAAAQEVRGRQSACVEGST
AGNTATTATITTTNNNPASHGGVYTALTGTEVTGKAAANKDLSRTRTTSHRNRCVSESGSTRNKSRSSSS
RSSSTHSVEYAEPKAGCSQPAATVPGCVPEIISAAIPPLAPLALHIRRAIVKELLEARPPGWNTFLYSWL
QAAGLSEFLPANGTCRMYMADRKQLVLRVGAMREEQVDAFLTCMCKAHGHSTWLARYLHMLGPEVSQLLS
QGRYSDELLAALRAAGQKTLADAVMEHFWGRDPDPEDSEAGEMDVKPWAERLGLLRFDMLAEQLRLPPNA
DGSVKNFSNGLVFKVDPLEVWSKYTDGEPSAGALSGMRATDKEARDKQVKQLRGVPLLYLWRIGGRVVYV
GMSGGWVKGRRIARYLAEGPGFSESSKMLPWLTAIDEGKEIELRVITLEGLKALEGMSEGMSEEEVQKKV
QKKVKELEKHFLCHVDCPCNKVNNGSYRVETPRQASWTNSRRSTR
Length:
4305 February 2, 1996
1 CAACAAGCCT GGGACGCAAT ACAACCGAAA CCAGCCCGCT GTTAGTTGAC
51 CCTTTGACAA CGTCTTGCGA ACGGCATCCT GCTTGCTCTC CTATGTATTC
101 GGCGTCACAA CAACAAGTGA AATAGCACAC TTTTACTCAT CGTAATCGCG
151 CCAAGgcgtg cgtgctccag Gctaccgcct acatggctgt tgcctgcgcg
201 gttgccgtcc ggcccctggt ccaagtGgcc gttgcctctg ccgtttccac
251 cgccgcccct gcctcctcga agcctgccgt Caagctcgcc gcctccgccg
301 tgtccgccgt cgcgctcacc accgtctctg tatccgccgg cctgttggca
351 accaccgctg ttgaagatcc acgttttcat gccgcggact gccagagccg
401 ttctgctgat gcgagtgcga gctgtgagga ccttcagccc tccacatcca
451 catgcacatc agCAGtacgt gacgccaaca ggcccacccg ccgtgtcagg
501 cgcagcggct ccaaggccca acgccgagga tccaccacgc tgacagcttc
551 cgTtccatcc atggcggcgg ccgtcgtgct cccacccaag atcgccctgc
601 ggcggcgcca tcggctgcgc cttcgggcgg gacacagcgc cactgctgct
651 gccactgaca agacccctcg ggagcagccg gataagcccg ctgcattgcc
701 ggaggatctg ctcccggccg acgccacctC cacctccagc acgggcaaaa
751 tctcctccgc cgccgtgtgc tgcggcctgc tggcgcaCTG CAGCGCTGCC
801 CAGCTGCACG CCATCCTGTG CGGGCTAGTG CAAGCCGTGG CATCCAGCAG
851 CGTCAAGGGC AACAATCGGA AGCTGCTGCT GGGCTCCAAG CTGCGCAAGC
901 TGCTGGAGGG CGTGGGCGTG GCGCCGGCCA ACGGCAAAGC CTACACCGCC
951 GCCGACGTGG CGGCACTCTC CGGCCCCAAG CTGGAGCGGC TGCGGGCCAC
1001 GCTGAAGTCG CAGCCGGGAC TGCTGCTGTG GTTCCTGCTG TTTACCGCGC
1051 CCGCTAAGCT GCAGGCGCTG CAGgcggcgc tgctgccggg cggcgcgggc
1101 gacaggagct tcgaggagtg gcgcgccgcg attgacgctg tggccggcag
1151 cggccacgag cagctggcgg cggcgcagga aGTGAGGGGC CGCCAGTCGG
1201 CGTGCGTTGA GGGCAGTACG GCCGGCAACA CCGCCACCAC CGCCACCATC
1251 ACCACCACCA ACAACAACCC CGCCAGTCAT GGTGGCGTCT ACACGGCGCT
1301 CACGGGCACC GAGGTTACCG GCAAGaagCc cgctgcattg ccggaggatc
1351 tgctcccggc cgacgccacc tccacctcca gcacgggcaa aatctcctcc
1401 gccgccgtgt gctgcggcct GctggcgcaC TGCAGCGCTG CCCAGCTGCA
1451 CGCCATCCTG TGCGGGCTAG TGCAAGCCGT GGCATCCAGC AGCGTCAAGG
1501 GCAACAATCG GAAGCTGCTG CTGGGCTCCA AGCTGCGCAA GCTGCTGGAG
1551 GGCGTGGGCG TGGCGCCGGC CAACGGCAAA GCCTACACCG CCGCCGACGT
1601 GGCGGCACTC TCCGGCCCCA AGCTGGAGCG GCTGCGGGCC ACGCtgaagt
1651 cgcagccggg actgctgctg tggttcctgc tgtttaccgc gcccgctAag
1701 ctgcaggcgc tgcaggcggc gctgctgccg ggcggcgcgg gcgacaggag
1751 cttcgaggag tggcgcgccg cgatCgacgc tgtggccggc agcggccacg
1801 agcagctggc ggcggcgcag gaagtgaggg gcCgccagtc ggcgtgcgtt
1851 gagggcagTa cggccggcaa caccgccacc accgccacca tcaccaccac
1901 caacaacaac cccgccagtc atggtggcgt ctacacggcg ctcacgggca
1951 ccgaggttac cggcaaggcg gcggccaaca aggatctgtc ccgcacccgc
2001 accaccagcc acaggaaccg gtGcgtgtcc gagagtggca gcacgcgcaa
2051 caagagcagg agtagcagca gcaggagcag cagtactcac agcgtggagt
2101 acgcggagcc gaaggcgggc tgctcccagc ccgccgccac cgtcccgggc
2151 tgcgtgcctg agatcatcag cgcagctata ccgccgctgg ctccgctagc
2201 tctgcacatc cggcgcgcca tagtgaagga gctgctggag gccaggccgc
2251 cgggctggaa cacctttctg tactcctggc tgcaggcggc ggggctgtcc
2301 gagttcctgc cggccaatgg cacctgccgc atgtacatgg cagacaggaa
2351 gcagctcgtg ctccgcgtgg gcgcgatgcg cgaggagcag gtggacgcgt
2401 tcctgacgtg catgtgcaag gcgcacgggc acagcacgtg gctcgcgcgc
2451 tacctgcaca tgctggggcc ggaggtttcg cagctgctgt cccagggtcg
2501 gtacagcgac gaactgctgg ccgcgctgcg ggccgccggg cagaaaacgc
2551 tggccgatgc aGTCATGGAG CaCTTCTGGG GGcgggaCCC GGATCCGGAG
2601 GACAGTGAGG CCGGCGAGAT GGATGTCAAG CCGTGGGCGG AGCGCCTGGG
2651 ACTGCTGAGG TTTGACATGC TCGCGGAGCA GCTGAGGCTG CCGCCCAACG
2701 CCGACGgcTc cgtgaagaac ttcagcaacg ggctggtctt caaggtcgac
2751 ccgctagagg tgtggagcaa gtacactgac ggagagccgt cagcaggcgc
2801 gctgagtggc atgcgtgcca ccgacaagga ggcccgcgac aagcaggtca
2851 agcagcttcg cggcgtcccg ttgctgtacc tgtggcggat aggcggccgg
2901 gtggtgtacg tgggcatgtc tgggggctgg gtgaagggcc gtcggatagc
2951 acgctatttg gcagagggcc cgggAttcag cgAgtcgtcg aagatgctgc
3001 cgtggctgac agccattgat gagggcaagg aGatcgaGCt ccgggtcatc
3051 acgctggagg gattgaaggc attggagggt atgtcggagg gcatgtccga
3101 ggaggaggtg cagaaaaagg tgcagaagaa ggtgaaggag ctcgagaagc
3151 acttcctgtg ccatgtgGac tgcccgtgca acaaagttaa caacgggtct
3201 taccgggttg agacgccccg ccaggcctcc tggacaaact caaggagaag
3251 tacaagataa aagtacggca acaaagcgca gGccggcaac cccgctgagc
3301 atcagtccaa ggaacagccc gagtagcctg CTGaggtctc gcttcagatg
3351 cgacaatcca attgcggcgg gtcgCcAttg caccacgggc cgttgcggtg
3401 gcgtgagcgg gctgcatgtg cacaagccgg ggctttcagg cAggtcagat
3451 agaatcatgc acccgcggca tgcggtcagc gcacaagcgg agctgcggcc
3501 agtttgcatg caggggcgcT gccgcacagg gccaggcgcg tgtacggttg
3551 ctgcagttgt gtgcttttgc gcagctcatc acgtagatta ggtgcgtgtg
3601 tgcatttgcg ggctgaacgc ggttggaacg atgtgcgggg tagtgcatga
3651 tAgctgtgga cagttgcatc ttacgtttct tttatttAtt tgtacaactg
3701 tgggcctacc tcttaccgag atgagcgaaa cgggatgcAt aggatgtgcg
3751 tggcaagctg atggcgggtg gcgggtgtgc aaccagtatg tgcgggcaag
3801 gagagcgtgc tgcccttcac atggatgttc tcctggtgcc aagttgtgta
3851 tcgtctattg tgggccattg ccgtgtggag gcgcttatcc ctagtaaatg
3901 ttgtctacct gctcagctcc cacattgggt taggactgat atgcaccagc
3951 agcactcatt gatcgaacag catgtagatt catgcccaat aggttgccgg
4001 cgtaaTTGgT TGCAGgCTGG CATgCGTGTG TTGTACCGgT AACTTGTAAg
4051 GCACAGACAg GCTGCATGCA TAGCGGGCAG gTtTAGCATG GCACGAAGAC
4101 GCATCCCGTT TGGTGTAGCC TTTGCTCACT GTGTTGAGGG GGTTAGTCGT
4151 CTGGTCGaAC GTGCCGATCC CGATTGAGAT GAGGCAAGCG CAGGCTTGGG
4201 CAGACGTACC GTATCCGCCT TTAGAGAGAG TCTAATGAAG GCGTGGAAGC
4251 ATGAGCGGCT GCAGTCTGAC CAAGCACTTT CCCATCAGAC AACAAGCaAA
4301 CCCAG