Unknown Protein Sequence

This page contains a real cDNA and protein but they have not been submitted to Genbank. Can you deduce what this protein does?

This is the challenge investigators face every time they characterize a new portein.

>Sequence 1 ORF:183..3260 Frame +3

MAVACAVAVRPLVQVAVASAVSTAAPASSKPAVKLAASAVSAVALTTVSVSAGLLATTAVEDPRFHAADC

QSRSADASASCEDLQPSTSTCTSAVRDANRPTRRVRRSGSKAQRRGSTTLTASVPSMAAAVVLPPKIALR

RRHRLRLRAGHSATAAATDKTPREQPDKPAALPEDLLPADATSTSSTGKISSAAVCCGLLAHCSAAQLHA

ILCGLVQAVASSSVKGNNRKLLLGSKLRKLLEGVGVAPANGKAYTAADVAALSGPKLERLRATLKSQPGL

LLWFLLFTAPAKLQALQAALLPGGAGDRSFEEWRAAIDAVAGSGHEQLAAAQEVRGRQSACVEGSTAGNT

ATTATITTTNNNPASHGGVYTALTGTEVTGKKPAALPEDLLPADATSTSSTGKISSAAVCCGLLAHCSAA

QLHAILCGLVQAVASSSVKGNNRKLLLGSKLRKLLEGVGVAPANGKAYTAADVAALSGPKLERLRATLKS

QPGLLLWFLLFTAPAKLQALQAALLPGGAGDRSFEEWRAAIDAVAGSGHEQLAAAQEVRGRQSACVEGST

AGNTATTATITTTNNNPASHGGVYTALTGTEVTGKAAANKDLSRTRTTSHRNRCVSESGSTRNKSRSSSS

RSSSTHSVEYAEPKAGCSQPAATVPGCVPEIISAAIPPLAPLALHIRRAIVKELLEARPPGWNTFLYSWL

QAAGLSEFLPANGTCRMYMADRKQLVLRVGAMREEQVDAFLTCMCKAHGHSTWLARYLHMLGPEVSQLLS

QGRYSDELLAALRAAGQKTLADAVMEHFWGRDPDPEDSEAGEMDVKPWAERLGLLRFDMLAEQLRLPPNA

DGSVKNFSNGLVFKVDPLEVWSKYTDGEPSAGALSGMRATDKEARDKQVKQLRGVPLLYLWRIGGRVVYV

GMSGGWVKGRRIARYLAEGPGFSESSKMLPWLTAIDEGKEIELRVITLEGLKALEGMSEGMSEEEVQKKV

QKKVKELEKHFLCHVDCPCNKVNNGSYRVETPRQASWTNSRRSTR

 


 

Length: 4305  February  2, 1996

       1  CAACAAGCCT GGGACGCAAT ACAACCGAAA CCAGCCCGCT GTTAGTTGAC

      51  CCTTTGACAA CGTCTTGCGA ACGGCATCCT GCTTGCTCTC CTATGTATTC

     101  GGCGTCACAA CAACAAGTGA AATAGCACAC TTTTACTCAT CGTAATCGCG

     151  CCAAGgcgtg cgtgctccag Gctaccgcct acatggctgt tgcctgcgcg

     201  gttgccgtcc ggcccctggt ccaagtGgcc gttgcctctg ccgtttccac

     251  cgccgcccct gcctcctcga agcctgccgt Caagctcgcc gcctccgccg

     301  tgtccgccgt cgcgctcacc accgtctctg tatccgccgg cctgttggca

     351  accaccgctg ttgaagatcc acgttttcat gccgcggact gccagagccg

     401  ttctgctgat gcgagtgcga gctgtgagga ccttcagccc tccacatcca

     451  catgcacatc agCAGtacgt gacgccaaca ggcccacccg ccgtgtcagg

     501  cgcagcggct ccaaggccca acgccgagga tccaccacgc tgacagcttc

     551  cgTtccatcc atggcggcgg ccgtcgtgct cccacccaag atcgccctgc

     601  ggcggcgcca tcggctgcgc cttcgggcgg gacacagcgc cactgctgct

     651  gccactgaca agacccctcg ggagcagccg gataagcccg ctgcattgcc

     701  ggaggatctg ctcccggccg acgccacctC cacctccagc acgggcaaaa

     751  tctcctccgc cgccgtgtgc tgcggcctgc tggcgcaCTG CAGCGCTGCC

     801  CAGCTGCACG CCATCCTGTG CGGGCTAGTG CAAGCCGTGG CATCCAGCAG

     851  CGTCAAGGGC AACAATCGGA AGCTGCTGCT GGGCTCCAAG CTGCGCAAGC

     901  TGCTGGAGGG CGTGGGCGTG GCGCCGGCCA ACGGCAAAGC CTACACCGCC

     951  GCCGACGTGG CGGCACTCTC CGGCCCCAAG CTGGAGCGGC TGCGGGCCAC

    1001  GCTGAAGTCG CAGCCGGGAC TGCTGCTGTG GTTCCTGCTG TTTACCGCGC

    1051  CCGCTAAGCT GCAGGCGCTG CAGgcggcgc tgctgccggg cggcgcgggc

    1101  gacaggagct tcgaggagtg gcgcgccgcg attgacgctg tggccggcag

    1151  cggccacgag cagctggcgg cggcgcagga aGTGAGGGGC CGCCAGTCGG

    1201  CGTGCGTTGA GGGCAGTACG GCCGGCAACA CCGCCACCAC CGCCACCATC

    1251  ACCACCACCA ACAACAACCC CGCCAGTCAT GGTGGCGTCT ACACGGCGCT

    1301  CACGGGCACC GAGGTTACCG GCAAGaagCc cgctgcattg ccggaggatc

    1351  tgctcccggc cgacgccacc tccacctcca gcacgggcaa aatctcctcc

    1401  gccgccgtgt gctgcggcct GctggcgcaC TGCAGCGCTG CCCAGCTGCA

    1451  CGCCATCCTG TGCGGGCTAG TGCAAGCCGT GGCATCCAGC AGCGTCAAGG

    1501  GCAACAATCG GAAGCTGCTG CTGGGCTCCA AGCTGCGCAA GCTGCTGGAG

    1551  GGCGTGGGCG TGGCGCCGGC CAACGGCAAA GCCTACACCG CCGCCGACGT

    1601  GGCGGCACTC TCCGGCCCCA AGCTGGAGCG GCTGCGGGCC ACGCtgaagt

    1651  cgcagccggg actgctgctg tggttcctgc tgtttaccgc gcccgctAag

    1701  ctgcaggcgc tgcaggcggc gctgctgccg ggcggcgcgg gcgacaggag

    1751  cttcgaggag tggcgcgccg cgatCgacgc tgtggccggc agcggccacg

    1801  agcagctggc ggcggcgcag gaagtgaggg gcCgccagtc ggcgtgcgtt

    1851  gagggcagTa cggccggcaa caccgccacc accgccacca tcaccaccac

    1901  caacaacaac cccgccagtc atggtggcgt ctacacggcg ctcacgggca

    1951  ccgaggttac cggcaaggcg gcggccaaca aggatctgtc ccgcacccgc

    2001  accaccagcc acaggaaccg gtGcgtgtcc gagagtggca gcacgcgcaa

    2051  caagagcagg agtagcagca gcaggagcag cagtactcac agcgtggagt

    2101  acgcggagcc gaaggcgggc tgctcccagc ccgccgccac cgtcccgggc

    2151  tgcgtgcctg agatcatcag cgcagctata ccgccgctgg ctccgctagc

    2201  tctgcacatc cggcgcgcca tagtgaagga gctgctggag gccaggccgc

    2251  cgggctggaa cacctttctg tactcctggc tgcaggcggc ggggctgtcc

    2301  gagttcctgc cggccaatgg cacctgccgc atgtacatgg cagacaggaa

    2351  gcagctcgtg ctccgcgtgg gcgcgatgcg cgaggagcag gtggacgcgt

    2401  tcctgacgtg catgtgcaag gcgcacgggc acagcacgtg gctcgcgcgc

    2451  tacctgcaca tgctggggcc ggaggtttcg cagctgctgt cccagggtcg

    2501  gtacagcgac gaactgctgg ccgcgctgcg ggccgccggg cagaaaacgc

    2551  tggccgatgc aGTCATGGAG CaCTTCTGGG GGcgggaCCC GGATCCGGAG

    2601  GACAGTGAGG CCGGCGAGAT GGATGTCAAG CCGTGGGCGG AGCGCCTGGG

    2651  ACTGCTGAGG TTTGACATGC TCGCGGAGCA GCTGAGGCTG CCGCCCAACG

    2701  CCGACGgcTc cgtgaagaac ttcagcaacg ggctggtctt caaggtcgac

    2751  ccgctagagg tgtggagcaa gtacactgac ggagagccgt cagcaggcgc

    2801  gctgagtggc atgcgtgcca ccgacaagga ggcccgcgac aagcaggtca

    2851  agcagcttcg cggcgtcccg ttgctgtacc tgtggcggat aggcggccgg

    2901  gtggtgtacg tgggcatgtc tgggggctgg gtgaagggcc gtcggatagc

    2951  acgctatttg gcagagggcc cgggAttcag cgAgtcgtcg aagatgctgc

    3001  cgtggctgac agccattgat gagggcaagg aGatcgaGCt ccgggtcatc

    3051  acgctggagg gattgaaggc attggagggt atgtcggagg gcatgtccga

    3101  ggaggaggtg cagaaaaagg tgcagaagaa ggtgaaggag ctcgagaagc

    3151  acttcctgtg ccatgtgGac tgcccgtgca acaaagttaa caacgggtct

    3201  taccgggttg agacgccccg ccaggcctcc tggacaaact caaggagaag

    3251  tacaagataa aagtacggca acaaagcgca gGccggcaac cccgctgagc

    3301  atcagtccaa ggaacagccc gagtagcctg CTGaggtctc gcttcagatg

    3351  cgacaatcca attgcggcgg gtcgCcAttg caccacgggc cgttgcggtg

    3401  gcgtgagcgg gctgcatgtg cacaagccgg ggctttcagg cAggtcagat

    3451  agaatcatgc acccgcggca tgcggtcagc gcacaagcgg agctgcggcc

    3501  agtttgcatg caggggcgcT gccgcacagg gccaggcgcg tgtacggttg

    3551  ctgcagttgt gtgcttttgc gcagctcatc acgtagatta ggtgcgtgtg

    3601  tgcatttgcg ggctgaacgc ggttggaacg atgtgcgggg tagtgcatga

    3651  tAgctgtgga cagttgcatc ttacgtttct tttatttAtt tgtacaactg

    3701  tgggcctacc tcttaccgag atgagcgaaa cgggatgcAt aggatgtgcg

    3751  tggcaagctg atggcgggtg gcgggtgtgc aaccagtatg tgcgggcaag

    3801  gagagcgtgc tgcccttcac atggatgttc tcctggtgcc aagttgtgta

    3851  tcgtctattg tgggccattg ccgtgtggag gcgcttatcc ctagtaaatg

    3901  ttgtctacct gctcagctcc cacattgggt taggactgat atgcaccagc

    3951  agcactcatt gatcgaacag catgtagatt catgcccaat aggttgccgg

    4001  cgtaaTTGgT TGCAGgCTGG CATgCGTGTG TTGTACCGgT AACTTGTAAg

    4051  GCACAGACAg GCTGCATGCA TAGCGGGCAG gTtTAGCATG GCACGAAGAC

    4101  GCATCCCGTT TGGTGTAGCC TTTGCTCACT GTGTTGAGGG GGTTAGTCGT

    4151  CTGGTCGaAC GTGCCGATCC CGATTGAGAT GAGGCAAGCG CAGGCTTGGG

    4201  CAGACGTACC GTATCCGCCT TTAGAGAGAG TCTAATGAAG GCGTGGAAGC

    4251  ATGAGCGGCT GCAGTCTGAC CAAGCACTTT CCCATCAGAC AACAAGCaAA

    4301  CCCAG