Z54142 . S.pombe chromosome...[gi:984697]
Protein, Related Sequences
LOCUS SPAC24H6 36833
bp DNA
PLN 07-MAR-2000
DEFINITION S.pombe chromosome I cosmid c24H6.
ACCESSION Z54142
VERSION Z54142.1 GI:984697
KEYWORDS 40S ribosomal protein S9; 5S rRNA; cdc25; cullin
3; DAHP synthetase
family; guanine nucleotide exchange factor; hexokinase; Homol D
box; Homol E box; hsk1; M-phase inducer phosphatase; major
facilitator family; phospo-2-dehydro-3-deoxyheptonate aldolase;
Rhodanese; rps9; ubiquitin activating enzyme; ubiquitin--protein
ligase.
SOURCE fission yeast.
ORGANISM Schizosaccharomyces pombe
Eukaryota; Fungi; Ascomycota; Schizosaccharomycetales;
Schizosaccharomycetaceae; Schizosaccharomyces.
REFERENCE 1 (bases 1 to 36833)
AUTHORS Skelton,J. and Churcher,C.M.
JOURNAL Unpublished
REFERENCE 2 (bases 1 to 36833)
AUTHORS Barrell,B.G., Rajandream,M.A., Walsh,S.V. and Wood,V.
TITLE Direct Submission
JOURNAL Submitted (10-SEP-1995) Schizosaccharomyces pombe chromosome
I
sequencing project, Sanger Centre, Hinxton Hall, Hinxton, Cambridge
CB10 1RQ E-mail: barrell@sanger.ac.uk
COMMENT Notes:
Details of yeast sequencing at the Sanger Centre are available on
the World Wide Web.
(URL, http://www.sanger.ac.uk/Projects/S_pombe/)
Protein coding regions (CDS) have been predicted with the help of
computer analysis using the Genefinder program in PomBase (an ACEDB
database) with additional predictions for the branch-acceptor sites
supplied by the program Sp3splice. CAUTION: It is possible that for
any individual CDS we may have underestimated or overestimated the
number of introns/exons or we may not have chosen the correct
splice donor/acceptor sites.
CDS are numbered using the following system eg SPAc5H10.01c. SP (S.
pombe), A (chromosome 1), c5H10 (cosmid name), .01 (first CDS), c
(complementary strand).
The more significant matches with motifs in the PROSITE database
are also included but some of these may be fortuitous.
The length in codons is given for each CDS.
IMPORTANT: This sequence MAY NOT be the entire insert of the
sequenced clone. It may be shorter because we only sequence
overlapping sections once, or longer, because we arrange for a
small overlap between neighbouring submissions.
Cosmid c24H6 is overlapped at the 5' end by cosmid c23E2.
FEATURES
Location/Qualifiers
source 1..36833
/organism="Schizosaccharomyces pombe"
/strain="972h-"
/db_xref="taxon:4896"
/chromosome="I"
/map="1L"
/clone="cosmid c24H6"
gene
complement(1..800)
/gene="SPAC24H6.01c"
misc_feature 1
/note="single base overlap with cosmid SPAC23E2, EM:Z68887
S.pombe chromosome 1"
CDS
complement(join(1..4,45..478,554..800))
/partial
/gene="SPAC24H6.01c"
/note="SPAC24H6.01c, len:>228, SIMILARITY:Saccharomyces
cerevisiae, YGI4_YEAST, hypothetical 65.3 kd protein in
mad1-scy1 intergenic region., (560 aa), fasta scores: opt:
300, E():1.3e-12, (33.3% identity in 183 aa)"
/codon_start=1
/label=SPAC24H6.01c
/product="hypothetical major facilitator family protein"
/protein_id="CAA90845.2"
/db_xref="GI:6066742"
/db_xref="SWISS-PROT:Q09758"
/translation="MLRLFRFDVLETSTKDTERPNSKSSRLSSTSGSSHPSSSSRLTV
RSAVPEKSAFGSIEFIFYFSVILSILTIACFKIHYVSSPKHPNYKNIEKYLKPGWLFG
QKVDSADFQYSAFRENMPILLLVIIVYNFLWRLVKLVFTKNTNDELAIKNNYRLCFSL
LFALLVYGTGVIYVLTIALINYLISKSLKNSIFNPLLTWTLDISVVFFKEYFAYCKFS
SLHPGLGFLD"
misc_feature complement(5..19)
/gene="SPAC24H6.01c"
/note="ttaatctttttacag, splice branch and acceptor"
misc_feature complement(39..44)
/gene="SPAC24H6.01c"
/note="gtacgc, splice donor sequence"
misc_feature complement(479..493)
/gene="SPAC24H6.01c"
/note="ctaactaggcgtcag, splice branch and acceptor"
misc_feature complement(548..553)
/gene="SPAC24H6.01c"
/note="gtaagt, splice donor sequence"
gene
complement(1466..2037)
/gene="SPAC24H6.02c"
CDS
complement(join(1466..1742,1787..2037))
/gene="SPAC24H6.02c"
/note="SPAC24H6.02c, len:175, SIMILARITY:Saccharomyces
cerevisiae, YN50_YEAST, hypothetical 23.5 kd protein, (205
aa), fasta scores: opt: 308, E():5.9e-14, (45.4% identity
in 108 aa)"
/codon_start=1
/label=SPAC24H6.02c
/product="hypothetical protein"
/protein_id="CAA90846.1"
/db_xref="GI:984699"
/db_xref="SWISS-PROT:Q09759"
/translation="MWSFARYTQKFCLRNLQLKNFSNKVKVSSLLFCSPNVRNISNWQ
CKRFTQSDAKDLASGVPSVKSDADQLQPKPTYNVSFTCTVCNTRSNHNFSKQAYHNGT
VLVQCPKCKNRHLMADHLKIFSEERVTIEDILAKKGETFKKGYGQVINGNVVEFKPPQ
FKIRPAKSSSSNSSK"
misc_feature complement(1743..1757)
/gene="SPAC24H6.02c"
/note="ctaaccaaatgaaag, splice branch and acceptor"
misc_feature complement(1781..1786)
/gene="SPAC24H6.02c"
/note="gtaagt, splice donor sequence"
gene
2201..4856
/gene="SPAC24H6.03"
CDS
join(2201..2242,2299..3683,3723..3832,3885..3949,
3994..4167,4219..4628,4685..4856)
/gene="SPAC24H6.03"
/note="SPAC24H6.03, len:785"
/codon_start=1
/label=pcu3
/product="cullin 3 homolog"
/protein_id="CAA90847.1"
/db_xref="GI:3336937"
/db_xref="SWISS-PROT:Q09760"
/translation="MQRSAKLKIRAPRKFSANQVDFATHWEVLQRAIGDIFQKSTSQL
SFEELYRNAYILVLHKYGEKLYNHVQDVIRSRLKEETVPAIYKNYDASLLGNALLDIR
KNDSYSTSWSRSLEAAHRFLSSLVNSWKDHIVSMQMISSVLKYLDKVYSKSADKVPVN
ENGIYIFREVVLLNSFEIGEKCVETILILVYLERKGNTINRPLINDCLDMLNSLPSEN
KKETLYDVLFAPKFLSYTRNFYEIESSTVIGVFGVVEYLKKAEKRFEEEKERSKNYLF
TKIASPLLSVVEDELLSKHLDDLLENQSTGFFSMIDSSNFEGLQLVYESFSRVELGVK
SLKKYLAKYVAHHGKLINETTSQALEGKMAVGRLSSNATMATLWVQKVLALWDRLNTI
ISTTMDADRSILNSLSDAFVTFVDGYTRAPEYISLFIDDNLKKDARKAIEGSIEATLQ
NSVTLFRFISEKDVFEKYYKTHLAKRLLNNRSISSDAELGMISRLKQEAGNVFTQKLE
GMFNDMNLSQELLQEYKHNSALQSAKPALDLNVSILASTFWPIDLSPHKIKCNFPKVL
LAQIDQFTDFYLSKHTGRKLLWYPSMGSADVRVNFKDRKYDLNVSTIASVILLLFQDL
KENQCLIFEEILEKTNIEVGDLKRNLQSLACAKYKILLKDPKGREVNAGDKFYFNENF
VSNLARIKISTVAQTRVEDDSERKRTLEKVDESRKHQADACIVRVMKDRKVCEHNQLM
AEVTRQLNPRFHPSPMMIKRRIEALIEREYLQRQADNGRIYEYLA"
misc_feature 2243..2248
/gene="SPAC24H6.03"
/note="gtaagt, splice donor sequence"
misc_feature 2288..2298
/gene="SPAC24H6.03"
/note="ctgacatttag, splice branch and acceptor"
misc_feature join(2314..3683,3723..3832,3885..3949)
/gene="SPAC24H6.03"
/note="Match to PF00888 Cullin, Cullin family Score
749.66"
misc_feature 3684..3689
/gene="SPAC24H6.03"
/note="gtatgc, splice donor sequence"
misc_feature 3710..3722
/gene="SPAC24H6.03"
/note="ttaacacttttag, splice branch and acceptor"
misc_feature 3833..3838
/gene="SPAC24H6.03"
/note="gtaagt, splice donor sequence"
misc_feature 3874..3884
/gene="SPAC24H6.03"
/note="ttaatgcttag, splice branch and acceptor"
misc_feature 3950..3955
/gene="SPAC24H6.03"
/note="gttagt, splice donor sequence"
misc_feature 3983..3993
/gene="SPAC24H6.03"
/note="ctaacagttag, splice branch and acceptor"
misc_feature 3997..4167
/gene="SPAC24H6.03"
/note="Match to PF00888 Cullin, Cullin family Score 80.72"
misc_feature 4168..4173
/gene="SPAC24H6.03"
/note="gtatag, splice donor sequence"
misc_feature 4202..4218
/gene="SPAC24H6.03"
/note="ctaacaatcctatttag, splice branch and acceptor"
misc_feature join(4219..4628,4685..4853)
/gene="SPAC24H6.03"
/note="Match to PF00888 Cullin, Cullin family Score
311.50"
misc_feature 4629..4634
/gene="SPAC24H6.03"
/note="gtatgt, splice donor sequence"
misc_feature 4669..4684
/gene="SPAC24H6.03"
/note="ctaacagtaaattaag, splice branch and acceptor"
gene
5915..7369
/gene="hsk1"
CDS
5915..7369
/gene="hxk1"
/note="SPAC24H6.04, len:484"
/codon_start=1
/label=hsk1
/product="hexokinase 1"
/protein_id="CAA90848.1"
/db_xref="GI:984701"
/db_xref="SWISS-PROT:Q09756"
/translation="MSLHDAYHWPSRTPSRKGSNIKLNKTLQDHLDELEEQFTIPTEL
LHRVTDRFVSELYKGLTTNPGDVPMVPTWIIGTPDGNEHGSYLALDLGGTNLRVCAVE
VQGNGKFDITQSKYRLPQELKVGTREALFDYIADCIKKFVEEVHPGKSQNLEIGFTFS
YPCVQRSINDASLVAWTKGFDIDGVEGESVGPLLSAALKRVGCNNVRLNAILSDTTGT
LVASNYASPGTEIGVIFGTGCNACYIEKFSEIPKLHKYDFPEDMNMIINCEWCDFDNQ
HVVLPRTKYDVAIDEESPRPGLQTYEKMIAGCYLGDILRRILLDLYEQGALFNGQDVT
KIRDPLAMDTSVLSAIEVDPFENLDETQTLFEETYGLKTTEEERQFIRRACELIGTRS
ARLSACGVCALVRKMNKPSMIVGTDGSVYNLYPRFKDRLAQAFKDILGEEIGSKVVTI
PAEDGSGVGAALVSALEAKGKALTSDILAEHLKN"
misc_feature 5987..7324
/gene="hsk1"
/note="Match to PF00349 hexokinase, Hexokinase Score
959.14"
misc_feature 6377..6454
/gene="hsk1"
/note="PS00378 Hexokinases signature"
gene
9276..11066
/gene="cdc25"
CDS
9276..11066
/gene="cdc25"
/note="SPAC24H6.05, len:596aa; conflict with published
sequ ence; SPAC24H6.05, len:596"
/codon_start=1
/label=cdc25
/product="M-phase inducer phosphatase"
/protein_id="CAA90849.1"
/db_xref="GI:984702"
/db_xref="SWISS-PROT:P06652"
/translation="MDSPLSSLSFTNTLSGKRNVLRPAARELKLMSDRNANQELDFFF
PKSKHIASTLVDPFGKTCSTASPASSLAADMSMNMHIDESPALPTPRRTLFRSLSCTV
ETPLANKTIVSPLPESPSNDALTESYFFRQPASKYSITQDSPRVSSTIAYSFKPKASI
ALNTTKSEATRSSLSSSSFDSYLRPNVSRSRSSGNAPPFLRSRSSSSYSINKKKGTSG
GQATRHLTYALSRTCSQSSNTTSLLESCLTDDTDDFELMSDHEDTFTMGKVADLPESS
VELVEDAASIQRPNSDFGACNDNSLDDLFQASPIKPIDMLPKINKDIAFPSLKVRSPS
PMAFAMQEDAEYDEQDTPVLRRTQSMFLNSTRLGLFKSQDLVCVTPKQSTKESERFIS
SHVEDLSLPCFAVKEDSLKRITQETLLGLLDGKFKDIFDKCIIIDCRFEYEYLGGHIS
TAVNLNTKQAIVDAFLSKPLTHRVALVFHCEHSAHRAPHLALHFRNTDRRMNSHRYPF
LYYPEVYILHGGYKSFYENHKNRCDPINYVPMNDASHVMTCTKAMNNFKRNATFMRTK
SYTFGQSVLASPDVNDSPTAMHSLSTLRRF"
misc_feature 10539..10856
/gene="cdc25"
/note="Match to PF00581 Rhodanese, Rhodanese-like domain
Score 116.58"
gene
12120..14267
/gene="SPAC24H6.06"
CDS
join(12120..12161,12210..13180,13274..14267)
/gene="SPAC24H6.06"
/note="SPAC24H6.06, len:668, SIMILARITY: LOW to Sus
scrofa, Q29070, gastric mucin, (317 aa), fasta scores:
opt: 138, E():0.13, (25.9% identity in 216 aa)"
/codon_start=1
/label=SPAC24H6.06
/product="hypothetical coiled coil protein"
/protein_id="CAA90850.2"
/db_xref="GI:6066743"
/db_xref="SWISS-PROT:Q09761"
/translation="MNNDHASKKSFCIKAPSNWEKSYLEVWPLVTVPRQCICLRWCIS
KEYHEFTCSSLQFIVIRPAGTSVLLGRVKSSKANQLVGIEHVEGSRYALIFLSEKLDF
KSLKVIANHQLTKSSKSLSNVSNKPLGDQLFRSNSLMSPSLLKKELHRIQSDASQANE
RESQAPHSFVTHDLISSSKDGNSLTHEFANDSVTEMVQDYTPSCSRDVKSLLDHLYNS
YFYQLLMTKTPVVFYVKQMVGKTRQLAVEVHNHVEEKALVDELLKFLDNLKSVDDRKS
RLLQCFESHLNYKAWHLEFENEAHQYEIKGYRLWLQNILNRENCQITKLDFEREFSQL
KLKDLLDSRDSGKRKSRKKNAKTLNPFETAQLKLEFTFDGLCIRRTIEQNATERSEDL
LLKFCKETIVPYYSSKFPRITRNLLEKCNGLDLLPERSHKHRHSAPPRSKLISSKSEA
GRALPGNTSGASISNTSSPHSEASISKDYEILKRRRSNSGVHSLTRSDSSFNGFERDT
RRRSSDIARIKNREINLPSSSLSKQRNSMHDISTNFPRRNLSFTEKLTMASLQGQSEE
SVQPKTTSSLSRSKTLSILEGSVSKRSEPSMDSILVQATPRKSSSVITELPDTPIKMN
SLDKASACTVENHIVTESPAHKSNKAQLFVCVPTTPVKKKSASP"
intron 12162..12209
/gene="SPAC24H6.06"
/note="confirmed intron"
misc_feature 12162..12167
/gene="SPAC24H6.06"
/note="gtactt, splice donor sequence"
misc_feature 12196..12209
/gene="SPAC24H6.06"
/note="ttgacatcttttag, splice branch and acceptor"
misc_feature 13181..13186
/gene="SPAC24H6.06"
/note="gtatga, splice donor sequence"
misc_feature 13261..13273
/gene="SPAC24H6.06"
/note="ctaatgataacag, splice branch and acceptor"
promoter 14624..14633
/note="Homol E box"
promoter 14645..14652
/note="Homol D box"
gene
14703..15323
/gene="rps9"
CDS
join(14703..14717,14763..15323)
/gene="rps9a"
/note="SPAC24H6.07, len:191"
/codon_start=1
/label=rps9
/product="40s ribosomal protein S9"
/protein_id="CAA90851.1"
/db_xref="GI:984704"
/db_xref="SWISS-PROT:Q09757"
/translation="MPSAPRKQSKTYKVPRRPFESARLDAELKLAGEYGLRNKHEIWR
VALTLSKIRRAARELLTLDEKDPKRLFEGNAIIRRLVRLGILDETRMKLDYVLALRIE
DFLERRLQTQVFKLGLAKSIHHARVLIFQRHIRVGKQIVNVPSFVVRLDTQKHIDFAL
SSPYGGGRPGRCKRKRLRSQEGGEGEEAEEE"
misc_feature 14718..14723
/gene="rps9"
/note="gtaagt, splice donor sequence"
misc_feature 14748..14762
/gene="rps9"
/note="ctaacattaaattag, splice branch and acceptor"
misc_feature 14763..15320
/gene="rps9"
/note="Match to PF00163 Ribosomal_S4, Ribosomal protein
S4/S9 N-terminal domain Score 316.39"
misc_feature 15060..15134
/gene="rps9"
/note="PS00632 Ribosomal protein S4 signature"
gene
16119..16781
/gene="SPAC24H6.08"
CDS
16119..16781
/gene="SPAC24H6.08"
/note="SPAC24H6.08, len:220aa"
/codon_start=1
/label=SPAC24H6.08
/product="hypothetical protein"
/protein_id="CAA90852.1"
/db_xref="GI:984705"
/db_xref="SWISS-PROT:Q09762"
/translation="MTSHQTGKIEVHWNDPSSGIFLSQTQRTSSTSSLKKSASSRRLV
YGDDMLAPKPLAGQVLSKSPLPPFSPSKIMNRSISVPPTNISVPQISSNPLNLMKKSS
DNDIFTTFNDTTNDCMNEASCEDVRHSLLQIIESKSNLSDSVHKMLGERVKTNLLLQG
QLENMEGFWLQKLSQTCRLALTGNISEAKAFIVEIMCAGVVTNCVRWCPVLKTLIENL
AI"
gene
17917..20233
/gene="SPAC24H6.09"
CDS
join(17917..17958,18014..20233)
/gene="SPAC24H6.09"
/note="SPAC24H6.09, len:753, SIMILARITY:Dictyostelium
discoideum, AAD47903, unconventional myosin heavy chain
myom., (1737 aa), fasta scores: opt: 302, E():6.2e-10,
(24.4% identity in 488 aa)"
/codon_start=1
/label=SPAC24H6.09
/product="putative guanine nucleotide exchange factor with
coiled-coil regions"
/protein_id="CAA90853.1"
/db_xref="GI:984706"
/db_xref="SWISS-PROT:Q09763"
/translation="METLKADLSDMSDSPEYFETLANRDLPRLPGTSKLHRAACIKRK
SINLSLPSNSYSLSYRQSDDTDGDVSESQSEYRLSSGRRSRASFARALQDPQTPNTPP
VSSQHRRFFSEGSFNLPNSNMSHSLNGDSTASNSSTLTPNRIYGDRNRQDYAQSSRYT
LPSLPSSPSYNCPTTLRKIHTNTSSNGTSRRVSGLGSFMAQNSSETSSNRTSAYLPGS
STDEQEKRSSVTLASMPSSHSSTASLLSPLDTTSFSTKLDSTILALETDESLSRTISY
ATTSLPSTPGKRLSKESLAMSEASSINPEYKRIKKRANLIKELVTTEAAYLNDLIAIQ
QSYGLRVKECSALNPVDAQTVFGDIESLLTFTVEFHSRLYQAGEGSWRVNLDTQLIDP
LPCNLGLIFLESLSEIGQIYTGYCNRQDSVFKIITKWREKPATASWIMEGDKIVQKYT
NAWDLGSLIIKPLQRLLKYPLLLQKIIDVTPESSSERPDLVLSYQLLQELISGINQKQ
KPSHKRGSLSASHKRDAAWSLLYKATSNKSRPTTTSTELKTDARLNFQRQVLQDFRQR
FAILKALHATLETWYVTVHRGFSVFEKVLAELEGLSALEPEDKPVDTWRKYHLLAHMM
TANLPSQIQTSLNSSILNPITNILRVIQKVIQFIISAEFVIPAQKLEAISTLVEKEFH
SVVYHFIGIQRSLYENYAQGFLFLIPQDMRDSILEETQDYAELVRAFEPMHYDDEVLL
EELMKSVSLAARV"
misc_feature 17959..17964
/gene="SPAC24H6.09"
/note="gtatgt, splice donor sequence"
misc_feature 17997..18013
/gene="SPAC24H6.09"
/note="ttaacaatgttgatcag, splice branch and acceptor"
misc_feature 18914..19489
/gene="SPAC24H6.09"
/note="Match to PF00621 RhoGEF, RhoGEF domain Score
288.04"
gene
complement(20990..22096)
/gene="SPAC24H6.10c"
CDS
complement(20990..22096)
/gene="SPAC24H6.10c"
/note="SPAC24H6.10c, len:368, SIMILARITY:Saccharomyces
cerevisiae, AROG_YEAST, phospho-2-dehydro-3-deoxyheptonate
aldolase, tyrosine-inhibited(ec 4.1.2.15), (370 aa), fasta
scores: opt: 1447, E():0, (62.5% identity in 347 aa)"
/codon_start=1
/label=SPAC24H6.10c
/product="phospho-2-dehydro-3-deoxyheptonate aldolase"
/protein_id="CAA90854.1"
/db_xref="GI:995560"
/db_xref="SWISS-PROT:Q09755"
/translation="MDKHTPLLPGDSVFSRCKTEDSRIKGYDPVISPALIQSELAASD
ETLAFVSDQRRQAADIIAGRDDRLLLIVGPCSLHDPVAAKEYAIRLQKEAIKHKKDLH
IIMRAYLEKPRTTVGWKGLINDPDLDGSYNINKGIRVARRIFLELLETGVGIASEMLD
TISPQYLADLICWGAIGARTTESQLHRELASGLSFPIGFKNATDGNIGIAIDAMNSSA
NPHHFLSVTKQGVVAIVTTTGNPDTHIILRGGKSGTNFDADSVAGAKAKLEECNKLPS
IMIDCSHGNSSKNHKNQPKVAACIAEQVANGQKAITGVMIESHLNEGKQAIPEDDLSS
MKYGVSVTDACIGWDDTTAVFEQLAAAVRSRRSH"
misc_feature complement(21008..22045)
/gene="SPAC24H6.10c"
/note="Match to PF00793 DAHP_synth_1, DAHP synthetase
family Score 903.18"
gene
complement(22641..25517)
/gene="SPAC24H6.11c"
CDS
complement(22641..25517)
/gene="SPAC24H6.11c"
/note="SPAC24H6.11c, len:958, SIMILARITY:Saccharomyces
cerevisiae, YG35_YEAST, hypothetical 117.0 kd protein in
asn2-phb1 intergenic region., (1036 aa), fasta scores:
opt: 1804, E():0, (36.4% identity in 968 aa)"
/codon_start=1
/label=SPAC24H6.11c
/product="putative sulphate transporter"
/protein_id="CAA90855.1"
/db_xref="GI:984707"
/db_xref="SWISS-PROT:Q09764"
/translation="MPTHSLNQKRSLLYPSIDNGSVPDSSFLSQSFRDTNDIHLLSAR
LASVEVDPETLPSNSPETNDDVSNYDYMSISSYQRAHSLASTSRNDFNYRPSEINETR
PLLPEREEHSSQLPPAQNQVLPFQSLNAPSKKLRGRISLSKIYQFFFDDKGPILFILN
IPAVIIGLLLNILDALSYGLILFPISDPLFKNLGADGLAIYYVSCVVSQLVYSLGGSF
FKGSVGSEMIEVIPFFHQIAFTILNRVGEDNPKSVIATTILAYCLSSILTGLVFFILG
ILRLGRLIEFFPRHILLGCIGGVGSFLVLTAVEVSSRLEGSVSFNWASLSALFQPMTF
AKWSIPLFLSSALEFAQQRWPHPFLIPSFFVIAPAIFYVLVWAIPGMSLEYLRETGWV
FSSTETNVPWYHFYSLFSLRDTDWSALLATVPEMCALTFFGILHVPINVPALAISLGL
DFVDTDKELIAHGVSNTLSGAVGSIQNYMTYTNSLMFIRSGGNNRLAGIMLALATVAL
LVIGPGIIAYIPVWTVGCLIYLLGIELLKESLWDPIGITTKIEYFTICAIVFTMTVVD
FVVGIVIGIIMACVFFVIQASSRSALRGIYSGGIVRSTVRRPMNQQRFLNEIGRQIQV
CKLSGFLFFGTINGVEKNIAGLIEELNVSNNPLRFLIIDFSLVNGADFSVVQAFLRIR
RMLATMNVQLCVCGLDETRSSFKTLTTMCFGGDDNCGCQVFEDVNSSLEYCENMLLDD
YDVYRTKLLHKAGYSHTLAVPGKHKQNISMAETFSPSPRHDLLRQVAMSSVKAETKEL
SKFEKYAKYEQPFPLLMQVFGEITTKREDFWLGLCPFFKKAFLRKGDLLWRKGDTPSK
LVILETGMVKASYQMDRESLSENITCLCVVGELPFFSKTCYNATVVAELDSAVWILDR
EGWETMLKSSDKAEVIENEMLYLTLKMTRDKFNVFTNFALNLYR"
misc_feature complement(23763..24695)
/gene="SPAC24H6.11c"
/note="Match to PF00916 Sulfate_transp, Sulfate
transporter family Score 297.33"
gene
complement(27543..28877)
/gene="SPAC24H6.12c"
CDS
complement(27543..28877)
/gene="SPAC24H6.12c"
/note="SPAC24H6.12c, len:444, SIMILARITY:Physarum
polycephalum, P90586, ubiquitin-activating enzyme, (427
aa), fasta scores: opt: 876, E():0, (46.5% identity in 432
aa)"
/codon_start=1
/label=uba3
/product="ubiquitin activating enzyme"
/protein_id="CAA90856.1"
/db_xref="GI:984708"
/db_xref="SWISS-PROT:Q09765"
/translation="MPSSDVCKAGSHRHSGWIQSLKKPGPFNLDAPENPEETLKSAFS
SKILIIGAGGLGCEILKDLALSGFRDLSVIDMDTIDITNLNRQFLFNESNIDEPKANV
AASMIMKRIPSTVVTPFYGKIQDKTIEFYKEFKLIICGLDSVEARRWINSTLVAIAKT
GDLIPLVDGGSEGLKGQARVIIPTITSCYECSLDMLTPKISYPICTLANTPRLPEHCV
EWAYLLEWPRVFLNASVDSFSKQEVFEPLDGKNSNFEPDNIRHIDWLVKRSIERANKF
QIPSSSINRFFVQGIVKRIIPAVASTNAIIAASCCNEALKILTESNPFLDNYMMYVGE
DGAYTYTFNLEKRSDCPVCGVLSEVYDISASSTVTLKDILNHYSKSYNLQNPSVSTAA
GTPLYLASPPALQVATSKNLSQPILSITSVDVNLVITDKNLSTSLSVQLREC"
misc_feature complement(28329..28751)
/gene="SPAC24H6.12c"
/note="Match to PF00899 ThiF_family, ThiF family Score
190.94"
gene
31140..33755
/gene="SPAC24H6.13"
CDS
31140..33755
/gene="SPAC24H6.13"
/note="SPAC24H6.13, len:871, SIMILARITY:Saccharomyces
cerevisiae, YM8G_YEAST, hypothetical 107.7 kd protein ,
(953 aa), fasta scores: opt: 1905, E():0, (39.9% identity
in 909 aa)"
/codon_start=1
/label=SPAC24H6.13
/product="putative major facilitator superfamily protein"
/protein_id="CAA90857.1"
/db_xref="GI:984709"
/db_xref="SWISS-PROT:Q09766"
/translation="MSDSSSSSTSAFVSSLVFNFAIFCAFIGLFLCLRPREKHVYQPR
CIIDTQPKEEKPEPSPSSPFGLFAYVVKRSETYLIQYAGVDGYFFIRYLFTFGALCIL
GCLVLFPILLPVNATNGVGEKGFDILSFSNVKNHNRFYAHVFLSWLFFGFTIFIIYRE
LRYYVIFRHAMQSSGLYNNLPSSSTMLLTELPNSVLNDEETLHELFPNASEFTCVRDL
KKLEKKVKKRSDLGNKYESTLNSLINKSVKKHNKLVKKHKPLPSTLDYTAYVKKRPTH
RLKFLIGKKVDTIDYCRDTIAELDEVVDKLQTSLEERKKVGSVFIRFRSQTDLQTAYQ
AFLYSKKFRKYRFGRALVGIAPEDIVWSNLDLSMYTRRGKKTISNTILTLMIIFWAFP
VAVVGCISNVNYLIEKVHFLKFIDHMPPKLLGIITGILPSVALSILMSLVPPFIKFLG
KFGGALTVQEIENYCQNWYYAFQVVQVFLVTTMTSAATSAVVQVIKEPASSMTLLASN
LPKASNFYISYFLLQGLSIPGGALLQIVTLLLSKVLGRIFDNTPRKKWNRWNQLSAPS
WGTVYPVYSLLVTIMICYSIIAPIIIGFAAVAFVLIYFAYSYNLIYVLGHNADAKGRN
YPRALFQVFVGLYLAEVCLIGLFVLAKNWGATVLEAVFLGFTVACHLYFKYKFLPLMD
AVPISAIESVSERPEIKYPMDLGTSEMKNVGRAYPEILEKLSSSSGSDEFLETSSRTS
ENTKEKIDKDDEGFAITNISSVHKMPSFVLSYFSDLAASNRILTGFDRVLQLLPSFYD
IPVRVRNVQYVSPALKATPPSVWIPKDPLGLSTYAIEDARGKVDIFDDNTTFNEKGNL
QYTGPPPDYDEAIRS"
rRNA
complement(34171..34289)
/note="5S rRNA gene"
/label=5SrRNA
misc_feature 34290..36800
/note="low complexity gene free region"
BASE COUNT 11471 a 6785 c 6583
g 11994 t
ORIGIN
1 gatcctgtaa aaagattaac atctgatggc taaagatagc gtaccaaaaa ccctaggcct
61 ggatgaaggc tagaaaattt gcaataggca aaatattctt taaaaaacac cactgatata
121 tctaatgtcc atgtcaagag tggattaaaa atactgttct ttagagactt agaaatgaga
181 taattgatca atgcaattgt taaaacatat ataacgccag tcccgtagac aagtaatgca
241 aacagcaggg agaagcaaag tcgatagttg ttttttatag cgagttcatc attagtattt
301 tttgtgaaga caagcttaac gagtctccag agaaaattat agacaataat tacaagtagt
361 aagattggca tattttcacg aaacgcggaa tactgaaaat cagcggaatc gactttttgt
421 ccaaacaacc acccaggctt taaatatttt tcaatgtttt tgtaattggg atgttttgct
481 gacgcctagt tagttgctta ctttatttca aattgataaa tagaaaaggc tgattaaagc
541 ttttgacact tacgagaact aacataatga attttaaaac aagcaattgt tagaatcgat
601 agaatgacag aaaagtaaaa aataaattct atggaaccaa aagcgctctt ttccggaact
661 gcgcttcgaa cggtcaaacg tgaagatgaa gatggatgac ttgaaccaga agtcgaactt
721 aaccgtgaac ttttagaatt gggtctttca gtatctttgg tcgaagtctc aagaacatca
781 aatcgaaata aacgtaacat cctagaagtc tcctaccaac agcaaagaag atggtcggtt
841 ctcaaaatat catagagtaa acaaacctag tggtgttcac tagcaaaatg caagacactg
901 cgtaatttac actttgtgtc agcttcatag tccgtaaata attttaagag taaggtagag
961 agagtatgat gaaatcaatt atatttttaa aaataataaa ttcagtttgt attgtttatt
1021 ggaatgaata aaatagtcag cactatacaa aacgcgctat taaaatagaa atattaatta
1081 agcatcttta aagactaaat tttgattttg gtaaattagc aacttatata ttgaaaaatg
1141 ttttacattc gtttgcaata attctgtata gcccatttag atctctaaaa tcgttaattt
1201 ttataattaa ttaattaatt tatttttttt tggatatgat ttgctcaatg attgctaaat
1261 tatcacatct agtctaaaaa ttcaaaagat aatttcaagt taacaagctc atataataat
1321 taaatgttct attagaacgt cgtatatatt gaccaaagta ttacaaaaat aaataaaaat
1381 aatatatact atagaaatta aaataaacaa tgactatatc ttgcaattta tatgctccta
1441 aacccaattt gatcgctctc ctagttcatt tggaagaatt ggaagaagat gattttgctg
1501 gtcgaatttt aaattgaggg ggcttaaatt ctacaacatt gccattgatt acttgaccat
1561 agcctttttt aaaagtttcc cctttcttag ctaaaatgtc ttcaatcgtc acccgttctt
1621 cagagaatat ttttaaatga tcagccatca agtgccgatt tttgcactta ggacattgaa
1681 ctaaaacggt cccattatgg taggcctgct tggagaaatt gtgatttgac ctagtgttac
1741 atctttcatt tggttagtat ccttcaatga aattataaat acttacacgg tacaagtaaa
1801 agaaacatta taagtcggtt taggttgtaa ttgatcagca tctgatttta cagagggaac
1861 accacttgca agatctttag catcgctttg agtgaagcgt ttgcattgcc aattagaaat
1921 gtttctcaca ttcggggaac agaaaagtaa acttgaaacc tttaccttgt ttgaaaaatt
1981 ttttaactgc aaatttcgta aacagaattt ttgagtatac cgtgcaaagc tccacattat
2041 gtttgaaaaa tttttgtgtt cggagaatgt tgttcgtagt tagtaccttc taccaccttt
2101 tgtaccatta cttccgtctg tgctggcgtt tctttatgca aactcatatg cccatatatt
2161 tagctttttt gttgaaactt aactttaaat tccattatta atgcagcggt cagcgaagtt
2221 aaaaattcga gcgccacgaa aagtaagtgt acctgatgca tgctgtcctt attattttcc
2281 tcacatactg acatttagtt ttctgctaac caagtagact ttgctacaca ttgggaagtc
2341 ttgcagcgag ctattgggga tatatttcag aaaagcacat ctcaattatc gtttgaagag
2401 ttgtatagaa atgcatatat acttgttcta cataagtatg gagaaaagct ttataatcat
2461 gtacaggatg ttattagatc gaggcttaaa gaagaaactg ttccagctat ttacaaaaac
2521 tatgacgctt ccttattagg gaatgctcta ttggatatta gaaaaaatga ttcctattct
2581 acttcatggt cgagatcatt ggaagcggct catagatttt tatcgtcttt agttaattcc
2641 tggaaggatc acattgtttc aatgcaaatg atatctagtg tattaaaata tttggacaag
2701 gtttattcaa aatctgcaga caaagttcct gttaatgaaa acggaatata tatctttcgt
2761 gaagttgttt tgttaaatag ttttgaaatt ggagagaaat gtgtcgaaac tattctaatt
2821 ctggtttatc ttgaacgtaa agggaataca ataaatcgac ccttaataaa tgactgctta
2881 gatatgctaa actctttgcc ttcagaaaac aaaaaggaaa ctctttatga tgttttattt
2941 gctccaaagt ttttatctta cacaagaaat ttctatgaaa tagaatcatc aacagtcatc
3001 ggcgtatttg gagttgtgga gtatctgaaa aaagcagaga aaagatttga ggaagaaaag
3061 gagaggtcca agaattacct gttcacaaaa atagctagcc cccttctatc cgttgttgaa
3121 gatgagcttt tgagtaaaca tttggacgat ttgcttgaaa atcaatcaac tggctttttc
3181 tcgatgatcg actcttcaaa cttcgagggt cttcaattgg tttacgaaag cttttctaga
3241 gtggaacttg gcgtgaagtc tttaaaaaaa tacttggcga aatatgtagc gcatcatgga
3301 aaattaatca atgaaactac tagccaagcg cttgaaggaa agatggctgt tggtcgtcta
3361 tcatctaatg ctacaatggc aacattatgg gttcaaaaag ttttggcctt atgggataga
3421 ctaaatacaa ttataagtac tactatggat gctgatcggt ctatcttgaa ctctttatca
3481 gatgcttttg ttacgttcgt agacggatat acccgagccc ccgagtacat ttcactattc
3541 attgatgaca atctgaaaaa agatgctaga aaagcaatag aaggctctat tgaagcaaca
3601 cttcaaaatt ctgttacgtt atttcggttt atttcagaaa aagatgtttt tgaaaaatat
3661 tacaaaacac accttgcaaa aaggtatgct ttaattgttt ttactgtctt taacactttt
3721 aggttgttga ataatcgatc aatatcttcc gatgcagagc tcggaatgat cagtaggctg
3781 aaacaagaag ctggtaatgt ttttactcaa aagctcgaag gaatgtttaa tggtaagtat
3841 tcagacttta gcaagttttt ttgcgttccc ttattaatgc ttagatatga atctttcaca
3901 agaattgtta caggagtaca aacataattc tgcgttacag tcggccaagg ttagtggaaa
3961 tttttattga taatccttta tactaacagt tagcctgctt tggacttaaa cgtttcaatt
4021 ttggcttcta cattttggcc catcgattta tcgcctcaca aaattaaatg taactttccg
4081 aaagttctcc tggctcagat agatcaattt acggattttt atttatcgaa acatacgggg
4141 agaaaattat tgtggtatcc ttctatggta tagtaaactt ttatttgctt gttgcgtcac
4201 actaacaatc ctatttaggg aagtgctgat gtccgtgtta atttcaaaga cagaaaatat
4261 gacttgaatg tatcgactat tgcatctgtt attctattgc ttttccaaga cctaaaagaa
4321 aatcagtgtt taatttttga ggagattttg gaaaaaacca atatagaagt tggggatttg
4381 aagcgcaatc ttcagtccct tgcttgtgca aaatacaaga ttttgctaaa ggatccaaaa
4441 ggaagggagg tgaatgcagg tgacaagttt tatttcaatg agaactttgt ttcaaatctc
4501 gcccggatta aaatatcgac agttgctcag actcgtgttg aggatgatag tgagagaaaa
4561 cgtacattag agaaggtaga cgagtcaagg aagcatcaag cagatgcctg tattgttaga
4621 gtgatgaagt atgtcttttt cctttttttt ttttttcaaa catttttact aacagtaaat
4681 taagggatcg caaagtgtgc gaacacaatc agctgatggc agaggttaca agacagttaa
4741 accctcgttt ccaccctagt ccgatgatga ttaagcgacg aattgaagct ttaatagagc
4801 gggagtacct acaaagacaa gccgataatg gtcgcattta tgaatattta gcatagcagc
4861 cttaactgtt acacgaaatg caatattcgt atacgagaaa tcatgtttaa atggaaagat
4921 accaatttgt tcaagatgat atctagcaaa agctttgtga ttataatgac actcatgaat
4981 tttacaccaa cttccttttt atttttgaaa aaaagaaaat tttaagtaat gctataagaa
5041 ggtttgaaaa caggtagatt atattccaaa tcaatcaaaa tttgatacat taattctgcg
5101 agcagatagt agagtaatga gtttaaatta cattgttcca tatacattgt acaaaaagac
5161 caaaacagaa aaaccctcag tttaaaatag attaaggtat ttgatcaacg atagatggga
5221 agtgttaacc taaaaaatta ttatgggagg aagcgaattc gccgtaaata cactatccta
5281 cacagtatgc ttttgatgat gactctacta ataacatcga aaataccaga gcaagtaaag
5341 ctccttataa tctttaaggt gttaatcaaa taataaagct atgtatttga tttagtattg
5401 aaacacctgg ttgtttacat taagaatatt cttctttcct aacggactac gctatcctct
5461 ttagcaatct cggaagaacc cctaaaaata tctaccgttc aatgctattc tgtatattct
5521 ccgcttaatt ctatttgttt aggaaattgt ataaataacc tcaaatatct taacgaaccc
5581 cacttgttcg ccaatttcat ttaattggtt ccgcgtacca cacttccttt taaccaccaa
5641 acacttttaa gtatccgcaa ggttgccaaa ggttgattac aatctcttac tttaagctaa
5701 gatacaaagc tctttttgat atcctactcg tcttgcagag aacctacata ttctcttttt
5761 gtgactaaac gtgtcttgaa ctctttaaaa taagtggcct ttgcttttga aattttaaag
5821 atctttttct tttcctacgt aactccaagt gtcaaggaat tgccatcttt tcacttcttc
5881 ctttcttttg tgtatcacta ctgaagttat caaaatgtcc ttgcacgacg cttaccattg
5941 gccttctcgt acacctagtc gtaagggttc aaatatcaaa ttgaacaaaa ctttacaaga
6001 tcatttggat gaactggaag aacaattcac cattcccact gaacttttac atcgcgttac
6061 cgatcgcttt gtttctgagc tttacaaggg cttaaccacg aacccgggtg atgttccaat
6121 ggtccccaca tggatcattg gtactcctga tggcaatgag catggctctt atttggcatt
6181 agatttaggt ggtactaact tgcgtgtttg tgcagttgag gttcaaggca acggtaaatt
6241 cgacattact caaagcaaat accgtctacc tcaagaactc aaagttggca ctcgtgaggc
6301 cctctttgat tacattgccg actgtatcaa gaaatttgtg gaagaggttc acccaggtaa
6361 aagccaaaat ttggaaattg gtttcacctt ttcttacccc tgtgttcaac gctccattaa
6421 cgatgcttca ttagttgcct ggactaaggg ctttgatatt gatggcgttg agggtgaaag
6481 tgtaggtcct cttttatcag cagccttgaa gcgtgttggg tgtaacaacg ttagactcaa
6541 tgccattttg agtgatacta ctggtacatt ggttgcttcc aactatgcca gcccaggtac
6601 tgagattggt gtcatctttg gaactggatg taatgcttgt tacattgaaa agttctcaga
6661 aattcctaag cttcataagt atgacttccc tgaagatatg aacatgatca tcaactgtga
6721 atggtgcgat tttgacaacc agcatgttgt ccttcctcgt accaaatacg atgttgctat
6781 tgatgaagag tctcccagac ccggtcttca aacgtacgag aaaatgattg ctggatgcta
6841 tttgggtgat atcttgcgtc gtattcttct tgacctttat gaacagggag ctctctttaa
6901 cggtcaggac gttaccaaga ttcgtgaccc cttggccatg gatacctctg tgctcagcgc
6961 tattgaagtt gacccctttg agaaccttga tgaaactcaa accctatttg aggaaaccta
7021 tggtctcaag accaccgaag aagagcgtca attcattcgt cgtgcatgcg aattgattgg
7081 aactcgttct gcccgtcttt ctgcgtgtgg tgtatgcgcc cttgttcgta aaatgaataa
7141 gccatctatg attgtaggta ctgatggtag tgtctacaac ttataccctc gttttaagga
7201 tcgtcttgct caagcattta aggatatcct tggtgaggaa attggcagca aagttgttac
7261 catccccgcc gaagacggta gtggcgtagg tgctgcattg gtcagtgctc ttgaagccaa
7321 aggcaaggcc ctcacttctg atattcttgc cgagcatctt aagaattaag tccactcatt
7381 gtttttaggt tttacggata ctcatttgat tttgtgtcac tgaactccac gaagtgttcg
7441 acaaactgtt ttatactgca ctttttattt gtttcatact ccatcttttt gcgtacaatt
7501 tgttccagca atttttatgg ttacactttt ctttgtctac taatcacgta tcagggcgtt
7561 tttacaaaaa ggtgctccac ctgataaaat attttctttt ttgctctagt gtttctgtgg
7621 atacgatatc tgcctctgat tgctagaata ctttaaataa aggttagagt tttgttatat
7681 tttgtatgtt gaagtgtaat aatagcggta tctttgttga agatatccga gtttaacaag
7741 acaactggct atactatctt gcgaaatatg cctactacga caataatggt atcaacccat
7801 agctacataa catcgggaga ctttcaatga gttacaccct cgttttccaa ttcggaatga
7861 cagtccgagt cccgaagtag agagattcat cgtgttacga ttattcgcta acacctgata
7921 acatatttta tctcgctgct cgtgttacat atcaggagtg cgggtatgag attatgggga
7981 tttaatggct gtcaagcagg ttacctgacg tttgttctta tgtatttgtt tacacataca
8041 caacatttgg cgttgctagg gcaacaaatc agacaaactg cggtggatcg aatagctgag
8101 atgtatggac atgtaaacaa acatacaaga tgcaaaaatt tatgtgtttt catctagttc
8161 aatgttgtta atgaaatgat tatttgcttt tatataatta atagattaag aggtgacata
8221 ttgtagcttg aaacggcatt atctacgcaa ctaaactata caacaatagt tgtcatggct
8281 gaggtgttgg ctaagtatta cagcttacag tttaactttg ctttaatcat gtatttgttt
8341 ttattattgt tttctgttac tcgaatttgt tggttggtgt tttattgttg ttgttacctt
8401 tgcttttttt tcctctcgga ggcagagctg agatttctct ctaccctcaa ctatctacag
8461 cgcttcaagc tcttctcctt tagatcaagt gtattacatt gaaaatatta ttcttaagaa
8521 taccaaagtt gtaagtttaa aaagaacttg atgcctgtta gtttcgggca tccttctctc
8581 ctcaagttgg attcctgttg atgcttctca agtactaact gctggttcag tgattttttg
8641 aaacgtctat aatacaattg cttacgcttt ttaaggcaat taataggatt cttgaagtta
8701 tttgttcaag aagcctttat atggtaaagt ggggggttta ttacttgtaa cgaactttga
8761 gtctgaactt ttgggtattc tgttgttttc atattaatca actgctttct tttaactttc
8821 ttttttcttt cattttttct ttcaacgaca gtatctttta tattctatat tatagtcgtg
8881 accgttggtg ctcatctatt tttgtctcat taatctcttt ttttttatat acttttattc
8941 ggtcttcccg tcgtgtggag tctaaactta cttaaacttc tttccggttt cttttgtaaa
9001 ggttggtttt gtttaatatt tggctatatt gctttcatcc tgtctttctt ttagttttct
9061 ttttgtctca ctttggcatt aatatcttaa accggtcttt tgactcttcc ctaggatttt
9121 cctttataat ttatcttact tgctctattg atttctcatt ctgaaactca atcttttgca
9181 gtcgtgtcgt cccattagtt ctttttgcag tgtacttggt ttaaattaaa tttaccattt
9241 tgtctgcttt taataatagt taaacctcaa ctaaaatgga ttctccgctt tcttcacttt
9301 cctttaccaa cactctatct ggcaaacgga atgtattgcg tcctgctgcc cgtgaattaa
9361 agttgatgtc ggatcgcaat gcaaatcaag agctggattt tttcttcccg aaatccaaac
9421 acattgcctc aacattagtg gatccttttg gaaaaacttg ttcgacagct tcacctgcat
9481 cttccttagc tgctgatatg tcaatgaaca tgcatatcga tgaaagccct gccttaccga
9541 ctcctcgtcg tacgctcttt cgatctcttt cttgtactgt agaaacccct ctcgctaaca
9601 agactattgt ttcacctctc ccggagtcac cctcgaatga cgctttaacc gagtcctact
9661 tcttccggca acctgcatcg aagtattcca ttacccaaga ttccccacgt gtttccagca
9721 ctattgctta cagctttaag cctaaagcat caatcgcatt aaacaccacc aaaagcgaag
9781 ctactcgttc gtcattatcg agttcttctt ttgattccta ccttcgtcct aacgtctcac
9841 gttctcgatc atcaggcaac gcacccccat ttttgcgatc cagatcgagt tcctcatatt
9901 ccattaacaa aaagaaggga acttctggtg gacaagctac ccgccatttg acttatgcct
9961 tatcccgtac ctgtagtcag tcgagcaaca cgaccagctt acttgaaagt tgtcttactg
10021 atgatacgga tgatttcgaa ttaatgtctg atcatgagga tacatttacg atgggcaagg
10081 ttgctgattt accagaaagc tctgttgaac ttgttgaaga cgctgcctct attcaacgtc
10141 ccaacagtga ttttggtgct tgcaatgata attctttaga tgatcttttt caagcttcac
10201 ccattaaacc tattgatatg ttacccaaga taaataaaga tattgccttt cctagcttga
10261 aagttaggtc cccttctccg atggcattcg ctatgcaaga agatgcggaa tatgatgagc
10321 aagatacacc agtgcttcgt cgtacccaaa gcatgtttct caattccaca agactagggc
10381 ttttcaaaag ccaagatctt gtgtgcgtta cgccaaaaca atcgaccaaa gaaagtgagc
10441 gcttcatctc ttctcatgtc gaggatttat ctcttccttg cttcgccgtg aaggaggact
10501 cattgaaacg aattacccaa gaaacattgc tcggtctatt agatggtaaa tttaaggata
10561 tttttgacaa atgcatcatt attgattgcc gttttgaata tgaatacctt gggggccata
10621 ttagcactgc tgtgaattta aatacgaaac aagctattgt ggatgccttt ttaagtaaac
10681 ctcttactca tcgggttgca cttgtttttc attgtgaaca tagtgcgcac agagcacctc
10741 atttggcatt gcactttcga aataccgaca gacgaatgaa tagtcatcgt tatcctttcc
10801 tttactatcc cgaggtttat atacttcatg gtggttacaa gtcgttttac gaaaaccaca
10861 aaaaccgttg tgacccaatt aactacgttc cgatgaacga tgcttcgcat gttatgacct
10921 gcaccaaggc tatgaataat ttcaaacgaa acgctacttt tatgcgtact aaaagttata
10981 cttttggcca aagtgtgtta gcttccccag acgttaatga ttctcctact gccatgcatt
11041 ccctctctac acttagaaga ttttaatgat tttaggctga ctcaatactt tacctagcct
11101 gaagtatact cttatatatt caactttcta aacctaagtt ttttctataa actttgaatc
11161 atgaaacctt ttatttctat tttttattat tattattgaa agaaagcaat atacctttta
11221 aaaggtctcg acaacaaaaa atttaaactg gctaaacttt tcgaagtaca tcacttttac
11281 gactaaatat ctagaacgac tagtttggtt attattatta cgcatggatt ttggtttaat
11341 aattgattat tttataatca attgagctta ataatttcta agcaatcacg ggtaaacata
11401 cgtaccgaag tccattctaa ttgctcttct cgcaattgct gttgatagga atgctaagaa
11461 tttagcattc tttgattgaa gcagtaatgc gataggaact gagttgctta cgtttttgta
11521 actctaccta accatctctg atggaaatat ttaagtagaa aacgaataaa acttttattt
11581 ttaaatcact tccttttatg catgttctgt aagcggtagt attgtagctt accctttcca
11641 atggaaggtt aagttcacca gagagttacg aagttttatg tgatataaaa tagttacaat
11701 gcggaagttg gacattcaca cttaagtacc ttcccaaaaa aaaaattacg aaaattttta
11761 atcaatacta cttttttggt aaaaatttaa acatgttccg ttgagaaagt gaatcatatg
11821 tgtgcttttt taaaatttga agatatttgt agattttgtt ttgtttactt tttatttgca
11881 acaatcagaa tgtcgtcatc taagtcaacc ttaaattcgc ttatatgaat tttattgatt
11941 atgacatcaa cgccctatga agtgcagctt tgaacgccaa cgattgtatt taatacattg
12001 taggataaac ctcgtaatca taaataacga attgtttaag tatgaaagtt tgcagagcac
12061 ttaatgttca aacaaacaaa tctttttagg tacactaatt taaagatttt ggtatcttta
12121 tgaataacga ccatgcttcc aagaaatcat tttgtataaa ggtacttaca ttttaaaaat
12181 agtaaagatt agttgttgac atcttttagg ctccttcaaa ttgggaaaaa tcatatcttg
12241 aagtctggcc cttagttact gtaccaaggc agtgtatttg cctgcgatgg tgcatttcaa
12301 aggaatatca cgagtttact tgctcatcac tacaattcat agttataagg ccagctggca
12361 ctagtgtttt actcggacgt gtaaaaagtt cgaaagctaa tcaacttgtt ggaattgaac
12421 atgtggaagg gagccgttat gccttgattt ttctttcaga aaaattggat tttaagtctt
12481 tgaaggtgat agcaaatcat caattaacga aatcctctaa aagcttgtcc aacgtctcaa
12541 acaaaccttt aggagatcaa ttgttccgtt cgaactcttt aatgtcacct agtttgttaa
12601 aaaaggaact tcacagaatt caatcagatg cttctcaagc aaacgaacgc gaaagccaag
12661 ctccgcattc attcgttacg catgatctta tttcgtcttc aaaagatggc aattccctta
12721 cgcatgaatt tgcaaatgat tccgttactg aaatggttca agattatact cctagctgtt
12781 cacgtgacgt taaatctttg ttggaccatc tttacaattc ctacttttat caactattga
12841 tgaccaaaac acctgttgtt ttttatgtaa agcaaatggt tggaaaaact cgacaactag
12901 cagttgaagt acacaatcat gtagaagaaa aagctctagt tgatgaactg ttaaaatttt
12961 tagataatct taaatctgtc gatgaccgta aatcccgatt acttcaatgt tttgaatcac
13021 atttaaacta taaggcttgg catttggaat tcgaaaatga agcccatcag tatgagatta
13081 aaggctatcg cctctggctt caaaacatat taaatcgaga aaattgtcaa attacaaaac
13141 ttgactttga gagggagttt tcccaactga aattgaagga gtatgaaatt cgtgtacttt
13201 tatactttga gattctatac ctttttttaa aatgggatcc tgagtacgca aggcgtcggg
13261 ctaatgataa cagtttgttg gattctcgcg atagtggtaa acgaaaatct cggaaaaaaa
13321 acgcaaaaac gcttaatcct tttgaaactg ctcaattaaa attggaattt acatttgatg
13381 ggctttgcat acgaaggact atagaacaaa atgctaccga acgatctgag gatctcttat
13441 tgaagttttg taaggagaca atagtcccat attactcctc taaattccct cgaattacgc
13501 gcaacctttt agagaaatgt aacggtttag atttattacc agaacgctct cataagcata
13561 gacatagcgc tcctcctcgt tcaaaattga ttagtagcaa atcagaagcc ggtcgcgctt
13621 tacctggaaa tacttctggt gccagtatat ccaatacttc tagtcctcat tcggaggcct
13681 ctatctctaa ggattatgaa attcttaaac gtcgaagaag caattctggg gttcattctt
13741 tgactcggtc agattcttca ttcaatggtt ttgagaggga cactagaaga aggtcttccg
13801 atatagctcg catcaagaac cgtgaaataa accttccttc ttcgtctttg tcaaaacaaa
13861 gaaactcaat gcatgatatt tctacaaatt ttccacgaag aaacttatcg tttactgaga
13921 aacttaccat ggcaagctta caaggacaaa gtgaggaaag tgtacagccg aaaacaactt
13981 cttctctttc tcggtcaaaa acacttagca ttctcgaggg atctgtatct aagcgctcag
14041 aaccttcgat ggattcaatt ctagtgcaag ctactccgcg aaaaagttct tctgtaataa
14101 ctgaactacc cgacactcct ataaaaatga attcattaga taaagcttct gcttgcaccg
14161 ttgagaatca catcgttact gaatcccccg ctcacaaatc aaacaaagct caattgttcg
14221 tgtgtgtccc tacaacacct gttaaaaaaa aatcagccag tccttgatga attacttaat
14281 tttatgaaaa tctttctcag ctcgtgtagt atcttgaatg aagtcatata ggagggtatg
14341 agatagtctc tcaggttaaa gattacgata tttatttaca caatgaatca gctattaaat
14401 ttttttgaat tattttctca ataagtataa gtaaaaaaga aaaatgaaaa acatcaatga
14461 aataatgata ctgcggttca acaacagtaa caaacgttac caacgttttg gaagttggca
14521 tattttacaa caaattaaat caaatcgaga aaagcataat tcttacatat aaacaaatta
14581 acttgttgaa gtatagtcct acaacactgc gataaaaata aaaaccctac cctatattgg
14641 aagacagtca caagccacaa cacatctcta atgaacacgg tgtgcttttt ctcaccacga
14701 aaatgccttc cgcacctgta agtaatcaag tgtatataag agtattacta acattaaatt
14761 agcgcaagca atccaagact tacaaggtcc ctcgccgtcc tttcgagtcg gctcgtcttg
14821 atgccgaact caagcttgct ggtgaatacg gtttgagaaa caaacatgaa atttggcgtg
14881 tggctcttac cctttccaag attcgtcgtg ccgcccgtga gcttttgaca cttgatgaga
14941 aggatcccaa gcgtcttttt gaaggtaatg ccatcattcg tcgtcttgtt cgtttgggta
15001 ttttggatga gactcgtatg aagctcgatt atgtcttggc acttcgtatc gaagactttt
15061 tggagcgtcg tttacagacc caagttttca agcttggttt ggctaagtct atccatcatg
15121 cacgtgtgtt gatcttccaa cgtcacattc gtgttggtaa gcaaattgtc aatgttcctt
15181 cttttgttgt ccgtttggat actcaaaagc acattgactt tgccctctcc tctccttatg
15241 gtggtggtcg tcctggtcgc tgcaagagaa aacgtctccg ctctcaagag ggcggtgaag
15301 gagaagaagc cgaagaagag taaattaagc aggtgtgaca ttctcttcta tacatttcta
15361 aatgggattt cttcatagcc gcttctatct gcctaataaa gaagtttgtt tgattgcata
15421 gtggttaatt ttgttttgtg aaattttaat tattgtactt aatgtgtaga ttataattat
15481 gtaagattgt agtgtcagtt tatttcatat gagtaatatg atgaatttgt tgtttgtttt
15541 gtaaaaaagc tgcttcaggt aataaacgga ttttaaatag aaggtcgttg attaagtttt
15601 tcaggcttta gatggtagtt gagcttctac ggatattaaa tgttaattac ttctatcgct
15661 cgaatggata attattcaaa catcggagcc attgcaatat gctgaccttt attatttgaa
15721 atccattatt gtattaatta tgatattccc gtacgttgca cattcagcgc tattgtaaaa
15781 actgagtagg taatttttac tacaccaaaa tgtcgatatt attcgtcagc tacgaacttt
15841 gatgctctaa gtacttatat ttatccttat tataataacc ttaccatttt ttcctactac
15901 cgttgttttt aatagttccg ctgctgaaat caccttattg aacatgtgat atatggcaat
15961 gttgcattcg tctcttctta tccactacca caaaccttct tgaactgcta taaaggaata
16021 caccagccag tttctttggt cgagatagaa agattcttat ttatttcgtt cgtttaaaca
16081 aactaaacat agaacaacat tatttgcgtc aacacaacat gacttcacat caaactggga
16141 aaattgaggt gcactggaac gatccttctt ctggtatctt tttaagtcaa actcaacgaa
16201 cgtcttccac atcaagttta aagaaatcgg cttcgagtag acgtttggtt tatggtgatg
16261 acatgctggc acccaaaccg ttagctggtc aagtacttag taaatcacca cttcctcctt
16321 tttcaccttc aaaaattatg aacaggagta tatctgttcc tcccacgaat atctctgttc
16381 ctcaaattag cagtaatcct ctcaacttga tgaagaaaag cagtgacaat gatatcttta
16441 ccacgtttaa cgatacaaca aacgattgta tgaacgaagc ttcttgtgaa gatgtcaggc
16501 actccctatt gcaaattata gaaagtaaat caaatcttag tgattctgtt cataagatgt
16561 tgggtgaacg agtcaaaacg aatttactat tacaaggtca actagaaaac atggaaggct
16621 tttggctaca aaaattatcg caaacatgtc gcttggcgtt gacaggaaac attagtgagg
16681 cgaaggcatt tattgtagaa attatgtgtg ccggtgttgt aactaattgt gtgcgctggt
16741 gtccagtttt gaagactcta attgaaaacc ttgctatcta gctttacatg ttgttatata
16801 ttttctaaat cttcctgctg caagttatat gacactaatg ctcttttttt cttaattcta
16861 tcatttttaa tgatgttact catttgcatt ttaacaaacc atttataagc atgattagct
16921 atgtcggacg ctaacgaagt tacaagttaa aggatttaat aacacttcct ttttgaaatt
16981 agaagaaatc ttaatagata taatggaagt ttccattttt tttacaatat atatttttct
17041 tatggcctgt cgattaatag tatttttagt ttaatttact gaaacattat atttgttttt
17101 agtgaacacc gtagaatatt taggaatttt gactaggtct gtttatcata cacacttagt
17161 attagatatc gtgatttttt gaatatatga aaaaaagtac tacagacaga aaatgcagcg
17221 tattagcctc aagtaattat taaaaaaaat aaaaattttc ttttaaaatt atacagtgtt
17281 taccaaagtt atgtaagaca atcaactgcc gcttctctct catttctgtt tatatacatc
17341 tttaatgttt attacttatt aaaaaaaaaa aaaaaaaaaa aaaaaaaaca aagcaaatta
17401 cgtgcattac caagaaaaca aaaccgctct aaatatttaa gctaattaaa acaatgaaat
17461 aacaaattta ttaacttgta accgtaacag tggaatttat tttaactgct gtcatttcta
17521 gaatttcata aagggagaca tgaataaatt taaccttata ttacaaattg taaatacgta
17581 aatgtttatc acaaaatcga atactcaagt catattacta tgtacctgtg caagcaacca
17641 tttgtaaacc aagtacaaag gtaaactgcc tctgtccgcg gttggcgctc aaaaagctga
17701 gccaagccta tttttggcaa actatttttc gtgggtgatt acgttatgca ttggtaatga
17761 agataaggaa gtggaaggta ttgaaagtgg attaagaaga agttggcgcc aaatctccaa
17821 aacctgtgct ttaccaacaa cgttacctaa taaaattcat gtaaacaaaa tttccaaaat
17881 tgtcattcat tttattgttc tccaaatgtt ttcaatatgg agactttaaa agcagatctt
17941 tcggatatgt ccgattccgt atgtccttat acttgatttg aatgtgtgta tatgtgttaa
18001 caatgttgat cagcctgagt actttgaaac cttggcaaac agggatttac caagattgcc
18061 tggaacaagc aagcttcatc gagctgcatg tattaaacga aaatctatca atttaagcct
18121 tccttctaac agttattcgc tttcttaccg acaatcagac gacactgatg gcgatgtttc
18181 agaatcacag agtgaataca ggttgagtag tggaaggagg tctcgagcaa gctttgctcg
18241 tgctttacaa gatccccaaa ctccaaacac tcctcccgta tcgtctcagc accgaagatt
18301 cttttcggag ggctccttca atttgccaaa ctccaacatg agccactcat taaatgggga
18361 ttctacagcc tctaatagtt ctactttaac ccctaatcgt atttatgggg accgcaatcg
18421 tcaagattat gctcagtcta gccgatatac tttgccttcg ttgcctagtt ccccatcata
18481 taactgtcct acaacacttc gaaaaataca tacaaacacg agtagcaatg gaacttcacg
18541 aagagttagc ggcctagggt cttttatggc acagaattcc tctgaaacta gcagtaacag
18601 gacttcggcg tatctacctg gtagctctac ggatgaacaa gaaaagcgtt ccagtgtgac
18661 tttggcaagt atgccttcct ctcattcgtc tacagcgtcc ctcctttcac ctcttgatac
18721 tacttctttt tctactaagt tggactctac tattttggcc ttggaaactg atgagtctct
18781 cagccgaaca atttcgtatg ccactacttc tttaccgagc acaccaggta agcgtttatc
18841 aaaggaatct cttgctatgt ctgaagccag ttctatcaat ccagaatata agcgtattaa
18901 aaagcgtgca aatttgatta aagagctggt tactactgaa gctgcctatt taaacgatct
18961 cattgcgatt caacaaagtt atggcttgcg ggtcaaagaa tgcagtgcat tgaatccagt
19021 tgacgcccaa acagtttttg gcgatataga gtctttatta acttttactg tggaattcca
19081 tagtcgactt tatcaagctg gtgaaggctc ctggcgtgta aacttggata cccagcttat
19141 agatcctctt ccttgcaatt taggtcttat ttttttagaa agcctttctg aaatcggtca
19201 aatttatact ggttattgca accgtcaaga ttcggttttc aaaattatca ctaaatggag
19261 agagaaacct gcaactgcat cctggattat ggagggtgat aaaatagttc aaaaatacac
19321 gaacgcctgg gatttgggta gtttaataat taaaccttta cagcgcttac tcaagtaccc
19381 cctattattg caaaaaatca ttgatgttac tcctgaaagt agttctgagc gcccagattt
19441 ggtcctttct tatcaactat tacaggagtt aatatctggt ataaaccaga agcagaagcc
19501 ttctcataag cgtgggtcgt taagtgcttc tcataaacga gacgctgctt ggagtttact
19561 ctataaagca acatcgaata aatctaggcc caccactaca tctactgagt taaaaactga
19621 tgctcgattg aattttcaac gccaagtgct tcaagatttt cgacaacgat tcgctatatt
19681 aaaagcctta catgctacat tggaaacgtg gtacgttact gtacatcgtg gattttcggt
19741 ttttgaaaag gttttagcag agcttgaagg gcttagtgct ttagagcccg aagacaagcc
19801 ggtagatacg tggcgtaagt atcatttact tgcgcacatg atgactgcta accttccttc
19861 tcaaatacaa acgtctttaa attcttctat ccttaaccca atcacaaata tcctacgagt
19921 gattcaaaaa gttattcagt tcattatcag cgctgaattt gtgattcctg ctcaaaagct
19981 tgaggcaatt tctactcttg ttgagaagga atttcacagt gttgtttatc attttattgg
20041 tattcaacgc tctctctatg aaaactacgc tcaaggtttc ttgtttctta tccctcaaga
20101 catgagggat agcatcttag aagaaacgca agattatgca gagttagttc gagcctttga
20161 acccatgcat tatgatgacg aggtattgct ggaggaatta atgaaaagtg tctctttagc
20221 tgcgagggtt taagagcaac ttctttagtc gaatattttt tatctgttct atatatatac
20281 acatttagat atactgcatt catgttactt gaatgatgaa aacgtttctc tggaccatgg
20341 ttttttgcag cttccatgta tttaacatca caattacatt tatttaacct aatatctgcg
20401 tttgtctgat tacaattcgt ttaggttctt ttaacggtct ttttttgtct atcacagagg
20461 cctgttaagt ttagtttcaa attaggattc taaacgcttt tacgccttcc ttttatgcat
20521 ctcattattt ttgtctgctc aagtcgagct attaattatt gtcttttcac tttttttttt
20581 taaactagga acgttttggc ttttctgctg ttatgccatt ttatttgttc atacgcttcc
20641 acttttcatt tatatgcttt aataatgtat ataatatttg actttcactt taactatgtt
20701 ttatgtttaa aatatgtaat taggttttga ttgaattaat tttgttcgtc gttatttatc
20761 ctagttgact aacaaaaaca ggtcaaattc ttacattcgt agtaaaacca aaacgtaaat
20821 aaaaatactt acgatgaaac agtctggaac catccgataa agtgcaatga taagtacgat
20881 aaataacctt ttaaaagtca cactgttaat tacggcaaat acaaatttat tatacaagaa
20941 accaaaaaat ttatttttat aaaactgtct caataacctt cattataggt taatgagaac
21001 gacgagaacg gacggcagca gccaactgtt caaaaacggc ggtagtgtcg tcccaaccaa
21061 tgcaagcatc ggtgacactg actccatatt tcatacttga caaatcatcc tcaggtatag
21121 cttgcttacc ttcgttaagg tgagattcga tcatgacacc agtaattgct ttctgaccgt
21181 ttgccacctg ctcagcaata caagcagcaa ccttgggttg attcttatgg tttttagatg
21241 agtttccatg agaacagtcg atcatgatac taggtagctt gttgcattct tcgagtttag
21301 ccttggcacc agccactgaa tcagcatcga agttagtccc acttttacca ccacgcaaaa
21361 taatgtgtgt atcagggtta cctgtagtag taacaatggc tacaacaccc tgctttgtga
21421 cagacaagaa atgatgaggg ttggcagaag aattcatagc atctatagca ataccaatgt
21481 taccgtcagt agcattctta aaacctattg gaaaagaaag ccctgaagca agttctcgat
21541 gcaattggga ctcagtggta cgagcaccaa tagcgcccca gcaaattaga tcagctaaat
21601 attggggact tatagtgtcc agcatctctg acgctatacc gacaccagtt tccagaagct
21661 ccagaaaaat gcgtcgagcg acacgaatac ccttattaat gttataagat ccatccaaat
21721 cgggatcatt aatcaaccct ttccatccaa cagttgtacg aggcttttca agataggcgc
21781 gcataataat atgcaagtcc ttcttgtgct taatagcctc cttttgcaag cgaatggcat
21841 actccttggc ggcaacagga tcgtgtaaag aacagggtcc aacgattaga agaagtctat
21901 catcacgtcc cgcaatgata tcggcggctt ggcgtctttg atccgacaca aaagcaagtg
21961 tttcatcaga ggctgctaat tctgattgaa ttaaagcagg tgaaattact ggatcatagc
22021 ctttgatacg actatcttca gttttacaac gtgaaaaaac actgtcgcca ggaagaagag
22081 gagtgtgctt gtccatctct taatgatttt ctaacttgta gaagacaagc cattagtata
22141 atgtagttct caagacatct gtaataaaat tttttaacga gatcaatttt cgctacttta
22201 gattacgagt tacaaactac tgcgctgtca taagtcttgg acattatatt tccaactatc
22261 aatagaaact ggtaaacaat atccttttgt aagcatgtat tattaaaatg aatgcttcaa
22321 tttaacattt cgaagtcttt tcaaagtatt aaagtaagta taaactattc ctgcttaaca
22381 aaacgactat catgaacaaa tgcaaaaaaa aaaaagcata agaacgataa cttttattat
22441 tagaagataa attttattac cctaataaac aaaataaatt gattaggcaa ttagggattg
22501 acaaaagata tcaagcgtcg agtatgtttt aacaaaaaaa caatttcaat aattaaatga
22561 caagcctctt caggtgttat ttttatgctt ttaaatgacc caaaatgcag aattattgtt
22621 ttgataaatt gggaacaatt ttatcgatac aaatttagag caaaattggt aaatacatta
22681 aatttgtctc gagtcatttt caaagttaaa tataacattt cattttcaat gacttcagcc
22741 ttgtcagaag atttcaacat agtctcccat ccttctcggt caagaatcca tacagcagaa
22801 tctaattcag ctacaactgt ggcattatag catgtcttac tgaagaatgg aagttctcca
22861 acaacgcaca aacatgtgat attttcagat aagctttctc tatccatttg gtacgaagcc
22921 ttaaccatcc cagtttccaa aataacaagt tttgaaggtg tatcaccttt tctccataat
22981 aagtctcctt ttctcaagaa cgccttttta aagaatggac agagcccaag ccaaaaatct
23041 tctcgcttag tggtaatctc tccaaacact tgcataagta atgggaaagg ttgttcgtat
23101 ttggcgtact tttcgaattt cgataactcc tttgtttcag ctttgaccga tgacatagca
23161 acttgcctta gcaaatcgtg tctgggtgat ggtgaaaagg tttcagccat tgaaatgttt
23221 tgtttgtgtt ttccgggcac agctaaagta tgtgaatacc ctgccttgtg taaaagttta
23281 gtgcgataca cgtcgtagtc atctaataac atgttttcac agtattccaa tgaagagttg
23341 acatcttcaa agacctgaca accgcaatta tcatcgccgc caaagcacat agtagtcaat
23401 gttttgaagg aagaccgagt ctcatccaac ccacagacgc atagctgtac attcatagtc
23461 gctaacatac gacgaatgcg tagaaatgct tggacgacac taaaatctgc tccattaaca
23521 agactaaaat cgataataag gaatctaagc ggattatttg agacattcag ctcctcaatc
23581 aatccagcaa tgtttttctc aacaccgttg atagtaccga agaataagaa accagagagt
23641 ttacacactt ggatttgtct tccaatttca tttaagaatc gttgttgatt cataggacga
23701 cgtaccgtag aacgaacaat accacctgaa taaattccac gtaaagcgga acggcttgaa
23761 gcctgaatta caaagaaaac acatgccata atgatgccaa taacaatacc aacaacgaaa
23821 tccacaacag tcatggtgaa aacaattgca caaatagtaa aatattcgat tttagtggtt
23881 ataccaatgg gatcccataa agattctttt agaagttcaa taccgagtaa ataaattagg
23941 cacccgacag tccaaacagg gatataggca ataattccag gacctataac taaaagggct
24001 acagttgcca aagctagcat aatacctgcc aagcgattgt tacctccaga tcgaataaac
24061 attaaagaat tcgtatatgt catgtaattt tgaatgctgc ctacagcgcc agataatgta
24121 ttgctgactc catgagcaat tagttctttg tcagtgtcta caaagtcgag cccaagtgaa
24181 atagctagcg ctggaacgtt tataggaaca tgcaaaattc caaaaaatgt taaagcacac
24241 atttcaggaa cagtggctaa caatgcgctc caatcagtat cacgtaaact aaacaagcta
24301 taaaaatgat accaaggtac gttagtctca gttgaagaga aaacccatcc ggtctcacgt
24361 aaatactcta gactcatgcc tggtattgcc caaactaaaa cgtaaaagat agccggggca
24421 ataacaaaaa acgaaggaat gaggaatgga tgcggccacc tctgttgagc aaattccaaa
24481 gcgcttgaga gaaaaagagg aattgaccat tttgcgaacg tcataggctg aaaaagagca
24541 gatagggaag cccaattaaa tgaaacactt ccttctaacc tagaagacac ttcaacggcc
24601 gtaagaacaa gaaagcttcc aacaccacct atacaaccaa gcaagatatg tctaggaaaa
24661 aactcaataa gcctaccaag tcggaggatg ccaagaataa aaaacaccaa tccagtaagg
24721 atactggata aacaatacgc caaaattgta gttgcaataa cggactttgg gttgtcttca
24781 ccaacacgat tcaggatggt gaaagcgatc tgatggaaaa atggaattac ctcaatcatc
24841 tctgagccca cacttccttt aaaaaaggag ccaccaagag aatatactaa ttgtgataca
24901 acacaactta cgtagtaaat agcaaggcca tcggccccca aatttttgaa aagagggtcg
24961 gagattggga ataaaataag gccataagag agagcatcca aaatattaag gagtaatcca
25021 atgatgacag caggtatatt aaggataaat aagatgggac ctttatcatc gaagaagaat
25081 tgatagatct tgcttagcga aatgcgacct ctcagttttt tagagggagc gttcagtgac
25141 tggaaaggca gtacctgatt ctgagcaggt ggcagttgag aagaatgttc ttcacgttcc
25201 ggaagcaaag gtcttgtctc attgatttca gaggggcgat aattaaagtc attgcgagaa
25261 gtggaggcta gagaatgtgc acgttggtac gagcttatag acatatagtc ataattggaa
25321 acatcatcgt tcgtttctgg agaatttgaa ggaagagttt cgggatccac ttcgacagaa
25381 gccaaccgag ctgacagaag atgaatgtca ttagtatctc taaagctttg acttaggaac
25441 gaagagtctg ggaccgaccc gttgtcaata ctaggataaa gcaaacttct tttctgatta
25501 aggctatggg tgggcatgga ttattaattt ttttaaagtc ttcaaaagta ataaatatga
25561 gtttttttaa gggacaccca acgaataaga tgagtgagaa aatatggggt aggcgatttt
25621 ttttaaaaaa aataaaaata aaaaacagac aagactttga aggtagaaaa gtttgtttaa
25681 caatcaatta aattaaaagc attagaggga aaacacagtg agaataaaag ggcccagtta
25741 agcgatgaaa tgaaactgta aacttttctg tatttaaata gctatggaaa tgtattaaaa
25801 ccaagtacaa aagcaagctg aaagatctta aacctcaaaa ggacaacgtt ggtaaggaat
25861 gaaggatgaa tcaagaatat acaaaaaatg tacgtcttaa agacactttt tcacttgcaa
25921 cttccaatac agcctagaat gtaagacgat aacagtactt ccaatactcg caaatacttt
25981 gctaaagata ccttataaat gtcaaacaaa tgcaattatg aggtagaaag taacggtaaa
26041 atcacaccaa aagttgagat tttacaataa atacaagagg atcaccaatc tcaagcaacc
26101 gttaaagctt tccagagata attcccaggg gtcaatgatt tgcccgaatt aattgatact
26161 ttggtaaatt ggcactatcg ttcaattaat gcaatgaaga aaagcaccct gatctcctca
26221 attcaagcag agaatggtga atgtttaaga aaggcgtgca gatggcgggt gtgttgtgac
26281 aaaaaaagct cgcttcttac atgcttcggt atctgcagta atatatatat atatatattt
26341 atttatttaa cagattcaac caaatacaag ctgttccgga ctgatatttt ttttttctcc
26401 taatactcct atcaataaag taatgtctat aattgcaggc ggatcagtga aataattgtt
26461 tttctagtga gccattctat tgatagcggt aaatccccag tgagtatcag cagcaacaaa
26521 tccctctcta cgttatcata agcaacaatt ttagtgtcat aaatgcgaaa agttgataat
26581 aaaccatgcg gtttttctcg tctccgagat tgggattcca ttcatggaaa gtcgatgtca
26641 accggctctt agtagtaaga aaagcgtcaa taatgtatgt ccacggtcaa atatccgttg
26701 agagcaaaat catgtttttc tactgtatac ctacgtaaac gaataaaata attacaaaaa
26761 tgaaaatttt aatttattta tcgatttttt ttttttttgg gaaaaagact tattgctttg
26821 tttcgaacga tttttaattt ctatttattt gaaaatgtaa taacttgttt tatttaggcg
26881 gatttaatag ctgctgagca aatactaatt ctgttgatgt tattgctaaa tgatatcgct
26941 tgttttattc cgatgaaaat ttttgaaaat tcggtcgtta aaagtcatgt ttttgcaaca
27001 tgaatcaaaa taattgttta tggaaaagta atgtcgcaaa tgcctaaaca aagggcagtg
27061 catgttttta tcactcaatg acaatgctcg tagcaataaa attaatagca taaaaaaagg
27121 caaataaaat taacttaaag catttgattt actttctatt ttgcaatata ctttgcacat
27181 tgcatgctta tatatgctaa atacctaact gtatgcatac gtttcgtgtt tgttgtctgt
27241 tgtagatttg taatttagag ttgaataact ttgatttggt ttaattgttt ctttttttaa
27301 aattaaataa gatcttttca acttatatat tttttttgag atgtcgttaa actgaagaca
27361 ttgaaacaaa gttgaaatga cataactata aataagcaag gaattatagt aaggagcgtg
27421 aagcagattc tctatctctt tagtgatatc tcttttcaat ttataagtat attgaaatgc
27481 agcgaaactg ctattctaca caaaaatatc cacagtctat tagcgattga taaacatcaa
27541 atctagcatt ctctcaactg aacagacaaa cttgtagaaa gatttttatc agtaattaca
27601 aggttcacat caaccgacgt aatacttaat ataggttgag ataaattttt agatgtcgca
27661 acttgaagag ctggaggtga agccaagtac agaggggttc ctgcagctgt tgaaaccgaa
27721 ggattttgta gattataaga tttagaataa tgatttaaaa tatcctttaa agtaacggta
27781 gatgaggctg aaatatcata gacttcagac aatacgccgc aaacaggaca atctgatctc
27841 ttttccaaat taaatgtata tgtatacgct ccatcttcac cgacatacat catgtaatta
27901 tccaaaaacg gattgctttc cgtcaaaatt tttaatgctt cgttacaaca gctcgcggca
27961 attatggcat ttgtcgacgc aaccgctggt atgatacgct ttacgattcc ttgcacgaaa
28021 aaacgattga ttgaggaaga gggtatttga aatttattgg cacgttcgat agaacgtttt
28081 acgagccagt caatatgtcg gatattatct ggttcaaaat tggaattttt tccatccagc
28141 ggttcaaaaa cttcttgctt agaaaaacta tcaactgaag cattaagaaa aactcgaggc
28201 cattcgagta aatatgccca ttcgacgcaa tgctccggca atcgaggagt attcgccaat
28261 gtacatatgg gatacgaaat cttaggcgta agcatgtcta atgaacattc atagcaactt
28321 gttatcgttg gaatgattac tcgggcttga cccttaaggc cctctgagcc gccatcgacc
28381 aacggtataa gatctcctgt cttagcaatg gccacgagag ttgaattgat ccaccgtcga
28441 gcttctactg aatcaaggcc acatataata agtttaaact ccttatagaa ttcaattgtt
28501 ttgtcttgaa ttttcccata aaaaggtgtt actaccgtag aagggattcg tttcataatc
28561 attgatgcgg caacgttggc tttcggctca tcaatgttgc tttcattaaa tagaaattgt
28621 cgattcaaat ttgtaatgtc aatagtatcc atgtcgatta cgcttaaatc tcgaaatcca
28681 gataatgcta aatcctttaa aatctcgcat cctagaccac cagcaccaat aattaagatt
28741 ttagaagaaa aggcagactt aagagtttct tcagggtttt ctggagcatc caagttgaaa
28801 ggccctggtt ttttgagact ttggatccat ccagaatgcc tgtgactacc agctttacat
28861 acgtctgagg agggcatctc ctacaaaaaa ataatcgttt tttgaagtat ttcttaaaat
28921 ttctgagtca agaatttgta ttctctatta atgtcaacga aatgcgttgg aggttggctt
28981 tgttcccaat attcgttaca tgtaaacaga aaaggaattt gtgtatgaaa atactccaat
29041 gctgtcagga aatatggcta ctcatttaat aagaggataa tgaaatggag tatgaaagtt
29101 ggaagatact ttttaacatg taataatgtg atataggtgt aagtaacagt cctaccgaca
29161 cacgcagaac agaagagtaa ctgatttcac ttcgtaatga caatactagg ttaccaaagc
29221 agaattagtt gaataaacgt cttttttcat tttatttgaa attggctttt gtattggcat
29281 aaattgacta agaataaaaa gctgatagaa aaaatggata tactaacacc ccaacgcaaa
29341 agctccatcc cacaaatgag tttgcataga ttaatacttg ctctacaaat gcgttggata
29401 atctgatatt attaataaac tatttgcgca tagtttttga aattattcag ggcgaactgg
29461 gttgattaat ttacgcaaac gtatacattc caagaacgca tttgagcaaa tgcaacgaat
29521 acgtagcaaa agagacgtaa gttatattgg aggagtctcg ttctcaactc atcttcagtt
29581 tacctatata ctggtgttgt ttacagtatt gcgtctatat tgtaaaacga caatactcat
29641 cattttctag tatggttaag taatagtcga gaacaaaagc acacgagaac agaggaatgt
29701 ccgtcacatc aagaaatcaa gtattaatga aaatgtttca tccaagcaac cgatggttaa
29761 aatagcatga ataaacaaaa atcacctttt gcttgaataa ttttgtacat ttctttttta
29821 tataattgac acttaaacgt gtagtttaaa tatttctatt tacatttctg ctgcatacaa
29881 tatagctcga aaatgttggt ttacatgtaa gtgcgaaact atgacacatt tgaatagaaa
29941 gcgaagaaaa aaaccttaaa cctagaaaac ttgcaggtag aaaatgattg ctcacatacg
30001 tatgtgatta catatactac aagcatagaa tattaaattc taaattcctt ttttttttta
30061 cttacaccat gctctctcat atcattatta cctactaagt atatgaggct gtgtaaagcc
30121 agctgtcatt ttcgttgaca tgatgtgcaa tttggttttt ctttattggt gaaatttatg
30181 aaatccgaga ttgaccttta caacacttta gcgcgttcaa taactggctg gaaatgcttt
30241 attctgcgtg cgtacaagat agggcagcga aatttttttg gtgaaaagtc cggtaaagac
30301 actcgctgtt gcttaacgat gtcgtttaac atgttccaat gtgtaaatat tacactcccg
30361 actaagtcga tggtggcttt ttgtgtgaca tgaaaccgag ttagcgaatg attgtcacat
30421 aagtgaaagt ccgtgtttta tgaaattatt atctcaattt gaattatatc actaacactt
30481 gtacttcact atgtccttaa caaccagcaa gaattgcatg agttcgttat tgtgcgaaag
30541 aaggatgatt tcgtttttct ctcttttgct agtgactaac tatgtttatc ctttcctaaa
30601 tgaatccttt taacgttatc ataattttgc ttcggttctt attttacctg ggtgtgggat
30661 ctgcctgttt tacaatagca aagtgacatg tatgcgtcgc agtttcaata tttgtttaca
30721 tctgaaattg cgagacgcaa attcacgaga ttggacagtg ttaagggaga ataggtgaaa
30781 aatgctggta atatctaatt ctggctttgc atttgccagc gcaatatcgc caatgtaaac
30841 agaacctcaa ggctttttaa ttggagatga gcatttacta ttaaaagtcc atgcaccatc
30901 acttgcataa ttttctttcg cccactacct tctaagtcat taggattttt tcttttcatt
30961 gttttaattg ctttattaaa caacaaagaa ggttttctac tgggggtttg gatcattctt
31021 ttaaaacttc cggtgttgtg tgtatctttc acaaatctac ccaactcaac atcgcaccat
31081 tctctcattt ttctttctcc atctgctgat cgtgtcaagc gcaacactct tttgacagta
31141 tgtccgacag cagcagttct tctacgtcgg cgttcgtatc atcgttagtc ttcaattttg
31201 ctatcttttg cgccttcatc ggtctttttt tatgtttgcg tcctcgcgag aaacacgttt
31261 accaacctag atgtattata gatactcaac caaaagaaga gaaaccagag ccttccccct
31321 ctagcccttt tggtttgttt gcttacgttg tgaaacgctc tgagacatat cttatccaat
31381 acgctggtgt ggatggttat ttttttattc gctatctctt cacatttggt gccctttgta
31441 tcctaggctg tttagttctt ttccccatcc tccttcctgt aaacgcaaca aacggtgtgg
31501 gtgaaaaggg atttgatatt ctttcattct caaacgtcaa aaatcataat cgattttatg
31561 cccatgtttt tctctcttgg ttgtttttcg gtttcaccat tttcataata tatcgtgagc
31621 ttcgctatta tgttattttt cgacatgcca tgcaatcttc aggtctttat aacaatcttc
31681 cttcttcctc tacgatgttg ctgactgagc ttccaaactc agttttaaac gatgaggaaa
31741 ctcttcatga gctttttcca aacgcttctg agtttacatg cgtccgtgat ctcaagaagc
31801 tggaaaaaaa ggtcaagaaa cgcagtgacc ttggaaacaa gtatgagagt actcttaaca
31861 gccttattaa taagtctgtt aaaaaacata ataagcttgt caaaaagcat aagccacttc
31921 cttcaacctt ggattatacc gcttacgtga agaagcgtcc aactcatcgc cttaaattct
31981 tgattggaaa aaaggttgat actattgact actgtagaga cacgatcgct gaattggatg
32041 aggttgtcga taaattacaa acttcactcg aggagcgcaa aaaagttggt tctgtgttta
32101 tcagattccg tagtcaaacg gacttgcaaa ctgcttatca ggccttcctt tactcaaaaa
32161 agtttagaaa ataccgtttc ggtcgtgctt tggtcggcat tgctccagaa gatatcgttt
32221 ggtccaatct tgacctttct atgtacacca gaagaggcaa aaagactatt tcaaatacta
32281 ttcttactct tatgattatt ttctgggcat ttccagttgc agtagtcggt tgtatatcca
32341 acgttaacta tcttattgaa aaggttcatt tcttgaaatt tatcgaccat atgcctccaa
32401 aattgcttgg tatcattaca ggaattctcc cctctgttgc tctctccatt ttgatgtcgc
32461 ttgttccacc gtttatcaag tttttaggaa agtttggcgg cgctcttacc gttcaagaga
32521 ttgaaaatta ttgtcaaaac tggtattacg catttcaggt cgttcaagtc tttttggtaa
32581 ctacaatgac atcggctgct acgtctgccg ttgtacaagt tattaaagaa ccagcatctt
32641 ccatgacact acttgccagt aatcttccaa aggcgtctaa cttttacatt tcatatttcc
32701 ttttgcaagg actttcaatt cctggaggag ctttattgca aatagtaaca ttacttttgt
32761 cgaaagtttt agggcgcata ttcgataata cacctcgaaa gaagtggaat cgctggaatc
32821 aactttccgc acctagctgg ggtacggttt atccggtcta ttctttgttg gtgactatca
32881 tgatttgcta ctcgatcatt gctcctatta taattggatt tgccgctgta gcatttgttt
32941 taatttattt tgcatattcc tataatttaa tttatgtctt agggcataac gctgacgcaa
33001 agggcagaaa ttatcctaga gctcttttcc aagtatttgt cggtctttac ttagcagaag
33061 tctgcttaat tggcttattt gttttggcta agaattgggg cgctaccgta ctcgaagccg
33121 tatttttggg ttttacggtg gcatgccatc tttatttcaa atacaaattt ttacctttga
33181 tggatgctgt tccaattagt gctatcgaaa gtgtttccga gcgacctgaa attaaatatc
33241 caatggacct gggtacgtct gaaatgaaga acgtgggtcg tgcttatccc gaaattctgg
33301 aaaaattgtc atcatcttct ggaagtgatg aattcttaga aacaagtagc agaacttcgg
33361 aaaataccaa agaaaaaata gataaggacg acgagggctt tgctattacg aatatctcat
33421 ctgtacataa aatgcctagt ttcgttttaa gttatttttc tgaccttgct gcttctaata
33481 ggatcctgac tggattcgat cgtgttttac aattacttcc ttcattttac gatattcctg
33541 tgcgtgtacg taatgtacaa tatgtgagtc ctgctttgaa agctacacca ccatcagttt
33601 ggattccaaa agatcctctt ggattgtcga cctatgcaat tgaggatgcg cgtggaaagg
33661 tggatatttt cgacgataac acaacattta atgagaaggg taatctccaa tatactggtc
33721 cacctcccga ctacgatgag gcgatcagga gttaaggctt agccaacttt tatttgaagt
33781 tgtttggaaa catggggtgg tagtatgttt ccaatcgttt tattatttat ttcggttttt
33841 ttttttattt taggttttaa aatttgcatg cttcctttaa tagccatact aatttattat
33901 tattggaatt taccatccat tctgatcata gtactgcaat gtattcttag tattatttgt
33961 tgttgctctc agcgaaacaa aaagtctcat cacattgaat gttgtaaatg tttcgctaaa
34021 atttgttgca tctacttcaa accagaggtt gtaaaacgac gcattcgatt atttaaaaaa
34081 aaaaaaaaac ggaaagtgaa aaaaatatta cggattttgc aatgatcaat gttaatgact
34141 gagctaaaaa atgagaaaaa aaataaaagg aacctacagc accccggatt cccatgttgt
34201 ctccaaccat agtactaacg aggccctcag acgcttaact gcagtgatcg gacgggaact
34261 ggtgttttca cctaggtatg gccgtagaca gcacgttaag taaaatatag aatttatagg
34321 aaaaagaaat tttaatttta gttagttaat attttaaatt caggcttgca aaattcctga
34381 actttggagt atatttacat gagcaatttg atccttataa gaaaatgaat atggagacat
34441 tagcaacctt ctaaagaata acgggagatg cgctttcagg tttcatttat gttaaaaaag
34501 gaaagtgcat acacgataaa aaccaaaata agcaaatatc aacgcacgtg ctatacagat
34561 gtggaacgaa acttaagtag aacaaaatgg tagatacttt agttgatggt aatcgtaatg
34621 ataagattaa tttgatgaaa gctttatttt ttttggaatg tcaatttcat tcactatcag
34681 acaaaggttg tctttacagt aggaaatttt taaaagtaca gctagtggct actaaaagtc
34741 aaaattaaag gatctgcctt tgtatgtatt acaaattttc ctattaccct gggaaaattt
34801 tagataaaca aacgatactc aaagaaaaaa aaaatccttt gtgcgtaaaa cattgaacct
34861 ttgcaatgct actgattttg tttgaaaccc gagataagat acgaaggttg ctatgagaaa
34921 taatggtttc gttcatagta caaaaaaata tagccttcaa aaaaaaataa aattgattat
34981 gtacattgtt tcctaaataa gaacaatatg gacgttgtat tcattgatta tttgattctg
35041 gctgattgtt ttgctattgt tgacataaac aagctattaa gaatagagga gcaaacttac
35101 tctgagaccc tatgtggggg gaaatttatc catatactac tactcccttg ccaagggacc
35161 aacttgctga acttgaaaga tttctcaccc ccacgtagac tttattcacc cactttagaa
35221 ccatcccttc tatatctaca gtctcctact tggtgcatct tcctcgaatc gcatccccca
35281 cttggaactg gtaaaaaaaa gtggggctgc aacttactca tgatagagta gagtgcctaa
35341 acttccttat accccatcct ctctcaatgc atttgcaaag gagtccttct tacttggaac
35401 ccctcccccg tgattcaatc taaacaaccg tgccccgcaa agcacctgtc gtgacatcac
35461 cagagtggaa taagaaatat ctagttgttt atccatgtta caactgtgct gcatttgtca
35521 tagcgatgtt tagtggcagc gatggttgtt tttcatatac ccaatttcca gcaacaacag
35581 agagacactg agagaatgtg gacgggcgtt tgtttgagaa gaaaagggtt accactttat
35641 gctctatgct tatttaggaa attgtggaca tgtataatgc ttagaataag gtgcctccaa
35701 aaaagtaacg cggggagggg cggtttccct ttctaagaga ctttgactat tgataatatc
35761 cgaataggtg atttatgcat gcaaacacag tatgtgggaa aactttttga caaaaaaaat
35821 gaaatcttgc gtgtggccac gcaaaaagat atcgagatgc tgcgaggttg atcaacaaag
35881 aaagcgaaat ctgataggta aggaggttca aggcatgatg tgtttgcaaa ggagtgaaat
35941 gaatgtatta aaaaagtacc agcgttcgcc aaccgaaatg cgtttgcatg ggagactatt
36001 cctaggtgtt aactgacggg tttgtttgca tttcaggagt catttgtgct tgttcattta
36061 cattgtaccg tgtttgtcgg ctttccatgg tataacgtga gcatttgtaa tatgtaggcg
36121 ttatagtatg taatgaaacg gttacaaaat ggcgtgtaag aaattcaaca gggtattcat
36181 ttttttttct cttattttaa ataaatataa ttatttttta tttaaaaaag aattcgtacg
36241 aatggcgcac gaaaaatcaa gtgacctttg tgtcggcaaa cattttgctt ctccacgaaa
36301 agagaaaagg aaagttttcg catactatga ggaatgtttg ttccctcgga catgagcttg
36361 agtctccgta agaagtattt ctctctttca tatttttccc atctgtctat tttattttta
36421 tgtttttcag tttcttcttc ttcatcttct aatttctttc tttctttctt tctttcctct
36481 ttccgtttct acttcttcca aacttttact tattctcatg tctttttcct gcctcgccta
36541 ttcttacatt ctctctcctc gcttcccctc gatccgtact ccatgtgcgt aaatctcttt
36601 aagcgcaatc cgtttttctt cccaagctca agcccaatcc cgagctaaaa ccaaactgaa
36661 atttctttac tcccttttac ttctttaaag caattctctt ctttcttctt catacccttc
36721 ggaacctctt cttcttgaaa ttcccttgtt tactcttcat tctttcaaat taccaagttg
36781 actagttgcc attgccgcaa tccttttact attcggagtg cctgcaaccg atc
The source for these sequences is GenBank
Return to Protein Page
Return to My Molecular Biology Home Page
This page was produced as an assignment for an undergraduate course at Davidson College.