E. coli amino acid sequence used shown in bold.
LOCUS ECAE000161 16170 bp DNA BCT 02-SEP-1997 DEFINITION Escherichia coli K-12 MG1655 section 51 of 400 of the complete genome. ACCESSION AE000161 U00096 NID g1786766 KEYWORDS . SOURCE Escherichia coli. ORGANISM Escherichia coli Eubacteria; Proteobacteria; gamma subdivision; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 16170) AUTHORS Blattner,F.R., Plunkett III,G., Bloch,C.A., Perna,N.T., Burland,V., Riley,M., Collado-Vides,J., Glasner,J.D., Rode,C.K., Mayhew,G.F., Gregor,J., Davis,N.W., Kirkpatrick,H.A., Goeden,M.A., Rose,D.J., Mau,B. and Shao,Y. TITLE The complete genome sequence of Escherichia coli K-12 JOURNAL Science 277 (5331), 1453-1474 (1997) MEDLINE 97426617 REFERENCE 2 (bases 1 to 16170) AUTHORS Blattner,F.R. TITLE Direct Submission JOURNAL Submitted (16-JAN-1997) Guy Plunkett III, Laboratory of Genetics, University of Wisconsin, 445 Henry Mall, Madison, WI 53706, USA. Email: ecoli@genetics.wisc.edu Phone: 608-262-2534 Fax: 608-263-7459 REFERENCE 3 (bases 1 to 16170) AUTHORS Blattner,F.R. TITLE Direct Submission JOURNAL Submitted (02-SEP-1997) Guy Plunkett III, Laboratory of Genetics, University of Wisconsin, 445 Henry Mall, Madison, WI 53706, USA. Email: ecoli@genetics.wisc.edu Phone: 608-262-2534 Fax: 608-263-7459 COMMENT The E. coli K-12 sequence and its annotations have been updated. All of the ambiguous residues in our original submission have been resolved, and mis-assemblies in two repetitive regions have been realigned. The annotations have been improved and updated as well. With this release we begin designating a version number for the annotated sequence, to assist in keeping track of corrections, updates, and other changes. This is version M52 (SEPT. 02, 1997). In addition, a revised notation has been instituted which assigns each gene (protein- or RNA-encoding) a unique numeric identifier beginning with a lowercase 'b' (in the '/label' field); this will remain constant through further updates, gene identifications, etc. This sequence was determined by the E. coli Genome Project at the University of Wisconsin-Madison (Frederick R. Blattner, director). Supported by NIH grants HG00301 and HG01428 (from the Human Genome Project and NCHGR). The entire sequence was independently determined from E. coli K-12 strain MG1655. Predicted open reading frames were determined using GeneMark software, kindly supplied by Mark Borodovsky, Georgia Institute of Technology, Atlanta, GA, 30332. e-mail: mark@amber.gatech.edu Open reading frames that have been correlated with genetic loci are being annotated with CG Site Nos., unique ID nos. for the genes in the E. coli Genetic Stock Center (CGSC) database at Yale University, kindly supplied by Mary Berlyn. A public version of the database is accessible (http://cgsc.biology.yale.edu). Annotation of the genome is an ongoing task whose goal is to make the genome sequence more useful by correlating it with other data. Comments to the authors are appreciated. Updated information will be available at the E. coli Genome Project's World Wide Web site (http://www.genetics.wisc.edu). FEATURES Location/Qualifiers source 1..16170 /organism="Escherichia coli" /strain="K-12" /sub_strain="MG1655" /db_xref="taxon:562" CDS 287..502 /note="o71; 85 pct identical amino acid sequence and equal length to VLYS_BPP21 SW: P27360" /codon_start=1 /label=b0554 /db_xref="PID:g1786767" /transl_table=11 /translation="MKSMDKLTTGVAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGS LVFGLLTYLTNLYFKIKEDKRKAARGE" CDS 502..999 /EC_number="3.2.1.17" /note="o165; 99 pct identical to LYCV_BPPA2 SW: P10439" /codon_start=1 /label=b0555 /product="bacteriophage lambda lysozyme homolog" /db_xref="PID:g1786768" /transl_table=11 /translation="MPPSLRKAVAAAIGGGAIAIASVLITGPSGNDGLEGVSYIPYKD IVGVWTVCHGHTGKDIMLGKTYTKAECKALLNKDLATVARQINPYIKVDIPETTRGAL YSFVYNVGAGNFRTSTLLRKINQGDIKGACDQLRRWTYAGGKQWKGLMTRREIEREVC LWGQQ" CDS 996..1457 /EC_number="3.4.-.-" /note="o153; 96 pct identical to ENPP_LAMBD SW: P00726" /codon_start=1 /label=b0556 /product="bacteriophage lambda endopeptidase homolog" /db_xref="PID:g1786769" /transl_table=11 /translation="MSRVTAIISALIICIIVSLSWAVNHYRDNAIAYKVQRDKNAREL KLANAAITDMQMRQRDVAALDAKYTKELADAKAENDALRDDVAAGRRRLHIKAVCQSV REATTASGVDNAASPRLADTAERDYFTLRERLITMQKQLEGTQKYINEQCR" CDS complement(1489..1782) /note="f97; 96 pct identical amino acid sequence and equal length to VBOR_LAMBD SW: P26814" /codon_start=1 /label=b0557 /product="bacteriophage lambda Bor protein homolog" /db_xref="PID:g1786770" /transl_table=11 /translation="MKKMLLATALALLITGCAQQTFTVQNKQTAVAPKETITHHFFVS GIGQKKTVDAAKICGGAENVVKTETQQTFVNGLLGFITLGIYTPLEARVYCSK" CDS complement(2073..2525) /note="f150; phage stats; 24 pct identical (6 gaps) to 125 residues of approx. 2136 aa protein YCF2_SPIOL SW: P08973" /codon_start=1 /label=b0558 /db_xref="PID:g1786771" /transl_table=11 /translation="MQTTRPRITWKVLPMAQVAIFKEIFDQVRKDLNCELFYSELKRH NVSHYIYYLATDNIHIVLENDNTVLIKGLKKVVNVKFSRNTHLIETSYDRLKSREITF QQYRENLAKAGVFRWITNIHEHKRYYYTFDNSLLFTESIQNTTQIFPR" CDS 2769..2975 /note="o68; 28 pct identical (3 gaps) to 57 residues of approx. 1488 aa protein CFTR_XENLA SW: P26363" /codon_start=1 /label=b0559 /db_xref="PID:g1786772" /transl_table=11 /translation="MNKEQSADDPSVDLIRVKNMLNSTISMSYPDVVIACIEHKVSLE AFRAIEAALVKHDNNMKDYSLVVD" gene 3723..4268 /gene="nohB" CDS 3723..4268 /gene="nohB" /note="o181; 100 pct identical to fragment NOHB_ECOLI SW: P31062 (147 aa); 98 pct identical to TERS_LAMBD SW: P03707" /codon_start=1 /label=b0560 /product="DNA packaging protein" /db_xref="PID:g1786773" /transl_table=11 /translation="MEVNKKQLADIFGASIRTIQNWQEQGMPVLRGGGKGNEVLYDSA AVIKWYAERDAEIENEKLRREVEELLQASETDLQPGTIEYERHRLTRAQADAQELKNA RDSAEVVETAFCTFVLSRIAGEIASILDGIPLSVQRRFPELENRHVDFLKRDIIKAMN KAAALDELIPGLLSEYIEQSG" CDS 4243..4986 /note="o247; residues 1-103 are 100 pct identical to aa 1-103 from TERL_LAMBD SW: P03708 (641 aa); residues 166-233 are 66 pct identical to aa 30-97 from YCDD_SALTY SW: P40784 (106 aa)" /codon_start=1 /label=b0561 /db_xref="PID:g1786774" /transl_table=11 /translation="MNISNSQVNRLRHFVRAGLRSLFRPEPQTAVEWADANYYLPKES AYQEGRWETLPFQRAIMNAMGSDYIREVNVVKSARVGYSKMLLGVYAYFIEHKQRNTL IPAGFVAVFNSDESSWHLVEDHRGKTVYDVASGDALFISELGPLPENVTWLSPEGEFQ KWNGTAWVKDAEAEKLFRIREAEETKNSLMQVASEHIAPLQDAVDLEIATEEETSLLE AWKKYRVLLNRVDTSTAPDIEWPTNPVRE" CDS complement(5041..5472) /note="f143; This 143 aa ORF is 30 pct identical (7 gaps) to 96 residues of an approx. 568 aa protein NPRM_BACME SW: Q00891" /codon_start=1 /label=b0562 /db_xref="PID:g1786775" /transl_table=11 /translation="MDLNEASLNAASTRAGESKIKHKISHDVFEPYPAALHGQFDSIS MSYLLHCLPGNISTKSCVIRNAAQALTDDGTLYGATILGDGVVHNSFGQKLMRIYNQK GIFSNTKDSEEGLTHILSEHFENVKTKVQGTVVMFSASGKK" CDS 5764..5949 /note="o61; 100 pct identical to hypothetical E. coli protein from Genbank Accession U82598; residues 32-61 are 80 pct identical to aa 137-166 from tail fiber assembly protein, Genbank Accession D90798" /codon_start=1 /label=b0563 /db_xref="PID:g2367110" /transl_table=11 /translation="MLPQHSDIEIAWYASIQQEPNGWKTVTTQFYIQEFSEYIAPLQD AVDLEIATEEERSLLEA" gene 6570..7319 /gene="appY" CDS 6570..7319 /gene="appY" /note="o249; 100 pct identical to GB: ECM5_1 ACCESSION: Y00138; 100 pct identical to 219 aa of APPY_ECOLI SW: P05052 (243 aa)" /codon_start=1 /label=b0564 /product="M5 polypeptide" /db_xref="PID:g1786776" /transl_table=11 /translation="MDYVCSVVFICQSFDLIINRRVISFKKNSLFIVSDKIRRELPVC PSKLRIVDIDKKTCLSFFIDVNNELPGKFTLDKNGYIAEEEPPLSLVFSLFEGIKIAD SHSLWLKERLCISLLAMFKKRESVNSFILTNINTFTCKITGIISFNIERQWHLKDIAE LIYTSESLIKKRLRDEGTSFTEILRDTRMRYAKKLITSNSYSINVVAQKCGYNSTSYF ICAFKDYYGVTPSHYFEKIIGVTDGINKTID" gene complement(7569..8522) /gene="ompT" CDS complement(7569..8522) /gene="ompT" /EC_number="3.4.21.87" /note="f317; 100 pct identical to OMPT_ECOLI SW: P09169" /codon_start=1 /label=b0565 /product="protease VII precursor" /db_xref="PID:g1786777" /transl_table=11 /translation="MRAKLLGIVLTTPIAISSFASTETLSFTPDNINADISLGTLSGK TKERVYLAEEGGRKVSQLDWKFNNAAIIKGAINWDLMPQISIGAAGWTTLGSRGGNMV DQDWMDSSNPGTWTDESRHPDTQLNYANEFDLNIKGWLLNEPNYRLGLMAGYQESRYS FTARGGSYIYSSEEGFRDDIGSFPNGERAIGYKQRFKMPYIGLTGSYRYEDFELGGTF KYSGWVESSDNDEHYDPGKRITYRSKVKDQNYYSVAVNAGYYVTPNAKVYVEGAWNRV TNKKGNTSLYDHNNNTSDYSKNGAGIENYNFITTAGLKYTF" gene complement(9036..9797) /gene="envY" CDS complement(9036..9797) /gene="envY" /note="f253; This 253 aa ORF is 98 pct identical (2 gaps) to ENVY_ECOLI SW: P10805" /codon_start=1 /label=b0566 /db_xref="PID:g1786778" /transl_table=11 /translation="MQLSSSEPCVVILTEKEVEVSVNNHATFTLPKNYLAAFACNNNV IELSTLNHVLITHINRNIINDYLLFLNKNLTCVKPWSRLATPVIACHSRTPEVFRLAA NHSKQQPSRPCEAELTRALLFTVLSNFLEQSRFIALLMYILRSSVRDSVCRIIQSDIQ HYWNLRIVASSLCLSPSLLKKKLKNENTSYSQIVTECRMRYAVQMLLMDNKNITQVAQ LCGYSSTSYFISVFKAFYGLTPLNYLAKQRQKVMW" gene complement(9980..10870) /gene="ybcH" CDS complement(9980..10870) /gene="ybcH" /note="f296; 99 pct identical to 130 aa of YBCH_ECOLI SW: P37325 (145 aa) but has 167 alternate C-terminal residues" /codon_start=1 /label=b0567 /db_xref="PID:g1786779" /transl_table=11 /translation="MRKFIFVLLTLLLVSPFSFAMKGIIWQPQNRDSQVTDTQWQGLM SQLRLQGFDTLVLQWTRYGDAFTQPEQRTLLFKRAAAAQQAGLKLIVGLNADPEFFMH QKQSSAALESYLNRLLAADLQQARLWSAAPGITPDGWYISAEIDDLNWRSEAARQPLL TWLNNAQRLISDVSAKPVYISSFFAGNMSPDGYRQLLEHVKATGVNVWVQDGSGVDKL TAEQRERYLQASADCQSPAPASGVVYELFVAGKGKTFTAKPKPDAEIASLLAKRSSCG KDTLYFSLRYLPVAHGILEY" gene complement(10871..13843) /gene="nfrA" CDS complement(10871..13843) /gene="nfrA" /note="f990; 100 pct identical to NFRA_ECOLI SW: P31600" /codon_start=1 /label=b0568 /product="bacteriophage N4 adsorption protein A" /db_xref="PID:g1786780" /transl_table=11 /translation="MKENNLNRVIGWSGLLLTSLLSTSALADNIGTSAEELGLSDYRH FVIYPRLDKALKAQKNNDEATAIREFEYIHQQVPDNIPLTLYLAEAYRHFGHDDRARL LLEDQLKRHPGDARLERSLAAIPVEVKSVTTVEELLAQQKACDAAPTLRCRSEVGQNA LRLAQLPVARAQLNDATFAASPEGKTLRTDLLQRAIYLKQWSQADTLYNEARQQNTLS AAERRQWFDVLLAGQLDDRILALQSQGIFTDPQSYITYATALAYRGEKARLQHYLIEN KPLFTTDAQEKSWLYLLSKYSANPVQALANYTVQFADNRQYVVGATLPVLLKEGQYDA AQKLLATLPANEMLEERYAVSVATRNKAEALRLARLLYQQEPANLTRLDQLTWQLMQN EQSREAADLLLQRYPFQGDARVSQTLMARLASLLESHPYLATPAKVAILSKPLPLAEQ RQWQSQLPGIADNCPAIVRLLGDMSPSYDAAAWNRLAKCYRDTLPGVALYAWLQAEQR QPSAWQHRAVAYQAYQVEDYATALAAWQKISLHDMSNEDLLAAANTAQAAGNGAARDR WLQQAEKRGLGSNALYWWLHAQRYIPGQPELALNDLTRSINIAPSANAYVARATIYRQ RHNVPAAVSDLRAALELEPNNSNTQAALGYALWDSGDIAQSREMLEPAHKGLPDDPAL IRQLAYVNQRLDDMPATQHYARLVIDDIDNQALITPLTPEQNQQRFNFRRLHEEVGRR WTFSFDSSIGLRSGAMSTANNNVGGAAPGKSYRSYGQLEAEYRIGRNMLLEGDLLSVY SRVFADTGENGVMMPVKNPMSGTGLRWKPLRDQIFFIAVEQQLPLNGQNGASDTMLRA SASFFNGGKYSDEWHPNGSGWFAQNLYLDAAQYIRQDIQAWTADYRVSWHQKVANGQT IEPYAHVQDNGYRDKGTQGAQLGGVGVRWNIWTGETHYDAWPHKVSLGVEYQHTFKAI NQRNGERNNAFLTIGVHW" gene complement(13830..16067) /gene="nfrB" CDS complement(13830..16067) /gene="nfrB" /note="f745; 100 pct identical to NFRB_ECOLI SW: P31599" /codon_start=1 /label=b0569 /product="bacteriophage N4 adsorption protein B precursor" /db_xref="PID:g1786781" /transl_table=11 /translation="MDWLLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRR IKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHI FVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANF AFAGFILHDAEDVISPMELRLFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSEL HGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKE KGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGI VFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWH FLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFM ANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDTRSLRPLGQILLENQVITEEQLDTA LRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESIDAWQIPSSLIAEMPASVA LHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYVIVLRGQIVTGLRHWYA RRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFAEILTTLGHINRSAINVLL LRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQSLLLKAGLNTEQVAQLES ENEGE" BASE COUNT 4164 a 3582 c 3915 g 4509 t ORIGIN 1 ttggttattt gttgagattt gcttatgtat ttgtagtggt gttttcaata ctcggtagca 61 ttctctcaaa tatcatttag tggtttacgt acgtaaaaaa ttggttatgc tgttaagagt 121 ggttacttcg tcacacagct taaacccgcc gtcgagcggg tttttccatt ttttgagtct 181 cgatattagc tgataaccca atacctgagt tattcactga ctccgagtct gttacgtttc 241 gtagtattcc ctcaatttac acccgctttg tctgcgaggt ggggttatga aatccatgga 301 taagttaaca acgggtgtcg cctatggcac ctcagcaggt agtgccgggt actggttttt 361 acagctgcta gataaagtca ctccctcaca gtgggcagca ataggtgtgc tgggtagcct 421 ggtatttggc ctgctgacgt acctgacaaa cctttatttc aagattaaag aagataagcg 481 caaggctgcg agaggtgaat aatgcctcca tcattacgaa aagccgttgc tgctgctatt 541 ggtggcggag caattgctat agcatcagtg ttaatcactg gcccaagtgg taacgatggt 601 ctggaaggtg tcagctacat accatacaaa gatattgttg gtgtatggac tgtatgtcac 661 ggacacaccg gaaaagacat catgctcggt aaaacgtata ccaaagcaga atgcaaagca 721 ctcttgaata aagaccttgc cactgtcgcc agacaaatta acccgtatat caaagtcgat 781 ataccggaaa caacgcgcgg cgctctttac tcattcgttt acaacgtggg tgctggcaat 841 tttagaacat cgacgcttct tcgcaaaata aaccagggcg atatcaaagg cgcatgtgat 901 cagctgcgtc gctggacata cgctggcggt aagcaatgga aaggcctgat gactcgtcgt 961 gagattgagc gtgaagtctg tttgtggggg caacagtgag cagagtaacc gcgattatat 1021 ccgctctgat tatctgcatc atcgtcagcc tgtcatgggc ggtcaatcat taccgtgata 1081 acgcaatcgc ctacaaagtc cagcgcgaca aaaatgccag agaactgaag ctagcgaacg 1141 cggcaattac tgacatgcag atgcgtcagc gtgatgttgc tgcgctcgat gcaaaataca 1201 cgaaggagtt agctgatgcg aaagctgaaa atgatgctct gcgtgatgat gttgccgctg 1261 gtcgtcgtcg gttgcacatc aaagcagtct gtcagtcagt gcgtgaagcc accacggcct 1321 ccggcgtgga taatgcagcc tccccccgac tggcagacac cgctgaacgg gattatttca 1381 ccctcagaga gaggctgatc actatgcaaa aacaactgga aggaacccag aagtatatta 1441 atgagcagtg cagatagagc tgaccatatc gatgggcaac tcatgcaatt attttgagca 1501 atacacacgc gcttccagcg gagtataaat gcctaaagta ataaaaccga gcaatccatt 1561 tacgaatgtt tgctgggttt ctgttttaac aacattttct gcgccgccac aaattttagc 1621 tgcatcgaca gttttcttct gcccaattcc agaaacgaag aaatgatggg tgatggtttc 1681 ctttggtgct actgctgtct gtttgttttg aacagtaaat gtctgttgag cacatcctgt 1741 aataagcagg gccagcgcag tagcgagtag catttttttc atggtgttat tcccgatgct 1801 ttttgaagtt cgcagaatcg tatgtgtaga aaattaaaca aaccctaaac aatgagttga 1861 aatttcatat tgttaatatt tattaatgta tgccaggtgc gatgaatcgt cattgtattc 1921 ccggattaac tatgtccaca gccctgacgg ggaacttctc tgcgggagtg tccgggaata 1981 attaaaaacg atgcacacag ggtttagcgc gtacatgtat tgtattatgc caacaccccg 2041 gtgctgacac ggaagaaacc ggacgttatg atttagcgtg gaaagatttg tgtagtgttc 2101 tgaatgctct cagtaaatag taatgaatta tcaaaggtat agtaatatct tttatgttcg 2161 tggatatttg taatccatcg gaaaactcct gctttagcaa gattttccct gtattgctga 2221 aatgtgattt ctcttgattt caacctatca taggacgttt ctataagatg cgtatttctt 2281 gagaatttaa catttacaac ctttttaagt ccttttatta acacggtgtt atcgttttct 2341 aacacaatgt gaatattatc tgtggctaga tagtaaatat aatgtgagac attgtgacgt 2401 tttagttcag aataaaacaa ttcacagttt aaatcttttc gcacttgatc gaatatttct 2461 ttaaaaatgg caacctgagc cattggtaaa accttccatg tgatacgagg gcgcgtagtt 2521 tgcattatcg tttttatcgc ttcaatctgg tctgacctct ttgtgttttg ttgatgattt 2581 atgtcaaata ttaggaatgt tttcaattaa tagtattggt tgcgtaacaa agtgcggtcc 2641 tgctggcatt ctggagggaa atacaaccga cagatgtatg taaggccaac gtgctcaaac 2701 cttcatacag aaagatttga agtaatattt taaccgctag atgaagagca agcgcatgga 2761 gcgacaaaat gaataaagaa caatctgctg atgatccctc cgtggatctg attcgtgtaa 2821 aaaatatgct taatagcacc atttctatga gttaccctga tgttgtaatt gcatgtatag 2881 aacataaggt gtctctggaa gcattcaggg caattgaggc agcgttggtg aagcacgata 2941 ataatatgaa ggattattcc ctggtggttg actgatcacc ataactgcta atcattcaaa 3001 ctacttaacc tgtgacagag ccaacacgca gtctgtcact gtcaggaaag tggtaaaact 3061 gcaactcaat tactgcaatg ccctcgtaat taagtgaatt tacaatatcg tcctgttcgg 3121 agggaagaac gcgggatgtt cattcttcat cacttttaat tgatgtatat gctctctttt 3181 ctgacgttag cctccgacgg caggcttcaa tgacccaggc tgagaaattc ccggaccctt 3241 tttgctcaag agcgatgtta atttgttcaa tcatttggtt aggaaagcgg atgttgcggg 3301 ttgttgttct gcgggttctg ttcttagttg acatgaggtt gccccgtatt cagtgtcgct 3361 gatttgtatt gtctgaagtt gtttttacgt taagttgatg cagatcaatt aatacgatac 3421 ctgcgtcata attgattatt tgacgtggtt tgatggcgta gatgcacgtt gtgacatgta 3481 gatgataatt attatcattt tgtgggtcct ttccggcgat ccgacaggtt acggggcggc 3541 gacctcgcgg gttttcgcta tttatgaaaa ttttccggtt taaggtgttt ccgttcttct 3601 tcgtcgtaac ttaatgtatt tatttaaaat accccctgaa aagaaaggaa acgacaggtg 3661 ctgaaagcga gctttttggc ctctgtcgtt tcctttctct gtttttgtcc gtggaatgtg 3721 caatggaagt caacaaaaag cagctggctg acattttcgg tgcgagtatc cgtaccattc 3781 agaactggca ggaacaggga atgcccgttc tgcgaggcgg tggcaagggt aatgaggtgc 3841 tttatgactc tgccgccgtc ataaaatggt atgccgaaag ggatgctgaa attgagaacg 3901 aaaagctgcg ccgggaggtt gaagaactgc tgcaggccag cgagacagat ctccagccag 3961 ggactattga gtacgaacgc catcgactta cgcgtgcgca ggccgatgca caggagctga 4021 aaaatgccag agactccgct gaagtggtgg aaaccgcatt ctgtactttc gtgctgtcgc 4081 ggatcgcagg tgaaattgcc agtattctcg acgggatccc cctgtcggtg cagcggcgtt 4141 ttccggaact ggaaaaccga catgttgatt tcctgaaacg ggatatcatc aaagccatga 4201 acaaagcagc cgcgctggat gaactgatac cggggttgct gagtgaatat atcgaacagt 4261 caggttaaca ggctgcggca ttttgtccgc gccgggcttc gctcactgtt caggccggag 4321 ccacagaccg ccgttgaatg ggcggatgct aattactatc tcccgaaaga atccgcatac 4381 caggaagggc gctgggaaac actgcccttt cagcgggcca tcatgaatgc gatgggcagc 4441 gactacatcc gtgaggtgaa tgtggtgaag tctgcccgtg tcggttattc caaaatgctg 4501 ctgggtgttt atgcctactt tatagagcat aagcagcgca acacccttat tccagctggc 4561 ttcgtggctg ttttcaacag tgatgagtca tcgtggcatc tcgttgaaga tcatcggggt 4621 aaaacggttt atgacgtagc gtcaggggac gcgttattta tttctgaact cggtccgtta 4681 ccggaaaatg ttacctggtt atcgccggaa ggggagtttc agaagtggaa cggtacagcc 4741 tgggtgaaag atgcagaagc agaaaaactg ttccggattc gggaggcgga agaaacaaaa 4801 aacagcctga tgcaggtagc cagtgagcat attgcgccac ttcaggatgc tgtagatctg 4861 gaaatcgcaa cggaggaaga aacctcattg ctggaagcct ggaaaaaata tcgggtgttg 4921 ctgaaccgtg ttgatacatc aactgcacct gatattgagt ggcctacgaa ccctgtcagg 4981 gagtaatcat tgggattatg ccgcagcacg tcttaagcaa gaacatgctg cggttggatg 5041 ctattttttt cctgaagcgg aaaacattac tacagtacct tgaaccttgg ttttaacatt 5101 ctcgaaatgc tctgagagta tatgtgttaa gccttcttcg gaatcttttg tgtttgaaaa 5161 gatgcctttc tgattgtaaa tgcgcatcag tttttgaccg aagctattgt gcacaactcc 5221 atcgccaaga attgtggctc cgtatagagt tccatcgtca gttaaggcct gcgccgcatt 5281 gcgtattaca cagctttttg tagatatatt tccaggcagg cagtgaagaa ggtaagacat 5341 ggaaatggaa tcaaattgac catgtaacgc cgcgggataa ggttcaaaaa catcatggct 5401 aattttatgt ttaatttttg attccccagc ccttgtagat gccgcgttca ggctagcttc 5461 gttcaaatcc attaaagata tcagactact ctcaggtacg tgagtaaggt aaaacccagt 5521 tccaacacca atatccagat ggttgttacc taaatgttcc agaaagtgtg gaagaaggtg 5581 ttcctttgta ggacatcccc atgcaagccg atttgatact cccaaaaccc accagtcata 5641 aagctttagg gtaagtggtg tgtaaattct agccccatca tctgtgtttt tttattaatt 5701 tcaccatgtt atagttttat ttgtgaatta aatcaattat ggcaatgaat tacaaggggt 5761 taaatgctgc cgcagcatag cgatattgaa atagcctggt atgcttcaat acagcaggag 5821 ccgaatggct ggaagaccgt caccacacag ttctacatcc aggaattcag tgagtatatt 5881 gcgccactgc aggatgctgt agatctggaa atcgcaacgg aggaagaaag atcgttgctg 5941 gaagcctgaa aaaagtatcg ggtgctgcta aaccgtgtgg acacttccgt agcaccagat 6001 atcgagtggc ttattcaacc ataataaaca gtatgtatat cataggttat taattgtgag 6061 ttttttcggt gtgttatttg tttgtttgat gttatgcttt tgcgccccaa aaggttgttt 6121 agatgtattt tatcaattga ttttcaatat cgtttaataa agaaaaatta agcaagctgg 6181 atgttggttt tttgttaatt gaatggttct aataatgttt ttttactgtt gttgaatgtg 6241 acttgataag aaatgcaagt aaaaatgata ctctttttat tttaaattca aacggttgac 6301 atatatatag caagaggttt caggtgcgtt gtagtgagtt tatgttaata aaaagcatag 6361 taagcgttga aaaatgtaac tttgaaataa gttagaataa aaaacaacat acatataata 6421 atttaatctt aaatgaaatt tattaaaatt tgcaaactat aattttgtgt ataaaaatat 6481 aaatgcacat catcctgatt atgattgtgt atttaattgg ttgttatttg actactatca 6541 acttgtttta attttatgat aggtgcaaga tggattatgt ttgctccgta gttttcatct 6601 gtcaatcatt tgatttaatt ataaacagga gagttatctc gttcaaaaaa aattcattgt 6661 ttattgtaag cgacaaaatt agaagggagt taccagtatg cccctctaaa ctaagaattg 6721 ttgatataga taagaaaaca tgtttatcct tttttatcga cgtgaataat gagctgcctg 6781 gcaaatttac tcttgataag aatggctata ttgctgaaga ggaacctcca ttatcgcttg 6841 ttttttctct gtttgaaggg attaaaatag cagactcaca ctccctttgg ttaaaagaaa 6901 gactatgtat atccttactt gccatgttca aaaaacgcga aagtgtaaat tcatttatac 6961 taacaaatat aaatacattt acctgtaaaa ttactggaat aatcagtttt aatattgagc 7021 ggcaatggca tttaaaagat attgcggaat tgatttatac gagtgaaagt ttaataaaaa 7081 aaagattaag ggatgaagga acgtcattta ctgaaatatt gagagatact aggatgaggt 7141 atgcaaaaaa actcataact tcaaactctt attctatcaa tgtcgtagcc cagaaatgtg 7201 gctataacag tacttcatat ttcatatgtg catttaaaga ttattatggt gtcacgccat 7261 ctcattattt tgagaaaata atcggcgtca cagatggaat aaacaaaaca attgactgat 7321 aatgtttatt acaagttgtc tacatgttaa ttataatatt atacagcgtt ttttttgatg 7381 tgatattctg gaaccattaa tttgtaattg ggttgctgtc gcctatttta tacatactat 7441 aattgatggt tttctatgtg atttagttaa taaccttctg ggtttatttt aagggttaat 7501 tgttacattg aaatggctag ttattccccg gggcgatttt cacctcgggg aaattttagt 7561 tggcgttctt aaaatgtgta cttaagacca gcagtagtga tgaagttata gttttctata 7621 cctgctccat ttttgctgta gtctgaagtg ttattattgt gatcataaag tgaagtatta 7681 ccttttttat tcgtaacccg attccatgcg ccttcaacat aaacttttgc gttaggtgtg 7741 acgtaataac ctgcattgac tgcaacagaa tagtaatttt ggtctttgac cttactgcga 7801 taagtgattc tttttcccgg gtcatagtgt tcatcgttat cagatgattc cacccagccg 7861 ctgtatttaa atgtgccacc gagttcaaaa tcttcataac gataacttcc agtcaagcca 7921 atgtagggca ttttaaaacg ttgtttgtag ccgattgctc tttctccatt cgggaaggag 7981 ccgatatcat ctctgaatcc ctcctcagaa ctgtagatat aggaaccacc tctggctgta 8041 aagctataac ggctttcctg atatccggcc atgagtccca ggcggtaatt gggttcgttg 8101 aggagccagc ctttgatatt cagatcaaat tcgttggcat aattgagttg tgtatcaggg 8161 tgtctacttt catccgtcca ggttccgggg ttactggaat ccatccagtc ctgatcgacc 8221 atattgccac ctcggctgcc gagagttgtc cagccagcag ccccgataga tatctggggc 8281 atcaaatccc aattaattgc acctttaata attgcagcgt tattgaattt ccagtcgagt 8341 tgactgactt ttcggcctcc ttcttcggct agataaacac gctcttttgt ttttccgctc 8401 agagttccaa gactaatgtc cgcatttatg ttgtcaggag taaacgataa agtctcggta 8461 gaagcaaaag agctgatcgc aataggggtt gtcaggacta ttcccagaag tttcgcccgc 8521 ataaaagttc tccattcaat cgttttaatg attgaatatg tattttttat atctaactta 8581 atgagtcaat tacatattgc tccactgttt atattttgtt tagtattgaa tgattatcac 8641 aatgcgctat ctgtttttgg tttaattatc tgttattgtt tcatatttcg gttttactgt 8701 gtggtttttt tatgcttttg tggtgctttt atctatttaa gtgccatgcc tttagaggca 8761 tataagcgaa aatagcatga ggtttatcct caattactat gttttttagt acaaaaaaga 8821 gggacaaaac tgagacacat aaggcctcgc aatggcttgc aaggctttac atgttttgag 8881 gtagtgggac gtgtgagcgc agagatggcg cggtaagttg ttgacttaaa atgtcgttct 8941 aggaacttct aagtcgtggg ccgcaggttc gaatcctgca gggcgcgcca tttcttcctc 9001 atttatgccc gtcttatccg tttccgcttt gcccttcacc acatcacttt ttgtcgctgt 9061 ttggcgagat aattcaacgg tgtcaggccg taaaacgcct taaaaacaga gataaagtac 9121 gacgtgctgc tatagccaca taattgcgcc acctgagtga tatttttgtt atccatcaat 9181 aacatctgta cggcgtaacg catacgacac tctgtgacaa tctggctata gctggtattt 9241 tcgtttttta atttcttttt gagcaggctg gggcttaaac atagcgaact ggcgacaatt 9301 cgcagattcc agtaatgctg aatatcgctt tgaataatgc ggcagacgct gtcgcggacg 9361 ctgctgcgta agatatacat cagtagggca ataaaccgcg attgctcaag aaagttagac 9421 aatacggtaa aaagcaatgc gcgcgtcaac tccgcttcgc agggtctgct gggttgctgc 9481 ttgctgtggt tggcggctag ccggaacact tccggtgtac ggctatgaca agcgataacc 9541 ggggttgcca gccgcgacca gggctttaca caggttaagt tcttatttaa aaacaacaga 9601 taatcgttga tgatgttacg gttgatgtgg gtgattaata cgtgatttaa cgttgagagt 9661 tcaatgacgt tattgttgca cgcgaaggcg gccaggtagt ttttcggaag ggtaaacgta 9721 gcatggttat tgacgcttac ctctacctct ttttcggtca ggatcaccac gcaaggttca 9781 ctgctgctca attgcatttc gcactcctca gatatcagaa actccgctca aaggatctat 9841 gcttcctgca tgagtgatcg gcccgttcgc cgataacgat cttctttctt tagcacgctt 9901 tttagcaatt aatcttgatg gaattctgat gagagcgaaa gaggtaagcc aggtcgtacc 9961 cgacttacct ggaggagatt taatactcga gaatgccgtg cgcgacgggc aaatagcgca 10021 gagagaaata gagagtgtct ttaccgcaag aggaacgttt cgctaacagc gaggcaattt 10081 ctgcgtccgg tttcggtttc gctgtaaagg ttttgccttt gccggcgaca aaaagttcat 10141 aaacaacgcc gctggcaggg gcgggacttt ggcaatcggc gctggcctgt aaataacgtt 10201 cacgctgttc agcggtcagt ttatccacgc cgctgccatc ctgtacccag acattaacgc 10261 cggttgcttt aacgtgttcc agcagttggc gatagccatc gggcgacatg tttccggcga 10321 aaaaactact gatataaacc ggttttgctg aaacatcgct aatcagccgc tgcgcgttgt 10381 ttaaccatgt tagcaaaggc tgacgggcgg cttcgctgcg ccagttcagg tcgtcaattt 10441 ccgcgctgat gtaccagcca tccggcgtta tgccaggcgc ggcgctccat aatctggctt 10501 gctggagatc ggcagccagc aggcgattaa gatagctttc cagcgctgcg gacgactgtt 10561 tctggtgcat aaaaaattcc ggatcggcgt tcagcccgac aataagcttc agaccagcct 10621 gttgcgcagc tgcggcccgc ttaaacaata acgtgcgctg ttctggctgg gtaaatgcat 10681 cgccgtaacg ggtccattgc aaaacaaggg tatcgaagcc ttgcaaacgt aactgactca 10741 tcagcccctg ccactgggta tcggtaacct gactatctcg gttttgtggt tgccagataa 10801 tacctttcat cgcaaaggaa aaagggctga ccaaaagcag tgtcagcaat acgaaaatga 10861 acttacgcat ttaccagtgc actccaatgg tgagaaacgc gttgttgcgc tctccgttac 10921 gttgattaat cgccttaaag gtatgttgat actcgacgcc gagactgact ttgtgcggcc 10981 aggcgtcgta gtgcgtctcg ccggtccaga tattccagcg gaccccgact ccgccaagct 11041 gcgcgccctg agtgccttta tcacgatagc cgttgtcctg aacgtgagcg taaggctcaa 11101 tagtctgtcc gttagctacc ttctgatgcc agctgacgcg ataatctgcc gtccacgcct 11161 gaatatcctg gcggatatat tgcgccgcat cgaggtacag gttttgggca aaccagcctg 11221 aaccgttcgg gtgccattcg tcgctgtatt tgccgccatt aaagaatgag gcgctggcgc 11281 gcagcatggt atcggatgcg ccattttggc cgttcagcgg caactgctgt tcgacggcga 11341 tgaaaaagat ctgatcgcgc agcggcttcc agcgcagacc ggtgccggac atcggatttt 11401 tcaccggcat catcaccccg ttttctccgg tatcggcaaa gacgcggcta taaactgaga 11461 gcaggtcgcc ttccagcagc atattgcgtc cgatgcggta ctcggcttcc agttgtccgt 11521 agctacgata gcttttccct ggcgctgcgc cgccgacatt attgttagcg gtactcattg 11581 cgccggaacg caagccgatg gaagaatcga aactgaacgt ccagcggcga ccgacctcct 11641 catgcaaacg gcggaaattg aagcgttgtt gattttgttc tggggtcagt ggggttatca 11701 gcgcctgatt atcaatgtca tcaatcacca gccgggcgta gtgctgcgtc gcaggcatgt 11761 catccagacg ctggttcacg taggccagtt gtcggatcag tgccggatcg tccggaagcc 11821 ctttatgcgc cggttcgagc atttcccgcg actgtgcgat atcaccgcta tcccacaagg 11881 cgtaaccaag cgctgcctgg gtgttgctat tattcggttc cagttccagc gcggcgcgca 11941 aatcactcac cgcggccggg acattatgac gttggcgata aattgtcgcc cgcgcaacgt 12001 aagcgttggc agaaggcgca atattgattg agcgcgtgag atcgttcagt gcgagttccg 12061 gctgaccagg aatgtaacgt tgcgcatgca gccaccagta gagggcattg cttcccagtc 12121 cacgtttttc tgcctgttgt agccagcgat cgcgagccgc accatttcct gccgcctggg 12181 cggtattggc agcagcaagc agatcctcat tgctcatgtc gtgaagactg attttctgcc 12241 aggccgccag tgcggtggcg tagtcctcaa cctgatacgc ctgataggct accgcacgat 12301 gttgccaggc gctcggttgt cgttgttcgg cctgaagcca tgcatacaac gccacaccgg 12361 gtagcgtgtc ccgataacac tttgccagac ggttccaggc ggcggcatcg taggaaggcg 12421 acatatcgcc cagcaagcga actattgccg ggcaattatc tgcaataccc ggcaactgac 12481 tttgccactg acgttgctcc gccagcggta agggtttcga taaaatcgcc accttcgccg 12541 gcgttgccag gtaaggatga ctttccagca gagacgccag tcgcgccatt aaagtctggc 12601 tgacacgcgc atcgccctgg aaaggatagc gttgcagcaa taaatcggca gcttcgcgtg 12661 actgctcgtt ctgcatcagt tgccaggtta gttgatccag gcgggtaaga tttgccggtt 12721 cttgctgata cagcaatcgt gccagacgca gagcttcagc cttgttacgg gtcgccacgc 12781 tgacagcata acgctcctca agcatttcat tggcggggag ggtggcgagc agtttttgcg 12841 ctgcgtcgta ctgaccttct tttaacagca ccggtagcgt cgcgccaaca acatactggc 12901 ggttgtcggc aaactgtacc gtataattcg ccaacgcctg aacggggtta gcgctgtatt 12961 tagataacag atagagccaa cttttctctt gtgcgtccgt ggtaaatagt ggcttatttt 13021 caatgagata atgctggagg cgtgcttttt cgccacgata agccagcgcg gtcgcgtaag 13081 taatatatga ctgaggatcg gtgaagatcc cctgtgattg cagtgccagg atccgatcgt 13141 ccagctgccc ggcaagaagc acgtcaaacc actgacggcg ttctgccgcg cttaatgtgt 13201 tctgctggcg tgcttcattg tatagcgtat ctgcctggga ccattgtttc aggtagattg 13261 cccgttgcag cagatcggtt cgcagcgttt ttccttccgg cgatgcagca aacgtcgcat 13321 cgttcagttg cgctctggcg acaggtaact gtgccagccg cagggcattc tgcccgactt 13381 cactgcgaca acgcagggtc ggcgcagcat cgcacgcttt ttgctgggca agcagttctt 13441 caactgtcgt aacgcttttc acttcaaccg gaatagccgc cagactgcgc tcaagtcggg 13501 catctcctgg gtgacgtttc agttgatcct caagcaacag ccgcgcccgg tcatcatgac 13561 caaaatggcg ataggcttcc gcaaggtata aagtcagcgg aatattatcc ggcacctgct 13621 ggtgtatata ttcaaattcg cggatggcgg ttgcttcgtc gttatttttc tgtgccttca 13681 gcgctttatc gagacgggga taaataacaa aatggcgata atcgctcagc cccagctctt 13741 ctgcgctggt gccgatattg tctgcgagtg cgctggtact caataaagac gtcagcagta 13801 aaccagacca tccgatgacg cgattaaggt tattctcctt cattttcgga ctccagttgc 13861 gcaacctgtt ctgtgtttaa acctgctttg agtaatagtg attgcatcga aacttgtaat 13921 tcgcgttgaa ttgtcaggac gcgatccaac gtttcctggc tgataacgcc ttcggtgacc 13981 aaaaacttgc cgagcggcag agaactgcgt tcatggcgca ataacaacac gttaattgct 14041 gaacgattaa tatgaccgag cgtggtcagt atttcggcga acaggaactg atgcggcaca 14101 tattgccgcc agatttcacc ggcctgctgt tccgtgagcc actgatgctg aaccgcattg 14161 tacaacattg cccgcggatc gtgaccgcgt cggcgtgcat accagtgacg taaccctgtg 14221 acaatttgtc cccgcagaac aatgacgtaa cgcactttgc gtccgacttt acgcgtcagg 14281 gccgccagcg aaaccgggtc aataccatct tcactgccga caattaactc gtcattttcc 14341 agacgcagcg gcagtaccgc ataatgcagc gccacggagg ccggcatttc ggcaatcagc 14401 gaggaaggga tctgccaggc atcgatggat tcccacgcca cgccgttttg ctctgccagc 14461 gcctgtgcca gctgctcggc gctaatcagc ccctgcatca gcattgaacc gcccaggcgt 14521 agaccttcga cgcgattacg cagtgctgta tcgagttgtt cttcagtgat gacctgattt 14581 tccagcagaa tttgacctaa cgggcgcaac gagcgggtat cgccagtcac gctggggaag 14641 tcatgcgttg ttttatccca cgccacgcga cgtggatcgc cgtgttgaag tacctgtttt 14701 agcgcgcgcc agttggccat gaagttaatc aggttgcccc agaaaagacg caggacggaa 14761 agcagcccct gcgtcaggcc gtagtagcca gtaacgaaaa tcacccgctg cacgatgcgg 14821 ttaaccatca aaccaaagtt tagccacagc agggtcatta accatgcgct gccgctgaaa 14881 atagaaagga aatgccaggc atcgggccac aaactttcat acgccagcaa cagcaaaagc 14941 tggatcatca ccagcatcgc gaggaagctg acaaagttac tgattgcccc tttgcggtcg 15001 cgccagagaa agtagttcag cgtcaggctg gaggtccatt tatgggtttt aaagccttgg 15061 aaaacaatgc cgatgatcca gcgggatttt tgtcgaaccg cagtcgaaaa ggtatcgggg 15121 aaatattcgc gcacgcagat catgtttgat gtccgcgcgt gctgtaaaaa tttacgctgc 15181 tcgcgttctt tggcttcgtc caccaccgga aaacggacaa aaatttccgt catacctttt 15241 tctttcaggc ggaagccaat gtcgtaatct tcagtaagac tctgcacgtc gaaagcaata 15301 ccgtcaccgt cagctaacag tgcggtcacg gcgcggcggc tgaaacaggt gccgacgcct 15361 gcgctgggca cttgtccggc gagggcttca cgcaccggaa catctttgcc atgcagctct 15421 gaaaactcat caatgtaagt catgctggtg aagtgcgtcc attcgcgttc gaacggatac 15481 accgggatct gaatcagatc tttacgctcg accagatagt tgaacagacg caattccatc 15541 ggtgaaatca catcttcggc gtcatgcaga ataaaaccag caaaagcgaa attggcgcta 15601 cgctcaaatt gggtgatggc gtccagcacg ttgttcagac agtcggcttt gctggtgggg 15661 ccaggacgcg cgcagactac cttatgcaca ttcgggaagc gagcgcacac ttcgtcaaca 15721 tcacgctgag tatcggggtc gttggggtag gtgccaacaa agatatgata gttttcgtag 15781 tcgagcgtgg tcgccgccag ctcggccata ttgccgatga cgcccgtttc attccacgcc 15841 ggaaccataa tcgctaacgg tttttcatct ggtttataca gttcgcggta actcattcgc 15901 gggtagcggc gataaacact caacttgcgt ttaatgcggc gtacccagta gacgacatca 15961 ataaaaaaat cgtccagccc gctgatgaac atgatgaccg ctaacgttat cgcgattact 16021 tttaagccgt agagccaggt agcaaaaaca tcaagaagcc agtccacaca aaaaccttac 16081 attaacgctg gttatgttta gggtggcgta tattaaggtt ttttatgaat tgtgacagct 16141 ttttaccatt aataggtatg actattgcgg //