Gene list
Applied filters:
COG category: Function unknown
Gene type: CDS
Genomic element: chromosome
Number of genes found: 427
Show UniProt / TrEMBL protein name | View in Fasta format (DNA) | View as list | ||||
# Escherichia coli O157:H7 str. Sakai, Sakai >ECs3667 hypothetical protein MTSRFMLIFAAISGFIFVALGAFGAHVLSKTMGAVEMGWIQTGLEYQAFH TLAILGLAVAMQRRISIWFYWSSVFLALGTVLFSGSLYCLALSHLRLWAF VTPVGGVSFLAGWALMLVGAIRLKRKGVSHE >ECs2785 hypothetical protein MRRVNILCSFALLFASHTSLAVTYPLPPEGSRLVGQSLTVTVPDHNTQPL ETFAAQYGQGLSNMLEANPGADVFLPKSGSQLTIPQQLILPVTVRKGIVV NVAEMRLYYYPPDSNTVEVFPIGIGQAGRETPRNWVTTVERKQEAPTWTP TPNTRREYAKRGESLPAFVPAGPDNPMGLYAIYIGRLYAIHGTNANFGIG LRVSQGCIRLRNDDIKYLFDNVPVGTRVQIIDQPVKYTTEPDGSKWLEVH EPLSRNRAEYESDRKVPLPVTPSLRAFINGQEVDVNRANAALQRRSGMPV QISSGSRQMF >ECs3990 hypothetical protein MFDSTLNPLWQRYILAVQEEVKPALGCTEPISLALAAAVAAAELEGPVER VEAWVSPNLMKNGLGVTVPGTGMVGLPIAAALGALGGNANAGLEVLKDAT AQAIADAKALLAAGKVSVKIQEPCNEILFSRAKVWNGEKWACVTIVGGHT NIVHIETHDGVVFTQQACVAEGEQESPLSVLSRTTLAEILKFVNEVPFAA IRFILDSAKLNCALSQEGLSGKWGLHIGATLEKQCERGLLAKDLSSSIVI RTSAASDARMGGATLPAMSNSGSGNQGITATMPVVVVAEHFGADDERLAR ALMLSHLSAIYIHNQLPRLSALCAATTAAMGAAAGMAWLVDGRYETISMA ISSMIGDVSGMICDGASNSCAMKVSTSASAAWKAVLMALDDTAVTGNEGI VAHDVEQSIANLCALASHSMQQTDRQIIEIMASKAR >ECs2498 hypothetical protein MFDVTLLILLGLAALGFISHNTTVAVSILVLIIVRVTPLSTFFPWIEKQG LSIGIIILTIGVMAPIASGTLPPSTLIHSFLNWKSLVAIAVGVIVSWLGG RGVTLMGSQPQLVAGLLVGTVLGVALFRGVPVGPLIATGLVSLIVGKQ >ECs3533 hypothetical protein MFNRPNRNDVDDGVQDIQNDVNQLADSLESVLKSWGSDAKGEAEAARSKA QALLKETRARMHGRTRVQQAARDAVGCADSFVRERPWCSVGTAAAVGIFI GALLSMRKS >ECs2949 putative tail length tape measure protein precursor MAGNFADLTAVLTLDSARFSEEAARVKKELGETSALADLMSGKVSQSFRK QADAAEQSLSRQALAAQKAGISVGQYKAAMRTLPAQFTDIVTQLAGGQNP FLIMLQQGGQISDSFGGPLSLLTLLKEELLGIRDASESSEESLSDTANAL AENARNAGELGRFMSVARVAAGGGVAVLAALAAAAWQAEQADRALLRSLT LTGGAAATTTAELWKMAGVISDEAGGGIRQAAENLARLAESGKYTAGQLR IMGETSQRWLQTVGDDAGKVEKAFEGIAADPVKALASLNQQYNFLSVSQL RHIDELERTKGKQAAVTEAMSLFADVMNARLEQLDKAATPVEKIWDDVKT WTSDAWAWIGDHTLGALSLITDVVAGTVEQVKLLLVQGDLALAEFIQSAW ETTKNVPGVGALFGELAEENRVFIEKTKRDELALRKSIAERDARIRQGEM GYINRSRATGVSKGPGQQEAVSRLAEELTGKKHTSPKTRSAGEREEEQAR EALLALEAELRTLEKHSGANEKISRQRRDLWKAESQYAVLKEAATKRQLS EQEKSLLAHKDETLEYKRQLAELGDKVEYQKRLNELAQQAVRFEEQQSAK QAAISAKARGLTDRQAQRESEAQRLRDVYGDNPAALAKATSALKNTWSAE EQLRGSWMAGLKSGWGEWAESATDSFSQVKSAATQTFDGIAQNMAAMLTG AEADWRGFTRSVLSMMTEILLKQAMVGIVGRIGSAIGGAFGGGASASTGT AIQAAAANFHFATGGFTGTGGKYEPAGIVHRGEFVFTKEATSRIGVGNLY RLMRGYAEGGYVGGAGSPAQMRRTEGINFNQNNHVVIQNDGTNGQAGPQL MKAVYDMARKGAQDELRLQLRDGGMLSGSRR >ECs0409 nucleoprotein/polynucleotide-associated enzyme MCRCLTRLIKQGRKGLAPNTPTAFTGILAKNYSAGLETKMAKLTLQEQLL KAGLVTSKKAAKVERTAKKSRVQAREARAAVEENKKAQLERDKQLSEQQK QAALAKEYKAQVKQLIEMNRITIANGDIGFNFTDGNLIKKIFVDKLTQAQ LINGRLAIARLLVDNNSEGEYAIIPASVADKIAQRDASSIVLHSALSAEE QDEDNPYADFKVPDDLMW >ECs2456 hypothetical protein MNAERKFLFACLIFALAIYAIHAFGLFDLLTDLPHLQTLIRQSGLFGYSL YILLFIIATLFLLPGSILVIAGGIVFGPFLGTLLSLIAATLASSCSFLLA RWMGRDLLLKYVGHSHTFQAIEKGIARNGIDFLILTRLIPLFPYNIQNYA YGLTTIAFWPYTLISALTTLPGIVIYTVMASDLANEGITLRFILQLCLAG LALFILVQLAKLYARHKHVDLSASRRNPLTHPKNEG >ECs0105 zinc-binding protein MSETITVNCPTCGKTVVWGEISPFRPFCSKRCQLIDLGEWAAEEKRIPSS GDLSESDDWSEEPKQ >ECs1677 hemolysin E MIMTEIVADKTVEVVKNAIETADGALDLYNKYLDQVIPWQTFDETIKELS RFKQEYSQAASVLVGNIKTLLMDSQDKYFEATQTVYEWCGVATQLLAAYI LLFDEYNEKKASAQKDILIKVLDDGITKLNEAQKSLLVSSQSFNNASGKL LALDSQLTNDFSEKSSYFQSQVDKIRKEAYAGAAAGVVAGPFGLIISYSI AAGVVEGKLIPELKNKLKSVQSFFTTLSNTVKQANKDIDAAKLKLTTEIA AIGEIKTETETTRFYVDYDDLMLSLLKEAANKMINTCNEYQKRHGKKTLF EVPEV >ECs0224 hypothetical protein MDEGSLSLPPFTGYDEKSLRDYHLALHGNSLNPMIDAATPLLGMVMRLST MNSQTMPEHLFAQVVTDVQAVEQLLQEQGYEPGVIISFRYILCTFIDEAA LGNGWSNKNEWIKQSLLVHFHNEAWGGEKVFILLERLIREPKRYQDLLEF LWLCFSLGFRGRYKVAVQDQGEFEQIYRRLYHVLHKLRGDAPFPLLHQDK KTQGGRYQLISRLTVKHIFCGGVVVLALFYLFYLLRLDSQTQDILHQLNK LLR >ECs0412 putative alpha helix chain MMRCEMPSTPEEKKKVLTRVRRIRGQIDALERSLEGDAECRAILQQIAAV RGAANGLMAEVLESHIRETFDRNDCYSREVSQSVDDTIELVRAYLK >ECs1488 hypothetical protein MNKSMLAGIGIGVAAALGVAAVASLNVFERGPQYAQVVSATPIKETVKTP RQECRNVTVTHRRPVQDEHRITGSVLGAVAGGVIGHQFGGGRGKDVATVV GALGGGYAGNQIQGSLQESDTYTTTQQRCKTVYDKSEKMLGYDVTYKIGD QQGKIRMDRDPGTQIPLDSNGQLILNNKV >ECs2248 putative portal protein MWNLLRRTRKNQKSGRDVREVGWRSLFQAVAEPFAGAWQQGVKADPETVL SFHAVFSCISLISQDIAKMRLRLMQTDVQGIRREKRQGDTARLCRRPNAQ QNRIQFFELWLNSKLRHGNTVVLKIRTPRGQIKELRILDWNRVEPLVADD GEVFYRITPDRNCGITESVTVPAREVIHDRFNCFFHPLVGLPPVYAAGLA AMQGHHIQANSTYFFRNGGRPSGVIEVPGSITEENAKKLKGNWDSGYTGE NAGKTAILSNGAKYSPTTFSPVDAQTVEQLKMTAEIVCSVFRVPAYKIGV GHPPSSDNVEALEQQYYSQCLQTLIESIELLLDEALETGENESTEFDVTT LLRMDSERRMKTLGDAVKNTLLTPNEARKRENLPPLAGGDALYLQQQNYS LEALSRRDAREDPFASAGKTVSSQLPDGASDGNKAISETEHDAVKAMFRG DTEKMTERELSIIRALGEEFSTVLADLQRTFEGKMASQAQAFEEKLTSLS AVLQKHVTVDEVRPVLQAMVDDAVGAIPVPRDGRDYDPDVLQQAVNDAVA NIPQPADGKSLTPDDVRPMLEQMVKEAVSHIPVPRDGRDYDPEVLQKAVN DAVANIPQPADGKSLTPDDVRPMLEQMVKEAVSHIPVPRDGRDYDPEVLQ KAVNDAVANIPQPADGKSLTPDDVRPMLEQMVKEAVSHIPVPRDGRDYDP DVLQKAVLDAVSALPAPQDGRDATALEILPAIDDQKSFPRGTYATHQGGL WRAYEKTHGMRGWECLVDGVADIDVSMTGERLFSVVVRQSSGQRTEKTFS LPVMLYRGVFRAGETYHPGDTVTWGGSLWHCNSMTEDKPGEAHSSAWTLA AKRGRDAGG >ECs0205 hypothetical protein MRKNTYAMRYVAGQPAERILPPGSFASIGQALPPGEPLSTEERIRILVWN IYKQQRAEWLSVLKNYGKDAHLVLLQEAQTTPELVQFATANYLAADQVPA FVLPQHPSGVMTLSAAHPVYCCPLREREPILRLAKSALVTVYPLPDTRLL MVVNIHAVNFSLGVDVYSKQLLPIGDQIAHHSGPVIMAGDFNAWSRRRMN ALYRFAREMSLRQVRFTDDQRRRAFGRPLDFVFYRGLNVSEASVLVTRAS DHNPLLVEFSPGKPDK >ECs4853 hypothetical protein MSLEVFEKLEAKVQQAIDTITLLQMEIEELKEKNNSLSQEVQNAQHQREE LERENNHLKEQQNGWQERLQALLGRMEEV >ECs1200 DNA-binding protein MNELINSNAIKMTSIEIAELVGSRHDKVKQSIERLAVRGVIRNPPMVVFE KINNLGLLRGVEAYVFEGEQGKRDSIIVVAQLSPEFTARLVDRWRELEGA TAKIPQTFSEALRLAADLEDQKAELEKQLALAAPKVEFADRVGEASGILI GNFAKVVGIGPNKLFAWMRDHKILIASGSRRNVPMQEYMDRGYFTVKETA VNTNHGIQISFTTKITGRGQQWLTRKLLDNGMLKVTREAA >ECs4197 hypothetical protein MSRSLLTNETSELDLLDQRPFEQTDFDILKSYEAVVDGLAMLIGSHCEIV LHSLQDLKCSAIRIANGEHTGRKIGSPITDLALRMLHDMTGADSSVSKCY FTRAKSGVLMKSLTIAIRNREQRVIGLLCINMNLDVPFSQIMSTFVPPET PDVGSSVNFASSVEDLVTQTLEFTIEEVNADRNVSNNAKNRQIVLNLYEK GIFDIKDAINQVADRLNISKHTVYLYIRQFKSGDFQGQDK >ECs5211 hypothetical protein MTKQPEDWLDDVPGDDIEDEDDEIIWVSKSEIKRDAEELKRLGAEIVDLG KNALDKIPLDADLRAAIELAQRIKMEGRRRQLQLIGKMLRQRDVEPIRQA LDKLKNRHNQQVVLFHKLENLRDRLIDQGDDAIAEVLNLWPDADRQQLRT LIRNAKKEKEGNKPPKSARQIFQYLRELAENEG >ECs0441 hypothetical protein MLQSNEYFSGKVKSIGFSSSSTGRASVGVMVEGEYTFSTAEPEEMTVISG ALNVLLPDATDWQVYEAGSVFNVPGHSEFHLQVAEPTSYLCRYL >ECs0611 hypothetical protein MKKALQVAMFSLFTVIGFNAQANEHHHETMSEAQPQVISATGVVKGFDLE SKKITIHHDPIAAVNWPEMTMRFTITPQTKMSGIKTGDKVAFNFVQQGNL SLLQDIKVSQ >ECs4079 hypothetical protein MKFKTNKLSLNLVLASSLLAASIPAFAVTGDTDQPIHIESDQQSLDMQGN VVTFTGNVIVTQGTIKINADKVVVTRPGGEQGKEVIDGYGKPATFYQMQD NGKPVEGHASQMHYELAKDFVVLTGNAYLQQVDSNIKGDKITYLVKEQKM QAFSDKGKRVTTVLVPSQLQDKNNKGQTPAQKKGN >ECs5157 hypothetical protein MTWNPLALATALQTVPEQNIDVTNSENALIIKMNDYGDLQINILFTSRQM IIETFICPVSSISNPDEFNTFLLRNQKMMPLSSVGISSVQQEEYYIVFGA LSLKSSLEDILLEITSLVDNALDLAEITEEYSH >ECs4762 putative alpha helix chain MDFSIMVYAVIALVGVAIGWLFASYQHAQQKAEQLAEREEMVAELSAAKQ QITQSEHWRAECELLNNEVRSLQSINTSLEADLREVTTRMEAAQQHADDK IRQMINSEQRLSEQFENLANRIFEHSNRRVDEQNRQSLNSLLSPLREQLD GFRRQVQDSFGKEAQERHTLTHEIRNLQQLNAQMAQEAINLTRALKGDNK TQGNWGEVVLTRVLEASGLREGYEYETQVSIENDARSRMQPDVIVRLPQG KDVVIDAKMTLVAYERYFNAEDDYTRESALQEHIASVRNHIRLLGRKDYQ QLPGLRTLDYVLMFIPVEPAFLLALDRQPELITEALKNNIMLVSPTTLLV ALRTIANLWRYEHQSRNAQQIADRASKLYDKMRLFIDDMSAIGQSLDKAQ DNYRQAMKKLSSGRGNVLAQAEAFRGLGVEIKREINPDLAEQAVSQDEEY RLRSVPEQPNDEAYQRDDEYNQQSR >ECs1554 putative minor tail protein MAEIKTLHLVPREGMQVSEKPSVVRVRFGDGYEQRRPTGLNPQLKTFQAV FRVTDESTRRWLDEFLSWHGGYRAFLWRPPKHNRTVRGVCREWSVTDNAR YSDFSCTIEQVVN >ECs4776 hypothetical protein MESWLIPAAPVTVVEEIKKSRFITLLAHTDGVEAAKAFVESVRAEHPDAR HHCVAWVAGAPDDSQQLGFSDDGEPAGTAGKPMLAQLMGSGVGEITAVVV RYYGGILLGTGGLVKAYGGGVNQALRQLTTQRKTPLTEYTLQCEYSQLTG IEALLGQCDGKIINSDYQAFVLLRVALPAAKVAEFSAKLADFSRGSLQLL AIEE >ECs0681 putative alpha helical protein MNKVAQYYRELVASLSERLRNGERDIDALVEQARERVIKTGELTRTEVDE LTRAVRRDLEEFAMSYEESLKEESDSVFMRVIKESLWQELADITDKTQLE WREVFQDLNHHGVYHSGEVVGLGNLVCEKCHFHLPIYTPEVLTLCPKCGH DQFQRRPFEP >ECs2075 IpaH-like protein MTNINTACVKNNASYQFNNALPNKETISSNFCERLEQWGNKSLNNGEERA IAVERIKEAYNSNMASLDLSYLDLSELPPIPSTVNTLNLENNCLTCLDFT DNASLVNINLSFNKINTITFPNESNLENIYIDHNNLESLDLKNQHSLVNL EAQNNNLKKLIFLIVIN >ECs4264 hypothetical protein MNYELLTTENAPVKMWTKGVPVEADARQQLINTAKMPFIFKHIAVMPDVH LGKGSTIGSVIPTKGAIIPAAVGVDIGCGMNALRTALTAADLPENLAELR QAIETAVPHGRTTGRCKRDKGAWENPPVNVDAKWAELEAGYQWLTQKYPR FLNTNNYKHLGTLGTGNHFIEICLDESDQVWIMLHSGSRGIGNAIGTYFI DLAQKEMQETLETLPSRDLAYFMEGTEYFDDYLKAVAWAQLFASLNRDVM MENVVTALQSITQKTVRQPQTLAMEEINCHHNYVQKEQHFGEEIYVTRKG AVSARAGQYGIIPGSMGAKSFIVRGLGNEESFCSCSHGAGRVMSRTKAKK MFSVEDQIRATVHVECRKDAEVIDEIPMAYKDIDAVMAAQSDLVEVIYTL RQVVCVKG >ECs1985 putative minor tail protein MQDIHEESLNESVKSEQSPRVVLWEIDLTVQGGERYFFCNELNEKGEAVT WQGRQYQAYPIDGSGFEMNGKGSSARPSLTVSNLFGLVTGMAEDLQSLVG ATVVRRRVYARFLDAVNFVAGNPEADPEQELSDRWVVEQMSQLTAMTASF VLATPTETDGALFPGRIMLANTCMWTYRSDECGYTGGAVADEFDNPTTDI RKDRCSKCMRGCEMRGMAVNFGGFLSINKLSQ >ECs4521 hypothetical protein MLLHILYLVGITAEAMTGALAAGRRRMDTFGVIIIATATAIGGGSVRDIL LGHYPLGWVKHPEYVIIVATAAVLTTIVAPVMPYLRKVFLVLDALGLVVF SIIGAQVALDMGHGPIIAVVAAVTTGVFGGVLRDMFCKRIPLVFQKELYA GVSFASAVLYIALQHYVSNHDVVIISTLVFGFFARLLALRLKLGLPVFYY SHEGH >ECs0225 hypothetical protein MATTRNKVMWQEGMLMRPHHFQQQQRYNDYLDNQRFRAMNDLSWGFTELT LNNELLAQGKIMIDSASGTLPDGTVFSIPDQDALPDPLHPQNFPDERSRN IYLALPVASDVRNEISDGRRIGRYRLNYADVRDLHSEEGDARTLTLGQLT PRIMSGAEDMSAYITLPLCRISDRHADGSLTLDDDFIPSCQNIQVSKKLR VYLKEVQGAIGGRASDLANRIGSPAQSGIADVAEFMMLQLLNRNQTRFTH RARRSQLHPEDFYLDLAELLGELMTFTEPSRLPCPLDVYDHHDLTKTFKT LLPEVKRALHTVLSPRAVNLPLHLRDGIWQADVHDSELLQSATFVLAVAA NMPVDQIQRQFIQQSKISSPEKIRNMVSVQIPGIPLRALMVAPRQLPYHS GFSYFELDKSGQAWTEMAAAGAVALHVSGSFPDLNMQLWAIRG >ECs4093 hypothetical protein MESLSERTSTGYQQIHDGIIHLVDSARTETVRSVNALMTATYWEIGRRIV EFEQGGEARAAYGAQLIKRLSKDLSLRYKRGFSAKNLRQMRLFYLFFQHV EIRQTVSGELTPLPWSTYVRLLSVKNADTRSFYEKETLRCGWSVRQLERQ IATQFYERTLLSHDKSAMLQQHAPAETHILPQQAIRDPFVLEFLELKDEY SESDFEEALINHLMDFMLELGDDFAFVGRQRRLRIDDNWFRVDLLFFHRR LRCLLIVDLKVGKFSYSDAGQMNMYLNYAKEHWTLPDENPPIGLVLCAEK GVGEAHYALAGLPNTVLASEYKMQLPDEKRLADELVRTQAVLEEGYRLR >ECs0221 hypothetical protein MAMDLRDPNVWISHLLENLPEEKLASALKDDNPDWEYIDGEIVKLGSLAH AQLDIPELQRRGLQLLASESKDFRLLAHLLRTLQHAGDPLLALHLLTLYV EHYWTVAAPQNMAHKKRFASQIIKRFETVLKAFHKTLPQRSAILCWVSWR NWRSAGSHITSRNWHRLPMIFLPCTSVRLIVRLLLRSPLRRPPVVHHKPP SRLKAA >ECs2413 hypothetical protein MTLSFITRWRDELPETYTALSPTPLNNARLIWHNTELANTLSIPSSLFKN GAGVWGGETLLPGMSPLAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGT TMDWHLKGAGLTPYSRMGDGRAVLRSTIRESLASEAMHYLGIPTTRALSI VTSDSPVYRETVEPGAMLMRVAPSHLRFGHFEHFYYRREPDKVRQLADFA IRHYWSHLEDDEDKYRLWFNDVVARTASLIAQWQTVGFAHGVMNTDNMSL LGLTLDYGPFGFLNDYEPGFICNHSDHQGRYSFDNQPAVALWILQRLAQT LSPFVAVDALNEALDSYQQVLLTHYGQRMRQKLGFMTEQKEDNALLNELF SLMARERSDYTRTFRMLSLTEQHSAASPLRDEFIDRAAFDDWFARYRGRL QQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTELHRLHEAL RNPFSDRDDDYVSRPPDWGKRLEVSCSS >ECs4300 hypothetical protein MHYAVVVMLELLCCIHAYRTGQERYWIFIIFCFPVIGCVAYFVIVMLPET GADRHGHTLLMRLQDKLNPERHLRKLTEELAIAETNQNHYALANELARLG RYHEAVPHYQQALSGIFAHEAAMMLSLAQAQFAIQEFAACQQTLEDVMRY NPDFQSADGHLLFARTLAAQEKYADAESEFEVLISYYPGPQARIYYAEML AKMSRLREANEQYVAVVDTAKRSRPHYRKHHREWIKTANERLKQSVVQ >ECs3378 hypothetical protein MNTEATHDQNEALTTGARLRNAREQLGLSQQAVAERLCLKVSTVRDIEED KAPADLASTFLRGYIRSYARLVHIPEEELLPGLEKQAPLRAAKVAPMQSF SLGKRRKKRDGWLMTFTWLVLFVVIGLSGAWWWQDHKAQQEEITTMADQS SAELSSNSEQGQSVPLNTSTTTDPATTSTPPASVDTTATNTQTPVVTAPA PAVDPQQNAVVSPSQANVDTAATPAPTAATTPDGAAPLPTDQAGVTTPVA DPNALVMNFTADCWLEVTDATGKKLFSGMQRKDGNLNLTGQAPYKLKIGA PAAVQIQYQGKPVDLSRFIRTNQVARLTLNAEQSPAQ >ECs3933 hypothetical protein MDDIVNSVPSWMFTAIIAVCILFIIGIIFARLYRRASAEQAFVRTGLGGQ KVVMSGGAIVMPIFHEIIPINMNTLKLEVSRSTIDSLITKDRMRVDVVVA FFVRVKPSVEGIATAAQTLGQRTLSPEDLRMLVEDKFVDALRATAAQMTM HELQDTRENFVQGVQNTVAEDLSKNGLELESVSLTNFNQTSKEHFNPNNA FDAEGLTKLTQETERRRRERNEVEQDVEVAVREKNRDALSRKLEIEQQEA FMTLEQEQQVKTRTAEQNAKIAAFEAERRREAEQTRILAERQIQETEIDR EQAVRSRKVEAEREVRIKEIEQQQVTEIANQTKSIAIAAKSEQQSQAEAR ANLALAEAVSAQQNVETTRQTAEADRAKQVALIAAAQDAETKAVELTVRA KAEKEAAEMQAAAIVELAEATRKKGLAEAEAQRALNDAINVLSDEQTSLK FKLALLQALPAVIEKSVEPMKSIDGIKIIQVDGLNRGGTAGDANTGNVGG GNLAEQALSAALSYRTQAPLIDSLLNEIGVSGGSLAALTSLLTPTTPVAE NVE >ECs2850 putative galactokinase MKLLILGNHTCGNRGDSAILRGLLDAINILNPHTEVDVMSRYPVSSSWLL NRPVMGDPLFLQMKQHNSAAGVVGRVKKVLRRRYQHQVLLSRVTDTGKLR NIAIAQGFTDFVRLLSGYDAIIQVGGSFFVDLYGVPQFEHALCTFMAKKP LFMIGHSVGPFQDEQFNQLANYVFGHCDALILRESVSLDLMKRSNITTAK VEHGVDTAWLVDHHTEDFTASYAVQHWLDVAAQQKTVAITLRELAPFDKR LGTTQQVYEKAFAGVVNRILDEGYQVIALSTCTGIDSYNKDDRMVALNLR QHISDPARYHVVMDELNDLEMGKILGACELTVGTRLHSAIISMNFATPAI AINYEHKSAGVMQQLGLPEMAIDIRHLLDGSLQAMVADTLGQLPALNARL SEAVSRERQTGMQMVQSVLERIGEVK >ECs4308 hypothetical protein MSAVKKQRIDLRLTDDDKSMIEEAAAISNQSVSQFMLNSASQRAAEVIEQ HRRVILNEESWTRVMDALSNPPSPGEKLKRAAKRLQGM >ECs1555 putative minor tail protein MQNIHEESLNESVKSEQSPRVVLWEIDLTVQGGERYFFCNELNEKGEAVT WQGRQYQVYPIDGSGFEMNGKGSSARPSLTVSNLFGLVTGMAEDLQSLVG ATVVRRRVYARFLDAVNFVAGNPEADPEQELRDRWVVEQMSELTAMTASF VLATPTETDGALFPGRIMLANTCMWDYRGDECGYHGPAVADEFDNPTTDI RKDRCSKCMRGCEMRGMVANFGGFLSINKLSQ >ECs2670 hypothetical protein MCGRFAQSQTREDYLALLAEDIERDIPYDPEPIGRYNVAPGTKVLLLSER DEHLHLDPVFWGYAPGWWDKPPLINARVETAATSRMFKPLWQHGRAICFA DGWFEWKKEGDKKQPYFIYRADGQPVFMAAIGSTPFERGDEAEGFLIVTA AADQGLVDIHDRRPLVLSPEAARKWMRQEISGKEASEIAASGCVPANQFS WHPVSRAVGNVKNQGAELIQPV >ECs4986 hypothetical protein MADIAVVWDQGCGSLQLNGADLLTDNSLLTAVIISLFTDRRALDSDEIPD GTRDRRGWWGDSFRERPVGSRLWLLSREKTLSSVVSRAQAYADEALAWLH KSGAATSVVCHAMRVGHARLSLSVKITLPDGSRHPMIFYADMKGE >ECs1804 minor tail protein MAEIKTLHLVPREGMQVSEKPSVVRVRFGDGYEQRRPTGLNPQLKTFQAV FRVTDESTRRWLDEFLSWHGGYRAFLWRPPKHNRTVRVVCREWSVTDNAR YSDFSCTIEQVVN >ECs2161 putative host specificity protein MGKGGGKGHTPVEAKDNLKSTQMMSVIDAIGEGPIEGPVKGLQSILVNKT PLTDTDGNPVIHGVTAVWRAGEQEQTPPEGFESSGAETALGVEVTKAKPV TRTITSANIDRLRVTFGVQSLLETTSKGDRNHSSVRLLIQLQRNGNWVTE KDVTINGKTTSQFLASVILDNLPPRPFNIRMVRETADSTTDQLQNRTLWS SYTEIIDVKQCYPNTAIVGLQVDAEQFGGQQMTVNYHIRGRIIQVPSNYD PEKRTYSGIWDGSLKPAYSNNPAWCLWDMLTHPRYGMGKRLGAADVDKWA LYAIAQYCDQMVPDGFGGTEPRMTFNAYLSQQRKAWDVLSDFCSAMRCMP VWNGQTLTFVQDSPSDVVWPYTNSDVVVDDNGVGFRYSFSALKDRHTAVE VNYTDPQNGWQTSTELVEDPEAILRYGRNLLKMDAFGCTSRGQAHRAGLW VIKTGLLETQTVDFTLGSQGLRHTPGDIIEICDNDYAGTMTGGRILSIDA ASRTLTLDREVTLPETGAATVNLINGSGKPVSVAITAHPAPDRIQVSTLP DGVETYGVWGLSLPSLRRRLFRCVSIRENTDGTFAITAVQHVPEKEAIVD NGAHFDGDQSGTLNSVIPPAVQHLTVEVSAADSQYLAQAKWDTPRVVKGV RFSLRLTSGSGQDSRLVTTAITADTEHRFSGLPLGEYTLTVRAINSYGQQ GEPAITTFRINAPAKPATIELTPGYFQITAVPVLAVYDPTVQFEFWFSEK RITNTAQVEKSARYLGSGSQWTVQGSRIKPGTDFWFYVRSVNLVGKSAFV EVSGQPSNDGEGYLEFFREKIGKLHLAQGLWELIDNSQLADEMAEMKTTI TETRNEITQTVSKTLENQSATIQQIQRVQKDTNDDLAALYMLKVQKTKDG IPYVAGIGAGIEDTDGQPLSNILLLADRIAMINPESGNSTPLFVAQGNQL FMNDVFLKRLFAVSITSSGNPPTFSLTPDGRLTAKNADISGSVNANSGTL NNVTINENCQIKGKLSANQIEGDIVKTVSKSFPRTSTYASGTITVRISDD QKFDRQVMIPPVLFRGGKHENFNSNNQQSYWYSTCRLRVTRNGQEIFNQS TTDAQGVFSSVIDMPAGQGTLTLTFTVSSSGANNWTPTTSISDLLVVVMK KSTAGISIS >ECs0455 putative glycoprotein MNFLAHLHLAHLAESSLSGNLLADFVRGNPEESFPPDVVAGIHMHRRIDV LTDNLPEVREAREWFRSETRRVAPITLDVMWDHFLSRHWSQLSPDFPLQE FVCYAREQVMTILPDSPPRFINLNNYLWSEQWLVRYRDMDFIQNVLNGMA SRRPRLDALRDSWYDLDAHYAALETRFWQFYPRMMAQASRKAL >ECs5309 hypothetical protein MEAKECKVQDILTENKKFIIPSYQRPYSWTVDNAEQLIDDIYKSSQSEEN GYFIGSMICINKGQNQYEVVDGQQRLTTLSIIVSELKKSSRFRG >ECs2724 putative minor tail protein MKTFRWKVKPDMEVNSQPSVREVRFGDGYSQRMAAGLNADLKTYRVTLSV TREEARHLEAFLAEHGGWKAFLWKPPYAYRQIKVTCAGWSARVGMLRVEF SAEFKQVVN >ECs3109 hypothetical protein MQSHLDREEYVARVLDREAKSTPPEAAKAMTVAIRTFLQQNANREGDCLT IPDSSATQRVSASPATTGARTMTVWTQDLIYAGDPVHYHGSRATEGTLSW RQAMAQAGKGERYDQILAFAYPDNSLSRWGAPRTTCQLLPKAKAWLAKKM PQWRRILQGETGYNEPDVFAVCRLVSGFPYTDRQQKRLFIRNFFTLQDRL DLTHEYLHLAFDGYPTGLDENYIETLTRQLLMD >ECs1692 hypothetical protein MGIIAWIIFGLIAGIIAKLIMPGRDGGGFFLTCILGIVGAVVGGWLATMF GIGGSISGFNLHSFLVAVVGAILVLGVFRLLRRE >ECs5089 hypothetical protein MPLSPYLSFAGNCSDAIAYYQRTLGAELLYKISFGEMPKSAQDSAENCPS GMQFPDTAIAHANVRIAGSDIMMSDAIPSGKASYSGFTLVLDSQQVEEGK RWFDNLAANGKIEMAWQETFWAHGFGKVTDKFGVPWMINVVKQQPTQ >ECs2556 hypothetical protein MANWLNQLQSLLGQSSSSTSSSADQGLGKLLVPGALGGLAGLLVANKSAR KLLTKYGTNALLIGGGAVAGTVLWNKYKDKIRAAHQDEPQFGSQSTPLDE RTERLILALVFAAKSDGHIDAKERAAIDQQLREAGVEEQGRVLIEQAIEQ PLDPQRLATGVRNEEEALEIYFLSCAAIDIDHFMERSYLNALGDALKIPQ DVREGIERDLEQQKRTLAE >ECs3599 tRNA pseudouridine synthase D MIEFDNLTYLHGKPQGTGLLKANPEDFVVVEDLGFEPDGEGEHILVRILK NGCNTRFVADALAKFLKIHAREVSFAGQKDKHAVTEQWLCARVPGKEMPD LSAFQLEGCQVLEYARHKRKLRLGALKGNAFTLVLREVSNRDDVEQRLID ICVKGVPNYFGAQRFGIGGSNLQGALRWAQTNTPVRDRNKRSFWLSAARS ALFNQIVAERLKKADVNQVVDGDALQLAGRGSWFVATTEELVELQRRVND KELMITAALPGSGEWGTQREALAFEQAAVAAETELQALLVREKVEAARRA MLLYPQQLSWNWWDDVTVEIRFWLPAGSFATSVVRELINTTGDYAHIAE >ECs1568 hypothetical protein MLPTSQLRPTGTFCSYSAETSADIKSEITPIQIEEARASGRLYIKDCDIE YLPQLPNEITSVTIENCNNLTTLTGLPVNTQNLSVINCEKLQITDMPSTV KNLHIELTDSPFIHFISEGIECLTVCHCYISGVPESVRYLEIKGSATDSI KNVPNGLSSLSINSYNPENQARIDNLISPSLKTLSLTGCSNIILPEKLPE SVTSVTIHAEQKTTWNIGVEGMPDGLDLDLQNVLLSPDVVKAKNITFQGN ALDVALHFREGDIVYGLSSPREKLVNSIKLVNDFSKKDIITQNTLTNAVW DPRTPRKYKQDPLIKRALNEHERGIKFKQHLKNHNNYNVTMADLSVYNRD KLWAKTSKAGLEFQTLTRNKTVIFCADELVNSLKLIANKSEGYGQSITAS ELRWIYRNKDNNQIMKNIKFYLHGKEIPAERILDTPEWKDYRPKYSGSTY KYS >ECs3112 hypothetical protein MRHGLLALICWLCCVVAHSEMLNVEQSGLFRAWFVRIAQEQLRQGPSPRW YQQDCAGLVRFAANETLKVHDSKWLKSNGLSSQYLPPEMTLTPEQRQLAQ NWNQGNGKTGPYVTAINLIQYNSQFIGQDINQALPGDMIFFDQGDAQHLM VWMGRYVIYHTGSATKTDNGMRAVSLQQLMTWKDTRWIPNDSNPNFIGIY RLNFLAR >ECs2578 hypothetical protein MIYIGLPQWSHPKWVRLGITSLEEYARHFNCVEGNTTLYALPKPEVVLRW REQTTDDFRFCFKFPATISHQAALRHCDDLVTEFLTRMSPLAPRIGQYWL QLPATFGPRELPALWHFLDSLPGEFNYGVEVRHPQFFAKGEEEQTLNRGL HQRGVNQVILDSRPVHAARPHSEAIRDAQRKKPKVPVHAVLTATNPLIRF IGSDDMTQNRELFQVWLQKLAQWHQTTTPYLFLHTPDIAQAPELVHTLWE DLRKTLPEIGAVPAIPQQSSLF >ECs3198 putative lipoprotein MGTIVLVALGVIVLPGLLDGQKKHYQDEFAAIPLVPKAGDRDEPDMMPAA TQALPTQPPEGAAEEVRAGDAAAPSLDPATIAANNTEFEPEPAPVVPPKP KPVEPPKPKVEAPPAPKPEPKPVVEEKAAPTGKAYVVQLGALKNADKVNE IVGKLRGAGYRVYTSPSTPVQGKITRILVGPDASKDKLKGSLGELKQLSG LSGVVMGYTPN >ECs0999 hypothetical protein MSLPHLSLADARNLHLAAQGLLNKPRRRASLEDIPATISRMSLLQIDTIN IVARSPYLVLFSRLGNYPAQWLDESLARGELMEYWAHEACFMPRSDFRLI RHRMLAPEKMGWKYKDAWMQEHEAEIAQLIQHIHDKGPVRSADFEYPRKG ASGWWEWKPHKRHLEGLFTAGKVMVIERRNFQRVYDLTHRVMPDWDDERD LVSQTEAEIIMLDNSARSQGIFREQWLADYYRLKRPALAAWREARAEQRQ IIAVHVEKLGNLWLHADLLPLLERALAGKLTATHSAVLSPFDPVVWDRKR AEQLFDFSYRLECYTPAPKRQYGYFVLPLLHRGQLVGRMDAKMHRQTGIL EVISLWLQEGIKPTTTLQKGLRQAITDFANWQQATRVTLGHCPQGLFTDC RTGWEIDPVA >ECs1645 minor tail protein MQDIRQETLNECTRAEQSASVVLWEIDLTEVGGERYFFCNEQNEKGEPVT WQGRQYQPYPIQGSGFELNGKGTSTRPTLTVSNLYGMVTGMAEDLQSLVG GTVVRRKVYARFLDAVNFVNGNSDADPEQEVISRWRIEQCSELSAVSASF VLSTPTETDGAVFPGRIMLANTCTWTYRGDECGYHGPAVADEYDQPTSDI TKDKCSKCLSGCKFRNNVGNFGGFLSINKLSQ >ECs5103 hypothetical protein MTRTLKPLILNTGALALTLILIYTGISAHDKLTWLMEVTPVIIVVPLLLA TAKRYPLTPLLYTLIFFHAIILMVGGQYTYAKVPIGFEVQEWLGLSRNPY DKLGHFFQGLVPALVAREILVRGMYVRGRKMVAFLVCCVALAISAMYELI EWWAALAMGQGADDFLGTQGDQWDTQSDMFCALLGALTTVIFLARFHCRQ LRRFGLITG >ECs1040 hypothetical protein MKYQQLENLESGWKWKYLVKKHREGELITRYIEASAAQEAVDVLLSLENE PVLVNGWIDKHMNPELVNRMKQTIRARRKRHFNAEHQHTRKKSIDLEFIV WQRLAGLAQRRGKTLSETIVQLIEDAENKEKYANKMSSLKQDLQALLGKE >ECs2695 hypothetical protein MSFMVSEEVTVKEGGPRMIVTGYSSGMVECRWYDGYGVKREAFHETELVP GEGSRSAEEV >ECs4051 hypothetical protein MITAPVEALGFELVGIEFIRGRTSTLRIYIDSEDGINVDDCADVSHQVSA VLDVEDPITVAYNLEVSSPGLDRPLFTAEHYARFVGEEVTLVLRMAVQNR RKWQGVIKAVDGEMITVTVEGKDEVFALSNIQKANLVPHF >ECs3390 hypothetical protein MGLKWTDSREIGEALYDAYPDLDPKTVRFTDMHQWICDLEDFDDDPQASN EKILEAILLVWLDEAE >ECs0230 hypothetical protein MILDRERTTGSLFERMEASSARNRQGGSIHSLRQSIRQNLRNILNTRSGS CRGAPELGIDEPEGAENFRESMSRAIEQCIERYEPRISHAEVQAVVSSAS SPLDMTFHITAWVTFNETHEVLEFDMAPNGSQHYRVD >ECs5255 hypothetical protein MMSHAPMGMAGNVTFVHNGKAYVTGGVNQNIFNGYFEDLNEAGKDSAAID KINAHYFDKKAEDYFFNKFLLSFDPSTQQWIFTCRLEQQKICEHDCNYFT AWDINGFQ >ECs4745 hypothetical protein MAHRLLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTV NPQTERVKMYWQKANGEAWGTLHALLADMNSQGQVQMAMNGGIYDESYAP LGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK EIQFAVQSGPMLMENSVINPRIHPNVASRKIRNGVGINKHGNAVFLLSQQ ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISV ERKG >ECs4987 hypothetical protein MMPYQPLPLAQLITQTQQDISQRLPGSQPGVNETTLNAIAYALAGLSAQE HEHLAWISRQIIPTEADEAELLKHCAFWGVIRKPASRADGPVQLMLTTDA GITEGVLLQRSDGVVYRITGSATGKAGTLNVNVEAESAGRAGNTPTGTRL SFITPQAGINQTATVTGTGLTGGADVETVPELLSRLVFRVQNPPSGGTQY DFERWAREVPGVTRAWCKPEWPEAGSVGVTFVQDNNPDIFPGEGDVKRVA DYIRSHDDPATGQPVGQPLGPTISVFKLTNKPVAFEIRIVPKTPENQAAV KQALTDLLYNESRPGGLVLPSSFWRAVAGVKGLEDFEVRSPLKSVMAGDT ELLTVGEITWL >ECs4703 acetolactate synthase II small subunit MMQHQVNVSARFNPETLERVLRVVRHRGFHVCSMNMAAASDAQNINIELT VASPRSVDLLFSQLNKLVDVAHVAICQSTTTSQQIRA >ECs5160 hypothetical protein MHILDSLLAFSAYFFIGVAMVIIFLFIYSKITPHNEWQLIKNNNTAASLA FSGTLLGYVIPLSSAAINAVSIPDYFAWGGIALVIQLLVFAGVRLYMPAL SEKIINHNTAAGMFMGTAALAGGIFNAACMTW >ECs4846 hypothetical protein MTIQQWLFSFKGRIGRRDFWIWIGLWFAGMLVLFSLAGKNLLDIQTAAFC LVCLLWPTAAVTVKRLHDRGRSGAWAFLMIVAWMLLAGNWAILPGVWQWA VGRFVPTLILVMMLIDLGAFVGTQGENKYGKDTQDVKYKADNKSSN >ECs4230 DamX MDEFKPEDELKPDPSDRRTGRSRQSSERSERTERGEPQINFDDIELDDTD DRRPTRAQKERNEEPEIEEEIDESEDETVDEERVERRPRKRKKAASKPAS RQYMMMGVGILVLLLLIIGIGSALKAPSTTSSDQTASGEKSIDLAGNATD QANGVQPAPGTTSAENTQQDVSLPPISSTPTQGQTPAATDGQQRVEVQGD LNNALTQPQNQQQLNNVAVNSTLPTEPATVAPVRNGNASRDTAKTQTAER PATTRPARQQAVIEPKKPQATVKTEPKPVAQTPKRTEPAAPVASTKAPAA TSTPAPKETATTAPVQTASPAQTTATPAAGGKTAGNVGSLKSAPSSHYTL QLSSSSNYDNLNGWAKKENLKNYVVYETTRNGQPWYVLVSGVYASKEEAK KAVSTLPADVQAKNPWAKPLRQVQADLK >ECs5377 hypothetical protein MFFRNRKNIAMTIKEKFSQKYPHASFCTFGDSAALADHLATLIATGVKTA SCGSLAGCIEDNAFPMIGEYKIVENSRGEPVCVIRVIGLHLLRFSDVTAE LARKEGEGDLSLEYWRNEHRRFFQAEGSYSPEMDVIFEEYALIDVV >ECs1247 hypothetical protein MLELLDERERNQQYIKRRDQENEEIALTVGKLRVELGAAENNLIDSECHV AELEEALRDKQALLEASEKRIAELEAELVSQTYKLPHTQFEQIANLYEMQ FDDGRTCAFHTDAQKAEQWLQACDGNRVQEYVKLERLQNALSGNSPVTPD GWISCSERMPDTKTAVLVAVEFDRKGDWRMKWATYIPGHPDANDGWIIPG ASWKPSHWMPLPEPPQEVN >ECs1000 hypothetical protein MDHRLLEIIACPVCNGKLWYNQEKQELICKLDNLAFPLRDGIPVLLETEA RVLTADESKS >ECs5310 hypothetical protein MQKRVLPIDVYSDETDEPRLIVRKKEHDLYKYYILQDSKDYKPEKPSDTE LVFISNAETIRDYLLRLSVDELKLLAKYILQNVYIVFVQTDDFASSFRLF NVLNSRGLPLSNADLLKNALFESASTHNKKSEQIESAWSQIEDMVGVRRL DKFLTLHKLSEKKDRDRVLQKGFEAFIENLQQQFDGDAIAMSLMLVNSAK NYTKILENDFEHPSIRRKIASLSNLGVDEWIPPVMAFMNRMARTEDFNLD DFSQFITAFEKVYMHGWLKKQIKSQREMVCYSALVAINNDMPFDSVINQI NQHADNSGFIAALDEDLYEPRPNQVNLIKAILLRLDMEQQDESVIKTYTG RITIEHILPQALVNEYWINRFQPQEHVYWLHKIGNLTLISGSKNSEAQHY DFIKKKSIYEKLNSKSSFDLTKDVCNSSEWGLAELKMRHEKMKTQLKKLW LV >ECs1987 putative tail assembly protein MATTNAFSLASPPLARICLHGDLQRFGRRLSLYVNTAAEAIRALSLQVPG FRRQMNEGWYQIRIAGDDTAPEAVYARLHEQLGEGTVIHIVPRLAGAGKG GLQIVLGAAAIVGSFFTAGASMAAWGAALSAGGFSATTMLFSLGASMILG GVAQMLAPKPKTPEYRATDNGKQNTYFSSLDNMIAQGNPMPVPYGEMLVG SRRISQDISTRDEGGGGKVVVIGRQA >ECs0061 hypothetical protein MKVSVPGMPATLLNMSNNDIYKMVSGDKMDMKMNIFQRLWETLRHLFWSD KQTEAYKLLFNFVNKKAGNINASKYFTGAVNENEKEKFIHSLELFNELKT CAKNPDEMVAKGNMSWVAQTFGDIELSVTFFIENKEICTQTLQLHKGPGN LGVDLREAYLPGVDMRDCYLGLKTMKGHNKVLYLEPGWNANLDGATLDGA TLDGATVDGATHLYDEVIIINKITPKKIDTEEVATKQSTAEQITDNAIIE >ECs4922 hypothetical protein MLQNPIHLRLERLESWQHVTFMACLCERMYPNYAMFCQQTGFGDGQIYRR ILDLIWETLTVKDAKVNFDSQLEKFEEAIPSADDFDLYGVYPAIDACVAL SELVHSRLSGETLEHAVEVSKTSITTVAMLEMTQAGREMSDEELKENPAV EQEWDIQWEIFRLLAECEERDIELIKGLRADLREADESNIGIIFQQ >ECs3652 hypothetical protein MTTHDRVRLQLQALEALLREHQHWRNDEPQPHQFNSTQPFFMDTMEPLEW LQWVLIPRMHDLLNNNQPLPGAFAVAPYYEMALATDHPQRALILAELEKL DALFADDAS >ECs3178 hypothetical protein MIQESTMEMTNAQRLILSNQYKMMTMLDPANAERYRRLQTIIERGYGLQM RELDREFGELKEETCRTIIDIMEMYHALHVSWSNLQDQQSIDERRVTFLG FDAATEARYLGYVRFMVNVEGRYTHFDAGTHGFNAQTPMWEKYQRMLNVW HACPRQYHLSANEINQIINA >ECs4597 hypothetical protein MFKTKWFAREARSHAITDEELCRAILETEQGKADALGGGVFKKRLHQNRE RAIILAKGVSNWFYTFLYAKQDMSNINSQELAGFREIAKHYAFLTKAQLT AMINTKELTEICYDCKN >ECs5304 hypothetical protein MKATEARLLDFLKRSQQFVIPIYQRTYSWTEQQCRQLWDDIIRAGKRDDI SAHFIGSVVYIEQGLYQVSGISPLLVIDGQQRLTTAMLLIEALSRHLGED EVFDGFSAMKLRNYYLLNPYESGEKGFKLLLTETDKDSLLALIKQRPMPE NYSHRIMENFTFFDEQIAKLGDDLIPLCRGLAKLLIVDVALNRGQDNPQL IFESMNSTGKALSQADLVRNFILMGLEPEHQTRLYEDHWRPMEVACGQQG YSEYFDSFMRHYLTVKTGRSLGQMKSMRHLNSMPAARVLLKKA >ECs3375 hypothetical protein MEIYENENDQVEAIKRFFAENGKALAVGVILGVGALIGWRYWNSHQVDSA RSASLAYQNAVTAVSEGKPDSIPAAEKFAAENKNTYGALASLELAQQFVD KNELEKAAAQLQQGLADTSDENLKAVINLRLARVQVQLKQADAALKTLDT IKGEGWAAIVADLRGEALLSKGDKQGARSAWEAGVKSDVTPALSEMMQMK INNLSI >ECs4199 hypothetical protein MQDLSLEARLAELESRLAFQEITIEELNVTVTAHEMEMAKLRDHLRLLTE KLKASQPSNIASQAEETPPPHY >ECs4784 putative GTP-binding protein MIRKSATGVIVALAVIWGGGTWYTGTQIQPGVEKFIKDFNDAKKKGEHAY DMTLSYQNFDKGFFNSRFQMQMTFDNGAPDLNIKPGQKVVFDVDVEHGPL PITMLMHGNVIPALAAAKVNLVNNELTQPLFIAAKNKSPVEATLRFAFGG SFSTTLDVAPAEYGKFSFGEGQFTFNGDSSSLSNLDIEGKVEDIVLQLSP MNKVTAKSFTIDSLARLEEKKFPVGESESKFNQINIINHGEDVAQIDAFV AKTRLDRVKDKDYINVNLTYELDKLTKGNQQLGSGEWSLIAESIDPSAVR QFIIQYNIAMQKQLAAHPELANDEVALQEVNAALFKEYLPLLQQSEPTIK QPVRWKNALGELNANLDISIADPAKSSSSTNKDIKSLNFDVKLPLNVVTE TAKQLNLSEGMDAEKAQKQADKQISGMMTLGQMFQLITIDNNTASLQLRY TPGKVVFNGQEMSEEEFMSRAGRFVH >ECs1559 putative tail assembly protein MATTNAFSLASPPLARICLHGDLQRFGRRLSLYVNTAAEAIRALSLQMPG FRRQMNEGWYQTRIPTASQNRSPAPVFVVMSGEYLLDKKEEIIQ >ECs4150 hypothetical protein MFDVLMYLFETYIHTEAELRVDQDKLEQDLTDAGFDREDIYNALLWLEKL ADYQEGLAEPMQLASDPLSMRIYTPEECERLDASCRGFLLFLEQIQVLNL ETREMVIERVLALDTAEFDLEDLKWVILMVLFNIPGCENAYQQMEELLFE VNEGMLH >ECs3409 hypothetical protein MNSLRYFDFGAARPVLLLIARIAVVLIFIIFGFPKMMGFDGTVQYMASLG APMPMLAAIIAVVMEVPAAILIVLGFFTRPLAVLFIFYTLGTAVIGHHYW DMTGDAVGPNMINFWKNVSIAGAFLLLAITGPGAISLDRR >ECs4655 hypothetical protein MQYITFIACFFSHENMKYSTFHDINLDMCEIKNCNFNNSEMNFISCVGTN FSGSTFNNVKTTTAQLIKTPTKWTNNTLKYWFSSCNKRNIIFTFNTISDR NMKLKGIKDILLSLVDQKVNIYSVRQELLNFLNNDLYKNDGEILSYKESI MLFCAE >ECs1674 hypothetical protein MFCVIYRSSKRDQTYLYVEKKDDFSRVPEELMKGFGQPQLAMILPLDGRK KLVNADIEKVKQALTEQGYYLQLPPPPEDLLKQHLSVMGQKTDDTNK >ECs4035 hypothetical protein METLTAISRWLAKQHVVTWCVQQEGELWCANAFYLFDVQKVAFYILTEEK TRHAQMSGPQAAVAGTVNGQPKTVALIRGVQFKGEIRRLEGEESDLARKA YNRRFPVARMLSAPVWEIRPDEIKFTDNTLGFGKKMIWLRGSGTEQA >ECs3526 hypothetical protein MGLFNFVKDAGEKLWDAVTGQHDKDDQAKKVQEHLSKTGIPDADKVNIQI ADGKATVTGDGLSQEAKEKILVAVGNISGIASVDDQVKTATPATASQFYT VKSGDTLSAISKQVYGNANLYNKIFEANKPMLKSPDKIYPGQVLRIPEE >ECs3110 hypothetical protein MNWRRIVWLLALVTLPTLAEETPLQLALRGAQHDQLYQLSSSGVTKVSAL PDTLTTPLGSLWKLYVYAWLEDTHQPEQPYQCRGNSPEEVYCCQAGESIT RDTALVRSCGLYFAPQRLHIGADVWGQYWQQRQAPAWLASLSTLKPETSV TVKSLLDSLATLPAQNKAQEVLLDVVLDEAKIGVASMLGSRVRVKTWSWF ADDKQEIRQGGFAGWLTDGTPLWVTGSGTSKTVLTRYATVLNRVLPVPTQ VASGQCVEVELFAR >ECs1445 hypothetical protein MKYQLTALEARVIGCLLEKQVTTPEQYPLSVNGVVTACNQKTNREPVMNL SESEVQEQLDNLVKRHYLRTVSGFGNRVTKYEQRFCNSEFGDLKLSAAEV ALITTLLLRGAQTPGELRSRAARMYEFSDMAEVELTLEQLANREDGPFVV RLAREPGKRESRYMHLFSGEVEDQPAVTDMSNAVDGDLQARVEALEIEVA ELKQRLDSLLAHLGD >ECs0607 Vgr MSTGLRFTLEVDGLPPDAFAVVSFHLTQSLSSLFSLDLSLVSQQFLSLEF AQVLDKMAYLTIWQGDDVQRRVKGVVTWFELGENDKNQMLYSMKVHPPLW RAGLRQNFRIFQNEDIKSILGTILQENGVTEWSPLFSEPHPSREFCVQYG ETDYDFLCRMAAEEGIFFYEEHAYKSTDQSLVLCDTVRHLPESFEIPWNP NTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQHQDY QRTQYEVYDYPGRFKGAHGQNFARWQMEGWRNNAETARGMSRSPEIWPGR RIVLTGHPQANLNREWQVVASELHGEQPQAVPGRRGAGTALENHFAVIPA DRTWRPQPRLKPLVDGPQSAVVTGPEGEEIFCDEHGRVRVKFNWDRYNPA DQDSSCWIRVAQAWAGTGFGNLAIPRVGQEVIVDFLNGDPDQPIIMGRTY HQENRTPGSLPGTKTQMTIRSKTYMGSGFNELKFDDATGREQVYIHAQKN MDTEVLNDRTTTVKHDHRETVKNDQTVTIQEGNRLLTVEKGHKITGVLKG SLSEDVFQDRGTIAGSVHVDAVNNGGEGNGIQAYTAIKEIMLAVEESKIA LTPDGIQLQVGESTVIRLSKDGITIVGGSVFIN >ECs0232 hypothetical protein MTAHKNISGTYHLSVADILQVVYQVCFSPSVEINQDGVAALITTLDRRIS DLLDEIIHFCEFQQSASHWQRVLH >ECs4388 putative transport ATPase MTAEFIIRLILAAIACGAIGMERQMRGKGAGLRTHVLIGMGSALFMIVSK YGFADVLSLDHVGLDPSRIAAQVVTGVGFIGAGNILVRNQNIVGLTTAAD IWVTAAIGMVIGSGMYELGIYGSVMTLLVLEVFHQLTFRLMNKNYHLQLT LVNGNTVSMLDWFKQQKIKTDLVSLQENEDHEVVAIDIQLHATTSIEDLL RLLKGMAGVKGVSIS >ECs3893 hypothetical protein MAVIQDIIAALWQHDFAALADPHIVSVVYFVMFATLFLENGLLPASFLPG DSLLILAGALIAQGVMDFLPTIAILTAAASLGCWLSYIQGRWLGNTKTVK GWLAQLPAKYHQRATCMFDRHGLLALLAGRFLAFVRTLLPTMAGISGLPN RRFQFFNWLSGLLWVSVVTSFGYALSMIPFVKRHEDQVMTFLMILPIALL TAGLLGTLFVVIKKKYCNA >ECs0069 hypothetical protein MQALLEHFITQSTVYSLMAVVLVAFLESLALVGLILPGTVLMAGLGALIG SGELSFWHAWLAGIVGCLLGDWISFWLGWRFKKPLHRWSFLKKNKALLDK TEHALHQHSMFTILVGRFVGPTRPLVPMVAGMLDLPVAKFITPNIIGCLL WPPFYFLPGILAGAAIDIPAGMQSGEFKWLLLATAVFLWVGGWLCWRLWR SGKATDRLSHYLSRGRLLWLTPLISAIGVVALVVLVRHPLMPVYIDILRK VVGV >ECs3113 hypothetical protein MVLIQRSGLSKLLEPLLFAATSDSQLSKTEISSIKINSETIPVYQLRYNG NNALMFATYQDKMLVFSSTDMLFKDDQQDTEATAIASDLLSGKKRWQASF GLEERAAEKTPVRQRIVASARLLGFGYQRLMPSFAGVRFEMGNDGWHSFV ALNDESASVDASFDFTPVWNSMPAGASFCVAVPYSHGIAEEMLSHISQEN DKLNGALDGAAGLCWYEDSKLQTPLFVGQFDGTAEQAQLPGKLFTQNIGA HESKAPEGVLPVSQTQQGEAQIWRREVSSRYGQYPKAQAAQPDQLMSDYF FRVSLAMQNKTLLFSLDDTLVNNALQTLNKTRPAMVDVIPTDGIVPLYIN PQGVAKLLRNETLTSLPKNLEPVFYNAAQTLLMPKLDALSQQPRYVMKLA QMEPGAAWQWLPIIWQPL >ECs2501 hypothetical protein MKGRTNTMNIQCKRVYDPAEQSDGYRVLVDRLWPRGIKKTDLALDEWDKE ITPSTELRKAFHGEVVDFATFREQYLAELAQHEQEGKRLADIAKKQPLTL LYSAKNTTQNHALVLADWLRSL >ECs0619 hypothetical protein MPLPDFHVSEPFTLGIELEMQVVNPPGYDLSQDSSMLIDAVKNKITAGEV KHDITESMLELATDVCRDINQAAGQFSAMQKVVLQAAADHHLEICGGGTH PFQKWQRQEVCDNERYQRTLENFGYLIQQATVFGQHVHVGCASGDDAIYL LHGLSRFVPHFIALSAASPYMQGTDTRFASSRPNIFSAFPDNGPMPWVSN WQQFEALFRCLSYTTMIDSIKDLHWDIRPSPHFGTVEVRVMDTPLTLSHA VNMAGLIQATAHWLLTERPFKHKEKDYLLYKFNRFQACRYGLEGVITDPY TGDRRPLTEDTLRLLEKIAPSAHKIGASSAIEALHRQVVSGLNEAQLMRD FVADGGSLIGLVKKHCEIWAGD >ECs2945 putative tail assembly protein MATTNAFCLASPPLARICLHGDLQRFGRRLSLYVNTAAEAIRALSLQMPG FRRQMNEGWYQIRIRGEDTAPEAVYARLHEPLGEGAVIHIVPRLAGAGKG GLQIVLGAAAIVGSFFTAGATMALWGAALSAGGLTATTMLFSLGASMILG GVAQMLAPKAKTPEYRATDNGKQNTYFSSLDNMIAQGNPMPVPYGEMLVG SRRISQDISTRDEGGDGKVVVIGRG >ECs1719 hypothetical protein MRSLADFEFNKAPLCEGMILACEAIRRDFPSQDVYDELERLVSLAKEEIS QLLPLEEQLEKLIALFYGDWGFKASRGVYRLSDALWLDQVLKNRQGSAVS LGAVLLWVANRLDLPLLPVIFPTQLILRIECPDGEIWLINPFNGESLSEH MLDVWLKGNISPSAELFYEDLDEADNIEVIRKLLDTLKASLMEENQMELA LRTSEALLQFNPEDPYEIRDRGLIYAQLDCEHVALNDLSYFVEQCPEDPI SEMIRAQINNIAHKHIVLH >ECs1990 putative host specificity protein MGKGGGRAHTPREAKDNLKSTQMMSVIDAIGEGPIEGPVKGLQSILVNKT PLTDTDGNPVIHGVTAVWRAGEQEQTPPEGFESSGAETGLGVEVTKAKPV TRTITSANIDRLRVTFGVQSLVETTSKGDRNPTSVRLLIQLERGGKWMTE KDVTINGKTTSQFLASVILDNLPPRPFNIRMVRETADSTTDQLQNKTLWS SYTEIIDVKQCYPNTAIVGLQVDAEQFGGQQMTVNYHIRGRIIQVPSNYD PEKRTYSGIWDGSLKPAYSNNPAWCLWDMLTHPRYGMGKRLGAADVDKWA LYAIAQYCDQTVPDGFGGTEPRMTFNAYLSQQRKAWDVLSDFCSAMRCMP VWNGQTLTFVQDRPSDVVWPYTNSDVVVDDNGVGFRYSFSALKDRHTAVE VNYTDPQNGWQTSTELVEDPEAILRYGRNLLKMDAFGCTSRGQAHRAGLW VIKTGLLETQTVDFTLGSQGLRHTPGDIIEICDNDYAGTMTGGRILSIDA ASRTLTLDREVTLPETGAATVNLINGSGKPVSVAITAHPAPDRIQVSTLP DGVETYGVWGLSLPSLRRRLFRCVSIRENTDGTFAITAVQHVPEKEAIVD NGAHFDGDQSGTVNGVTPPAVQHLTAEVTADSGEYQVLARWDTPKVVKGV SFLLRLTVAADDGSERLVSTARTTETTYRFTQLAPGNYRLTVRAVNAWGQ QGDPASVSFRIAAPAAPSRIELTPGYFQITATPHLAVYDPTVQFEFWFSE KRIADIRQVETTARYLGTALYWIAASINIKPGHDYYFYIRSVNTVGKSAF VEAVGQPSDDASGYLDFFKGEIGKTHLAQELWTQIDNGQLAPDLTEIRTS ITDVSNEITQTVNKKLEDQSAAIQQIQKVQVDTNNNLNSMWAVKLQQMQD GRLYIAGIGAGIENTPDGMQSQVLLAADRIAMINPANGNTKPMFVGQGDQ IFMNEVFLKYLTAPTITSGGNPPAFSLTPDGRLTAKNADISGSVNANSGA LNNVTINQNCTIKGMLEATQVRGDFVKAVSKAFPKKVGTWGNTETPNGTV TVTISDDHNFDRQIIIPPIIFNGIAYDDPGSGNNPGGTRYTGYGFEVRKN GVLIASRETKGAIPGSYSAVIDMPSGGGSVTLEFKIFQKGNQGAGNITDC TVIVTKKAASGISIR >ECs1493 hypothetical protein MKQKELWINQIKGLCICLVVIYHSVITFYPHMTTFQHPLSEVLSKCWIYF NLYLAPFRMPVFFFISGYLIRRYIDSVPWGNCLDKRIWNIFWVLALWGVV QWLALSALNQWLAPERDLSNASNAAYADSTGEFLHGMITASTSLWYLYAL IVYFVVCKIFSRLALPLFALFVLLSVAVNFVPTPWWGMNSVIRNLPYYSL GAWFGATIMTCVKEVPLRRHLLMASLLAALAVGAWLFTISLLLSLVSIVV IMKLFYQYEQRFGMRSTSLLNVIGSNTIAIYTTHRILVEIFSLTLLAQMN AARWSPQVELTLLLVYPFVSLFICTVAGLLVRKLSQRAFSDLLFSPPSLP VAVSYSR >ECs5382 hypothetical protein MFDSLAKAGKYLGQAAKLMIGMPDYDNYVEHMRVNHPDQTPMTYEEFFRE RQDARYGGKGGARCC >ECs0231 hypothetical protein MSLQEEELVSSHAGQPEQESSLLDQIMAQTRIQPGSEGYDVARQGVTAFI ASILQSTASAEPVNKLAVDSMIADIDERISRQMDVIIHAPAFQQVESFWR SLKTMVDRVDFRENIKVNVLHVTKQELLEDFEFAPEIIQSGFYKHVYSSG FGQFGGEPIAAVLGAYEFKNTAPDMKLLQYVSAVGAMAHAPFLSSVSPEF MGLNSWTELPNIKDLYAIFEGPAYTKWRALRDSEDSRYLGLTAPRFLLRQ PYSPTDNPVKNFNYYEDVSQNHEDYLWGNTAWMLACNIADSFAKYRWCPN IIGPQSGGAVKDLPVHLFETMGQIQAKIPTEVLVTDRREFELAEEGFITL TMRKDSDNAAFFSANSVQKPKHFPGKDAETNYKLGTQLPYLFIINRLAHY IKVLQREQLGSWKERSDLERELNTWIRQYVADQENPPADVRSRKPLRAAK VEVMDVEGEPGWY >ECs1036 hypothetical protein MTIAALWLAGCSSGEINKNYYQLPVVQSGTQSTASQGNRLLWVEQVAVPD YLAGNGVVYQTSDVKYVIANNNLWASPLDQQLRNTLVANLSTQLPGWVVA SQPLGSAQDTLNVTVTEFNGRYDGKVIVSGEWLLNHQGQLIKRPFRLEGV QTQDGYDEMVKVLAGVWSQEAASIAQEIKRLP >ECs4970 hypothetical protein MLDIPQIASRYIEHPASGITPNRAAQCLRGAERGDLIAQSDLAADIEEKD THLFAELGKRRLAIQSVPWSIEPPPNASANEKKDAEMLDEYLHSADWFDA MLFDATDAILKGYSCMEIEHGMLGKMHIIRAIRWRDSGHFCLNPDDLSEL RLRDGSHAGVAFQPFGWVVHQSRSRTGYGGATGLVRTLIWPFIFKNYSVR DLAEFLEVYGLPMKVGKYPSGATSEQKSALMRAVMDIGRRTGGIIPAGMS LEFQVAANGQADPFETMISWGERSISKAILGGTLTTEAGDKGARSLGEVH NEVRREIRDSDLRQLAATLNRDLVYPLYALNTAHAIDIRRLPRICFQTKE PGDITKITSAVMQLSTGMDIPDPWVRDQTGIPQPAPGEAIFRVRQSGNEP AQTDKEMPPEKQEKTEQTALSARLPEAKSSPRDELDDMGDAVPARRLQDA IDPLLEPVIDAIRTRGLADALADLPALYREMDDSRLMTLLSDAMFAAEMK GMLDGTGD >ECs1592 putative head portal protein MWPFRRKKEQRSMTLDEFMALAGTSNTGAGEYVSSGTAESLPAVMNAVTV ISEAVATMPCYLYLVRNEKGKEAREWLDSHPVDHILNERPNAWQTPYQFK RMMVRHCLLNGNAYAVIQWGRDGFPVALHPYPPQSVNVEQTGEHNWRYCI TDAYTGNTHNYLPWEVLHLRYSTDDGFMGRSPVTICRESLGLGLAQQRHG ASVMRDGMMAAGVITSGEWLDGVKGKQALAALERYKGARNAGKTPILEGG MSYQQLGMSNQDAEWLASRRFTIEDIARMFNVSPIFLQEYSNSTYSNFSE ASRAFLTMTMRPWLANFEQQIKNALLVASPVPGIRYQVEFDSADLLRATP GERFATYERGIKSGVMCPNEAREREGLSPRDGGDEFSQAWKQEVKISEGE KPE >ECs5248 hypothetical protein MIKDAIFSPCGKYRYSLSRVWDESKPYTLFIGLNPSYADAEKDDRTLSRC ISFAKSWGYGGVYMANLFAFVHTQRHEMMKASDPIGKDNDSHLIRLVSGA GLVVAAWGNEGRHLKRSTTVRQLLPESTMCFVLNATGEPKHPLYMKNDSV LIPLG >ECs1654 hypothetical protein MQIKSVEDIFIHLLSDTYSAEKQLTKALSKLSRSAYSDKLTAAFQSHLDE THGQIERIDQVVDSEDGLKLKRIKCAAMEGLIEEANEVIESTDKNEVRDA ALIAAAQKVEHYEIASYGTLVTLAEQLGYKKAAKLLKETLEEEKATDVKL TDLAFNNVNKKAQDNS >ECs2948 putative minor tail protein MKTFRWKVKPDMEVNSQPSVREVRFGDGYSQRMAAGLNADLKTYRVTLSV TREEARHLEAFLAEHGGWKAFLWKPPYAYRQIKVTCAGWSARVGMLRVEF SAEFKQVVN >ECs5355 hypothetical protein MKYKHLILSLSLIMLGPLAHAEEIGSVDTVFKMIGPDHKIVVEAFDDPDV KNVTCYVSRAKTGGIKGGLGLAEDTSDAAISCQQVGPIELSDRIKNGKAQ GEVVFKKRTSLVFKSLQVVRFYDAKRNALAYLAYSDKVVEGSPKNAISAV PVMPWRQ >ECs3968 hypothetical protein MLRAFARLLLRICFSRRTLKIACLLLLVAGATIFIADRVMVNASKQLTWS DVNVVPARNVGLLLGARPGNRYFTRRIDTAAELYHAGKVKWLLVSGDNGR KNYDEASGMQQALIAKGVPAKVIFCDYAGFSTLDSVVRANKVFGENHITI ISQEFHNQRAIWLAKQYGIDAIGFNAPDLEKGRGKIVRLREKLARVSAVI DAKIFNRQPKYLGPSVIIGPFSEHGCPAQK >ECs1860 putative oxidoreductase MLSPIRLSPLPALRQDNDFLYDQGAPMEQRHITGKSHWYHETQSSTTEYD VLPLVPEAAKVSDPFLLDVILEKETLAPFLSWLDPARVLAVELFPDQLTV TRSQTFTAYERLSTALTVAQVCGVQRLCNYYSARLTPLPGPDSTRESNHR LAQITQYARQLASSPSIIDNRSRQHLNDVGLTAWDCVIINQIIGFIGFQA RTIATFQAYLGHPVRWLPGLEIQNYADASLFADESLRWRSSYEVEKLPEE HTKSSTAELCQLAEILSLHPISLSLLERLLNSTRVNTQPDNQLAALLCAR INGSPACFAACMDSSNEYKKISPLLRKGENEINQWADRHSVERATVQAIQ WLTRAPDRFSAAQFSPLLEHEKSSTQIINLLVWSGLCGWINRLKIALGET Y >ECs3480 hypothetical protein MRKRFTVPGKIAVEVAYALPKKQYLQRVTLQEGATVEEAIRASGLLELRT DIDLTENKVGIYSRPAKLSDSVHDGDRVEIYRPLIADPKELRRQRAEKSA NK >ECs1116 putative minor tail protein MQDIHEESLNESVKSEQSPRVVLREIDLTVQGGERYFFCNELNEKGEAVT WQGRQYQVYPIDGSGFEMNGKGSSARPSLTVSNLFGLVTGMAEDLQSLVG ATVVRRRVYARFLDAVNFVAGNPEADPEQELSDRWVVEQMSELTAMTASF VLATPTETDGALFPGRIMLANTCMWTYRSDECGYTGGAVADEFDKPTTDI RKDRCSKCMRGCEMRGMVANFGGFLSINKLSQ >ECs3982 hypothetical protein MSSKVERERRKAQLLSQIQQQRLDLSASRREWLEATGAYDRRWNMLLSLR SWALVGSSVMAIWTIRHPNMLVRWARRGFGVWSAWRLVKTTLKQQQLRG >ECs3927 hypothetical protein MTPLVKDIIMSSTRMPALFLGHGSPMNVLEDNLYTRSWQTLGMTLPRPQA IVVVSAHWFTRGTGVTAMETPPTIHDFGGFPQALYDTHYPAPGSPALAQR LVELLAPIPVTLDKEAWGFDHGSWGVLIKMYPDADIPMVQLSIDSSKPAA WHFEMGRKLAALRDEGIMLVASGNVVHNLRTVKWHGDSSPYPWAMSFNEY VKANLTWQGPVEQHPLVNYLDHEGGALSNPTPEHYLPLLYVLGAWDGQEP ITIPVDGIEMGSLSMLSVQIG >ECs2935 hypothetical protein MLSNDILRSVRYILKANNNDLVRILALGNVEATAEQIAVWLRKEDEEGFQ RCPDIVLSSFLNGLIYEKRGKDESAPALEPERRINNNIVLKKLRIAFSLK TDDILAILTEQQFRVSMPEITAMMRAPDHKNFRECGDQFLRYFLRGLAAR QHVKKS >ECs5381 hypothetical protein MAFSNPFDDPQGVFYILRNAQGQFSLWPQQCALPAGWDIVCQPQSQESCQ QWLEVHWRTLTPANFTQLQEAQ >ECs3201 hypothetical protein MDLIYFLIDFILHIDVHLAELVAEYGVWVYAILFLILFCETGLVVTPFLP GDSLLFVAGALASLETNDLNVHMMVVLMLIAAIVGDAVNYTIGRLFGEKL FSNPNSKIFRRSYLDKTHQFYEKHGGKTIILARFVPIVRTFAPFVAGMGH MSYRHFAAYNVIGALLWVLLFTYAGYFFGTIPMVQDNLKLLIVGIIVVSI LPGVIEIIRHKRAAARAAK >ECs4205 hypothetical protein MLIPWQDLSPETLENLIESFVLREGTDYGEHERTLEQKVADVKRQLQCGE AVLVWSELHETVNIMPRSQFRE >ECs3780 hypothetical protein MPGYNEMNQYLNQQGTGLTPAEMHGLISGMICGGNDDSSWLPLLHDLTNE GMAFGHELAQALRKMHSATSDALQDDGFLFQLYLPDGDDVSVFDRADALA GWVNHFLLGLGVTQPKLDKVTGETGEAIDDLRNIAQLGYDEDEDQEELEM SLEEIIEYVRVAALLCHDTFTHPQPTAPEVQKPTLH >ECs2496 hypothetical protein MTEMAKGSVTHQRLIALLSQEGADFRVVTHEAVGKCEAVSEIRGTALGQG AKALVCKVKGNGVNQHVLAILAADQQADLSQLASHIGGLRASLASPAEVD ELTGCVFGAIPPFSFHPKLKLVADPLLFERFDEIAFNAGMLDKSVILKTA DYLRIAQPELVNFRRTA >ECs0988 hypothetical protein MTQTFIPGKDAALEDSIARFQQKLSDLGFQIEEASWLNPVPNVWSVHIRD KECALCFTNGKGATKKAALASALGEYFERLSTNYFFADFWLGETIANGPF VHYPNEKWFPLTENDDVPEGLLDERLRAFYDPENELTGSMLIDLQSGNED RGICGLPFTRQSDNQTVYIPMNIIGNLYVSNGMSAGNTRNEARVQGLSEV FERYVKNRIIAESISLPEIPADVLARYPAVVEAIETLEAEGFPIFAYDGS LGGQYPVICVVLFNPANGTCFASFGAHPDFGVALERTVTELLQGRGLKDL DVFTPPTFDDEEVAEHTNLETHFIDSSGLISWDLFKQDADYPFVDWNFSG TTEEEFATLMAIFNKEDKEVYIADYEHLGVYACRIIVPGMSDIYPAEDLW LANNSMGSHLRETILSLPGSEWEKEDYLNLIEQLDEEGFDDFTRVRELLG LATGSDNGWYTLRIGELKAMLALAGGDLEQALVWTEWTMEFNSSVFSPER ANYYRCLQTLLLLAQEEDRQPLQYLNAFVRMYGADAVEAASAAMSGEAAF YGLQPVDSDLHAFAAHQSLLKAYEKLQRAKAAFWAK >ECs3980 hypothetical protein MSKEHTTEHLRAELKSLSDTLEEVLSSSGEKSKEELSKIRSKAEQALKQS RYRLGETGDAIAKQTRVAAARADEYVRENPWTGVGIGAAIGVVLGVLLSR R >ECs2005 hypothetical protein MMKRTLLIAVWAIGLMSDSAMALTLNEARSQGRVGETLNGYLVALQTDAE TQALVKDINEARNHSYQQLAKQNNVSTKEIAKLAGQKLVARAKSGQYVQG VNGKWLRK >ECs0590 hypothetical protein MATFSLGKHPHVELCDLLKLEGWSESGAQAKIAIAEGQVKVDGAVETRKR CKIVAGQTVSFAGHSVQVVA >ECs4445 hypothetical protein MDNKISTYSPAFSIVSWIALVGGIVTYLLGLWNAEMQLNEKGYYFAVLVL GLFSAASYQKTVRDKYEGIPTTSIYYMTCLTVFIISVALLMVGLWNATLL LSEKGFYGLAFFLSLFGAVAVQKNIRDAGINPPKETQVTQEEYSE >ECs2493 hypothetical protein MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGE SVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGS GQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKT HRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISN SEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAV MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA MQHIRDQDDIYPVFRELFHKQNATAKD >ECs0960 putative surface protein MFSGLLIILVPLIVGYLIPLRQQAALKVINQLLSWMVYLILFFMGISLAF LDNLASNLLAILHYSAVSITVILLCNIAALMWLERGLPWRNHHQQEKLPS RIAMALESLKLCGVVVIGFAIGLSGLAFLQHATEASEYTLILLLFLVGIQ LRNNGMTLKQIVLNRRGMIVAVVVVASSLIGGLINAFILDLPINTALAMA SGFGWYSLSGILLTESFGPVIGSAAFFNDLARELIAIMLIPGLIRRSRST ALGLCGATSMDFTLPVLQRTGGLDMVPAAIVHGFILSLLVPILIAFFSA >ECs0025 hypothetical protein MTDGISTSPHCLYKSNIVDDVIINKTRQNELVKVFCEYKTEFLILFDDFF RSQDLPKPSPVLHHFFQYTHLRDAHFYRCKLIEHTVQFSFFKHKGITLLR LDVFDDRTSECLSEEIKIYQECHEKFIKFLKANFNQEIYPELYTPEIFYE ACRNLQSFYDHQETSQNAKYSAIVKKKSYFNKEIRNLIKKNIYPELYNEQ CNKIPASSTDDNQKITWQNFKTSNAAYSQLCEKLSLLKSSPSRLIEKSAY CSNENMITDKFDVVFSYCGDNVKEFILLLPYNKSLEMHELNEQNIQYLTA LNINIHKLLLSNITIEKSNLSYGYYFGCVLSNISCFESDLSNTIFSNGEI NNLFIKKSNIFGTSFTNTMIKNLRCEDIMPGRWTTQLVNKHLGYRYTGVF KTLASIDDKPSRFEILIPLVQTLVRDNVKLNNDVYKELKKFMHDYDKTSP EMRKYLQSINESMFLMKKISHQD >ECs4584 hypothetical protein MIYFLTDLIKFYCLMKLYEFKNIKIDLVLTEDIIPEDKLQEIIQSDDIIK LARKKTYEHLLRARRKSKELKTESRKKIARKMILMRERIRKNNKIKLDKE VNQSIKWVKDIQAIELVLMQDIMNKIHLSLTNALHSLDTSSRINWDDLLN EVVRETLSNNNIVGAIKITKNPDIKLDPGEANNIQLINDANTPHNKIIIE NEYIRITLDPLEQISILLNSFKDNYLSIIQE >ECs0234 hypothetical protein MPTPCYISITGQTQGNITAGAFTADSVGNIYVQGHEDEMLVQEFLHNVTV PTDPQSGQPAGQRAHKPFIFTVALNKAVPLMYNALASGEMLPTTELHWWR TSVEGKQEHYFTTRLTDSTIVDMKLHMPHCQDPAKREFTQLLEVSLAYRK IEWEHVKSGTSGADDWRAPLEA >ECs1647 tail assembly protein MAATHTLPLASPGMARICLYGDLQRFGRRIDLRVKTGAEAIRALATQLPV FRQKLSDGWYQVRITGRDVSTSGLTAQLHETLPDGAVIHIVPRVAGAKSG GVFQIVLGAAAIAGSFFTAGATLAAWGAAIGAGGMTGILFSLGASMVLGG VAQMLAPKARTPRTQTTDNGKQNTYFSSLDNMVAQGNVLPVLYGEMRVGS RVVSQEISTADEGDGGQVVVIGR >ECs1795 putative portal protein MWNLLRRTRKNQKSGRDVREVGWRSLFQAVAEPFAGAWQQGVKADPETVL SFHAVFSCISLISQDIAKMRLRLMQTDVQGIRREKRQGDTARLCRRPNAQ QNRIQFFELWLNSKLRHGNTVVLKIRTPRGQIKELRILDWNRVEPLVADD GEVFYRITPDRNCGITESVTVPAREVIHDRFNCFFHPLVGLPPVYAAGLA AMQGHHIQANSTYFFRNGGRPSGVIEVPGSITEENAKKLKGNWDSGYTGE NAGKTAILSNGAKYSPTTFSPVDAQTVEQLKMTAEIVCSVFRVPAYKIGV GHPPSSDNVEALEQQYYSQCLQTLIESIELLLDEALETGENESTEFDVTT LLRMDSERRMKTLGDAVKNTLLTPNEARKRENLPPLAGGDALYLQQQNYS LEALSRRDAREDPFASAGKTVSSQLPDGASDGNKAISETEHDAVKAMFRG DTEKMTERELSIIRALGEEFSTVLADLQRTFEGKMASQAQAFEEKLTSLS AVLQKHVTVDEVRPVLQAMVDDAVGAIPVPRDGRDYDPDVLQQAVNDAVA NIPQPADGKSLTPDDVRPMLEQMVKEAVSHIPVPRDGRDYDPEVLQKAVN DAVANIPQPADGKSLTPDDVRPMLEQMVKEAVSHIPVPRDGRDYDPEVLQ KAVNDAVANIPQPADGKSLTPDDVRPMLEQMVKEAVSHIPVPRDGRDYDP DVLQKAVLDAVSALPAPQDGRDATALEILPAIDDQKSFPRGTYATHQGGL WRAYEKTHGMRGWECLVDGVADIDVSMTGERLFSVVVRQSSGQRTEKTFS LPVMLYRGVFRAGETYHPGDTVTWGGSLWHCNSMTEDKPGEAHSSAWTLA AKRGRDAGG >ECs3182 putative S-transferase MQGNICAMSAITESKPTRRWAMPDTLVIIFFVAILTSLATWVVPVGMFDS QEVQYQVDGQTKTRKVVDPHSFRLLTNEAGEPEYHRVQLFTTGDERPGLM NFPFEGLTSGSKYGTAVGIIMFMLVIGGAFGIVMRTGTIDNGILALIRHT RGNEILFIPALFILFSLGGAVFGMGEEAVAFAIIIAPLMVRLGYDSITTV LVTYIATQIGFASSWMNPFCVVVAQGIAGVPVLSGSGLRIVVWVIATLIG LIFTMVYASRVKKNPLLSRVHESDRFFREKQADVEQRPFTFGDWLVLIVL TAVMVWVIWGVIVNAWFIPEIASQFFTMGLVIGIIGVVFRLNGMTVNTMA SSFTEGARMMIAPALLVGFAKGILLLVGNGEAGDASVLNTILNSIANAIS GLDNAVAAWFMLLFQAVFNFFVTSGSGQAALTMPLLAPLGDLVGVNRQVT VLAFQFGDGFSHIIYPTSASLMATLGVCRVDFRNWLKVGATLLGLLFIMS SVVVIGAQLMGYH >ECs0160 hypothetical protein MSDDVALPLEFTDAAANKVKSLIADEDNPNLKLRVYITGGGCSGFQYGFT FDDQVNEGDMTIEKQGVGLVVDPMSLQYLVGGSVDYTEGLEGSRFIVTNP NAKSTCGCGSSFSI >ECs2385 hypothetical protein MKRASLLTLTLIGAFSAIQAAWAVDYPLPPTGSRLVGQNQTYTVQEGDKN LQAIARRFDTAAMLILEANNTIAPVPKPGTTITIPSQLLLPDAPRQGIIV NLAELRLYYYPPGENIVQVYPIGIGLQGLETPVMETRVGQKIPNPTWTPT AGIRQRSLERGIKLPPVVPAGPNNPLGRYALRLAHGNGEYLIHGTSAPDS VGLRVSSGCIRMNAPDIKALFSSVRTGTPVKVINEPVKYSVEPNGMRYVE VHRPLSAEEQQNVQTMPYTLPAGFTQFKDNKAVDQKLVDKALYRRAGYPV AVSSGATPTASNAPSVESAQNGEPEQGNMLRATQ >ECs0841 putative tail assembly protein MAATHTLPLASPGMARICLYGDLQRFGRRIDLRVKTGAEAIRALAMQIPA FRQKLSDGWYLVRIAGRDTGENELSARLNEPLANGAVIHIVPRLAGAKSG GVFQAVLGAALIAVAWWNPVGWLGAAAVSGMYAAGASMILGGVAQMLAPK ARTPRTQTTDNGKQNTYFSSLDNMVAQGNVLPVLYGEMRVGSRVVSQEIS TADEGDGGQVVVIGR >ECs2145 hypothetical protein MKFQAIVLASFLVIPYALADDQGGLKQDAAPPPPHAIEDGYRGTDDAKKM TVDFAKTMHDGASVSLRGNLISHKGEDRYVFRDKSGEINVVIPATVFDGR EVQPDQMINISGSLDKKSAPAVVRVTHLQK >ECs0218 IcmF-like protein MVIGPAGSGKTTLLREGFPSDIIYAPEGARGTEQRLYLTPHVGKQAVIFD IDGTLCAPADADILHRRLWEHALGWLKEKRARQPLNGIILTLDLPDLLTA DKRHREHLLQTLRSRLQDIRQHLHCQLPVYVVLTRLDLLQGFAALFQSLN RQDRDAILGVTFTRRAHENDDWRTELNAFWQTWVDRMNLALPDLMVTQTH TRASLFSFSRQMQGSREPLVSLLEGLLDGENMNVMLRGVYLTSSLQRGQM DDIFTQSAARQYRLGNNPLASWPLVDTAPYFTRSLFPQALLAEPNLATES RAWLMRSRRRLTVFSATGGVAALLLITGWHHYYNGNYQSGITVLKQAKAF MDVPPPQGEDDYGNLQLPLLNPVRDATLAYGDWGDRSRLADMGLYQGRRI GPYVEQTYLQLLEQRYLPSLFNGLVKAMNAAPPESEEKLAVLRVMRMLED KSGRNNEVVKQYMAKRWSEKFHGQRDIQAQLMSHLDYALAHTDWHAERQA GDGDAISRWTPYDKPVVSAQKELSKLPVYQRVYQSLKTRALGVLPADLNL RDQVGPTFDQVFTSADDNKLVVPQFLTRYGLQSYFVKQRDELVELTAMDS WVLNLTRSVKYSDADRAEIQRQLTEQYISDYTATWRAGMDNLNIRNFESI GQLTGALEQVISGDQPLQRALTVLRDNTQPGVFSEKLSAKERDEALAEPD YQLLTRLGHEFAPENITLAVQKDKESTMQAVYQQLTELHRYLLAIQNAPV PGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGRLTDQAWHVV MVEAVHYMEVDWRDSVVKPFNEQLANNYPFNPRSAQDASLDAFERFFKPD GILDTFYQQNLKLFIDNDLSLEDGDNNVIIREDIIAQLETAQKIRDIFFS KQNGLGTSFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNM REGNESKLTLIGTSGNAPRSISFSGPWAQFRLFGAGQLTGVQDGNFTVRF SVDGGAMTYRVHTDTEDNPFSGGLFSQFGLSDTLY >ECs0340 hypothetical protein MEKYLHLLSRGDKIGLALIRLSIAIVFMWIGLLKFVPYEADSITPFVANS PLMSFFYEHPEDYKQYLTHEGEYKPEARAWQSANNTYGFSNGLGVVEVII ALLVLANPVNRWLGLLGGLMAFTTPLVTLSFLITTPEAWVPALGDAHHGF PYLSGAGRLVLKDTLMLAGAVMIMADSAREILKQGSNESSSTLKTEY >ECs0839 putative minor tail protein MQDIPQETHHETTRLTQSAQAVLWEIDLTEVGGERYFFCNEQNEKGEPVT WQGRQYQAYPIQGTGFELNGKGSSARPTLTVSNLHGMVTGMAEDLQSLVG GTVVRRKVYARFLDAVNFVNGNSDADPEQEVISRWRIEQCSELSAVSASF VLSTPTETDGAVFPGRIMLANTCTWTYRGDECGYHGPAVADEYDQPTSDI TKDKCSKCLSGCKFRNNVGNFGGFLSINKLSQ >ECs0236 VgrG MSTGLRFTLEVDGLPPDAFAVVFFHLNQSLSSLFSLALSLVSQQFLSLEF QQILDKMAYLTIWQGDDVQRRVKGVVTWFELGENDKNQKLYSMKVCPPLW RTGLRQNFRIFQNEDIASILGTILQENGVTEWSPLFSEPHPSREFCVQYG ETDYDFLCRMAAEEGIFFYEEHAQKSTDQSLVLCDTVLYLPESFEIPWNP NTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQHQDY QRTQYEVYDYPGRFKGAHGQNFARWQMDGWRNNAEVARGTSRSPEIWPGR RIVLTGHPQANLNREWQVVASELHGEQPQAVPGRRGSGTTLDNHFAVIPA DRTWRPQPLLKPLVDGPQSAVVTGPAGEEIFCDEHGRVRVKFNWDRYNPS NQDSSCWIRVAQAWAGTGFGNLAIPRVGQEVIVDFLNGDPDQPIIMGRTY HQENRTPGSLPGTKTQMTIRSKTYKGSGFNELKFDDATGKEQVYIHAQKN MNTEVLNNRTTDVINNHAEKIGNNQAITVTNNQILNIGVNQIQTVGVNQV ETVGSNQIIKVGSNQVEKVGIIRALTVGVAYQTTVGGIMNTSVALLQSSQ VGLHKSLMVGMGYSVNVGNNVTFSVGKTMKENTGQTAVYSAGEHLELCCG KARLVLTKDGSIFLNGTHIHLEGESDVNGDAPVINWNCGATQPVPDAPVP KDLPPGMPDMRQF >ECs0252 hypothetical protein MNSGQFSKDVKLAQKRHKDMNKLKYLMTLLINNTLPLPAVYKDHPLQGSW KGYRDAHVEPDWILIYKLTDKLLRFERTGTHAALFG >ECs4078 hypothetical protein MSKARRWVIIVLSLAVLVMIGINMAEKDDTAQVVVNNNDPTYKSEHTDTL VYNPEGALSYRLIAQHVEYYSDQAVSWFTQPVLTTFDKDKIPTWSVKADK AKLTNDRMLYLYGHVEVNALVPDSQLRRITTDNAQINLVTQDVTSEDLVT LYGTTFNSSGLKMRGNLRSKNAELIEKVRTSYEIQNKQTQP >ECs0882 putative hydroxylase MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRS TLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGA VRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLV LYPSSSLHCVTPVTRGVRVASFIWIQSMIRDDKKRAMLFELDKNIQNIQS LKSRYGENEEILSLLNLYHNLLREWSEI >ECs0123 hypothetical protein MAQCDFGALPGAEEHTMDYEFLRDITGVVKVRMSMGHEVVGHWFNEEVKE NLALLDEVEQAAHALKGSERSWQRAGHEYTLWMDGEEVMVRANQLEFAGD EMEEGMNYYDEESLSLCGVEDFLQVVAAYRNFVQQK >ECs1830 putative structural proteins MNMKTIEDVFIHLLSDTYSAEKQLTRALAKLARATSNEKLSQAFHAHLEE THGQIERIDQVVESESNLKIKRMKCVAMEGLIEEANEVIESTEKNEVRDA ALIAAAQKVEHYEIASYGTLATLAEQLGYRKAAKLLKETLEEEKATDIKL TDLALNNVNKKAENKA >ECs3079 hypothetical protein MPQISRYSDEQVEQLLAELLNVLEKHKAPTDLSLMVLGNMVTNLINTSIA PAQRQAIANSFARALQSSINEDKAH >ECs4356 HicB-like protein MIKLKTPNSMEIAGQPAVITYVPELNAFRGKFLGLSGYCDFVSDSIQGLQ KEGELSLREYLEDCKAAGIEPYARTEKIKTFTLRYPESLSERLNNAAAQQ QVSVNTYIIETLNERLNHL >ECs4836 hypothetical protein MGKFLVEREQMRYPVDVYTGKIQAYPEGKPSAIAKIQVDGELMLTELGLE GDEQAEKKVHGGPDRALCHYPREHYLYWAREFPEQAELFVAPAFGENLST DGLTESNVYIGDIFRWGEALIQVSQPRSPCYKLNYHFDISDIAQLMQNTG KVGWLYSVIAPGKVSADAPLELVSRVSDVTVQEAAAIAWHMPFDDDQYHR LLSAAGLSKSWTRTMQKRRLSGKIEDFSRRLWGK >ECs0262 hypothetical protein MEWYMGKYIRPLSDAVFTIASDDLWIESLAIQQLHTTANLPNMQRVVGMP DLHPGRGYPIGAAFFSVGRFYPTRRRGNGAGNRNGPLL >ECs5586 hypothetical protein MVKKTIAAIFSVLVLSTVLTACNTTRGVGEDISDGGNAISGAATKAQQ >ECs2947 putative minor tail protein MQDIHGESLIESVKSEQSPRVVLWEIDLTVQGGERYFFCNELNEKGEAVT WQGREYQAYPIDGSGFEMNGKGSSARPSLTVSNLFGLVTGMAEDLQSLVG ATVVRRRVYARFLDAVNFVAGNPEADPEQELSDRWVVEQMSELTAMTASF VLATPTETDGALFPGRIMLANTCMWDYRGDECGYNGPAVADEFDNPTTDI RKDRCSKCMRGCEMRGMVANFGGFLSINKLSQ >ECs2870 hypothetical protein MKIETFDSVWDAVSDTPEQAENMRIRAELVTIINNWIEQQGFSQAQAASA LGVTQPRISELARGKIQIFSIDKLITMMAHAGLHIQRIEVQYPHAA >ECs0436 hypothetical protein MPHSCREIHCFDNRWQKHKQNYAGRQKRDTIEDYPTKDDFMTIWVDADAC PNVIKEILYRAAERMQMPLVLVANQSLRVPPSRFIRTLRVAAGFDVADNE IVRQCEAGDLVITADIPLAAEAIEKGAAALNPRGERYTPATIRERLTMRD FMDTLRASGIQTGGPDSLSQRDRQAFAAELEKWWLEVQRSRG >ECs1305 hypothetical protein MVHKSDSDELAALRAENARLVSLLEAHGIEWRRKPQTPVQRVSVLSTDEK VALFRRLFRGRDDVWALRWESKTSGKSGYSPACANEWQARICGKPRIKCG DCAHRQLIPVSDLVIYHHLAGTHTVGMYPLLEDDSCYFLAVDFDEAEWQK DASAFMRSCDELGVPAALEISRSRQGAHVWIFFASRVSAREARRLGTAII SYTCSRTRQLRLGSYDRLFPNQDTMPKGGFGNLIALPLQKRPRELGGSVF VDMNLQPYPDQWAFLVSVTPMNVQDIEPTILRATGSIHPLDVNFINEEDL GTPWEEKKSSGNRLNISIAEPLKITLANQIYFEKAQLPQVLINRLIRLAA FPNPEFYKAQAMRMSVWNKPRVIGCAENYPQHIALPRGCLDSVLSFLRDN NIAAELIDKRFAGTECNAVFMGNLRAEQEEAVSALLRYDTGVLCAPTAFG KTVTAAAVIAKRKVNTLILVHRTELLKQWQERLAVFLQAGDSIGIIGGGK HKPCGNIDIAVVQSISRQGEVEPLVRNYGQIIVDECHHIGAVSFSAILKE TNARYLLGLTATPIRRDGLHPIIFMYCGAIRHTAVRPKESPHNLEVLIRS RFTSGHLPSDARIQDIFREIALDHDRTVAIAEEAMKAFGQGRKVLVLTER TDHLDEIASVMNSLKLSPFILHGRLSKKKRAMLISGLNALPPDSPRILLS TGRLIGEGFDHPPLDTLILAMPVSWKGTLQQYAGRLHREHTGKSDVRIID FVDTAYPVLLRMWDKRQRGYKAMGYRIIADGDESVI >ECs0534 hypothetical protein MTPAVKSLEKNKISFQIHTYEHDPAETNFGDEVVKKLGLNPDQVYKTLLV AVNGDMKHLAVAVTPVAGQLDLKKVAKALGAKKVEMADPMVAQRSTGYLV GGISPLGQKKRLPTIIDAPAQEFATIYVSGGKRGLDIELAAGDLAKILDA KFADIARRD >ECs5294 hypothetical protein MFCHPETTAAPCGVYYFIARVKNSLSFIRILLICRTEFDEKKAIIRSNKL KKRITFIVFLCAIVAASLFFVQSCVRKSQHVAGFQNYQATIDGKEITGVT KNISSLTWSAQSNTLFSTINKPATIVEMTTEGDLIRTIPLDFVKDLETIE YIGDNKFVISDERDYAIYVISLNADSEVSILKKIKIPLQETPTNCGFEGL AYSSQDHTFWFFKEKNPIEVYKVTGLLRSDELHISKDKTLQRQFTLDDVS GAEFNPQKNTLLVLSHESRALQEVTVRGDVIGEMSLTKGKYGLSHNIKQA EGIAMDDSGNIYIVGEPNLFYRFTSTKSR >ECs3654 hypothetical protein MSSYANHQALAGLTLGKSTDYRDTYDASLLQGVPRSLNRDPLGLKADNLP FQGTDIWTLYELSWLNAKGLPQVAVGHVELDYTSVNLIESKSFKLYLNSF NQTRFNNWDEVRQTLERDLSTCAQGEVSVALYRLDELEGQPIGHFNGTCI DDQDITIDNYEFTTDYLENATSGEKVVEETLVSHLLKSNCLITHQPDWGS IQIQYRGRQIDREKLLRYLVSFRHHNEFHEQCVERIFNDLLRFCQPEKLS VYARYTRRGGLDINPWRSNSDFVPSTTRLVRQ >ECs5200 hypothetical protein MRIFVYGSLRHKQGNSHWMTNAQLLGDFSIDNYQLYSLGHYPGAVPGNGT VHGEVYRIDNATLAELDALRTRGGEYARQLIQTPYGSAWMYVYQRPVDGL KLIESGDWLDRDK >ECs2780 hypothetical protein MGRKWANIVAKKTAKDGATSKIYAKFGVEIYAAAKQGEPDPELNTSLKFV IERAKQAQVPKHVIDKAIDKAKGGGDETFVQGRYEGFGPNGSMIIAETLT SNVNRTIANVRTIFNKKGGNIGAAGSVSYMFDNTGVIVFKGSDPDHIFEI LLEAEVDVRDVTEEEGNIVIYTEPTDLHKGIAALKAAGISEFSTTELEMI AQSEVELSPEDLEIFEGLVDALEDDDDVQKVYHNVANL >ECs2320 hypothetical protein MNKSLVAVGVIVALGVVWTGGAWYTGKKIETHLEDMVAQANAQLKLTAPE SNLEVSYQNYHRGVFSSQLQLLVKPIAGKENPWIKSGQSVIFNESVDHGP FPLAQLKKLNLIPSMASIQTTLVNNEVSKPLFDMAKGETPFEINSRIGYS GDSSSDISLKPLNYEQKDEKVAFSGGEFQLNADRDGKAISLSGEAQSGRI DAVNEYNQKVQLTFNNLKTDGSSTLASFGERVGNQKLSLEKMTISVEGKE LALLEGMEISGKSDLVNDGKTINSQLDYSLNSLKVQNQDLGSGKLTLKVG QIDGEAWHQFSQQYNAQTQALLAQPEIANNPELYQEKVTEAFFSALPLML KGDPVITIAPLSWKNSQGESALNLSLFLKDPATTKEAPQTLAQEVDRSVK SLDAKLTIPVDMATELMTQVAKLEGYQEDQAKKLAKQQVEGASAMGQMFR LTTLQDNTITTSLQYTNGQITLNGQKMPLEDFVGMFAMPALNVPVVPAIP QQ >ECs5152 hypothetical protein MNSTIWLALALVLVLEGLGPMLYPKAWKKMISAMTNLPDNILRRFGGGLV VAGVVVYYMLRKTIG >ECs0346 putative transporter MDNRGEFLNNVAQALGRPLRLEPQAEDAPLNNYANERLTQLNQQQRCDAF IQFASDVMLTRCELTSEAKAAEAAIRLCKELGDQSVVISGDTRLEELGIS ERLQQECNAVVWDPAKGAKNISQAEQAKVGVVYAEYGLTESGGVVLFSAA ERGRSLSLLPESSLFILRKSTLLPRVAQLAERLHQKAQAGERMPSCINII SGPSSTADIELIKVVGVHGPVKAVYLIIEDC >ECs1718 hypothetical protein MTSFSTLLSVHLISIALSVGLLTLRFWLRYQKHPQAFARWTRIVPPVVDT VLLLSGIALMAKAHILPFSGQAQWLTEKLFGVIIYIVLGFIALDYRRMHS QQARIIAFPLTLVVLYIIIKLATTKVPLLG >ECs5039 hypothetical protein MTISELLQYCMAKPGAEQSVHNDWKATQIKVEDVLFAMVKEVENRPAVSL KTSPELAELLRQQHSDVRPSRHLNKAHWSTVYLDGSLPDSQIYYLVDASY QQAVNLLPEEKRKLLVQL >ECs2162 putative tail assembly protein MATTNAFCLASPPLARICLHGDLQRFGRRLSLYVNTAAEAIRALSLQMPG FRRQMNEGWYQIRIRGEDTAPEAVYARLHEPLGEGAVIHIVPRLAGAGKG GLQIVLGAAAIVGSFFTAGATMALWGAALSAGGLTATTMLFSLGASMILG GVAQMLAPKAKTPEYRATDNGKQNTYFSSLDNMIAQGNPMPVPYGEMLVG SRRISQDISTRDEGGDGKVVVIGRG >ECs2458 hypothetical protein MMKMQSRKIWYYRITLIILLFAMLLAWALLPGVHEFINRSVAAFAAVDQQ GIERFIQSYGALAAVVSFLLMILQAIAAPLPAFLITFANASLFGAFWGGS LSWTSSMAGAALCFFIARVMGREVVEKLTGKTVLDSMDGFFTRYGKHTIL VCRLLPFVPFDPISYAAGLTSIRFRSFFIATGLGQLPATIVYSWAGSMLT GGTFWFVTGLFILFALTVVIFMAKKIWLERQKRNA >ECs1644 minor tail protein MKTFRWKVKPGMDVASAPSVRKVRFGDGYSQRAPAGLNADLKTYSVTLSV PRWEATALESFLAEHGGWKAFLWTPPYEWRQIKVTCAKWSSRVSMLRVEF SAEFEQVVN >ECs1829 hypothetical protein MNRIEHYHDWLRDAHAMEKQAESMLESMASRIDNYPELRARIEQHLSETK NQIVQLETILDRNDISRSVIKDSMSKMAALGQSIGGIFPSDEIVKGSISG YVFEQFEIACYTSLLAAAKNAGDTASIPIIEAILNEEKQMADWLIQHIPQ TTEKFLIRSETDGVEAKK >ECs0952 hypothetical protein MQFSTTPTLEGQTIVEYCGVVTGEAILGANIFRDFFAGIRDIVGGRSGAY EKELRKAREIAFEELGSQARALGADAVVGIDIDYETVGQNGSMLMVSVSG TAVKTRR >ECs0228 hypothetical protein MDGKNRAASSYLSPGNPPADKEQNDPLAQVFHNACSYNFFAMAELLHRLA KGEKGTPELSLRDDPAQETLRFSADASLAFPCSDISALKRDTSGAFRMTT TFMGLQGSQSPLPGYYLDHLAWKAVHEQSPVGDFLDMFSHRLTQFVWHIW RKYRYHISFRNGGVDAFSQRMYSLVGLGHRQLRDKLAINHSKMLAYSGIL ANPGRSPEIICGLVSHCFDLSEVTLQNWQRRKVDIEPDQQNSLGSYSLKN GEKLAGRSVLGNFVLGTRVPDLSGKFQLSITSLTRKQFLSFLPSGENFLP LTTFVSFILRDQLAWDLHLGLAPEQVGAMRLGDNKSALLGWTSFLGTPEE RPSVTIRVRS >ECs2164 putative minor tail protein MQDIHEESLNESVKSEQSPRVVLWEIDLTAQGGERYFFCNELNEKGEAVT WQGRQYQAYPIDGSGFEMNGKGSSARPSLTVSNLFGLVTGMAEDLQSLVG ATVVRRRVYARFLDAVNFVAGNPEADPEQELTDRWVVEQMSSLTAMTASF VLATPTETDGALFPGRIMLANTCMWDYRGDECGYNGPAVADEFDNPTTDI RKDRCSKCMRGCEMRGMVANFGGFLSINKLSQ >ECs3510 hypothetical protein MNNIFIFEPSNKNNPLDNVIKFIEFCKSTISNNNLTTSWESNKWKGLYRF TKFNSKNNLNSKECLDDSFINFAKAYMLHVHSFNKSKTKHSTLSMLKIVE FVLLKINMEANVNYCNNSIYDECIRIASEKYSKAHAFAIGKELEKLSSFL NDNRMTNSFYLFWVNPIRYRITQSWTGYDSSLEGHSRLPDIKSVIAIAEI FSKRDEQLSSRDIFTTSVLALLMCAPSRISEILALPADCEITECDGKGIQ RYGLRFFSAKGYEGNIKWIPTLMIPVAKKAISRLKELSSQARLLAAEIQK NYSNSTKGTLKENIPPDLFWYDREKKIKYSNALCLLTEGQLNQNKKEMSD KLFRPTTNFFKTDIIDSDYIKGYFNVFKRHGYINEDGSPYLLRTHQLRHL LNTFAQINGMDEFSIARWSGRKLISQNVSYDHRSHLQMSKAIREKKLSVC VNEHRIKDIPVVDLNEFDSLSSGAVLVSKHGYCKHSYAFKPCDNYPIKNS GLDNETISNIHDKILKRTLYDKNDGNINADKWYEFHKKIKKGE >ECs5352 hypothetical protein MLIMHQVVCATTNPAKIQAILQAFHEIFGEGSCHIASIAVESGVPEQPFG SEETRAGARNRVANARRLLPEADFWVAIEAGIDGDSTFSWVVIENTTQRG EARSATLPLPAVILEKVREGEALGPVMSRYTGIDEIGRKEGAIGVFTAGK LTRASVYHQAVILALSPFHNAVY >ECs2448 hypothetical protein MPITIHGYLNDNNEKNGPGASMEYFDMRKMSVNLWRNAAGETREICTFPP AKRDFYWRASIASIAANGEFSLFPGMERIVTLLEGGEMFLESADHFNHTL KPLQPFAFAADQVVKAKLTAGQMSMDFNIMTRLDVCKAKVRIAERTFTTF GSRGGVVFVINGAWQLGDKLLTTDQGACWFDGRHTLRLLQPQGKLLFSEI NWLAGHSPDQVQ >ECs4465 hypothetical protein MAIKSPPTLIPLSHLSGEELQAHLRFNRVTDEKGRYLPFDELQYRIKKGE NVDVAWTLTRLARNAAIQRINYCNEAGEQAGFNITPVIAEACELVDKRAT ALALKDQTERLRGAGAELSQLRLEEPITSSQLEGANTTTLVARKMLETGR SPRTEDEHMIAGNARLMAEIPHLLAEPLTPALIRQLHAIGMGGINDAKYR PGEFRETDDVVIADYDGNIVHQPPAAALLPERLEKVCQWLNSHEGYIHPL VRACILHFMLAHEHPFRDGNGRTSRALFYWYMLKSGYDVFKYISISRLLH AAPVKYAASYQYTESDGMDLTYFLEYQAGVIKRALQNWQQHIDEITQRSA KLDSVLFSSGVLKRLNPRQVTLLNVMLANPGKEYTVAEISASLGVSDNTV RADLRTIVKEGFAQEKKINDQQAVYFAHYPL >ECs1806 putative host specificity protein MGKGGGKAHTPVEAKDNLKSTQMMSVIDAIGEGPIEGPVKGLQSILVNKT PLTDTDGNPVIHGVTAVWRAGEQEQTPPEGFESSGAETALGVEVTKAKPV TRTITSANIDRLRVTFGVQSLLETTSKGDRNHSSVRLLIQLQRNGNWVTE KDVTINGKTTSQFLASVILDNLPPRPFNIRMVRETADSTTDQLQNRTLWS SYTEIIDVKQCYPNTAIVGLQVDAEQFGGQQMTVNYHIRGRIIQVPSNYD PEKRTYSGIWDGSLKPAYSNNPAWCLWDMLTHPRYGMGKRLGAADVDKWA LYAIAQYCDQTVPDGFGGTEPRMTFNAYLSQQRKAWDVLSDFCSAMRCMP VWNGQTLTFVQDRPSDVVWPYTSSDVVVDDNGVGFRYSFSALKDRHTAVE VNYTDPQNGWQTSTELVEDPEAILRYGRNLLKMDAFGCTSRGQAHRAGLW VIKTGLLETQTVDFTLGSQGLRHTPGDIIEICDNDYAGTMTGGRILSIDA ASRTLTLDREVTLPETGAATVNLINGSGKPVSVAITAHPAPDRIQVSTLP DGVETYGVWGLSLPSLRRRLFRCVSVRENTDGTFAITAVQHVPEKEAIVD NGARFEPQSGTLNSVIPPAVQHLTVEVSAADGQYLAQAKWDTPKVVKGVS FMLRLTVAADDGSERLVSTARTTETTYRFTQLAPGNYRLTVRAVNAWGQQ GDPASVSFRIAAPAAPSQIELTPGYFQITAVPRLAVYDPTVQFEFWFSET RITDIRQVETTARYLGTGLYWIAASINIKPGHDYYFYIRSVNTVGKSAFV EAVGQPSDDASGYLDFFKGEIGKTHLAQELWTQIDNGQLAPDLAEIRTSI TDVSNEITQTVNKKLEDQSAAIQQIQKVQVDTNNNLNSMWAVKLQQMQDG RLYIAGIGAGIENTSDGMQSQVLLAADRIAMINPANGNTKPMFVGQGDQI FMNEVFLKYLTAPTITSGGNPPAFSLTPDGRLTAKNADISGNVNANSGTL NNVTINENCRVLGKLSANQIEGDLVKTVGKAFPRDSRAPERWPSGTITVR VYDDQPFDRQIVIPAVAFSGAKHEKEHTDIYSSCRLIVRKNGAEIYNRTA LDNTLIYSGVIDMPAGHGHMTLEFSVSAWLVNNWYPTASISDLLVVVMKK ATAGITIS >ECs3835 hypothetical protein MAKNRSRRLRKKMHIDEFQELGFSVAWRFPEGTSEEQIDKTVDDFINEVI EPNKLAFDGSGYLAWEGLICMQEIGKCTEEHQAIVRKWLEERNLDEVRTS ELFDVWWD >ECs2934 hypothetical protein MEFYMKAFNKLFSLVVASVLVFSLAGCGDKEESKKFSANLNGTEIAITYV YKGDKVLKQSSETKIQFASIGATTKEDAAKTLEPLSAKYKNIAGVEEKLT YTDTYAQENVTIDMEKVDFKALQGISGINVSAEDAKKGITMAQMELVMKA AGFKEVK >ECs3820 hypothetical protein MKTSRLPIAIQQAVMRRLREKLAQANLKLGRNYPEPKLSYTQRGTSAGTA WLESYEIRLNPVLLLENSEAFIEEVVPHELAHLLVWKHFGRVAPHGKEWK WMMESVLGVPARRTHQFELQSVRRNTFPYSCKCQEHQLTVRRHNRVVRGE AVYRCVHCGEQLVAK >ECs2721 putative tail assembly protein MATTNAFCLASPPLARICLHGDLQRFGRRLSLYVNTAAEAIRALSLQMPG FRRQMNEGWYQIRIRGEDTAPEAVYARLHEPLGEGAVIHIVPRLAGAGKG GLQIVLGAAAIVGSFFTAGATMALWGAALSAGGLTATTMLFSLGASMILG GVAQMLAPKAKTPEYRATDNGKQNTYFSSLDNMIAQGNPMPVPYGEMLVG SRRISQDISTRDEGGDGKVVVIGRG >ECs3023 hypothetical protein MLTGNQRETDWPMDLNTLISQYGYAALVIGSLAEGETVTLLGGLAAHQGL LKFPLVVLSVALGGMIGDQVLYLCGRRFGGKLLRRFSKHQDKIERAQKLI QRHPYLFVIGTRFMYGFRVIGPTLIGASQLPPKIFLPLNILGAFAWALIF TTIGYAGGQVIAPWLHNLDQHLKHWVWLILVVVLVVGVRWWLKRRGKKKP DNQA >ECs2203 hypothetical protein MILRQCAGTMKVKSVGALIGRTEAAVRTKARELGISMMLRGDFHPSAKYS QRDIELARQLHQRGMQRREIARKLGMPLRIVNNYVYFDRRVSA >ECs3154 hypothetical protein MSNQFGDTRIDDDLTLLSETLEEVLRSSGDPADQKYVELKARAEKALDDV KKRVSQASDSYYYRAKQAVYRADDYVHEKPWQGIGVGAAVGLVLGLLLAR R >ECs2166 putative tail length tape measure protein precursor MSQPAGDLVIDLSLDAARFDEQMARVRRHFSSLEADARKTASTVEQGLSR QALAAQKAGISVGQYKAAMRTLPAQFTDIVTQLAGGQNPFLIMLQQGGQI SDSFGGPLSLLTLLKEELLGIRDASESSEESLSDTANALAENARNAGELG RFMSVARVAAGGGVAVLAALAAAAWQAEQADRALLRSLILTGGAAATTTA ELWKMAGVISDEAGGGIRQAAENLARLAESGKYTAGQLRIMGETSQRWLQ TVGDDAGKVEKAFEGIAADPVKALASLNQQYNFLSVSQLRHIDELERTKG KQVAVTEAMSLFADVMNARLEQLDKAATPVEKIWDDVKTWTSDAWAWIGD HTLGALSLITDVVAGTVEQVKLLLVQGDLALAEFIQSAWETTKNVPGVGA LFGELAEENRVFIEKTKRDELALRKSIAERDARIRQGEMGYINRSRATGV SKGPGQQEAVSRLAEELTGKKHTSPKTRSAGEREEEQAREALLALEAELR TLEKHSGANEKISRQRRDLWKAESQYAVLKEAATKRQLSEQEKSLLAHKD ETLEYKRQLAELGDKVEYQKRLNELAQQAVRFEEQQSAKQAAISAKARGL TDRQAQRESEAQRLRDVYGDNPAALAKATSALKNTRSAEEQLRGSWMAGL KSGWGEWAESATDSFSQVKSAATQTFDGIAQNMAAMLTGAEADWRGFTRS VLSMMTEILLKQAMVGIVGRIGSAIGGAFGGGASASSGTAIQAAAANFHF ATGGFTGTGGKYEPAGIVHRGEFVFTKEATSRIGVGNLYRLMRGYAEGGY VGGAGSPAQMRRAEGINFNQNNHVVIQNDGTNGQAGPQLMKAVYDMARKG AQDELRLQLRDGGMLSGSGR >ECs1434 hypothetical protein MKKSLLGLTFASLMFSAGSAVAADYKIDKEGQHAFVNFRIQHLGYSWLYG TFKDFDGTFTFDEKNPAADKVNVTINTTSVDTNHAERDKHLRSADFLNTA KYPQATFTSTSVKKDGDELDITGDLTLNGVTKPVTLEAKLIGQGDDPWGG KRAGFEAEGKIKLKDFNIKTDLGPASQEVDLIISVEGVQQK >ECs2459 hypothetical protein MGLPPLSKIPFILRPQAWLHRRHYGEVLSPIRWWGRIPFIFYLVSMFVGW LERKRSPLDPVVRSLVSARIAQMCLCEFCVDITSMKVAERTGSSDKLLAV ADWRQSPLFSDEERLALEYAEAASVTPPTVDDALRTRLAAHFDAQELTEL TALIGLQNLSARFNSAMDIPAQGLCRIPEKRS >ECs0216 Hcp-like protein MANLIYLTLNGDNQGLISSGCSSQPSIGNKAQTAHIDQIMIYEFMHGLNR DQNVNHHHLTIKKPIDKASPLLGKAICDNELLTCDFSFYRTNKFGINELF YKIKLTGAKISDIHVSISHIVVDNSVQPEESVSFSYESIIWEHCSAGTSA YSLWEDRLF >ECs1008 putative amidase MLLNMMCGRRLSAISLCLAVTFAPLFNAQADEPEVIPGDSPVAVSEQGEA LPQAQATAIMAGILPLPEGAAEKARTQIESQLPAGYKPVYLNQLQLLYAA RDMQPMWENRDAVKAFQQQLAEVAIAGFQPQFNKWVELLTDPGVNGMARD VVLSDAMMGYLHFIANIPVKGTRWLYSSKPYALATPPLSVINQWQQALDK GQLPTFVAGLAPQHPQYAVMHESLLALLSDTKPWPQLTGKATLRPGQWSN DVPALREILQRTGMLDGGPKITLPGDDTPTDAVVSPSAVTVETAETKPMD KQTTSRSKPAPAVRAAYDNELVEAVKRFQAWQGLGADGAIGPATRDWLNV TPAQRAGVLALNIQRLRLLPTELSTGIMVNIPAYSLVYYQNGNQVLDSRV IVGRPDRKTPMMSSALNNVVVNPPWNVPPTLARKDILPKVRNDPGYLESH GYTVMRGWNSREAIDPWQVDWSTITASNLPFRFQQAPGPRNSLGRYKFNM PSSEAIYLHDTPNHNLFKRDTRALSSGCVRVNKASDLANMLLQDAGWNDK RISDALKQGDTRYVNIRQSIPVNLYYLTAFVGADGRTQYRTDIYNYDLPA RSSSQIVSKAEQLIR >ECs2439 hypothetical protein MERLLIVNADDFGLSKGQNYGIIEACRNGIVTSTTALVNGQAIDHAVQLS RDEPSLAIGMNFVLTMGKPLTAMPGLTRDGVLGKWIWQLAEEDALPLEEI TQELASQYLRFIELFGRKPTHLDSHHHVHMFPQIFPIVARFAAEEGIALR IDRQPLSNAGDLPANLRSSHGFSSAFYGEEISEALFLQVLDDSSHRGERS LEVMCHPAFVDNTIRQSAYCFPRLTELEVLTSASLKYAIAERGYRLGSYL DV >ECs1544 putative portal protein MWNLLRRTRKNQKSGRDVREAGWTSLFQAVAEPFSGAWQQGVKADPEAVL SFHAVFACISLISQDIAKMRLRLMQTDAHGIRRETRRGDIARLCRRPNAQ QNRIQFFELWLNAKLRHGNTVVLKIRNARGQIKELRILDWSRVEPLVADD GEVFYRITPDRNCGITEAVTVPAREVIHDRFNCFFHPLIGLPPVYAAGLA ATQGHHIQENSTSFFRNGGRPSGVIEIPGSITEENAKKLKSNWDSGYTGE NAGKTAILSNGAKYNPTTFSPVDAQTVEQLKMTAEIVCSVFRVPAYKIGV GQPPSSDNVEALEQQYYSQCLQTLIESIELLLDEALETGENESTEFDVTT LLRMDSERRMKTLGDAVKNTLLTPNEARKRENLPPLAGGDALYLQQQNYS LEALSRRDAREDPFASAGKTVSSQLPDGASDGNKAISETEHDAVKAMFRG DTEKMTERELSIIRALGEEFSTVLADLQRTFEEKIAAQAQTFEEKLASQS VVLQKCVTGDDVRPMLEQMVKEAVSHIPVPRDGRDYDPDVLQKAVNDAVA NIPVPADGKSITPDDVRPMLEQMVKEAVSHIPVPRDGRDYDPDVLQKAVN DAVAKIPVPADGKSITPDDVHPMLEQMVKEAVSHIPVPRDGRDYDPDVLQ KAVNDAVAKIPVPADGKSITPDDVHPMLEQMVKEAVSHIPVPRDGRDYDP DVLQKAVLEAVSALPAPQDGRDATALEILPAIDDQKSFPRGSYATHQGGL WRAYEKTYGMRGWECLVDGVADIDVSMTGERSFSVVVRQSSGQRTEKTFS LPVMLYRGVFRIGETYHPGDTVTWGARCGTATV >ECs0770 hypothetical protein MSKIIATLYAVMDKRPLRALSFVMALLLAGCMFWDPSRFAAKTSELEIWH GLLLMWAVCAGVIHGVGFRPQKVLWQGIFCPLLADIVLIVGLIFFFF >ECs1009 hypothetical protein MDKFDANRRKLLALGGVALGAAILPTPAFATLSTPRPRILTLNNLHTGES IKAEFFDGRGYIQEELAKLNHFFRDYRANKIKSIDPGLFDQLYRLQGLLG TRKPVQLISGYRSIDTNNELRARSRGVAKKSYHTKGQAMDFHIEGIALSN IRKAALSMRAGGVGYYPRSNFVHIDTGPARHW >ECs4740 hypothetical protein MKQPGEELQETLTELDDRVVVDYLIKNPEFFIRNARAVEAIRVPHPVRGT VSLVEWHMARARNHIHVLEENMALLMEQAIANEGLFYRLLYLQRSLTAAS SLDDMLMRFHRWARDLGLAGASLRLFPDRWRLGAPSNHTHLALSRQSFEP LRIQRLGQEQHYLGPLNGPELLVVLPEAKAVGSVAMSMLGSDADLGVVLF TSRDASHYQQGQGTQLLHEIALMLPELLERWIERV >ECs5199 hypothetical protein MSLWKKISLGVVIVILLLLGSVAFLVGTTSGLHLVFKAADRWVPGLDIGK VTGGWRDLTLSDVRYEQPGVAVKAGNLHLAVGLECLWNSSVCINDLALKD IQVNIDSKKMPPSEQVEEEEDSGPLDLSTPYPITLTRVALDNVNIKIDDT TVSVMDFTSGLNWQEKTLTLKPTSLKGLLIALPKVAEVAQEEVVEPKIEN PQPEEKPLGETLKDLFSRPVLPEMTDVHLPLNLNIEEFKGEQLRVTGDTD ITVRTMLLKVSSIDGNTKLDALDIDSNQGILNASGTAQLSDNWPVDITLN STLNVEPLKGEKVKLKVGGALREQLEIGVNLSGPVDMDLRAQTRLAEAGL PLNVEVNSKQIYWPFTGEKQYQADDLKLKLTGKMTDYTLSMRTAVKGLEI PPATITLDAKGNEQQVNLDKLTVAALEGKTELKALLDWQQAISWRGELTL NGINTAKEIPEWPSKLNGLIKTRGSLYGGTWQMEVPELKLTGNVKQNKVN VDGTLKGNSYMQWMIPGLHLELGPNSAEVKGELGVKDLNLDATINAPGLD NALPGLGGTAKGLVKVRGTVEAPQLLADITARGLRWQELTVAQVRVEGDI KSTDQIAGKLDVRVEQISQPDVNINLVTLNAKGSEKQHELQLRIQGEPVS GQLNLAGSFDRKEERWKGTLSNTRFQTLVGPWSLTRDIALDYRNKEQKIS IGPHCWLNPNAELCVPQTIDAGAEGRAVVNLNRFDLAMLKPFMPETTQAS GIFTGKADVAWDTTEEGLPQGSITLSGRNVQVTQTVNDAALPVAFQTLNL TAELRNNRAELGWTIRLTNNGQFDGQVQVTDPQGRRNLGGNVNIRNFNLA MINPIFTRGEKAAGMVSANLRLGGDVQSPQLFGQLQVTGVDIDGNFMPFD MQPSQLAVNFNGMRSTLAGTVRTQQGEIYLNGDADWSQIENWRARVTAKG SKVRITVPPMVRMDVSPDVVFEATPNLFTLDGRVDVPWARIVVHDLPESA VGVSSDVVMLNDNLQPEEPKTASIPINSNLIVHVGNNVRIDAFGLKARLT GDLNVVQDKQGLGLNGQINIPEGRFHAYGQDLIVRKGELLFSGPPDQPYL NIEAIRNPDATEDDVIAGVRVTGLADEPKAEIFSDPAMSQQAALSYLLRG QGLESDQSDSAAMTSMLIGLGVAQSGQIVGKIGETFGVSNLALDTQGVGD SSQVVVSGYVLPGLQVKYGVGIFDSIATLTLRYRLMPKLYLEAVSGVDQA LDLLYQFEF >ECs3930 hypothetical protein MALNTYQYRETTMIDPKKIEQIARQVHESMPKGIREFGEDVEKKIRQTLQ AQLTRLDLVSREEFDVQTQVLLRTREKLALLEQRSSELEARNNSVADLQS PPAIPPIDKAE >ECs0251 hypothetical protein MRKIALILAMLLIPCVSFAGLLGSSSSTTPVSKEYKQQLMGSPVYIQIFK EERTLDLYVKMGEQYQLLDSYKICKYSGGLGPKQRQGDFKSPEGFYSVQR NQLKPDSRYYKAINIGFPNAYDRAHGYEGKYLMIHGDCVSIGCYAMTNQG IDEIFQFVTGALVFGQPSVQVSIYPFRMTDANMKRHKYSNFKDFWEQLKP GYDYFEQTRKPPTVSVVNGRYVVSKPLSHEVVQPQLASNYTLPEAK >ECs5021 hypothetical protein MLGHISKFDGNNSLIKHGVVQGNNIVDFDLLRNFNGGPGLNRENFIYISN IFLNIKQRNEKNHSINMFREVSISGDIVSVKFYRNEKIECACDFMMAKDA QGYIDLSELDLTSCHFKGDVISKVSFISSNLQHVTFECKEIGDCNFTTAI VDNVIFKCRRLHNVIFIKASGDYVDFSKNILDTVDFSQSQLTHSNFCECQ IRNSNFDHCYLYASHFTRAEFLTDKEISFIKSNLTAVMFDHVRISTGNFK DSVTQLMVLSIDYSDIFGNEYLDGYINNIIKMIDSLPDDPAILKSVLAVK LVMQLKILNIVNKNFIENMKKIFSHGPYIKDPIIRSYIHPDEDNKFDNFM RQNRFSKVNFDTQQMIDFINRFNMNKWLIDRNNNFFIQLIDQALRSTNDT IKENAWHLYKEWIRSDDVSPLFIEIEDNLRTFNTNELTRNDNIFILFSSV DDGPVMVVSSQRLHDMLNPTKDTNWNSTYIYKSRHEMLPVNLTPETLFGS KSYDKHALFPIFTASWRANRIKNKGI >ECs2073 hypothetical protein MKFPSIFNKIKPQSIQQHPEKNQLNWMLELNKWKEERILTGEIHRPECRN EAAKRINCAFLSKQNDIDLSGLNLSTQPPGLQNFTSINLDNNQLTHFDAT NYDRLVKLSLNSNTLESINIHQGRNVSITHISMNNNCLRNIDIDRLSSIT YFSAAHNKLEFVQLESCEWLQYLNLSHNQLTDIVTGNKEELLLLDLSHNK LASLHNALFPNLNTLLINNNLLSEIKMFYSNFCKVQTLNAANNQLEKINL HFLTYLSSIKSLRLDNNKITRIDTENTSDIRSLFPIIKKSESLNFLNISG ENNCPTIQLMLFNLFSPALKLNTGLAILSPGAFEDHSDGLDVDNELFHYT INKAYTPYNIHTYKTEEVVNQRNIKIKNMTLDEINNTYCNNDYYNEAIRE EPIDFLDRSFSSSSWPFYH >ECs5495 hypothetical protein MEKEQLIEIANTIMPFGKYKGRRLIDLPEEYLLWFARKDEFPAGKLGELM QITLLIKTEGLTQLVQPLKRPL >ECs4317 hypothetical protein MLWSFIAVCLSAWLSVDASYRGPTWQRWVFKPLTLLLLLLLAWQAPMFDA ISYLVLAGLCASLLGDALTLLPRQRLMYAIGAFFLSHLLYTIYFASQMTL SFFWPLPLVLLVLGALLLAIIWTRLEEYRWPICTFIGMTLVMVWLAGELW FFRPTAPALSAFVGASLLFISNFVWLGSHYRRRFRADNAIAAACYFAGHF LIVRSLYL >ECs2096 hypothetical protein MDICSRNKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQ QSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQR LGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAH KRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSG DRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDN ETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWR NRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAAR YDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQ LDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSRWGS >ECs1683 putative sporulation protein MATIDSMNKDTTRLSDGPDWTFDLLDVYLAEIDRVAKLYRLDTYPHQIEV ITSEQMMDAYSSVGMPINYPHWSFGKKFIETERLYKHGQQGLAYEIVINS NPCIAYLMEENTITMQALVMAHACYGHNSFFKNNYLFRSWTDASSIVDYL IFARKYITECEERYGVDEVERLLDSCHALMNYGVDRYKRPQKISLQEEKA RQKSREEYLQSQVNMLWRTLPKREEEKTVAEARRYPSEPQENLLYFMEKN APLLESWQREILRIVRKVSQYFYPQKQTQVMNEGWATFWHYTILNHLYDE GKVTERFMLEFLHSHTNVVFQPPYNSPWYSGINPYALGFAMFQDIKRICQ SPTEEDKYWFPDIAGSDWLETLHFAMRDFKDESFISQFLSPKVMRDFRFF TVLDDDRHNYLEISAIHNEEGYREIRNRLSSQYNLSNLEPNIQIWNVDLR GDRSLTLRYIPHNRAPLDRGRKEVLKHVHRLWGFDVMLEQQNEDGSVELL ERCPPRMGNL >ECs2239 putative minor tail protein MAEIKTLHLVPREGMQVSEKPSVVRVRFGDGYEQRRPTGLNPQLKTFQAV FRVTDESTRRWLDEFLSWHGGYRAFLWRPPKHNRTVRVVCREWSVTDNAR YSDFSCTIEQVVN >ECs2585 hypothetical protein MANWQSIDELQDIASDLPRFTHALDELSRRLGLNITPLTADHISLRCHQN ATAERWRRGFEQCGELLSENMINGRPICLFKLHEPVQVAHWQFSIVELPW PREKRYPHEGWEHIEIVLPGDPETLNARALALLSDEGLSLPGISVKTSSP KGEHERLPNPTLAVTDGKTTIKFHPWSIEEIVASEQSA >ECs0273 hypothetical protein MKKAAILIDAGFFMQRVHATHRKHFAEHELTAQCIMKVIWSMVLSHLNGK RQSQERREPLELYRIYFYDCPPLDIQTRLPLPEPGNKTPGRKNFKLEKSY ILRTELHEELRKTRKTPIVFILKLIGATH >ECs0285 hypothetical protein MDITEFPSGVIEHLGWYVYRLIDPRDGSTFYVGKGKGNRVFAHMRGEVAA ADDDDLLSNKLKQIREIRLAGLEVIHVIHRHGMTDEKTAYEVEAALIDAY PGLTNIMNGAGSNEFGAAHVKELIATYQPETITFHHKALMISVNRSAKDS ELYDAVRFSWRINVSRASKAEVILATVRGIVRGVFIADKWLKSTREHFPT MKYWDEDPDFEATQSSRYGFEGREAPPEIANLYLGKKIPDELRKKGAMSP VRYSPNF >ECs3925 hypothetical protein MGIYHRSRKTKMKRTKSIRHASFRKNWSARHLTPVALAVATVFMLAGCEK SDETVSLYQNADDCSAANPGKSAECTTAYNNALKEAERTAPKYATREDCV AEFGEGQCQQAPAQAGMAPENQAQAQQSSGSFWMPLMAGYMMGRLMGGGA GFAQQPLFSSKNPASPAYGKYTDATGKNYGAAQPGRTMTVPKTAMAPKPA TTTTVTRGGFGESVAKQSTMQRSATGTSSRSMGG >ECs4391 hypothetical protein MLYIDKATILKFDLEMLKKHRRAIQFIAVLLFIVGLLCISFPFVSGDILS TVVGALLICSGIALIVGLFSNRSHNFWPVLSGFLVAVAYLLIGYFFIRAP ELGIFAIAAFIAGLFCVAGVIRLMSWYRQRSMKGSWLQLVIGVLDIVIAW IFLGATPMVSVTLVSTLVGIELIFSAASLFSFASLFVKQQ >ECs2473 hypothetical lipoprotein MESSVNKAPSLIAAIVLGLGISACGYFVGDGVKHLKTNNRYVNVRGLSEK EVRADTAELTIAINFKGNVPGELFPKLEEAQKKIVAELNAQGINEKEIIL GQWTSKRTDSFYLKDDPTMPRYNADGSVTIKTHNVAAVEKVVAKLNELQV ATDGAIAESKVAYRFNGIGALRAEMIAAATKDARNAALQFATDSGSQVGS ISDASQGVFQIFASGSDEDDPTAINKTVRVVTTVTYALQD >ECs0669 hypothetical protein MKTKLNELLEFPTPFTYKVMGQALPELVDQVVEVVQRHAPGDYTPTVKPS SKGNYHSVSITINATHIEQVETLYEELGKIDIVRMVL >ECs4106 hypothetical protein MFMTWEYALIGLVVGIIIGAVAMRFGNRKLRQQQALQYELEKNKAELDEY REELVSHFARSAELLDTMAHDYRQLYQHMAKSSSSLLPELSAEANPFRNR LAESEASNDQAPVQMPRDYSEGASGLLRTGAKRD >ECs1050 hypothetical protein MIASKFGIGQQVRHSLLGYLGVVVDIDPVYSLSEPSPDELAVNDELRAAP WYHVVMEDDNGLPVHTYLAEAQLSSELQDEHPEQPSMDELAQTIRKQLQA PRLRN >ECs3535 hypothetical protein MTTLRQPYYELSPAVYNALVQAKTALENSTLDTTLMELIYLRVSQINGCA FCLEMHSKALRKSGVPQHKLDALAGWRVSHHFDERERAALAWAESVTDIA RTHAEDEVYQPLLEHFSAAEISDLTFAIGLMNCFNRLAVSMRM >ECs3906 hypothetical protein MKKFAAVIAVMALCSAPVMAAEQGGFSGPSATQSQAGGFQGPNGSVTTVE SAKSLRDDTWVTLRGNIVERISDDLYVFKDASGTINVDIDHKRWNGVTVT PKDTVEIQGEVDKDWNSVEIDVKQIRKVNP >ECs4710 hypothetical protein MAINFSPKVGEILECNFGNYPVSQNGPFSTTYYDGRIPPEMIKNRLVVVL NGKINGNAFIVVPLSTTRDHDKLKRGMHVEIASNVINDLQFFDQQIRWAK TDLVQQVSRNRLNRARTYRGYLNQCLPHELVADIQRAVIKSINAISLIN >ECs0217 hypothetical protein MNSNVLTQTIVTGSDPRGLPEFSAIREEINKASHPSQPELNWKLVESLAL AIFKANGVDLHTATYYTLARTRTQGLAGFCEGAELLAAMVSHDWDKFWPQ GGPARTEMLDWFNSRTGNILRQQISFAESDLPLIYRTERALQLICDKLQQ VELKRVPRVENLLYFMQNTRKRLEPQLKSNTENAAQTTVRTLIYAPETQA SSTPEAVVPPLPGLPEMKVEVRSLTENPPQASVIKQGSTVRGFIAGIACS VAVASALWWWQVYPVQQQLLQVNDTAQGAATVWMASPELENYERRLQQLL DTSPVQPLETGMQMMRVADSRWPESLQQQQASTQWNEALKTRAQSSPQLR GWLQTRQDLHAFADLVMQREKEGLTLSYIKNVIWQAERGLGQETPVESLL TQYQDARAQKQNTDALEKQINERLEGVLSRWLLLKNNTIPTIKKALNFNN IHEYKGVLNGEFNLFNTKW >ECs3634 hypothetical protein MARSQANRASSDLQQTPGDEQKLQAWQQAQAQVTRTLGQLTIISERYPEL KSQELYQNLMVQLEGSENRIAVARGRYIKAIEQYNVTIRKFPAVLTAKVM DYTPKKNYLPDDVAAVSKAPTIDFSQNANAH >ECs0797 hypothetical protein MKKIPCVMMRGGTSRGAFLLAEHLPEDQTQRDKILMAIMGSGNDLEIDGI GGGNPLTSKVAIISRSSDPRADVDYLFAQVIVHEKRVDTTPNCGNMLSGV GAFAIENGLIAATSPVTRVRIRNVNTGTFIEADVQTPNGVVEYEGSARID GVPGTAAPVALTFLNAAGTKTGKVFPTDNQIDYFDDVPVTCIDMAMPVVI IPAEYLGKTGYELPAELDADKALLARIESIRLQAGKAMGLGDVSNMVIPK PVLISPAQKGGAINVRYFMPHSCHRALAITGAIAISSSCALEGTVTRQIV PSVGYGNINIEHPSGGLDVHLSNEGQDATTLRASVIRTTRKIFSGEVYLP >ECs2288 hypothetical protein MIKTTLLFFATALCEIIGCFLPWLWLKRNASIWLLLPAGISLALFVWLLT LHPAASGRVYAAYGGVYVCTALMWLRVVDGVKLTLYDWTGPLIALCGMLI IVVGWGRT >ECs0161 hypothetical protein MLVYWLDIVGTAVFAISGVLLAGKLRMDPFGVLVLGVVTAVGGGTIRDMA LDHGPVFWVKDPTDLVVAMVTSMLTIVLVRQPRHLPKWMLPVLDAVGLAV FVGIGVNKAFNAEAGPLIAVCMGVITGVGGGIIRDVLAREIPMILRTEIY ATACIIGGIVHATAYYTFSVPLETASMMGMVVTLLIRLAAIRWHLKLPTF ALDENGR >ECs4113 hypothetical protein MDIFSIANQHIRFAVKLATAIVLALFVGFHFQLETPRWAVLTAAIVAAGP AFAAGGEPYSGAIRYRGFLRIIGTFIGCIAGLVIIIAMIRAPLLMILVCC IWAGFCTWISSLVRIENSYAWGLAGYTALIIVITIQPEPLLTPQFAVERC SEIVIGIVCAIMADLLFSPRSIKQEVDRELESLLVAQYQLMQLCIKHGDG EVVDKAWGDLVRRTTALQGMRSNLNMESSRWARANRRLKAINTLSLTLIT QSCETYLIQNTRPELITDTFREFFDTPVETAQDVHKQLKRLRRVIAWTGE RETPVTIYSWVAAATRYQLLKRGVISNTKINATEEEILQGEPEVKVESAE RHHAMVNFWRTTLSCILGTLFWLWTGWTSGSGAMVMIAVVTSLAMRLPNP RMVAIDFIYGTLAALPLGLLYFLVIIPNTQQSMLLLCISLAVLGFFLGIE VQKRRLGSMGALASTINIIVLDNPMTFHFSQFLDSALGQIVGCVLAFTVI LLVRDKSRDRTGRVLLNQFVSAAVSAMTTNVARRKENHLPALYQQLFLLM NKFPGDLPKFRLALTMIIAHQRLRDAPIPVNEDLSAFHRQMRRTADHVIS ARSDDKRRRYFGQLLEELEIYQEKLCIWQAPPQVTEPVHRLAGMLHKYQH ALTDS >ECs0837 putative tail length tape measure protein precursor MDQIANLVIDLGIDAAEFKNEIPRIKNLLNGAASDAERSSARMQRFMERQ TQAARQTMQAASSAATAASVHAQTVEKSAQAHERMAREVEQTRQRMEALS QKMREEQAQAMALAEAQDKAAAAFYRQIDSVKQAGAGLQELQRIQQQIRQ ARNSGGIGQQDYLALISEVTAKTRVLTQAEEEATRQKVAFIRQLKEQATR QNLSSSELLRAKAAQLGVSSAAEVYIRKMEQAGKATHSLGLKSAAARQEI GVLIGELARGNLGALRGSGITLANRAGWIDTLMSPKGMMPGAVIGGIAAA VYGLGKAWYDGQKEGEEFNRQLSLTGHYAGVTAGQLWTLSRAISGNGITQ HAAAGALAQVVGSGAFRGNDIGMVARAAAQMERSVGQSVSDTINQFKRLK DDPVNAAKALDNELHFLTATQLEQIRVLGEQGRSSDAARIAMSALAEETG RRTADIDNNLNALGSTLKYLSDLWSRFWDAAMNIGREDSLDEQIAALQEK VSRAKRLPWTASSSQVEYDQQRLNDLQEKKRQKDLQDAKEQAERNYQEQQ KRRNAENAALNRMNETEAARHQREIARINAMQYADQAVRDAAIQRENERY EKALASGKKKTRETRNDEATRLLLQYSQQQAQVEGQIAAARQSAGIATER MTEARKQLLALQQRISDLDGKKLTADEKSVLARKDELIQALTLLDVKQQE LQKQTALNELKKKTIQLTSQLAEEERAQRQQHDLDIATVGMGDQQRQRYQ VQLSLRQKYQQQLEQLRRDSEQKGTYNTDDYRKAEQALTESLNRQLNENR RYWQQLEVVQGNWKNGVLRAFQDFTVDADNTAETAEQVFSSAFSNMGNGL ATFVTTGKLNFKSFTSSVLSDMAKILAQATMMKSIKGIGSVLGFDLSSLS LNANGGIYQSADLSRYSGTVVNRPTFFAFAKGAGVMGEAGPEAILPLRRG ADGKLGVVADIGGSGMAMFSPQYNIEINNDGTNGQIGPAALKAVYDLGKK AAADFMQQQARDGGRLSGAYR >ECs5530 hypothetical protein MSRYQHTKGQIKDNAIEALLHDPLFRQRVEKNKKGKGSYMRKGKHGNRGN WEASGKKVNHFFTTGLLLSGAC >ECs4981 hypothetical protein MTTRKKKTAVSEAAVMEAIREALEGADPRTAGLTEQLAKGYVDLLDGLPF GETREYRVTFRELTAKDSIDAEAEAERVVETNNGPMLIASPSLRGVALLR RQIAAVGDIEGPLSPRQIGQLSERDLSRLMAAVSLLDTALAGKLAADRGR SGAVSGSD >ECs1653 hypothetical protein MNRIEHYHDWLRDAHAMEKQAEKMLESMASRIENYPELRSRIEQHISETK NQLSQLESILDRNNISRSVIKDSMSKMAAFGQSIGGIFPSDEIVKGSISG YVFEQFEIACYTSLLAAAKNAGDTASVPIIEAILNEEKQMAEWLLNHIPD TTEQFMVRSEIDGVEAKK >ECs3255 hypothetical protein MKRLIMATMVTAILASSTVWAADNAPVAAQQQTQQVQQTQKTAAAAERIS EQGLYAMRDVQVARLALFHGDPEKAKELTNEASALLSDDSTEWAKFAKPG KKTNVNDDQYIVINASVGISESYVATPEKEAAIKIANEKMAKGDKKGAME ELRLAGVGVMENQYLMPLKQTRNALADAQKLLDKKQYYEANLALKGAEDG IIVDSEALFVN >ECs4617 hypothetical protein MKISRLGEAPDYRFSLANERTFLAWIRTALGFLAAGVGLDQLAPDFATPV IRELLALLLCLFSGGLAMYGYLRWLRNEKAMRLKEDLPYTNSLLIISLIL MVVAVIVMGLVLYAG >ECs4320 hypothetical protein MNVFSQTQRYKALFWLSLFHLLVITSSNYLVQLPVSIFGFHTTWGAFSFP FIFLATDLTVRIFGAPLARRIIFAVMIPALLISYVISSLFYMGSWQGFGA LAHFNLFVARIATASFMAYALGQILDVHVFNRLRQSRRWWLAPTASTLFG NVSDTLAFFFIAFWRSPDAFMAEHWMEIALVDYCFKVLISIVFFLPMYGV LLNMLLKRLADKSEINALQAS >ECs2504 hypothetical protein MGILSWIIFGLIAGILAKWIMPGKDGGGFFMTILLGIVGAVVGGWISTLF GFGKVDGFNFGSFVVAVIGAIVVLFIYRKIKS >ECs1852 hypothetical protein MKYLLIFLLVLAIFVISVTLGAQNDQQVTFNYLLAQGEYRISTLLAVLFA AGFAIGWLICGLFWLRVRVSLVRAERKIKRLENQLSPATDVAVVQHSSAA KE >ECs3921 hypothetical protein MKRYTPDFPEMMRLCEMNFSQLRRLLPRNDAPGETVSYQVANAQYRLTIV ESTRYTTLVTIEQTAPAISYWSLPSMTVRLYHDAMVAEVCSSQQIFRFKA RYDYPNKKLHQRDEKHQINQFLADWLRYCLAHGAMAIPVY >ECs5142 hypothetical protein MTDHTMKKNPVSIPHTVWHADDIRRGEREAADALGLTLYELMLRAGEAAF QVCRSAYPDARHWLVLCGHGNNGGDGYVVARLAKAVGIEVTLLAQESDKP LPEEAALAREAWLNAGGEIHASNIVWPESVALIVDALLGTGLQQAPRESI SQLIDHANSHPAPIVAVDIPSGLLAETGATPGAVINADHTITFIALKPGL LTGKARDVTGQLHFDSLGLDSWLAGQETKIQRFSAEQLSHWLKPRRPTSH KGDHGRLVIIGGDHGTAGAIRMTGEAALRAGAGLVRVLTRSENIAPLLTA RPELMVHELTMDSLTESLEWADVVVIGPGLGQQEWGKKALQKVENFRKPM LWDADALNLLAINPDKRHNRVITPHPGEAARLLGCSVAEIESDRLHCAKR LVQRYGGVAVLKGAGTVVAAHPDALGIIDAGNAGMASGGMGDVLSGIIGA LLGQKLSPYDAACAGCVAHGAAADVLAARFGTRGMLATDLFSTLQRIVNP EVTDKNHDESSNSAP >ECs3226 hypothetical protein MSKCSADETPVCCCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEK ARSVESEPCKISPTFTEESDGVRLDIDFTFACEAEMLIFQLGLR >ECs2042 hypothetical protein MRETVEIMRYPVTLTPAPEGGYMVSFVDIPEALTQGETVAEAMEAAKDAL LTAFDFYFEDNELIPLPSPLNSHDHFIEVPLSVASKVLLLNAFLQSEITQ QELARRIGKPKQEITRLFNLHHATKIDAVQLAAKALGKELSLVMV >ECs1751 hypothetical protein MHGLNKDIIFPLQIFALSNNVLYNFPEQGVVPVLYVIYAQDKADSLEKRL SVRPAHLARLQLLHDEGRLLTAGPMPAVDSNDPGAAGFTGSTVIAEFESL EAAQAWADADPYVAAGVYEHVSVKPFKKVF >ECs0989 hypothetical protein MKAFDLHRMAFDKVPFDFLGEVALRSLYTFVLVFLFLKMTGRRGVRQMSL FEVLIILTLGSAAGDVAFYDDVPMVPVLIVFITLALLYRLVMWLMAHSEK LEDLLEGKPVVIIEDGELAWSKLNNSNMTEFEFFMELRLRGVEQLGQVRL AILETNGQISVYFFEDDKVKSGLLILPSDCTQRYKVVPESADYACIRCSE IIHMKAGEKQLCPRCANPEWTKASRAKRVT >ECs5400 hypothetical protein MGVYKARRFSQSTKKLGIHDKVLMAAAEEVMQGIWEADLGSGVIKKRLPL QQGKSGGARTIIFFKSANHVFFYDGWSKSGLSSKGSKEIEDDELAAYKKM ANAFLAFSNKQIEDLIETGFLIEVKNER >ECs0963 hypothetical protein MVKSTSCTTIDFMNMSQLTERTFTSSESLSSLSLFLSLARGQCRPGKFWH RRSFRQKFLLRSLIMPRLSVEWMNELSHWPNLNVLLTRQPRLPVRLHRPY LAANLSRKQLLEALRYHYALLRGCMSAEEFSLYLNTPGLQLAKLEGKNGE QFTLELTMMISMDKEGDSTILFRNSEGIPLAEITFTLCEYQGKRTMFIGG LQGAKWEIPHQEIQNATKACHGLFPKRLVMEAACLFAQRLQVEQIIAVSN ETHIYRSLRYRDKEGKIHADYNAFWESVGGVCDAERHYRLPAQIARKEIA EIASKKRAEYRRRYEMLDAIQPQMATMFRG >ECs0509 hypothetical protein MKYVDGFVVAVPADKKDAYREMAAKAAPLFKEFGALRIVECWASDVPDGK VTDFRMAVKAEENEEVVFSWIEYPSKEVRDAANQKMMSDPRMKEFGESMP FDGKRMIYGGFESIIDE >ECs5028 hypothetical protein MNKDEAGGNWKQFKGKVKEQWGKLTDDDMTIIEGKRDQLVGKIQERYGYQ KDQAEKEVVDWETRNEYRW >ECs5063 hypothetical protein MSALNSLPLPVVRLLAFFHEELSERRPGRVPQTVQLWVGCLLVILISMTF EIPFVALSLAVLFYGIQSNAFYTKFVAILFVVATVLEIGSLFLIYKWSYG EPLIRLIIAGPILMGCMFLMRTHRLGLVFFAVAIVAIYGQTFPAMLDYPE VVVRLTLWCIVVGLYPTLLMTLIGVLWFPSRAISQMHQALNDRLDDAISH LTDSLAPLPETRIEREALALQKLNVFCLADDANWRTQSAWWQSCVATVTY IYSTLNRYDPTSFADSQAIIEFRQKLASEINKLQHAITEGQCWQSDWRIS ESEAMAARECNLENICQTLLQLGQMDPNTPPTPAAKPPSMVADAFTNPDY MRYAVKTLLACLICYTFYSGVDWEGIHTCMLTCVIVANPNVGSSYQKMVL RFGGAFCGAILALLFTLLVMPWLDNIVELLFVLAPIFLLGAWIATSSERS SYIGTQMVVTFALATLENVFGPVYDLVEIRDRALGIIIGTVVSAVIYTFV WPESEARTLPQKLAGALGMLSKVMRIPRQQEVTALRTYLQIRIGLHAAFN ACEEMCQRVALERQLDSEERALLIERSQTVIHQGRDLLHAWDATWNSAQA LDNALQPDKAGQFADALEKYAAGLATALSRSPQITLEETPASQAILPTLL KQEQHVCQLFARLPDWTAPALTPATEQAQGATQ >ECs2869 hypothetical protein MRKKLAFLDTSLDDLRAFPESSRQEIGYQLDRIQQGLNPYDWKPFSTIGP GVREIRTRDADGIYRVMYVAKFEEAVYVLHCFQKKTQTTSQSDIDLAKRR YKELVQERKNEN >ECs5288 hypothetical protein MGIVMTQQGDAVAGELATEKVGIKGYLAFFLTIIFFSGVFSGTDSWWRVF DFSVLNGSFGQLPGANGATTSFRGAGGAGAKDGFLFALELAPSVILSLGI ISITDGLGGLRAAQQLMTPVLKPLLGIPGICSLALIANLQNTDAAAGMTK ELAQEGEITERDKVIFAAYQTSGSAIITNYFSSGVAVFAFLGTSVIVPLA VILVFKFVGANILRVWLNFEERRNPTQGAQA >ECs5234 hypothetical protein MAQVINEMDVPSHSFVFHGTGERYFLICVVNVLLTIITLGIYLPWALMKC KRYLYANMEVNGQRFSYGITGGNVFFSCLVFVFFYFAILMTVSADMPLIG CVLTLSLLVLLIFMAAKGLRYQALMTSLNGVRFSFNCSMKGVWWVTFFLP ILMAIGMGTVFFISTKMLHANSSSSVIVSVVLMAIVGIVSIGIFNGTLYS LVMSFLWSNTSFGIHRFKVKLDTAYCIKYAILAFLALLPFLAVAGYIIFD QILNAYDSSVYANDDIENLQQFMEMQRKMIIAQLIYYFGIAVSTSYLTVS LRNHFMSNLSLNDGRIRFRSTLTYHGMLYRMCALVVISGITGGLAYPLLK IWMIDWQAKNTYLLGDLDDLPLINKEEQPDKGFLASISRGIMPSLPFL >ECs3769 hypothetical protein MDINNKARIHWACRRGMRELDISIMPFFEHEYDSLSDDEKRIFIRLLECD DPDLFNWLMNHGKPADAELEMMVRLIQTRNRERGPVAI >ECs3449 hypothetical protein MTENAVLQLRAERIARATRPFLARGNRVRRCQRCLLPEKLCLCSTITPAQ AKSRFCLLMFDTEPMKPSNTGRLIADILPDTVAFQWSRTEPSQDLLDLVQ NPDYQPMVVFPASYADEQREVIFTPPAGKPPLFIMLDGTWPEARKMFRKS PYLDNLPVISVDLSRLSAYRLREAQAEGQYCTAEVAIALLDMAGDTGAAA GLGEHFTRFKTRYLAGKTQHLGSITAEQLESV >ECs1048 hypothetical protein MKTGIVTTLIALCLPVSVFATTLRLSTDVDLLVLDGKKVSSSLLRGADSI ELDNGPHQLVFRVEKTIHLSNSEERLYISPPLVVSFNTQLINQVNFRLPR LENEREANHFDAAPRLELLDGDATPIPVKLDILAITSTAKTIDYEVEVER YNKSAKRASLPQFATMMADDSTLLSGVSELDAIPPQSQVLTEQRLKYWFK LADPQTRNTFLQWAEKQPSS >ECs4444 hypothetical protein MQPKIYWIDNLRGIACLMVVMIHTTTWYVTNAHSVSPVTWDIANVLNSAS RVSVPLFFMISGYLFFGERSAQPRHFLRIGLCLFFYSAIALLYIALFTSI NVELALKNLLQKPVFYHLWFFFAIAVIYLVSPLIQVKNVGGKMLLVLMVV IGIIANPNTVPQKIDGFEWLPINLYINGDTFYYILYGMLGRALGMMDTQH KALSWVSAALFATGVFIISRGTLYELQWRGNFADTWYPYCGPMVFICAIA LLTLVKNTLDTRTIRGLGLISRHSLGIYGFHALIIHALRTRGIELKNWPI LDIIWIFCATLAASLLLSMLVQRIDRNRLVS >ECs3772 hypothetical protein MQPNDITFFQRFQDDILAGRKTITIRDESESHFKTGDVLRVGRFEDDGYF CTIEVTATSTVTLDTLTEKHAEQENMTLTELKKVIADIYPGQTQFYVIEF KCL >ECs5299 hypothetical protein MNKLIELRRAKMLALSLLLIAAATFVVTLFLPPNFWVSGVKAIAEAAMVG ALADWFAVVALFRRVPIPIISRHTAIIPRNKDRIGENLGQFVQKKFLDTQ SLVALIRRHEPALLIGNWFSQPENARRVGQHLLQIMSGFLELTDDARIQR LLKRAVHRAIDKVDLSGTSALMLESMTKNDRHQVLLDTLIAQLIALLQRD KSRKFIAQQIVRWLESEHPLKAKILPTEWLGEHSAELVSDAVNSLLDDIS RDRAHQIRHAFDRATFALIDKLKNDPEMTARADAVKSYLKEDEAFLSELW GDLREWLKADINSEDSRVKERIARAGQWFGETLIADDALRASLNGHLEQA AHRVAPEFSAFLTRHISDTVKSWDARDMSRQIELNIGKDLQFIRVNGTLV GGCIGLILYLLSQLPALFPLGNF >ECs1114 putative tail length tape measure protein MSQPVGDLVIDLSLDAVRFDEQMSRVRRHFSGLDTDVRKTASAVEQGLSR QALAAQKAGISVGQYKAAMRTLPAQFTDIATQLAGGQNPWLILLQQGGQV KDSFGGMIPMFRGLAGAITLPMVGVTSLAVATGALVYAWYQGDSTLSAFN KTLVLSGNQSGLTADRMLTLSRAGQAAGLTFNQARESLAALVNAGVRGGE QFDAINQSVARFASASGVEVDKVAEAFGKLTTDPTSGLIAMVRQFRNVTA EQIAYVAQLQRSGDEAGALQAANDIATKGFDEQTRRLKENMGTLETWADK TGKAFKSMWDAILDIGRPESSADMLASAQKAFDEADKKWQWYQSRSQRRG KTASFRANLQGAWNDRENARLGLAAATLQSDMEKAGELAARDRAERDASQ LKYTGEAQKAYERLLTPLEKYTARQEELNKALKDGKILRADYNTLMAAAK KDYESTLKKPKSSGVKVSAGERQEDQAHAALLALETELRTLEKHSGANEK ISQQRRDLWKAENQYAVLKEAATKRQLSEQEKFLLAHKDETLEYKRQLAE LGDKVEHQKRLNELAQQAVRFEEQQSAKQAAISAKARGLTDRQAQRESEA QRLRDVYGDNPAALAKATSALKNTWSAEEQLRGSWMAGLKSGWGEWAESA TDSFSQVKSAATQTFDGIAQNMAAMLTGAEADWRGFTRSVLSMLTEIFLK QAMVGIVGSIGSAIGGAFGGGASASTGTAIQAAAANFHFATGGFTGTGGK YEPAGIVHRGEFVFTKEATSRIGVGNLYRLMRGYAEGGYVGGAGSPAQMR RAEGINFNQNNHVVIQNDGTNGQAGPQLMKAVYDMARKGAQDELRLQLRD GGMLSGSGR >ECs1244 hypothetical protein MKKVLIAALISGVSFGAFAQQGGFQGPEAERSTVAQAKELKDDAWVILEG SIVKKVGDERYEFRDNSGTIVTDIDDSIWAGQNVSPKDKVRIEGEIDKDL SSVEVDVKALKLLK >ECs3036 vancomycin sensitivity MLKRVFLSLLVLIGLLLLTVLGLDRWMSWKTAPYIYDELQDLPYRQVGVV LGTAKYYRTGVINQYYRYRIQGAINAYNSGKVNYLLLSGDNALQSYNEPM TMRKDLIAAGVDPSDIVLDYAGFRTLDSIVRTRKVFDTNDFIIITQRFHC ERALFIALHMGIQAQCYAVPSPKDMLSVRIREFAARFGALADLYIFKREP RFLGPLVPIPAMHQVPEDAQGYPAVTPEQLLELQKKQGK >ECs4699 hypothetical protein MAESFTTTNRYFDNKHYPRGFSRHGDFTIKEAQLLERHGYAFNELDLGKR EPVTEEEKLFVAVCRGEREPVTEAERVWSKYMTRIKRPKRFHTLSGGKPQ VEGAEDYTDSDD >ECs3374 putative dehydrogenase MQLRKLLLPGLLSVTLLSGCSLFNSEEDVVKMSPLPTVENQFTPTTAWST SVGSGIGNFYSNLHPALADNVVYAADRAGLVKALNADDGKEIWSVSLAEK DGWFSKEPALLSGGVTVSGGHVYIGSEKAQVYALNTSDGTVAWQTKVAGE ALSRPVVSDGLVLIHTSNGQLQALNEADGAVKWTVNLDMPSLSLRGESAP ATAFGAAVVGGDNGRVSAVLMEQGQMIWQQRISQATGSTEIDRLSDVDTT PAVVNGVVFALAYNGNLTALDLRSGQIMWKRELGSVNDFIVDGNRIYLVD QNDRVMALTIDGGVTLWTQSDLLHRLLTSPVLYNGNLVVGDSEGYLHWIN VEDGRFVAQQKVDSSGFQTEPVAADGKLLIQAKDGTVYSITR >ECs1643 tail length tape measure protein precursor MAEPVGDLVVDLSLDAARFDEQMARVRRHFSGTESDAKKTAAVVEQSMSR QALAAQKAGISVGQYKAAMRMLPAQFTDVATQLAGGQSPWLILLQQGGQV KDSFGGMIPMFRGLAGAITLPMVGATSLAVATGALAYAWYQGNSTLSDFN KTLVLSGNQAGLTADRMLVLSRAGQAAGLTFNQTSESLSALVKAGVSGEA QIASISQSVARFSSASGVEVDKVAEAFGKLTTDPTSGLTAMARQFHNVTA EQIAYVAQLQRSGEEAGALQAANEAATKGFDDQTRRLKENMGTLETWADR TARAFKSMWDAVLDIGRPDTAQEMLIKAEAAFKKADDIWNLRKDDYFVND EARARYWDDREKARLALEAARKKAEQQSQQDKNAQQQSDTEASRLKYTEE AQKAYERLQTPLEKYTARQEELNKALKDGKILQADYNTLMAAAKKDYEAT LKKPKQSGVKVSAGDRQEDSAHAALLTLQAELRMLEKHAGANEKISQQRR DLWKAESQFAVLEEAAQRRQLSAQEKSLLAHKDETLEYKRQLAVLGDKVT YQEHLNALAQQADKFAQQQRAKRAAIDAKNRGLTDRQAAREATEQRLKEQ YGDNPLALNNVMSEQKKTWAAEDQLRGSWMAGLRSGWSEWEESATDSMSQ VKSAATQTFDGIAQNMAAMLTGSEQNWRSFTRSVLSMMTEILLKQAMVGI VGSIGSAIGGGASASGGTAIQAAAAKFHFAAGGFTGTGGKYEPAGIVHRG EFVFTKEATSRIGVGNLYRLMRGYATGGYVGTPGSMADSRSQASGTFEQN NHVVINNDGTNGQIGPQALKAVYDVARKAAMDVVTGQMRDGGLFSGGGR >ECs2236 putative tail assembly protein MATTNAFSLASPPLARICLHGDLQRFGRRLSLYVNTAAEAIRALSLQVPG FRRQMNEGWYQIRIAGYDTAPEAVYARLHEQLGEGTVIHIVPRLAGAGKG GLQIVLGAAAIVGSFFTAGASMALWGSALAAGGFSATTMLFSLGASMILG GVAQMLAPKAKTPDYRATDNGRQNTYFSSLDNMIAQGNPMPVPYGEMLVG SRRISQDISTRDEGGGGTVVVVGRQG >ECs4459 hypothetical protein MNISEVDLHKLTVSDPFLGQYQQLVRDVVIPYQWDALNDRIPEAEPSHAI ENFRIAAGLQEGEFYGMVFQDSDVAKWLEAVAWSLCQKPDAELEKTADEV IELIASAQCEDGYLNTYFTVKAPEERWSNLAECHELYCAGHLIEAGVAFF QATGKRRLLGVVCRLADHIDSVFGPDESKLHGYPGHPEIELALMRLYEVT EEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAY SQAHLPLAQQQTAIGHAVRFVYLMTGVAHLARLSHDDSKRQDCLRLWNNM AQRQLYITGGIGSQSSGEAFSSDYDLPNDTVYAESCASIGLMMFARRMLE MEGDSQYADVMERALYNTVLGGMALDGKHFFYVNPLEVHPKSLKFNHIYD HVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPREDALYINIYAGNSMEV PVENGTLRLRVSGNYPWQEQVTIAVESPQPVRHTLALRLPDWCTQPQIIL NGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVRRVYGNPLVRHVAGKV AIQRGPLVYCLEKADNGESLHNLWLPTDAPFTTFEGKGLFSHKILIQAPG YRYEQSNPEQQPLWHYDSAPAKRQTQTLTFIPWFSWANRGEGEMRIWVNE EKHCHP >ECs2031 hypothetical protein MFPEYRDLISRLKNENPRFMSLFDKHNKLDHEIARKEGSDDRGYNAEVVR MKKQKLQLKDEMLKILQHESVKEV >ECs1676 hypothetical protein MAEHLMSDVPFWQSKTLDEMSDAEWESLCDGCGQCCLHKLMDEDTDEIYF TNVACRQLNIKTCQCRNYERRFEFEPDCIKLTRENLPTFEWLPMTCAYRL LAEGKDLPAWHPLLTGSKAAMHGERISVRHIAVKESEVIDWQDHILNKPD WAQ >ECs3886 hypothetical protein MERFLENAMYASRWLLAPVYFGLSLALVALALKFFQEIIHVLPNIFSMAE SDLILVLLSLVDMTLVGGLLVMVMFSGYENFVSQLDISENKEKLNWLGKM DATSLKNKVAASIVAISSIHLLRVFMDAKNVPDNKLMWYVIIHLTFVLSA FVMGYLDRLTRHNH >ECs4988 hypothetical protein MAVTLTPHQRALLQLLPDGLAWDKRPSSVLAALCLGLSHSTERVSWTGNQ MLAERFPDSSRLLLEDWERYLGLPECDMTGATIQERQRYAGNKYRMKPSL NREFYIRFAAEFGYEIDIQPSPDSQWVSIVTINSETGYRNMNVLDDILTP LRIYEGGALECILNRYKPAWQTFIYVYANSHEEENI >ECs2056 putative receptor MHLRHLFSLRLRGSLLLGSLLVASSFSTQAAEEMLRKAVGKGAYEMAYSQ QENALWLATSQSRKLDKGGVVYRLDPVTLEVTQAIHNDLKPFGATINNTT QTLWFGNTVNSAVTAIDAKTGEVKGRLVLDDRKRTEEVRPLQPRELVADD ATNTVYISGIGKDSVIWVVDGENIKLKTAIQNTGKMSTGLALDSKGKRLY TTNADGELITIDTADNKILSRKKLLDDGKEHFFINISLDTARQRAFITDS KAAEVLVVDTRNGNILAKVAAPESLAVLFNPARNEAYVTHRQAGKVSVID AKSYKVVKTFDTPTHPNSLALSADGKTLYVSVKQKSTKQQEATQPDDVIR IAL >ECs4591 hypothetical protein MKRKPLIAFFIAIFIALTILILFPFNCTYKKSPQEHSLSFIKQLPNTVNL SSLTYNKEDDFLYATQNSPAQLLKITKSGDIMDRAPLPFISDAETIEHIQ GNIFAAVDEKTSELFFFTVTKDMHISFRNKIQLEKFNKKNRGFEGLAWKA DDRMLFVAKERRPSKIFIYQLSPDLLSVKQATIPEALNDIRVNDISGLAF NNESLMILSDESRKLLKFNLTEMSFIEMLDLTKGNHSLTSDLPQPEGIVT LPDESIYVASEPDILAKFVPNK >ECs2165 putative minor tail protein MKTFRWKVKPDMEVNSQPSVREVRFGDGYSQRMAAGLNADLKTYRVTLSV TREEARHLEAFLAEHGGWKAFLWKPPYAYRQIKVTCAGWSARVGMLRVEF SAEFKQVVN >ECs2520 hypothetical protein MFAGLPSLTHEQQQKAVERIQELMAQGMSSGQAIALVAEELRANHSGERI VARFEDEDE >ECs0006 hypothetical protein MLILISPAKTLDYQSPLTTTRYTLPELLDNSQQLIHEARKLTPPQISTLM RISDKLAGINAARFHDWQPDFTPENARQAILAFKGDVYTGLQAETFSEDD FDFAQQHLRMLSGLYGVLRPLDLMQPYRLEMGIRLENARGKDLYQFWGDI ITNKLNEALAAQGDNVVINLASDEYFKSVKPKKLNAEIIKPVFLDEKNGK FKIISFYAKKARGLMSRFIIENRLTKPEQLTGFNSEGYFFDEASSSNGEL VFKRYEQR >ECs5013 phosphate-starvation-inducible protein PsiE MTSLSRPRVEFISTILQTVLNLGLLCLGLILVVFLGKETVHLADVLFAPE QASKYELVEGLVVYFLYFEFIALIVKYFQSGFHFPLRYFVYIGITAIVRL IIVDHKSPLDVLIYSAAILLLVVTLWLCNSKRLKRE >ECs0777 hypothetical protein MSSNFRHQLLSLSLLVGIAAPWAAFAQAPISSVGSGSVEDRVIQLERISN AHSQLLTQLQQQLSDNQSDIDSLRGQIQENQYQLNQVVERQKQILLQIDS LSSGGAAAQSTSGDQSGAAASTTPTADAGTANAGAPVKSGDANTDYNAAI ALVQDKSRQDDAMVAFQNFIKNYPDSTYLPNANYWLGQLNYNKGKKDDAA YYFASVVKNYPKSPKAADAMFKVGVIMQDKGDTAKAKAVYQQVISKYPGT DGAKQAQKRLNAM >ECs2197 hypothetical protein MKKVLIAALISGVSFGAFAQQGGFQGPEAERSTVAQAKELKDDAWVILEG SIVKKVGDERYEFRDNSGTIVTDIDDSVWAGQNVSPKDKVRIEGEIDKDL SSVEVDVKALKLLK >ECs0735 hypothetical protein MKNTELEQLINEKLNSAAISDYAPNGLQVEGKETVQKIVTGVTASQALLD EAVRLGADAVIVHHGYFWKGESPVIRGMKRNRLKTLLANDINLYGWHLPL DAHPELGNNAQLAALLGITVMGEIEPLVPWGELTMPVPGLELASWIEARL GRKPLWCGDTGPEVVQRVAWCTGGGQSFIDSAARFGVDAFITGEVSEQTI HSAREQGLHFYAAGHHATERGGIRALSEWLNENTDLDVTFIDIPNPA >ECs2809 hypothetical protein METTKPSFQDVLEFVRLFRRKNKLQREIQDVEKKIRDNQKRVLLLDNLSD YIKPGMSVEAIQGIIASMKGDYEDRVDDYIIKNAELSKERRDISKKLKAM GEMKNGEAK >ECs5108 hypothetical protein MDQALLDGGYRCYTGEKIDVYFNTAICQHSGNCVRGNGKLFNLKRKPWIM PDEVDVATVVKVIDTCPSGALKYRHK >ECs1748 hypothetical protein MDMDLNNRLTEDETLEQAYDIFLELAADNLDPADVLLFNLQFEERGGAEL FDPAEDWQEHVDFDLNPDFFAEVVIGLADSEDGEINDVFARILLCREKDH KLCHIIWRE >ECs2696 putative methyl-independent mismatch repair protein MLLAGSSLLTLLDDIATLLDDISVMGKLAAKKTAGVLGDDLSLNAQQVSG VRANRELPVVWGVAKGSLINKVILVPLALIISAFIPWAITPLLMIGGAFL CFEGVEKVLHMLEARKHKEDPAQSQQRLEKLAAQDPLKFEKDKIKGAIRT DFILSAEIVAITLGIVAEAPLLNQVLVLSGIALVVTVGVYGLVGVIVKID DLGYWLAEKSSALMQALGKGLLIIAPWLMKALSIVGTLAMFLVGGGIVVH GIAPLHHAIEHFAGQQSAVVAMILPTVLNLILGFIIGGIVVLGVKAVAKM RGQVH >ECs0814 putative outer membrane protein MKKSVIAGVFIALSFTTCSAIANSLALSLANDDAGKFQPILNDIYGNKHE NRDDYSQGLFLGYSHDISDSSQLSLHIAQDIYSPSGSNKRHNTAVTGDRA FSAYTHTGIEWNSLANDWIRYRLGTDIGVVGPDAGGQKVQNKAHEIIGAE KYHAWDDQIENRYGYTVKGMLSMTPSMDILGANVGLYPEVSAVTGNLFQY VAYGATIAIGNDKTFNSDNGFGLLAPRGLMHMSDTSGFKYKIFAGMERRD VNRNYTLEGKTIQTKQTTVSLNKTVDEYQVGATIGYAPVAFTLAFNKVTS EFKTGDDYSFINGAITFFF >ECs0838 putative minor tail protein METFHWKVRPDMNVVSEPKVVTVKLGDGYEQRRAAGLNNQLSTYSVTIRV RKCEHPSLKAFLERHGGVRAFQWTPPYDWKPIRVVCRKWSASVGALWVTI TADFEQVVA >ECs2354 hypothetical protein MNASSWSLRNLPWFRATLAQWRYALRNTIAMCLALTVAYYLNLDEPYWAM TSAAVVSFPTVGGVISKSLGRIAGSLLGAIAALLLAGHTLNEPWFFLLSM SAWLGFCTWACAHFTNNVAYAFQLAGYTAAIIAFPMVNITEASQLWDIAQ ARVCEVIVGILCGGMMMMILPSSSDATALLTALKNMHARLLEHASLLWQP ETTDAIRAAHEGVIGQILTMNLLRIQAFWSHYRFRQQNARLNALLHQQLR MTSVISSLRRMLLNWPSPPGATREILEQLLTALASSQTDVYTVARIIAPL RPTNVADYRHVAFWQRLRYFCRLYLQSSQELHRLQSDVDDHARLPRTSGL ARHTDNAEAMWSGLRTFCTLMMIGAWSIASQWDAGANALTLAAISCVLYS AVAAPFKSLSLLMRTLVLLSLFSFVVKFGLMVQISDLWQFLLFLFPLLAT MQLLKLQMPKFAALWGQLIVFMGSFIAVTNPPVYNFADFLNDNLAKIVGV ALAWLAFAILRPGSDARKSRRHIRALRRDFVDQLSRHPTLSESEFESLTY HHVSQLSNSQDALARRWLLRWGIVLLNCSHVVWQLRDWESRSDPLSRVRD NCISLLRGVMSERGVQQKSLAATLEELQRICDSLARHHQPAARELAAIVW RLYCSLSQLEQAPPQGTLAS >ECs0010 hypothetical protein MGNTKLANPAPLGLMGFGMTTILLNLHNVGYFALDGIILAMGIFYGGIAQ IFAGLLEYKKGNTFGLTAFTSYGSFWLTLVAILLMPKLGLTDAPNAQFLG VYLGLWGVFTLFMFFGTLKGARVLQFVFFSLTVLFALLAIGNIAGNAAII HFAGWIGLICGASAIYLAMGEVLNEQFGRTVLPIGESH >ECs2723 putative minor tail protein MQDIHGESLIESVKSEQSPRVVLWEIDLTVQGGERYFFCNELNEKGEAVT WQGREYQAYPIDGSGFEMNGKGSSARPSLTVSNLFGLVTGMAEDLQSLVG ATVVRRRVYARFLDAVNFVAGNPEADPEQELSDRWVVEQMSELTAMTASF VLATPTETDGALFPGRIMLANTCMWDYRGDECGYNGPAVADEFDNPTTDI RKDRCSKCMRGCEMRGMVANFGGFLSINKLSQ >ECs2587 hypothetical protein MFKFLVLTLGIISCQVYAEDTLIVNDHDISAIKDCWQKNSGDDTDINVIK SCLRQEYNLVDAQLNKAYGEAYRYIEQVPRTGAKKPDTEQLNLLKKSQRA WLDFRDKECELILSNEDVQDLSNPYSESEWLSCMIIQTNTRTRQLQLYRN SEDFYPSPLTRG >ECs3129 hypothetical protein MTESTTSSPHDAVFKTFMFTPETARDFLEIHLPEPLRKLCNLQTLRLEPT SFIEKSLRAYYSDVLWSVETSDGDGYIYCVIEHQSSAEKNMAFRLMRYAT AAMQRHLDKGYDRVPLVVPLLFYHGETSPYPYSLNWLDEFDDPQLARQLY TEAFPLVDITIVPDDEIMQHRRIALLELIQKHIRDRDLIGMVDRITTLLV RGFTNDSQLQTLFNYLLQCGDTSRFTRFIQEIAERSPLQKERLMTIAERL RQEGHQIGWQEGKLEGLQEGMHEQAIKIALRMLEQGFDRDLVLAATQLSE ADLAANNH >ECs3551 hypothetical protein MSEALSLFSLFASSFLSATLLPGNSEVVLVAMLLSGISHPWVLVLTATMG NSLGGLTNVILGRFCPLRKTSRWQEKATGWLKRYGAVTLLLSWMPVVGDL LCLLAGWMRISWGPVIFFLCLGKALRYVAVAAATVQGMMWWH >ECs1924 hypothetical protein MNLDDKSLFLDAMEDVQPLKRATDVHWHPTRNQRAPQRIDTLQLDNFLTT GFLDIIPLSQPLEFRREGLQHGVLDKLRSGKYPQQASLNLLRQPVEECRK MMFSFIQQAMADGLRNVLIIHGKGRDDKSHANIVRSYVARWLTEFDDVQA YCTALPHHGGSGACYVALRKTAQAKQENWERHAKRSR >ECs3822 hypothetical protein MRIPRIYHPEPLTSHSHIALCEDAANHIGRVLRMGPGQALQLFDGSNQVF DAEITSASKKSVEVKVLEGQIDDRESPLHIHLGQVMSRGEKMEFTIQKSI ELGVSLITPLFSERCGVKLDSERLNKKLQQWQKIAIAACEQCGRNRVPEI RPAMDLEAWCAEQDEGLKLNLHPRASNSINTLPLPVERVRLLIGPEGGLS ADEIAMTARYQFTDILLGPRVLRTETTALTAITALQVRFGDLG >ECs3965 hypothetical protein MHLITQKALKDAAEKYPQHKTELVALGNTIAKGYFKKPESLKAVFPSLDN FKYLDKHYVFNVGGNELRVVAMVFFESQKCYIREVMTHKEYDFFTAVHRT KGKK >ECs5232 hypothetical protein MANPEQLEEQREETRLIIEELLEDGSDPDALYTIEHHLSADDLETLEKAA VEAFKLGYEVTDPEELEVEDGDIVICCDILSECALNADLIDAQVEQLMTL AEKFDVEYDGWGTYFEDPNGEDGDDEDFVDEDDDGVRH >ECs4555 EspD MLNVNNDTLSVTSGVNTASGTSGITQSETGLSLDLQLVKSMNSSAGWTES SPLPTPPAGHSLVTPSAAEDVLSKLFGGISGEVTSRTEEAEPQRTSYPYL SQVNTVDPQQMMMMVTLLSLDTSAQKVSSLKNSNEIYMDGQTKALENKTQ EYKKQLEEQQKAEEKSQKSKIVGQVFGWLGVALTAVAAVFNPALWAVVAI GATAMALQTAVDVMGENAPQGLKTAAQVFGGISMAASILTAGVGGVSSLL SKFGNVANKIGSSVVKVVEKAAEALVKNVFAKISTVAEGVTNGIRSAGTT ALNNEAAQLQMLSQLAAFAVQNLTRQSESLGESAKLELDKAASELQNQAS YLQSVSQLMSDSARVNSRIVSGRI >ECs5323 hypothetical protein MILMTSGLNIEWSTFMASMLVGTIGIQWSRWYLAHPKVFTVAAVIPMFPG ISAYTAMISAVKISQLGYSEPLMITLLTNFLTASSIVGALSIGLSIPGLW LYRKRPRV >ECs1901 hypothetical protein MTEPLKPRIDFDGPLEVEQNPKFRAQQTFDENQAQNFAPATLDEAQEEEG QVEAVMDAALRPKRSLWRKMVMGGLALFGASVVGQGVQWTMNAWQTQDWV ALGGCAAGALIIGAGVGSVVTEWRRLWRLRQRAHERDEARDLLHSHGTGK GRAFCEKLAQQAGIDQSHPALQRWYASIHETQNDREVVSLYAHLVQPVLD AQARREISRSAAESTLMIAVSPLALVDMAFIAWRNLRLINRIATLYGIEL GYYSRLRLFKLVLLNIAFAGASELVREVGMDWMSQDLAARLSTRAAQGIG AGLLTARLGIKAMELCRPLPWIDDDKPRLGDFRRQLIGQVKETLQKGKTP SEK >ECs0085 hypothetical protein MFRGATLVNLDSKGRLSVPTRYREQLLENAAGQMVCTIDIHHPCLLLYPL PEWEIIEQKLSRLSSMNPVERRVQRLLLGHASECQMDGAGRLLIAPVLRQ HAGLTKEVMLVGQFNKFELWDETTWHQQVKEDIDAEQLATGDLSERLQDL SL >ECs4118 hypothetical protein MRRLPGILLLTGAALVVIAALLVSGLRIALPHLDAWRPEILNKIESATGM PVEASQLSASWQNFGPTLEAHDIRAELKDGGEFSVKRVTLALDVWQSLLH MRWQFRDLTFWQLRFRTNTPITSGGSDDSLEASHISDLFLRQFDHFDLRD SEVSFLTPSGQRAELAIPQLTWLNDPRRHRAEGLVSLSSLTGQHGVMQVR MDLRDDEGLLSNGRVWLQADDIDLKPWLGKWMQDNIALETAQFSLEGWMT IDKGDVTGGDVWLKQGGASWLGEKETHTLSVDNLTAHITRENPGWQFSIP DTRITMDGKPWPSGALTLAWIPEQDVGGKDNKRSDELRIRASNLELAGLE GIRPLAAKLSPALGDVWRSTQPSGKINTLALDIPLQAADKTRFQASWSDL AWKQWKLLPGAEHFSGTLSGSVENGLLTASMKQAKMPYETVFRAPLEIAD GQATISWLNNDKGFQLDGRNIDVKAKAVHARGGFRYLQPANDEPWLGILA GISTDDGSQAWRYFPENLMGKDLVDYLSGAIQGGEADNATLVYGGNPQLF PYKHNEGQFEVLVPLRNAKFAFQPDWPALTNLGIELDFINDGLWMKTDGV NLGGVRASNLTAVIPDYSKEKLLIDADIKGPGKAVGPYFDETPLKDSLGA TLQELQLDGDVNARLHLDIPLNGELVTAKGEVTLRNNSLFIKPLDSTLKN LSGKFSFINGDLQSEPLTASWFNQPLNVDFSTKEGAKAYQVAVNLNGNWQ PAKTGVLPAAVNEALSGSVAWDGKVGIVLPYHAGATYNVELNGDLKNVSS HLPSPLAKPAGEPLPVNVKVDGNLNSFELTGQAGADNHFNSRWLLGQKLT LDRAIWAADSKTLPPLPEQSGVELNMPPMNGAEWLALFQKGAAESVGGAA SFPQHITLRTPMLSLGNQQWNNLSIVSQPTANGTQVEAQGREINATLAMR NNAPWLANIKYLYYNPSVAKTRGDSTPSSPFPTTERINFRGWSDAQIRCA ECWFWGQKFGRIDSDITISGNTLTLTNGLIDTGFSRLTADGEWINNPGNE RTSLKGKLRGQKIDAAAEFFGVTTPIRQSSFNVDYDLHWRKAPWQPDEAT LNGIIHTQLGKGEITEINTGHAGQLLRLLSVDALMRKLRFDFRDTFGEGF YFDSIRSTAWIKDGVMHTDDTLVDGLEADIAMKGSVNLVRRDLNMEAVVA PEISATVGVAAAFAVNPIVGAAVFAASKVLGPLWSKVSILRYHISGPLDD PQINEVLRQPRKEKAQ >ECs0926 putative DEOR-type transcriptional regulator MRRANDPQRREKIIQATLEAVKLYGIHAVTHRKIATLAGVPLGSMTYYFS GIDELLLEAFSSFTEIMSRQYQAFFSDVSDAQGACQAITDMIYSSQVATP DNMELMYQLYALASRKPLLKTVMQNWMQRSQQTLEQWFEPGTARALDAFI EGMTLHFVTDRKPLSREEILRMVERVAG >ECs1233 putative tail tip fiber protein MTRKPWRAGKDLSTVVENMEIGTGQRGDGRHAFVTREELVGLKLARRRTS GGASYALNPGIEIDSTLMTVDFPTKPLNFKATGGFGSVLLEWDMPNYRGH SLTEIWRGTEDDLADAVLVATTPGQVYGDPVDPGWSGFYWIRFVNAAGVK GPWNAEKGTQAQTQIGVKAIIDQIRDEAAKSPVVSELRKEIKNAQGQAVK DAAIKTTEVVGTLREETTRTIGGIETRISTLDSSTSESLNEVDKRITKLD KEGGEAFLAMWSKKAGVDGITAGIGIVAGKDSEGRPVSQVAISASQLFVF DPNNPDNTAYPFAVSGGKVVIPKAMIYDAVIETLVSRKVVADEVKAGVSI TSPVIRSAVIQNGNFQVDSQGNLNIGGLFSVTSQGQLTIRYSNQNVGLVI RNDKIEVYDQNGRLAVRIGRLR >ECs0606 hypothetical protein MQFTFNEGHIQLPSQWQDQSMQVLVSTDNSGINLVITREPVSQGTLTPEL YQETLALYQGKLDGYTEHACREITLAEAPAWLLDYSWNGPEDEGNQGRIS QIAVFQRRGDTLLTFTFSTSLSLKNSQKTMLLEVIKSFTPLPPENDIQKD QPR >ECs3210 putative transporting ATPase MNSTHHYEQLIEIFNSCFADDFNTRLIKGDDEPIYLPADAEVPYNRIVFA HGFYASAIHEISHWCIAGKARREQVDFGYWYCPDGRDAQTQSQFEDVEVK PQALDWLFCVAAGYPFNVSCDNLEGDFEPDRVVFQRRVHAQVMDYLTNGI PERPARFIKALQNYYHTPELTAEQFPWPEALN >ECs3781 hypothetical protein MSAQPVDIQIFGRSLRVNCPPDQRDALNQAADDLNQRLQDLKERTRVTNT EQLVFIAALNISYELAQEKAKTRDYAASMEQRIRMLQQTIEQALLEQGRI TEKTNQNFE >ECs3983 hypothetical protein MILSIDSNDANTAPLHKKTISSLSGAVESMMKKLEDVGVLVARILMPILF ITAGWGKITGYAGTQQYMEAMGVPGFMLPLVILLEFGGGLAILFGFLTRT TALFTAGFTLLTAFLFHSNFAEGVNSLMFMKNLTISGGFLLLAITGPGAY SIDRLLNKKW >ECs0220 hypothetical protein MTQPSAPAPQIAIDSHDDKAWRDTLLKVAAILCERQPDSPQGYRLRRHAL WQSITSTPQAESDGRTPLAAVSADMVADYQSRLASADMALWQQVEKSVLL APYWLDGHCLSAQTALRLGYKQVADTIRDEVIRFLERLPQLTGLLFNDRT PFLSEQTKQWLAASPDGKVAPVAQIGEESQAARACFAGQGLEAALRYLDM LPEGDPRDQFHRQYLAAQLTEEAGLIQLAQQQYRMLLMIGSQMMVSDWEP SLLTQLEQKFTAEQ >ECs3937 hypothetical protein MAQEIELKFIVNHSAVEALRDHLNTLGGEHHDPVQLLNIYYETPDNWLRG HDMGLRIRGENGRYEMTMKVAGRVTGGLHQRPEYNVALSEPTLDLAQLPT EVWPNGELPADLASRVQPLFSTDFYREKWLVAVDGSRIEIALDQGEVKAG EFAEPICELELELLSGDTRAVLKLANQLVSQTGLRQGSLSKAARGYHLAQ GNPAREIKPTTILHVAAKADVEQGLEAALELALAQWQYHEELWVRGNDAA KEQVLAAISLVRHTLMLFGGIVPRKASTHLRDLLTQCEATIASAVSAVTA VYSTETAMAKLALTEWLVSKAWQPFLDAKAQGKISDSFKRFADIHLSRHA AELKSVFCQPLGDRYRDQLPRLTRDIDSILLLAGYYDPVVAQAWLENWQG LHHAIATGQRIEIEHFRNEANNQEPFWLHSGKR >ECs1983 putative tail length tape measure protein MATLRELIIKISANSQSFQSEIQRASRMGSEYYRTLQNGGRQAAAAARDQ RRALAELNSQLTEIRGSAVGMAGAFAGAFATGHLISLADEWSSVNARLKQ ASQSSDEFSSSQKVLMDISQRTGTAFSDNAALFARSAASMREYGYSADDV LKVTEAISTGLKISGASTAEAGSVITQFSQALAQGVLRGEEFNSVNESGD RIVRALAAGMGVARKDLKAMADDGQLTADKVVPALISQLEVLRDEYAAMP ETVSDGITKVENAFMAWVGGANEASGVTKTLSGVLNGVAGQIDNVATAVG ALVAVGVARYFGNMASGAMSATAGLVTAARNEVALAEAQFRGTQIATARA RAAVYRAQQAVAAARGTEMQIAAEARLAATQERLNRNIAARSAAQNALNS TTAVGSRLMSGALGLVGGVPGLVMLGAAAWYTLYQNQEQARESARQYALT IDEIAHKTPSMSLPEASDNEGRTRAALTEQNRLIDEQASRVKSLQEKAQS IQDVLAGLEDRRVALIRQQAAEQNKVYQSMLVMNGQHTEFNRLLGLGNEL LQQRQGLVNVPLRLPQATLDDKQQSALTKTERELALSRLKGEEKERARLG YAADDLGFVGDSYQEARQRYISNALEAWRNNEANKPKSRGGKSETEKAED SFSRLLKQQKAQLALAGQNTELAKLKYQTAQGELKTLTEIQKQELLRNAA LIDQQKIREQLRSREETLKNENAAARASNDAELLGYGQGERARERMRELQ QIRDSFRQKDADLQSQYQTGDISEDFYRQALAQNAQYLSERLKDQETFYA ESDAQRADWQKGLQEGFSNWVDNASDYASQAAQLATEGISGMVNNITEML NGNKVEWRSWASSVLQEISKVLMNAAIVNGIKTAANGMSGAGGFLGSIGD WLGGAVANAKGGVYTSANLSAYSNSIVDTPTYFAFAKGAGLMGEAGPEAI MPLTRAADGSLGVRAVGSMNGSAGLVYSPVYHIAIQNDGTNGQIGPEAAG SLVQLIDQRVQAVMLSMRRDGGMLSG >ECs3456 hypothetical protein MSKLIVPQWPLPKGVAACSSTRIGGVSLPPYDSLNLGAHCGDNPDHVEEN RKRLFAAGNLPSKPVWLEQVHGKDVLNLTGEPYASKRADASYSNTPGRVC AVMTADCLPVLFCNRAGTEVAAAHAGWRGLCAGVLEETVSCFADNPENIL AWLGPAIGPRAFEVGAEVREAFMAVDAEASTAFIQHGDKYLADIYQLARQ RLANVGVEQIFGGDRCTYTENETFFSYRRDKTTGRMASFIWLI >ECs2278 hypothetical protein MTLKEFIKSLRVGDAKKFAARLGVSPSYLSQMASGRTAISPTRALMIESA TEGQVSRAELRPHDWELIWPEYASGIRLGQTHVVHAEGDCSACLSDGVDS >ECs3050 hypothetical protein MTNITLQKQHRTLWHFIPGLALSAVITGVALWGGSIPAVAGAGFSALTLA ILLGMVLGNTIYPHIWKSCDGGVLFAKQYLLRLGIILYGFRLTFSQIADV GISGIIIDVLTLSSTFLLACFLGQKVFGLDKHTSWLIGAGSSICGAAAVL ATEPVVKAEASKVTVAVATVVIFGTVAIFLYPAIYPLMSQWFSPETFGIY IGSTVHEVAQVVAAGHAISPDAENAAVISKMLRVMMLAPFLILLAARVKQ LSGANSGEKSKITIPWFAILFIVVAIFNSFHLLPQSVVNMLVTLDTFLLA MAMAALGLTTHVSALKKAGAKPLLMALVLFAWLIVGGGAINYVIQSVIA >ECs5295 hypothetical protein MGSLFNIYKDIFPTLGMYSGLKACHEKNNLPFDINTEIETIQKQINYDIN HLNDGLIKRVLNLFIHLISNPDNLELTLNRYSSTTEQIIGRTKRNGLHEF DDGDLKIIFNRQDDNESVLTVKDKDKDKDKDKDKDKDISHHCNVKTEQLQ QFIKIMEQKAQLPIYIDKNNLKESIFSVLHNDPQQVDKDQHLPCEKFLKH ACKSSNSFEVKLDATHQYQHLNNFMISFDPVENQLTIRDNNNKTETFSFT NLQWENLLQYYKENHQQPNIAGSRNLTDNIDKIKNTISTSEIIECASPEI RSSVLNDLYSIANFLPDNNLTPNESWKRFCETCERFYVAQKSITGDKSER LTRKLSISDAGITMTFKIGDVVINTISTAIPEDATGQRCIEGLNLAEMDL TDIDLSKMALRNVNFNGSILRNAKFSGTICEGVDFTDCDLRNAEFENASL ENNDFRKVRHLTYVNFKNANLRNSNFNGKVLTGVTFTGSDLSNAYLEHID FTTVILYETSKIPGIPGTPQIPGTPKVILTGAILNYSDLSGKDLSEYNLT GILCMYTNFSNANLTNCKISNANFSNAKFYNTNCTGANCSNILFDYAWFD NTIFIKTLFKNTCFYNVRAKNVYLEGAYLNNDNIVNQANNSTEKQSIDST DKQANDSTVQQSIDSTVQQANDSTDKQANDNIDKQVNDSTDKQAKNSTEQ QDSNSFNQARLKKEVNRRFSIPGLTSYQPTYIVEE >ECs4847 hypothetical protein MKDVVDKCSTKGCAIDIGTVIDNDNCTSKFSRFFATREEAESFMTKLKEL AAAASSADEGASVAYKIKDLEGQVELDAAFTFSCQAEMIIFELSLRSLA >ECs2774 hypothetical protein MIKWPWKVQESAHQTALPWQEALSIPLLTCLTEQEQSKLVTLAERFLQQK RLVPLQGFELDSLRSCRIALLFCLPVLELGLEWLDGFHEVLIYPAPFVVD DEWEDDIGLVHNQRIVQSGQSWQQGPIVLNWLDIQDSFDASGFNLIIHEV AHKLDTRNGDRASGVPFIPLREVAGWEHDLHAAMNNIQEEIELVGENAAS IDAYAASDPAECFAVLSEYFFSAPELFAPRFPSLWQRFCQFYQQDPLQRL HHANDTDSFSATNVH >ECs3985 putative cytochrome MDWYLKVLKNYVGFRGRARRKEYWMFILVNIIFTFVLGLLDKMLGWQRAG GEGILTTIYGILVFLPWWAVQFRRLHDTDRSAWWALLFLIPFIGWLIIIV FNCQAGTPGENRFGPDPKLEP >ECs1805 minor tail protein MQDIHEESLNESVKSEQSPRVVLWEIDLTVQGGERYFFCNELNEKGEAVT WQGREYQAYPIDGSGFEMNGKGSSARPSLTVSNLFGLVTGMAEDLQSLVG ATVVRRRVYARFLDAVNFVAGGRGYRAAEVKRIKKSRRVAGTDKD >ECs4446 hypothetical protein MEIVMKTSKTVAKLLFVVGALVYLVGLWISCPLLSGKGYFLGVLMTATFG NYAYLRAEKLGQLDNFFTHICQLVALITIGLLFIGVLNAPINAYEMVIYP IAFFVCLFGQMRLFRSV >ECs3911 hypothetical protein MLTVIAEIRTRPGQHHRQAVLDQFAKIVPTVLKEEGCHGYAPMVDCAAGV SFQSMAPDSIVMIEQWESIAHLEAHLQTPHMKAYSEAVKGDVLEMNIRIL QPGI >ECs3942 hypothetical protein MSAIAPGMILIAYLCGSISSAILVCRLCGLPDPRTSGSGNPGATNVLRIG GKGAAVAVLIFDVLKGMLPVWGAYELGVSPFWLGLIAIAACLGHIWPVFF GFKGGKGVATAFGAIAPIGWDLTGVMAGTWLLTVLLSGYSSLGAIVSALI APFYVWWFKPQFTFPVSMLSCLILLRHHDNIQRLWRRQETKIWTKFKRKR EKDPE >ECs4873 hypothetical protein MKASLALVSLLTAFTSYSLKSPAIPPTVVQIQANTNLAIADGARQQIGST LFYDPAYVQLTYPGGDVPQERGVCSDVVIRALRSQKVDLQKLVHEDMAKN FAEYPQKWQLKRPDSNIDHRRVPNLETWFTRHDKTRPISKNPSDYQAGDI VSWRLDNGLAHIGVVSDGFARDGTPLVIHNIGAGAQEEDVLFSWRMVGHY RYFVK >ECs1984 putative minor tail protein MAEIKTLHLVPREGMQVSEKPSVVRVRFGDGYEQRRPTGLNPQLKTFQAV FRVTDESTRRWLEEFLSWHGGYRAFLWRPPKHNRTVRVVCREWSVTDNAR YSDFSCTIEQVVN >ECs5287 hypothetical protein MTTQVRKNVMDMFIDGARRGFTIATTNLLPNVVMAFVIIQALKITGLLDW VGHICEPVMALWGLPGEAATVLLAALMSMGGAVGVAASLATAGALTGHDV TVLLPAMYLMGNPVQNVGRCLGTAEVNAKYYPHIITVCVINALLSIWVMQ LIV >ECs4209 hypothetical protein MWRRLIYHPDINYALRQTLVLCLPVAVGLMLGELRFGLLFSLVPACCNIA GLDTPHKRFFKRLIIGASLFATCSLLTQLLLAKDVPLPFLLTGLTLVLGV TAELGPLHAKLLPASLLAAIFTLSLAGYMPVWEPLLIYALGTLWYGLFNW FWFWIWREQPLRESLSLLYRELADYCEAKYSLLTQHTDPEKALPPLLVRQ QKAVDLITQCYQQMHMLSAQNNTDYKRMLRIFQEALDLQEHISVSLHQPE EVQKLVERSHAEEVIRWNAQTVAARLRVLADDILYHRLPTRFTMEKQIGA LEKIARQHPDNPVGQFCYWHFSRIARVLRTQKPLYARDLLADKQRRMPLL PALKSYLSLKSPALRNAGRLSVMLSVASLMGTALHLPKSYWILMTVLLVT QNGYGATRLRIVNRSVGTVVGLIIAGVALHFKIPEGYTLTLMLITTLASY LILRKNYGWATVGFTITAVYTLQLLWLNGEQYILPRLIDTIIGCLIAFGG TVWLWPQWQSGLLRKNAHDALEAYQEAIRLILSEDPQPTPLAWQRMRVNQ AHNTLYNSLNQAMQEPAFNSHYLADMKLWVTHSQFIVEHINAMTTLAREH RALPPELAQEYLQSCEIAIQRCQQRLEYDEPGSSGDANIMDAPEMQPHEG AAGTLEQHLQRVIGHLNTMHTISSMAWRQRPHHGIWLSRKLRDSKA >ECs5269 hypothetical protein MTTWTVRVFTTAEIIYRKTVIALVCHLNCSRQETVTMNKTITALAILMAS FAANASVLPETPVPFKSGTGAIDNDTVYIGLGSAGTAWYKLETQAKDKKW TALAAFPGGPRDQATSAFIDGNLYVFGGIGKNSEGLTQVFNDVHKYNPKT NSWVKLISHAPMGMAGHVTFVHNGKAYVTGGVNQNIFNGYFEDLNEAGKD STAVDKINAHYFDKKAEDYFFNKFLLSFDPSTQQWSYAGESPWYGTAGAA VVNKGDKTWLINGEAKPGLRTDAVFELDFTGNNLKWNRLAPVSSPDGVAG GFAGISNDSLIFAGGAGFKGSRENYQNGKNYAHEGLKKSYSTDIHLWHNG KWDKSGELSQGRAYGVSLPWNNSLLIIGGETAGGKAVTDSVLISVKDNKV TVQN >ECs0674 hypothetical protein MKLQLVAVGTKMPDWVQTGFTEYLRRFPKDMPFELIEIPAGKRGKNADIK RILDKEGEQMLAAAGKNRIVTLDIPGKPWDTPQLAAELERWKLDGRDVSL LIGGPEGLSPACKAAAEQSWSLSALTLPHPLVRVLVAESLYRAWSITTNH PYHRE >ECs5324 putative structural protein MDSHYLNNTQHVYDKGRVMQTEQQRAVTRLCIQCGLFLLQHGAESALVDE LSSRLGRALGMDSVESSISSNAIVLTTIKDGQCLTSTRKNHDRGINMHVV TEVQHIVILAEHHLLDYKGVEKRFSQIQPLRYPRWLVALMVGLSCACFCK LNNGGWDGAVITFFASTTAMYIRQLLAQRHLHPQINFCLTAFAATTISGL LLQHPTFSNTPTIAMAASVLLLVPGFPLINAVADMFKGHINTGLARWAIA SLLTLATCVGVVMALTIWGLRGWV >ECs0675 hypothetical protein MQGKALQDFVIDKIDDLKGQDIIALDVQGKSSITDCMIICTGTSSRHVMS IADHVVQESRAAGLLPLGVEGENSADWIVVDLGDVIVHVMQEESRRLYEL EKLWS >ECs3108 hypothetical protein MRKIFLPLLLVALSPVAHSEGVQEVEIDAPLSGWHPAEGEDASFSQSINY PASSVNMADDQNISAQIRGKIKNYAAAGKVQQGRLVVNGASMPQRIESDG SFARPYIFTEGSNSVQVISPDGQSRQKMQFYSTPGTGTIRARLRLVLSWD TDNTDLDLHVVTPDGEHAWYGNTVLKNSGALDMDVTTGYGPEIFAMPAPV HGRYQVYINYYGGRSETELTTAQLTLITDEGSVNEKQETFIVPMRNAGEL TLVKSFDW >ECs3829 hypothetical protein MDGVMSAVTVNDDGLVLRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQ ANSHLVKFLGKQFRVAKSQVVIEKGELGRHKQIKIINPQQIPPEVAALIN >ECs2574 hypothetical protein MAGHSKWANTRHRKAAQDAKRGKIFTKIIRELVTAAKLGGGDPDANPRLR AAIDKALSNNMTRDTLNRAIARGVGGDDDANMETIIYEGYGPGGTAIMIE CLSDNRNRTVAEVRHAFSKCGGNLGTDGSVAYLFSKKGVISFEKGDEDTI MEAALEAGAEDVVTYDDGAIDVYTAWEEMGKVRDALEAAGLKADSAEVSM IPSTKADMDAETAPKLMRLIDMLEDCDDVQEVYHNGEISDEVAATL >ECs0192 hypothetical protein MALKATIYKATVNVADLDRNQFLDASLTLARHPSETQERMMLRLLAWLKY ADERLQFTRGLCADDEPEAWLRNDHLGIDLWIELGLPDERRIKKACTQAA EVALFTYNSRAAQIWWQQNQSKCAQFANLSVWYLDDEQLAKVSAFADRTM TL >ECs1491 hypothetical protein MMIKTRFSRWLTFFTFAAAVALALPAKANTWPLPPAGSRLVGENKFHVVE NDGGSLEAIAKKYNVGFLALLQANPGVDPYVPRAGSVLTIPLQTLLPDAP REGIVINIAELRLYYYPPGKNSVTVYPIGIGQLGGDTLTPTMVTTVSDKR ANPTWTPTANIRARYKAQGIELPAVVPAGPDNPMGHHAIRLAAYGGVYLL HGTNADFGIGMRVSSGCIRLRDDDIKTLFSQVTPGTKVNIINTPIKVSAE PNGARLVEVHQPLSEKIDDDPQLLPITLNSAMQSFKDAAQTDAEVMQHVM DVRSGMPVDVRRHQVSPQTL >ECs2051 hypothetical protein MNQSLTLAFLIAAGIGLVVQNSLMVRITQTSSTILIAMLLNSLVGIVLFV SILWFKQGMAGFGELVSSVRWWTLIPGLLGSFFVFASISGYQNVGAATTI AVLVASQLIGGLVLDIFRSHGVPLRALFGPICGAILLVVGAWLVARRSF >ECs4985 hypothetical protein MMNDEVISRLLAPVMRGVRLLFGRGVLTGTTDTLKIQNVQITGMDGETFD DVERPQQYGQISVPLPGAEVFLACAGGQRDQAVVLVVEDRRSRPTGLTAG DTGVYHHEGHRIRLTKNGRIIVTCKTLEIYADEGVQVDTPEAHFTGNVTV DKNLHVKGNVSIDGTGRSQGTFTMSEAVIAGITYSGHVHHDNGEGSKRGG PENG >ECs4402 hypothetical protein MTQENEIKRPTQDLEHEPIKQLDNSEKGGKVSQALETVTTTAEKVQRQPV IAHLIRATERFNDRLGNQFGAAITYFSFLSMIPILMVSFAAGGFVLASHP MLLQDIFDKILQNISDPTLAATLKNTINTAVQQRTTVGLVGLAVALYSGI NWMGNLREAIRAQSRDVWERSPQDQEKFWVKYLRDFISLIGLLIALIVTL SITSVAGSAQQMIISALHLNSIEWLKPTWRLIGLAISIFANYLLFFWIFW RLPRHRPRKKALIRGTFLAAIGFEVIKIVMTYTLPSLMKSPSGAAFGSVL GLMAFFYFFARLTLFCAAWIATAEYKDDPRMPGKTQP >ECs2913 hypothetical protein MTIKNKMLLGALLLVTSAAWAAPATAGSTNTSGISKYELSSFIADFKHYK PGDTVPEMYRTDEYNIKQWQLRNLPAPDAGTHWTYMGGAYVLISDTDGKI IKAYDGEIFYHR >ECs4971 hypothetical protein MAQGIDLGYAATLPSKEAVAYFRAKGAHISWNWFETDADVHARSFTAAKA ARLDVLTTLQAEVQRAIDEGISQKAFIRTLTPRLQKLGWWGKQIVVDSAG NAEEVQLGSPRRLALIYNVNTRVAYNAGRYTQMMNNTDTHPFWQYVAVMD SRTRPSHSTLNGLVFRYDDPFWKTHYPPNGWNCRCRVRPLSQARLDAMGL SVSSGEDHLSTRNVEAGVDKQTGEVREMPVTTYSDGTRTMTPDVGWSYNP GSAAFGTDQALIRKLIEVKSPALREMVVQEMNNSPERQLAFRIWAKNIMK TRRGGHDIRTLGFMTESIAQAVESRTGTPPARLLAMSGKNVLHADSVKHQ NDGIALTPEDFAQLPAMLAAPDAVLWDHVHQNLLYITETRDGTAKIAVNA PYGVKRQPDKLDVVINAYRVNKFDIEKAIEGGKLELLEGKL >ECs4976 hypothetical protein MNYATETDMRARYREDLLRPLLAVPRSDEPDTRKLNRALTDASALIDSYL SARYTLPLEVIPAVLVQHCCAIAFYYLCDQRASDQARDRYREALAWLKDV MNGNVPVGVDTNGAAPESGDLPQVQSDAAVFGRNQKGFI >ECs5050 hypothetical protein MNGTIYQRIEDNAHFRELVEKRQRFATILSIIMLAVYIGFILLIAFAPGW LGTPLNPNTSVTRGIPVGVGVIVISFVLTGIYIWRANGEFDRLNNEVLHE VQAS >ECs1003 hypothetical protein MLFTLKKVIGNMLLPLPLMLLIIGAGLALLWFSRFQKTGKIFISIGWLAL LLLSLQPVADRLLRPIESTYPTWNNSQKVDYIVVLGGGYTWNPQWAPSSN LINNSLPRLNEGIRLWRENPGSKLIFTGGVAKTNTVSTAEVGARVAQSLG VPREQIITLDLPKDTEEEAAAVKQAIGDAPFLLVTSASHLPRAMIFFQQE GLNPLPAPANQLAIDSPLNPWERAIPSPVWLMHSDRVGYETLGRIWQWLK GSSGEPRQE >ECs4764 hypothetical protein MPFKPLVTAGIESLLNTFLYRSPALKTARSRLLGKVLRVEVKGFSTSLIL VFSERQVDVLGEWAGDADCTVIAYASVLPKLRDRQQLAALIRSGELEVQG DIQVVQNFVALAGLAEFDPAELLAPYTGDIAAEGISKAMRGGAKFLHHGI KRQQRYVAEAITEEWRMAPGPLEVAWFAEETAAVERAVDALIKRLEKLEA K >ECs1115 putative minor tail protein MKTFRWKVKPDMEVNSQPSVREVRFGDGYSQRMAAGLNADLKTYRVTLSV TREEARHLEAFLAEHGGWKAFLWTPPYAWRQIKVTCAAWSSRVRMLRVEF SAEFKQVVN >ECs0586 UDP-2,3-diacylglucosamine hydrolase MATLFIADLHLCVEEPAITAGFLRFLAGEARKADALYILGDLFEAWIGDD DPNPLHRQMAAAIKAVSDSGVPCYFIHGNRDFLLGKRFARESGMTLLPEE KVLELYGRRVLIMHGDTLCTDDAGYQAFRTKVHKPWLQTLFLALPLFVRK RIAVRMRANSKEANSSKSLAIMDVNQNAVVSAMEKHQVQWLIHGHTHRPA VHELIANQQPAFRVVLGAWHTEGSMVKVTADDVELIHFPF >ECs3611 hypothetical protein MSQYLIFQLHGPMASWGVDAPGEVRHTHELPSRSALLGLLAAGVGIRRDD TERLNAFNRHYSLVVCASRNPRWARDYHTVQMPKEVRKARYFSRREELSA PDLLSAIISRRDYYTDAWWMVAVATTPDAPYSLEQLQDGLRHPVFPLYLG RKSHPLALPLAPLLLEGNASDVLRNAYQQYQDSFRELKVSLPKLQDECWW EGEHDGLVASKILRRRDVPLNRQQWLFGERTINQGPWLSKEEPCTSQE >ECs0472 hypothetical protein MMKTITKQPILFTDVPVADLRNSMKQDLNQNLIERLWNKIRDFFLDSDKQ KAFKSIHKYINTLSVLNYNSALTPDPNFNIDATSDLDSYLKLDFDRLSPK QKQTTLCCFWNKIASSLPEPYNSTIKHNIIFYKDGENLMIRGTISIVNEV VKTYSLPIEKDDNGYYDFSGLYLAHSNISGKDPNKDPDIDFGIDMGNCNC SNVNFEHTYFYGVKFTNANCTNANFNNCRFKKCDLTNMNCTGAILDNAMI YGKEKEPEMQYPEADQIIQRITYQKSDGNETKGMILTNCSCVKTTFNWAD LSESDCQNVDFSEANLSNTILPDIVRMKGTKLYRTDLFNPILKTEAESTE EKDISPLAKIILDYIESDKNPESLNFEEKSTVIKIKQDIDNFIFYNQHLK KIFNRAMNLQEKISRKKYNEFFKYIQAEAKQYFKDQYKLTKNDYLKKVPL TAQLIAKYKMDDQLDQLLVTREIQDEIKSKIQDKIDELSKNLFNTMTETI ENNFDDIFRQQSENMSNYYEFVD >ECs2710 hypothetical protein MRLTAKQVTWLKVCLHLAGLLPFLWLVWAINHGGLGADPVKDIQHFTGRT ALKFLLATLLITPLARYAKQPLLIRTRRLLGLWCFAWATLHLTSYALLEL GVNNLALLGKELITRPYLTLGIISWIILLALAFTSTQAMQRKLGKHWQQL HNFVYLVAILAPIHYLWSVKIISPQPLIYAGLAVLLLALRYKKSRSLFNR LRKQVHNKLSV >ECs4788 hypothetical protein MKPSSPSRSKGHAKARRKTREELNQEARDRKRQKKRRGHAPGSRAAGGNN TSGSKGQNAPKDPRIGSKTPIPLGVTEKVTKQHKPKSEKPMLSPQAELEL LETDERLDALLERLEAGETLSAEEQSWVDAKLDRIDELMQKLGLSYDDDE EEEEDEKQEDMMRLLRGN >ECs2125 hypothetical protein MHVTLVEINVHEDKVDEFIEVFRQNHLGSVQEEGNLRFDVLQDPEVNSRF YIYEAYKDEDAVAFHKTTPHYKTCVAKLESLMTGPRKKRLFNGLMP >ECs0858 putative structural protein MRNRTLADLDRVVALGGGHGLGRVLSSLSSLGSRLTGIVTTTDNGGSTGR IRRSEGGIAWGDMRNCLNQLITEPSVASAMFEYRFGGNGELSGHNLGNLM LKALDHLSVRPLEAINLIRNLLKVDAHLIPMSEHPVDLMAIDDQGHEVYG EVNIDQLTTPIQELLLTPNVPATREAVHAINEADLIIIGPGSFYTSLMPI LLLKEIAQALRRTPAPMVYIGNLGRELSLPAANLKLESKLAIMEQYVGKK VIDAVIVGPKVDVSAVKERIVIQEVLEASDIPYRHDRQLLHSALEKALQA LG >ECs0866 hypothetical protein MSKSHPRWRLAKKILTWLFFIAVIVLLVVYAKKVDWEEVWKVIRDYNRVA LLSAVGLVVVSYLIYGCYDLLARFYCGHKLAKRQVMLVSFICYAFNLTLS TWVGGIGMRYRLYSRLGLPGSTITRIFSLSITTNWLGYILLAGIIFTAGV VELPDHWYVDQTTLRILGIGLLMIIAVYLWFCAFAKHRHMTIKGQKLVLP SWKFALAQMLISSVNWMVMGAIIWLLLGQSVNYFFVLGVLLVSSIAGVIV HVPAGIGVLEAVFIALLAGEHTSKGTIIAALLAYRVLYYFIPLLLALICY LLLESQAKKLRAKNEAAM >ECs5312 hypothetical protein MFGNLGQAKKYLGQAAKMLIGIPDYDNYVEHMKTNHPDKPYMSYEEFFRE RQNARYGGDGKGGMRCC >ECs0524 hypothetical protein MFGKGGLGNLMKQAQQMQEKMQKMQEEIAQLEVTGESGAGLVKVTINGAH NCRRVEIDPSLLEDDKEMLEDLVAAAFNDAARRIEETQKEKMASVSSGMQ LPPGFKMPF >ECs0197 hypothetical protein MSSFQFEQIGVIRSPYKEKFAVPRQPGLVKSANGELHLIAPYNQADAVRG LEAFSHLWILFVFHQTMEGGWRPTVRPPRLGGNARMGVFATRSTFRPNPI GMSLVELKEVVCHKDCVILKLGSLDLVDGTPVVDIKPYLPFAESLPDASA SYAQSAPAAEMAVSFTAEVEKQLLTLEKRYPQLTLFIREVLAQDPRPAYR KGEETGKTYAVWLHDFNVRWRVTDAGFEVFALEPR >ECs1507 hypothetical protein MKKDNYSFKRACAVVGGQSAMARLLGVSPPSVNQWIKGVRQLPAERCPAI ERATKGGVLCEELRPDVDWTYLRRSSCYSQNMSMKQPNDENDHTRSIKRQ MIHENQT >ECs2530 hypothetical protein MTITDLVLILFIAALLAFAIYDQFIMPRRNGPTLLAIPLLRRGRIDSVIF VGLIVILIYNNVTNHGALITTWLLSALALMGFYIFWIRVPKIIFKQKGFF FANVWIEYSRIKAMNLSEDGVLVMQLEQRRLLIRVRNIDDLERIYKLLVS TQ >ECs3986 putative cytochrome MQWYLSVLKNYVGFSGRARRKEYWMFTLINAIVGAIINVIQLILGLELPY LSMLYLLATFLPVLALAIRRLHDTDRSGAWALLFFVPFIGWLVLLVFFCT EGTSGSNRYGNDPKFGSN >ECs1044 hypothetical protein MAFMLSPLLKRYTWNSAWLYYARIFIALCGTTAFPWWLGDVKLTIPLTLG MVAAALTDLDDRLAGRLRNLIITLFCFFIASASVELLFPWPWLFAIGLTL STSGFILLGGLGQRYATIAFGALLIAIYTMLGTSLYEHWYQQPMYLLAGA VWYNVLTLIGHLLFPVRPLQDNLARCYEQLARYLELKSRMFDPDIEDESQ APLYDLALANGLLMATLNQTKLSLLTRLRGDRGQRGTRRTLHYYFVAQDI HERASSSHIQYQTLREHFRHSDVLFRFQRLMSMQGQACQQLSRCILLRQP YQHDPHFERAFTHIDAALERMRDNGAPADLLKTLGFLLNNLRAIDAQLAT IESEQAQALPHNNDENELADDSPHGLSDIWLRLSRHFTPESALFRHAVRM SLVLCFGYAIIQITGMHHGYWILLTSLFVCQPNYNATRHRLKLRIIGTLV GIAIGIPVLWFVPSLEGQLVLLVITGVLFFAFRNVQYAHATMFITLLVLL CFNLLGEGFEVALPRVIDTLIGCAIAWAAVSYIWPDWQFRNLPRMLERAT EANCRYLDAILEQYHQGRDNRLAYRIARRDAHNRDAELASVVSNMSSEPN VTPQIREAAFRLLCLNHTFTSYISALGAHREQLTNPEILAFLDDAVCYVD DALHHQPADEERVNEALASLKQRMQQLEPRADSKEPLVVQQVGLLIALLP EIGRLQRQITQVPQETPVSA >ECs4633 hypothetical protein MGLFDEVVGAFLKGDAGKYQAILSWVEEQGGIQVLLEKLQSGGLGAILST WLSNQQGNQPVSGEQLESALGTNAVSDLGQKLGVDTSTASCLLAEQLPKI IDALSPQGEVSPQANNDLLSAGMELLKGKLFR >ECs2410 hypothetical protein MDNAVDRHVFYISDGTAITAEVLGHAVMSQFPVTISSITLPFVENESRAR AVKDQIDAIYHQTGVRPLVFYSIVLPEIRAIILQSEGFCQDIVQALVAPL QQEMKLDPTPIAHRTHGLNPNNLNKYDARIAAIDYTLAHDDGISLRNLDQ AQVILLGVSRCGKTPTSLYLAMQFGIRAANYPFIADDMDNLVLPASLKPL QHKLFGLTIDPERLAAIREERRENSRYASLRQCRMEVAEVEALYRKNQIP WINSTNYSVEEIATKILDIMGLSRRMY >ECs3044 hypothetical protein MERNVTLDFVRGVAILGILLLNISAFGLPKAAYLNPAWYGAITPQDAWTW AFLDLIGQVKFLTLFALLFGAGLQMLLPRGRRWIQSRLTLLVLLGFIHGL LFWDGDILLAYGLVGLICWRLVRDAPSVKSLFNTGVMLYLVGLGVLLLLG LISDSQTSRAWTPDASAILYEKYWKLHGGVEAISNRADGVGNSLLALGAQ YGWQLAGMMLIGAALMRSGWLKGQFSLRHYRRTGFVLVAIGVIINLPAIA LQWQLDWAYRWCAFLLQMPRELSAPFQAIGYASLFYGFWPQLSRFKLVLA IACVGRMALTNYLLQTLICTTLFYHLGLFMQFDRLELLAFVIPVWLANIL FSVIWLRYFRQGPVEWLWRQLTLRAAGPAISKTSR >ECs0106 hypothetical protein MQTQVLFEHPLNEKMRTWLRIEFLIQQLTVNLPIVDHAGALHFFRNVSEL LDVFERGEVRTELLKELDRQQRKLQTWIGVPGVDQSRIEALIQQLKAAGS VLISAPRIGQFLREDRLIALVRQRLSIPGGCCSFDLPTLHIWLHLPQAQR DCQVETWIASLNPLTQALTMVLDLIRQSAPFRKQTSLNGFYQDNGGDADL LRLNLSLDSQLYPQISGHKSRFAIRFMPLDTENGQVPERLDFELACC >ECs2240 putative tail length tape measure protein MATLRELIIKISANSQSFQSEIQRASRMGSEYYRTLQNGGRQAAAAAREQ RRALAELHSQLTEIRASAVGMTGAFAGAFATGHLISLADEWSSVNARLKQ ASQSSDEFASSQKVLMDISQRTGTAFSDNAALFARSAASMREYGYSADDV LKVTEAISTGLKISGASTAEAGSVITQFSQALAQGVLRGEEFNSVNESGD RIVRALAAGMGVARKDLKAMADDGKLTADKVVPALISQLGILRDEYAAMP ETVSSSITKVENAFMAWVGGANEASGVTKTLSGMLNGVAGQIDNVATAVG ALVAVGVARYFGNMASGAMSATAGLVTAARNEVALAEAQFRGTQIATARA RAAVYRAQQAVAAARGTEMQIAAEARLAATQERLNRNIAARSAAQNALNS TTAVGSRLMSGALGLVGGVPGLVMLGAAAWYTLYQNQEQARESARQYALT IDEIAHKTPSMSLPEASDNEGRTRAALTEQNRLIDEQASRVKSLQEKIAG YQYVLANPGWTTGDGFMINHLTSVKTVTEGLAQATEQLAVEQSRLAQMQE KAQSIQDVLAGLEDRRVALIRQQAAEQNKVYQSMLVMNGQYTEFNRLLGL GNELLQQRQGLVNVPLRLPQATLDDKQQSALTKTERELALSRLKGEEKER VRLGYAADDLGFVGDPYQEARQRYISNALEAWRNNEVNKPKSRGGKSETE KAEDSFSRLLKQQKEQLALAGQNTELAKLKYQTALGELKTLSEIQKQELL RNAALIDQQKIREQLRYREETLKNDNVAARASNESELLGYGQGERARERM RELQQIRDSFRQKDADLQSQYQTGDISEDFYRQALAQNAQYLSERLKDQA VFYAESDVQRADWQKGLQEGFSNWVDNASDYASQAAQLATEGISGMVNNI TEMLNGNKVEWRSWASSVLQEISKVLMNAAIVNGIKTAANGMSGAGGFLG SIGDWLGGAVANAKGGVYTSANLSAYSNSIVDTPTYFAFAKGAGLMGEAG PEAIMPLTRAADGSLGVRAVGSMNGSAGLVYSPVYHIAIQNDGANGQIGP EAAGSLVQLIDQRVQAVMLSMRRDGGMLSG >ECs0732 hypothetical protein MNQQRFDDSTLIRIFALHELHRLKEHGLTRGALLDYHSRYKLVFLAHSQP EYRKLGPFVADIHQWQNLDDFYNQYRQRVIVLLSHPANPRDHTNVLMHVQ GYFRPHIESTERQQLAALIDSYRRGEQPLLAPLMRIKHYMALYPYAWLSG QRYFELWPRVINLRHSGVL >ECs0012 putative oxidoreductase MNVNYLNDSDLDFLQHCSEEQLANFARLLTHNEKGKTRLSSVLMRNELFK SMEGHPEQHRRNWQLIAGELQHFGGDSIANKLRGHGKLYRAILLDVSKRL KLKADKEMSTFEIEQQLLEQFLRNTWKKMDEEHKQEFLHAVDARVNELEE LLPLLMKDKLLAKGVSHLLSSQLTRILRTHAAMSVLGHGLLRGAGLGGPV GAALNGVKAVSGSSYRVTIPAVLQIACLRRMVSATQV >ECs0229 hypothetical protein MEFEERYFREELDYLRQLSKLLATEKPHLARFLAEKDADPDIERLLEGVA FLTGNLRQKIEDEFPELTHGLIKMLWPNYLRPVPAMTLIEYTPDMDKSSV PVLIPRNEQFTTNAGEIRVDEVLPSDAKKEEPPPCTFTLCRDIWLLPVRL EQIENRSTTRNGVINITFSVAPGTDFRTLDLNKLRFWLGNDDNYTRDQLY LWFCEYLQGADLTVGEQHIRLPEFMLKAVGFEPQDAMLPWPKNVHSGYRI LQEYFCYPDAFLFFDLCGCPALPDGLQAEFFTLQLRFSRPLPVDIRLRRD SLRLYCAPAINLFIHHAEAITLDNRRADYPLVPSRHYPQHYDVFSVNSVV SQVQDMFRKKDLGRPVSTQAARQWPAFESFSHQMEYSRKREVVYWHHRTK TSLFHRGFDHTLAFIHADGSYPSDESLLSNEVVSVSLTCTNRELPSQIRS GDITGTTGKNAAVASFRNITRPTQPLWPVIDGSLHWSLLSAMNLNYLSLL DTDALKQVIANFDRHAIHHPQTARLSQQKLDAIERLETRPVDRLFTGIPV RGLASTLYLHPEPFVCEGEMYLLGTVLSHFLSLYASVNSFHMLTVVNTES QETWKWTERIGQHPLI >ECs2888 hypothetical protein MAGWFELSKSSDSQFRFVLKAGNGETILTSELYTSKASAEKGIASVRSNS PQEERYEKKTASNGKFYFNLKAANHQIIGSSQMYATAQSRETGIASVKAN GTSQTVKDNT >ECs4828 hypothetical protein MIRKAFVMQVNPDAHEEYQRRHNPIWPELEAVLKSHGAHNYAIYLDKARN LLFATVEIESEERWNAVASTEICQRWWKYMTDVMPANPDNSPVSSELQEV FYLP >ECs2545 putative rRNA methylase MLVAQHTVYFPDAFLTQMREAMPSTLSFDDFLAACQRPLRRSIRVNTLKT SVADFLRLTAPYGWTLTPIPWCEEGFWIERDDEDALPLGSTAEHLSGLFY IQEASSMLPVAALFADGNAPQRVMDVAAAPGSKTTQIAARMNNEGAILAN EFSASRVKVLHANISRCGISNVALTHFDGRVFGAALPEMFDAILLDAPCS GEGVVRKDPDALKNWSPESNQEIAATQRELIDSAFHALRSGGTLVYSTCT LNREENESVCLWLKETYPDAVEFLPLGDLFPGANKALTEEGFLHVFPQIY DCEGFFVARLRKTQAIPALPTPKYKVGNFPFSPVKDREAAQIRQAAASVG LNWDGNLRLWQRDKEVWLFPVGIEALIGKVRFSRLGIKLAETHNKGYRWQ HEAVIALASPDNENAFELTPQEAEEWYRGRDVYPQAAPVADDVLVTFQHQ PIGLAKRIGSRLKNSYPRELVRDGKLFTSNA >ECs1974 putative portal protein MWNLLRRTRKNQKSGRDVREVGWRSLFQAVAEPFAGAWQQGVKADPETVL SFHAVFSCISLISQDIAKMRLRLMQTDVQGIRREKRQGDTARLCRRPNAQ QNRIQFFELWLNSKLRHGNTVVLKIRTPRGQIKELRILDWNRVEPLVADD GEVFYRITPDRNCGITESVTVPAREVIHDRFNCFFHPLVGLPPVYAAGLA AMQGHHIQANSTYFFRNGGRPSGVIEVPGSITEENAKKLKGNWDSGYTGE NAGKTAILSNGAKYSPTTFSPVDAQTVEQLKMTAEIVCSVFRVPAYKIGV GHPPSSDNVEALEQQYYSQCLQTLIESIELLLDEALETGENESTEFDVTT LLRMDSERRMKTLGESVKNTLLTPNEARKRENLPPLAGGDALYLQQQNYS LEALSRRDAREDPFASAGKTVSAQLPDGASDGNKAISETEHDAVKAMFRG ILRK >ECs2911 hypothetical protein MSHTIRDKQKLKARASKIQGQVVALKKMLDEPHECAAVLQQIAAIRGAVN GLMREVIKGHLTEHIVHQGDELKREEDLDVVLKVLDSYIK >ECs4781 hypothetical protein MKCKRLNEVIELLQPAWQKEPDLNLLQFLQKLAKESGFDGELADLTDDIL IYHLKMRDSAKDAVIPGLQKDYEEDFKTALLRARGVIKE >ECs4744 hypothetical protein MKKTLATLFLLTCLGSSSAYADNALILQTDFSLKDGAVSAMKGVAFGVDH NLKIFDLTHEIPPYNIWEGAYRLYQTASYWPQGSVFVSVVDPGVGTDRKS IVLKTKNGQYFVSPDNGTLTLVAESLGIESVREIDEKANRLKGSEKSYTF HGRDVYAYTGARLASGAITFEQVGPELPAKVVELSYQKAKATKGEVKGNI PILDIQYGNVWSNISDELLNQAGIKLNDTLCVTISEGSRQKYAGKMPYVA SFGDVPEGQPMVYLNSLLNVSVALNMDNFAQKHQVASGADWNIDVKKCDK >ECs2028 hypothetical protein MANSITADEIREQFSQAMSAMYQQEVPQYGTLLELVADVNLAVLENNPQL HEKMVNADELARLNVERHGAIRVGTAQELATLRRMFAIMGMYPVSYYDLS QAGVPVHSTAFRPIDDASLARNPFRVFTSLLRLELIENEILRQKAAEILR QRDIFTPRCRQLLEEYDQRGGFNETQAQEFVQEALETFRWHQSATVDEET YRALHNEHRLIADVVCFPGCHINHLTPRTLDIDRVQSMMPECGIEPKILI EGPPRREVPILLRQTSFKALEETVLFAGQKQGTHTARFGEIEQRGVALTP KGRQLYDDLLRNAGTGQDNLTHQMHLQETFRTFPDSEFLMRQQGLAWFRY RLTPSGEAHRQAIHPGDDPQPLIERGWVAAQPITYEDFLPVSAAGIFQSN LGNETQARSHGNASREAFEQALGCPVLDEFQLYQEAEERSKRRCGLL >ECs0896 hypothetical protein MNMKLKTLFAAAFAVVGFCSTASAVTYPLPTDGSRLVGQNQVITIPEGNT QPLEYFAAEYQMGLSNMMEANPGVDTFLPKGGTVLNIPQQLILPDTVHEG IVINSAEMRLYYYPKGTNTVIVLPIGIGQLGKDTPINWTTKVERKKAGPT WTPTAKMHAEYRAAGEPLPAVVPAGPDNPMGLYALYIGRLYAIHGTNANF GIGLRVSHGCVRLRNEDIKFLFEKVPVGTRVQFIDEPVKATTEPDGSRYI EVHNPLSTTEAQFEGQEIVPITLTKSVQTVTGQPDVDQVVLDEAIKNRSG MPVRLN >ECs4330 hypothetical membrane protein MSGIRSLPMIKLLTGLLLLAWPFVIWFGLAHNGLHWLLPLMALLLLLRLR QTRRQAGPLQAVTQLVAVVGIALCVASFMLKTHQLLLFYPVVVNAVMLAV FGGSLWSAMPIVERLARLQEPDLPEKGVRYTRHVTQIWCGFFIINGGIAL FTALYADMSLWTAWNGMIAYLLMGTLMAGEWLLRRQVMKRDRA >ECs2238 minor tail protein MQNIHEESLNESVKSEQSPRVVLWEIDLTVQGGERYFFCNELNEKGEAVT WQGRQYQVYPIDGSGFEMNGKGSSARPSLTVSNLFGLVTGMAEDLQSLVG ATVVRRRVYARFLDAVNFVAGNPEADPEQELRDRWVVEQMSELTAMTASF VLATPTETDGALFPGRIMLANTCMWTYRSDECGYTGGAVADEFDNPTTDI RKDRCSKCMRGCEMRSMVANFGGFLSINKLSQ >ECs1121 putative host specificity protein MGKGGGRAHTPVEAKDNLKSTQMMSVIDAIGEGPIEGPVKGLQSILVNKT PLTDTDGNPVIHGVTAVWRAGEQEQTPPEGFESSGAETALGVEVTKAKPV TRTITSANIDRLRVTFGVQSLLETTSKGDRNPSSVRLLIQLQRNGNWVTE KDVTINGKTTSQFLASVILDNLPPRPFNIRMVRETADSTSDQLQNKTLWS SYTEIIDVKQCYPNTAIVGLQVDAEQFGGQQMTVNYHIRGRIIQVPSNYD PEKRTYSGIWDGSLKPAYSNNPAWCLWDMLTHPRYGMGKRLGAADVDKWA LYAIAQYCDQMVPDGFGGTEPRMTFNAYLSQQRKAWDVLSDFCSAMRCMP VWNGQTLTFVQDSPSDVVWPYTNSDVVVDDNGVGFRYSFSALKDRHTAVE VNYTDPQNGWQTSTELVEDPEAILRYGRNLLKMDAFGCTSRGQAHRAGLW VIKTGLLETQTVDFTLGSQGLRHTPGDIIEICDNDYAGTMTGGRVLSIDA ASRTLTLDREVTLPETGAATVNLINGSGKPVSVAITAHPAPDRIQVSTLP DGVETYGVWGLSLPSLRRRLFRCVSIRENTDGTFAITAVQHVPEKEAIVD NGARFEPQSGTLNSVIPPAVQHLTVEVSAADGQYLAQAKWDTPRVVKGVR FSLRLTSGSGEGSRLVTTAITADTEHRSSGLPPGEYTLTVRAINSYGQQG EPATTTFRINAPAVPATIELTPGYFQITAVPRLAVYDPTVQFEFWFSETK IADTSQVETSARYLGTGSQWSVSGPHIKPGKDFWFYVRSVNLVGKSAFVE ASGRASNDAEGYLGLFREKIGKLHLAQGLWELIDNSQLADEMAEMKTSIT ETRNEITQTVSKTLEDQSAIIQQIQRVQKDTNDDLAALYMLKVQKTKNGI PYVAGIGAGIEDTDGQPLSNILLLADRIAMINPEDGNTTPLFVAQGNQLF MNDVFLKRLFAVSITSSANPPTFSLTPEGRLTARNADISGNVNANSGTLN NVTINENCRVLGKLSANQIEGDLVKTVGKAFPRDSRAPERWPSGTITVRV YDDQPFDRQIVIPAVAFSGAKHEREHTDIYSSCRLIVRKNGAEIYNRTAL DNTLIYSGVIDMPAGHGHMTLEFSVSAWLVNNWYPTASISDLLVVVMKKA TAGISIS >ECs2486 hypothetical protein MNLDDIINSMTPEVYQRLSTAVELGKWPDGVALTEEQKENCLQLVMLWQA RNNTEAQHMTIDTNGQMVMKSKQQLKEDFGISAKPIAMFK >ECs1803 putative tail length tape measure protein MATLRELIIKISANSQSFQSEIQRASRMGSEYYRTLQNGGRQAAAAAREQ RRALAELNSQLTEIRGSAVGMAGAFAGAFASGHLISLADEWSSVNARLKQ ASQSSDEFASSQKVLMDISQRTGTAFSDNAALFARSAASMREYGYSAGDV LKVTEAISTGLKISGASTAEAGSVITQFSQALAQGVLRGEEFNSVNESGD RIVRALAAGMGVARKDLKAMADDGKLTADKVVPALISQLGILRDEYAAMP ETVSSSITKVENAFMAWVGGANEASGVTKTLSGMLNGVAGQIDNVATAVG ALVAVGVARYFGNMASGAMSATAGLVTAARNEVALAEAQFRGTQIATARA RAAVYRAQQAVAAARGTEMQIAAEARLAATQERLNRNIAARSAAQNALNS TTAVGSRLMSGALGLVGGVPGLVMLGAAAWYTLYQNQEQARESARQYALT IDEIAHKTPSMSLPEASDNEGRTRAALTEQNRLIDEQASRVKSLQEKAQS IQDVLAGLEDRRVALIRQQAAEQNKVYQSMLVMNGQHTEFNRLLGLGNEL LQQRQGLVNVPLRLPQATLDDKQQSALTKTERELALSRLKGEEKERVRLG YAADDLGFVGDPYQEARQRYISNALEAWRNNEANKPKSRGGKSETEKAED SFSRLLKQQKEQLALVGQNTELAKLKYQTALGELKTLTEMQKQELLRNAT LIDQQKIREQLRSREETLKNENAAARASNDAELLGYGQGERARERMRELQ QIRDSFRQKDADLQSQYQTGDISEDFYRQALAQNAQYLSERLKDQAVFYA ESDVQRADWQKGLQEGFSNWVDNASDYASQAAQLATEGISGMVNNITEML NGNKVEWRSWASSVLQEISKVLMNAAIVNGIKTAANGMSGAGGFLGSIGD WLGGAVANAKGGVYTSANLSAYSNSIVDTPTYFAFAKGAGLMGEAGPEAI MPLTRAADGSLGVRAVGSMNGSAGLVYSPVYHIAIQNDGANGQIGPEAAG SLVQLIDQRVQAVMLSMRRDGGMLSG >ECs1573 hypothetical protein MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVD SRLVSHQDGKWQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPF RELPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQM KQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALENAMNYSV GFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKL REMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQG DVLVRLGGLRVLRIGDDVYTNGEKIDSPHRPALDALASNIALTAENFGDA LEDPSFLAMLAALVNSGYWFFEG >ECs1045 hypothetical protein MRTVLNILNFVLGGFATTLGWLLATLVSIALIFTLPLTRSCWEITKLSLV PYGNEAIHVDELNPAGKNVLLNTGGTVLNIFWLIFFGWWLCLMHIATGIA QCISIIGIPVGIANFKIAAIALWPVGRRVVSVETAQAAREANARRRFE >ECs5048 hypothetical protein MRYNGLNNMFFPLCLINDNHSVTSLSHTKKTKSDNYSKHHKNTLIDNKAL SLFKMDDHEKVIDLIQKMKRIYDSLPSGKITKETDRKIHKYFIDIASYAN NKCDDRITRRVYLNKDKEVSIKVVYFINNVTVHNNTIEIPQTVNGGYDFS HLSLKGIVIKDEDLSNSNFAGCRLQNAIFQDCNMYKTNFNFAIMEKILFD NCILDDSYFAQIKMTDGTLNSCSAMHVQFYNATMNRANIKNTFLDYSNFY MAYMAEVNLYKVIAPYINLFRADLSFSKLDLINFKHADLSRVNLNKAILQ NINLIDSKLFFTRLTNTFLEMVICTDSNMANVNFNNANLNNCHFNCSVLT KAWMFNTRLYRVNFDEASVQGMGISILRGEENIPINSDTLVTLQKFFEED CTSHTGMSQTENNTHEVAMKITADIMQHAD >ECs4315 hypothetical protein MLINIGRLLMLCVWGFLILNLVHPFPRPLNIFVNVALIFTVLMHGMQLAL LKSTLPKDGPQMTTAEKVRIFLFGVFELLVWQKKFKVKK >ECs4261 hypothetical protein MNMKPESKEAPINIRAKASQRDLIDMAANLVAKSRTDFMLDAACREAQDI LLDQRLFILDDEQYDAFLAALDAPITAERQAKINALMNRKSPWE >ECs0830 hypothetical protein MYLKSAPERGCAETVMAKNFVEEGKTVAIVASAAISSGDLVQVGDVFAVA LTDIPQGETGDGMTEGVFMLPKLKTDDMKTGKKVYLKSGKVQLTNSGSDP LVGVVWADAGTSAEEVPVKLNV >ECs3828 putative resistance protein MNTLTFLLSTVIELYTMVLLLRIWMQWAHCDFYNPFSQFVVKVTQPIIGP LRRVIPAMGPIDSASLLVAYILSFIKAIVLFKVVTFLPIIWIAGLLILLK TIGLLIFWVLLVMAIMSWVSQGRSPIEYVLIQLADPLLRPIRRLLPAMGG IDFSPMILVLLLYVINMGVAEVLQATGNMLLPGLWMAL >ECs2127 hypothetical protein MIVITFNRATFPRLKITMIVRPQQHWLRRIFVWHGSVLSKISSRLLLNFL FSIAVIFMLPWYTHLGIKFTLAPFSILGVAIAIFLGFRNNAGYARYVEAR KLWGQLMIASRSLLREVKTTLPDSASVREFARLQIAFAHCLRMTLRKQPQ VEVLAHYLKTEDLQRVLASNSPANRILLIMGEWLAVQRRNGQLSDILFIS LNDRLNDISAVLAGCERIAYTPIPFAYTLILHRTVYLFCIMLPFALVVDL HYMTPFISVLISYTFISLDCLAEELEDPFGTENNDLPLDAICNAIEIDLL QMNDEAEIPAKVLPDRHYQLT >ECs4519 putative alpha helix protein MIRSMTAYARREIKGEWGSATWEMRSVNQRYLETYFRLPEQFRSLEPVVR ERIRSRLTRGKVECTLRYEPDVSAQGELILNEKLAKQLVTAANWVKMQSD EGEINPVDILRWPGVMAAQEQDLDAIAAEILAALDGTLDDFIVARETEGQ ALKALIEQRLEGVTAEVVKVRAHMPEILQWQRERLVAKLEDAQVQLENNR LEQELVLLAQRIDVAEELDRLEAHVKETYNILKKKEAVGRRLDFMMQEFN RESNTLASKSINAEVTNSAIELKVLIEQMREQIQNIE >ECs4129 hypothetical protein MDTRFVQAHKEARWALGLTLLYLAVWLVAAYLPGVAPGFTGFPRWFEMAC ILTPLLFIGLCWAMVKFIYRDIPLEDDDAA >ECs5305 hypothetical protein MALGKESDKSLATAFQDLRELKVDVAYPFLLALYHDYKNDDLSHEDFLSI IRLIESYVFRRAVCAIPTNSLNKTFATFYKVINKEKYLESIQVHFLNLPS YRRFPNDDEFKRELKVRDLYNFRSRSYWLRRLENDKRRERVEEFTIEHIM PQNENLSAKWREELGSDWQRIHKELLHTLGNLTLTRYNSRYSDRPFAEKR DIEDGFKHSPLYLNIGLGQCEKWDEAAIHARADRLAELAVQVWQAPSLPE EVLAVYRGQPENKTSYSLSDYPFLADGSHSRVLFDHLRDEIMRLDAGITQ EVLKLYIAFKAETNFVDVVPQKSRLRLSLNMQFHELVDPKGIAKDVTNVG RWGNGDVEIGFSDLAQLPYIMGLIRQAFEKQMESALV >ECs0480 hypothetical protein MKGEEKMPSFDIVSEVDLQEARNAVDNASREVESRFDFRNVEASFELNDA SKTIKVLSESDFQVNQLLDILRAKLLKRGIEGSSLDVPENIVHSGKTWFV EAKLKQGIESATQKKIVKMIKDSKLKVQAQIQGDEIRVTGKSRDDLQAVM AMVRGGDLGQPFQFKNFRD >ECs2375 hypothetical protein MKKIAIVGAGPTGIYTLFSLLQQQTPLSISIFEQADEAGVGMPYSDEENS KMMLANIASIEIPPIYCTYLEWLQKQEASHLQRYGVKKETLHDRQFLPRI LLGEYFRDQFLRLVDQARKQKFAVAVYESCQVTDLQITNAGVMLATNQDL PSETFDLAVIATGHVWPDEEEATRTYFPSPWSGLMEAKVDACNVGIMGTS LSGLDAAMAVAIQHGSFIEDDKQHVVFHRDNASEKLNITLMSRTGILPEA DFYCPIPYEPLHIVTDQALNAEIQKGEYGLLDRVFRLIVEEIKFADPDWS QRIALESLNVDSFAQAWFAERKQRDQFDWAEKNLQEVERNKREKHTVPWR YVILRLHEAVQEIVPHLNEHDHKRFSKGLARVFIDNYAAIPSESIRRLLA LREAGIIHILALGEDYKMEINESRTVLKTEDNSYSFDVFIDARGQRPLKV KDIPFPGLREQLQKTGDEIPDVGEDYTLQQPEDIRGRVAFGALPWLMHDQ PFVQGLTACAEIGEAMARAVVKPASRARRRLSFD >ECs2543 hypothetical protein MALNTPQITPTKKITVRAIGEELPRGDYQRCPQCDMLFSLPEINSHQSAY CPRCQAKIRDGRDWSLTRLAAMAFTMLLLMPFAWGEPLLHIWLLGIRIDA NVMQGIWQMTKQGDAITGSMVFFCVIGAPLILVTSIAYLWFGNRLGMNLR PVLLMLERLKEWVMLDIYLVGIGVASIKVQDYAHIQAGVGLFSFVALVIL TTVTLSHLNVEELWERFYPQRPATRRDEKLRVCLGCHFTGYPDQRGRCPR CHIPLRLRRRHSLQKCWAALLASIVLLLPANLLPISIIYLNGGRQEDTIL SGIMSLASSNIAVAGIVFIASILVPFTKVIVMFTLLLSIHFKCQQGLRTR ILLLRMVTWIGRWSMLDLFVISLTMSLINRDQILAFTMGPAAFYFGAAVI LTILAVEWLDSRLLWDAHESGNARFDD >ECs0521 hypothetical protein MQRIILIIIGWLAVVLGTLGVVLPVLPTTPFILLAAWCFARSSPRFHAWL LYRSWFGSYLRFWQKHHAMPRGVKPRAILLILLTFAISLWFVQMPWVRIM LLVILACLLFYMWRIPVIDEKQEKH >ECs0617 hypothetical protein MDKQSLHETAKRLALELPFVELCWPFGPEFDVFKIGGKIFMLSSELRGVP FINLKSDPQKSLLNQQIYPSIKPGYHMNKKHWISVYPGEEISEALLRDLI NDSWNLVVDGLAKRDQKRVRPG >ECs0842 putative host specificity protein MGKGSSKGHTPREAKDNLKSTQLLSVIDAISEGPIEGPVDGLKSVLLNST PVLDSEGNTNISGVTVVFRAGEQEQTPPEGFESSGSETVLGTEVKYDTPI TRTITSANIDRLRFTFGVQALRETTSKGDRNPSEVRLLVQIQRNGGWVTE KDITIKGKTTSQYLASVVVDNLPPRPFNIRMRRMTPDSTTDQLQNKTLWS SYTEIIDVKQCYPNTALVGVQVDSEQFGSQQVSRNYHLRGRILQVPSNYD PEKRTYSGIWDGTLKPAYSNNMAWCLWDMLTHPRYGMGKRLGAADVDKWA LYVIGQYCDQSVPDGFGGTEPRITCNAYLTTQRKAWDVLSDFCSAMRCMP VWNGQTLTFVQDRPSDKVWTYNRSNVVMPDDGAPFRYSFSALKDRHNAVE VNWTDPDNGWETATELVEDTQAIARYGRNVTKMDAFGCTSRGQAHRAGLW LIKTELLETQTVDFSVGAEGLRHVPGDVIEICDDDYAGISIGGRVLAVNS QTRTLTLDREITLPSSGTTLISLVDGSGNPVSVEVQSVTDGVKVKVSRVP DGVAEYSVWGLKLPTLRQRLFRCVSIRENDDGTYAITAVQHVPEKEGIVD NGAHFDGDQSSTVNGVTPPAVQHLTAEVSADSGEYQVLARWDTPKVVKGV SFLLRLTVAADDGSERLVSTARTTETTYRFTQLALGNYRLTVRAVNAWGQ QGDPASVSFRIAAPAAPSQIELTPGYFQITATPHLAVYDPTVQFEFWFSE TRIADIRQVETSARYLGTALYWIAASINIKPGHDYYFYIRSVNTVGKSAF VEAVGRASDDAEGYLDFFKGKITESHLGKELLEKVDLTEDNASRLDEFSK EWKDANDKWNAMWGVKIEQTKDGKHYVAGIGLSMEDTEEGKLSQFLVAAN RIAFIDPANGNETPMFVAQGNQIFMNDVFLKRLTAPTITSGGNPPVFSLT SDGKLTAKNADISGSVNANSGTLNNVTVNENCTIKGMLEATQVRGDFVKA VSKSFPKQAGTWGNTETPNGTVTVTISDDHNFDRQIIIPPIIFNGIAYSD PGSGNNPGGTRYTGYGFEVRKNGVLIASRETKGAIPGSYSAVIDMPSGRG SVTLEFKVFHKGNQRAGNITDCTVIVTKKAASGISIR >ECs2810 hypothetical protein MRANKSLSPFEIRVYRHYRIVHGTRVALAFLLTFLIIRLFTIPEGTWPLV TMVVIMGPISFWGNVVPRAFERIGGTVLGSILGLIALQLELISLPLMLVW CAAAMFLCGWLALGKKPYQGLLIGVTLAIVVGSPTGEIDTALWRSGDVIL GSLLAMLFTGIWPQRAFIHWRIQLAKSLTEYNRVYQSAFSPNLLERPRLE SHLQKLLTDAVKMRGLIAPASKETRIPKSIYEGIQTINRNLVCMLELQIN AYWATRPSHFVLLNAQKLRDTQHMMQQILLSLVHALYEGNPQPVFANTEK LNDAVEELRQLLNNHHDLKVVETPIYGYVWLNMETAHQLELLSSLICRAL RK >ECs3179 hypothetical protein MSTPDNRSVNFFSLFRRGQHYSKTWPLEKRLAPVFVENRVIKMTRYAIRF MPPIAVFTLCWQIALGGQLGPAVATAPVRLKFTHAGIVVAGQAFCHAITP CNPQLVL >ECs4492 hypothetical protein MFPFRRNVLAFAALLALSSPVLAGKLAIVIDDFGYRPHNENQVLAMPSAI SVAVLPDSPHAREMATKAHNSGHEVLIHLPMAPLSKQPLEKNTLRPEMSS DEIERIIRSAVNNVPYAVGINNHMGSKMTSNLFGMQKVMQALERYNLYFL DSVTIGNTQAMRAAQGTGVKVIKRKVFLDDSQNEADIRVQFNRAIDLARR NGSTIAIGHPHPSTVRVLQQMVYNLPPDITLVKASSLLNEPQVDTSTPPK NAVPDAPRNPFRGVKLCKPKKPIEPVYANRFFEVLSESISQSTLIVYFQH QWQGWGKQPEAAKFNASAN >ECs1648 host specificity protein MGKGSSKGHTPREAKDNLKSTQLLSVIDAISEGPVEGPVDGLKSVLLNST PVLDSEGNTNISGVTVVFRAGEQEQTPPEGFESSGSETVLGTEVKYDTPI TRTITSANIDRLRFTFGVQALVETTSKGDRNPSEVRLLVQIQRNGGWVTE KDITIKGKTTSQYLASVVVDNLPPRPFNIRMRRMTPDSTTDQLQNKTLWS SYTEIIDVKQCYPNTALVGVQVDSEQFGSQKVSRNYHLRGRILQVPSNYN PQTRQYSGIWDGTFKPAYSNNMAWCLWDMLTHPRYGMGKRLGAADVDKWA LYVIGQYCDQSVPDGFGGTEPRITCNAYLTTQRKAWDVLSDFCSAMRCMP VWNGQTLTFVQDRPSDKVWTYNRSNVVMPDDGASFRYSFSALKDRHNAVE VNWIDPDNGWETATELVEDTQAILRYGRNVTKMDAFGCTSRGQAHRAGLW LIKTELLETQTVDFSVGAEGLRHVPGDVIEICDDDYAGISIGGRVLAVNS QTRTLTLDREITLPSSGTTLISLVDGSGNPVSVEVQSVTDGVKVKVSRVP DGVAGYSVWGLKLPTLRQRLFRCVSIRENDDGTYAITAVQHVPEKEAIVD NGAHFDGDLSGTVNGVTPPAVQHLTAEVTADSGEYQVLARWDTPKVVKGV SFLLRLTVAADDGSERLVSTARTTETTYRFTQLALGNYRLTVRAVNAWGQ QGDPASVSFRIAAPAAPSRIELTPGYFQITATPHLAVYDPTVQFEFWFSE TRIADIRQVETSARYLGTALYWIAASINIKPGHDYYFYIRSVNTVGKSAF VEAVGRASDDAEGYLDFFKGKITESHLGKELLEKVDLTEDNASRLDEFSK EWKDANDKWNAMWGVKIEQTKDGKHYVAGIGLSMEDTEEGKLSQFLVAAN RIAFIDPANGNETPMFVAQGNQIFMNDVFLKRLTAPTITSGGNPPAFSLT PDGKLTAKNADISGSVNANSGTLNNVTINENCQIKGKLSANQIEGDIVKT VSKSFPRTSTYASGTITVRISDDQKFDRQVMIPPVLFRGGKHENFNSNNQ QSYWYSTCRLRVTRNGQEIFNQSTTDAQGVFSSVIDMPAGQGTLTLTFTV SSSGANNWTPTTSISDLLVVVMKKSTAGISIS >ECs5038 hypothetical protein MWYQKTLTLSAKSRGFHLVTDEILNQLADMPRVNIGLLHLLLQHTSASLT LNENCDPTVRHDMERFFLRTVPDNGNYEHDYEGADDMPSHIKSSMLGTSL VLPVHKGRIQTGTWQGIWLGEHRIHGGSRRIIATLQGE >ECs0226 hypothetical lipoprotein MNRKAFLACVLMCVILTGCETAKKISEVIKNPDIQVGSLKEQPSEITVTL LTEPDTNTNAEGESAAVDVQLVYLTDDSKLQAADYDQIASTPLPDVLGKN YIDHQDFNLLPDTIKTLPPIKLDEKTQFIGVVAYFSDDQATEWKQIETVE GTGHHYRLLVHVRQSSIEMKKEDE >ECs0507 glycoprotein/polysaccharide metabolism MASGLAVAIALAACADKSADIQTPAPAANTSISATQQPAIQQPNVSGTVW IRQKVALPPDAVLTVTLSDASLADAPSKVLAQKAVRTEGKQSPFSFVLPF NPADVQPNARILLSAAITVNDKLVFITDTVQPVINQGGTKADLTLVPVQQ TAVPVQASGGATTTVPSTSPTQVNPSSAVPAPTQY >ECs3977 hypothetical protein MELLTQLLQALWAQDFETLANPSMIGMLYFVLFVILFLENGLLPAAFLPG DSLLVLVGVLIAKGAMGYPQTILLLTVAASLGCWVSYIQGRWLGNTRTVQ NWLSHLPAHYHQRAHHLFHKHGLSALLIGRFIAFVRTLLPTIAGLSGLNN ARFQFFNWMSGLLWVLILTTLGYMLGKTPVFLKYEDQLMSCLMLLPVVLL VFGLAGSLVVLWKKKYGNRG >ECs3793 putative actin MKFKVIALAALMGISGMAAQANELPDGPHIVTSGTASVDAVPDIATLAIE VNVAAKDAATAKKQADERVAQYISFLELNQIAKKDISSANLRTQPDYDYQ DGKSILKGYRAVRTVEVTLRQLDKLNSLLDGALKAGLNEIRSVSLGVAQP DAYKDKARKAAIDNAIHQAQELANGFHRKLGPVYSVRYHVSNYQPSPMVR MMKADAAPVSAQETYEQAAIQFDDQVDVVFQLEPVDQQPAKTPAAQ >ECs4960 hypothetical protein MRGKLISAIHVAKRELALDDETYTSALLAATGKTSCRDMSPDELSRVLDV FKKRGFKVRQNPVNRALKPGTVTAKIRAIWKVMHRQGFITDGAETALNRW VKSQTAAQNGGEGVANWQWLEQHPALASDVLERLKRWHRRKMLAAMGMPE RTLMGYDAVCRQYEKSLPR >ECs3527 hypothetical protein MGFWRIVITIILPPLGVLLGKGFGWAFIINILLTLLGYIPGLIHAFWVQT RD >ECs1735 hypothetical protein MSQLCPCGSAVEYSLCCHPYVSGEKVAPDPEHLMRSRYCAFVMQDADYLI KTWHPSCGAAALRAELIAGFAHTEWLGLTVFEHCWQDADNIGFVSFVARF TEGGKTGAIIERSRFLKENGQWYYIDGTRPQFGRNDPCPCGSGKKFKKCC GQ >ECs2060 VgrE MSTGLRFTLEVDGLPPDAFAVVSFHLNQSLSSLFSLDLSLVSQQFLSLEF AQVLDKMAYLTVWQGDDVQRRVKGVVTWFELGENDKNQMLYSMKVCPPLW RTGLRQNFRIFQNEDIESILATILKENGVTEWSPLFSEPHPSREFCVQYG ETDYDFLCRMAAEEGIFFYEEHAQKSIDQSLVLCDTVRYLPESFEIPWNP NTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQYQDY QRTQYEVYDYPGRFKGAHGQNFARWQMDGWRNNAEVARGTSRSPEIWPGR RIVLTGHPQANLNREWQVVASELHGEQPQAVPGRRGSGTTLNNHFAVIPA DRTWRPQPLLKPLVDGPQSAVVTGPAGEEIFCDEHGRVRVKFNWDRYNPS NQDSSCWIRVAQAWAGTGFGNLAIPRVGQEVIVDFLNGDPDQPIIMGRTY HQENRTPGSLPGTKTQMTIRSKNYKGSGFNELKFDDATGKEQVYIHAQKN MNTEVLNNRTTDVINNHAETIGNNQMIAVTNNQIQTVGVNQIETVGSNQI INVGSVQVETIGLVRALTVGVAYQTTVGGIMNTSVALMQSSQIGLHKSLR VGLGYDVKVGNNVTFTVGKTKKDDTGQTAIYSAGEHLELCCGKARLVLTK DGQIFLNGTKIHLQGKEQVNGDSLLINWNCAASKSPPKTPDEKQDTPDMR EY >ECs2020 hypothetical protein MNITPFPTLSTATIDAINVIGQWLAQDDFSGEVPYQADCVILAGNAVMPT IDAACKIARDQQIPLLISGGIGHSTTFLYSAIAQHPHYNTIRTTGRAEAT ILADIAHQFWHIPHEKIWIEDQSTNCGENARFSIALLNQAVERVHTAIVV QDPTMQRRTMATFRRMTGDNPDVPRWLSYPGFVPQLGNNADSVIFINQLQ GLWPVERYLSLLTGELPRLRDDSDGYGPRGRDFIVHVDFPAEVIHAWQTL KHDAVLIEAMESRSLR >ECs1034 paraquat-inducible protein A MCEHHHAAKHILCSQCDMLVALPRLEHGQKAACPRCGTTLTVAWDAPRQR PTAYALAALFMLLLSNLFPFVNMNVAGVTSEITLLEIPGVLFSEDYASLG TFFLLFVQLVPAFCLITILLLVNRAELPVRLKEQLARVLFQLKTWGMAEI FLAGVLVSFVKLMAYGSIGVGSSFLPWCLFCVLQLRAFQCVDRRWLWDDI APMPELRQPLKPGVTGIRQGLRSCSCCTAILPADEPVCPRCGTKGYVRRR NSLQWTLALLVTSIMLYLPANILPIMVTDLLGSKMPSTILAGVILLWSEG SYPVAAVIFLASIMVPTLKMIAIAWLCWDAKGHGKRDSERMHLIYEVVEF VGRWSMIDVFVIAVLSALVRMGGLMSIYPAMGALMFALVVIMTMFSAMTF DPRLSWDRQPESEHEES >ECs0317 hypothetical protein MNIFEQTPPNRRRYGLAAFIGLIAGVVSAFVKWGAEVPLPPRSPVDMFNA ACGPESLIRAAGQIDCSRNFLNPPYIFLRDWLGLTDPNAAVYTFAGHVFN WVGVTHIIFSIVFAVGYCVVAEVFPKIKLWQGLLAGALAQLFVHMISFPL MGLTPPLFDLPWYENVSEIFGHLVWFWSIEIIRRDLRNRITHEPDPEIPL GSNR >ECs0535 putative ligase MDLLYRVKTLWAALRGNHYTWPAIDISLPGNRHFHLIGSIHMGSHDMAPL PTRLLKKLKNADALIVEADVSTSDTPFANLPACEALEERISEEQLQKLQH ISQEMGISPSLFSTQPLWQIAMVLQATQAQKLGLRAEYGIDYQLLQAAKQ QHKPVIELEGAENQIAMLLQLPDKGLALLDDTLTHWHTNARLLQQMMSWW LNAPPQNNEITLPNTFSQSLYDVLMHQRNLAWRDKLRAMPPGRYVVAVGA LHLYGEGNLPQMLR >ECs2531 hypothetical protein MFAGGDDVFYGYPGQDVVMNITATVLLAFGMSMDAFAASIGKGATLHKPK FSEALRTGLIFGAVETLTPLIGWGMGMLASRFVLEWNHWIAFVLLIFLGG RMIIEGFRGADDEDEEPRRRHGFWLLVTTAIATSLDAMAVGVGLAFLQVN IIATALAIGCATLIMSTLGMMVGRFIGSIIGKKAEILGGLVLIGIGVQIL WTHFHG >ECs0233 hypothetical protein MSKKFEGSVAPRERINISYVPKTDGQTAEVELPLNMLVVGDTGNTQETSS LDERQAVSVNKHNFGAVMAEAAIGLNFTVPATLKGSTTDDEMNVALNIKS LDDFSPDSVARQVPEVNKLLELREALTALKGPMGNLPAFRTQLQALLENE ESREQLLKEIGQVSNK >ECs2558 hypothetical protein MAVEVKYVVIREGEEKMSFTSKKEADAYDKMLDTADLLDTWLTNSPVQME DEQREALSLWLAEQKDVLSTILKTGKLPSPQVVGAESEEEDASHAA >ECs1118 putative tail assembly protein MATTNAFSLASPPLARICLHGDLQRFGRRLSLYVNTAAEAIRALSMQMPG FRRQMNEGWYQIRIAGDDTAPEAVYARLHEQLGEGTVIHIVPRLAGAGKG GLQIVLGAAAIVGSFFTAGASMALWGSALAAGGFSATTMLFSLGASMILG GVAQMLAPKAKTPDYRATDNGRQNTYFSSLDNMIAQGNPMPVPYGEMLVG SRRISQDISTRDEGGDGKVVVIGRQA >ECs4961 putative transcription regulator MAETQMSMFGGDSEQLHALIDRLDDIPDDVLKKNWPRTLSELVEVTGAEL QRQGIEPVLAGKLARKVAAAQAAYMGGRGYYLPVGESLFAELRNNEIFSR WDRGEKIESLRRHYRMSETQIYTVIREQRRLHLARTQPPLF >ECs3214 hypothetical protein MKKKTTLSEEDQALFRQLMAGTRKIKQDTIVHRPQRKKVSEVPVKRLIQE QVDASHYFSDEFQPLLNADGPVKYVRPGVDHFEAKKLRRGDYSPELFLDL HGLTQLQAKQELGALIAACRREHVFCACVMHGHGKHILKQQTPLWLAQHP HVMAFHQAPKEYGGDAALLVLIEVEEWLPPELP >ECs0967 clpS, ATP-dependent Clp protease adaptor protein ClpS MGKTNDWLDFDQLAEEKVRDALKPPSMYKVILVNDDYTPMEFVIDVLQKF FSYDVERATQLMLAVHYQGKAICGVFTAEVAETKVAMVNKYARENEHPLL CTLEKA >ECs3394 iscA, iron-sulfur cluster assembly protein MSITLSDSAAARVNTFLANRGKGFGLRLGVRTSGCSGMAYVLEFVDEPTP EDIVFEDKGVKVVVDGKSLQFLDGTQLDFVKEGLNEGFKFTNPNVKDECG CGESFHV >ECs4809 rbn, ribonuclease BN MLKTIQDKARHRTRPLWAWLKLLWQRIDEDNMTTLAGNLAYVSLLSLVPL VAVVFALFAAFPMFSDVSIQLRHFIFANFLPATGDVIQRYIEQFVANSNK MTAVGACGLIVTALLLMYSIDSALNTIWRSKRARPKIYSFAVYWMILTLG PLLAGASLAISSYLLSLRWASDLNTVIDNVLRIFPLLLSWISFWLLYSIV PTIRVPNRDAIVGAFVAALLFEAGKKGFALYITMFPSYQLIYGVLAVIPI LFVWVYWTWCIVLLGAEITVTLGEYRKLKQAAEQEEDDEP >ECs2391 sufA, iron-sulfur cluster assembly scaffold protein MDMHSGTFNPQDFAWQGLTLTPAAAVHIRELVAKQPGMVGVRLGVKQTGC AGFGYVLDSVSEPDKDDLLFEHDGAKLFVPLQAMPFIDGTEVDFVREGLN QIFKFHNPKAQNECGCGESFGV >ECs5169 ulaA, ascorbate-specific PTS system enzyme IIC MEILYNIFTVFFNQVMTNAPLLLGIVTCLGYILLRKSVSVIIKGTIKTII GFMLLQAGSGILTSTFKPVVAKMSEVYGINGAISDTYASMMATIDRMGDA YSWVGYAVLLALALNICYVLLRRITGIRTIMLTGHIMFQQAGLIAVTLFI FGYSMWTTIICTAILVSLYWGITSNMMYKPTQEVTDGCGFSIGHQQQFAS WIAYKVAPFLGKKEESVEDLKLPGWLNIFHDNIVSTAIVMTIFFGAILLS FGIDTVQAMAGKVHWTVYILQTGFSFAVAIFIITQGVRMFVAELSEAFNG ISQRLIPGAVLAIDCAAIYSFAPNAVVWGFMWGTIGQLIAVGILVACGSS ILIILGFIPMFFSNATIGVFANHFGGWRAALKICLVMGMIEIFGCVWAVK LTGMSAWMGMADWSILAPPMMQGFFSIGIAFMAVIIVIALAYMFFAGRAL RAEEDAEKQLAEQSA >ECs0940 ulaA, ascorbate-specific PTS system enzyme IIC MEGVPTMFAKFIDVIQTFLTEPAILIGLLVGIGYALDKKSPIKIITGMVS AMVGLMMVLFGGFQFSATFKPVAEAVSKAYGVHGYLMDSYAMKAATQIAL GDNFGYVGYVFVLAFFTNLLLVLFGRYTGAKGIFLTGNTGVSHSQAVLWL IVFWLGFSWTTSIIIAGILTGVFWAFSTTLIVKPIAKVTKDAGFTIAHNQ MLGLWFFSKFAHKFGDPEKHDAENLKLPGWLAIFNHNVTAIAIVMTLFVG GFLLSTGIDNVQLMAKGKPWYIYIINLGLQFSMYMVILLQGVRMMVGEIN GSFKGWQDRFIPNAIPAVDVAALLPFSPNAATLGFVFCTFGTIFSMGILL LVHSPIMVLPGFVPLFFSGGPIGVLANRMGGYRSVIICTFLLGIIQTFGT VWAIPLTGLAENGVGWTGIFDWATVWPAICEVLKFIAATFHLGPYAG