TitleGenColors Logo

Gene list

Applied filters:

COG category: Function unknown
Gene type: CDS
Genomic element: chromosome

Number of genes found: 427

Free access
Sort by:

 



# Escherichia coli O157:H7 str. Sakai, Sakai

>ECs3667 hypothetical protein
MTSRFMLIFAAISGFIFVALGAFGAHVLSKTMGAVEMGWIQTGLEYQAFH
TLAILGLAVAMQRRISIWFYWSSVFLALGTVLFSGSLYCLALSHLRLWAF
VTPVGGVSFLAGWALMLVGAIRLKRKGVSHE
>ECs2785 hypothetical protein
MRRVNILCSFALLFASHTSLAVTYPLPPEGSRLVGQSLTVTVPDHNTQPL
ETFAAQYGQGLSNMLEANPGADVFLPKSGSQLTIPQQLILPVTVRKGIVV
NVAEMRLYYYPPDSNTVEVFPIGIGQAGRETPRNWVTTVERKQEAPTWTP
TPNTRREYAKRGESLPAFVPAGPDNPMGLYAIYIGRLYAIHGTNANFGIG
LRVSQGCIRLRNDDIKYLFDNVPVGTRVQIIDQPVKYTTEPDGSKWLEVH
EPLSRNRAEYESDRKVPLPVTPSLRAFINGQEVDVNRANAALQRRSGMPV
QISSGSRQMF
>ECs3990 hypothetical protein
MFDSTLNPLWQRYILAVQEEVKPALGCTEPISLALAAAVAAAELEGPVER
VEAWVSPNLMKNGLGVTVPGTGMVGLPIAAALGALGGNANAGLEVLKDAT
AQAIADAKALLAAGKVSVKIQEPCNEILFSRAKVWNGEKWACVTIVGGHT
NIVHIETHDGVVFTQQACVAEGEQESPLSVLSRTTLAEILKFVNEVPFAA
IRFILDSAKLNCALSQEGLSGKWGLHIGATLEKQCERGLLAKDLSSSIVI
RTSAASDARMGGATLPAMSNSGSGNQGITATMPVVVVAEHFGADDERLAR
ALMLSHLSAIYIHNQLPRLSALCAATTAAMGAAAGMAWLVDGRYETISMA
ISSMIGDVSGMICDGASNSCAMKVSTSASAAWKAVLMALDDTAVTGNEGI
VAHDVEQSIANLCALASHSMQQTDRQIIEIMASKAR
>ECs2498 hypothetical protein
MFDVTLLILLGLAALGFISHNTTVAVSILVLIIVRVTPLSTFFPWIEKQG
LSIGIIILTIGVMAPIASGTLPPSTLIHSFLNWKSLVAIAVGVIVSWLGG
RGVTLMGSQPQLVAGLLVGTVLGVALFRGVPVGPLIATGLVSLIVGKQ
>ECs3533 hypothetical protein
MFNRPNRNDVDDGVQDIQNDVNQLADSLESVLKSWGSDAKGEAEAARSKA
QALLKETRARMHGRTRVQQAARDAVGCADSFVRERPWCSVGTAAAVGIFI
GALLSMRKS
>ECs2949 putative tail length tape measure protein precursor
MAGNFADLTAVLTLDSARFSEEAARVKKELGETSALADLMSGKVSQSFRK
QADAAEQSLSRQALAAQKAGISVGQYKAAMRTLPAQFTDIVTQLAGGQNP
FLIMLQQGGQISDSFGGPLSLLTLLKEELLGIRDASESSEESLSDTANAL
AENARNAGELGRFMSVARVAAGGGVAVLAALAAAAWQAEQADRALLRSLT
LTGGAAATTTAELWKMAGVISDEAGGGIRQAAENLARLAESGKYTAGQLR
IMGETSQRWLQTVGDDAGKVEKAFEGIAADPVKALASLNQQYNFLSVSQL
RHIDELERTKGKQAAVTEAMSLFADVMNARLEQLDKAATPVEKIWDDVKT
WTSDAWAWIGDHTLGALSLITDVVAGTVEQVKLLLVQGDLALAEFIQSAW
ETTKNVPGVGALFGELAEENRVFIEKTKRDELALRKSIAERDARIRQGEM
GYINRSRATGVSKGPGQQEAVSRLAEELTGKKHTSPKTRSAGEREEEQAR
EALLALEAELRTLEKHSGANEKISRQRRDLWKAESQYAVLKEAATKRQLS
EQEKSLLAHKDETLEYKRQLAELGDKVEYQKRLNELAQQAVRFEEQQSAK
QAAISAKARGLTDRQAQRESEAQRLRDVYGDNPAALAKATSALKNTWSAE
EQLRGSWMAGLKSGWGEWAESATDSFSQVKSAATQTFDGIAQNMAAMLTG
AEADWRGFTRSVLSMMTEILLKQAMVGIVGRIGSAIGGAFGGGASASTGT
AIQAAAANFHFATGGFTGTGGKYEPAGIVHRGEFVFTKEATSRIGVGNLY
RLMRGYAEGGYVGGAGSPAQMRRTEGINFNQNNHVVIQNDGTNGQAGPQL
MKAVYDMARKGAQDELRLQLRDGGMLSGSRR
>ECs0409 nucleoprotein/polynucleotide-associated enzyme
MCRCLTRLIKQGRKGLAPNTPTAFTGILAKNYSAGLETKMAKLTLQEQLL
KAGLVTSKKAAKVERTAKKSRVQAREARAAVEENKKAQLERDKQLSEQQK
QAALAKEYKAQVKQLIEMNRITIANGDIGFNFTDGNLIKKIFVDKLTQAQ
LINGRLAIARLLVDNNSEGEYAIIPASVADKIAQRDASSIVLHSALSAEE
QDEDNPYADFKVPDDLMW
>ECs2456 hypothetical protein
MNAERKFLFACLIFALAIYAIHAFGLFDLLTDLPHLQTLIRQSGLFGYSL
YILLFIIATLFLLPGSILVIAGGIVFGPFLGTLLSLIAATLASSCSFLLA
RWMGRDLLLKYVGHSHTFQAIEKGIARNGIDFLILTRLIPLFPYNIQNYA
YGLTTIAFWPYTLISALTTLPGIVIYTVMASDLANEGITLRFILQLCLAG
LALFILVQLAKLYARHKHVDLSASRRNPLTHPKNEG
>ECs0105 zinc-binding protein
MSETITVNCPTCGKTVVWGEISPFRPFCSKRCQLIDLGEWAAEEKRIPSS
GDLSESDDWSEEPKQ
>ECs1677 hemolysin E
MIMTEIVADKTVEVVKNAIETADGALDLYNKYLDQVIPWQTFDETIKELS
RFKQEYSQAASVLVGNIKTLLMDSQDKYFEATQTVYEWCGVATQLLAAYI
LLFDEYNEKKASAQKDILIKVLDDGITKLNEAQKSLLVSSQSFNNASGKL
LALDSQLTNDFSEKSSYFQSQVDKIRKEAYAGAAAGVVAGPFGLIISYSI
AAGVVEGKLIPELKNKLKSVQSFFTTLSNTVKQANKDIDAAKLKLTTEIA
AIGEIKTETETTRFYVDYDDLMLSLLKEAANKMINTCNEYQKRHGKKTLF
EVPEV
>ECs0224 hypothetical protein
MDEGSLSLPPFTGYDEKSLRDYHLALHGNSLNPMIDAATPLLGMVMRLST
MNSQTMPEHLFAQVVTDVQAVEQLLQEQGYEPGVIISFRYILCTFIDEAA
LGNGWSNKNEWIKQSLLVHFHNEAWGGEKVFILLERLIREPKRYQDLLEF
LWLCFSLGFRGRYKVAVQDQGEFEQIYRRLYHVLHKLRGDAPFPLLHQDK
KTQGGRYQLISRLTVKHIFCGGVVVLALFYLFYLLRLDSQTQDILHQLNK
LLR
>ECs0412 putative alpha helix chain
MMRCEMPSTPEEKKKVLTRVRRIRGQIDALERSLEGDAECRAILQQIAAV
RGAANGLMAEVLESHIRETFDRNDCYSREVSQSVDDTIELVRAYLK
>ECs1488 hypothetical protein
MNKSMLAGIGIGVAAALGVAAVASLNVFERGPQYAQVVSATPIKETVKTP
RQECRNVTVTHRRPVQDEHRITGSVLGAVAGGVIGHQFGGGRGKDVATVV
GALGGGYAGNQIQGSLQESDTYTTTQQRCKTVYDKSEKMLGYDVTYKIGD
QQGKIRMDRDPGTQIPLDSNGQLILNNKV
>ECs2248 putative portal protein
MWNLLRRTRKNQKSGRDVREVGWRSLFQAVAEPFAGAWQQGVKADPETVL
SFHAVFSCISLISQDIAKMRLRLMQTDVQGIRREKRQGDTARLCRRPNAQ
QNRIQFFELWLNSKLRHGNTVVLKIRTPRGQIKELRILDWNRVEPLVADD
GEVFYRITPDRNCGITESVTVPAREVIHDRFNCFFHPLVGLPPVYAAGLA
AMQGHHIQANSTYFFRNGGRPSGVIEVPGSITEENAKKLKGNWDSGYTGE
NAGKTAILSNGAKYSPTTFSPVDAQTVEQLKMTAEIVCSVFRVPAYKIGV
GHPPSSDNVEALEQQYYSQCLQTLIESIELLLDEALETGENESTEFDVTT
LLRMDSERRMKTLGDAVKNTLLTPNEARKRENLPPLAGGDALYLQQQNYS
LEALSRRDAREDPFASAGKTVSSQLPDGASDGNKAISETEHDAVKAMFRG
DTEKMTERELSIIRALGEEFSTVLADLQRTFEGKMASQAQAFEEKLTSLS
AVLQKHVTVDEVRPVLQAMVDDAVGAIPVPRDGRDYDPDVLQQAVNDAVA
NIPQPADGKSLTPDDVRPMLEQMVKEAVSHIPVPRDGRDYDPEVLQKAVN
DAVANIPQPADGKSLTPDDVRPMLEQMVKEAVSHIPVPRDGRDYDPEVLQ
KAVNDAVANIPQPADGKSLTPDDVRPMLEQMVKEAVSHIPVPRDGRDYDP
DVLQKAVLDAVSALPAPQDGRDATALEILPAIDDQKSFPRGTYATHQGGL
WRAYEKTHGMRGWECLVDGVADIDVSMTGERLFSVVVRQSSGQRTEKTFS
LPVMLYRGVFRAGETYHPGDTVTWGGSLWHCNSMTEDKPGEAHSSAWTLA
AKRGRDAGG
>ECs0205 hypothetical protein
MRKNTYAMRYVAGQPAERILPPGSFASIGQALPPGEPLSTEERIRILVWN
IYKQQRAEWLSVLKNYGKDAHLVLLQEAQTTPELVQFATANYLAADQVPA
FVLPQHPSGVMTLSAAHPVYCCPLREREPILRLAKSALVTVYPLPDTRLL
MVVNIHAVNFSLGVDVYSKQLLPIGDQIAHHSGPVIMAGDFNAWSRRRMN
ALYRFAREMSLRQVRFTDDQRRRAFGRPLDFVFYRGLNVSEASVLVTRAS
DHNPLLVEFSPGKPDK
>ECs4853 hypothetical protein
MSLEVFEKLEAKVQQAIDTITLLQMEIEELKEKNNSLSQEVQNAQHQREE
LERENNHLKEQQNGWQERLQALLGRMEEV
>ECs1200 DNA-binding protein
MNELINSNAIKMTSIEIAELVGSRHDKVKQSIERLAVRGVIRNPPMVVFE
KINNLGLLRGVEAYVFEGEQGKRDSIIVVAQLSPEFTARLVDRWRELEGA
TAKIPQTFSEALRLAADLEDQKAELEKQLALAAPKVEFADRVGEASGILI
GNFAKVVGIGPNKLFAWMRDHKILIASGSRRNVPMQEYMDRGYFTVKETA
VNTNHGIQISFTTKITGRGQQWLTRKLLDNGMLKVTREAA
>ECs4197 hypothetical protein
MSRSLLTNETSELDLLDQRPFEQTDFDILKSYEAVVDGLAMLIGSHCEIV
LHSLQDLKCSAIRIANGEHTGRKIGSPITDLALRMLHDMTGADSSVSKCY
FTRAKSGVLMKSLTIAIRNREQRVIGLLCINMNLDVPFSQIMSTFVPPET
PDVGSSVNFASSVEDLVTQTLEFTIEEVNADRNVSNNAKNRQIVLNLYEK
GIFDIKDAINQVADRLNISKHTVYLYIRQFKSGDFQGQDK
>ECs5211 hypothetical protein
MTKQPEDWLDDVPGDDIEDEDDEIIWVSKSEIKRDAEELKRLGAEIVDLG
KNALDKIPLDADLRAAIELAQRIKMEGRRRQLQLIGKMLRQRDVEPIRQA
LDKLKNRHNQQVVLFHKLENLRDRLIDQGDDAIAEVLNLWPDADRQQLRT
LIRNAKKEKEGNKPPKSARQIFQYLRELAENEG
>ECs0441 hypothetical protein
MLQSNEYFSGKVKSIGFSSSSTGRASVGVMVEGEYTFSTAEPEEMTVISG
ALNVLLPDATDWQVYEAGSVFNVPGHSEFHLQVAEPTSYLCRYL
>ECs0611 hypothetical protein
MKKALQVAMFSLFTVIGFNAQANEHHHETMSEAQPQVISATGVVKGFDLE
SKKITIHHDPIAAVNWPEMTMRFTITPQTKMSGIKTGDKVAFNFVQQGNL
SLLQDIKVSQ
>ECs4079 hypothetical protein
MKFKTNKLSLNLVLASSLLAASIPAFAVTGDTDQPIHIESDQQSLDMQGN
VVTFTGNVIVTQGTIKINADKVVVTRPGGEQGKEVIDGYGKPATFYQMQD
NGKPVEGHASQMHYELAKDFVVLTGNAYLQQVDSNIKGDKITYLVKEQKM
QAFSDKGKRVTTVLVPSQLQDKNNKGQTPAQKKGN
>ECs5157 hypothetical protein
MTWNPLALATALQTVPEQNIDVTNSENALIIKMNDYGDLQINILFTSRQM
IIETFICPVSSISNPDEFNTFLLRNQKMMPLSSVGISSVQQEEYYIVFGA
LSLKSSLEDILLEITSLVDNALDLAEITEEYSH
>ECs4762 putative alpha helix chain
MDFSIMVYAVIALVGVAIGWLFASYQHAQQKAEQLAEREEMVAELSAAKQ
QITQSEHWRAECELLNNEVRSLQSINTSLEADLREVTTRMEAAQQHADDK
IRQMINSEQRLSEQFENLANRIFEHSNRRVDEQNRQSLNSLLSPLREQLD
GFRRQVQDSFGKEAQERHTLTHEIRNLQQLNAQMAQEAINLTRALKGDNK
TQGNWGEVVLTRVLEASGLREGYEYETQVSIENDARSRMQPDVIVRLPQG
KDVVIDAKMTLVAYERYFNAEDDYTRESALQEHIASVRNHIRLLGRKDYQ
QLPGLRTLDYVLMFIPVEPAFLLALDRQPELITEALKNNIMLVSPTTLLV
ALRTIANLWRYEHQSRNAQQIADRASKLYDKMRLFIDDMSAIGQSLDKAQ
DNYRQAMKKLSSGRGNVLAQAEAFRGLGVEIKREINPDLAEQAVSQDEEY
RLRSVPEQPNDEAYQRDDEYNQQSR
>ECs1554 putative minor tail protein
MAEIKTLHLVPREGMQVSEKPSVVRVRFGDGYEQRRPTGLNPQLKTFQAV
FRVTDESTRRWLDEFLSWHGGYRAFLWRPPKHNRTVRGVCREWSVTDNAR
YSDFSCTIEQVVN
>ECs4776 hypothetical protein
MESWLIPAAPVTVVEEIKKSRFITLLAHTDGVEAAKAFVESVRAEHPDAR
HHCVAWVAGAPDDSQQLGFSDDGEPAGTAGKPMLAQLMGSGVGEITAVVV
RYYGGILLGTGGLVKAYGGGVNQALRQLTTQRKTPLTEYTLQCEYSQLTG
IEALLGQCDGKIINSDYQAFVLLRVALPAAKVAEFSAKLADFSRGSLQLL
AIEE
>ECs0681 putative alpha helical protein
MNKVAQYYRELVASLSERLRNGERDIDALVEQARERVIKTGELTRTEVDE
LTRAVRRDLEEFAMSYEESLKEESDSVFMRVIKESLWQELADITDKTQLE
WREVFQDLNHHGVYHSGEVVGLGNLVCEKCHFHLPIYTPEVLTLCPKCGH
DQFQRRPFEP
>ECs2075 IpaH-like protein
MTNINTACVKNNASYQFNNALPNKETISSNFCERLEQWGNKSLNNGEERA
IAVERIKEAYNSNMASLDLSYLDLSELPPIPSTVNTLNLENNCLTCLDFT
DNASLVNINLSFNKINTITFPNESNLENIYIDHNNLESLDLKNQHSLVNL
EAQNNNLKKLIFLIVIN
>ECs4264 hypothetical protein
MNYELLTTENAPVKMWTKGVPVEADARQQLINTAKMPFIFKHIAVMPDVH
LGKGSTIGSVIPTKGAIIPAAVGVDIGCGMNALRTALTAADLPENLAELR
QAIETAVPHGRTTGRCKRDKGAWENPPVNVDAKWAELEAGYQWLTQKYPR
FLNTNNYKHLGTLGTGNHFIEICLDESDQVWIMLHSGSRGIGNAIGTYFI
DLAQKEMQETLETLPSRDLAYFMEGTEYFDDYLKAVAWAQLFASLNRDVM
MENVVTALQSITQKTVRQPQTLAMEEINCHHNYVQKEQHFGEEIYVTRKG
AVSARAGQYGIIPGSMGAKSFIVRGLGNEESFCSCSHGAGRVMSRTKAKK
MFSVEDQIRATVHVECRKDAEVIDEIPMAYKDIDAVMAAQSDLVEVIYTL
RQVVCVKG
>ECs1985 putative minor tail protein
MQDIHEESLNESVKSEQSPRVVLWEIDLTVQGGERYFFCNELNEKGEAVT
WQGRQYQAYPIDGSGFEMNGKGSSARPSLTVSNLFGLVTGMAEDLQSLVG
ATVVRRRVYARFLDAVNFVAGNPEADPEQELSDRWVVEQMSQLTAMTASF
VLATPTETDGALFPGRIMLANTCMWTYRSDECGYTGGAVADEFDNPTTDI
RKDRCSKCMRGCEMRGMAVNFGGFLSINKLSQ
>ECs4521 hypothetical protein
MLLHILYLVGITAEAMTGALAAGRRRMDTFGVIIIATATAIGGGSVRDIL
LGHYPLGWVKHPEYVIIVATAAVLTTIVAPVMPYLRKVFLVLDALGLVVF
SIIGAQVALDMGHGPIIAVVAAVTTGVFGGVLRDMFCKRIPLVFQKELYA
GVSFASAVLYIALQHYVSNHDVVIISTLVFGFFARLLALRLKLGLPVFYY
SHEGH
>ECs0225 hypothetical protein
MATTRNKVMWQEGMLMRPHHFQQQQRYNDYLDNQRFRAMNDLSWGFTELT
LNNELLAQGKIMIDSASGTLPDGTVFSIPDQDALPDPLHPQNFPDERSRN
IYLALPVASDVRNEISDGRRIGRYRLNYADVRDLHSEEGDARTLTLGQLT
PRIMSGAEDMSAYITLPLCRISDRHADGSLTLDDDFIPSCQNIQVSKKLR
VYLKEVQGAIGGRASDLANRIGSPAQSGIADVAEFMMLQLLNRNQTRFTH
RARRSQLHPEDFYLDLAELLGELMTFTEPSRLPCPLDVYDHHDLTKTFKT
LLPEVKRALHTVLSPRAVNLPLHLRDGIWQADVHDSELLQSATFVLAVAA
NMPVDQIQRQFIQQSKISSPEKIRNMVSVQIPGIPLRALMVAPRQLPYHS
GFSYFELDKSGQAWTEMAAAGAVALHVSGSFPDLNMQLWAIRG
>ECs4093 hypothetical protein
MESLSERTSTGYQQIHDGIIHLVDSARTETVRSVNALMTATYWEIGRRIV
EFEQGGEARAAYGAQLIKRLSKDLSLRYKRGFSAKNLRQMRLFYLFFQHV
EIRQTVSGELTPLPWSTYVRLLSVKNADTRSFYEKETLRCGWSVRQLERQ
IATQFYERTLLSHDKSAMLQQHAPAETHILPQQAIRDPFVLEFLELKDEY
SESDFEEALINHLMDFMLELGDDFAFVGRQRRLRIDDNWFRVDLLFFHRR
LRCLLIVDLKVGKFSYSDAGQMNMYLNYAKEHWTLPDENPPIGLVLCAEK
GVGEAHYALAGLPNTVLASEYKMQLPDEKRLADELVRTQAVLEEGYRLR
>ECs0221 hypothetical protein
MAMDLRDPNVWISHLLENLPEEKLASALKDDNPDWEYIDGEIVKLGSLAH
AQLDIPELQRRGLQLLASESKDFRLLAHLLRTLQHAGDPLLALHLLTLYV
EHYWTVAAPQNMAHKKRFASQIIKRFETVLKAFHKTLPQRSAILCWVSWR
NWRSAGSHITSRNWHRLPMIFLPCTSVRLIVRLLLRSPLRRPPVVHHKPP
SRLKAA
>ECs2413 hypothetical protein
MTLSFITRWRDELPETYTALSPTPLNNARLIWHNTELANTLSIPSSLFKN
GAGVWGGETLLPGMSPLAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGT
TMDWHLKGAGLTPYSRMGDGRAVLRSTIRESLASEAMHYLGIPTTRALSI
VTSDSPVYRETVEPGAMLMRVAPSHLRFGHFEHFYYRREPDKVRQLADFA
IRHYWSHLEDDEDKYRLWFNDVVARTASLIAQWQTVGFAHGVMNTDNMSL
LGLTLDYGPFGFLNDYEPGFICNHSDHQGRYSFDNQPAVALWILQRLAQT
LSPFVAVDALNEALDSYQQVLLTHYGQRMRQKLGFMTEQKEDNALLNELF
SLMARERSDYTRTFRMLSLTEQHSAASPLRDEFIDRAAFDDWFARYRGRL
QQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTELHRLHEAL
RNPFSDRDDDYVSRPPDWGKRLEVSCSS
>ECs4300 hypothetical protein
MHYAVVVMLELLCCIHAYRTGQERYWIFIIFCFPVIGCVAYFVIVMLPET
GADRHGHTLLMRLQDKLNPERHLRKLTEELAIAETNQNHYALANELARLG
RYHEAVPHYQQALSGIFAHEAAMMLSLAQAQFAIQEFAACQQTLEDVMRY
NPDFQSADGHLLFARTLAAQEKYADAESEFEVLISYYPGPQARIYYAEML
AKMSRLREANEQYVAVVDTAKRSRPHYRKHHREWIKTANERLKQSVVQ
>ECs3378 hypothetical protein
MNTEATHDQNEALTTGARLRNAREQLGLSQQAVAERLCLKVSTVRDIEED
KAPADLASTFLRGYIRSYARLVHIPEEELLPGLEKQAPLRAAKVAPMQSF
SLGKRRKKRDGWLMTFTWLVLFVVIGLSGAWWWQDHKAQQEEITTMADQS
SAELSSNSEQGQSVPLNTSTTTDPATTSTPPASVDTTATNTQTPVVTAPA
PAVDPQQNAVVSPSQANVDTAATPAPTAATTPDGAAPLPTDQAGVTTPVA
DPNALVMNFTADCWLEVTDATGKKLFSGMQRKDGNLNLTGQAPYKLKIGA
PAAVQIQYQGKPVDLSRFIRTNQVARLTLNAEQSPAQ
>ECs3933 hypothetical protein
MDDIVNSVPSWMFTAIIAVCILFIIGIIFARLYRRASAEQAFVRTGLGGQ
KVVMSGGAIVMPIFHEIIPINMNTLKLEVSRSTIDSLITKDRMRVDVVVA
FFVRVKPSVEGIATAAQTLGQRTLSPEDLRMLVEDKFVDALRATAAQMTM
HELQDTRENFVQGVQNTVAEDLSKNGLELESVSLTNFNQTSKEHFNPNNA
FDAEGLTKLTQETERRRRERNEVEQDVEVAVREKNRDALSRKLEIEQQEA
FMTLEQEQQVKTRTAEQNAKIAAFEAERRREAEQTRILAERQIQETEIDR
EQAVRSRKVEAEREVRIKEIEQQQVTEIANQTKSIAIAAKSEQQSQAEAR
ANLALAEAVSAQQNVETTRQTAEADRAKQVALIAAAQDAETKAVELTVRA
KAEKEAAEMQAAAIVELAEATRKKGLAEAEAQRALNDAINVLSDEQTSLK
FKLALLQALPAVIEKSVEPMKSIDGIKIIQVDGLNRGGTAGDANTGNVGG
GNLAEQALSAALSYRTQAPLIDSLLNEIGVSGGSLAALTSLLTPTTPVAE
NVE
>ECs2850 putative galactokinase
MKLLILGNHTCGNRGDSAILRGLLDAINILNPHTEVDVMSRYPVSSSWLL
NRPVMGDPLFLQMKQHNSAAGVVGRVKKVLRRRYQHQVLLSRVTDTGKLR
NIAIAQGFTDFVRLLSGYDAIIQVGGSFFVDLYGVPQFEHALCTFMAKKP
LFMIGHSVGPFQDEQFNQLANYVFGHCDALILRESVSLDLMKRSNITTAK
VEHGVDTAWLVDHHTEDFTASYAVQHWLDVAAQQKTVAITLRELAPFDKR
LGTTQQVYEKAFAGVVNRILDEGYQVIALSTCTGIDSYNKDDRMVALNLR
QHISDPARYHVVMDELNDLEMGKILGACELTVGTRLHSAIISMNFATPAI
AINYEHKSAGVMQQLGLPEMAIDIRHLLDGSLQAMVADTLGQLPALNARL
SEAVSRERQTGMQMVQSVLERIGEVK
>ECs4308 hypothetical protein
MSAVKKQRIDLRLTDDDKSMIEEAAAISNQSVSQFMLNSASQRAAEVIEQ
HRRVILNEESWTRVMDALSNPPSPGEKLKRAAKRLQGM
>ECs1555 putative minor tail protein
MQNIHEESLNESVKSEQSPRVVLWEIDLTVQGGERYFFCNELNEKGEAVT
WQGRQYQVYPIDGSGFEMNGKGSSARPSLTVSNLFGLVTGMAEDLQSLVG
ATVVRRRVYARFLDAVNFVAGNPEADPEQELRDRWVVEQMSELTAMTASF
VLATPTETDGALFPGRIMLANTCMWDYRGDECGYHGPAVADEFDNPTTDI
RKDRCSKCMRGCEMRGMVANFGGFLSINKLSQ
>ECs2670 hypothetical protein
MCGRFAQSQTREDYLALLAEDIERDIPYDPEPIGRYNVAPGTKVLLLSER
DEHLHLDPVFWGYAPGWWDKPPLINARVETAATSRMFKPLWQHGRAICFA
DGWFEWKKEGDKKQPYFIYRADGQPVFMAAIGSTPFERGDEAEGFLIVTA
AADQGLVDIHDRRPLVLSPEAARKWMRQEISGKEASEIAASGCVPANQFS
WHPVSRAVGNVKNQGAELIQPV
>ECs4986 hypothetical protein
MADIAVVWDQGCGSLQLNGADLLTDNSLLTAVIISLFTDRRALDSDEIPD
GTRDRRGWWGDSFRERPVGSRLWLLSREKTLSSVVSRAQAYADEALAWLH
KSGAATSVVCHAMRVGHARLSLSVKITLPDGSRHPMIFYADMKGE
>ECs1804 minor tail protein
MAEIKTLHLVPREGMQVSEKPSVVRVRFGDGYEQRRPTGLNPQLKTFQAV
FRVTDESTRRWLDEFLSWHGGYRAFLWRPPKHNRTVRVVCREWSVTDNAR
YSDFSCTIEQVVN
>ECs2161 putative host specificity protein
MGKGGGKGHTPVEAKDNLKSTQMMSVIDAIGEGPIEGPVKGLQSILVNKT
PLTDTDGNPVIHGVTAVWRAGEQEQTPPEGFESSGAETALGVEVTKAKPV
TRTITSANIDRLRVTFGVQSLLETTSKGDRNHSSVRLLIQLQRNGNWVTE
KDVTINGKTTSQFLASVILDNLPPRPFNIRMVRETADSTTDQLQNRTLWS
SYTEIIDVKQCYPNTAIVGLQVDAEQFGGQQMTVNYHIRGRIIQVPSNYD
PEKRTYSGIWDGSLKPAYSNNPAWCLWDMLTHPRYGMGKRLGAADVDKWA
LYAIAQYCDQMVPDGFGGTEPRMTFNAYLSQQRKAWDVLSDFCSAMRCMP
VWNGQTLTFVQDSPSDVVWPYTNSDVVVDDNGVGFRYSFSALKDRHTAVE
VNYTDPQNGWQTSTELVEDPEAILRYGRNLLKMDAFGCTSRGQAHRAGLW
VIKTGLLETQTVDFTLGSQGLRHTPGDIIEICDNDYAGTMTGGRILSIDA
ASRTLTLDREVTLPETGAATVNLINGSGKPVSVAITAHPAPDRIQVSTLP
DGVETYGVWGLSLPSLRRRLFRCVSIRENTDGTFAITAVQHVPEKEAIVD
NGAHFDGDQSGTLNSVIPPAVQHLTVEVSAADSQYLAQAKWDTPRVVKGV
RFSLRLTSGSGQDSRLVTTAITADTEHRFSGLPLGEYTLTVRAINSYGQQ
GEPAITTFRINAPAKPATIELTPGYFQITAVPVLAVYDPTVQFEFWFSEK
RITNTAQVEKSARYLGSGSQWTVQGSRIKPGTDFWFYVRSVNLVGKSAFV
EVSGQPSNDGEGYLEFFREKIGKLHLAQGLWELIDNSQLADEMAEMKTTI
TETRNEITQTVSKTLENQSATIQQIQRVQKDTNDDLAALYMLKVQKTKDG
IPYVAGIGAGIEDTDGQPLSNILLLADRIAMINPESGNSTPLFVAQGNQL
FMNDVFLKRLFAVSITSSGNPPTFSLTPDGRLTAKNADISGSVNANSGTL
NNVTINENCQIKGKLSANQIEGDIVKTVSKSFPRTSTYASGTITVRISDD
QKFDRQVMIPPVLFRGGKHENFNSNNQQSYWYSTCRLRVTRNGQEIFNQS
TTDAQGVFSSVIDMPAGQGTLTLTFTVSSSGANNWTPTTSISDLLVVVMK
KSTAGISIS
>ECs0455 putative glycoprotein
MNFLAHLHLAHLAESSLSGNLLADFVRGNPEESFPPDVVAGIHMHRRIDV
LTDNLPEVREAREWFRSETRRVAPITLDVMWDHFLSRHWSQLSPDFPLQE
FVCYAREQVMTILPDSPPRFINLNNYLWSEQWLVRYRDMDFIQNVLNGMA
SRRPRLDALRDSWYDLDAHYAALETRFWQFYPRMMAQASRKAL
>ECs5309 hypothetical protein
MEAKECKVQDILTENKKFIIPSYQRPYSWTVDNAEQLIDDIYKSSQSEEN
GYFIGSMICINKGQNQYEVVDGQQRLTTLSIIVSELKKSSRFRG
>ECs2724 putative minor tail protein
MKTFRWKVKPDMEVNSQPSVREVRFGDGYSQRMAAGLNADLKTYRVTLSV
TREEARHLEAFLAEHGGWKAFLWKPPYAYRQIKVTCAGWSARVGMLRVEF
SAEFKQVVN
>ECs3109 hypothetical protein
MQSHLDREEYVARVLDREAKSTPPEAAKAMTVAIRTFLQQNANREGDCLT
IPDSSATQRVSASPATTGARTMTVWTQDLIYAGDPVHYHGSRATEGTLSW
RQAMAQAGKGERYDQILAFAYPDNSLSRWGAPRTTCQLLPKAKAWLAKKM
PQWRRILQGETGYNEPDVFAVCRLVSGFPYTDRQQKRLFIRNFFTLQDRL
DLTHEYLHLAFDGYPTGLDENYIETLTRQLLMD
>ECs1692 hypothetical protein
MGIIAWIIFGLIAGIIAKLIMPGRDGGGFFLTCILGIVGAVVGGWLATMF
GIGGSISGFNLHSFLVAVVGAILVLGVFRLLRRE
>ECs5089 hypothetical protein
MPLSPYLSFAGNCSDAIAYYQRTLGAELLYKISFGEMPKSAQDSAENCPS
GMQFPDTAIAHANVRIAGSDIMMSDAIPSGKASYSGFTLVLDSQQVEEGK
RWFDNLAANGKIEMAWQETFWAHGFGKVTDKFGVPWMINVVKQQPTQ
>ECs2556 hypothetical protein
MANWLNQLQSLLGQSSSSTSSSADQGLGKLLVPGALGGLAGLLVANKSAR
KLLTKYGTNALLIGGGAVAGTVLWNKYKDKIRAAHQDEPQFGSQSTPLDE
RTERLILALVFAAKSDGHIDAKERAAIDQQLREAGVEEQGRVLIEQAIEQ
PLDPQRLATGVRNEEEALEIYFLSCAAIDIDHFMERSYLNALGDALKIPQ
DVREGIERDLEQQKRTLAE
>ECs3599 tRNA pseudouridine synthase D
MIEFDNLTYLHGKPQGTGLLKANPEDFVVVEDLGFEPDGEGEHILVRILK
NGCNTRFVADALAKFLKIHAREVSFAGQKDKHAVTEQWLCARVPGKEMPD
LSAFQLEGCQVLEYARHKRKLRLGALKGNAFTLVLREVSNRDDVEQRLID
ICVKGVPNYFGAQRFGIGGSNLQGALRWAQTNTPVRDRNKRSFWLSAARS
ALFNQIVAERLKKADVNQVVDGDALQLAGRGSWFVATTEELVELQRRVND
KELMITAALPGSGEWGTQREALAFEQAAVAAETELQALLVREKVEAARRA
MLLYPQQLSWNWWDDVTVEIRFWLPAGSFATSVVRELINTTGDYAHIAE
>ECs1568 hypothetical protein
MLPTSQLRPTGTFCSYSAETSADIKSEITPIQIEEARASGRLYIKDCDIE
YLPQLPNEITSVTIENCNNLTTLTGLPVNTQNLSVINCEKLQITDMPSTV
KNLHIELTDSPFIHFISEGIECLTVCHCYISGVPESVRYLEIKGSATDSI
KNVPNGLSSLSINSYNPENQARIDNLISPSLKTLSLTGCSNIILPEKLPE
SVTSVTIHAEQKTTWNIGVEGMPDGLDLDLQNVLLSPDVVKAKNITFQGN
ALDVALHFREGDIVYGLSSPREKLVNSIKLVNDFSKKDIITQNTLTNAVW
DPRTPRKYKQDPLIKRALNEHERGIKFKQHLKNHNNYNVTMADLSVYNRD
KLWAKTSKAGLEFQTLTRNKTVIFCADELVNSLKLIANKSEGYGQSITAS
ELRWIYRNKDNNQIMKNIKFYLHGKEIPAERILDTPEWKDYRPKYSGSTY
KYS
>ECs3112 hypothetical protein
MRHGLLALICWLCCVVAHSEMLNVEQSGLFRAWFVRIAQEQLRQGPSPRW
YQQDCAGLVRFAANETLKVHDSKWLKSNGLSSQYLPPEMTLTPEQRQLAQ
NWNQGNGKTGPYVTAINLIQYNSQFIGQDINQALPGDMIFFDQGDAQHLM
VWMGRYVIYHTGSATKTDNGMRAVSLQQLMTWKDTRWIPNDSNPNFIGIY
RLNFLAR
>ECs2578 hypothetical protein
MIYIGLPQWSHPKWVRLGITSLEEYARHFNCVEGNTTLYALPKPEVVLRW
REQTTDDFRFCFKFPATISHQAALRHCDDLVTEFLTRMSPLAPRIGQYWL
QLPATFGPRELPALWHFLDSLPGEFNYGVEVRHPQFFAKGEEEQTLNRGL
HQRGVNQVILDSRPVHAARPHSEAIRDAQRKKPKVPVHAVLTATNPLIRF
IGSDDMTQNRELFQVWLQKLAQWHQTTTPYLFLHTPDIAQAPELVHTLWE
DLRKTLPEIGAVPAIPQQSSLF
>ECs3198 putative lipoprotein
MGTIVLVALGVIVLPGLLDGQKKHYQDEFAAIPLVPKAGDRDEPDMMPAA
TQALPTQPPEGAAEEVRAGDAAAPSLDPATIAANNTEFEPEPAPVVPPKP
KPVEPPKPKVEAPPAPKPEPKPVVEEKAAPTGKAYVVQLGALKNADKVNE
IVGKLRGAGYRVYTSPSTPVQGKITRILVGPDASKDKLKGSLGELKQLSG
LSGVVMGYTPN
>ECs0999 hypothetical protein
MSLPHLSLADARNLHLAAQGLLNKPRRRASLEDIPATISRMSLLQIDTIN
IVARSPYLVLFSRLGNYPAQWLDESLARGELMEYWAHEACFMPRSDFRLI
RHRMLAPEKMGWKYKDAWMQEHEAEIAQLIQHIHDKGPVRSADFEYPRKG
ASGWWEWKPHKRHLEGLFTAGKVMVIERRNFQRVYDLTHRVMPDWDDERD
LVSQTEAEIIMLDNSARSQGIFREQWLADYYRLKRPALAAWREARAEQRQ
IIAVHVEKLGNLWLHADLLPLLERALAGKLTATHSAVLSPFDPVVWDRKR
AEQLFDFSYRLECYTPAPKRQYGYFVLPLLHRGQLVGRMDAKMHRQTGIL
EVISLWLQEGIKPTTTLQKGLRQAITDFANWQQATRVTLGHCPQGLFTDC
RTGWEIDPVA
>ECs1645 minor tail protein
MQDIRQETLNECTRAEQSASVVLWEIDLTEVGGERYFFCNEQNEKGEPVT
WQGRQYQPYPIQGSGFELNGKGTSTRPTLTVSNLYGMVTGMAEDLQSLVG
GTVVRRKVYARFLDAVNFVNGNSDADPEQEVISRWRIEQCSELSAVSASF
VLSTPTETDGAVFPGRIMLANTCTWTYRGDECGYHGPAVADEYDQPTSDI
TKDKCSKCLSGCKFRNNVGNFGGFLSINKLSQ
>ECs5103 hypothetical protein
MTRTLKPLILNTGALALTLILIYTGISAHDKLTWLMEVTPVIIVVPLLLA
TAKRYPLTPLLYTLIFFHAIILMVGGQYTYAKVPIGFEVQEWLGLSRNPY
DKLGHFFQGLVPALVAREILVRGMYVRGRKMVAFLVCCVALAISAMYELI
EWWAALAMGQGADDFLGTQGDQWDTQSDMFCALLGALTTVIFLARFHCRQ
LRRFGLITG
>ECs1040 hypothetical protein
MKYQQLENLESGWKWKYLVKKHREGELITRYIEASAAQEAVDVLLSLENE
PVLVNGWIDKHMNPELVNRMKQTIRARRKRHFNAEHQHTRKKSIDLEFIV
WQRLAGLAQRRGKTLSETIVQLIEDAENKEKYANKMSSLKQDLQALLGKE
>ECs2695 hypothetical protein
MSFMVSEEVTVKEGGPRMIVTGYSSGMVECRWYDGYGVKREAFHETELVP
GEGSRSAEEV
>ECs4051 hypothetical protein
MITAPVEALGFELVGIEFIRGRTSTLRIYIDSEDGINVDDCADVSHQVSA
VLDVEDPITVAYNLEVSSPGLDRPLFTAEHYARFVGEEVTLVLRMAVQNR
RKWQGVIKAVDGEMITVTVEGKDEVFALSNIQKANLVPHF
>ECs3390 hypothetical protein
MGLKWTDSREIGEALYDAYPDLDPKTVRFTDMHQWICDLEDFDDDPQASN
EKILEAILLVWLDEAE
>ECs0230 hypothetical protein
MILDRERTTGSLFERMEASSARNRQGGSIHSLRQSIRQNLRNILNTRSGS
CRGAPELGIDEPEGAENFRESMSRAIEQCIERYEPRISHAEVQAVVSSAS
SPLDMTFHITAWVTFNETHEVLEFDMAPNGSQHYRVD
>ECs5255 hypothetical protein
MMSHAPMGMAGNVTFVHNGKAYVTGGVNQNIFNGYFEDLNEAGKDSAAID
KINAHYFDKKAEDYFFNKFLLSFDPSTQQWIFTCRLEQQKICEHDCNYFT
AWDINGFQ
>ECs4745 hypothetical protein
MAHRLLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTV
NPQTERVKMYWQKANGEAWGTLHALLADMNSQGQVQMAMNGGIYDESYAP
LGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK
EIQFAVQSGPMLMENSVINPRIHPNVASRKIRNGVGINKHGNAVFLLSQQ
ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISV
ERKG
>ECs4987 hypothetical protein
MMPYQPLPLAQLITQTQQDISQRLPGSQPGVNETTLNAIAYALAGLSAQE
HEHLAWISRQIIPTEADEAELLKHCAFWGVIRKPASRADGPVQLMLTTDA
GITEGVLLQRSDGVVYRITGSATGKAGTLNVNVEAESAGRAGNTPTGTRL
SFITPQAGINQTATVTGTGLTGGADVETVPELLSRLVFRVQNPPSGGTQY
DFERWAREVPGVTRAWCKPEWPEAGSVGVTFVQDNNPDIFPGEGDVKRVA
DYIRSHDDPATGQPVGQPLGPTISVFKLTNKPVAFEIRIVPKTPENQAAV
KQALTDLLYNESRPGGLVLPSSFWRAVAGVKGLEDFEVRSPLKSVMAGDT
ELLTVGEITWL
>ECs4703 acetolactate synthase II small subunit
MMQHQVNVSARFNPETLERVLRVVRHRGFHVCSMNMAAASDAQNINIELT
VASPRSVDLLFSQLNKLVDVAHVAICQSTTTSQQIRA
>ECs5160 hypothetical protein
MHILDSLLAFSAYFFIGVAMVIIFLFIYSKITPHNEWQLIKNNNTAASLA
FSGTLLGYVIPLSSAAINAVSIPDYFAWGGIALVIQLLVFAGVRLYMPAL
SEKIINHNTAAGMFMGTAALAGGIFNAACMTW
>ECs4846 hypothetical protein
MTIQQWLFSFKGRIGRRDFWIWIGLWFAGMLVLFSLAGKNLLDIQTAAFC
LVCLLWPTAAVTVKRLHDRGRSGAWAFLMIVAWMLLAGNWAILPGVWQWA
VGRFVPTLILVMMLIDLGAFVGTQGENKYGKDTQDVKYKADNKSSN
>ECs4230 DamX
MDEFKPEDELKPDPSDRRTGRSRQSSERSERTERGEPQINFDDIELDDTD
DRRPTRAQKERNEEPEIEEEIDESEDETVDEERVERRPRKRKKAASKPAS
RQYMMMGVGILVLLLLIIGIGSALKAPSTTSSDQTASGEKSIDLAGNATD
QANGVQPAPGTTSAENTQQDVSLPPISSTPTQGQTPAATDGQQRVEVQGD
LNNALTQPQNQQQLNNVAVNSTLPTEPATVAPVRNGNASRDTAKTQTAER
PATTRPARQQAVIEPKKPQATVKTEPKPVAQTPKRTEPAAPVASTKAPAA
TSTPAPKETATTAPVQTASPAQTTATPAAGGKTAGNVGSLKSAPSSHYTL
QLSSSSNYDNLNGWAKKENLKNYVVYETTRNGQPWYVLVSGVYASKEEAK
KAVSTLPADVQAKNPWAKPLRQVQADLK
>ECs5377 hypothetical protein
MFFRNRKNIAMTIKEKFSQKYPHASFCTFGDSAALADHLATLIATGVKTA
SCGSLAGCIEDNAFPMIGEYKIVENSRGEPVCVIRVIGLHLLRFSDVTAE
LARKEGEGDLSLEYWRNEHRRFFQAEGSYSPEMDVIFEEYALIDVV
>ECs1247 hypothetical protein
MLELLDERERNQQYIKRRDQENEEIALTVGKLRVELGAAENNLIDSECHV
AELEEALRDKQALLEASEKRIAELEAELVSQTYKLPHTQFEQIANLYEMQ
FDDGRTCAFHTDAQKAEQWLQACDGNRVQEYVKLERLQNALSGNSPVTPD
GWISCSERMPDTKTAVLVAVEFDRKGDWRMKWATYIPGHPDANDGWIIPG
ASWKPSHWMPLPEPPQEVN
>ECs1000 hypothetical protein
MDHRLLEIIACPVCNGKLWYNQEKQELICKLDNLAFPLRDGIPVLLETEA
RVLTADESKS
>ECs5310 hypothetical protein
MQKRVLPIDVYSDETDEPRLIVRKKEHDLYKYYILQDSKDYKPEKPSDTE
LVFISNAETIRDYLLRLSVDELKLLAKYILQNVYIVFVQTDDFASSFRLF
NVLNSRGLPLSNADLLKNALFESASTHNKKSEQIESAWSQIEDMVGVRRL
DKFLTLHKLSEKKDRDRVLQKGFEAFIENLQQQFDGDAIAMSLMLVNSAK
NYTKILENDFEHPSIRRKIASLSNLGVDEWIPPVMAFMNRMARTEDFNLD
DFSQFITAFEKVYMHGWLKKQIKSQREMVCYSALVAINNDMPFDSVINQI
NQHADNSGFIAALDEDLYEPRPNQVNLIKAILLRLDMEQQDESVIKTYTG
RITIEHILPQALVNEYWINRFQPQEHVYWLHKIGNLTLISGSKNSEAQHY
DFIKKKSIYEKLNSKSSFDLTKDVCNSSEWGLAELKMRHEKMKTQLKKLW
LV
>ECs1987 putative tail assembly protein
MATTNAFSLASPPLARICLHGDLQRFGRRLSLYVNTAAEAIRALSLQVPG
FRRQMNEGWYQIRIAGDDTAPEAVYARLHEQLGEGTVIHIVPRLAGAGKG
GLQIVLGAAAIVGSFFTAGASMAAWGAALSAGGFSATTMLFSLGASMILG
GVAQMLAPKPKTPEYRATDNGKQNTYFSSLDNMIAQGNPMPVPYGEMLVG
SRRISQDISTRDEGGGGKVVVIGRQA
>ECs0061 hypothetical protein
MKVSVPGMPATLLNMSNNDIYKMVSGDKMDMKMNIFQRLWETLRHLFWSD
KQTEAYKLLFNFVNKKAGNINASKYFTGAVNENEKEKFIHSLELFNELKT
CAKNPDEMVAKGNMSWVAQTFGDIELSVTFFIENKEICTQTLQLHKGPGN
LGVDLREAYLPGVDMRDCYLGLKTMKGHNKVLYLEPGWNANLDGATLDGA
TLDGATVDGATHLYDEVIIINKITPKKIDTEEVATKQSTAEQITDNAIIE
>ECs4922 hypothetical protein
MLQNPIHLRLERLESWQHVTFMACLCERMYPNYAMFCQQTGFGDGQIYRR
ILDLIWETLTVKDAKVNFDSQLEKFEEAIPSADDFDLYGVYPAIDACVAL
SELVHSRLSGETLEHAVEVSKTSITTVAMLEMTQAGREMSDEELKENPAV
EQEWDIQWEIFRLLAECEERDIELIKGLRADLREADESNIGIIFQQ
>ECs3652 hypothetical protein
MTTHDRVRLQLQALEALLREHQHWRNDEPQPHQFNSTQPFFMDTMEPLEW
LQWVLIPRMHDLLNNNQPLPGAFAVAPYYEMALATDHPQRALILAELEKL
DALFADDAS
>ECs3178 hypothetical protein
MIQESTMEMTNAQRLILSNQYKMMTMLDPANAERYRRLQTIIERGYGLQM
RELDREFGELKEETCRTIIDIMEMYHALHVSWSNLQDQQSIDERRVTFLG
FDAATEARYLGYVRFMVNVEGRYTHFDAGTHGFNAQTPMWEKYQRMLNVW
HACPRQYHLSANEINQIINA
>ECs4597 hypothetical protein
MFKTKWFAREARSHAITDEELCRAILETEQGKADALGGGVFKKRLHQNRE
RAIILAKGVSNWFYTFLYAKQDMSNINSQELAGFREIAKHYAFLTKAQLT
AMINTKELTEICYDCKN
>ECs5304 hypothetical protein
MKATEARLLDFLKRSQQFVIPIYQRTYSWTEQQCRQLWDDIIRAGKRDDI
SAHFIGSVVYIEQGLYQVSGISPLLVIDGQQRLTTAMLLIEALSRHLGED
EVFDGFSAMKLRNYYLLNPYESGEKGFKLLLTETDKDSLLALIKQRPMPE
NYSHRIMENFTFFDEQIAKLGDDLIPLCRGLAKLLIVDVALNRGQDNPQL
IFESMNSTGKALSQADLVRNFILMGLEPEHQTRLYEDHWRPMEVACGQQG
YSEYFDSFMRHYLTVKTGRSLGQMKSMRHLNSMPAARVLLKKA
>ECs3375 hypothetical protein
MEIYENENDQVEAIKRFFAENGKALAVGVILGVGALIGWRYWNSHQVDSA
RSASLAYQNAVTAVSEGKPDSIPAAEKFAAENKNTYGALASLELAQQFVD
KNELEKAAAQLQQGLADTSDENLKAVINLRLARVQVQLKQADAALKTLDT
IKGEGWAAIVADLRGEALLSKGDKQGARSAWEAGVKSDVTPALSEMMQMK
INNLSI
>ECs4199 hypothetical protein
MQDLSLEARLAELESRLAFQEITIEELNVTVTAHEMEMAKLRDHLRLLTE
KLKASQPSNIASQAEETPPPHY
>ECs4784 putative GTP-binding protein
MIRKSATGVIVALAVIWGGGTWYTGTQIQPGVEKFIKDFNDAKKKGEHAY
DMTLSYQNFDKGFFNSRFQMQMTFDNGAPDLNIKPGQKVVFDVDVEHGPL
PITMLMHGNVIPALAAAKVNLVNNELTQPLFIAAKNKSPVEATLRFAFGG
SFSTTLDVAPAEYGKFSFGEGQFTFNGDSSSLSNLDIEGKVEDIVLQLSP
MNKVTAKSFTIDSLARLEEKKFPVGESESKFNQINIINHGEDVAQIDAFV
AKTRLDRVKDKDYINVNLTYELDKLTKGNQQLGSGEWSLIAESIDPSAVR
QFIIQYNIAMQKQLAAHPELANDEVALQEVNAALFKEYLPLLQQSEPTIK
QPVRWKNALGELNANLDISIADPAKSSSSTNKDIKSLNFDVKLPLNVVTE
TAKQLNLSEGMDAEKAQKQADKQISGMMTLGQMFQLITIDNNTASLQLRY
TPGKVVFNGQEMSEEEFMSRAGRFVH
>ECs1559 putative tail assembly protein
MATTNAFSLASPPLARICLHGDLQRFGRRLSLYVNTAAEAIRALSLQMPG
FRRQMNEGWYQTRIPTASQNRSPAPVFVVMSGEYLLDKKEEIIQ
>ECs4150 hypothetical protein
MFDVLMYLFETYIHTEAELRVDQDKLEQDLTDAGFDREDIYNALLWLEKL
ADYQEGLAEPMQLASDPLSMRIYTPEECERLDASCRGFLLFLEQIQVLNL
ETREMVIERVLALDTAEFDLEDLKWVILMVLFNIPGCENAYQQMEELLFE
VNEGMLH
>ECs3409 hypothetical protein
MNSLRYFDFGAARPVLLLIARIAVVLIFIIFGFPKMMGFDGTVQYMASLG
APMPMLAAIIAVVMEVPAAILIVLGFFTRPLAVLFIFYTLGTAVIGHHYW
DMTGDAVGPNMINFWKNVSIAGAFLLLAITGPGAISLDRR
>ECs4655 hypothetical protein
MQYITFIACFFSHENMKYSTFHDINLDMCEIKNCNFNNSEMNFISCVGTN
FSGSTFNNVKTTTAQLIKTPTKWTNNTLKYWFSSCNKRNIIFTFNTISDR
NMKLKGIKDILLSLVDQKVNIYSVRQELLNFLNNDLYKNDGEILSYKESI
MLFCAE
>ECs1674 hypothetical protein
MFCVIYRSSKRDQTYLYVEKKDDFSRVPEELMKGFGQPQLAMILPLDGRK
KLVNADIEKVKQALTEQGYYLQLPPPPEDLLKQHLSVMGQKTDDTNK
>ECs4035 hypothetical protein
METLTAISRWLAKQHVVTWCVQQEGELWCANAFYLFDVQKVAFYILTEEK
TRHAQMSGPQAAVAGTVNGQPKTVALIRGVQFKGEIRRLEGEESDLARKA
YNRRFPVARMLSAPVWEIRPDEIKFTDNTLGFGKKMIWLRGSGTEQA
>ECs3526 hypothetical protein
MGLFNFVKDAGEKLWDAVTGQHDKDDQAKKVQEHLSKTGIPDADKVNIQI
ADGKATVTGDGLSQEAKEKILVAVGNISGIASVDDQVKTATPATASQFYT
VKSGDTLSAISKQVYGNANLYNKIFEANKPMLKSPDKIYPGQVLRIPEE
>ECs3110 hypothetical protein
MNWRRIVWLLALVTLPTLAEETPLQLALRGAQHDQLYQLSSSGVTKVSAL
PDTLTTPLGSLWKLYVYAWLEDTHQPEQPYQCRGNSPEEVYCCQAGESIT
RDTALVRSCGLYFAPQRLHIGADVWGQYWQQRQAPAWLASLSTLKPETSV
TVKSLLDSLATLPAQNKAQEVLLDVVLDEAKIGVASMLGSRVRVKTWSWF
ADDKQEIRQGGFAGWLTDGTPLWVTGSGTSKTVLTRYATVLNRVLPVPTQ
VASGQCVEVELFAR
>ECs1445 hypothetical protein
MKYQLTALEARVIGCLLEKQVTTPEQYPLSVNGVVTACNQKTNREPVMNL
SESEVQEQLDNLVKRHYLRTVSGFGNRVTKYEQRFCNSEFGDLKLSAAEV
ALITTLLLRGAQTPGELRSRAARMYEFSDMAEVELTLEQLANREDGPFVV
RLAREPGKRESRYMHLFSGEVEDQPAVTDMSNAVDGDLQARVEALEIEVA
ELKQRLDSLLAHLGD
>ECs0607 Vgr
MSTGLRFTLEVDGLPPDAFAVVSFHLTQSLSSLFSLDLSLVSQQFLSLEF
AQVLDKMAYLTIWQGDDVQRRVKGVVTWFELGENDKNQMLYSMKVHPPLW
RAGLRQNFRIFQNEDIKSILGTILQENGVTEWSPLFSEPHPSREFCVQYG
ETDYDFLCRMAAEEGIFFYEEHAYKSTDQSLVLCDTVRHLPESFEIPWNP
NTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQHQDY
QRTQYEVYDYPGRFKGAHGQNFARWQMEGWRNNAETARGMSRSPEIWPGR
RIVLTGHPQANLNREWQVVASELHGEQPQAVPGRRGAGTALENHFAVIPA
DRTWRPQPRLKPLVDGPQSAVVTGPEGEEIFCDEHGRVRVKFNWDRYNPA
DQDSSCWIRVAQAWAGTGFGNLAIPRVGQEVIVDFLNGDPDQPIIMGRTY
HQENRTPGSLPGTKTQMTIRSKTYMGSGFNELKFDDATGREQVYIHAQKN
MDTEVLNDRTTTVKHDHRETVKNDQTVTIQEGNRLLTVEKGHKITGVLKG
SLSEDVFQDRGTIAGSVHVDAVNNGGEGNGIQAYTAIKEIMLAVEESKIA
LTPDGIQLQVGESTVIRLSKDGITIVGGSVFIN
>ECs0232 hypothetical protein
MTAHKNISGTYHLSVADILQVVYQVCFSPSVEINQDGVAALITTLDRRIS
DLLDEIIHFCEFQQSASHWQRVLH
>ECs4388 putative transport ATPase
MTAEFIIRLILAAIACGAIGMERQMRGKGAGLRTHVLIGMGSALFMIVSK
YGFADVLSLDHVGLDPSRIAAQVVTGVGFIGAGNILVRNQNIVGLTTAAD
IWVTAAIGMVIGSGMYELGIYGSVMTLLVLEVFHQLTFRLMNKNYHLQLT
LVNGNTVSMLDWFKQQKIKTDLVSLQENEDHEVVAIDIQLHATTSIEDLL
RLLKGMAGVKGVSIS
>ECs3893 hypothetical protein
MAVIQDIIAALWQHDFAALADPHIVSVVYFVMFATLFLENGLLPASFLPG
DSLLILAGALIAQGVMDFLPTIAILTAAASLGCWLSYIQGRWLGNTKTVK
GWLAQLPAKYHQRATCMFDRHGLLALLAGRFLAFVRTLLPTMAGISGLPN
RRFQFFNWLSGLLWVSVVTSFGYALSMIPFVKRHEDQVMTFLMILPIALL
TAGLLGTLFVVIKKKYCNA
>ECs0069 hypothetical protein
MQALLEHFITQSTVYSLMAVVLVAFLESLALVGLILPGTVLMAGLGALIG
SGELSFWHAWLAGIVGCLLGDWISFWLGWRFKKPLHRWSFLKKNKALLDK
TEHALHQHSMFTILVGRFVGPTRPLVPMVAGMLDLPVAKFITPNIIGCLL
WPPFYFLPGILAGAAIDIPAGMQSGEFKWLLLATAVFLWVGGWLCWRLWR
SGKATDRLSHYLSRGRLLWLTPLISAIGVVALVVLVRHPLMPVYIDILRK
VVGV
>ECs3113 hypothetical protein
MVLIQRSGLSKLLEPLLFAATSDSQLSKTEISSIKINSETIPVYQLRYNG
NNALMFATYQDKMLVFSSTDMLFKDDQQDTEATAIASDLLSGKKRWQASF
GLEERAAEKTPVRQRIVASARLLGFGYQRLMPSFAGVRFEMGNDGWHSFV
ALNDESASVDASFDFTPVWNSMPAGASFCVAVPYSHGIAEEMLSHISQEN
DKLNGALDGAAGLCWYEDSKLQTPLFVGQFDGTAEQAQLPGKLFTQNIGA
HESKAPEGVLPVSQTQQGEAQIWRREVSSRYGQYPKAQAAQPDQLMSDYF
FRVSLAMQNKTLLFSLDDTLVNNALQTLNKTRPAMVDVIPTDGIVPLYIN
PQGVAKLLRNETLTSLPKNLEPVFYNAAQTLLMPKLDALSQQPRYVMKLA
QMEPGAAWQWLPIIWQPL
>ECs2501 hypothetical protein
MKGRTNTMNIQCKRVYDPAEQSDGYRVLVDRLWPRGIKKTDLALDEWDKE
ITPSTELRKAFHGEVVDFATFREQYLAELAQHEQEGKRLADIAKKQPLTL
LYSAKNTTQNHALVLADWLRSL
>ECs0619 hypothetical protein
MPLPDFHVSEPFTLGIELEMQVVNPPGYDLSQDSSMLIDAVKNKITAGEV
KHDITESMLELATDVCRDINQAAGQFSAMQKVVLQAAADHHLEICGGGTH
PFQKWQRQEVCDNERYQRTLENFGYLIQQATVFGQHVHVGCASGDDAIYL
LHGLSRFVPHFIALSAASPYMQGTDTRFASSRPNIFSAFPDNGPMPWVSN
WQQFEALFRCLSYTTMIDSIKDLHWDIRPSPHFGTVEVRVMDTPLTLSHA
VNMAGLIQATAHWLLTERPFKHKEKDYLLYKFNRFQACRYGLEGVITDPY
TGDRRPLTEDTLRLLEKIAPSAHKIGASSAIEALHRQVVSGLNEAQLMRD
FVADGGSLIGLVKKHCEIWAGD
>ECs2945 putative tail assembly protein
MATTNAFCLASPPLARICLHGDLQRFGRRLSLYVNTAAEAIRALSLQMPG
FRRQMNEGWYQIRIRGEDTAPEAVYARLHEPLGEGAVIHIVPRLAGAGKG
GLQIVLGAAAIVGSFFTAGATMALWGAALSAGGLTATTMLFSLGASMILG
GVAQMLAPKAKTPEYRATDNGKQNTYFSSLDNMIAQGNPMPVPYGEMLVG
SRRISQDISTRDEGGDGKVVVIGRG
>ECs1719 hypothetical protein
MRSLADFEFNKAPLCEGMILACEAIRRDFPSQDVYDELERLVSLAKEEIS
QLLPLEEQLEKLIALFYGDWGFKASRGVYRLSDALWLDQVLKNRQGSAVS
LGAVLLWVANRLDLPLLPVIFPTQLILRIECPDGEIWLINPFNGESLSEH
MLDVWLKGNISPSAELFYEDLDEADNIEVIRKLLDTLKASLMEENQMELA
LRTSEALLQFNPEDPYEIRDRGLIYAQLDCEHVALNDLSYFVEQCPEDPI
SEMIRAQINNIAHKHIVLH
>ECs1990 putative host specificity protein
MGKGGGRAHTPREAKDNLKSTQMMSVIDAIGEGPIEGPVKGLQSILVNKT
PLTDTDGNPVIHGVTAVWRAGEQEQTPPEGFESSGAETGLGVEVTKAKPV
TRTITSANIDRLRVTFGVQSLVETTSKGDRNPTSVRLLIQLERGGKWMTE
KDVTINGKTTSQFLASVILDNLPPRPFNIRMVRETADSTTDQLQNKTLWS
SYTEIIDVKQCYPNTAIVGLQVDAEQFGGQQMTVNYHIRGRIIQVPSNYD
PEKRTYSGIWDGSLKPAYSNNPAWCLWDMLTHPRYGMGKRLGAADVDKWA
LYAIAQYCDQTVPDGFGGTEPRMTFNAYLSQQRKAWDVLSDFCSAMRCMP
VWNGQTLTFVQDRPSDVVWPYTNSDVVVDDNGVGFRYSFSALKDRHTAVE
VNYTDPQNGWQTSTELVEDPEAILRYGRNLLKMDAFGCTSRGQAHRAGLW
VIKTGLLETQTVDFTLGSQGLRHTPGDIIEICDNDYAGTMTGGRILSIDA
ASRTLTLDREVTLPETGAATVNLINGSGKPVSVAITAHPAPDRIQVSTLP
DGVETYGVWGLSLPSLRRRLFRCVSIRENTDGTFAITAVQHVPEKEAIVD
NGAHFDGDQSGTVNGVTPPAVQHLTAEVTADSGEYQVLARWDTPKVVKGV
SFLLRLTVAADDGSERLVSTARTTETTYRFTQLAPGNYRLTVRAVNAWGQ
QGDPASVSFRIAAPAAPSRIELTPGYFQITATPHLAVYDPTVQFEFWFSE
KRIADIRQVETTARYLGTALYWIAASINIKPGHDYYFYIRSVNTVGKSAF
VEAVGQPSDDASGYLDFFKGEIGKTHLAQELWTQIDNGQLAPDLTEIRTS
ITDVSNEITQTVNKKLEDQSAAIQQIQKVQVDTNNNLNSMWAVKLQQMQD
GRLYIAGIGAGIENTPDGMQSQVLLAADRIAMINPANGNTKPMFVGQGDQ
IFMNEVFLKYLTAPTITSGGNPPAFSLTPDGRLTAKNADISGSVNANSGA
LNNVTINQNCTIKGMLEATQVRGDFVKAVSKAFPKKVGTWGNTETPNGTV
TVTISDDHNFDRQIIIPPIIFNGIAYDDPGSGNNPGGTRYTGYGFEVRKN
GVLIASRETKGAIPGSYSAVIDMPSGGGSVTLEFKIFQKGNQGAGNITDC
TVIVTKKAASGISIR
>ECs1493 hypothetical protein
MKQKELWINQIKGLCICLVVIYHSVITFYPHMTTFQHPLSEVLSKCWIYF
NLYLAPFRMPVFFFISGYLIRRYIDSVPWGNCLDKRIWNIFWVLALWGVV
QWLALSALNQWLAPERDLSNASNAAYADSTGEFLHGMITASTSLWYLYAL
IVYFVVCKIFSRLALPLFALFVLLSVAVNFVPTPWWGMNSVIRNLPYYSL
GAWFGATIMTCVKEVPLRRHLLMASLLAALAVGAWLFTISLLLSLVSIVV
IMKLFYQYEQRFGMRSTSLLNVIGSNTIAIYTTHRILVEIFSLTLLAQMN
AARWSPQVELTLLLVYPFVSLFICTVAGLLVRKLSQRAFSDLLFSPPSLP
VAVSYSR
>ECs5382 hypothetical protein
MFDSLAKAGKYLGQAAKLMIGMPDYDNYVEHMRVNHPDQTPMTYEEFFRE
RQDARYGGKGGARCC
>ECs0231 hypothetical protein
MSLQEEELVSSHAGQPEQESSLLDQIMAQTRIQPGSEGYDVARQGVTAFI
ASILQSTASAEPVNKLAVDSMIADIDERISRQMDVIIHAPAFQQVESFWR
SLKTMVDRVDFRENIKVNVLHVTKQELLEDFEFAPEIIQSGFYKHVYSSG
FGQFGGEPIAAVLGAYEFKNTAPDMKLLQYVSAVGAMAHAPFLSSVSPEF
MGLNSWTELPNIKDLYAIFEGPAYTKWRALRDSEDSRYLGLTAPRFLLRQ
PYSPTDNPVKNFNYYEDVSQNHEDYLWGNTAWMLACNIADSFAKYRWCPN
IIGPQSGGAVKDLPVHLFETMGQIQAKIPTEVLVTDRREFELAEEGFITL
TMRKDSDNAAFFSANSVQKPKHFPGKDAETNYKLGTQLPYLFIINRLAHY
IKVLQREQLGSWKERSDLERELNTWIRQYVADQENPPADVRSRKPLRAAK
VEVMDVEGEPGWY
>ECs1036 hypothetical protein
MTIAALWLAGCSSGEINKNYYQLPVVQSGTQSTASQGNRLLWVEQVAVPD
YLAGNGVVYQTSDVKYVIANNNLWASPLDQQLRNTLVANLSTQLPGWVVA
SQPLGSAQDTLNVTVTEFNGRYDGKVIVSGEWLLNHQGQLIKRPFRLEGV
QTQDGYDEMVKVLAGVWSQEAASIAQEIKRLP
>ECs4970 hypothetical protein
MLDIPQIASRYIEHPASGITPNRAAQCLRGAERGDLIAQSDLAADIEEKD
THLFAELGKRRLAIQSVPWSIEPPPNASANEKKDAEMLDEYLHSADWFDA
MLFDATDAILKGYSCMEIEHGMLGKMHIIRAIRWRDSGHFCLNPDDLSEL
RLRDGSHAGVAFQPFGWVVHQSRSRTGYGGATGLVRTLIWPFIFKNYSVR
DLAEFLEVYGLPMKVGKYPSGATSEQKSALMRAVMDIGRRTGGIIPAGMS
LEFQVAANGQADPFETMISWGERSISKAILGGTLTTEAGDKGARSLGEVH
NEVRREIRDSDLRQLAATLNRDLVYPLYALNTAHAIDIRRLPRICFQTKE
PGDITKITSAVMQLSTGMDIPDPWVRDQTGIPQPAPGEAIFRVRQSGNEP
AQTDKEMPPEKQEKTEQTALSARLPEAKSSPRDELDDMGDAVPARRLQDA
IDPLLEPVIDAIRTRGLADALADLPALYREMDDSRLMTLLSDAMFAAEMK
GMLDGTGD
>ECs1592 putative head portal protein
MWPFRRKKEQRSMTLDEFMALAGTSNTGAGEYVSSGTAESLPAVMNAVTV
ISEAVATMPCYLYLVRNEKGKEAREWLDSHPVDHILNERPNAWQTPYQFK
RMMVRHCLLNGNAYAVIQWGRDGFPVALHPYPPQSVNVEQTGEHNWRYCI
TDAYTGNTHNYLPWEVLHLRYSTDDGFMGRSPVTICRESLGLGLAQQRHG
ASVMRDGMMAAGVITSGEWLDGVKGKQALAALERYKGARNAGKTPILEGG
MSYQQLGMSNQDAEWLASRRFTIEDIARMFNVSPIFLQEYSNSTYSNFSE
ASRAFLTMTMRPWLANFEQQIKNALLVASPVPGIRYQVEFDSADLLRATP
GERFATYERGIKSGVMCPNEAREREGLSPRDGGDEFSQAWKQEVKISEGE
KPE
>ECs5248 hypothetical protein
MIKDAIFSPCGKYRYSLSRVWDESKPYTLFIGLNPSYADAEKDDRTLSRC
ISFAKSWGYGGVYMANLFAFVHTQRHEMMKASDPIGKDNDSHLIRLVSGA
GLVVAAWGNEGRHLKRSTTVRQLLPESTMCFVLNATGEPKHPLYMKNDSV
LIPLG
>ECs1654 hypothetical protein
MQIKSVEDIFIHLLSDTYSAEKQLTKALSKLSRSAYSDKLTAAFQSHLDE
THGQIERIDQVVDSEDGLKLKRIKCAAMEGLIEEANEVIESTDKNEVRDA
ALIAAAQKVEHYEIASYGTLVTLAEQLGYKKAAKLLKETLEEEKATDVKL
TDLAFNNVNKKAQDNS
>ECs2948 putative minor tail protein
MKTFRWKVKPDMEVNSQPSVREVRFGDGYSQRMAAGLNADLKTYRVTLSV
TREEARHLEAFLAEHGGWKAFLWKPPYAYRQIKVTCAGWSARVGMLRVEF
SAEFKQVVN
>ECs5355 hypothetical protein
MKYKHLILSLSLIMLGPLAHAEEIGSVDTVFKMIGPDHKIVVEAFDDPDV
KNVTCYVSRAKTGGIKGGLGLAEDTSDAAISCQQVGPIELSDRIKNGKAQ
GEVVFKKRTSLVFKSLQVVRFYDAKRNALAYLAYSDKVVEGSPKNAISAV
PVMPWRQ
>ECs3968 hypothetical protein
MLRAFARLLLRICFSRRTLKIACLLLLVAGATIFIADRVMVNASKQLTWS
DVNVVPARNVGLLLGARPGNRYFTRRIDTAAELYHAGKVKWLLVSGDNGR
KNYDEASGMQQALIAKGVPAKVIFCDYAGFSTLDSVVRANKVFGENHITI
ISQEFHNQRAIWLAKQYGIDAIGFNAPDLEKGRGKIVRLREKLARVSAVI
DAKIFNRQPKYLGPSVIIGPFSEHGCPAQK
>ECs1860 putative oxidoreductase
MLSPIRLSPLPALRQDNDFLYDQGAPMEQRHITGKSHWYHETQSSTTEYD
VLPLVPEAAKVSDPFLLDVILEKETLAPFLSWLDPARVLAVELFPDQLTV
TRSQTFTAYERLSTALTVAQVCGVQRLCNYYSARLTPLPGPDSTRESNHR
LAQITQYARQLASSPSIIDNRSRQHLNDVGLTAWDCVIINQIIGFIGFQA
RTIATFQAYLGHPVRWLPGLEIQNYADASLFADESLRWRSSYEVEKLPEE
HTKSSTAELCQLAEILSLHPISLSLLERLLNSTRVNTQPDNQLAALLCAR
INGSPACFAACMDSSNEYKKISPLLRKGENEINQWADRHSVERATVQAIQ
WLTRAPDRFSAAQFSPLLEHEKSSTQIINLLVWSGLCGWINRLKIALGET
Y
>ECs3480 hypothetical protein
MRKRFTVPGKIAVEVAYALPKKQYLQRVTLQEGATVEEAIRASGLLELRT
DIDLTENKVGIYSRPAKLSDSVHDGDRVEIYRPLIADPKELRRQRAEKSA
NK
>ECs1116 putative minor tail protein
MQDIHEESLNESVKSEQSPRVVLREIDLTVQGGERYFFCNELNEKGEAVT
WQGRQYQVYPIDGSGFEMNGKGSSARPSLTVSNLFGLVTGMAEDLQSLVG
ATVVRRRVYARFLDAVNFVAGNPEADPEQELSDRWVVEQMSELTAMTASF
VLATPTETDGALFPGRIMLANTCMWTYRSDECGYTGGAVADEFDKPTTDI
RKDRCSKCMRGCEMRGMVANFGGFLSINKLSQ
>ECs3982 hypothetical protein
MSSKVERERRKAQLLSQIQQQRLDLSASRREWLEATGAYDRRWNMLLSLR
SWALVGSSVMAIWTIRHPNMLVRWARRGFGVWSAWRLVKTTLKQQQLRG
>ECs3927 hypothetical protein
MTPLVKDIIMSSTRMPALFLGHGSPMNVLEDNLYTRSWQTLGMTLPRPQA
IVVVSAHWFTRGTGVTAMETPPTIHDFGGFPQALYDTHYPAPGSPALAQR
LVELLAPIPVTLDKEAWGFDHGSWGVLIKMYPDADIPMVQLSIDSSKPAA
WHFEMGRKLAALRDEGIMLVASGNVVHNLRTVKWHGDSSPYPWAMSFNEY
VKANLTWQGPVEQHPLVNYLDHEGGALSNPTPEHYLPLLYVLGAWDGQEP
ITIPVDGIEMGSLSMLSVQIG
>ECs2935 hypothetical protein
MLSNDILRSVRYILKANNNDLVRILALGNVEATAEQIAVWLRKEDEEGFQ
RCPDIVLSSFLNGLIYEKRGKDESAPALEPERRINNNIVLKKLRIAFSLK
TDDILAILTEQQFRVSMPEITAMMRAPDHKNFRECGDQFLRYFLRGLAAR
QHVKKS
>ECs5381 hypothetical protein
MAFSNPFDDPQGVFYILRNAQGQFSLWPQQCALPAGWDIVCQPQSQESCQ
QWLEVHWRTLTPANFTQLQEAQ
>ECs3201 hypothetical protein
MDLIYFLIDFILHIDVHLAELVAEYGVWVYAILFLILFCETGLVVTPFLP
GDSLLFVAGALASLETNDLNVHMMVVLMLIAAIVGDAVNYTIGRLFGEKL
FSNPNSKIFRRSYLDKTHQFYEKHGGKTIILARFVPIVRTFAPFVAGMGH
MSYRHFAAYNVIGALLWVLLFTYAGYFFGTIPMVQDNLKLLIVGIIVVSI
LPGVIEIIRHKRAAARAAK
>ECs4205 hypothetical protein
MLIPWQDLSPETLENLIESFVLREGTDYGEHERTLEQKVADVKRQLQCGE
AVLVWSELHETVNIMPRSQFRE
>ECs3780 hypothetical protein
MPGYNEMNQYLNQQGTGLTPAEMHGLISGMICGGNDDSSWLPLLHDLTNE
GMAFGHELAQALRKMHSATSDALQDDGFLFQLYLPDGDDVSVFDRADALA
GWVNHFLLGLGVTQPKLDKVTGETGEAIDDLRNIAQLGYDEDEDQEELEM
SLEEIIEYVRVAALLCHDTFTHPQPTAPEVQKPTLH
>ECs2496 hypothetical protein
MTEMAKGSVTHQRLIALLSQEGADFRVVTHEAVGKCEAVSEIRGTALGQG
AKALVCKVKGNGVNQHVLAILAADQQADLSQLASHIGGLRASLASPAEVD
ELTGCVFGAIPPFSFHPKLKLVADPLLFERFDEIAFNAGMLDKSVILKTA
DYLRIAQPELVNFRRTA
>ECs0988 hypothetical protein
MTQTFIPGKDAALEDSIARFQQKLSDLGFQIEEASWLNPVPNVWSVHIRD
KECALCFTNGKGATKKAALASALGEYFERLSTNYFFADFWLGETIANGPF
VHYPNEKWFPLTENDDVPEGLLDERLRAFYDPENELTGSMLIDLQSGNED
RGICGLPFTRQSDNQTVYIPMNIIGNLYVSNGMSAGNTRNEARVQGLSEV
FERYVKNRIIAESISLPEIPADVLARYPAVVEAIETLEAEGFPIFAYDGS
LGGQYPVICVVLFNPANGTCFASFGAHPDFGVALERTVTELLQGRGLKDL
DVFTPPTFDDEEVAEHTNLETHFIDSSGLISWDLFKQDADYPFVDWNFSG
TTEEEFATLMAIFNKEDKEVYIADYEHLGVYACRIIVPGMSDIYPAEDLW
LANNSMGSHLRETILSLPGSEWEKEDYLNLIEQLDEEGFDDFTRVRELLG
LATGSDNGWYTLRIGELKAMLALAGGDLEQALVWTEWTMEFNSSVFSPER
ANYYRCLQTLLLLAQEEDRQPLQYLNAFVRMYGADAVEAASAAMSGEAAF
YGLQPVDSDLHAFAAHQSLLKAYEKLQRAKAAFWAK
>ECs3980 hypothetical protein
MSKEHTTEHLRAELKSLSDTLEEVLSSSGEKSKEELSKIRSKAEQALKQS
RYRLGETGDAIAKQTRVAAARADEYVRENPWTGVGIGAAIGVVLGVLLSR
R
>ECs2005 hypothetical protein
MMKRTLLIAVWAIGLMSDSAMALTLNEARSQGRVGETLNGYLVALQTDAE
TQALVKDINEARNHSYQQLAKQNNVSTKEIAKLAGQKLVARAKSGQYVQG
VNGKWLRK
>ECs0590 hypothetical protein
MATFSLGKHPHVELCDLLKLEGWSESGAQAKIAIAEGQVKVDGAVETRKR
CKIVAGQTVSFAGHSVQVVA
>ECs4445 hypothetical protein
MDNKISTYSPAFSIVSWIALVGGIVTYLLGLWNAEMQLNEKGYYFAVLVL
GLFSAASYQKTVRDKYEGIPTTSIYYMTCLTVFIISVALLMVGLWNATLL
LSEKGFYGLAFFLSLFGAVAVQKNIRDAGINPPKETQVTQEEYSE
>ECs2493 hypothetical protein
MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGE
SVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGS
GQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKT
HRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISN
SEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAV
MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE
VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW
ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA
MQHIRDQDDIYPVFRELFHKQNATAKD
>ECs0960 putative surface protein
MFSGLLIILVPLIVGYLIPLRQQAALKVINQLLSWMVYLILFFMGISLAF
LDNLASNLLAILHYSAVSITVILLCNIAALMWLERGLPWRNHHQQEKLPS
RIAMALESLKLCGVVVIGFAIGLSGLAFLQHATEASEYTLILLLFLVGIQ
LRNNGMTLKQIVLNRRGMIVAVVVVASSLIGGLINAFILDLPINTALAMA
SGFGWYSLSGILLTESFGPVIGSAAFFNDLARELIAIMLIPGLIRRSRST
ALGLCGATSMDFTLPVLQRTGGLDMVPAAIVHGFILSLLVPILIAFFSA
>ECs0025 hypothetical protein
MTDGISTSPHCLYKSNIVDDVIINKTRQNELVKVFCEYKTEFLILFDDFF
RSQDLPKPSPVLHHFFQYTHLRDAHFYRCKLIEHTVQFSFFKHKGITLLR
LDVFDDRTSECLSEEIKIYQECHEKFIKFLKANFNQEIYPELYTPEIFYE
ACRNLQSFYDHQETSQNAKYSAIVKKKSYFNKEIRNLIKKNIYPELYNEQ
CNKIPASSTDDNQKITWQNFKTSNAAYSQLCEKLSLLKSSPSRLIEKSAY
CSNENMITDKFDVVFSYCGDNVKEFILLLPYNKSLEMHELNEQNIQYLTA
LNINIHKLLLSNITIEKSNLSYGYYFGCVLSNISCFESDLSNTIFSNGEI
NNLFIKKSNIFGTSFTNTMIKNLRCEDIMPGRWTTQLVNKHLGYRYTGVF
KTLASIDDKPSRFEILIPLVQTLVRDNVKLNNDVYKELKKFMHDYDKTSP
EMRKYLQSINESMFLMKKISHQD
>ECs4584 hypothetical protein
MIYFLTDLIKFYCLMKLYEFKNIKIDLVLTEDIIPEDKLQEIIQSDDIIK
LARKKTYEHLLRARRKSKELKTESRKKIARKMILMRERIRKNNKIKLDKE
VNQSIKWVKDIQAIELVLMQDIMNKIHLSLTNALHSLDTSSRINWDDLLN
EVVRETLSNNNIVGAIKITKNPDIKLDPGEANNIQLINDANTPHNKIIIE
NEYIRITLDPLEQISILLNSFKDNYLSIIQE
>ECs0234 hypothetical protein
MPTPCYISITGQTQGNITAGAFTADSVGNIYVQGHEDEMLVQEFLHNVTV
PTDPQSGQPAGQRAHKPFIFTVALNKAVPLMYNALASGEMLPTTELHWWR
TSVEGKQEHYFTTRLTDSTIVDMKLHMPHCQDPAKREFTQLLEVSLAYRK
IEWEHVKSGTSGADDWRAPLEA
>ECs1647 tail assembly protein
MAATHTLPLASPGMARICLYGDLQRFGRRIDLRVKTGAEAIRALATQLPV
FRQKLSDGWYQVRITGRDVSTSGLTAQLHETLPDGAVIHIVPRVAGAKSG
GVFQIVLGAAAIAGSFFTAGATLAAWGAAIGAGGMTGILFSLGASMVLGG
VAQMLAPKARTPRTQTTDNGKQNTYFSSLDNMVAQGNVLPVLYGEMRVGS
RVVSQEISTADEGDGGQVVVIGR
>ECs1795 putative portal protein
MWNLLRRTRKNQKSGRDVREVGWRSLFQAVAEPFAGAWQQGVKADPETVL
SFHAVFSCISLISQDIAKMRLRLMQTDVQGIRREKRQGDTARLCRRPNAQ
QNRIQFFELWLNSKLRHGNTVVLKIRTPRGQIKELRILDWNRVEPLVADD
GEVFYRITPDRNCGITESVTVPAREVIHDRFNCFFHPLVGLPPVYAAGLA
AMQGHHIQANSTYFFRNGGRPSGVIEVPGSITEENAKKLKGNWDSGYTGE
NAGKTAILSNGAKYSPTTFSPVDAQTVEQLKMTAEIVCSVFRVPAYKIGV
GHPPSSDNVEALEQQYYSQCLQTLIESIELLLDEALETGENESTEFDVTT
LLRMDSERRMKTLGDAVKNTLLTPNEARKRENLPPLAGGDALYLQQQNYS
LEALSRRDAREDPFASAGKTVSSQLPDGASDGNKAISETEHDAVKAMFRG
DTEKMTERELSIIRALGEEFSTVLADLQRTFEGKMASQAQAFEEKLTSLS
AVLQKHVTVDEVRPVLQAMVDDAVGAIPVPRDGRDYDPDVLQQAVNDAVA
NIPQPADGKSLTPDDVRPMLEQMVKEAVSHIPVPRDGRDYDPEVLQKAVN
DAVANIPQPADGKSLTPDDVRPMLEQMVKEAVSHIPVPRDGRDYDPEVLQ
KAVNDAVANIPQPADGKSLTPDDVRPMLEQMVKEAVSHIPVPRDGRDYDP
DVLQKAVLDAVSALPAPQDGRDATALEILPAIDDQKSFPRGTYATHQGGL
WRAYEKTHGMRGWECLVDGVADIDVSMTGERLFSVVVRQSSGQRTEKTFS
LPVMLYRGVFRAGETYHPGDTVTWGGSLWHCNSMTEDKPGEAHSSAWTLA
AKRGRDAGG
>ECs3182 putative S-transferase
MQGNICAMSAITESKPTRRWAMPDTLVIIFFVAILTSLATWVVPVGMFDS
QEVQYQVDGQTKTRKVVDPHSFRLLTNEAGEPEYHRVQLFTTGDERPGLM
NFPFEGLTSGSKYGTAVGIIMFMLVIGGAFGIVMRTGTIDNGILALIRHT
RGNEILFIPALFILFSLGGAVFGMGEEAVAFAIIIAPLMVRLGYDSITTV
LVTYIATQIGFASSWMNPFCVVVAQGIAGVPVLSGSGLRIVVWVIATLIG
LIFTMVYASRVKKNPLLSRVHESDRFFREKQADVEQRPFTFGDWLVLIVL
TAVMVWVIWGVIVNAWFIPEIASQFFTMGLVIGIIGVVFRLNGMTVNTMA
SSFTEGARMMIAPALLVGFAKGILLLVGNGEAGDASVLNTILNSIANAIS
GLDNAVAAWFMLLFQAVFNFFVTSGSGQAALTMPLLAPLGDLVGVNRQVT
VLAFQFGDGFSHIIYPTSASLMATLGVCRVDFRNWLKVGATLLGLLFIMS
SVVVIGAQLMGYH
>ECs0160 hypothetical protein
MSDDVALPLEFTDAAANKVKSLIADEDNPNLKLRVYITGGGCSGFQYGFT
FDDQVNEGDMTIEKQGVGLVVDPMSLQYLVGGSVDYTEGLEGSRFIVTNP
NAKSTCGCGSSFSI
>ECs2385 hypothetical protein
MKRASLLTLTLIGAFSAIQAAWAVDYPLPPTGSRLVGQNQTYTVQEGDKN
LQAIARRFDTAAMLILEANNTIAPVPKPGTTITIPSQLLLPDAPRQGIIV
NLAELRLYYYPPGENIVQVYPIGIGLQGLETPVMETRVGQKIPNPTWTPT
AGIRQRSLERGIKLPPVVPAGPNNPLGRYALRLAHGNGEYLIHGTSAPDS
VGLRVSSGCIRMNAPDIKALFSSVRTGTPVKVINEPVKYSVEPNGMRYVE
VHRPLSAEEQQNVQTMPYTLPAGFTQFKDNKAVDQKLVDKALYRRAGYPV
AVSSGATPTASNAPSVESAQNGEPEQGNMLRATQ
>ECs0841 putative tail assembly protein
MAATHTLPLASPGMARICLYGDLQRFGRRIDLRVKTGAEAIRALAMQIPA
FRQKLSDGWYLVRIAGRDTGENELSARLNEPLANGAVIHIVPRLAGAKSG
GVFQAVLGAALIAVAWWNPVGWLGAAAVSGMYAAGASMILGGVAQMLAPK
ARTPRTQTTDNGKQNTYFSSLDNMVAQGNVLPVLYGEMRVGSRVVSQEIS
TADEGDGGQVVVIGR
>ECs2145 hypothetical protein
MKFQAIVLASFLVIPYALADDQGGLKQDAAPPPPHAIEDGYRGTDDAKKM
TVDFAKTMHDGASVSLRGNLISHKGEDRYVFRDKSGEINVVIPATVFDGR
EVQPDQMINISGSLDKKSAPAVVRVTHLQK
>ECs0218 IcmF-like protein
MVIGPAGSGKTTLLREGFPSDIIYAPEGARGTEQRLYLTPHVGKQAVIFD
IDGTLCAPADADILHRRLWEHALGWLKEKRARQPLNGIILTLDLPDLLTA
DKRHREHLLQTLRSRLQDIRQHLHCQLPVYVVLTRLDLLQGFAALFQSLN
RQDRDAILGVTFTRRAHENDDWRTELNAFWQTWVDRMNLALPDLMVTQTH
TRASLFSFSRQMQGSREPLVSLLEGLLDGENMNVMLRGVYLTSSLQRGQM
DDIFTQSAARQYRLGNNPLASWPLVDTAPYFTRSLFPQALLAEPNLATES
RAWLMRSRRRLTVFSATGGVAALLLITGWHHYYNGNYQSGITVLKQAKAF
MDVPPPQGEDDYGNLQLPLLNPVRDATLAYGDWGDRSRLADMGLYQGRRI
GPYVEQTYLQLLEQRYLPSLFNGLVKAMNAAPPESEEKLAVLRVMRMLED
KSGRNNEVVKQYMAKRWSEKFHGQRDIQAQLMSHLDYALAHTDWHAERQA
GDGDAISRWTPYDKPVVSAQKELSKLPVYQRVYQSLKTRALGVLPADLNL
RDQVGPTFDQVFTSADDNKLVVPQFLTRYGLQSYFVKQRDELVELTAMDS
WVLNLTRSVKYSDADRAEIQRQLTEQYISDYTATWRAGMDNLNIRNFESI
GQLTGALEQVISGDQPLQRALTVLRDNTQPGVFSEKLSAKERDEALAEPD
YQLLTRLGHEFAPENITLAVQKDKESTMQAVYQQLTELHRYLLAIQNAPV
PGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGRLTDQAWHVV
MVEAVHYMEVDWRDSVVKPFNEQLANNYPFNPRSAQDASLDAFERFFKPD
GILDTFYQQNLKLFIDNDLSLEDGDNNVIIREDIIAQLETAQKIRDIFFS
KQNGLGTSFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNM
REGNESKLTLIGTSGNAPRSISFSGPWAQFRLFGAGQLTGVQDGNFTVRF
SVDGGAMTYRVHTDTEDNPFSGGLFSQFGLSDTLY
>ECs0340 hypothetical protein
MEKYLHLLSRGDKIGLALIRLSIAIVFMWIGLLKFVPYEADSITPFVANS
PLMSFFYEHPEDYKQYLTHEGEYKPEARAWQSANNTYGFSNGLGVVEVII
ALLVLANPVNRWLGLLGGLMAFTTPLVTLSFLITTPEAWVPALGDAHHGF
PYLSGAGRLVLKDTLMLAGAVMIMADSAREILKQGSNESSSTLKTEY
>ECs0839 putative minor tail protein
MQDIPQETHHETTRLTQSAQAVLWEIDLTEVGGERYFFCNEQNEKGEPVT
WQGRQYQAYPIQGTGFELNGKGSSARPTLTVSNLHGMVTGMAEDLQSLVG
GTVVRRKVYARFLDAVNFVNGNSDADPEQEVISRWRIEQCSELSAVSASF
VLSTPTETDGAVFPGRIMLANTCTWTYRGDECGYHGPAVADEYDQPTSDI
TKDKCSKCLSGCKFRNNVGNFGGFLSINKLSQ
>ECs0236 VgrG
MSTGLRFTLEVDGLPPDAFAVVFFHLNQSLSSLFSLALSLVSQQFLSLEF
QQILDKMAYLTIWQGDDVQRRVKGVVTWFELGENDKNQKLYSMKVCPPLW
RTGLRQNFRIFQNEDIASILGTILQENGVTEWSPLFSEPHPSREFCVQYG
ETDYDFLCRMAAEEGIFFYEEHAQKSTDQSLVLCDTVLYLPESFEIPWNP
NTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQHQDY
QRTQYEVYDYPGRFKGAHGQNFARWQMDGWRNNAEVARGTSRSPEIWPGR
RIVLTGHPQANLNREWQVVASELHGEQPQAVPGRRGSGTTLDNHFAVIPA
DRTWRPQPLLKPLVDGPQSAVVTGPAGEEIFCDEHGRVRVKFNWDRYNPS
NQDSSCWIRVAQAWAGTGFGNLAIPRVGQEVIVDFLNGDPDQPIIMGRTY
HQENRTPGSLPGTKTQMTIRSKTYKGSGFNELKFDDATGKEQVYIHAQKN
MNTEVLNNRTTDVINNHAEKIGNNQAITVTNNQILNIGVNQIQTVGVNQV
ETVGSNQIIKVGSNQVEKVGIIRALTVGVAYQTTVGGIMNTSVALLQSSQ
VGLHKSLMVGMGYSVNVGNNVTFSVGKTMKENTGQTAVYSAGEHLELCCG
KARLVLTKDGSIFLNGTHIHLEGESDVNGDAPVINWNCGATQPVPDAPVP
KDLPPGMPDMRQF
>ECs0252 hypothetical protein
MNSGQFSKDVKLAQKRHKDMNKLKYLMTLLINNTLPLPAVYKDHPLQGSW
KGYRDAHVEPDWILIYKLTDKLLRFERTGTHAALFG
>ECs4078 hypothetical protein
MSKARRWVIIVLSLAVLVMIGINMAEKDDTAQVVVNNNDPTYKSEHTDTL
VYNPEGALSYRLIAQHVEYYSDQAVSWFTQPVLTTFDKDKIPTWSVKADK
AKLTNDRMLYLYGHVEVNALVPDSQLRRITTDNAQINLVTQDVTSEDLVT
LYGTTFNSSGLKMRGNLRSKNAELIEKVRTSYEIQNKQTQP
>ECs0882 putative hydroxylase
MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRS
TLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGA
VRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLV
LYPSSSLHCVTPVTRGVRVASFIWIQSMIRDDKKRAMLFELDKNIQNIQS
LKSRYGENEEILSLLNLYHNLLREWSEI
>ECs0123 hypothetical protein
MAQCDFGALPGAEEHTMDYEFLRDITGVVKVRMSMGHEVVGHWFNEEVKE
NLALLDEVEQAAHALKGSERSWQRAGHEYTLWMDGEEVMVRANQLEFAGD
EMEEGMNYYDEESLSLCGVEDFLQVVAAYRNFVQQK
>ECs1830 putative structural proteins
MNMKTIEDVFIHLLSDTYSAEKQLTRALAKLARATSNEKLSQAFHAHLEE
THGQIERIDQVVESESNLKIKRMKCVAMEGLIEEANEVIESTEKNEVRDA
ALIAAAQKVEHYEIASYGTLATLAEQLGYRKAAKLLKETLEEEKATDIKL
TDLALNNVNKKAENKA
>ECs3079 hypothetical protein
MPQISRYSDEQVEQLLAELLNVLEKHKAPTDLSLMVLGNMVTNLINTSIA
PAQRQAIANSFARALQSSINEDKAH
>ECs4356 HicB-like protein
MIKLKTPNSMEIAGQPAVITYVPELNAFRGKFLGLSGYCDFVSDSIQGLQ
KEGELSLREYLEDCKAAGIEPYARTEKIKTFTLRYPESLSERLNNAAAQQ
QVSVNTYIIETLNERLNHL
>ECs4836 hypothetical protein
MGKFLVEREQMRYPVDVYTGKIQAYPEGKPSAIAKIQVDGELMLTELGLE
GDEQAEKKVHGGPDRALCHYPREHYLYWAREFPEQAELFVAPAFGENLST
DGLTESNVYIGDIFRWGEALIQVSQPRSPCYKLNYHFDISDIAQLMQNTG
KVGWLYSVIAPGKVSADAPLELVSRVSDVTVQEAAAIAWHMPFDDDQYHR
LLSAAGLSKSWTRTMQKRRLSGKIEDFSRRLWGK
>ECs0262 hypothetical protein
MEWYMGKYIRPLSDAVFTIASDDLWIESLAIQQLHTTANLPNMQRVVGMP
DLHPGRGYPIGAAFFSVGRFYPTRRRGNGAGNRNGPLL
>ECs5586 hypothetical protein
MVKKTIAAIFSVLVLSTVLTACNTTRGVGEDISDGGNAISGAATKAQQ
>ECs2947 putative minor tail protein
MQDIHGESLIESVKSEQSPRVVLWEIDLTVQGGERYFFCNELNEKGEAVT
WQGREYQAYPIDGSGFEMNGKGSSARPSLTVSNLFGLVTGMAEDLQSLVG
ATVVRRRVYARFLDAVNFVAGNPEADPEQELSDRWVVEQMSELTAMTASF
VLATPTETDGALFPGRIMLANTCMWDYRGDECGYNGPAVADEFDNPTTDI
RKDRCSKCMRGCEMRGMVANFGGFLSINKLSQ
>ECs2870 hypothetical protein
MKIETFDSVWDAVSDTPEQAENMRIRAELVTIINNWIEQQGFSQAQAASA
LGVTQPRISELARGKIQIFSIDKLITMMAHAGLHIQRIEVQYPHAA
>ECs0436 hypothetical protein
MPHSCREIHCFDNRWQKHKQNYAGRQKRDTIEDYPTKDDFMTIWVDADAC
PNVIKEILYRAAERMQMPLVLVANQSLRVPPSRFIRTLRVAAGFDVADNE
IVRQCEAGDLVITADIPLAAEAIEKGAAALNPRGERYTPATIRERLTMRD
FMDTLRASGIQTGGPDSLSQRDRQAFAAELEKWWLEVQRSRG
>ECs1305 hypothetical protein
MVHKSDSDELAALRAENARLVSLLEAHGIEWRRKPQTPVQRVSVLSTDEK
VALFRRLFRGRDDVWALRWESKTSGKSGYSPACANEWQARICGKPRIKCG
DCAHRQLIPVSDLVIYHHLAGTHTVGMYPLLEDDSCYFLAVDFDEAEWQK
DASAFMRSCDELGVPAALEISRSRQGAHVWIFFASRVSAREARRLGTAII
SYTCSRTRQLRLGSYDRLFPNQDTMPKGGFGNLIALPLQKRPRELGGSVF
VDMNLQPYPDQWAFLVSVTPMNVQDIEPTILRATGSIHPLDVNFINEEDL
GTPWEEKKSSGNRLNISIAEPLKITLANQIYFEKAQLPQVLINRLIRLAA
FPNPEFYKAQAMRMSVWNKPRVIGCAENYPQHIALPRGCLDSVLSFLRDN
NIAAELIDKRFAGTECNAVFMGNLRAEQEEAVSALLRYDTGVLCAPTAFG
KTVTAAAVIAKRKVNTLILVHRTELLKQWQERLAVFLQAGDSIGIIGGGK
HKPCGNIDIAVVQSISRQGEVEPLVRNYGQIIVDECHHIGAVSFSAILKE
TNARYLLGLTATPIRRDGLHPIIFMYCGAIRHTAVRPKESPHNLEVLIRS
RFTSGHLPSDARIQDIFREIALDHDRTVAIAEEAMKAFGQGRKVLVLTER
TDHLDEIASVMNSLKLSPFILHGRLSKKKRAMLISGLNALPPDSPRILLS
TGRLIGEGFDHPPLDTLILAMPVSWKGTLQQYAGRLHREHTGKSDVRIID
FVDTAYPVLLRMWDKRQRGYKAMGYRIIADGDESVI
>ECs0534 hypothetical protein
MTPAVKSLEKNKISFQIHTYEHDPAETNFGDEVVKKLGLNPDQVYKTLLV
AVNGDMKHLAVAVTPVAGQLDLKKVAKALGAKKVEMADPMVAQRSTGYLV
GGISPLGQKKRLPTIIDAPAQEFATIYVSGGKRGLDIELAAGDLAKILDA
KFADIARRD
>ECs5294 hypothetical protein
MFCHPETTAAPCGVYYFIARVKNSLSFIRILLICRTEFDEKKAIIRSNKL
KKRITFIVFLCAIVAASLFFVQSCVRKSQHVAGFQNYQATIDGKEITGVT
KNISSLTWSAQSNTLFSTINKPATIVEMTTEGDLIRTIPLDFVKDLETIE
YIGDNKFVISDERDYAIYVISLNADSEVSILKKIKIPLQETPTNCGFEGL
AYSSQDHTFWFFKEKNPIEVYKVTGLLRSDELHISKDKTLQRQFTLDDVS
GAEFNPQKNTLLVLSHESRALQEVTVRGDVIGEMSLTKGKYGLSHNIKQA
EGIAMDDSGNIYIVGEPNLFYRFTSTKSR
>ECs3654 hypothetical protein
MSSYANHQALAGLTLGKSTDYRDTYDASLLQGVPRSLNRDPLGLKADNLP
FQGTDIWTLYELSWLNAKGLPQVAVGHVELDYTSVNLIESKSFKLYLNSF
NQTRFNNWDEVRQTLERDLSTCAQGEVSVALYRLDELEGQPIGHFNGTCI
DDQDITIDNYEFTTDYLENATSGEKVVEETLVSHLLKSNCLITHQPDWGS
IQIQYRGRQIDREKLLRYLVSFRHHNEFHEQCVERIFNDLLRFCQPEKLS
VYARYTRRGGLDINPWRSNSDFVPSTTRLVRQ
>ECs5200 hypothetical protein
MRIFVYGSLRHKQGNSHWMTNAQLLGDFSIDNYQLYSLGHYPGAVPGNGT
VHGEVYRIDNATLAELDALRTRGGEYARQLIQTPYGSAWMYVYQRPVDGL
KLIESGDWLDRDK
>ECs2780 hypothetical protein
MGRKWANIVAKKTAKDGATSKIYAKFGVEIYAAAKQGEPDPELNTSLKFV
IERAKQAQVPKHVIDKAIDKAKGGGDETFVQGRYEGFGPNGSMIIAETLT
SNVNRTIANVRTIFNKKGGNIGAAGSVSYMFDNTGVIVFKGSDPDHIFEI
LLEAEVDVRDVTEEEGNIVIYTEPTDLHKGIAALKAAGISEFSTTELEMI
AQSEVELSPEDLEIFEGLVDALEDDDDVQKVYHNVANL
>ECs2320 hypothetical protein
MNKSLVAVGVIVALGVVWTGGAWYTGKKIETHLEDMVAQANAQLKLTAPE
SNLEVSYQNYHRGVFSSQLQLLVKPIAGKENPWIKSGQSVIFNESVDHGP
FPLAQLKKLNLIPSMASIQTTLVNNEVSKPLFDMAKGETPFEINSRIGYS
GDSSSDISLKPLNYEQKDEKVAFSGGEFQLNADRDGKAISLSGEAQSGRI
DAVNEYNQKVQLTFNNLKTDGSSTLASFGERVGNQKLSLEKMTISVEGKE
LALLEGMEISGKSDLVNDGKTINSQLDYSLNSLKVQNQDLGSGKLTLKVG
QIDGEAWHQFSQQYNAQTQALLAQPEIANNPELYQEKVTEAFFSALPLML
KGDPVITIAPLSWKNSQGESALNLSLFLKDPATTKEAPQTLAQEVDRSVK
SLDAKLTIPVDMATELMTQVAKLEGYQEDQAKKLAKQQVEGASAMGQMFR
LTTLQDNTITTSLQYTNGQITLNGQKMPLEDFVGMFAMPALNVPVVPAIP
QQ
>ECs5152 hypothetical protein
MNSTIWLALALVLVLEGLGPMLYPKAWKKMISAMTNLPDNILRRFGGGLV
VAGVVVYYMLRKTIG
>ECs0346 putative transporter
MDNRGEFLNNVAQALGRPLRLEPQAEDAPLNNYANERLTQLNQQQRCDAF
IQFASDVMLTRCELTSEAKAAEAAIRLCKELGDQSVVISGDTRLEELGIS
ERLQQECNAVVWDPAKGAKNISQAEQAKVGVVYAEYGLTESGGVVLFSAA
ERGRSLSLLPESSLFILRKSTLLPRVAQLAERLHQKAQAGERMPSCINII
SGPSSTADIELIKVVGVHGPVKAVYLIIEDC
>ECs1718 hypothetical protein
MTSFSTLLSVHLISIALSVGLLTLRFWLRYQKHPQAFARWTRIVPPVVDT
VLLLSGIALMAKAHILPFSGQAQWLTEKLFGVIIYIVLGFIALDYRRMHS
QQARIIAFPLTLVVLYIIIKLATTKVPLLG
>ECs5039 hypothetical protein
MTISELLQYCMAKPGAEQSVHNDWKATQIKVEDVLFAMVKEVENRPAVSL
KTSPELAELLRQQHSDVRPSRHLNKAHWSTVYLDGSLPDSQIYYLVDASY
QQAVNLLPEEKRKLLVQL
>ECs2162 putative tail assembly protein
MATTNAFCLASPPLARICLHGDLQRFGRRLSLYVNTAAEAIRALSLQMPG
FRRQMNEGWYQIRIRGEDTAPEAVYARLHEPLGEGAVIHIVPRLAGAGKG
GLQIVLGAAAIVGSFFTAGATMALWGAALSAGGLTATTMLFSLGASMILG
GVAQMLAPKAKTPEYRATDNGKQNTYFSSLDNMIAQGNPMPVPYGEMLVG
SRRISQDISTRDEGGDGKVVVIGRG
>ECs2458 hypothetical protein
MMKMQSRKIWYYRITLIILLFAMLLAWALLPGVHEFINRSVAAFAAVDQQ
GIERFIQSYGALAAVVSFLLMILQAIAAPLPAFLITFANASLFGAFWGGS
LSWTSSMAGAALCFFIARVMGREVVEKLTGKTVLDSMDGFFTRYGKHTIL
VCRLLPFVPFDPISYAAGLTSIRFRSFFIATGLGQLPATIVYSWAGSMLT
GGTFWFVTGLFILFALTVVIFMAKKIWLERQKRNA
>ECs1644 minor tail protein
MKTFRWKVKPGMDVASAPSVRKVRFGDGYSQRAPAGLNADLKTYSVTLSV
PRWEATALESFLAEHGGWKAFLWTPPYEWRQIKVTCAKWSSRVSMLRVEF
SAEFEQVVN
>ECs1829 hypothetical protein
MNRIEHYHDWLRDAHAMEKQAESMLESMASRIDNYPELRARIEQHLSETK
NQIVQLETILDRNDISRSVIKDSMSKMAALGQSIGGIFPSDEIVKGSISG
YVFEQFEIACYTSLLAAAKNAGDTASIPIIEAILNEEKQMADWLIQHIPQ
TTEKFLIRSETDGVEAKK
>ECs0952 hypothetical protein
MQFSTTPTLEGQTIVEYCGVVTGEAILGANIFRDFFAGIRDIVGGRSGAY
EKELRKAREIAFEELGSQARALGADAVVGIDIDYETVGQNGSMLMVSVSG
TAVKTRR
>ECs0228 hypothetical protein
MDGKNRAASSYLSPGNPPADKEQNDPLAQVFHNACSYNFFAMAELLHRLA
KGEKGTPELSLRDDPAQETLRFSADASLAFPCSDISALKRDTSGAFRMTT
TFMGLQGSQSPLPGYYLDHLAWKAVHEQSPVGDFLDMFSHRLTQFVWHIW
RKYRYHISFRNGGVDAFSQRMYSLVGLGHRQLRDKLAINHSKMLAYSGIL
ANPGRSPEIICGLVSHCFDLSEVTLQNWQRRKVDIEPDQQNSLGSYSLKN
GEKLAGRSVLGNFVLGTRVPDLSGKFQLSITSLTRKQFLSFLPSGENFLP
LTTFVSFILRDQLAWDLHLGLAPEQVGAMRLGDNKSALLGWTSFLGTPEE
RPSVTIRVRS
>ECs2164 putative minor tail protein
MQDIHEESLNESVKSEQSPRVVLWEIDLTAQGGERYFFCNELNEKGEAVT
WQGRQYQAYPIDGSGFEMNGKGSSARPSLTVSNLFGLVTGMAEDLQSLVG
ATVVRRRVYARFLDAVNFVAGNPEADPEQELTDRWVVEQMSSLTAMTASF
VLATPTETDGALFPGRIMLANTCMWDYRGDECGYNGPAVADEFDNPTTDI
RKDRCSKCMRGCEMRGMVANFGGFLSINKLSQ
>ECs3510 hypothetical protein
MNNIFIFEPSNKNNPLDNVIKFIEFCKSTISNNNLTTSWESNKWKGLYRF
TKFNSKNNLNSKECLDDSFINFAKAYMLHVHSFNKSKTKHSTLSMLKIVE
FVLLKINMEANVNYCNNSIYDECIRIASEKYSKAHAFAIGKELEKLSSFL
NDNRMTNSFYLFWVNPIRYRITQSWTGYDSSLEGHSRLPDIKSVIAIAEI
FSKRDEQLSSRDIFTTSVLALLMCAPSRISEILALPADCEITECDGKGIQ
RYGLRFFSAKGYEGNIKWIPTLMIPVAKKAISRLKELSSQARLLAAEIQK
NYSNSTKGTLKENIPPDLFWYDREKKIKYSNALCLLTEGQLNQNKKEMSD
KLFRPTTNFFKTDIIDSDYIKGYFNVFKRHGYINEDGSPYLLRTHQLRHL
LNTFAQINGMDEFSIARWSGRKLISQNVSYDHRSHLQMSKAIREKKLSVC
VNEHRIKDIPVVDLNEFDSLSSGAVLVSKHGYCKHSYAFKPCDNYPIKNS
GLDNETISNIHDKILKRTLYDKNDGNINADKWYEFHKKIKKGE
>ECs5352 hypothetical protein
MLIMHQVVCATTNPAKIQAILQAFHEIFGEGSCHIASIAVESGVPEQPFG
SEETRAGARNRVANARRLLPEADFWVAIEAGIDGDSTFSWVVIENTTQRG
EARSATLPLPAVILEKVREGEALGPVMSRYTGIDEIGRKEGAIGVFTAGK
LTRASVYHQAVILALSPFHNAVY
>ECs2448 hypothetical protein
MPITIHGYLNDNNEKNGPGASMEYFDMRKMSVNLWRNAAGETREICTFPP
AKRDFYWRASIASIAANGEFSLFPGMERIVTLLEGGEMFLESADHFNHTL
KPLQPFAFAADQVVKAKLTAGQMSMDFNIMTRLDVCKAKVRIAERTFTTF
GSRGGVVFVINGAWQLGDKLLTTDQGACWFDGRHTLRLLQPQGKLLFSEI
NWLAGHSPDQVQ
>ECs4465 hypothetical protein
MAIKSPPTLIPLSHLSGEELQAHLRFNRVTDEKGRYLPFDELQYRIKKGE
NVDVAWTLTRLARNAAIQRINYCNEAGEQAGFNITPVIAEACELVDKRAT
ALALKDQTERLRGAGAELSQLRLEEPITSSQLEGANTTTLVARKMLETGR
SPRTEDEHMIAGNARLMAEIPHLLAEPLTPALIRQLHAIGMGGINDAKYR
PGEFRETDDVVIADYDGNIVHQPPAAALLPERLEKVCQWLNSHEGYIHPL
VRACILHFMLAHEHPFRDGNGRTSRALFYWYMLKSGYDVFKYISISRLLH
AAPVKYAASYQYTESDGMDLTYFLEYQAGVIKRALQNWQQHIDEITQRSA
KLDSVLFSSGVLKRLNPRQVTLLNVMLANPGKEYTVAEISASLGVSDNTV
RADLRTIVKEGFAQEKKINDQQAVYFAHYPL
>ECs1806 putative host specificity protein
MGKGGGKAHTPVEAKDNLKSTQMMSVIDAIGEGPIEGPVKGLQSILVNKT
PLTDTDGNPVIHGVTAVWRAGEQEQTPPEGFESSGAETALGVEVTKAKPV
TRTITSANIDRLRVTFGVQSLLETTSKGDRNHSSVRLLIQLQRNGNWVTE
KDVTINGKTTSQFLASVILDNLPPRPFNIRMVRETADSTTDQLQNRTLWS
SYTEIIDVKQCYPNTAIVGLQVDAEQFGGQQMTVNYHIRGRIIQVPSNYD
PEKRTYSGIWDGSLKPAYSNNPAWCLWDMLTHPRYGMGKRLGAADVDKWA
LYAIAQYCDQTVPDGFGGTEPRMTFNAYLSQQRKAWDVLSDFCSAMRCMP
VWNGQTLTFVQDRPSDVVWPYTSSDVVVDDNGVGFRYSFSALKDRHTAVE
VNYTDPQNGWQTSTELVEDPEAILRYGRNLLKMDAFGCTSRGQAHRAGLW
VIKTGLLETQTVDFTLGSQGLRHTPGDIIEICDNDYAGTMTGGRILSIDA
ASRTLTLDREVTLPETGAATVNLINGSGKPVSVAITAHPAPDRIQVSTLP
DGVETYGVWGLSLPSLRRRLFRCVSVRENTDGTFAITAVQHVPEKEAIVD
NGARFEPQSGTLNSVIPPAVQHLTVEVSAADGQYLAQAKWDTPKVVKGVS
FMLRLTVAADDGSERLVSTARTTETTYRFTQLAPGNYRLTVRAVNAWGQQ
GDPASVSFRIAAPAAPSQIELTPGYFQITAVPRLAVYDPTVQFEFWFSET
RITDIRQVETTARYLGTGLYWIAASINIKPGHDYYFYIRSVNTVGKSAFV
EAVGQPSDDASGYLDFFKGEIGKTHLAQELWTQIDNGQLAPDLAEIRTSI
TDVSNEITQTVNKKLEDQSAAIQQIQKVQVDTNNNLNSMWAVKLQQMQDG
RLYIAGIGAGIENTSDGMQSQVLLAADRIAMINPANGNTKPMFVGQGDQI
FMNEVFLKYLTAPTITSGGNPPAFSLTPDGRLTAKNADISGNVNANSGTL
NNVTINENCRVLGKLSANQIEGDLVKTVGKAFPRDSRAPERWPSGTITVR
VYDDQPFDRQIVIPAVAFSGAKHEKEHTDIYSSCRLIVRKNGAEIYNRTA
LDNTLIYSGVIDMPAGHGHMTLEFSVSAWLVNNWYPTASISDLLVVVMKK
ATAGITIS
>ECs3835 hypothetical protein
MAKNRSRRLRKKMHIDEFQELGFSVAWRFPEGTSEEQIDKTVDDFINEVI
EPNKLAFDGSGYLAWEGLICMQEIGKCTEEHQAIVRKWLEERNLDEVRTS
ELFDVWWD
>ECs2934 hypothetical protein
MEFYMKAFNKLFSLVVASVLVFSLAGCGDKEESKKFSANLNGTEIAITYV
YKGDKVLKQSSETKIQFASIGATTKEDAAKTLEPLSAKYKNIAGVEEKLT
YTDTYAQENVTIDMEKVDFKALQGISGINVSAEDAKKGITMAQMELVMKA
AGFKEVK
>ECs3820 hypothetical protein
MKTSRLPIAIQQAVMRRLREKLAQANLKLGRNYPEPKLSYTQRGTSAGTA
WLESYEIRLNPVLLLENSEAFIEEVVPHELAHLLVWKHFGRVAPHGKEWK
WMMESVLGVPARRTHQFELQSVRRNTFPYSCKCQEHQLTVRRHNRVVRGE
AVYRCVHCGEQLVAK
>ECs2721 putative tail assembly protein
MATTNAFCLASPPLARICLHGDLQRFGRRLSLYVNTAAEAIRALSLQMPG
FRRQMNEGWYQIRIRGEDTAPEAVYARLHEPLGEGAVIHIVPRLAGAGKG
GLQIVLGAAAIVGSFFTAGATMALWGAALSAGGLTATTMLFSLGASMILG
GVAQMLAPKAKTPEYRATDNGKQNTYFSSLDNMIAQGNPMPVPYGEMLVG
SRRISQDISTRDEGGDGKVVVIGRG
>ECs3023 hypothetical protein
MLTGNQRETDWPMDLNTLISQYGYAALVIGSLAEGETVTLLGGLAAHQGL
LKFPLVVLSVALGGMIGDQVLYLCGRRFGGKLLRRFSKHQDKIERAQKLI
QRHPYLFVIGTRFMYGFRVIGPTLIGASQLPPKIFLPLNILGAFAWALIF
TTIGYAGGQVIAPWLHNLDQHLKHWVWLILVVVLVVGVRWWLKRRGKKKP
DNQA
>ECs2203 hypothetical protein
MILRQCAGTMKVKSVGALIGRTEAAVRTKARELGISMMLRGDFHPSAKYS
QRDIELARQLHQRGMQRREIARKLGMPLRIVNNYVYFDRRVSA
>ECs3154 hypothetical protein
MSNQFGDTRIDDDLTLLSETLEEVLRSSGDPADQKYVELKARAEKALDDV
KKRVSQASDSYYYRAKQAVYRADDYVHEKPWQGIGVGAAVGLVLGLLLAR
R
>ECs2166 putative tail length tape measure protein precursor
MSQPAGDLVIDLSLDAARFDEQMARVRRHFSSLEADARKTASTVEQGLSR
QALAAQKAGISVGQYKAAMRTLPAQFTDIVTQLAGGQNPFLIMLQQGGQI
SDSFGGPLSLLTLLKEELLGIRDASESSEESLSDTANALAENARNAGELG
RFMSVARVAAGGGVAVLAALAAAAWQAEQADRALLRSLILTGGAAATTTA
ELWKMAGVISDEAGGGIRQAAENLARLAESGKYTAGQLRIMGETSQRWLQ
TVGDDAGKVEKAFEGIAADPVKALASLNQQYNFLSVSQLRHIDELERTKG
KQVAVTEAMSLFADVMNARLEQLDKAATPVEKIWDDVKTWTSDAWAWIGD
HTLGALSLITDVVAGTVEQVKLLLVQGDLALAEFIQSAWETTKNVPGVGA
LFGELAEENRVFIEKTKRDELALRKSIAERDARIRQGEMGYINRSRATGV
SKGPGQQEAVSRLAEELTGKKHTSPKTRSAGEREEEQAREALLALEAELR
TLEKHSGANEKISRQRRDLWKAESQYAVLKEAATKRQLSEQEKSLLAHKD
ETLEYKRQLAELGDKVEYQKRLNELAQQAVRFEEQQSAKQAAISAKARGL
TDRQAQRESEAQRLRDVYGDNPAALAKATSALKNTRSAEEQLRGSWMAGL
KSGWGEWAESATDSFSQVKSAATQTFDGIAQNMAAMLTGAEADWRGFTRS
VLSMMTEILLKQAMVGIVGRIGSAIGGAFGGGASASSGTAIQAAAANFHF
ATGGFTGTGGKYEPAGIVHRGEFVFTKEATSRIGVGNLYRLMRGYAEGGY
VGGAGSPAQMRRAEGINFNQNNHVVIQNDGTNGQAGPQLMKAVYDMARKG
AQDELRLQLRDGGMLSGSGR
>ECs1434 hypothetical protein
MKKSLLGLTFASLMFSAGSAVAADYKIDKEGQHAFVNFRIQHLGYSWLYG
TFKDFDGTFTFDEKNPAADKVNVTINTTSVDTNHAERDKHLRSADFLNTA
KYPQATFTSTSVKKDGDELDITGDLTLNGVTKPVTLEAKLIGQGDDPWGG
KRAGFEAEGKIKLKDFNIKTDLGPASQEVDLIISVEGVQQK
>ECs2459 hypothetical protein
MGLPPLSKIPFILRPQAWLHRRHYGEVLSPIRWWGRIPFIFYLVSMFVGW
LERKRSPLDPVVRSLVSARIAQMCLCEFCVDITSMKVAERTGSSDKLLAV
ADWRQSPLFSDEERLALEYAEAASVTPPTVDDALRTRLAAHFDAQELTEL
TALIGLQNLSARFNSAMDIPAQGLCRIPEKRS
>ECs0216 Hcp-like protein
MANLIYLTLNGDNQGLISSGCSSQPSIGNKAQTAHIDQIMIYEFMHGLNR
DQNVNHHHLTIKKPIDKASPLLGKAICDNELLTCDFSFYRTNKFGINELF
YKIKLTGAKISDIHVSISHIVVDNSVQPEESVSFSYESIIWEHCSAGTSA
YSLWEDRLF
>ECs1008 putative amidase
MLLNMMCGRRLSAISLCLAVTFAPLFNAQADEPEVIPGDSPVAVSEQGEA
LPQAQATAIMAGILPLPEGAAEKARTQIESQLPAGYKPVYLNQLQLLYAA
RDMQPMWENRDAVKAFQQQLAEVAIAGFQPQFNKWVELLTDPGVNGMARD
VVLSDAMMGYLHFIANIPVKGTRWLYSSKPYALATPPLSVINQWQQALDK
GQLPTFVAGLAPQHPQYAVMHESLLALLSDTKPWPQLTGKATLRPGQWSN
DVPALREILQRTGMLDGGPKITLPGDDTPTDAVVSPSAVTVETAETKPMD
KQTTSRSKPAPAVRAAYDNELVEAVKRFQAWQGLGADGAIGPATRDWLNV
TPAQRAGVLALNIQRLRLLPTELSTGIMVNIPAYSLVYYQNGNQVLDSRV
IVGRPDRKTPMMSSALNNVVVNPPWNVPPTLARKDILPKVRNDPGYLESH
GYTVMRGWNSREAIDPWQVDWSTITASNLPFRFQQAPGPRNSLGRYKFNM
PSSEAIYLHDTPNHNLFKRDTRALSSGCVRVNKASDLANMLLQDAGWNDK
RISDALKQGDTRYVNIRQSIPVNLYYLTAFVGADGRTQYRTDIYNYDLPA
RSSSQIVSKAEQLIR
>ECs2439 hypothetical protein
MERLLIVNADDFGLSKGQNYGIIEACRNGIVTSTTALVNGQAIDHAVQLS
RDEPSLAIGMNFVLTMGKPLTAMPGLTRDGVLGKWIWQLAEEDALPLEEI
TQELASQYLRFIELFGRKPTHLDSHHHVHMFPQIFPIVARFAAEEGIALR
IDRQPLSNAGDLPANLRSSHGFSSAFYGEEISEALFLQVLDDSSHRGERS
LEVMCHPAFVDNTIRQSAYCFPRLTELEVLTSASLKYAIAERGYRLGSYL
DV
>ECs1544 putative portal protein
MWNLLRRTRKNQKSGRDVREAGWTSLFQAVAEPFSGAWQQGVKADPEAVL
SFHAVFACISLISQDIAKMRLRLMQTDAHGIRRETRRGDIARLCRRPNAQ
QNRIQFFELWLNAKLRHGNTVVLKIRNARGQIKELRILDWSRVEPLVADD
GEVFYRITPDRNCGITEAVTVPAREVIHDRFNCFFHPLIGLPPVYAAGLA
ATQGHHIQENSTSFFRNGGRPSGVIEIPGSITEENAKKLKSNWDSGYTGE
NAGKTAILSNGAKYNPTTFSPVDAQTVEQLKMTAEIVCSVFRVPAYKIGV
GQPPSSDNVEALEQQYYSQCLQTLIESIELLLDEALETGENESTEFDVTT
LLRMDSERRMKTLGDAVKNTLLTPNEARKRENLPPLAGGDALYLQQQNYS
LEALSRRDAREDPFASAGKTVSSQLPDGASDGNKAISETEHDAVKAMFRG
DTEKMTERELSIIRALGEEFSTVLADLQRTFEEKIAAQAQTFEEKLASQS
VVLQKCVTGDDVRPMLEQMVKEAVSHIPVPRDGRDYDPDVLQKAVNDAVA
NIPVPADGKSITPDDVRPMLEQMVKEAVSHIPVPRDGRDYDPDVLQKAVN
DAVAKIPVPADGKSITPDDVHPMLEQMVKEAVSHIPVPRDGRDYDPDVLQ
KAVNDAVAKIPVPADGKSITPDDVHPMLEQMVKEAVSHIPVPRDGRDYDP
DVLQKAVLEAVSALPAPQDGRDATALEILPAIDDQKSFPRGSYATHQGGL
WRAYEKTYGMRGWECLVDGVADIDVSMTGERSFSVVVRQSSGQRTEKTFS
LPVMLYRGVFRIGETYHPGDTVTWGARCGTATV
>ECs0770 hypothetical protein
MSKIIATLYAVMDKRPLRALSFVMALLLAGCMFWDPSRFAAKTSELEIWH
GLLLMWAVCAGVIHGVGFRPQKVLWQGIFCPLLADIVLIVGLIFFFF
>ECs1009 hypothetical protein
MDKFDANRRKLLALGGVALGAAILPTPAFATLSTPRPRILTLNNLHTGES
IKAEFFDGRGYIQEELAKLNHFFRDYRANKIKSIDPGLFDQLYRLQGLLG
TRKPVQLISGYRSIDTNNELRARSRGVAKKSYHTKGQAMDFHIEGIALSN
IRKAALSMRAGGVGYYPRSNFVHIDTGPARHW
>ECs4740 hypothetical protein
MKQPGEELQETLTELDDRVVVDYLIKNPEFFIRNARAVEAIRVPHPVRGT
VSLVEWHMARARNHIHVLEENMALLMEQAIANEGLFYRLLYLQRSLTAAS
SLDDMLMRFHRWARDLGLAGASLRLFPDRWRLGAPSNHTHLALSRQSFEP
LRIQRLGQEQHYLGPLNGPELLVVLPEAKAVGSVAMSMLGSDADLGVVLF
TSRDASHYQQGQGTQLLHEIALMLPELLERWIERV
>ECs5199 hypothetical protein
MSLWKKISLGVVIVILLLLGSVAFLVGTTSGLHLVFKAADRWVPGLDIGK
VTGGWRDLTLSDVRYEQPGVAVKAGNLHLAVGLECLWNSSVCINDLALKD
IQVNIDSKKMPPSEQVEEEEDSGPLDLSTPYPITLTRVALDNVNIKIDDT
TVSVMDFTSGLNWQEKTLTLKPTSLKGLLIALPKVAEVAQEEVVEPKIEN
PQPEEKPLGETLKDLFSRPVLPEMTDVHLPLNLNIEEFKGEQLRVTGDTD
ITVRTMLLKVSSIDGNTKLDALDIDSNQGILNASGTAQLSDNWPVDITLN
STLNVEPLKGEKVKLKVGGALREQLEIGVNLSGPVDMDLRAQTRLAEAGL
PLNVEVNSKQIYWPFTGEKQYQADDLKLKLTGKMTDYTLSMRTAVKGLEI
PPATITLDAKGNEQQVNLDKLTVAALEGKTELKALLDWQQAISWRGELTL
NGINTAKEIPEWPSKLNGLIKTRGSLYGGTWQMEVPELKLTGNVKQNKVN
VDGTLKGNSYMQWMIPGLHLELGPNSAEVKGELGVKDLNLDATINAPGLD
NALPGLGGTAKGLVKVRGTVEAPQLLADITARGLRWQELTVAQVRVEGDI
KSTDQIAGKLDVRVEQISQPDVNINLVTLNAKGSEKQHELQLRIQGEPVS
GQLNLAGSFDRKEERWKGTLSNTRFQTLVGPWSLTRDIALDYRNKEQKIS
IGPHCWLNPNAELCVPQTIDAGAEGRAVVNLNRFDLAMLKPFMPETTQAS
GIFTGKADVAWDTTEEGLPQGSITLSGRNVQVTQTVNDAALPVAFQTLNL
TAELRNNRAELGWTIRLTNNGQFDGQVQVTDPQGRRNLGGNVNIRNFNLA
MINPIFTRGEKAAGMVSANLRLGGDVQSPQLFGQLQVTGVDIDGNFMPFD
MQPSQLAVNFNGMRSTLAGTVRTQQGEIYLNGDADWSQIENWRARVTAKG
SKVRITVPPMVRMDVSPDVVFEATPNLFTLDGRVDVPWARIVVHDLPESA
VGVSSDVVMLNDNLQPEEPKTASIPINSNLIVHVGNNVRIDAFGLKARLT
GDLNVVQDKQGLGLNGQINIPEGRFHAYGQDLIVRKGELLFSGPPDQPYL
NIEAIRNPDATEDDVIAGVRVTGLADEPKAEIFSDPAMSQQAALSYLLRG
QGLESDQSDSAAMTSMLIGLGVAQSGQIVGKIGETFGVSNLALDTQGVGD
SSQVVVSGYVLPGLQVKYGVGIFDSIATLTLRYRLMPKLYLEAVSGVDQA
LDLLYQFEF
>ECs3930 hypothetical protein
MALNTYQYRETTMIDPKKIEQIARQVHESMPKGIREFGEDVEKKIRQTLQ
AQLTRLDLVSREEFDVQTQVLLRTREKLALLEQRSSELEARNNSVADLQS
PPAIPPIDKAE
>ECs0251 hypothetical protein
MRKIALILAMLLIPCVSFAGLLGSSSSTTPVSKEYKQQLMGSPVYIQIFK
EERTLDLYVKMGEQYQLLDSYKICKYSGGLGPKQRQGDFKSPEGFYSVQR
NQLKPDSRYYKAINIGFPNAYDRAHGYEGKYLMIHGDCVSIGCYAMTNQG
IDEIFQFVTGALVFGQPSVQVSIYPFRMTDANMKRHKYSNFKDFWEQLKP
GYDYFEQTRKPPTVSVVNGRYVVSKPLSHEVVQPQLASNYTLPEAK
>ECs5021 hypothetical protein
MLGHISKFDGNNSLIKHGVVQGNNIVDFDLLRNFNGGPGLNRENFIYISN
IFLNIKQRNEKNHSINMFREVSISGDIVSVKFYRNEKIECACDFMMAKDA
QGYIDLSELDLTSCHFKGDVISKVSFISSNLQHVTFECKEIGDCNFTTAI
VDNVIFKCRRLHNVIFIKASGDYVDFSKNILDTVDFSQSQLTHSNFCECQ
IRNSNFDHCYLYASHFTRAEFLTDKEISFIKSNLTAVMFDHVRISTGNFK
DSVTQLMVLSIDYSDIFGNEYLDGYINNIIKMIDSLPDDPAILKSVLAVK
LVMQLKILNIVNKNFIENMKKIFSHGPYIKDPIIRSYIHPDEDNKFDNFM
RQNRFSKVNFDTQQMIDFINRFNMNKWLIDRNNNFFIQLIDQALRSTNDT
IKENAWHLYKEWIRSDDVSPLFIEIEDNLRTFNTNELTRNDNIFILFSSV
DDGPVMVVSSQRLHDMLNPTKDTNWNSTYIYKSRHEMLPVNLTPETLFGS
KSYDKHALFPIFTASWRANRIKNKGI
>ECs2073 hypothetical protein
MKFPSIFNKIKPQSIQQHPEKNQLNWMLELNKWKEERILTGEIHRPECRN
EAAKRINCAFLSKQNDIDLSGLNLSTQPPGLQNFTSINLDNNQLTHFDAT
NYDRLVKLSLNSNTLESINIHQGRNVSITHISMNNNCLRNIDIDRLSSIT
YFSAAHNKLEFVQLESCEWLQYLNLSHNQLTDIVTGNKEELLLLDLSHNK
LASLHNALFPNLNTLLINNNLLSEIKMFYSNFCKVQTLNAANNQLEKINL
HFLTYLSSIKSLRLDNNKITRIDTENTSDIRSLFPIIKKSESLNFLNISG
ENNCPTIQLMLFNLFSPALKLNTGLAILSPGAFEDHSDGLDVDNELFHYT
INKAYTPYNIHTYKTEEVVNQRNIKIKNMTLDEINNTYCNNDYYNEAIRE
EPIDFLDRSFSSSSWPFYH
>ECs5495 hypothetical protein
MEKEQLIEIANTIMPFGKYKGRRLIDLPEEYLLWFARKDEFPAGKLGELM
QITLLIKTEGLTQLVQPLKRPL
>ECs4317 hypothetical protein
MLWSFIAVCLSAWLSVDASYRGPTWQRWVFKPLTLLLLLLLAWQAPMFDA
ISYLVLAGLCASLLGDALTLLPRQRLMYAIGAFFLSHLLYTIYFASQMTL
SFFWPLPLVLLVLGALLLAIIWTRLEEYRWPICTFIGMTLVMVWLAGELW
FFRPTAPALSAFVGASLLFISNFVWLGSHYRRRFRADNAIAAACYFAGHF
LIVRSLYL
>ECs2096 hypothetical protein
MDICSRNKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQ
QSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQR
LGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAH
KRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSG
DRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDN
ETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWR
NRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAAR
YDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQ
LDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSRWGS
>ECs1683 putative sporulation protein
MATIDSMNKDTTRLSDGPDWTFDLLDVYLAEIDRVAKLYRLDTYPHQIEV
ITSEQMMDAYSSVGMPINYPHWSFGKKFIETERLYKHGQQGLAYEIVINS
NPCIAYLMEENTITMQALVMAHACYGHNSFFKNNYLFRSWTDASSIVDYL
IFARKYITECEERYGVDEVERLLDSCHALMNYGVDRYKRPQKISLQEEKA
RQKSREEYLQSQVNMLWRTLPKREEEKTVAEARRYPSEPQENLLYFMEKN
APLLESWQREILRIVRKVSQYFYPQKQTQVMNEGWATFWHYTILNHLYDE
GKVTERFMLEFLHSHTNVVFQPPYNSPWYSGINPYALGFAMFQDIKRICQ
SPTEEDKYWFPDIAGSDWLETLHFAMRDFKDESFISQFLSPKVMRDFRFF
TVLDDDRHNYLEISAIHNEEGYREIRNRLSSQYNLSNLEPNIQIWNVDLR
GDRSLTLRYIPHNRAPLDRGRKEVLKHVHRLWGFDVMLEQQNEDGSVELL
ERCPPRMGNL
>ECs2239 putative minor tail protein
MAEIKTLHLVPREGMQVSEKPSVVRVRFGDGYEQRRPTGLNPQLKTFQAV
FRVTDESTRRWLDEFLSWHGGYRAFLWRPPKHNRTVRVVCREWSVTDNAR
YSDFSCTIEQVVN
>ECs2585 hypothetical protein
MANWQSIDELQDIASDLPRFTHALDELSRRLGLNITPLTADHISLRCHQN
ATAERWRRGFEQCGELLSENMINGRPICLFKLHEPVQVAHWQFSIVELPW
PREKRYPHEGWEHIEIVLPGDPETLNARALALLSDEGLSLPGISVKTSSP
KGEHERLPNPTLAVTDGKTTIKFHPWSIEEIVASEQSA
>ECs0273 hypothetical protein
MKKAAILIDAGFFMQRVHATHRKHFAEHELTAQCIMKVIWSMVLSHLNGK
RQSQERREPLELYRIYFYDCPPLDIQTRLPLPEPGNKTPGRKNFKLEKSY
ILRTELHEELRKTRKTPIVFILKLIGATH
>ECs0285 hypothetical protein
MDITEFPSGVIEHLGWYVYRLIDPRDGSTFYVGKGKGNRVFAHMRGEVAA
ADDDDLLSNKLKQIREIRLAGLEVIHVIHRHGMTDEKTAYEVEAALIDAY
PGLTNIMNGAGSNEFGAAHVKELIATYQPETITFHHKALMISVNRSAKDS
ELYDAVRFSWRINVSRASKAEVILATVRGIVRGVFIADKWLKSTREHFPT
MKYWDEDPDFEATQSSRYGFEGREAPPEIANLYLGKKIPDELRKKGAMSP
VRYSPNF
>ECs3925 hypothetical protein
MGIYHRSRKTKMKRTKSIRHASFRKNWSARHLTPVALAVATVFMLAGCEK
SDETVSLYQNADDCSAANPGKSAECTTAYNNALKEAERTAPKYATREDCV
AEFGEGQCQQAPAQAGMAPENQAQAQQSSGSFWMPLMAGYMMGRLMGGGA
GFAQQPLFSSKNPASPAYGKYTDATGKNYGAAQPGRTMTVPKTAMAPKPA
TTTTVTRGGFGESVAKQSTMQRSATGTSSRSMGG
>ECs4391 hypothetical protein
MLYIDKATILKFDLEMLKKHRRAIQFIAVLLFIVGLLCISFPFVSGDILS
TVVGALLICSGIALIVGLFSNRSHNFWPVLSGFLVAVAYLLIGYFFIRAP
ELGIFAIAAFIAGLFCVAGVIRLMSWYRQRSMKGSWLQLVIGVLDIVIAW
IFLGATPMVSVTLVSTLVGIELIFSAASLFSFASLFVKQQ
>ECs2473 hypothetical lipoprotein
MESSVNKAPSLIAAIVLGLGISACGYFVGDGVKHLKTNNRYVNVRGLSEK
EVRADTAELTIAINFKGNVPGELFPKLEEAQKKIVAELNAQGINEKEIIL
GQWTSKRTDSFYLKDDPTMPRYNADGSVTIKTHNVAAVEKVVAKLNELQV
ATDGAIAESKVAYRFNGIGALRAEMIAAATKDARNAALQFATDSGSQVGS
ISDASQGVFQIFASGSDEDDPTAINKTVRVVTTVTYALQD
>ECs0669 hypothetical protein
MKTKLNELLEFPTPFTYKVMGQALPELVDQVVEVVQRHAPGDYTPTVKPS
SKGNYHSVSITINATHIEQVETLYEELGKIDIVRMVL
>ECs4106 hypothetical protein
MFMTWEYALIGLVVGIIIGAVAMRFGNRKLRQQQALQYELEKNKAELDEY
REELVSHFARSAELLDTMAHDYRQLYQHMAKSSSSLLPELSAEANPFRNR
LAESEASNDQAPVQMPRDYSEGASGLLRTGAKRD
>ECs1050 hypothetical protein
MIASKFGIGQQVRHSLLGYLGVVVDIDPVYSLSEPSPDELAVNDELRAAP
WYHVVMEDDNGLPVHTYLAEAQLSSELQDEHPEQPSMDELAQTIRKQLQA
PRLRN
>ECs3535 hypothetical protein
MTTLRQPYYELSPAVYNALVQAKTALENSTLDTTLMELIYLRVSQINGCA
FCLEMHSKALRKSGVPQHKLDALAGWRVSHHFDERERAALAWAESVTDIA
RTHAEDEVYQPLLEHFSAAEISDLTFAIGLMNCFNRLAVSMRM
>ECs3906 hypothetical protein
MKKFAAVIAVMALCSAPVMAAEQGGFSGPSATQSQAGGFQGPNGSVTTVE
SAKSLRDDTWVTLRGNIVERISDDLYVFKDASGTINVDIDHKRWNGVTVT
PKDTVEIQGEVDKDWNSVEIDVKQIRKVNP
>ECs4710 hypothetical protein
MAINFSPKVGEILECNFGNYPVSQNGPFSTTYYDGRIPPEMIKNRLVVVL
NGKINGNAFIVVPLSTTRDHDKLKRGMHVEIASNVINDLQFFDQQIRWAK
TDLVQQVSRNRLNRARTYRGYLNQCLPHELVADIQRAVIKSINAISLIN
>ECs0217 hypothetical protein
MNSNVLTQTIVTGSDPRGLPEFSAIREEINKASHPSQPELNWKLVESLAL
AIFKANGVDLHTATYYTLARTRTQGLAGFCEGAELLAAMVSHDWDKFWPQ
GGPARTEMLDWFNSRTGNILRQQISFAESDLPLIYRTERALQLICDKLQQ
VELKRVPRVENLLYFMQNTRKRLEPQLKSNTENAAQTTVRTLIYAPETQA
SSTPEAVVPPLPGLPEMKVEVRSLTENPPQASVIKQGSTVRGFIAGIACS
VAVASALWWWQVYPVQQQLLQVNDTAQGAATVWMASPELENYERRLQQLL
DTSPVQPLETGMQMMRVADSRWPESLQQQQASTQWNEALKTRAQSSPQLR
GWLQTRQDLHAFADLVMQREKEGLTLSYIKNVIWQAERGLGQETPVESLL
TQYQDARAQKQNTDALEKQINERLEGVLSRWLLLKNNTIPTIKKALNFNN
IHEYKGVLNGEFNLFNTKW
>ECs3634 hypothetical protein
MARSQANRASSDLQQTPGDEQKLQAWQQAQAQVTRTLGQLTIISERYPEL
KSQELYQNLMVQLEGSENRIAVARGRYIKAIEQYNVTIRKFPAVLTAKVM
DYTPKKNYLPDDVAAVSKAPTIDFSQNANAH
>ECs0797 hypothetical protein
MKKIPCVMMRGGTSRGAFLLAEHLPEDQTQRDKILMAIMGSGNDLEIDGI
GGGNPLTSKVAIISRSSDPRADVDYLFAQVIVHEKRVDTTPNCGNMLSGV
GAFAIENGLIAATSPVTRVRIRNVNTGTFIEADVQTPNGVVEYEGSARID
GVPGTAAPVALTFLNAAGTKTGKVFPTDNQIDYFDDVPVTCIDMAMPVVI
IPAEYLGKTGYELPAELDADKALLARIESIRLQAGKAMGLGDVSNMVIPK
PVLISPAQKGGAINVRYFMPHSCHRALAITGAIAISSSCALEGTVTRQIV
PSVGYGNINIEHPSGGLDVHLSNEGQDATTLRASVIRTTRKIFSGEVYLP
>ECs2288 hypothetical protein
MIKTTLLFFATALCEIIGCFLPWLWLKRNASIWLLLPAGISLALFVWLLT
LHPAASGRVYAAYGGVYVCTALMWLRVVDGVKLTLYDWTGPLIALCGMLI
IVVGWGRT
>ECs0161 hypothetical protein
MLVYWLDIVGTAVFAISGVLLAGKLRMDPFGVLVLGVVTAVGGGTIRDMA
LDHGPVFWVKDPTDLVVAMVTSMLTIVLVRQPRHLPKWMLPVLDAVGLAV
FVGIGVNKAFNAEAGPLIAVCMGVITGVGGGIIRDVLAREIPMILRTEIY
ATACIIGGIVHATAYYTFSVPLETASMMGMVVTLLIRLAAIRWHLKLPTF
ALDENGR
>ECs4113 hypothetical protein
MDIFSIANQHIRFAVKLATAIVLALFVGFHFQLETPRWAVLTAAIVAAGP
AFAAGGEPYSGAIRYRGFLRIIGTFIGCIAGLVIIIAMIRAPLLMILVCC
IWAGFCTWISSLVRIENSYAWGLAGYTALIIVITIQPEPLLTPQFAVERC
SEIVIGIVCAIMADLLFSPRSIKQEVDRELESLLVAQYQLMQLCIKHGDG
EVVDKAWGDLVRRTTALQGMRSNLNMESSRWARANRRLKAINTLSLTLIT
QSCETYLIQNTRPELITDTFREFFDTPVETAQDVHKQLKRLRRVIAWTGE
RETPVTIYSWVAAATRYQLLKRGVISNTKINATEEEILQGEPEVKVESAE
RHHAMVNFWRTTLSCILGTLFWLWTGWTSGSGAMVMIAVVTSLAMRLPNP
RMVAIDFIYGTLAALPLGLLYFLVIIPNTQQSMLLLCISLAVLGFFLGIE
VQKRRLGSMGALASTINIIVLDNPMTFHFSQFLDSALGQIVGCVLAFTVI
LLVRDKSRDRTGRVLLNQFVSAAVSAMTTNVARRKENHLPALYQQLFLLM
NKFPGDLPKFRLALTMIIAHQRLRDAPIPVNEDLSAFHRQMRRTADHVIS
ARSDDKRRRYFGQLLEELEIYQEKLCIWQAPPQVTEPVHRLAGMLHKYQH
ALTDS
>ECs0837 putative tail length tape measure protein precursor
MDQIANLVIDLGIDAAEFKNEIPRIKNLLNGAASDAERSSARMQRFMERQ
TQAARQTMQAASSAATAASVHAQTVEKSAQAHERMAREVEQTRQRMEALS
QKMREEQAQAMALAEAQDKAAAAFYRQIDSVKQAGAGLQELQRIQQQIRQ
ARNSGGIGQQDYLALISEVTAKTRVLTQAEEEATRQKVAFIRQLKEQATR
QNLSSSELLRAKAAQLGVSSAAEVYIRKMEQAGKATHSLGLKSAAARQEI
GVLIGELARGNLGALRGSGITLANRAGWIDTLMSPKGMMPGAVIGGIAAA
VYGLGKAWYDGQKEGEEFNRQLSLTGHYAGVTAGQLWTLSRAISGNGITQ
HAAAGALAQVVGSGAFRGNDIGMVARAAAQMERSVGQSVSDTINQFKRLK
DDPVNAAKALDNELHFLTATQLEQIRVLGEQGRSSDAARIAMSALAEETG
RRTADIDNNLNALGSTLKYLSDLWSRFWDAAMNIGREDSLDEQIAALQEK
VSRAKRLPWTASSSQVEYDQQRLNDLQEKKRQKDLQDAKEQAERNYQEQQ
KRRNAENAALNRMNETEAARHQREIARINAMQYADQAVRDAAIQRENERY
EKALASGKKKTRETRNDEATRLLLQYSQQQAQVEGQIAAARQSAGIATER
MTEARKQLLALQQRISDLDGKKLTADEKSVLARKDELIQALTLLDVKQQE
LQKQTALNELKKKTIQLTSQLAEEERAQRQQHDLDIATVGMGDQQRQRYQ
VQLSLRQKYQQQLEQLRRDSEQKGTYNTDDYRKAEQALTESLNRQLNENR
RYWQQLEVVQGNWKNGVLRAFQDFTVDADNTAETAEQVFSSAFSNMGNGL
ATFVTTGKLNFKSFTSSVLSDMAKILAQATMMKSIKGIGSVLGFDLSSLS
LNANGGIYQSADLSRYSGTVVNRPTFFAFAKGAGVMGEAGPEAILPLRRG
ADGKLGVVADIGGSGMAMFSPQYNIEINNDGTNGQIGPAALKAVYDLGKK
AAADFMQQQARDGGRLSGAYR
>ECs5530 hypothetical protein
MSRYQHTKGQIKDNAIEALLHDPLFRQRVEKNKKGKGSYMRKGKHGNRGN
WEASGKKVNHFFTTGLLLSGAC
>ECs4981 hypothetical protein
MTTRKKKTAVSEAAVMEAIREALEGADPRTAGLTEQLAKGYVDLLDGLPF
GETREYRVTFRELTAKDSIDAEAEAERVVETNNGPMLIASPSLRGVALLR
RQIAAVGDIEGPLSPRQIGQLSERDLSRLMAAVSLLDTALAGKLAADRGR
SGAVSGSD
>ECs1653 hypothetical protein
MNRIEHYHDWLRDAHAMEKQAEKMLESMASRIENYPELRSRIEQHISETK
NQLSQLESILDRNNISRSVIKDSMSKMAAFGQSIGGIFPSDEIVKGSISG
YVFEQFEIACYTSLLAAAKNAGDTASVPIIEAILNEEKQMAEWLLNHIPD
TTEQFMVRSEIDGVEAKK
>ECs3255 hypothetical protein
MKRLIMATMVTAILASSTVWAADNAPVAAQQQTQQVQQTQKTAAAAERIS
EQGLYAMRDVQVARLALFHGDPEKAKELTNEASALLSDDSTEWAKFAKPG
KKTNVNDDQYIVINASVGISESYVATPEKEAAIKIANEKMAKGDKKGAME
ELRLAGVGVMENQYLMPLKQTRNALADAQKLLDKKQYYEANLALKGAEDG
IIVDSEALFVN
>ECs4617 hypothetical protein
MKISRLGEAPDYRFSLANERTFLAWIRTALGFLAAGVGLDQLAPDFATPV
IRELLALLLCLFSGGLAMYGYLRWLRNEKAMRLKEDLPYTNSLLIISLIL
MVVAVIVMGLVLYAG
>ECs4320 hypothetical protein
MNVFSQTQRYKALFWLSLFHLLVITSSNYLVQLPVSIFGFHTTWGAFSFP
FIFLATDLTVRIFGAPLARRIIFAVMIPALLISYVISSLFYMGSWQGFGA
LAHFNLFVARIATASFMAYALGQILDVHVFNRLRQSRRWWLAPTASTLFG
NVSDTLAFFFIAFWRSPDAFMAEHWMEIALVDYCFKVLISIVFFLPMYGV
LLNMLLKRLADKSEINALQAS
>ECs2504 hypothetical protein
MGILSWIIFGLIAGILAKWIMPGKDGGGFFMTILLGIVGAVVGGWISTLF
GFGKVDGFNFGSFVVAVIGAIVVLFIYRKIKS
>ECs1852 hypothetical protein
MKYLLIFLLVLAIFVISVTLGAQNDQQVTFNYLLAQGEYRISTLLAVLFA
AGFAIGWLICGLFWLRVRVSLVRAERKIKRLENQLSPATDVAVVQHSSAA
KE
>ECs3921 hypothetical protein
MKRYTPDFPEMMRLCEMNFSQLRRLLPRNDAPGETVSYQVANAQYRLTIV
ESTRYTTLVTIEQTAPAISYWSLPSMTVRLYHDAMVAEVCSSQQIFRFKA
RYDYPNKKLHQRDEKHQINQFLADWLRYCLAHGAMAIPVY
>ECs5142 hypothetical protein
MTDHTMKKNPVSIPHTVWHADDIRRGEREAADALGLTLYELMLRAGEAAF
QVCRSAYPDARHWLVLCGHGNNGGDGYVVARLAKAVGIEVTLLAQESDKP
LPEEAALAREAWLNAGGEIHASNIVWPESVALIVDALLGTGLQQAPRESI
SQLIDHANSHPAPIVAVDIPSGLLAETGATPGAVINADHTITFIALKPGL
LTGKARDVTGQLHFDSLGLDSWLAGQETKIQRFSAEQLSHWLKPRRPTSH
KGDHGRLVIIGGDHGTAGAIRMTGEAALRAGAGLVRVLTRSENIAPLLTA
RPELMVHELTMDSLTESLEWADVVVIGPGLGQQEWGKKALQKVENFRKPM
LWDADALNLLAINPDKRHNRVITPHPGEAARLLGCSVAEIESDRLHCAKR
LVQRYGGVAVLKGAGTVVAAHPDALGIIDAGNAGMASGGMGDVLSGIIGA
LLGQKLSPYDAACAGCVAHGAAADVLAARFGTRGMLATDLFSTLQRIVNP
EVTDKNHDESSNSAP
>ECs3226 hypothetical protein
MSKCSADETPVCCCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEK
ARSVESEPCKISPTFTEESDGVRLDIDFTFACEAEMLIFQLGLR
>ECs2042 hypothetical protein
MRETVEIMRYPVTLTPAPEGGYMVSFVDIPEALTQGETVAEAMEAAKDAL
LTAFDFYFEDNELIPLPSPLNSHDHFIEVPLSVASKVLLLNAFLQSEITQ
QELARRIGKPKQEITRLFNLHHATKIDAVQLAAKALGKELSLVMV
>ECs1751 hypothetical protein
MHGLNKDIIFPLQIFALSNNVLYNFPEQGVVPVLYVIYAQDKADSLEKRL
SVRPAHLARLQLLHDEGRLLTAGPMPAVDSNDPGAAGFTGSTVIAEFESL
EAAQAWADADPYVAAGVYEHVSVKPFKKVF
>ECs0989 hypothetical protein
MKAFDLHRMAFDKVPFDFLGEVALRSLYTFVLVFLFLKMTGRRGVRQMSL
FEVLIILTLGSAAGDVAFYDDVPMVPVLIVFITLALLYRLVMWLMAHSEK
LEDLLEGKPVVIIEDGELAWSKLNNSNMTEFEFFMELRLRGVEQLGQVRL
AILETNGQISVYFFEDDKVKSGLLILPSDCTQRYKVVPESADYACIRCSE
IIHMKAGEKQLCPRCANPEWTKASRAKRVT
>ECs5400 hypothetical protein
MGVYKARRFSQSTKKLGIHDKVLMAAAEEVMQGIWEADLGSGVIKKRLPL
QQGKSGGARTIIFFKSANHVFFYDGWSKSGLSSKGSKEIEDDELAAYKKM
ANAFLAFSNKQIEDLIETGFLIEVKNER
>ECs0963 hypothetical protein
MVKSTSCTTIDFMNMSQLTERTFTSSESLSSLSLFLSLARGQCRPGKFWH
RRSFRQKFLLRSLIMPRLSVEWMNELSHWPNLNVLLTRQPRLPVRLHRPY
LAANLSRKQLLEALRYHYALLRGCMSAEEFSLYLNTPGLQLAKLEGKNGE
QFTLELTMMISMDKEGDSTILFRNSEGIPLAEITFTLCEYQGKRTMFIGG
LQGAKWEIPHQEIQNATKACHGLFPKRLVMEAACLFAQRLQVEQIIAVSN
ETHIYRSLRYRDKEGKIHADYNAFWESVGGVCDAERHYRLPAQIARKEIA
EIASKKRAEYRRRYEMLDAIQPQMATMFRG
>ECs0509 hypothetical protein
MKYVDGFVVAVPADKKDAYREMAAKAAPLFKEFGALRIVECWASDVPDGK
VTDFRMAVKAEENEEVVFSWIEYPSKEVRDAANQKMMSDPRMKEFGESMP
FDGKRMIYGGFESIIDE
>ECs5028 hypothetical protein
MNKDEAGGNWKQFKGKVKEQWGKLTDDDMTIIEGKRDQLVGKIQERYGYQ
KDQAEKEVVDWETRNEYRW
>ECs5063 hypothetical protein
MSALNSLPLPVVRLLAFFHEELSERRPGRVPQTVQLWVGCLLVILISMTF
EIPFVALSLAVLFYGIQSNAFYTKFVAILFVVATVLEIGSLFLIYKWSYG
EPLIRLIIAGPILMGCMFLMRTHRLGLVFFAVAIVAIYGQTFPAMLDYPE
VVVRLTLWCIVVGLYPTLLMTLIGVLWFPSRAISQMHQALNDRLDDAISH
LTDSLAPLPETRIEREALALQKLNVFCLADDANWRTQSAWWQSCVATVTY
IYSTLNRYDPTSFADSQAIIEFRQKLASEINKLQHAITEGQCWQSDWRIS
ESEAMAARECNLENICQTLLQLGQMDPNTPPTPAAKPPSMVADAFTNPDY
MRYAVKTLLACLICYTFYSGVDWEGIHTCMLTCVIVANPNVGSSYQKMVL
RFGGAFCGAILALLFTLLVMPWLDNIVELLFVLAPIFLLGAWIATSSERS
SYIGTQMVVTFALATLENVFGPVYDLVEIRDRALGIIIGTVVSAVIYTFV
WPESEARTLPQKLAGALGMLSKVMRIPRQQEVTALRTYLQIRIGLHAAFN
ACEEMCQRVALERQLDSEERALLIERSQTVIHQGRDLLHAWDATWNSAQA
LDNALQPDKAGQFADALEKYAAGLATALSRSPQITLEETPASQAILPTLL
KQEQHVCQLFARLPDWTAPALTPATEQAQGATQ
>ECs2869 hypothetical protein
MRKKLAFLDTSLDDLRAFPESSRQEIGYQLDRIQQGLNPYDWKPFSTIGP
GVREIRTRDADGIYRVMYVAKFEEAVYVLHCFQKKTQTTSQSDIDLAKRR
YKELVQERKNEN
>ECs5288 hypothetical protein
MGIVMTQQGDAVAGELATEKVGIKGYLAFFLTIIFFSGVFSGTDSWWRVF
DFSVLNGSFGQLPGANGATTSFRGAGGAGAKDGFLFALELAPSVILSLGI
ISITDGLGGLRAAQQLMTPVLKPLLGIPGICSLALIANLQNTDAAAGMTK
ELAQEGEITERDKVIFAAYQTSGSAIITNYFSSGVAVFAFLGTSVIVPLA
VILVFKFVGANILRVWLNFEERRNPTQGAQA
>ECs5234 hypothetical protein
MAQVINEMDVPSHSFVFHGTGERYFLICVVNVLLTIITLGIYLPWALMKC
KRYLYANMEVNGQRFSYGITGGNVFFSCLVFVFFYFAILMTVSADMPLIG
CVLTLSLLVLLIFMAAKGLRYQALMTSLNGVRFSFNCSMKGVWWVTFFLP
ILMAIGMGTVFFISTKMLHANSSSSVIVSVVLMAIVGIVSIGIFNGTLYS
LVMSFLWSNTSFGIHRFKVKLDTAYCIKYAILAFLALLPFLAVAGYIIFD
QILNAYDSSVYANDDIENLQQFMEMQRKMIIAQLIYYFGIAVSTSYLTVS
LRNHFMSNLSLNDGRIRFRSTLTYHGMLYRMCALVVISGITGGLAYPLLK
IWMIDWQAKNTYLLGDLDDLPLINKEEQPDKGFLASISRGIMPSLPFL
>ECs3769 hypothetical protein
MDINNKARIHWACRRGMRELDISIMPFFEHEYDSLSDDEKRIFIRLLECD
DPDLFNWLMNHGKPADAELEMMVRLIQTRNRERGPVAI
>ECs3449 hypothetical protein
MTENAVLQLRAERIARATRPFLARGNRVRRCQRCLLPEKLCLCSTITPAQ
AKSRFCLLMFDTEPMKPSNTGRLIADILPDTVAFQWSRTEPSQDLLDLVQ
NPDYQPMVVFPASYADEQREVIFTPPAGKPPLFIMLDGTWPEARKMFRKS
PYLDNLPVISVDLSRLSAYRLREAQAEGQYCTAEVAIALLDMAGDTGAAA
GLGEHFTRFKTRYLAGKTQHLGSITAEQLESV
>ECs1048 hypothetical protein
MKTGIVTTLIALCLPVSVFATTLRLSTDVDLLVLDGKKVSSSLLRGADSI
ELDNGPHQLVFRVEKTIHLSNSEERLYISPPLVVSFNTQLINQVNFRLPR
LENEREANHFDAAPRLELLDGDATPIPVKLDILAITSTAKTIDYEVEVER
YNKSAKRASLPQFATMMADDSTLLSGVSELDAIPPQSQVLTEQRLKYWFK
LADPQTRNTFLQWAEKQPSS
>ECs4444 hypothetical protein
MQPKIYWIDNLRGIACLMVVMIHTTTWYVTNAHSVSPVTWDIANVLNSAS
RVSVPLFFMISGYLFFGERSAQPRHFLRIGLCLFFYSAIALLYIALFTSI
NVELALKNLLQKPVFYHLWFFFAIAVIYLVSPLIQVKNVGGKMLLVLMVV
IGIIANPNTVPQKIDGFEWLPINLYINGDTFYYILYGMLGRALGMMDTQH
KALSWVSAALFATGVFIISRGTLYELQWRGNFADTWYPYCGPMVFICAIA
LLTLVKNTLDTRTIRGLGLISRHSLGIYGFHALIIHALRTRGIELKNWPI
LDIIWIFCATLAASLLLSMLVQRIDRNRLVS
>ECs3772 hypothetical protein
MQPNDITFFQRFQDDILAGRKTITIRDESESHFKTGDVLRVGRFEDDGYF
CTIEVTATSTVTLDTLTEKHAEQENMTLTELKKVIADIYPGQTQFYVIEF
KCL
>ECs5299 hypothetical protein
MNKLIELRRAKMLALSLLLIAAATFVVTLFLPPNFWVSGVKAIAEAAMVG
ALADWFAVVALFRRVPIPIISRHTAIIPRNKDRIGENLGQFVQKKFLDTQ
SLVALIRRHEPALLIGNWFSQPENARRVGQHLLQIMSGFLELTDDARIQR
LLKRAVHRAIDKVDLSGTSALMLESMTKNDRHQVLLDTLIAQLIALLQRD
KSRKFIAQQIVRWLESEHPLKAKILPTEWLGEHSAELVSDAVNSLLDDIS
RDRAHQIRHAFDRATFALIDKLKNDPEMTARADAVKSYLKEDEAFLSELW
GDLREWLKADINSEDSRVKERIARAGQWFGETLIADDALRASLNGHLEQA
AHRVAPEFSAFLTRHISDTVKSWDARDMSRQIELNIGKDLQFIRVNGTLV
GGCIGLILYLLSQLPALFPLGNF
>ECs1114 putative tail length tape measure protein
MSQPVGDLVIDLSLDAVRFDEQMSRVRRHFSGLDTDVRKTASAVEQGLSR
QALAAQKAGISVGQYKAAMRTLPAQFTDIATQLAGGQNPWLILLQQGGQV
KDSFGGMIPMFRGLAGAITLPMVGVTSLAVATGALVYAWYQGDSTLSAFN
KTLVLSGNQSGLTADRMLTLSRAGQAAGLTFNQARESLAALVNAGVRGGE
QFDAINQSVARFASASGVEVDKVAEAFGKLTTDPTSGLIAMVRQFRNVTA
EQIAYVAQLQRSGDEAGALQAANDIATKGFDEQTRRLKENMGTLETWADK
TGKAFKSMWDAILDIGRPESSADMLASAQKAFDEADKKWQWYQSRSQRRG
KTASFRANLQGAWNDRENARLGLAAATLQSDMEKAGELAARDRAERDASQ
LKYTGEAQKAYERLLTPLEKYTARQEELNKALKDGKILRADYNTLMAAAK
KDYESTLKKPKSSGVKVSAGERQEDQAHAALLALETELRTLEKHSGANEK
ISQQRRDLWKAENQYAVLKEAATKRQLSEQEKFLLAHKDETLEYKRQLAE
LGDKVEHQKRLNELAQQAVRFEEQQSAKQAAISAKARGLTDRQAQRESEA
QRLRDVYGDNPAALAKATSALKNTWSAEEQLRGSWMAGLKSGWGEWAESA
TDSFSQVKSAATQTFDGIAQNMAAMLTGAEADWRGFTRSVLSMLTEIFLK
QAMVGIVGSIGSAIGGAFGGGASASTGTAIQAAAANFHFATGGFTGTGGK
YEPAGIVHRGEFVFTKEATSRIGVGNLYRLMRGYAEGGYVGGAGSPAQMR
RAEGINFNQNNHVVIQNDGTNGQAGPQLMKAVYDMARKGAQDELRLQLRD
GGMLSGSGR
>ECs1244 hypothetical protein
MKKVLIAALISGVSFGAFAQQGGFQGPEAERSTVAQAKELKDDAWVILEG
SIVKKVGDERYEFRDNSGTIVTDIDDSIWAGQNVSPKDKVRIEGEIDKDL
SSVEVDVKALKLLK
>ECs3036 vancomycin sensitivity
MLKRVFLSLLVLIGLLLLTVLGLDRWMSWKTAPYIYDELQDLPYRQVGVV
LGTAKYYRTGVINQYYRYRIQGAINAYNSGKVNYLLLSGDNALQSYNEPM
TMRKDLIAAGVDPSDIVLDYAGFRTLDSIVRTRKVFDTNDFIIITQRFHC
ERALFIALHMGIQAQCYAVPSPKDMLSVRIREFAARFGALADLYIFKREP
RFLGPLVPIPAMHQVPEDAQGYPAVTPEQLLELQKKQGK
>ECs4699 hypothetical protein
MAESFTTTNRYFDNKHYPRGFSRHGDFTIKEAQLLERHGYAFNELDLGKR
EPVTEEEKLFVAVCRGEREPVTEAERVWSKYMTRIKRPKRFHTLSGGKPQ
VEGAEDYTDSDD
>ECs3374 putative dehydrogenase
MQLRKLLLPGLLSVTLLSGCSLFNSEEDVVKMSPLPTVENQFTPTTAWST
SVGSGIGNFYSNLHPALADNVVYAADRAGLVKALNADDGKEIWSVSLAEK
DGWFSKEPALLSGGVTVSGGHVYIGSEKAQVYALNTSDGTVAWQTKVAGE
ALSRPVVSDGLVLIHTSNGQLQALNEADGAVKWTVNLDMPSLSLRGESAP
ATAFGAAVVGGDNGRVSAVLMEQGQMIWQQRISQATGSTEIDRLSDVDTT
PAVVNGVVFALAYNGNLTALDLRSGQIMWKRELGSVNDFIVDGNRIYLVD
QNDRVMALTIDGGVTLWTQSDLLHRLLTSPVLYNGNLVVGDSEGYLHWIN
VEDGRFVAQQKVDSSGFQTEPVAADGKLLIQAKDGTVYSITR
>ECs1643 tail length tape measure protein precursor
MAEPVGDLVVDLSLDAARFDEQMARVRRHFSGTESDAKKTAAVVEQSMSR
QALAAQKAGISVGQYKAAMRMLPAQFTDVATQLAGGQSPWLILLQQGGQV
KDSFGGMIPMFRGLAGAITLPMVGATSLAVATGALAYAWYQGNSTLSDFN
KTLVLSGNQAGLTADRMLVLSRAGQAAGLTFNQTSESLSALVKAGVSGEA
QIASISQSVARFSSASGVEVDKVAEAFGKLTTDPTSGLTAMARQFHNVTA
EQIAYVAQLQRSGEEAGALQAANEAATKGFDDQTRRLKENMGTLETWADR
TARAFKSMWDAVLDIGRPDTAQEMLIKAEAAFKKADDIWNLRKDDYFVND
EARARYWDDREKARLALEAARKKAEQQSQQDKNAQQQSDTEASRLKYTEE
AQKAYERLQTPLEKYTARQEELNKALKDGKILQADYNTLMAAAKKDYEAT
LKKPKQSGVKVSAGDRQEDSAHAALLTLQAELRMLEKHAGANEKISQQRR
DLWKAESQFAVLEEAAQRRQLSAQEKSLLAHKDETLEYKRQLAVLGDKVT
YQEHLNALAQQADKFAQQQRAKRAAIDAKNRGLTDRQAAREATEQRLKEQ
YGDNPLALNNVMSEQKKTWAAEDQLRGSWMAGLRSGWSEWEESATDSMSQ
VKSAATQTFDGIAQNMAAMLTGSEQNWRSFTRSVLSMMTEILLKQAMVGI
VGSIGSAIGGGASASGGTAIQAAAAKFHFAAGGFTGTGGKYEPAGIVHRG
EFVFTKEATSRIGVGNLYRLMRGYATGGYVGTPGSMADSRSQASGTFEQN
NHVVINNDGTNGQIGPQALKAVYDVARKAAMDVVTGQMRDGGLFSGGGR
>ECs2236 putative tail assembly protein
MATTNAFSLASPPLARICLHGDLQRFGRRLSLYVNTAAEAIRALSLQVPG
FRRQMNEGWYQIRIAGYDTAPEAVYARLHEQLGEGTVIHIVPRLAGAGKG
GLQIVLGAAAIVGSFFTAGASMALWGSALAAGGFSATTMLFSLGASMILG
GVAQMLAPKAKTPDYRATDNGRQNTYFSSLDNMIAQGNPMPVPYGEMLVG
SRRISQDISTRDEGGGGTVVVVGRQG
>ECs4459 hypothetical protein
MNISEVDLHKLTVSDPFLGQYQQLVRDVVIPYQWDALNDRIPEAEPSHAI
ENFRIAAGLQEGEFYGMVFQDSDVAKWLEAVAWSLCQKPDAELEKTADEV
IELIASAQCEDGYLNTYFTVKAPEERWSNLAECHELYCAGHLIEAGVAFF
QATGKRRLLGVVCRLADHIDSVFGPDESKLHGYPGHPEIELALMRLYEVT
EEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAY
SQAHLPLAQQQTAIGHAVRFVYLMTGVAHLARLSHDDSKRQDCLRLWNNM
AQRQLYITGGIGSQSSGEAFSSDYDLPNDTVYAESCASIGLMMFARRMLE
MEGDSQYADVMERALYNTVLGGMALDGKHFFYVNPLEVHPKSLKFNHIYD
HVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPREDALYINIYAGNSMEV
PVENGTLRLRVSGNYPWQEQVTIAVESPQPVRHTLALRLPDWCTQPQIIL
NGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVRRVYGNPLVRHVAGKV
AIQRGPLVYCLEKADNGESLHNLWLPTDAPFTTFEGKGLFSHKILIQAPG
YRYEQSNPEQQPLWHYDSAPAKRQTQTLTFIPWFSWANRGEGEMRIWVNE
EKHCHP
>ECs2031 hypothetical protein
MFPEYRDLISRLKNENPRFMSLFDKHNKLDHEIARKEGSDDRGYNAEVVR
MKKQKLQLKDEMLKILQHESVKEV
>ECs1676 hypothetical protein
MAEHLMSDVPFWQSKTLDEMSDAEWESLCDGCGQCCLHKLMDEDTDEIYF
TNVACRQLNIKTCQCRNYERRFEFEPDCIKLTRENLPTFEWLPMTCAYRL
LAEGKDLPAWHPLLTGSKAAMHGERISVRHIAVKESEVIDWQDHILNKPD
WAQ
>ECs3886 hypothetical protein
MERFLENAMYASRWLLAPVYFGLSLALVALALKFFQEIIHVLPNIFSMAE
SDLILVLLSLVDMTLVGGLLVMVMFSGYENFVSQLDISENKEKLNWLGKM
DATSLKNKVAASIVAISSIHLLRVFMDAKNVPDNKLMWYVIIHLTFVLSA
FVMGYLDRLTRHNH
>ECs4988 hypothetical protein
MAVTLTPHQRALLQLLPDGLAWDKRPSSVLAALCLGLSHSTERVSWTGNQ
MLAERFPDSSRLLLEDWERYLGLPECDMTGATIQERQRYAGNKYRMKPSL
NREFYIRFAAEFGYEIDIQPSPDSQWVSIVTINSETGYRNMNVLDDILTP
LRIYEGGALECILNRYKPAWQTFIYVYANSHEEENI
>ECs2056 putative receptor
MHLRHLFSLRLRGSLLLGSLLVASSFSTQAAEEMLRKAVGKGAYEMAYSQ
QENALWLATSQSRKLDKGGVVYRLDPVTLEVTQAIHNDLKPFGATINNTT
QTLWFGNTVNSAVTAIDAKTGEVKGRLVLDDRKRTEEVRPLQPRELVADD
ATNTVYISGIGKDSVIWVVDGENIKLKTAIQNTGKMSTGLALDSKGKRLY
TTNADGELITIDTADNKILSRKKLLDDGKEHFFINISLDTARQRAFITDS
KAAEVLVVDTRNGNILAKVAAPESLAVLFNPARNEAYVTHRQAGKVSVID
AKSYKVVKTFDTPTHPNSLALSADGKTLYVSVKQKSTKQQEATQPDDVIR
IAL
>ECs4591 hypothetical protein
MKRKPLIAFFIAIFIALTILILFPFNCTYKKSPQEHSLSFIKQLPNTVNL
SSLTYNKEDDFLYATQNSPAQLLKITKSGDIMDRAPLPFISDAETIEHIQ
GNIFAAVDEKTSELFFFTVTKDMHISFRNKIQLEKFNKKNRGFEGLAWKA
DDRMLFVAKERRPSKIFIYQLSPDLLSVKQATIPEALNDIRVNDISGLAF
NNESLMILSDESRKLLKFNLTEMSFIEMLDLTKGNHSLTSDLPQPEGIVT
LPDESIYVASEPDILAKFVPNK
>ECs2165 putative minor tail protein
MKTFRWKVKPDMEVNSQPSVREVRFGDGYSQRMAAGLNADLKTYRVTLSV
TREEARHLEAFLAEHGGWKAFLWKPPYAYRQIKVTCAGWSARVGMLRVEF
SAEFKQVVN
>ECs2520 hypothetical protein
MFAGLPSLTHEQQQKAVERIQELMAQGMSSGQAIALVAEELRANHSGERI
VARFEDEDE
>ECs0006 hypothetical protein
MLILISPAKTLDYQSPLTTTRYTLPELLDNSQQLIHEARKLTPPQISTLM
RISDKLAGINAARFHDWQPDFTPENARQAILAFKGDVYTGLQAETFSEDD
FDFAQQHLRMLSGLYGVLRPLDLMQPYRLEMGIRLENARGKDLYQFWGDI
ITNKLNEALAAQGDNVVINLASDEYFKSVKPKKLNAEIIKPVFLDEKNGK
FKIISFYAKKARGLMSRFIIENRLTKPEQLTGFNSEGYFFDEASSSNGEL
VFKRYEQR
>ECs5013 phosphate-starvation-inducible protein PsiE
MTSLSRPRVEFISTILQTVLNLGLLCLGLILVVFLGKETVHLADVLFAPE
QASKYELVEGLVVYFLYFEFIALIVKYFQSGFHFPLRYFVYIGITAIVRL
IIVDHKSPLDVLIYSAAILLLVVTLWLCNSKRLKRE
>ECs0777 hypothetical protein
MSSNFRHQLLSLSLLVGIAAPWAAFAQAPISSVGSGSVEDRVIQLERISN
AHSQLLTQLQQQLSDNQSDIDSLRGQIQENQYQLNQVVERQKQILLQIDS
LSSGGAAAQSTSGDQSGAAASTTPTADAGTANAGAPVKSGDANTDYNAAI
ALVQDKSRQDDAMVAFQNFIKNYPDSTYLPNANYWLGQLNYNKGKKDDAA
YYFASVVKNYPKSPKAADAMFKVGVIMQDKGDTAKAKAVYQQVISKYPGT
DGAKQAQKRLNAM
>ECs2197 hypothetical protein
MKKVLIAALISGVSFGAFAQQGGFQGPEAERSTVAQAKELKDDAWVILEG
SIVKKVGDERYEFRDNSGTIVTDIDDSVWAGQNVSPKDKVRIEGEIDKDL
SSVEVDVKALKLLK
>ECs0735 hypothetical protein
MKNTELEQLINEKLNSAAISDYAPNGLQVEGKETVQKIVTGVTASQALLD
EAVRLGADAVIVHHGYFWKGESPVIRGMKRNRLKTLLANDINLYGWHLPL
DAHPELGNNAQLAALLGITVMGEIEPLVPWGELTMPVPGLELASWIEARL
GRKPLWCGDTGPEVVQRVAWCTGGGQSFIDSAARFGVDAFITGEVSEQTI
HSAREQGLHFYAAGHHATERGGIRALSEWLNENTDLDVTFIDIPNPA
>ECs2809 hypothetical protein
METTKPSFQDVLEFVRLFRRKNKLQREIQDVEKKIRDNQKRVLLLDNLSD
YIKPGMSVEAIQGIIASMKGDYEDRVDDYIIKNAELSKERRDISKKLKAM
GEMKNGEAK
>ECs5108 hypothetical protein
MDQALLDGGYRCYTGEKIDVYFNTAICQHSGNCVRGNGKLFNLKRKPWIM
PDEVDVATVVKVIDTCPSGALKYRHK
>ECs1748 hypothetical protein
MDMDLNNRLTEDETLEQAYDIFLELAADNLDPADVLLFNLQFEERGGAEL
FDPAEDWQEHVDFDLNPDFFAEVVIGLADSEDGEINDVFARILLCREKDH
KLCHIIWRE
>ECs2696 putative methyl-independent mismatch repair protein
MLLAGSSLLTLLDDIATLLDDISVMGKLAAKKTAGVLGDDLSLNAQQVSG
VRANRELPVVWGVAKGSLINKVILVPLALIISAFIPWAITPLLMIGGAFL
CFEGVEKVLHMLEARKHKEDPAQSQQRLEKLAAQDPLKFEKDKIKGAIRT
DFILSAEIVAITLGIVAEAPLLNQVLVLSGIALVVTVGVYGLVGVIVKID
DLGYWLAEKSSALMQALGKGLLIIAPWLMKALSIVGTLAMFLVGGGIVVH
GIAPLHHAIEHFAGQQSAVVAMILPTVLNLILGFIIGGIVVLGVKAVAKM
RGQVH
>ECs0814 putative outer membrane protein
MKKSVIAGVFIALSFTTCSAIANSLALSLANDDAGKFQPILNDIYGNKHE
NRDDYSQGLFLGYSHDISDSSQLSLHIAQDIYSPSGSNKRHNTAVTGDRA
FSAYTHTGIEWNSLANDWIRYRLGTDIGVVGPDAGGQKVQNKAHEIIGAE
KYHAWDDQIENRYGYTVKGMLSMTPSMDILGANVGLYPEVSAVTGNLFQY
VAYGATIAIGNDKTFNSDNGFGLLAPRGLMHMSDTSGFKYKIFAGMERRD
VNRNYTLEGKTIQTKQTTVSLNKTVDEYQVGATIGYAPVAFTLAFNKVTS
EFKTGDDYSFINGAITFFF
>ECs0838 putative minor tail protein
METFHWKVRPDMNVVSEPKVVTVKLGDGYEQRRAAGLNNQLSTYSVTIRV
RKCEHPSLKAFLERHGGVRAFQWTPPYDWKPIRVVCRKWSASVGALWVTI
TADFEQVVA
>ECs2354 hypothetical protein
MNASSWSLRNLPWFRATLAQWRYALRNTIAMCLALTVAYYLNLDEPYWAM
TSAAVVSFPTVGGVISKSLGRIAGSLLGAIAALLLAGHTLNEPWFFLLSM
SAWLGFCTWACAHFTNNVAYAFQLAGYTAAIIAFPMVNITEASQLWDIAQ
ARVCEVIVGILCGGMMMMILPSSSDATALLTALKNMHARLLEHASLLWQP
ETTDAIRAAHEGVIGQILTMNLLRIQAFWSHYRFRQQNARLNALLHQQLR
MTSVISSLRRMLLNWPSPPGATREILEQLLTALASSQTDVYTVARIIAPL
RPTNVADYRHVAFWQRLRYFCRLYLQSSQELHRLQSDVDDHARLPRTSGL
ARHTDNAEAMWSGLRTFCTLMMIGAWSIASQWDAGANALTLAAISCVLYS
AVAAPFKSLSLLMRTLVLLSLFSFVVKFGLMVQISDLWQFLLFLFPLLAT
MQLLKLQMPKFAALWGQLIVFMGSFIAVTNPPVYNFADFLNDNLAKIVGV
ALAWLAFAILRPGSDARKSRRHIRALRRDFVDQLSRHPTLSESEFESLTY
HHVSQLSNSQDALARRWLLRWGIVLLNCSHVVWQLRDWESRSDPLSRVRD
NCISLLRGVMSERGVQQKSLAATLEELQRICDSLARHHQPAARELAAIVW
RLYCSLSQLEQAPPQGTLAS
>ECs0010 hypothetical protein
MGNTKLANPAPLGLMGFGMTTILLNLHNVGYFALDGIILAMGIFYGGIAQ
IFAGLLEYKKGNTFGLTAFTSYGSFWLTLVAILLMPKLGLTDAPNAQFLG
VYLGLWGVFTLFMFFGTLKGARVLQFVFFSLTVLFALLAIGNIAGNAAII
HFAGWIGLICGASAIYLAMGEVLNEQFGRTVLPIGESH
>ECs2723 putative minor tail protein
MQDIHGESLIESVKSEQSPRVVLWEIDLTVQGGERYFFCNELNEKGEAVT
WQGREYQAYPIDGSGFEMNGKGSSARPSLTVSNLFGLVTGMAEDLQSLVG
ATVVRRRVYARFLDAVNFVAGNPEADPEQELSDRWVVEQMSELTAMTASF
VLATPTETDGALFPGRIMLANTCMWDYRGDECGYNGPAVADEFDNPTTDI
RKDRCSKCMRGCEMRGMVANFGGFLSINKLSQ
>ECs2587 hypothetical protein
MFKFLVLTLGIISCQVYAEDTLIVNDHDISAIKDCWQKNSGDDTDINVIK
SCLRQEYNLVDAQLNKAYGEAYRYIEQVPRTGAKKPDTEQLNLLKKSQRA
WLDFRDKECELILSNEDVQDLSNPYSESEWLSCMIIQTNTRTRQLQLYRN
SEDFYPSPLTRG
>ECs3129 hypothetical protein
MTESTTSSPHDAVFKTFMFTPETARDFLEIHLPEPLRKLCNLQTLRLEPT
SFIEKSLRAYYSDVLWSVETSDGDGYIYCVIEHQSSAEKNMAFRLMRYAT
AAMQRHLDKGYDRVPLVVPLLFYHGETSPYPYSLNWLDEFDDPQLARQLY
TEAFPLVDITIVPDDEIMQHRRIALLELIQKHIRDRDLIGMVDRITTLLV
RGFTNDSQLQTLFNYLLQCGDTSRFTRFIQEIAERSPLQKERLMTIAERL
RQEGHQIGWQEGKLEGLQEGMHEQAIKIALRMLEQGFDRDLVLAATQLSE
ADLAANNH
>ECs3551 hypothetical protein
MSEALSLFSLFASSFLSATLLPGNSEVVLVAMLLSGISHPWVLVLTATMG
NSLGGLTNVILGRFCPLRKTSRWQEKATGWLKRYGAVTLLLSWMPVVGDL
LCLLAGWMRISWGPVIFFLCLGKALRYVAVAAATVQGMMWWH
>ECs1924 hypothetical protein
MNLDDKSLFLDAMEDVQPLKRATDVHWHPTRNQRAPQRIDTLQLDNFLTT
GFLDIIPLSQPLEFRREGLQHGVLDKLRSGKYPQQASLNLLRQPVEECRK
MMFSFIQQAMADGLRNVLIIHGKGRDDKSHANIVRSYVARWLTEFDDVQA
YCTALPHHGGSGACYVALRKTAQAKQENWERHAKRSR
>ECs3822 hypothetical protein
MRIPRIYHPEPLTSHSHIALCEDAANHIGRVLRMGPGQALQLFDGSNQVF
DAEITSASKKSVEVKVLEGQIDDRESPLHIHLGQVMSRGEKMEFTIQKSI
ELGVSLITPLFSERCGVKLDSERLNKKLQQWQKIAIAACEQCGRNRVPEI
RPAMDLEAWCAEQDEGLKLNLHPRASNSINTLPLPVERVRLLIGPEGGLS
ADEIAMTARYQFTDILLGPRVLRTETTALTAITALQVRFGDLG
>ECs3965 hypothetical protein
MHLITQKALKDAAEKYPQHKTELVALGNTIAKGYFKKPESLKAVFPSLDN
FKYLDKHYVFNVGGNELRVVAMVFFESQKCYIREVMTHKEYDFFTAVHRT
KGKK
>ECs5232 hypothetical protein
MANPEQLEEQREETRLIIEELLEDGSDPDALYTIEHHLSADDLETLEKAA
VEAFKLGYEVTDPEELEVEDGDIVICCDILSECALNADLIDAQVEQLMTL
AEKFDVEYDGWGTYFEDPNGEDGDDEDFVDEDDDGVRH
>ECs4555 EspD
MLNVNNDTLSVTSGVNTASGTSGITQSETGLSLDLQLVKSMNSSAGWTES
SPLPTPPAGHSLVTPSAAEDVLSKLFGGISGEVTSRTEEAEPQRTSYPYL
SQVNTVDPQQMMMMVTLLSLDTSAQKVSSLKNSNEIYMDGQTKALENKTQ
EYKKQLEEQQKAEEKSQKSKIVGQVFGWLGVALTAVAAVFNPALWAVVAI
GATAMALQTAVDVMGENAPQGLKTAAQVFGGISMAASILTAGVGGVSSLL
SKFGNVANKIGSSVVKVVEKAAEALVKNVFAKISTVAEGVTNGIRSAGTT
ALNNEAAQLQMLSQLAAFAVQNLTRQSESLGESAKLELDKAASELQNQAS
YLQSVSQLMSDSARVNSRIVSGRI
>ECs5323 hypothetical protein
MILMTSGLNIEWSTFMASMLVGTIGIQWSRWYLAHPKVFTVAAVIPMFPG
ISAYTAMISAVKISQLGYSEPLMITLLTNFLTASSIVGALSIGLSIPGLW
LYRKRPRV
>ECs1901 hypothetical protein
MTEPLKPRIDFDGPLEVEQNPKFRAQQTFDENQAQNFAPATLDEAQEEEG
QVEAVMDAALRPKRSLWRKMVMGGLALFGASVVGQGVQWTMNAWQTQDWV
ALGGCAAGALIIGAGVGSVVTEWRRLWRLRQRAHERDEARDLLHSHGTGK
GRAFCEKLAQQAGIDQSHPALQRWYASIHETQNDREVVSLYAHLVQPVLD
AQARREISRSAAESTLMIAVSPLALVDMAFIAWRNLRLINRIATLYGIEL
GYYSRLRLFKLVLLNIAFAGASELVREVGMDWMSQDLAARLSTRAAQGIG
AGLLTARLGIKAMELCRPLPWIDDDKPRLGDFRRQLIGQVKETLQKGKTP
SEK
>ECs0085 hypothetical protein
MFRGATLVNLDSKGRLSVPTRYREQLLENAAGQMVCTIDIHHPCLLLYPL
PEWEIIEQKLSRLSSMNPVERRVQRLLLGHASECQMDGAGRLLIAPVLRQ
HAGLTKEVMLVGQFNKFELWDETTWHQQVKEDIDAEQLATGDLSERLQDL
SL
>ECs4118 hypothetical protein
MRRLPGILLLTGAALVVIAALLVSGLRIALPHLDAWRPEILNKIESATGM
PVEASQLSASWQNFGPTLEAHDIRAELKDGGEFSVKRVTLALDVWQSLLH
MRWQFRDLTFWQLRFRTNTPITSGGSDDSLEASHISDLFLRQFDHFDLRD
SEVSFLTPSGQRAELAIPQLTWLNDPRRHRAEGLVSLSSLTGQHGVMQVR
MDLRDDEGLLSNGRVWLQADDIDLKPWLGKWMQDNIALETAQFSLEGWMT
IDKGDVTGGDVWLKQGGASWLGEKETHTLSVDNLTAHITRENPGWQFSIP
DTRITMDGKPWPSGALTLAWIPEQDVGGKDNKRSDELRIRASNLELAGLE
GIRPLAAKLSPALGDVWRSTQPSGKINTLALDIPLQAADKTRFQASWSDL
AWKQWKLLPGAEHFSGTLSGSVENGLLTASMKQAKMPYETVFRAPLEIAD
GQATISWLNNDKGFQLDGRNIDVKAKAVHARGGFRYLQPANDEPWLGILA
GISTDDGSQAWRYFPENLMGKDLVDYLSGAIQGGEADNATLVYGGNPQLF
PYKHNEGQFEVLVPLRNAKFAFQPDWPALTNLGIELDFINDGLWMKTDGV
NLGGVRASNLTAVIPDYSKEKLLIDADIKGPGKAVGPYFDETPLKDSLGA
TLQELQLDGDVNARLHLDIPLNGELVTAKGEVTLRNNSLFIKPLDSTLKN
LSGKFSFINGDLQSEPLTASWFNQPLNVDFSTKEGAKAYQVAVNLNGNWQ
PAKTGVLPAAVNEALSGSVAWDGKVGIVLPYHAGATYNVELNGDLKNVSS
HLPSPLAKPAGEPLPVNVKVDGNLNSFELTGQAGADNHFNSRWLLGQKLT
LDRAIWAADSKTLPPLPEQSGVELNMPPMNGAEWLALFQKGAAESVGGAA
SFPQHITLRTPMLSLGNQQWNNLSIVSQPTANGTQVEAQGREINATLAMR
NNAPWLANIKYLYYNPSVAKTRGDSTPSSPFPTTERINFRGWSDAQIRCA
ECWFWGQKFGRIDSDITISGNTLTLTNGLIDTGFSRLTADGEWINNPGNE
RTSLKGKLRGQKIDAAAEFFGVTTPIRQSSFNVDYDLHWRKAPWQPDEAT
LNGIIHTQLGKGEITEINTGHAGQLLRLLSVDALMRKLRFDFRDTFGEGF
YFDSIRSTAWIKDGVMHTDDTLVDGLEADIAMKGSVNLVRRDLNMEAVVA
PEISATVGVAAAFAVNPIVGAAVFAASKVLGPLWSKVSILRYHISGPLDD
PQINEVLRQPRKEKAQ
>ECs0926 putative DEOR-type transcriptional regulator
MRRANDPQRREKIIQATLEAVKLYGIHAVTHRKIATLAGVPLGSMTYYFS
GIDELLLEAFSSFTEIMSRQYQAFFSDVSDAQGACQAITDMIYSSQVATP
DNMELMYQLYALASRKPLLKTVMQNWMQRSQQTLEQWFEPGTARALDAFI
EGMTLHFVTDRKPLSREEILRMVERVAG
>ECs1233 putative tail tip fiber protein
MTRKPWRAGKDLSTVVENMEIGTGQRGDGRHAFVTREELVGLKLARRRTS
GGASYALNPGIEIDSTLMTVDFPTKPLNFKATGGFGSVLLEWDMPNYRGH
SLTEIWRGTEDDLADAVLVATTPGQVYGDPVDPGWSGFYWIRFVNAAGVK
GPWNAEKGTQAQTQIGVKAIIDQIRDEAAKSPVVSELRKEIKNAQGQAVK
DAAIKTTEVVGTLREETTRTIGGIETRISTLDSSTSESLNEVDKRITKLD
KEGGEAFLAMWSKKAGVDGITAGIGIVAGKDSEGRPVSQVAISASQLFVF
DPNNPDNTAYPFAVSGGKVVIPKAMIYDAVIETLVSRKVVADEVKAGVSI
TSPVIRSAVIQNGNFQVDSQGNLNIGGLFSVTSQGQLTIRYSNQNVGLVI
RNDKIEVYDQNGRLAVRIGRLR
>ECs0606 hypothetical protein
MQFTFNEGHIQLPSQWQDQSMQVLVSTDNSGINLVITREPVSQGTLTPEL
YQETLALYQGKLDGYTEHACREITLAEAPAWLLDYSWNGPEDEGNQGRIS
QIAVFQRRGDTLLTFTFSTSLSLKNSQKTMLLEVIKSFTPLPPENDIQKD
QPR
>ECs3210 putative transporting ATPase
MNSTHHYEQLIEIFNSCFADDFNTRLIKGDDEPIYLPADAEVPYNRIVFA
HGFYASAIHEISHWCIAGKARREQVDFGYWYCPDGRDAQTQSQFEDVEVK
PQALDWLFCVAAGYPFNVSCDNLEGDFEPDRVVFQRRVHAQVMDYLTNGI
PERPARFIKALQNYYHTPELTAEQFPWPEALN
>ECs3781 hypothetical protein
MSAQPVDIQIFGRSLRVNCPPDQRDALNQAADDLNQRLQDLKERTRVTNT
EQLVFIAALNISYELAQEKAKTRDYAASMEQRIRMLQQTIEQALLEQGRI
TEKTNQNFE
>ECs3983 hypothetical protein
MILSIDSNDANTAPLHKKTISSLSGAVESMMKKLEDVGVLVARILMPILF
ITAGWGKITGYAGTQQYMEAMGVPGFMLPLVILLEFGGGLAILFGFLTRT
TALFTAGFTLLTAFLFHSNFAEGVNSLMFMKNLTISGGFLLLAITGPGAY
SIDRLLNKKW
>ECs0220 hypothetical protein
MTQPSAPAPQIAIDSHDDKAWRDTLLKVAAILCERQPDSPQGYRLRRHAL
WQSITSTPQAESDGRTPLAAVSADMVADYQSRLASADMALWQQVEKSVLL
APYWLDGHCLSAQTALRLGYKQVADTIRDEVIRFLERLPQLTGLLFNDRT
PFLSEQTKQWLAASPDGKVAPVAQIGEESQAARACFAGQGLEAALRYLDM
LPEGDPRDQFHRQYLAAQLTEEAGLIQLAQQQYRMLLMIGSQMMVSDWEP
SLLTQLEQKFTAEQ
>ECs3937 hypothetical protein
MAQEIELKFIVNHSAVEALRDHLNTLGGEHHDPVQLLNIYYETPDNWLRG
HDMGLRIRGENGRYEMTMKVAGRVTGGLHQRPEYNVALSEPTLDLAQLPT
EVWPNGELPADLASRVQPLFSTDFYREKWLVAVDGSRIEIALDQGEVKAG
EFAEPICELELELLSGDTRAVLKLANQLVSQTGLRQGSLSKAARGYHLAQ
GNPAREIKPTTILHVAAKADVEQGLEAALELALAQWQYHEELWVRGNDAA
KEQVLAAISLVRHTLMLFGGIVPRKASTHLRDLLTQCEATIASAVSAVTA
VYSTETAMAKLALTEWLVSKAWQPFLDAKAQGKISDSFKRFADIHLSRHA
AELKSVFCQPLGDRYRDQLPRLTRDIDSILLLAGYYDPVVAQAWLENWQG
LHHAIATGQRIEIEHFRNEANNQEPFWLHSGKR
>ECs1983 putative tail length tape measure protein
MATLRELIIKISANSQSFQSEIQRASRMGSEYYRTLQNGGRQAAAAARDQ
RRALAELNSQLTEIRGSAVGMAGAFAGAFATGHLISLADEWSSVNARLKQ
ASQSSDEFSSSQKVLMDISQRTGTAFSDNAALFARSAASMREYGYSADDV
LKVTEAISTGLKISGASTAEAGSVITQFSQALAQGVLRGEEFNSVNESGD
RIVRALAAGMGVARKDLKAMADDGQLTADKVVPALISQLEVLRDEYAAMP
ETVSDGITKVENAFMAWVGGANEASGVTKTLSGVLNGVAGQIDNVATAVG
ALVAVGVARYFGNMASGAMSATAGLVTAARNEVALAEAQFRGTQIATARA
RAAVYRAQQAVAAARGTEMQIAAEARLAATQERLNRNIAARSAAQNALNS
TTAVGSRLMSGALGLVGGVPGLVMLGAAAWYTLYQNQEQARESARQYALT
IDEIAHKTPSMSLPEASDNEGRTRAALTEQNRLIDEQASRVKSLQEKAQS
IQDVLAGLEDRRVALIRQQAAEQNKVYQSMLVMNGQHTEFNRLLGLGNEL
LQQRQGLVNVPLRLPQATLDDKQQSALTKTERELALSRLKGEEKERARLG
YAADDLGFVGDSYQEARQRYISNALEAWRNNEANKPKSRGGKSETEKAED
SFSRLLKQQKAQLALAGQNTELAKLKYQTAQGELKTLTEIQKQELLRNAA
LIDQQKIREQLRSREETLKNENAAARASNDAELLGYGQGERARERMRELQ
QIRDSFRQKDADLQSQYQTGDISEDFYRQALAQNAQYLSERLKDQETFYA
ESDAQRADWQKGLQEGFSNWVDNASDYASQAAQLATEGISGMVNNITEML
NGNKVEWRSWASSVLQEISKVLMNAAIVNGIKTAANGMSGAGGFLGSIGD
WLGGAVANAKGGVYTSANLSAYSNSIVDTPTYFAFAKGAGLMGEAGPEAI
MPLTRAADGSLGVRAVGSMNGSAGLVYSPVYHIAIQNDGTNGQIGPEAAG
SLVQLIDQRVQAVMLSMRRDGGMLSG
>ECs3456 hypothetical protein
MSKLIVPQWPLPKGVAACSSTRIGGVSLPPYDSLNLGAHCGDNPDHVEEN
RKRLFAAGNLPSKPVWLEQVHGKDVLNLTGEPYASKRADASYSNTPGRVC
AVMTADCLPVLFCNRAGTEVAAAHAGWRGLCAGVLEETVSCFADNPENIL
AWLGPAIGPRAFEVGAEVREAFMAVDAEASTAFIQHGDKYLADIYQLARQ
RLANVGVEQIFGGDRCTYTENETFFSYRRDKTTGRMASFIWLI
>ECs2278 hypothetical protein
MTLKEFIKSLRVGDAKKFAARLGVSPSYLSQMASGRTAISPTRALMIESA
TEGQVSRAELRPHDWELIWPEYASGIRLGQTHVVHAEGDCSACLSDGVDS
>ECs3050 hypothetical protein
MTNITLQKQHRTLWHFIPGLALSAVITGVALWGGSIPAVAGAGFSALTLA
ILLGMVLGNTIYPHIWKSCDGGVLFAKQYLLRLGIILYGFRLTFSQIADV
GISGIIIDVLTLSSTFLLACFLGQKVFGLDKHTSWLIGAGSSICGAAAVL
ATEPVVKAEASKVTVAVATVVIFGTVAIFLYPAIYPLMSQWFSPETFGIY
IGSTVHEVAQVVAAGHAISPDAENAAVISKMLRVMMLAPFLILLAARVKQ
LSGANSGEKSKITIPWFAILFIVVAIFNSFHLLPQSVVNMLVTLDTFLLA
MAMAALGLTTHVSALKKAGAKPLLMALVLFAWLIVGGGAINYVIQSVIA
>ECs5295 hypothetical protein
MGSLFNIYKDIFPTLGMYSGLKACHEKNNLPFDINTEIETIQKQINYDIN
HLNDGLIKRVLNLFIHLISNPDNLELTLNRYSSTTEQIIGRTKRNGLHEF
DDGDLKIIFNRQDDNESVLTVKDKDKDKDKDKDKDKDISHHCNVKTEQLQ
QFIKIMEQKAQLPIYIDKNNLKESIFSVLHNDPQQVDKDQHLPCEKFLKH
ACKSSNSFEVKLDATHQYQHLNNFMISFDPVENQLTIRDNNNKTETFSFT
NLQWENLLQYYKENHQQPNIAGSRNLTDNIDKIKNTISTSEIIECASPEI
RSSVLNDLYSIANFLPDNNLTPNESWKRFCETCERFYVAQKSITGDKSER
LTRKLSISDAGITMTFKIGDVVINTISTAIPEDATGQRCIEGLNLAEMDL
TDIDLSKMALRNVNFNGSILRNAKFSGTICEGVDFTDCDLRNAEFENASL
ENNDFRKVRHLTYVNFKNANLRNSNFNGKVLTGVTFTGSDLSNAYLEHID
FTTVILYETSKIPGIPGTPQIPGTPKVILTGAILNYSDLSGKDLSEYNLT
GILCMYTNFSNANLTNCKISNANFSNAKFYNTNCTGANCSNILFDYAWFD
NTIFIKTLFKNTCFYNVRAKNVYLEGAYLNNDNIVNQANNSTEKQSIDST
DKQANDSTVQQSIDSTVQQANDSTDKQANDNIDKQVNDSTDKQAKNSTEQ
QDSNSFNQARLKKEVNRRFSIPGLTSYQPTYIVEE
>ECs4847 hypothetical protein
MKDVVDKCSTKGCAIDIGTVIDNDNCTSKFSRFFATREEAESFMTKLKEL
AAAASSADEGASVAYKIKDLEGQVELDAAFTFSCQAEMIIFELSLRSLA
>ECs2774 hypothetical protein
MIKWPWKVQESAHQTALPWQEALSIPLLTCLTEQEQSKLVTLAERFLQQK
RLVPLQGFELDSLRSCRIALLFCLPVLELGLEWLDGFHEVLIYPAPFVVD
DEWEDDIGLVHNQRIVQSGQSWQQGPIVLNWLDIQDSFDASGFNLIIHEV
AHKLDTRNGDRASGVPFIPLREVAGWEHDLHAAMNNIQEEIELVGENAAS
IDAYAASDPAECFAVLSEYFFSAPELFAPRFPSLWQRFCQFYQQDPLQRL
HHANDTDSFSATNVH
>ECs3985 putative cytochrome
MDWYLKVLKNYVGFRGRARRKEYWMFILVNIIFTFVLGLLDKMLGWQRAG
GEGILTTIYGILVFLPWWAVQFRRLHDTDRSAWWALLFLIPFIGWLIIIV
FNCQAGTPGENRFGPDPKLEP
>ECs1805 minor tail protein
MQDIHEESLNESVKSEQSPRVVLWEIDLTVQGGERYFFCNELNEKGEAVT
WQGREYQAYPIDGSGFEMNGKGSSARPSLTVSNLFGLVTGMAEDLQSLVG
ATVVRRRVYARFLDAVNFVAGGRGYRAAEVKRIKKSRRVAGTDKD
>ECs4446 hypothetical protein
MEIVMKTSKTVAKLLFVVGALVYLVGLWISCPLLSGKGYFLGVLMTATFG
NYAYLRAEKLGQLDNFFTHICQLVALITIGLLFIGVLNAPINAYEMVIYP
IAFFVCLFGQMRLFRSV
>ECs3911 hypothetical protein
MLTVIAEIRTRPGQHHRQAVLDQFAKIVPTVLKEEGCHGYAPMVDCAAGV
SFQSMAPDSIVMIEQWESIAHLEAHLQTPHMKAYSEAVKGDVLEMNIRIL
QPGI
>ECs3942 hypothetical protein
MSAIAPGMILIAYLCGSISSAILVCRLCGLPDPRTSGSGNPGATNVLRIG
GKGAAVAVLIFDVLKGMLPVWGAYELGVSPFWLGLIAIAACLGHIWPVFF
GFKGGKGVATAFGAIAPIGWDLTGVMAGTWLLTVLLSGYSSLGAIVSALI
APFYVWWFKPQFTFPVSMLSCLILLRHHDNIQRLWRRQETKIWTKFKRKR
EKDPE
>ECs4873 hypothetical protein
MKASLALVSLLTAFTSYSLKSPAIPPTVVQIQANTNLAIADGARQQIGST
LFYDPAYVQLTYPGGDVPQERGVCSDVVIRALRSQKVDLQKLVHEDMAKN
FAEYPQKWQLKRPDSNIDHRRVPNLETWFTRHDKTRPISKNPSDYQAGDI
VSWRLDNGLAHIGVVSDGFARDGTPLVIHNIGAGAQEEDVLFSWRMVGHY
RYFVK
>ECs1984 putative minor tail protein
MAEIKTLHLVPREGMQVSEKPSVVRVRFGDGYEQRRPTGLNPQLKTFQAV
FRVTDESTRRWLEEFLSWHGGYRAFLWRPPKHNRTVRVVCREWSVTDNAR
YSDFSCTIEQVVN
>ECs5287 hypothetical protein
MTTQVRKNVMDMFIDGARRGFTIATTNLLPNVVMAFVIIQALKITGLLDW
VGHICEPVMALWGLPGEAATVLLAALMSMGGAVGVAASLATAGALTGHDV
TVLLPAMYLMGNPVQNVGRCLGTAEVNAKYYPHIITVCVINALLSIWVMQ
LIV
>ECs4209 hypothetical protein
MWRRLIYHPDINYALRQTLVLCLPVAVGLMLGELRFGLLFSLVPACCNIA
GLDTPHKRFFKRLIIGASLFATCSLLTQLLLAKDVPLPFLLTGLTLVLGV
TAELGPLHAKLLPASLLAAIFTLSLAGYMPVWEPLLIYALGTLWYGLFNW
FWFWIWREQPLRESLSLLYRELADYCEAKYSLLTQHTDPEKALPPLLVRQ
QKAVDLITQCYQQMHMLSAQNNTDYKRMLRIFQEALDLQEHISVSLHQPE
EVQKLVERSHAEEVIRWNAQTVAARLRVLADDILYHRLPTRFTMEKQIGA
LEKIARQHPDNPVGQFCYWHFSRIARVLRTQKPLYARDLLADKQRRMPLL
PALKSYLSLKSPALRNAGRLSVMLSVASLMGTALHLPKSYWILMTVLLVT
QNGYGATRLRIVNRSVGTVVGLIIAGVALHFKIPEGYTLTLMLITTLASY
LILRKNYGWATVGFTITAVYTLQLLWLNGEQYILPRLIDTIIGCLIAFGG
TVWLWPQWQSGLLRKNAHDALEAYQEAIRLILSEDPQPTPLAWQRMRVNQ
AHNTLYNSLNQAMQEPAFNSHYLADMKLWVTHSQFIVEHINAMTTLAREH
RALPPELAQEYLQSCEIAIQRCQQRLEYDEPGSSGDANIMDAPEMQPHEG
AAGTLEQHLQRVIGHLNTMHTISSMAWRQRPHHGIWLSRKLRDSKA
>ECs5269 hypothetical protein
MTTWTVRVFTTAEIIYRKTVIALVCHLNCSRQETVTMNKTITALAILMAS
FAANASVLPETPVPFKSGTGAIDNDTVYIGLGSAGTAWYKLETQAKDKKW
TALAAFPGGPRDQATSAFIDGNLYVFGGIGKNSEGLTQVFNDVHKYNPKT
NSWVKLISHAPMGMAGHVTFVHNGKAYVTGGVNQNIFNGYFEDLNEAGKD
STAVDKINAHYFDKKAEDYFFNKFLLSFDPSTQQWSYAGESPWYGTAGAA
VVNKGDKTWLINGEAKPGLRTDAVFELDFTGNNLKWNRLAPVSSPDGVAG
GFAGISNDSLIFAGGAGFKGSRENYQNGKNYAHEGLKKSYSTDIHLWHNG
KWDKSGELSQGRAYGVSLPWNNSLLIIGGETAGGKAVTDSVLISVKDNKV
TVQN
>ECs0674 hypothetical protein
MKLQLVAVGTKMPDWVQTGFTEYLRRFPKDMPFELIEIPAGKRGKNADIK
RILDKEGEQMLAAAGKNRIVTLDIPGKPWDTPQLAAELERWKLDGRDVSL
LIGGPEGLSPACKAAAEQSWSLSALTLPHPLVRVLVAESLYRAWSITTNH
PYHRE
>ECs5324 putative structural protein
MDSHYLNNTQHVYDKGRVMQTEQQRAVTRLCIQCGLFLLQHGAESALVDE
LSSRLGRALGMDSVESSISSNAIVLTTIKDGQCLTSTRKNHDRGINMHVV
TEVQHIVILAEHHLLDYKGVEKRFSQIQPLRYPRWLVALMVGLSCACFCK
LNNGGWDGAVITFFASTTAMYIRQLLAQRHLHPQINFCLTAFAATTISGL
LLQHPTFSNTPTIAMAASVLLLVPGFPLINAVADMFKGHINTGLARWAIA
SLLTLATCVGVVMALTIWGLRGWV
>ECs0675 hypothetical protein
MQGKALQDFVIDKIDDLKGQDIIALDVQGKSSITDCMIICTGTSSRHVMS
IADHVVQESRAAGLLPLGVEGENSADWIVVDLGDVIVHVMQEESRRLYEL
EKLWS
>ECs3108 hypothetical protein
MRKIFLPLLLVALSPVAHSEGVQEVEIDAPLSGWHPAEGEDASFSQSINY
PASSVNMADDQNISAQIRGKIKNYAAAGKVQQGRLVVNGASMPQRIESDG
SFARPYIFTEGSNSVQVISPDGQSRQKMQFYSTPGTGTIRARLRLVLSWD
TDNTDLDLHVVTPDGEHAWYGNTVLKNSGALDMDVTTGYGPEIFAMPAPV
HGRYQVYINYYGGRSETELTTAQLTLITDEGSVNEKQETFIVPMRNAGEL
TLVKSFDW
>ECs3829 hypothetical protein
MDGVMSAVTVNDDGLVLRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQ
ANSHLVKFLGKQFRVAKSQVVIEKGELGRHKQIKIINPQQIPPEVAALIN
>ECs2574 hypothetical protein
MAGHSKWANTRHRKAAQDAKRGKIFTKIIRELVTAAKLGGGDPDANPRLR
AAIDKALSNNMTRDTLNRAIARGVGGDDDANMETIIYEGYGPGGTAIMIE
CLSDNRNRTVAEVRHAFSKCGGNLGTDGSVAYLFSKKGVISFEKGDEDTI
MEAALEAGAEDVVTYDDGAIDVYTAWEEMGKVRDALEAAGLKADSAEVSM
IPSTKADMDAETAPKLMRLIDMLEDCDDVQEVYHNGEISDEVAATL
>ECs0192 hypothetical protein
MALKATIYKATVNVADLDRNQFLDASLTLARHPSETQERMMLRLLAWLKY
ADERLQFTRGLCADDEPEAWLRNDHLGIDLWIELGLPDERRIKKACTQAA
EVALFTYNSRAAQIWWQQNQSKCAQFANLSVWYLDDEQLAKVSAFADRTM
TL
>ECs1491 hypothetical protein
MMIKTRFSRWLTFFTFAAAVALALPAKANTWPLPPAGSRLVGENKFHVVE
NDGGSLEAIAKKYNVGFLALLQANPGVDPYVPRAGSVLTIPLQTLLPDAP
REGIVINIAELRLYYYPPGKNSVTVYPIGIGQLGGDTLTPTMVTTVSDKR
ANPTWTPTANIRARYKAQGIELPAVVPAGPDNPMGHHAIRLAAYGGVYLL
HGTNADFGIGMRVSSGCIRLRDDDIKTLFSQVTPGTKVNIINTPIKVSAE
PNGARLVEVHQPLSEKIDDDPQLLPITLNSAMQSFKDAAQTDAEVMQHVM
DVRSGMPVDVRRHQVSPQTL
>ECs2051 hypothetical protein
MNQSLTLAFLIAAGIGLVVQNSLMVRITQTSSTILIAMLLNSLVGIVLFV
SILWFKQGMAGFGELVSSVRWWTLIPGLLGSFFVFASISGYQNVGAATTI
AVLVASQLIGGLVLDIFRSHGVPLRALFGPICGAILLVVGAWLVARRSF
>ECs4985 hypothetical protein
MMNDEVISRLLAPVMRGVRLLFGRGVLTGTTDTLKIQNVQITGMDGETFD
DVERPQQYGQISVPLPGAEVFLACAGGQRDQAVVLVVEDRRSRPTGLTAG
DTGVYHHEGHRIRLTKNGRIIVTCKTLEIYADEGVQVDTPEAHFTGNVTV
DKNLHVKGNVSIDGTGRSQGTFTMSEAVIAGITYSGHVHHDNGEGSKRGG
PENG
>ECs4402 hypothetical protein
MTQENEIKRPTQDLEHEPIKQLDNSEKGGKVSQALETVTTTAEKVQRQPV
IAHLIRATERFNDRLGNQFGAAITYFSFLSMIPILMVSFAAGGFVLASHP
MLLQDIFDKILQNISDPTLAATLKNTINTAVQQRTTVGLVGLAVALYSGI
NWMGNLREAIRAQSRDVWERSPQDQEKFWVKYLRDFISLIGLLIALIVTL
SITSVAGSAQQMIISALHLNSIEWLKPTWRLIGLAISIFANYLLFFWIFW
RLPRHRPRKKALIRGTFLAAIGFEVIKIVMTYTLPSLMKSPSGAAFGSVL
GLMAFFYFFARLTLFCAAWIATAEYKDDPRMPGKTQP
>ECs2913 hypothetical protein
MTIKNKMLLGALLLVTSAAWAAPATAGSTNTSGISKYELSSFIADFKHYK
PGDTVPEMYRTDEYNIKQWQLRNLPAPDAGTHWTYMGGAYVLISDTDGKI
IKAYDGEIFYHR
>ECs4971 hypothetical protein
MAQGIDLGYAATLPSKEAVAYFRAKGAHISWNWFETDADVHARSFTAAKA
ARLDVLTTLQAEVQRAIDEGISQKAFIRTLTPRLQKLGWWGKQIVVDSAG
NAEEVQLGSPRRLALIYNVNTRVAYNAGRYTQMMNNTDTHPFWQYVAVMD
SRTRPSHSTLNGLVFRYDDPFWKTHYPPNGWNCRCRVRPLSQARLDAMGL
SVSSGEDHLSTRNVEAGVDKQTGEVREMPVTTYSDGTRTMTPDVGWSYNP
GSAAFGTDQALIRKLIEVKSPALREMVVQEMNNSPERQLAFRIWAKNIMK
TRRGGHDIRTLGFMTESIAQAVESRTGTPPARLLAMSGKNVLHADSVKHQ
NDGIALTPEDFAQLPAMLAAPDAVLWDHVHQNLLYITETRDGTAKIAVNA
PYGVKRQPDKLDVVINAYRVNKFDIEKAIEGGKLELLEGKL
>ECs4976 hypothetical protein
MNYATETDMRARYREDLLRPLLAVPRSDEPDTRKLNRALTDASALIDSYL
SARYTLPLEVIPAVLVQHCCAIAFYYLCDQRASDQARDRYREALAWLKDV
MNGNVPVGVDTNGAAPESGDLPQVQSDAAVFGRNQKGFI
>ECs5050 hypothetical protein
MNGTIYQRIEDNAHFRELVEKRQRFATILSIIMLAVYIGFILLIAFAPGW
LGTPLNPNTSVTRGIPVGVGVIVISFVLTGIYIWRANGEFDRLNNEVLHE
VQAS
>ECs1003 hypothetical protein
MLFTLKKVIGNMLLPLPLMLLIIGAGLALLWFSRFQKTGKIFISIGWLAL
LLLSLQPVADRLLRPIESTYPTWNNSQKVDYIVVLGGGYTWNPQWAPSSN
LINNSLPRLNEGIRLWRENPGSKLIFTGGVAKTNTVSTAEVGARVAQSLG
VPREQIITLDLPKDTEEEAAAVKQAIGDAPFLLVTSASHLPRAMIFFQQE
GLNPLPAPANQLAIDSPLNPWERAIPSPVWLMHSDRVGYETLGRIWQWLK
GSSGEPRQE
>ECs4764 hypothetical protein
MPFKPLVTAGIESLLNTFLYRSPALKTARSRLLGKVLRVEVKGFSTSLIL
VFSERQVDVLGEWAGDADCTVIAYASVLPKLRDRQQLAALIRSGELEVQG
DIQVVQNFVALAGLAEFDPAELLAPYTGDIAAEGISKAMRGGAKFLHHGI
KRQQRYVAEAITEEWRMAPGPLEVAWFAEETAAVERAVDALIKRLEKLEA
K
>ECs1115 putative minor tail protein
MKTFRWKVKPDMEVNSQPSVREVRFGDGYSQRMAAGLNADLKTYRVTLSV
TREEARHLEAFLAEHGGWKAFLWTPPYAWRQIKVTCAAWSSRVRMLRVEF
SAEFKQVVN
>ECs0586 UDP-2,3-diacylglucosamine hydrolase
MATLFIADLHLCVEEPAITAGFLRFLAGEARKADALYILGDLFEAWIGDD
DPNPLHRQMAAAIKAVSDSGVPCYFIHGNRDFLLGKRFARESGMTLLPEE
KVLELYGRRVLIMHGDTLCTDDAGYQAFRTKVHKPWLQTLFLALPLFVRK
RIAVRMRANSKEANSSKSLAIMDVNQNAVVSAMEKHQVQWLIHGHTHRPA
VHELIANQQPAFRVVLGAWHTEGSMVKVTADDVELIHFPF
>ECs3611 hypothetical protein
MSQYLIFQLHGPMASWGVDAPGEVRHTHELPSRSALLGLLAAGVGIRRDD
TERLNAFNRHYSLVVCASRNPRWARDYHTVQMPKEVRKARYFSRREELSA
PDLLSAIISRRDYYTDAWWMVAVATTPDAPYSLEQLQDGLRHPVFPLYLG
RKSHPLALPLAPLLLEGNASDVLRNAYQQYQDSFRELKVSLPKLQDECWW
EGEHDGLVASKILRRRDVPLNRQQWLFGERTINQGPWLSKEEPCTSQE
>ECs0472 hypothetical protein
MMKTITKQPILFTDVPVADLRNSMKQDLNQNLIERLWNKIRDFFLDSDKQ
KAFKSIHKYINTLSVLNYNSALTPDPNFNIDATSDLDSYLKLDFDRLSPK
QKQTTLCCFWNKIASSLPEPYNSTIKHNIIFYKDGENLMIRGTISIVNEV
VKTYSLPIEKDDNGYYDFSGLYLAHSNISGKDPNKDPDIDFGIDMGNCNC
SNVNFEHTYFYGVKFTNANCTNANFNNCRFKKCDLTNMNCTGAILDNAMI
YGKEKEPEMQYPEADQIIQRITYQKSDGNETKGMILTNCSCVKTTFNWAD
LSESDCQNVDFSEANLSNTILPDIVRMKGTKLYRTDLFNPILKTEAESTE
EKDISPLAKIILDYIESDKNPESLNFEEKSTVIKIKQDIDNFIFYNQHLK
KIFNRAMNLQEKISRKKYNEFFKYIQAEAKQYFKDQYKLTKNDYLKKVPL
TAQLIAKYKMDDQLDQLLVTREIQDEIKSKIQDKIDELSKNLFNTMTETI
ENNFDDIFRQQSENMSNYYEFVD
>ECs2710 hypothetical protein
MRLTAKQVTWLKVCLHLAGLLPFLWLVWAINHGGLGADPVKDIQHFTGRT
ALKFLLATLLITPLARYAKQPLLIRTRRLLGLWCFAWATLHLTSYALLEL
GVNNLALLGKELITRPYLTLGIISWIILLALAFTSTQAMQRKLGKHWQQL
HNFVYLVAILAPIHYLWSVKIISPQPLIYAGLAVLLLALRYKKSRSLFNR
LRKQVHNKLSV
>ECs4788 hypothetical protein
MKPSSPSRSKGHAKARRKTREELNQEARDRKRQKKRRGHAPGSRAAGGNN
TSGSKGQNAPKDPRIGSKTPIPLGVTEKVTKQHKPKSEKPMLSPQAELEL
LETDERLDALLERLEAGETLSAEEQSWVDAKLDRIDELMQKLGLSYDDDE
EEEEDEKQEDMMRLLRGN
>ECs2125 hypothetical protein
MHVTLVEINVHEDKVDEFIEVFRQNHLGSVQEEGNLRFDVLQDPEVNSRF
YIYEAYKDEDAVAFHKTTPHYKTCVAKLESLMTGPRKKRLFNGLMP
>ECs0858 putative structural protein
MRNRTLADLDRVVALGGGHGLGRVLSSLSSLGSRLTGIVTTTDNGGSTGR
IRRSEGGIAWGDMRNCLNQLITEPSVASAMFEYRFGGNGELSGHNLGNLM
LKALDHLSVRPLEAINLIRNLLKVDAHLIPMSEHPVDLMAIDDQGHEVYG
EVNIDQLTTPIQELLLTPNVPATREAVHAINEADLIIIGPGSFYTSLMPI
LLLKEIAQALRRTPAPMVYIGNLGRELSLPAANLKLESKLAIMEQYVGKK
VIDAVIVGPKVDVSAVKERIVIQEVLEASDIPYRHDRQLLHSALEKALQA
LG
>ECs0866 hypothetical protein
MSKSHPRWRLAKKILTWLFFIAVIVLLVVYAKKVDWEEVWKVIRDYNRVA
LLSAVGLVVVSYLIYGCYDLLARFYCGHKLAKRQVMLVSFICYAFNLTLS
TWVGGIGMRYRLYSRLGLPGSTITRIFSLSITTNWLGYILLAGIIFTAGV
VELPDHWYVDQTTLRILGIGLLMIIAVYLWFCAFAKHRHMTIKGQKLVLP
SWKFALAQMLISSVNWMVMGAIIWLLLGQSVNYFFVLGVLLVSSIAGVIV
HVPAGIGVLEAVFIALLAGEHTSKGTIIAALLAYRVLYYFIPLLLALICY
LLLESQAKKLRAKNEAAM
>ECs5312 hypothetical protein
MFGNLGQAKKYLGQAAKMLIGIPDYDNYVEHMKTNHPDKPYMSYEEFFRE
RQNARYGGDGKGGMRCC
>ECs0524 hypothetical protein
MFGKGGLGNLMKQAQQMQEKMQKMQEEIAQLEVTGESGAGLVKVTINGAH
NCRRVEIDPSLLEDDKEMLEDLVAAAFNDAARRIEETQKEKMASVSSGMQ
LPPGFKMPF
>ECs0197 hypothetical protein
MSSFQFEQIGVIRSPYKEKFAVPRQPGLVKSANGELHLIAPYNQADAVRG
LEAFSHLWILFVFHQTMEGGWRPTVRPPRLGGNARMGVFATRSTFRPNPI
GMSLVELKEVVCHKDCVILKLGSLDLVDGTPVVDIKPYLPFAESLPDASA
SYAQSAPAAEMAVSFTAEVEKQLLTLEKRYPQLTLFIREVLAQDPRPAYR
KGEETGKTYAVWLHDFNVRWRVTDAGFEVFALEPR
>ECs1507 hypothetical protein
MKKDNYSFKRACAVVGGQSAMARLLGVSPPSVNQWIKGVRQLPAERCPAI
ERATKGGVLCEELRPDVDWTYLRRSSCYSQNMSMKQPNDENDHTRSIKRQ
MIHENQT
>ECs2530 hypothetical protein
MTITDLVLILFIAALLAFAIYDQFIMPRRNGPTLLAIPLLRRGRIDSVIF
VGLIVILIYNNVTNHGALITTWLLSALALMGFYIFWIRVPKIIFKQKGFF
FANVWIEYSRIKAMNLSEDGVLVMQLEQRRLLIRVRNIDDLERIYKLLVS
TQ
>ECs3986 putative cytochrome
MQWYLSVLKNYVGFSGRARRKEYWMFTLINAIVGAIINVIQLILGLELPY
LSMLYLLATFLPVLALAIRRLHDTDRSGAWALLFFVPFIGWLVLLVFFCT
EGTSGSNRYGNDPKFGSN
>ECs1044 hypothetical protein
MAFMLSPLLKRYTWNSAWLYYARIFIALCGTTAFPWWLGDVKLTIPLTLG
MVAAALTDLDDRLAGRLRNLIITLFCFFIASASVELLFPWPWLFAIGLTL
STSGFILLGGLGQRYATIAFGALLIAIYTMLGTSLYEHWYQQPMYLLAGA
VWYNVLTLIGHLLFPVRPLQDNLARCYEQLARYLELKSRMFDPDIEDESQ
APLYDLALANGLLMATLNQTKLSLLTRLRGDRGQRGTRRTLHYYFVAQDI
HERASSSHIQYQTLREHFRHSDVLFRFQRLMSMQGQACQQLSRCILLRQP
YQHDPHFERAFTHIDAALERMRDNGAPADLLKTLGFLLNNLRAIDAQLAT
IESEQAQALPHNNDENELADDSPHGLSDIWLRLSRHFTPESALFRHAVRM
SLVLCFGYAIIQITGMHHGYWILLTSLFVCQPNYNATRHRLKLRIIGTLV
GIAIGIPVLWFVPSLEGQLVLLVITGVLFFAFRNVQYAHATMFITLLVLL
CFNLLGEGFEVALPRVIDTLIGCAIAWAAVSYIWPDWQFRNLPRMLERAT
EANCRYLDAILEQYHQGRDNRLAYRIARRDAHNRDAELASVVSNMSSEPN
VTPQIREAAFRLLCLNHTFTSYISALGAHREQLTNPEILAFLDDAVCYVD
DALHHQPADEERVNEALASLKQRMQQLEPRADSKEPLVVQQVGLLIALLP
EIGRLQRQITQVPQETPVSA
>ECs4633 hypothetical protein
MGLFDEVVGAFLKGDAGKYQAILSWVEEQGGIQVLLEKLQSGGLGAILST
WLSNQQGNQPVSGEQLESALGTNAVSDLGQKLGVDTSTASCLLAEQLPKI
IDALSPQGEVSPQANNDLLSAGMELLKGKLFR
>ECs2410 hypothetical protein
MDNAVDRHVFYISDGTAITAEVLGHAVMSQFPVTISSITLPFVENESRAR
AVKDQIDAIYHQTGVRPLVFYSIVLPEIRAIILQSEGFCQDIVQALVAPL
QQEMKLDPTPIAHRTHGLNPNNLNKYDARIAAIDYTLAHDDGISLRNLDQ
AQVILLGVSRCGKTPTSLYLAMQFGIRAANYPFIADDMDNLVLPASLKPL
QHKLFGLTIDPERLAAIREERRENSRYASLRQCRMEVAEVEALYRKNQIP
WINSTNYSVEEIATKILDIMGLSRRMY
>ECs3044 hypothetical protein
MERNVTLDFVRGVAILGILLLNISAFGLPKAAYLNPAWYGAITPQDAWTW
AFLDLIGQVKFLTLFALLFGAGLQMLLPRGRRWIQSRLTLLVLLGFIHGL
LFWDGDILLAYGLVGLICWRLVRDAPSVKSLFNTGVMLYLVGLGVLLLLG
LISDSQTSRAWTPDASAILYEKYWKLHGGVEAISNRADGVGNSLLALGAQ
YGWQLAGMMLIGAALMRSGWLKGQFSLRHYRRTGFVLVAIGVIINLPAIA
LQWQLDWAYRWCAFLLQMPRELSAPFQAIGYASLFYGFWPQLSRFKLVLA
IACVGRMALTNYLLQTLICTTLFYHLGLFMQFDRLELLAFVIPVWLANIL
FSVIWLRYFRQGPVEWLWRQLTLRAAGPAISKTSR
>ECs0106 hypothetical protein
MQTQVLFEHPLNEKMRTWLRIEFLIQQLTVNLPIVDHAGALHFFRNVSEL
LDVFERGEVRTELLKELDRQQRKLQTWIGVPGVDQSRIEALIQQLKAAGS
VLISAPRIGQFLREDRLIALVRQRLSIPGGCCSFDLPTLHIWLHLPQAQR
DCQVETWIASLNPLTQALTMVLDLIRQSAPFRKQTSLNGFYQDNGGDADL
LRLNLSLDSQLYPQISGHKSRFAIRFMPLDTENGQVPERLDFELACC
>ECs2240 putative tail length tape measure protein
MATLRELIIKISANSQSFQSEIQRASRMGSEYYRTLQNGGRQAAAAAREQ
RRALAELHSQLTEIRASAVGMTGAFAGAFATGHLISLADEWSSVNARLKQ
ASQSSDEFASSQKVLMDISQRTGTAFSDNAALFARSAASMREYGYSADDV
LKVTEAISTGLKISGASTAEAGSVITQFSQALAQGVLRGEEFNSVNESGD
RIVRALAAGMGVARKDLKAMADDGKLTADKVVPALISQLGILRDEYAAMP
ETVSSSITKVENAFMAWVGGANEASGVTKTLSGMLNGVAGQIDNVATAVG
ALVAVGVARYFGNMASGAMSATAGLVTAARNEVALAEAQFRGTQIATARA
RAAVYRAQQAVAAARGTEMQIAAEARLAATQERLNRNIAARSAAQNALNS
TTAVGSRLMSGALGLVGGVPGLVMLGAAAWYTLYQNQEQARESARQYALT
IDEIAHKTPSMSLPEASDNEGRTRAALTEQNRLIDEQASRVKSLQEKIAG
YQYVLANPGWTTGDGFMINHLTSVKTVTEGLAQATEQLAVEQSRLAQMQE
KAQSIQDVLAGLEDRRVALIRQQAAEQNKVYQSMLVMNGQYTEFNRLLGL
GNELLQQRQGLVNVPLRLPQATLDDKQQSALTKTERELALSRLKGEEKER
VRLGYAADDLGFVGDPYQEARQRYISNALEAWRNNEVNKPKSRGGKSETE
KAEDSFSRLLKQQKEQLALAGQNTELAKLKYQTALGELKTLSEIQKQELL
RNAALIDQQKIREQLRYREETLKNDNVAARASNESELLGYGQGERARERM
RELQQIRDSFRQKDADLQSQYQTGDISEDFYRQALAQNAQYLSERLKDQA
VFYAESDVQRADWQKGLQEGFSNWVDNASDYASQAAQLATEGISGMVNNI
TEMLNGNKVEWRSWASSVLQEISKVLMNAAIVNGIKTAANGMSGAGGFLG
SIGDWLGGAVANAKGGVYTSANLSAYSNSIVDTPTYFAFAKGAGLMGEAG
PEAIMPLTRAADGSLGVRAVGSMNGSAGLVYSPVYHIAIQNDGANGQIGP
EAAGSLVQLIDQRVQAVMLSMRRDGGMLSG
>ECs0732 hypothetical protein
MNQQRFDDSTLIRIFALHELHRLKEHGLTRGALLDYHSRYKLVFLAHSQP
EYRKLGPFVADIHQWQNLDDFYNQYRQRVIVLLSHPANPRDHTNVLMHVQ
GYFRPHIESTERQQLAALIDSYRRGEQPLLAPLMRIKHYMALYPYAWLSG
QRYFELWPRVINLRHSGVL
>ECs0012 putative oxidoreductase
MNVNYLNDSDLDFLQHCSEEQLANFARLLTHNEKGKTRLSSVLMRNELFK
SMEGHPEQHRRNWQLIAGELQHFGGDSIANKLRGHGKLYRAILLDVSKRL
KLKADKEMSTFEIEQQLLEQFLRNTWKKMDEEHKQEFLHAVDARVNELEE
LLPLLMKDKLLAKGVSHLLSSQLTRILRTHAAMSVLGHGLLRGAGLGGPV
GAALNGVKAVSGSSYRVTIPAVLQIACLRRMVSATQV
>ECs0229 hypothetical protein
MEFEERYFREELDYLRQLSKLLATEKPHLARFLAEKDADPDIERLLEGVA
FLTGNLRQKIEDEFPELTHGLIKMLWPNYLRPVPAMTLIEYTPDMDKSSV
PVLIPRNEQFTTNAGEIRVDEVLPSDAKKEEPPPCTFTLCRDIWLLPVRL
EQIENRSTTRNGVINITFSVAPGTDFRTLDLNKLRFWLGNDDNYTRDQLY
LWFCEYLQGADLTVGEQHIRLPEFMLKAVGFEPQDAMLPWPKNVHSGYRI
LQEYFCYPDAFLFFDLCGCPALPDGLQAEFFTLQLRFSRPLPVDIRLRRD
SLRLYCAPAINLFIHHAEAITLDNRRADYPLVPSRHYPQHYDVFSVNSVV
SQVQDMFRKKDLGRPVSTQAARQWPAFESFSHQMEYSRKREVVYWHHRTK
TSLFHRGFDHTLAFIHADGSYPSDESLLSNEVVSVSLTCTNRELPSQIRS
GDITGTTGKNAAVASFRNITRPTQPLWPVIDGSLHWSLLSAMNLNYLSLL
DTDALKQVIANFDRHAIHHPQTARLSQQKLDAIERLETRPVDRLFTGIPV
RGLASTLYLHPEPFVCEGEMYLLGTVLSHFLSLYASVNSFHMLTVVNTES
QETWKWTERIGQHPLI
>ECs2888 hypothetical protein
MAGWFELSKSSDSQFRFVLKAGNGETILTSELYTSKASAEKGIASVRSNS
PQEERYEKKTASNGKFYFNLKAANHQIIGSSQMYATAQSRETGIASVKAN
GTSQTVKDNT
>ECs4828 hypothetical protein
MIRKAFVMQVNPDAHEEYQRRHNPIWPELEAVLKSHGAHNYAIYLDKARN
LLFATVEIESEERWNAVASTEICQRWWKYMTDVMPANPDNSPVSSELQEV
FYLP
>ECs2545 putative rRNA methylase
MLVAQHTVYFPDAFLTQMREAMPSTLSFDDFLAACQRPLRRSIRVNTLKT
SVADFLRLTAPYGWTLTPIPWCEEGFWIERDDEDALPLGSTAEHLSGLFY
IQEASSMLPVAALFADGNAPQRVMDVAAAPGSKTTQIAARMNNEGAILAN
EFSASRVKVLHANISRCGISNVALTHFDGRVFGAALPEMFDAILLDAPCS
GEGVVRKDPDALKNWSPESNQEIAATQRELIDSAFHALRSGGTLVYSTCT
LNREENESVCLWLKETYPDAVEFLPLGDLFPGANKALTEEGFLHVFPQIY
DCEGFFVARLRKTQAIPALPTPKYKVGNFPFSPVKDREAAQIRQAAASVG
LNWDGNLRLWQRDKEVWLFPVGIEALIGKVRFSRLGIKLAETHNKGYRWQ
HEAVIALASPDNENAFELTPQEAEEWYRGRDVYPQAAPVADDVLVTFQHQ
PIGLAKRIGSRLKNSYPRELVRDGKLFTSNA
>ECs1974 putative portal protein
MWNLLRRTRKNQKSGRDVREVGWRSLFQAVAEPFAGAWQQGVKADPETVL
SFHAVFSCISLISQDIAKMRLRLMQTDVQGIRREKRQGDTARLCRRPNAQ
QNRIQFFELWLNSKLRHGNTVVLKIRTPRGQIKELRILDWNRVEPLVADD
GEVFYRITPDRNCGITESVTVPAREVIHDRFNCFFHPLVGLPPVYAAGLA
AMQGHHIQANSTYFFRNGGRPSGVIEVPGSITEENAKKLKGNWDSGYTGE
NAGKTAILSNGAKYSPTTFSPVDAQTVEQLKMTAEIVCSVFRVPAYKIGV
GHPPSSDNVEALEQQYYSQCLQTLIESIELLLDEALETGENESTEFDVTT
LLRMDSERRMKTLGESVKNTLLTPNEARKRENLPPLAGGDALYLQQQNYS
LEALSRRDAREDPFASAGKTVSAQLPDGASDGNKAISETEHDAVKAMFRG
ILRK
>ECs2911 hypothetical protein
MSHTIRDKQKLKARASKIQGQVVALKKMLDEPHECAAVLQQIAAIRGAVN
GLMREVIKGHLTEHIVHQGDELKREEDLDVVLKVLDSYIK
>ECs4781 hypothetical protein
MKCKRLNEVIELLQPAWQKEPDLNLLQFLQKLAKESGFDGELADLTDDIL
IYHLKMRDSAKDAVIPGLQKDYEEDFKTALLRARGVIKE
>ECs4744 hypothetical protein
MKKTLATLFLLTCLGSSSAYADNALILQTDFSLKDGAVSAMKGVAFGVDH
NLKIFDLTHEIPPYNIWEGAYRLYQTASYWPQGSVFVSVVDPGVGTDRKS
IVLKTKNGQYFVSPDNGTLTLVAESLGIESVREIDEKANRLKGSEKSYTF
HGRDVYAYTGARLASGAITFEQVGPELPAKVVELSYQKAKATKGEVKGNI
PILDIQYGNVWSNISDELLNQAGIKLNDTLCVTISEGSRQKYAGKMPYVA
SFGDVPEGQPMVYLNSLLNVSVALNMDNFAQKHQVASGADWNIDVKKCDK
>ECs2028 hypothetical protein
MANSITADEIREQFSQAMSAMYQQEVPQYGTLLELVADVNLAVLENNPQL
HEKMVNADELARLNVERHGAIRVGTAQELATLRRMFAIMGMYPVSYYDLS
QAGVPVHSTAFRPIDDASLARNPFRVFTSLLRLELIENEILRQKAAEILR
QRDIFTPRCRQLLEEYDQRGGFNETQAQEFVQEALETFRWHQSATVDEET
YRALHNEHRLIADVVCFPGCHINHLTPRTLDIDRVQSMMPECGIEPKILI
EGPPRREVPILLRQTSFKALEETVLFAGQKQGTHTARFGEIEQRGVALTP
KGRQLYDDLLRNAGTGQDNLTHQMHLQETFRTFPDSEFLMRQQGLAWFRY
RLTPSGEAHRQAIHPGDDPQPLIERGWVAAQPITYEDFLPVSAAGIFQSN
LGNETQARSHGNASREAFEQALGCPVLDEFQLYQEAEERSKRRCGLL
>ECs0896 hypothetical protein
MNMKLKTLFAAAFAVVGFCSTASAVTYPLPTDGSRLVGQNQVITIPEGNT
QPLEYFAAEYQMGLSNMMEANPGVDTFLPKGGTVLNIPQQLILPDTVHEG
IVINSAEMRLYYYPKGTNTVIVLPIGIGQLGKDTPINWTTKVERKKAGPT
WTPTAKMHAEYRAAGEPLPAVVPAGPDNPMGLYALYIGRLYAIHGTNANF
GIGLRVSHGCVRLRNEDIKFLFEKVPVGTRVQFIDEPVKATTEPDGSRYI
EVHNPLSTTEAQFEGQEIVPITLTKSVQTVTGQPDVDQVVLDEAIKNRSG
MPVRLN
>ECs4330 hypothetical membrane protein
MSGIRSLPMIKLLTGLLLLAWPFVIWFGLAHNGLHWLLPLMALLLLLRLR
QTRRQAGPLQAVTQLVAVVGIALCVASFMLKTHQLLLFYPVVVNAVMLAV
FGGSLWSAMPIVERLARLQEPDLPEKGVRYTRHVTQIWCGFFIINGGIAL
FTALYADMSLWTAWNGMIAYLLMGTLMAGEWLLRRQVMKRDRA
>ECs2238 minor tail protein
MQNIHEESLNESVKSEQSPRVVLWEIDLTVQGGERYFFCNELNEKGEAVT
WQGRQYQVYPIDGSGFEMNGKGSSARPSLTVSNLFGLVTGMAEDLQSLVG
ATVVRRRVYARFLDAVNFVAGNPEADPEQELRDRWVVEQMSELTAMTASF
VLATPTETDGALFPGRIMLANTCMWTYRSDECGYTGGAVADEFDNPTTDI
RKDRCSKCMRGCEMRSMVANFGGFLSINKLSQ
>ECs1121 putative host specificity protein
MGKGGGRAHTPVEAKDNLKSTQMMSVIDAIGEGPIEGPVKGLQSILVNKT
PLTDTDGNPVIHGVTAVWRAGEQEQTPPEGFESSGAETALGVEVTKAKPV
TRTITSANIDRLRVTFGVQSLLETTSKGDRNPSSVRLLIQLQRNGNWVTE
KDVTINGKTTSQFLASVILDNLPPRPFNIRMVRETADSTSDQLQNKTLWS
SYTEIIDVKQCYPNTAIVGLQVDAEQFGGQQMTVNYHIRGRIIQVPSNYD
PEKRTYSGIWDGSLKPAYSNNPAWCLWDMLTHPRYGMGKRLGAADVDKWA
LYAIAQYCDQMVPDGFGGTEPRMTFNAYLSQQRKAWDVLSDFCSAMRCMP
VWNGQTLTFVQDSPSDVVWPYTNSDVVVDDNGVGFRYSFSALKDRHTAVE
VNYTDPQNGWQTSTELVEDPEAILRYGRNLLKMDAFGCTSRGQAHRAGLW
VIKTGLLETQTVDFTLGSQGLRHTPGDIIEICDNDYAGTMTGGRVLSIDA
ASRTLTLDREVTLPETGAATVNLINGSGKPVSVAITAHPAPDRIQVSTLP
DGVETYGVWGLSLPSLRRRLFRCVSIRENTDGTFAITAVQHVPEKEAIVD
NGARFEPQSGTLNSVIPPAVQHLTVEVSAADGQYLAQAKWDTPRVVKGVR
FSLRLTSGSGEGSRLVTTAITADTEHRSSGLPPGEYTLTVRAINSYGQQG
EPATTTFRINAPAVPATIELTPGYFQITAVPRLAVYDPTVQFEFWFSETK
IADTSQVETSARYLGTGSQWSVSGPHIKPGKDFWFYVRSVNLVGKSAFVE
ASGRASNDAEGYLGLFREKIGKLHLAQGLWELIDNSQLADEMAEMKTSIT
ETRNEITQTVSKTLEDQSAIIQQIQRVQKDTNDDLAALYMLKVQKTKNGI
PYVAGIGAGIEDTDGQPLSNILLLADRIAMINPEDGNTTPLFVAQGNQLF
MNDVFLKRLFAVSITSSANPPTFSLTPEGRLTARNADISGNVNANSGTLN
NVTINENCRVLGKLSANQIEGDLVKTVGKAFPRDSRAPERWPSGTITVRV
YDDQPFDRQIVIPAVAFSGAKHEREHTDIYSSCRLIVRKNGAEIYNRTAL
DNTLIYSGVIDMPAGHGHMTLEFSVSAWLVNNWYPTASISDLLVVVMKKA
TAGISIS
>ECs2486 hypothetical protein
MNLDDIINSMTPEVYQRLSTAVELGKWPDGVALTEEQKENCLQLVMLWQA
RNNTEAQHMTIDTNGQMVMKSKQQLKEDFGISAKPIAMFK
>ECs1803 putative tail length tape measure protein
MATLRELIIKISANSQSFQSEIQRASRMGSEYYRTLQNGGRQAAAAAREQ
RRALAELNSQLTEIRGSAVGMAGAFAGAFASGHLISLADEWSSVNARLKQ
ASQSSDEFASSQKVLMDISQRTGTAFSDNAALFARSAASMREYGYSAGDV
LKVTEAISTGLKISGASTAEAGSVITQFSQALAQGVLRGEEFNSVNESGD
RIVRALAAGMGVARKDLKAMADDGKLTADKVVPALISQLGILRDEYAAMP
ETVSSSITKVENAFMAWVGGANEASGVTKTLSGMLNGVAGQIDNVATAVG
ALVAVGVARYFGNMASGAMSATAGLVTAARNEVALAEAQFRGTQIATARA
RAAVYRAQQAVAAARGTEMQIAAEARLAATQERLNRNIAARSAAQNALNS
TTAVGSRLMSGALGLVGGVPGLVMLGAAAWYTLYQNQEQARESARQYALT
IDEIAHKTPSMSLPEASDNEGRTRAALTEQNRLIDEQASRVKSLQEKAQS
IQDVLAGLEDRRVALIRQQAAEQNKVYQSMLVMNGQHTEFNRLLGLGNEL
LQQRQGLVNVPLRLPQATLDDKQQSALTKTERELALSRLKGEEKERVRLG
YAADDLGFVGDPYQEARQRYISNALEAWRNNEANKPKSRGGKSETEKAED
SFSRLLKQQKEQLALVGQNTELAKLKYQTALGELKTLTEMQKQELLRNAT
LIDQQKIREQLRSREETLKNENAAARASNDAELLGYGQGERARERMRELQ
QIRDSFRQKDADLQSQYQTGDISEDFYRQALAQNAQYLSERLKDQAVFYA
ESDVQRADWQKGLQEGFSNWVDNASDYASQAAQLATEGISGMVNNITEML
NGNKVEWRSWASSVLQEISKVLMNAAIVNGIKTAANGMSGAGGFLGSIGD
WLGGAVANAKGGVYTSANLSAYSNSIVDTPTYFAFAKGAGLMGEAGPEAI
MPLTRAADGSLGVRAVGSMNGSAGLVYSPVYHIAIQNDGANGQIGPEAAG
SLVQLIDQRVQAVMLSMRRDGGMLSG
>ECs1573 hypothetical protein
MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVD
SRLVSHQDGKWQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPF
RELPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQM
KQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALENAMNYSV
GFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKL
REMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQG
DVLVRLGGLRVLRIGDDVYTNGEKIDSPHRPALDALASNIALTAENFGDA
LEDPSFLAMLAALVNSGYWFFEG
>ECs1045 hypothetical protein
MRTVLNILNFVLGGFATTLGWLLATLVSIALIFTLPLTRSCWEITKLSLV
PYGNEAIHVDELNPAGKNVLLNTGGTVLNIFWLIFFGWWLCLMHIATGIA
QCISIIGIPVGIANFKIAAIALWPVGRRVVSVETAQAAREANARRRFE
>ECs5048 hypothetical protein
MRYNGLNNMFFPLCLINDNHSVTSLSHTKKTKSDNYSKHHKNTLIDNKAL
SLFKMDDHEKVIDLIQKMKRIYDSLPSGKITKETDRKIHKYFIDIASYAN
NKCDDRITRRVYLNKDKEVSIKVVYFINNVTVHNNTIEIPQTVNGGYDFS
HLSLKGIVIKDEDLSNSNFAGCRLQNAIFQDCNMYKTNFNFAIMEKILFD
NCILDDSYFAQIKMTDGTLNSCSAMHVQFYNATMNRANIKNTFLDYSNFY
MAYMAEVNLYKVIAPYINLFRADLSFSKLDLINFKHADLSRVNLNKAILQ
NINLIDSKLFFTRLTNTFLEMVICTDSNMANVNFNNANLNNCHFNCSVLT
KAWMFNTRLYRVNFDEASVQGMGISILRGEENIPINSDTLVTLQKFFEED
CTSHTGMSQTENNTHEVAMKITADIMQHAD
>ECs4315 hypothetical protein
MLINIGRLLMLCVWGFLILNLVHPFPRPLNIFVNVALIFTVLMHGMQLAL
LKSTLPKDGPQMTTAEKVRIFLFGVFELLVWQKKFKVKK
>ECs4261 hypothetical protein
MNMKPESKEAPINIRAKASQRDLIDMAANLVAKSRTDFMLDAACREAQDI
LLDQRLFILDDEQYDAFLAALDAPITAERQAKINALMNRKSPWE
>ECs0830 hypothetical protein
MYLKSAPERGCAETVMAKNFVEEGKTVAIVASAAISSGDLVQVGDVFAVA
LTDIPQGETGDGMTEGVFMLPKLKTDDMKTGKKVYLKSGKVQLTNSGSDP
LVGVVWADAGTSAEEVPVKLNV
>ECs3828 putative resistance protein
MNTLTFLLSTVIELYTMVLLLRIWMQWAHCDFYNPFSQFVVKVTQPIIGP
LRRVIPAMGPIDSASLLVAYILSFIKAIVLFKVVTFLPIIWIAGLLILLK
TIGLLIFWVLLVMAIMSWVSQGRSPIEYVLIQLADPLLRPIRRLLPAMGG
IDFSPMILVLLLYVINMGVAEVLQATGNMLLPGLWMAL
>ECs2127 hypothetical protein
MIVITFNRATFPRLKITMIVRPQQHWLRRIFVWHGSVLSKISSRLLLNFL
FSIAVIFMLPWYTHLGIKFTLAPFSILGVAIAIFLGFRNNAGYARYVEAR
KLWGQLMIASRSLLREVKTTLPDSASVREFARLQIAFAHCLRMTLRKQPQ
VEVLAHYLKTEDLQRVLASNSPANRILLIMGEWLAVQRRNGQLSDILFIS
LNDRLNDISAVLAGCERIAYTPIPFAYTLILHRTVYLFCIMLPFALVVDL
HYMTPFISVLISYTFISLDCLAEELEDPFGTENNDLPLDAICNAIEIDLL
QMNDEAEIPAKVLPDRHYQLT
>ECs4519 putative alpha helix protein
MIRSMTAYARREIKGEWGSATWEMRSVNQRYLETYFRLPEQFRSLEPVVR
ERIRSRLTRGKVECTLRYEPDVSAQGELILNEKLAKQLVTAANWVKMQSD
EGEINPVDILRWPGVMAAQEQDLDAIAAEILAALDGTLDDFIVARETEGQ
ALKALIEQRLEGVTAEVVKVRAHMPEILQWQRERLVAKLEDAQVQLENNR
LEQELVLLAQRIDVAEELDRLEAHVKETYNILKKKEAVGRRLDFMMQEFN
RESNTLASKSINAEVTNSAIELKVLIEQMREQIQNIE
>ECs4129 hypothetical protein
MDTRFVQAHKEARWALGLTLLYLAVWLVAAYLPGVAPGFTGFPRWFEMAC
ILTPLLFIGLCWAMVKFIYRDIPLEDDDAA
>ECs5305 hypothetical protein
MALGKESDKSLATAFQDLRELKVDVAYPFLLALYHDYKNDDLSHEDFLSI
IRLIESYVFRRAVCAIPTNSLNKTFATFYKVINKEKYLESIQVHFLNLPS
YRRFPNDDEFKRELKVRDLYNFRSRSYWLRRLENDKRRERVEEFTIEHIM
PQNENLSAKWREELGSDWQRIHKELLHTLGNLTLTRYNSRYSDRPFAEKR
DIEDGFKHSPLYLNIGLGQCEKWDEAAIHARADRLAELAVQVWQAPSLPE
EVLAVYRGQPENKTSYSLSDYPFLADGSHSRVLFDHLRDEIMRLDAGITQ
EVLKLYIAFKAETNFVDVVPQKSRLRLSLNMQFHELVDPKGIAKDVTNVG
RWGNGDVEIGFSDLAQLPYIMGLIRQAFEKQMESALV
>ECs0480 hypothetical protein
MKGEEKMPSFDIVSEVDLQEARNAVDNASREVESRFDFRNVEASFELNDA
SKTIKVLSESDFQVNQLLDILRAKLLKRGIEGSSLDVPENIVHSGKTWFV
EAKLKQGIESATQKKIVKMIKDSKLKVQAQIQGDEIRVTGKSRDDLQAVM
AMVRGGDLGQPFQFKNFRD
>ECs2375 hypothetical protein
MKKIAIVGAGPTGIYTLFSLLQQQTPLSISIFEQADEAGVGMPYSDEENS
KMMLANIASIEIPPIYCTYLEWLQKQEASHLQRYGVKKETLHDRQFLPRI
LLGEYFRDQFLRLVDQARKQKFAVAVYESCQVTDLQITNAGVMLATNQDL
PSETFDLAVIATGHVWPDEEEATRTYFPSPWSGLMEAKVDACNVGIMGTS
LSGLDAAMAVAIQHGSFIEDDKQHVVFHRDNASEKLNITLMSRTGILPEA
DFYCPIPYEPLHIVTDQALNAEIQKGEYGLLDRVFRLIVEEIKFADPDWS
QRIALESLNVDSFAQAWFAERKQRDQFDWAEKNLQEVERNKREKHTVPWR
YVILRLHEAVQEIVPHLNEHDHKRFSKGLARVFIDNYAAIPSESIRRLLA
LREAGIIHILALGEDYKMEINESRTVLKTEDNSYSFDVFIDARGQRPLKV
KDIPFPGLREQLQKTGDEIPDVGEDYTLQQPEDIRGRVAFGALPWLMHDQ
PFVQGLTACAEIGEAMARAVVKPASRARRRLSFD
>ECs2543 hypothetical protein
MALNTPQITPTKKITVRAIGEELPRGDYQRCPQCDMLFSLPEINSHQSAY
CPRCQAKIRDGRDWSLTRLAAMAFTMLLLMPFAWGEPLLHIWLLGIRIDA
NVMQGIWQMTKQGDAITGSMVFFCVIGAPLILVTSIAYLWFGNRLGMNLR
PVLLMLERLKEWVMLDIYLVGIGVASIKVQDYAHIQAGVGLFSFVALVIL
TTVTLSHLNVEELWERFYPQRPATRRDEKLRVCLGCHFTGYPDQRGRCPR
CHIPLRLRRRHSLQKCWAALLASIVLLLPANLLPISIIYLNGGRQEDTIL
SGIMSLASSNIAVAGIVFIASILVPFTKVIVMFTLLLSIHFKCQQGLRTR
ILLLRMVTWIGRWSMLDLFVISLTMSLINRDQILAFTMGPAAFYFGAAVI
LTILAVEWLDSRLLWDAHESGNARFDD
>ECs0521 hypothetical protein
MQRIILIIIGWLAVVLGTLGVVLPVLPTTPFILLAAWCFARSSPRFHAWL
LYRSWFGSYLRFWQKHHAMPRGVKPRAILLILLTFAISLWFVQMPWVRIM
LLVILACLLFYMWRIPVIDEKQEKH
>ECs0617 hypothetical protein
MDKQSLHETAKRLALELPFVELCWPFGPEFDVFKIGGKIFMLSSELRGVP
FINLKSDPQKSLLNQQIYPSIKPGYHMNKKHWISVYPGEEISEALLRDLI
NDSWNLVVDGLAKRDQKRVRPG
>ECs0842 putative host specificity protein
MGKGSSKGHTPREAKDNLKSTQLLSVIDAISEGPIEGPVDGLKSVLLNST
PVLDSEGNTNISGVTVVFRAGEQEQTPPEGFESSGSETVLGTEVKYDTPI
TRTITSANIDRLRFTFGVQALRETTSKGDRNPSEVRLLVQIQRNGGWVTE
KDITIKGKTTSQYLASVVVDNLPPRPFNIRMRRMTPDSTTDQLQNKTLWS
SYTEIIDVKQCYPNTALVGVQVDSEQFGSQQVSRNYHLRGRILQVPSNYD
PEKRTYSGIWDGTLKPAYSNNMAWCLWDMLTHPRYGMGKRLGAADVDKWA
LYVIGQYCDQSVPDGFGGTEPRITCNAYLTTQRKAWDVLSDFCSAMRCMP
VWNGQTLTFVQDRPSDKVWTYNRSNVVMPDDGAPFRYSFSALKDRHNAVE
VNWTDPDNGWETATELVEDTQAIARYGRNVTKMDAFGCTSRGQAHRAGLW
LIKTELLETQTVDFSVGAEGLRHVPGDVIEICDDDYAGISIGGRVLAVNS
QTRTLTLDREITLPSSGTTLISLVDGSGNPVSVEVQSVTDGVKVKVSRVP
DGVAEYSVWGLKLPTLRQRLFRCVSIRENDDGTYAITAVQHVPEKEGIVD
NGAHFDGDQSSTVNGVTPPAVQHLTAEVSADSGEYQVLARWDTPKVVKGV
SFLLRLTVAADDGSERLVSTARTTETTYRFTQLALGNYRLTVRAVNAWGQ
QGDPASVSFRIAAPAAPSQIELTPGYFQITATPHLAVYDPTVQFEFWFSE
TRIADIRQVETSARYLGTALYWIAASINIKPGHDYYFYIRSVNTVGKSAF
VEAVGRASDDAEGYLDFFKGKITESHLGKELLEKVDLTEDNASRLDEFSK
EWKDANDKWNAMWGVKIEQTKDGKHYVAGIGLSMEDTEEGKLSQFLVAAN
RIAFIDPANGNETPMFVAQGNQIFMNDVFLKRLTAPTITSGGNPPVFSLT
SDGKLTAKNADISGSVNANSGTLNNVTVNENCTIKGMLEATQVRGDFVKA
VSKSFPKQAGTWGNTETPNGTVTVTISDDHNFDRQIIIPPIIFNGIAYSD
PGSGNNPGGTRYTGYGFEVRKNGVLIASRETKGAIPGSYSAVIDMPSGRG
SVTLEFKVFHKGNQRAGNITDCTVIVTKKAASGISIR
>ECs2810 hypothetical protein
MRANKSLSPFEIRVYRHYRIVHGTRVALAFLLTFLIIRLFTIPEGTWPLV
TMVVIMGPISFWGNVVPRAFERIGGTVLGSILGLIALQLELISLPLMLVW
CAAAMFLCGWLALGKKPYQGLLIGVTLAIVVGSPTGEIDTALWRSGDVIL
GSLLAMLFTGIWPQRAFIHWRIQLAKSLTEYNRVYQSAFSPNLLERPRLE
SHLQKLLTDAVKMRGLIAPASKETRIPKSIYEGIQTINRNLVCMLELQIN
AYWATRPSHFVLLNAQKLRDTQHMMQQILLSLVHALYEGNPQPVFANTEK
LNDAVEELRQLLNNHHDLKVVETPIYGYVWLNMETAHQLELLSSLICRAL
RK
>ECs3179 hypothetical protein
MSTPDNRSVNFFSLFRRGQHYSKTWPLEKRLAPVFVENRVIKMTRYAIRF
MPPIAVFTLCWQIALGGQLGPAVATAPVRLKFTHAGIVVAGQAFCHAITP
CNPQLVL
>ECs4492 hypothetical protein
MFPFRRNVLAFAALLALSSPVLAGKLAIVIDDFGYRPHNENQVLAMPSAI
SVAVLPDSPHAREMATKAHNSGHEVLIHLPMAPLSKQPLEKNTLRPEMSS
DEIERIIRSAVNNVPYAVGINNHMGSKMTSNLFGMQKVMQALERYNLYFL
DSVTIGNTQAMRAAQGTGVKVIKRKVFLDDSQNEADIRVQFNRAIDLARR
NGSTIAIGHPHPSTVRVLQQMVYNLPPDITLVKASSLLNEPQVDTSTPPK
NAVPDAPRNPFRGVKLCKPKKPIEPVYANRFFEVLSESISQSTLIVYFQH
QWQGWGKQPEAAKFNASAN
>ECs1648 host specificity protein
MGKGSSKGHTPREAKDNLKSTQLLSVIDAISEGPVEGPVDGLKSVLLNST
PVLDSEGNTNISGVTVVFRAGEQEQTPPEGFESSGSETVLGTEVKYDTPI
TRTITSANIDRLRFTFGVQALVETTSKGDRNPSEVRLLVQIQRNGGWVTE
KDITIKGKTTSQYLASVVVDNLPPRPFNIRMRRMTPDSTTDQLQNKTLWS
SYTEIIDVKQCYPNTALVGVQVDSEQFGSQKVSRNYHLRGRILQVPSNYN
PQTRQYSGIWDGTFKPAYSNNMAWCLWDMLTHPRYGMGKRLGAADVDKWA
LYVIGQYCDQSVPDGFGGTEPRITCNAYLTTQRKAWDVLSDFCSAMRCMP
VWNGQTLTFVQDRPSDKVWTYNRSNVVMPDDGASFRYSFSALKDRHNAVE
VNWIDPDNGWETATELVEDTQAILRYGRNVTKMDAFGCTSRGQAHRAGLW
LIKTELLETQTVDFSVGAEGLRHVPGDVIEICDDDYAGISIGGRVLAVNS
QTRTLTLDREITLPSSGTTLISLVDGSGNPVSVEVQSVTDGVKVKVSRVP
DGVAGYSVWGLKLPTLRQRLFRCVSIRENDDGTYAITAVQHVPEKEAIVD
NGAHFDGDLSGTVNGVTPPAVQHLTAEVTADSGEYQVLARWDTPKVVKGV
SFLLRLTVAADDGSERLVSTARTTETTYRFTQLALGNYRLTVRAVNAWGQ
QGDPASVSFRIAAPAAPSRIELTPGYFQITATPHLAVYDPTVQFEFWFSE
TRIADIRQVETSARYLGTALYWIAASINIKPGHDYYFYIRSVNTVGKSAF
VEAVGRASDDAEGYLDFFKGKITESHLGKELLEKVDLTEDNASRLDEFSK
EWKDANDKWNAMWGVKIEQTKDGKHYVAGIGLSMEDTEEGKLSQFLVAAN
RIAFIDPANGNETPMFVAQGNQIFMNDVFLKRLTAPTITSGGNPPAFSLT
PDGKLTAKNADISGSVNANSGTLNNVTINENCQIKGKLSANQIEGDIVKT
VSKSFPRTSTYASGTITVRISDDQKFDRQVMIPPVLFRGGKHENFNSNNQ
QSYWYSTCRLRVTRNGQEIFNQSTTDAQGVFSSVIDMPAGQGTLTLTFTV
SSSGANNWTPTTSISDLLVVVMKKSTAGISIS
>ECs5038 hypothetical protein
MWYQKTLTLSAKSRGFHLVTDEILNQLADMPRVNIGLLHLLLQHTSASLT
LNENCDPTVRHDMERFFLRTVPDNGNYEHDYEGADDMPSHIKSSMLGTSL
VLPVHKGRIQTGTWQGIWLGEHRIHGGSRRIIATLQGE
>ECs0226 hypothetical lipoprotein
MNRKAFLACVLMCVILTGCETAKKISEVIKNPDIQVGSLKEQPSEITVTL
LTEPDTNTNAEGESAAVDVQLVYLTDDSKLQAADYDQIASTPLPDVLGKN
YIDHQDFNLLPDTIKTLPPIKLDEKTQFIGVVAYFSDDQATEWKQIETVE
GTGHHYRLLVHVRQSSIEMKKEDE
>ECs0507 glycoprotein/polysaccharide metabolism
MASGLAVAIALAACADKSADIQTPAPAANTSISATQQPAIQQPNVSGTVW
IRQKVALPPDAVLTVTLSDASLADAPSKVLAQKAVRTEGKQSPFSFVLPF
NPADVQPNARILLSAAITVNDKLVFITDTVQPVINQGGTKADLTLVPVQQ
TAVPVQASGGATTTVPSTSPTQVNPSSAVPAPTQY
>ECs3977 hypothetical protein
MELLTQLLQALWAQDFETLANPSMIGMLYFVLFVILFLENGLLPAAFLPG
DSLLVLVGVLIAKGAMGYPQTILLLTVAASLGCWVSYIQGRWLGNTRTVQ
NWLSHLPAHYHQRAHHLFHKHGLSALLIGRFIAFVRTLLPTIAGLSGLNN
ARFQFFNWMSGLLWVLILTTLGYMLGKTPVFLKYEDQLMSCLMLLPVVLL
VFGLAGSLVVLWKKKYGNRG
>ECs3793 putative actin
MKFKVIALAALMGISGMAAQANELPDGPHIVTSGTASVDAVPDIATLAIE
VNVAAKDAATAKKQADERVAQYISFLELNQIAKKDISSANLRTQPDYDYQ
DGKSILKGYRAVRTVEVTLRQLDKLNSLLDGALKAGLNEIRSVSLGVAQP
DAYKDKARKAAIDNAIHQAQELANGFHRKLGPVYSVRYHVSNYQPSPMVR
MMKADAAPVSAQETYEQAAIQFDDQVDVVFQLEPVDQQPAKTPAAQ
>ECs4960 hypothetical protein
MRGKLISAIHVAKRELALDDETYTSALLAATGKTSCRDMSPDELSRVLDV
FKKRGFKVRQNPVNRALKPGTVTAKIRAIWKVMHRQGFITDGAETALNRW
VKSQTAAQNGGEGVANWQWLEQHPALASDVLERLKRWHRRKMLAAMGMPE
RTLMGYDAVCRQYEKSLPR
>ECs3527 hypothetical protein
MGFWRIVITIILPPLGVLLGKGFGWAFIINILLTLLGYIPGLIHAFWVQT
RD
>ECs1735 hypothetical protein
MSQLCPCGSAVEYSLCCHPYVSGEKVAPDPEHLMRSRYCAFVMQDADYLI
KTWHPSCGAAALRAELIAGFAHTEWLGLTVFEHCWQDADNIGFVSFVARF
TEGGKTGAIIERSRFLKENGQWYYIDGTRPQFGRNDPCPCGSGKKFKKCC
GQ
>ECs2060 VgrE
MSTGLRFTLEVDGLPPDAFAVVSFHLNQSLSSLFSLDLSLVSQQFLSLEF
AQVLDKMAYLTVWQGDDVQRRVKGVVTWFELGENDKNQMLYSMKVCPPLW
RTGLRQNFRIFQNEDIESILATILKENGVTEWSPLFSEPHPSREFCVQYG
ETDYDFLCRMAAEEGIFFYEEHAQKSIDQSLVLCDTVRYLPESFEIPWNP
NTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQYQDY
QRTQYEVYDYPGRFKGAHGQNFARWQMDGWRNNAEVARGTSRSPEIWPGR
RIVLTGHPQANLNREWQVVASELHGEQPQAVPGRRGSGTTLNNHFAVIPA
DRTWRPQPLLKPLVDGPQSAVVTGPAGEEIFCDEHGRVRVKFNWDRYNPS
NQDSSCWIRVAQAWAGTGFGNLAIPRVGQEVIVDFLNGDPDQPIIMGRTY
HQENRTPGSLPGTKTQMTIRSKNYKGSGFNELKFDDATGKEQVYIHAQKN
MNTEVLNNRTTDVINNHAETIGNNQMIAVTNNQIQTVGVNQIETVGSNQI
INVGSVQVETIGLVRALTVGVAYQTTVGGIMNTSVALMQSSQIGLHKSLR
VGLGYDVKVGNNVTFTVGKTKKDDTGQTAIYSAGEHLELCCGKARLVLTK
DGQIFLNGTKIHLQGKEQVNGDSLLINWNCAASKSPPKTPDEKQDTPDMR
EY
>ECs2020 hypothetical protein
MNITPFPTLSTATIDAINVIGQWLAQDDFSGEVPYQADCVILAGNAVMPT
IDAACKIARDQQIPLLISGGIGHSTTFLYSAIAQHPHYNTIRTTGRAEAT
ILADIAHQFWHIPHEKIWIEDQSTNCGENARFSIALLNQAVERVHTAIVV
QDPTMQRRTMATFRRMTGDNPDVPRWLSYPGFVPQLGNNADSVIFINQLQ
GLWPVERYLSLLTGELPRLRDDSDGYGPRGRDFIVHVDFPAEVIHAWQTL
KHDAVLIEAMESRSLR
>ECs1034 paraquat-inducible protein A
MCEHHHAAKHILCSQCDMLVALPRLEHGQKAACPRCGTTLTVAWDAPRQR
PTAYALAALFMLLLSNLFPFVNMNVAGVTSEITLLEIPGVLFSEDYASLG
TFFLLFVQLVPAFCLITILLLVNRAELPVRLKEQLARVLFQLKTWGMAEI
FLAGVLVSFVKLMAYGSIGVGSSFLPWCLFCVLQLRAFQCVDRRWLWDDI
APMPELRQPLKPGVTGIRQGLRSCSCCTAILPADEPVCPRCGTKGYVRRR
NSLQWTLALLVTSIMLYLPANILPIMVTDLLGSKMPSTILAGVILLWSEG
SYPVAAVIFLASIMVPTLKMIAIAWLCWDAKGHGKRDSERMHLIYEVVEF
VGRWSMIDVFVIAVLSALVRMGGLMSIYPAMGALMFALVVIMTMFSAMTF
DPRLSWDRQPESEHEES
>ECs0317 hypothetical protein
MNIFEQTPPNRRRYGLAAFIGLIAGVVSAFVKWGAEVPLPPRSPVDMFNA
ACGPESLIRAAGQIDCSRNFLNPPYIFLRDWLGLTDPNAAVYTFAGHVFN
WVGVTHIIFSIVFAVGYCVVAEVFPKIKLWQGLLAGALAQLFVHMISFPL
MGLTPPLFDLPWYENVSEIFGHLVWFWSIEIIRRDLRNRITHEPDPEIPL
GSNR
>ECs0535 putative ligase
MDLLYRVKTLWAALRGNHYTWPAIDISLPGNRHFHLIGSIHMGSHDMAPL
PTRLLKKLKNADALIVEADVSTSDTPFANLPACEALEERISEEQLQKLQH
ISQEMGISPSLFSTQPLWQIAMVLQATQAQKLGLRAEYGIDYQLLQAAKQ
QHKPVIELEGAENQIAMLLQLPDKGLALLDDTLTHWHTNARLLQQMMSWW
LNAPPQNNEITLPNTFSQSLYDVLMHQRNLAWRDKLRAMPPGRYVVAVGA
LHLYGEGNLPQMLR
>ECs2531 hypothetical protein
MFAGGDDVFYGYPGQDVVMNITATVLLAFGMSMDAFAASIGKGATLHKPK
FSEALRTGLIFGAVETLTPLIGWGMGMLASRFVLEWNHWIAFVLLIFLGG
RMIIEGFRGADDEDEEPRRRHGFWLLVTTAIATSLDAMAVGVGLAFLQVN
IIATALAIGCATLIMSTLGMMVGRFIGSIIGKKAEILGGLVLIGIGVQIL
WTHFHG
>ECs0233 hypothetical protein
MSKKFEGSVAPRERINISYVPKTDGQTAEVELPLNMLVVGDTGNTQETSS
LDERQAVSVNKHNFGAVMAEAAIGLNFTVPATLKGSTTDDEMNVALNIKS
LDDFSPDSVARQVPEVNKLLELREALTALKGPMGNLPAFRTQLQALLENE
ESREQLLKEIGQVSNK
>ECs2558 hypothetical protein
MAVEVKYVVIREGEEKMSFTSKKEADAYDKMLDTADLLDTWLTNSPVQME
DEQREALSLWLAEQKDVLSTILKTGKLPSPQVVGAESEEEDASHAA
>ECs1118 putative tail assembly protein
MATTNAFSLASPPLARICLHGDLQRFGRRLSLYVNTAAEAIRALSMQMPG
FRRQMNEGWYQIRIAGDDTAPEAVYARLHEQLGEGTVIHIVPRLAGAGKG
GLQIVLGAAAIVGSFFTAGASMALWGSALAAGGFSATTMLFSLGASMILG
GVAQMLAPKAKTPDYRATDNGRQNTYFSSLDNMIAQGNPMPVPYGEMLVG
SRRISQDISTRDEGGDGKVVVIGRQA
>ECs4961 putative transcription regulator
MAETQMSMFGGDSEQLHALIDRLDDIPDDVLKKNWPRTLSELVEVTGAEL
QRQGIEPVLAGKLARKVAAAQAAYMGGRGYYLPVGESLFAELRNNEIFSR
WDRGEKIESLRRHYRMSETQIYTVIREQRRLHLARTQPPLF
>ECs3214 hypothetical protein
MKKKTTLSEEDQALFRQLMAGTRKIKQDTIVHRPQRKKVSEVPVKRLIQE
QVDASHYFSDEFQPLLNADGPVKYVRPGVDHFEAKKLRRGDYSPELFLDL
HGLTQLQAKQELGALIAACRREHVFCACVMHGHGKHILKQQTPLWLAQHP
HVMAFHQAPKEYGGDAALLVLIEVEEWLPPELP
>ECs0967 clpS, ATP-dependent Clp protease adaptor protein ClpS
MGKTNDWLDFDQLAEEKVRDALKPPSMYKVILVNDDYTPMEFVIDVLQKF
FSYDVERATQLMLAVHYQGKAICGVFTAEVAETKVAMVNKYARENEHPLL
CTLEKA
>ECs3394 iscA, iron-sulfur cluster assembly protein
MSITLSDSAAARVNTFLANRGKGFGLRLGVRTSGCSGMAYVLEFVDEPTP
EDIVFEDKGVKVVVDGKSLQFLDGTQLDFVKEGLNEGFKFTNPNVKDECG
CGESFHV
>ECs4809 rbn, ribonuclease BN
MLKTIQDKARHRTRPLWAWLKLLWQRIDEDNMTTLAGNLAYVSLLSLVPL
VAVVFALFAAFPMFSDVSIQLRHFIFANFLPATGDVIQRYIEQFVANSNK
MTAVGACGLIVTALLLMYSIDSALNTIWRSKRARPKIYSFAVYWMILTLG
PLLAGASLAISSYLLSLRWASDLNTVIDNVLRIFPLLLSWISFWLLYSIV
PTIRVPNRDAIVGAFVAALLFEAGKKGFALYITMFPSYQLIYGVLAVIPI
LFVWVYWTWCIVLLGAEITVTLGEYRKLKQAAEQEEDDEP
>ECs2391 sufA, iron-sulfur cluster assembly scaffold protein
MDMHSGTFNPQDFAWQGLTLTPAAAVHIRELVAKQPGMVGVRLGVKQTGC
AGFGYVLDSVSEPDKDDLLFEHDGAKLFVPLQAMPFIDGTEVDFVREGLN
QIFKFHNPKAQNECGCGESFGV
>ECs5169 ulaA, ascorbate-specific PTS system enzyme IIC
MEILYNIFTVFFNQVMTNAPLLLGIVTCLGYILLRKSVSVIIKGTIKTII
GFMLLQAGSGILTSTFKPVVAKMSEVYGINGAISDTYASMMATIDRMGDA
YSWVGYAVLLALALNICYVLLRRITGIRTIMLTGHIMFQQAGLIAVTLFI
FGYSMWTTIICTAILVSLYWGITSNMMYKPTQEVTDGCGFSIGHQQQFAS
WIAYKVAPFLGKKEESVEDLKLPGWLNIFHDNIVSTAIVMTIFFGAILLS
FGIDTVQAMAGKVHWTVYILQTGFSFAVAIFIITQGVRMFVAELSEAFNG
ISQRLIPGAVLAIDCAAIYSFAPNAVVWGFMWGTIGQLIAVGILVACGSS
ILIILGFIPMFFSNATIGVFANHFGGWRAALKICLVMGMIEIFGCVWAVK
LTGMSAWMGMADWSILAPPMMQGFFSIGIAFMAVIIVIALAYMFFAGRAL
RAEEDAEKQLAEQSA
>ECs0940 ulaA, ascorbate-specific PTS system enzyme IIC
MEGVPTMFAKFIDVIQTFLTEPAILIGLLVGIGYALDKKSPIKIITGMVS
AMVGLMMVLFGGFQFSATFKPVAEAVSKAYGVHGYLMDSYAMKAATQIAL
GDNFGYVGYVFVLAFFTNLLLVLFGRYTGAKGIFLTGNTGVSHSQAVLWL
IVFWLGFSWTTSIIIAGILTGVFWAFSTTLIVKPIAKVTKDAGFTIAHNQ
MLGLWFFSKFAHKFGDPEKHDAENLKLPGWLAIFNHNVTAIAIVMTLFVG
GFLLSTGIDNVQLMAKGKPWYIYIINLGLQFSMYMVILLQGVRMMVGEIN
GSFKGWQDRFIPNAIPAVDVAALLPFSPNAATLGFVFCTFGTIFSMGILL
LVHSPIMVLPGFVPLFFSGGPIGVLANRMGGYRSVIICTFLLGIIQTFGT
VWAIPLTGLAENGVGWTGIFDWATVWPAICEVLKFIAATFHLGPYAG