TitleGenColors Logo

Gene list

Applied filters:

COG category: Unclassified
Gene type: CDS
Genomic element: chromosome

Number of genes found: 1125

Free access
Sort by:

 



# Escherichia coli O157:H7 str. Sakai, Sakai

>ECs1824 hypothetical protein
MPVILNFSSERVLSESELEALRHVGRVSQSEQLVVRGRTMRLHHISFMDS
FSVEPVSGGLLDRLSARGHRLLAENLEIQLNRGHTFLQAFRLYMEQSRAT
PCTRQNVSSAIQNKINSHAFTVSHQDFSCHEQHLNCPITLCIPETGVFVR
NAKNSEICSLYDHNALTELIRRNAPHPLSREPFVPEMIVSKDECHFNLIE
QYFCILATQNICTRI
>ECs2285 hypothetical protein
MNTAFALVLTVFLVSGEPVDIAVSVHRTMQECVTAATEQKIPGNCYPVDK
VIHQDNNEIPAGL
>ECs2642 putative phage tail protein
MCGCRKFFERCGLWREERTGDGSLITVVCFEHIEDFVADIAVIFNWSPAE
IFMMTPGEVVSWRERAALRSGNADNEDS
>ECs1068 hypothetical protein
MGTALSPIVSEFETTEQENSYNEWLRTKVTSSLADTRPAIPHDEVMAEME
NLIAQIAVTNKSE
>ECs1161 putative excisionase
MRELVNQHNHGIQPVITPVVQINANEWVTLELLMAVTGLRKGTILRARDS
AWMNGREYKQIAPDGTPKKNSECLYHLPTINTWIKNQPLPSQDV
>ECs2153 putative damage-inducible protein
MPHRYPAAKINKMPKGSVPALQQEMLRRVSKRYDDVEVIIKSTSNDGLSV
TRTADKDSAKTFVQETLKDTWESADEWFVR
>ECs5374 hypothetical protein
MFLPGNNIFASGHGVAFHTDFYKTRSVRFFMDSFGYILRFKVMLFNIKVF
MQAQCL
>ECs3209 hypothetical protein
MIAEFESRILALIDGMVDHASDDELFASGYLRGHLTLAIAELESGDDHSA
QAVHTTVSQSLEKAIGAGELSPRDQALVTDMWENLFQQASQQ
>ECs2804 hypothetical protein
MKIITRGEAMRIHQQHPTSRLFPFCTGKYRWHGSAEAYTGREVQDIPGVL
AVFAERRKDSFGPYVRLMSVTLN
>ECs3707 hypothetical protein
MKPRNINNSLPLQPLVPEQENKNKKNEEKSVNPDKITMGSGLNYIEQESL
GGKYLTHDLSIKIADISEEIIQQAILSAMSIYKFPITDDLMSMAVNELIK
LTKIENNVDLNKFTTICTDVLSPLVTKHNKQKSQNDILPFAKIPFLIFIE
KMEIDNMNPGHVH
>ECs0809 exonuclease
MTPDIILQRTGIDVRAVEQGDDAWHKLRLGVITASEVHNVIAKPRSGKKW
PDMKMSYFHTLLAEVCTGVAPEVNAKALAWGKQYENDARTLFEFTSGVNV
TESPIIYRDESMRTACSPDGLCSDGNGLELKCPFTSRDFMKFRLGGFEAI
KSAYMAQVQYSMWVTRKDAWYFANYDPRMKREGLHYVVVERDEKYMAGFD
EMVPEFIEKMDEALAEIGFVFGEQWR
>ECs2665 FliT
MNHAPHLYFAWQQLVEKSQLMLRLATEEQWDELIASEMAYVNAVQEIAHL
TEEVDPSTTMQEQLRPMLRLILDNESKVKQLLQIRMDELAKLVGQSSVQK
SVLSAYGDQGGFVLAPQDNLF
>ECs2941 putative tail fiber protein
MAVKISGVLKDGTGKPVENCTIQLKARRNSATVVVNTVASENPDEAGRYS
MDVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMSEDDVRP
EALRRFELMVEEAARHAEEAKKNAGEAETSARNAGISASQAEESAANADT
SAGDALESARQAAESAAAAKQSEDASSSSASAAAQKASESSQSAAEAELS
RKTAESAAGNAARDATTATEKARESAESAQSAEQSRIAAEEAVNRIPTVV
GPPGPKGEQGPAGPQGPKGDKGERGDTGPVGATGERGPAGDAGPAGPQGP
KGDRGERGETGLTGNAGPQGPKGDTGAAGPAGPQGPKGETGAAGPVGATG
PQGPKGDPGETQIRFRLGPASIIETNSNGWFPDTDGALITGLTFLDPKDA
TQVQGLFRHLQVRFGDGPWQDVKGLDEVGSDTGRTGE
>ECs1230 hypothetical protein
MLIRWSEGCRVILVQEFFMPENRRIILDSKESWLIICDSQLGHLMRSMYQ
GRRFIQLNLEKLKGVHDVALPVKWEFTRRQ
>ECs3486 hypothetical protein
MPVILNFSNGSVLPENELEALRHIARSNQNDTITIGGRNMRLHYIQFMDG
FSVEPILGGLWDHLGAREAHHLADRLTRQLNGGNTFLQAYSLYLEQRQAA
PLVQESVIKTLLDRINSNAFPVSLQDFSCTEEHLNCPITLHIPETGVFVR
NARNSEICALYDQEALTELILRNALHPLSRDPFAPEMIISKDKCHFNITK
QCFYALPIYPLQQNSI
>ECs1930 hypothetical protein
MAQVIFNEEWMVEYGLMLRTGLGARQIEAYRQNCWVEGFHFKRVSPLGKP
DSKRGIIWYNYPKINQFIKDS
>ECs0243 hypothetical protein
MLQIIRGKLVIFLITLCLFVVYLGFDNNSNSDIVFYGHKTPKSVEIYLSE
KILFIK
>ECs1551 putative phage tail protein
MRLALRLGRTLSELRHSLSASEAMMWMEFDRVSPLGDERGDIRNAQIVKA
VFGAQGMNVALKDAMLCWGEDEDKPEVDPFAALEDALSLAAMS
>ECs2142 hypothetical protein
MPHGLHRYNVAPRNTPQIAYRRAIRPFFLRSFAYGVNGERLGYADARSPL
HVQPHEVKRNHRQYKPESHEHRRNLFRHSSARENAPQQLLASLW
>ECs4562 hypothetical protein
MFSPMTMAGRSLVQATAQTLKPAVTRAAMQAGTGATGMRFMPVQSNFVIN
HGKLTNQLLQAVAKQTRNGDTQQWFQQEQTTYISRTVNRTLDDYCRSNNS
VISKETKGHIFRAVENALQQPLDMNGAQSSIGHFLQSNKYFNQKVDEQCG
KRVDPITRFNTQTKMIEQVSQEIFERNFSGFKVSEIKAITQNAILEHVQD
TRL
>ECs1549 putative major tail subunit
MSALYERSQLTQVMISSAPATAETMEKAEYLRLDCTIKEVQFTAGQKQDI
DVTTLCSTEQENINGLGASSEISMSGNFYLNQAQNALRDAYDNDTVYAFK
VQFPSGKGFKFLAEVRQHTWSSGTNGVVAATFSLRLKGKPVSYVVPLAFV
KNLDKTLTVNTGALLTMSVSVNGGTPPYKHAWKKDGQPVEGQTTDTFSKP
GAQSGDKGAYTCEVTDSAEQPQSITSDACTVTVNGAGG
>ECs1502 putative excisionase
MSEQYLITLDEWKPKRFSLPITNTTLVKYGKLGYIVPRPQKIRGRWLIDR
RAVFVGPGETGIAPEIHTGDDDALKEILTHVTEATKKQH
>ECs2534 hypothetical protein
MCGIFSKEVLSKHVDVEYRFSAEPYIGASCSNVSVLSMLCLRAKKTI
>ECs1611 putative replication protein
MLRNAVLNLSIFHQLLVFLAVMRKTYGFNKKLDWVSNEQLSELTGILPHK
CSAAKSVLVKRGIFIQSGRNTGINNVVSEWSTLPESGKKNKVYLKEVNLP
ESGKKSLPKSGKGVYPNQVNTKDKLTKDNIKPFSSENSGESSDQPENDLP
VVKPDAAIQSGSKWGTAEDLTAAEWMFDMVKTIAPSARKPNFAGWANDIR
LMRERDGRNHRDMCVLFRWACQDNFWSGNVLSPAKLRDKWTQLEINRNKQ
QAGMIASKPKLDLTNTDWIYGVDL
>ECs0073 hypothetical protein
MKVRNPEQISIPASNTTKDPGLTNSQIIRMANLVKKTENMNIFEELWETL
RNLFQSDKHSQTAARQILKDAFYFQNSDDYSKYFTGAVDGKARDKLTHWL
IKFNELKEYAKDPENMAAKASLSPEGTLCVSFFIGDEAIFTLELQLKKST
RTGGIDLSNAYFNGVVICGIDCLEVDLSNAETNNSRWYD
>ECs0876 hypothetical protein
MDCSKCNGYATMLLNMVQGSDPVNLLELHGFLEHFAYYVSFGKFNAGHQR
YNAFKKFVSEISEISANDINMTIKTGQSRHENVISINMNDAIPRDEKGIT
VRIDNINGKKNNSNSSDVFIPYVNTFPDLKNKILRMKIELTEGSGFSKSL
SDSQIEMHILRTVNSLNVGEKLNDDNLADHSIFTNEFSVIIPPSYYDATS
AVNANNIVREKLFESDSKVKDIVDDMSNHDVESEKDIFVIGGMIEKLNSL
ADESFNDSTDNIQTVKDLLTQLTDGMELFALRDIVAFPSTIIAKLIKSPL
NSDHELVMRALDTYLCYFRNKNLNNNAEIINFFHALFLKRPELMVAENYR
FIQFIDLLFENGNVEEKNLAFDLYHNYLSLSEIKQFVTEEIKLNFNEQQG
LLDKDNKCYILLSSDNSGRVMRLSHQALISMLEPEVKKKTIWNNYSIYPS
LQDTHEVVRDDPETICMRAFPLFAKGWEYAQKNKKHQLILNALGFKGYIR
DIFMSAIMRKTDFVLECNNQPTELNSSFSSLMNDSDQWQQHTLKDKHYAN
LLTMLDLNDASESDKSKIFFCLSAVFANISHSNVFNGIPDASKTLKRYAF
ALLAKAHSLDESMISNQTFNTYKTVLLDFNNLSNEEANQLRISSLYRDMV
RYAQYRFSKVLSEWTPDAWL
>ECs2334 hypothetical protein
MTVQDYLLKFRKISSLESLEKLYDHLNYTLTDDQELINMYRAADHRRAEL
VSGGRLFDLGQVPKSVWHYVQ
>ECs5534 hypothetical protein
MLASGLLIRWITCLIIKDVLCLKVMQPGLDKDYRRARAELYRLAGVNFPT
PQRNLSGIRSGSKK
>ECs2925 putative regulator
MDKELPWLADNAQLELKYKKGKTPLSHRRWPGEPVSVITGSLIQTLGDEL
LQQAGQKENITWNYDKCSLEWQSAIQQAINLTGEHKPSIPALTMAALICI
AQNDSQQLLDEIVQQEGLEYATDVVISRQCIARRYESDSLVVTLQYQDED
YGYGYGSATYNDFDLRLRKHLSLAEESCWQRCADKLIAALPGIPKIRRPF
IALILPEKPEIANELASLESSRSSLHSKEWLKVVATDNTAVKKLERYWGL
DVFSDREASYMSQENRFGYAACASLLREQGLAAVPRLAMYAHKEDCGSLL
VQINHPQVIRTLLLVADKNKPSLQRVAKYSKNFPHATLAALAELLALKEP
PARPGYPIIEDKKLPAQQKAREEYWRTLLQTLMASQPQLAAEVMSWLSTQ
ARAVLKSYLSALPKPVIDGTDNSNLPEILVSPPWRSKKKMTAPRLDLARL
ELTPQVYWQPGERERLAATESARYFSTESLAQRMEQKSGRVVLQELGFGD
DVWLFLNYILPGKLDAARNSLIVQWHYYQGRVEEILNGWNSPEAQLAEQA
LRSGHIEALINIWENDNYSRYRPEKSVWNLYLLAQLPREMALTFWLRINE
KKHLFAGEDYFLSILGLDALPGLLLAFSHRPKETFPLILNFGATELALPV
ARVWRRFAAQRDLARQWILQWPEHTATALIPLVFTKSSDKSEAALLALRL
LYEHGHGELLQTVANRWQRTDVWPALEHLLKQGPMEIYPARIPKAPDFWH
PQMWSRPRLITNNQPVTDDALEIIGEMLRFTQGGRFYSGLEQLKTFCQPQ
TLAAFAWDLFTAWQQAGAPAKDNWAFLALSLFGDESTARDLTTQILAWPQ
GKSARAVSGLNILTLMNNDMALIQLHHISQRAKSRPLRDNAAEFLQVVAE
NRGLSQEELADRLVPTLGLDDPQALSFDFGPRQFTVRFDENLNPVIFDQQ
NVRQKSVPRLRADDDQLKAPEALARLKGLKKDATQVSKNLLPRLEAALRT
TRRWSLADFHSLFVNHPFTRLVTQRLIWGVYPANEPRRLLNAFRVAAEGG
FCNAQDEPIDLPADALIGIAHPLEMTAEMRSEFAQLFADYEIMPPFRQLA
RRTVLLTPDESTSNSLTRWEGKSATVGQLMGMRYKGWESGYEDAFVYDLG
EYRLVLKFSPGFNHYNVDSKALMSFRSLRVYRDNKSVTFAELDVFNLSEA
LSAPDVIFH
>ECs4478 hypothetical protein
MKEVEKNEIKRLSDRLDAIRHQQADLSLVEAADKYAELEKEKATLEAEIA
RLREVHSQKLSKEAQKLMKMPFQRAITKKEQADMGKLKKSVRGLVVVHPM
TALGREMGLQEMTGFSKTAF
>ECs5042 hypothetical protein
MATLTTGVVLLRWQLLSAVMMFLASTLNIRFRRSDYVGLAVISSGLGVVS
ACWFAMGLLGITMADITAIWHNIESVMIEEMNQTPPQWPMILT
>ECs5496 hypothetical protein
MDWLAKYWWILVIVFLVGVLLNVIKDLKRVDHKKFLANKPELPPHRDFND
KWDDDDDWPKKDQPKK
>ECs1845 hypothetical protein
MNKETQPIDRETLLKEANKIIREHEDTLAGIEATGVTQRNGVLVFTGDYF
LDEQGLPTAKSTAVFNMFKHLAHVLSEKYHLVD
>ECs0302 Cnr-like protein
MICCESLLALRAALYRRAVACAWLALSNHQERYSGLTLAELEDAIARELE
GFYLRQHGQQRGLEIACALLSDLMESGPLKACPVLSLLGMTVMDELCSRH
LNKPALH
>ECs5159 hypothetical protein
MSGFFQRLFGKDNKPAIARGPLGLHLNSGFTLDTLAFRLLEDELLIALPG
EEFTVVAVSHIDLGGGSQIFRYYTSGDEFLQINTTGGEDIDDIDDIKLFV
YEESYGISKENHWREAINAKTMGAMTLNWQEKRWQRFFNSEEPGNIEPVY
MLEKVENQNHAKWEVHNFTMGYQRQVTEDTYEYLLLNGEESFNDLGEPEW
LFSRALGVDIPLTSLHIIG
>ECs4924 hypothetical protein
MNSFNEGVVSPLLSFWRRSLMLAGALFLTACSHNSSLPPFTASGFAEDQG
AVRIWRKDSGDNVHFLAVFSPWRSGDTTTREYRWQGDNLTLININVYSKP
PVNIRARFDDRGDLSFMQRESDGEKQQLSNDQIDLYRYRADQIRQISDAL
RQGRVVLRQGRWHAMEQTVTTCEGQTIKPDLDSQAIAHIERRQSRSSVDV
SVAWLEAPEGSQLLLVANSDFCRWQPNEKTF
>ECs2714 hypothetical protein
MSIIKNCLSLINNALNIQKTSYSLTKMEQAGKLLNRKITPENTPPMLLSY
RNADLTQEKNITERVLSIFKIKRDFVAVRIQNNQFTDLKNKKIQGHQNTV
ASVMDWYNPQKNALGITMGTPRKSADIAKEEHRNALNFMIMEKNTFHEKI
LNSNDNLQKSYSKTEDSSWVAASVGSLLDKGAKVYPDTSCSLRLGEPFIF
TLPESVRVDVDIYPLKK
>ECs1342 hypothetical protein
MSKPRWTLDQKKHHVAAWRASGLTREQYCELYDIPFKSLRQWPQDVAKAE
KRARAPEIIPVSVSGSSGMTDGRPLSDEPVTLFLPGGIRMCCQPSQLTDV
FRALRHADA
>ECs1529 hypothetical protein
MSEIKSLVTAEAVKDVLRSEEVRSALKQQLRQNLEARLDAEVDAILDELL
GGPAAPEPEDGAGDSAVSDGVVSQPDGSSEPQPGGEMMM
>ECs4540 hypothetical protein
MMKLALTLEADSVNVQALNMGRIVVDIDGVNLAELINMVCDNGYSLRVVD
ESDRTSADCTPPFTALTGIRCSTAHITETDNAWLYSLSHQTSDFGESEWI
HFTGNGYLLRTDAWSYPVLRLKRLGLSKTFRRLVVTLIRRYGVSLIHLDA
SAGCLPGLPTFNW
>ECs3934 hypothetical protein
MFLYFRLKKNQRQEGFINTYPWPKKPGIFYICVPLKGANAQDVAAFCVTA
RGRQNCFRESVARGPALINNRGVSKGLKRESPSRGRGLIEGKGYDEARHH
TGCVVTVKFPDLLTTHQRGEKSSLTLVPLL
>ECs3158 hypothetical protein
MAITTLRLVARITLTRNVLALSRSGGLYCLVGRIQDINAVRQVKTALLSS
RDLSVYSMNAPWFIHGIDFSDHLNYWQHDILAVMITDTAFYRNKQYHLPG
DTADRLNYQKMAQMVDGVITLLHNSK
>ECs5284 hypothetical protein
MPLSNVLQSQIITDNHFLHHPKVESELTRKYERARLDTENIYLLPLARGN
NHNYDGKSVVEIRKLDISKESWPFNYVTEACRESDGITTTGRMLYRNLKI
TSALDEIYGGICKKAHAATELAEGLRLNLFMKSPFDPVEDYTVHEITLGP
GCNVPGYAGTTIGYISTLPASQAKRWTNEQPRIDIYIDQIITVSGVANSS
GFALAALLNANIELGNDPIIGIEAYPGTAEIHAKMGYKVIPGDEDAPLKR
MTLQPSSLPELFELKNGEWNYIGK
>ECs4912 HtrC
MKQEVEKWRPFKHPDGDVRDLSFLDAHQVIYVQHHEDKEPLEYRFWVTYS
LHCFTKDYEHQTNEEKLSLMYHAPKESRPFCQHRYNLARTHLKRTILALP
ESNVIHAGYGSYAVIEVDLDGGDKAFYFVVFRAFSEIFHHCLKLIEK
>ECs1223 hypothetical protein
MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQ
TSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDERVEGRGEDLSHA
DFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIV
HLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFE
QIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTP
RQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYA
GMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQK
AGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDT
AVKL
>ECs1277 putative outer membrane protein
MSKSTFLHILISSIILVALIQSSAWANCTNTQIGQTEDGRTALIEFGKIN
MTDTYFAPAGSLLATTVVPPTNYTSGGATGSSVLWECDATDLPNIYFLVA
TNGDDRVGGFYDAGGPDGLSDVYATWFAFVGLKQTMAGVTLGRYWKKVPI
TSYATQGTKIQIRLQDIPPLHAELYRISTLPDTSATTSWCGNNNTDSSGV
GFAKPSGTIYNCVQPNAYIQLSGTSGILFGHDEPGEDSSVHWDFWGADNG
FGYGMRSANRLYNNATCVARSATPLVLLPTIAEAQLNAGMESTGNFNVRV
ECSNSVQSGISDTQTALGIQVSEGAYTAAQKLGIINSNGGVSALVSDNYD
AAEMAKGVGIYISNSAHPDTAMTLVGQPGIAKLTPGGNAAGWYPVFEGAT
LEGATHPGYSSYSYSFIARLKKLPNQTVSAGKVRATAYILVKMQ
>ECs5440 hypothetical protein
MKRNPLVVCLLIICITILTFTLLTRQTLYELRFRDSDKEVAALMACTSR
>ECs1996 hypothetical protein
MPVDLTPYILPGVSFLSDIPQETLSEIRNQTIRGEAQIRLGELMVSIRPM
QVNGYFMGSLNQDGLSNDNIQIGLQYIEHIERTLNHGSLTSREVTVLREI
EMLENMDLLSNYQLEELLDKIEVCAFNVEHAQLQVPESLRTCPVTLCEPE
DGVFMRNSMNSNVCMLYDKMALIHLVKTRAAHPLSRESIAVSMIVGRDNC
AFDPDRGNFVLKN
>ECs4294 hypothetical protein
MKRLLILTALLPFVGFAQPINTLNNPNQPGYQIPSQQRMQTQMQTQQIQQ
KGMLNQQLKTQTQLQQQHLENQINNNSQRVLQSQPGERNPARQQMLPNTN
GGMLNSNRNPDSSLNQQHMLPERRNGDMLNQPGTPQPDIPLKTIGP
>ECs2030 hypothetical protein
MADKVYLKYTPSDYSFNLGKNASGIVFNQTAPPEEGAEEKTINSSRGRQH
TDVYPALAGNTDTAMFH
>ECs1315 hypothetical protein
MIDADLSRYAQQLWIQGIFSARYSYQIDDLFAQMLFVLSQPGFCGHIHCS
PMGNRDRRLAVMRFCR
>ECs4548 hypothetical protein
MVFNSIFVIQGGIEDIRKNLKIGTDRKLRGQQKRMVTPGNATPTSRANPP
LQSWFAVLHTRLPGYTGAVWSENIHAA
>ECs1133 HyaF
MSETFFHLLGPRTQPNDDSFSMNPLPITCQVNDEPSMAALEQCAHSPQVI
ALLNELQHQLSQRQPPLGEVLAVDLLNLNADDRHFINTLLGEGEVSVRIQ
QADDSESEIQEAIFCGLWRVRRRRGEKLLEDKLEAGCAPLALWQAATQNL
LPTDSLLPPPIDGLMNGLPLAHELLAHVRNPDAQPHSINLTQLPISEADR
LFLSRLCGPGNIQIRTIGYGESYINATGLRHVWHLRCTDTLKGPLLESYE
ICPIPEVVLAAPEDLVDSAQRLSEVCQWLAEAAPT
>ECs1123 putative tail fiber protein
MNMAVKISGVLKDGTGKPVENCTIQLKARRNSATVVVNTVASENPDEAGR
YSMDVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMSEDDV
RPEALRRFELMVEEAARHAEEAKKNAGEAETSARNAGISASQAEESAANA
DTSAGDASESARQAAESAAAAKQSEEASSSSASAAAQKASESSQSAAEAE
LSRKTAESAAGNAARDATTATEKARESAESAQSAEQSRIAAEEAVNRIPT
VVGPPGPKGEPGPAGPQGPKGDKGERGDTGPAGATGERGPAGDAGPAGPQ
GPKGDRGERGETGLTGNAGPQGPKGDTGAAGPAGPQGPKGETGAAGPVGA
TGPQGPKGDPGETQIRFRLGPASIIETNSHGWFPGTDGALITGLTFLAPK
DATRVQVFFQHLQVRFGDGPWQDVKGLDEVGSDTGRTGE
>ECs1262 hypothetical protein
MRPISNLNTSAATYIPPQQLPSSCLETLTLLVQMNNYIKRDADYSTGMAV
APLVPEEVHLLAAAMTTELHRHQFQPVVSANDLQLPEPFTFDVAGFTITF
TKTQEHDNTGNLMKIAVSKSGVCTSTNITLELFHSIVTTLMSRSQYGTFD
LWSVRPILTDESQNRVHEAARYSPAQQYGREETHFTNNGMREFGNMSPLA
WRNDVELQQSCNTSNIPPNAANDDINNTRQTIEDQPEADEPQQYVKLTVD
DMRKWAAMDQQARNALQGVSGWCTRNHFNIKNARNYLTDHGLNYAGQVKV
NRPHEYAKFTLEHIRQWAALNKHVRKPVGYLEKWCKERNLAPTTARNYLK
NDGLTALGELKLTGPQKWVTFTFGDIVQWANMTQEERNSAGGAEKWSKKR
GFQWSTARSYLKSSGVTKQGARKLAWLKNSGNMSNPFYLAQPTSRRT
>ECs3545 hypothetical protein
MSYEVLLLGLLVGAANYCFRYLPLRLRVGNARPTKRGAVGILLDTIGIAS
ICALLVVSTAPEVMHDTRRFVPTLVGFAVLGASFYKTRSIIIPTLLSALA
YGLAWKVMAII
>ECs4654 hypothetical protein
MDNLIGTPPNHAVPHNYIDMEQMEYLCHLNRFSKLSNDFLINGPREDLFT
YKYVLDGSFSNLQHLLPKGQLQKIQQRLSSLMHKNMFHCFHVFLLKLCSI
ESIPLPNADYAFFDDEMPFTLTDEQIENISFLNAYHKEKKGNNDIVTFDF
MSADHHYNFSTTIALTNDSFHISSINNHNSQVIFDENIHLHPYELPESSQ
WCYQLIKNMISLHCRYNNNFKIN
>ECs1141 hypothetical protein
MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVAQLVT
QPQLRDRLWWPGALLTDSAVKAKALKDYQHVMAQLASWEAEADDDVAATI
KSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGNYTLYTVQRPVTI
TLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVIVITPEGETVVA
PVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD
>ECs2957 hypothetical protein
MLARAGFENVEKDNAYNGMTLREWARMSLTERGIGVASYNPMQMVGLALT
HSTSDFGNILLDVSNKGLIQGWEESEETFQKWTRKGRLSDFKTAYRVGMG
GFGSLRQVREGAEYKYITTSDRKETIALATYGEIFSITRQAIINDDLNML
VDVPMKMGRAAKATIGDLVYKVLTDNPKLSDGKALFHADHKNIATGGISV
SGLDAARQMMRLQKEGDRALNIRPAFMLVPVALETVANQTIKSASVKGAD
ANAGVINPIQNFAEVIAEARLDAADPKTWYLAAAQGTDTIEVAWLDGVDT
PYIDQQEGFTTDGIATKIRIDAGVAPLDWRGLVRSSVA
>ECs2660 hypothetical protein
MPHFNFDYQEFLMMVQHLKRRPLSRYLKDFKHSQTHCAHCRKLLDRITLV
RDGKIVNKIEISRLDTLLDENGWQTEQKSWAALCRFCGDLHCKTQSDFFD
IIGFKQFLFEQTEMSPGTVREYVVRLRRLGNHLHEQNISLDQLQDGFLDE
ILAPWLPTTSTNNYRIALRKYQHYQRQTCTGLVQKSSSLPASDIY
>ECs4964 hypothetical protein
MRRVRCWQISSTPCNSFHKSVRRPRMKNNRTFSTARGSRALSAGRWQQCL
RPLPLFLMMWLIGCAGPSVKYVPVKPVPIPAEWLADCLVPPAPEPFTFGA
SVTYNLQLLAVIKNCNVDKASIRRLETRRQHEFTDMAGTPAVPAGKTK
>ECs3768 hypothetical protein
MVLWQSDLRVSWRAQWLSLLIHGLVAAVILLMPWPLSYTPLWMVLLSLVV
FDCVRSQRRINARQGEIRLLMDGRLRWQGQEWSIVKAPWMIKSGMMLRLR
SDSGKRQHLWLAADSMDEAEWRDLRRILLQQETQR
>ECs4357 HicA-like protein
MRTKVQALRKKQKNTLDQIFKTPVPQGIKWSDIESLVKALGGEIKEGRGS
RCKLILNMSVACFHRPHPSPDTDKGAVESVRDWLLSIGVKP
>ECs3503 hypothetical protein
MRIYRRKCKCCNEWFIPKYQNQYWCNEICGTKIALERRSKEREKAEKAEK
AAEKKRRREEQKQKDKLKIQKLALKPRSYWIKQAQQAVHAFIRERDRDLP
CISCGTLTSAQWDAGHYRTTAAAPQLRFDERNIHKQCVVCNQHKSGNLVP
YRVELISRIGQEAVEEIESNHNRYRWTVEECRAIKAEYQQKLKKLRNSRS
EVA
>ECs2602 transcriptional activator FlhD
MHTSELLKHIYDINLSYLLLAQRLIVQDKASAMFRLGINEEMATTLAALT
LPQMVKLAETNQLVCHFRFDSHQTITLLTQDSRVDDLQQIHTGIMLSTRL
LNDVNQPEEALRKKRA
>ECs0275 Cro
MEQRITLKDYAMRFGQTKTAKDLGVYQSAINKAIHAGRKIFLTINADGSV
YAEEVKPFPSNKKTTA
>ECs1293 hypothetical protein
MTQMHKGFLLIGVFLYAAFVAPKIYAATPDAVNQVLQQGLPAQPAALATL
TEALKKEYEARHNVVALVFYAYGLLRQADGYAATNDFIHASEYAKSGFFW
LDEAVDLQEKNQRVRYLRARVDAYLPADSGRCVVTVKDTEHMLADPAIWT
TTIRDHILAMRYRALRHCKDTTRANALLAQIKGQNAALAQSLTQDFNVVP
EWDSEELTQVLLPLMKGE
>ECs0079 leu operon leader peptide
MTHIVRFIGLLLLNASSLRGRRVSGIQH
>ECs4967 hypothetical protein
MSDFITEDQRLVILRSLADYNGELGESVLQDCLDDYGHRVSRDTVHTHIA
WLAEQGLVRKRVLINGYFIAELTGRGQDVAEGRVCVPGVKKPRARG
>ECs5364 hypothetical protein
MVFSLSLWERAGVRAPARTFTLTLTLSLKGEGTDRAQILRDIFFCLVTEE
QKIGLLRLNIAEKKHGRE
>ECs2976 hypothetical protein
MTFTVKTIPDMLVEAYENQTEVARILNCSRNTVRKYTGDKEGKRHAIVNG
VLMVHRGWGKDTDA
>ECs1059 putative cell division inhibition protein
MEKLSCNASTSELRFEIGVITGDKTFIEDAIKQRKLEQDLLNEVCIPSML
ARLDLLQKGYKQ
>ECs1957 hypothetical protein
MVNLRKAAKGQICQIRIPGYCNHNPETSVLAHYRLAGTCGTAIKPHDMQA
AIACNSCHDLIDGRVKTSDYTKEELRLMHAEGVFRTQEIWRKDGYL
>ECs0301 hypothetical protein
MNQFEISYDDVVRLKHLRNVGEYVTGMAALQDCYEKPAGAQCEQLVSLIY
LMTEQLDGVVQRCHDDLMNAEVA
>ECs2601 FlhC
MSEKSIVQEARDIQLAMELITLGARLQMLESETQLSRGRLIKLYKELRGS
PPPKGMLPFSTDWFMTWEQNVHASMFCNAWQFLLKTGLCNGVDAVIKAYR
LYLEQCPQAEEGPLLALTRAWTLVRFVESGLLQLSSCNCCGGNFITHAHQ
PVGSFACSLCQPPSRAVKRRKLSQNPADIIPQLLDEQRVQAV
>ECs1415 CsgF
MRVKHAVVLLMLISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNS
YKDPSYNDDFGIETPSALDNFTQAIQSQILGGLLSNINTGKPGRMVTNDY
IVDIANRDGQLQLNVTDRKTGQTSTIQVSGLQNNSTDF
>ECs1514 hypothetical protein
MNIDPAITIDMALNAGLALLGYFYIMFCSGRWLSLLFMKKWNKRRKQEQR
QKAMDAFFEAFGIDGMEPGDPARAISRGGVVILVYRSEEKNDDHKTTCRR
NHIPH
>ECs4472 hypothetical protein
MADKAILWALISASTKEGRKACSLSYFACKAAEAELGLAYMAANDNKEFL
TSLSNIMRYKIDAGLSESYTCYLLSKGKIIRPYLKNLNPLQLAADCIETV
NKIKDKNKKNH
>ECs1392 hypothetical protein
MAQPFQQSSIVIPVHPRECGQFHFCTVFPALTVYYLRFVKTIDALRHRVV
SKRKLTAVCSHQTKIGNLDAHLI
>ECs2612 hypothetical protein
MRLLILTLSLITLAGCTVTRQAHVSEVDAATGIVRLVYDQAFLQHAHTDR
YVSRGISDRACQQEGYTHAIPFGQPVGNCSLFAGSLCLNTEFTLSYQCHH
SAFPVFL
>ECs4140 hypothetical protein
MKRLIPVALLTALLAGCAHDSPCVPVYDDQGRLVHTNTCMKGTTQDNWET
AGAIAGGAAAVAGLTMGIIALSK
>ECs1231 putative outer membrane protein
MLPFFHCIIIAAVATRSITPHTQDVSHSHHTVHFSPFPYTESYSLSSCEF
PFHWQGDIMYPFQFFQIQLNKTASLIHTAH
>ECs0013 hypothetical protein
MKSVFTISASLAISLMLCCTAQANDHKLLGVIAMPRNETNALALKLPVCR
IVKRIQLSADHGDLQLSGASVYFKAARSASQSLNIPSEIKEGQTTDWINI
NSDNDNKRCVSKITFSGHTVNSSDMATLKIIGDD
>ECs1512 hypothetical protein
MSGWKKWRWAEILILRQCAGTMRVESIGYLIGRSESAVRTKARELGISMM
LRGDYHQSAKCSQRDIELAWQLHQRGVPRREIAEKFGMKLGAVNNYVYFD
RRVQE
>ECs4135 putative transport protein
MIRKYWWLVVFAVFVFLFDTLLMQWIELLATETDKCRNMNSVNPLKLVNC
DELNFQDRM
>ECs5420 hypothetical protein
MRISPELKEQFDKEAQSDGISLANWLKELGRTELKRRGIEPKG
>ECs1695 hypothetical protein
MPTLILSVDKIANRITAPRNVLSRTSAGVLARLTTMSVSGYIAGINNKML
VPSPLPAATGRSSGGIAYRRRHCDDFPFSGTNRCVTSGRYPGR
>ECs2019 hypothetical protein
MDERFINITPTSLSFKHADLSGLSYKNARLELALICPAIKSEVKISFDWI
HSFRVTDEGDLLGMSGKQGINQTGIYRVANSTYLTWFTMQSCEIHKNKMI
EHYVIATSNDVIDVLSTVSPQIVYD
>ECs5549 hypothetical protein
MPKTPRVYVAFCFYICNLNAALAMLGKFLEFAGMLCNLHIKWLIFAQDWW
VRNGLSLLNKGLRGYTSANNFL
>ECs5092 hypothetical protein
MTKTLLDGPGRVLESVYPRFLVDLAQGDDARLPQAHQQQFRERLMQELLA
RVQLQTWTNGGMLNAPLSLRLTLVEKLASMLDPGHLALTQIAQHLALLQK
MDHRQHSAFPELPQQIAALYEWFSARCRWKEKALTQRGLLVQAGEQSEQI
FTRWRAGAYNAWSLPGRCFIVLEELRWGAFGDACRLGSPQAVALLLGDLR
VKATQHLAESINAAPTTRHYYHQWFASSTVPTGGDHADFLSWLGKWTTAD
KQPVCWSVTQRWQTVALGMPRLCSAQRLAGAMLEEIFSVNLV
>ECs2914 putative type-1 fimbrial protein
MMFRNRILLIFILWANFTWAGCRTTASLNITDGINVGEILANETSFSKSV
VFTGISCDTSTDKIVYKNIQSDWVEVGPFGNGEKLKVKIESLGKTSDTIG
KSSNAQAVLPYVVKIARGTPDFTGERKSTWFISDTVIANIGGESSSSIDF
WLGICKALKFNWCVNYLTSKLAGDTFTLGLNISYYPKNTTCKPENTVIKV
DDIALFQLRNQGKIAANSKEGTITLKCDNLFGDKKQASRNMVVYLSSSDL
VKGSNTILRGKTDNGVGFVLDLTEPPKGTEAAIKISANGDQGAATSLWKT
DKPGVSLNSNIINIPVMASYYVYDEKKVKSGALEATALINVKYD
>ECs0429 hypothetical protein
MADFTLSKSLFSGKYRNASSTPGNIAYALFVLFCFWAGAQLLNLLVHAPG
VYERLMQVQETGRPRVEIGLGVGTIFGLIPFLVGCLIFAVVALWLHWRHR
RQ
>ECs2627 hypothetical protein
MAIKHFPVVRFTSRGREYEVDERLITTIDKHRSEKDAHHIYLTDGTYFCA
TNVARVNLIRQVQEPRR
>ECs2962 hypothetical protein
MDNIRRLVVSTEEAREMIQRYREAEMAVLEGKSVIFNGQQLTLESLSQIR
AGRQEWERRLAAMVSRRRGKPGFKLARF
>ECs0260 hypothetical protein
MRVFKTKLIRLQLTAEELDALTADFISYKRDGVLPDIFGRDALYDDSFTW
PLIKFERVAHIHLANVNNPFPPQLRQFSRTNDESHLVYCQGAFDEQAWLL
IAILKPEPHKLARDNNQMHKIGKMAEAFRMRF
>ECs0019 hypothetical protein
MLFEIEVSSCTGNNTNCNSVRFNFNLNTILQVNLMTCGFEDQSIDTGLFV
LSRINNENYQFHHIRFDCIQGEQGELVFEPSDVEYYFEPVTIQESGTSIL
KNELENSEDGAGQIGFQLSNDGTNEIQYGKSNYYSFQHPHEGSNQIPLFI
RPRTYGNNVSSGQIMSRVKIVVMYN
>ECs1513 hypothetical protein
MRVRVYIAGPMTGYENFNREAFHKTEEVLKREGHTVLNPAVLPDGLTQPH
YMDICMAMLRCVDAVYMLKGWQQSAGAGAELALEEKPGHAVIFQEVGSEY
>ECs2249 hypothetical protein
MMPGPGISVMWHSRVRRCWFAWGRRSTSQRDVRQVRKRNVMRSMANVEPF
AANPKKPEIRT
>ECs1398 hypothetical protein
MTSPFIQQIADNRVCQVLTCLPEKFVVDFANGIDVAQEHIRTAGERTFFR
RLKEGLTGEGAARQNAINASLAQGVEASLRWLTEMTTSLATTNYAITRVN
DRVSSLVSDTARLAHYSADTREQLLTLADQVHHKLNHLEEKLHRVDQVQR
AQLHLEQIFSWWSAGRYASFSPAGRCYVALEELRWGAFGDVIRQGETGQV
NQLLDILRHKALTQMAQESGGSATVRLNTLDWLGGQGREQADNEWHDAIN
WLGDWCSEEQHPVIWSTTQAAEHLPVRMPRLCSAERLSESMVDEIFQKGA
A
>ECs3857 hypothetical protein
MLSSLNVLQSSFRGKTALSNSTLLQKVSFAGKEYPLEPIDEKTPILFQWF
EARPERYEKGEVPILNTKEHPYLSNIINAAKIENERIIGVLVDGNFTYEQ
KKEFLSLENEYQNIKIIYRADVDFSMYDKKLSDIYLENIHKQESYPASER
DNYLLGLLREELKNIPEGKDSLIESYAEKREHTWFDFFRNLAMLKAGSLF
TETGKTGCHNISPCSGCIYLDADMIITDKLGVLYAPDGIAVHVDCNDEIK
SLENGAIVVNRSNHPALLAGLDIMKSKVDAHPYYDGLGKGIKRHFNYSSL
HDYNAFCDFIEFKHENIIPNTSMYTCSSW
>ECs2025 hypothetical protein
MNQAIQMLASYPPSGKEKGYEAQPSGGVSAHYLHYDSDIHTPDPTNALRT
AVPGQ
>ECs2632 hypothetical protein
MILANDFLEYLLNTERDLAARVRDRYDMYLKSLPVPQLADGKIVIDGRYM
IDSHEGNYRLYRIEGGTPSVIGIYQRPSSAIVDVIADSIRITHRHADTED
TVLEIQRLATACRDTLNGMTK
>ECs2805 putative structural protein
MSDTLPGTTLPDDNHDRPWWGLPCTVTPCFGARLVQEGNRLHYLADRAGI
RGLFSDADAYHPDQAFPLLMKQLELMLTSGELNPRHQHTVTLYAKGLTCK
ADTLSSCGYVYLAVYPTPEMKN
>ECs1798 hypothetical protein
MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAP
VRRGKLRRNVVVLSRRSRDGGMESGVHIRGVNPDTGNSDNTMKADNPRNA
FYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLRR
>ECs2253 hypothetical protein
MARSGPTESLACDQTRGGSSGEKKRSPWTPRPLSRREKIPVSGQLTC
>ECs5565 hypothetical protein
MSFEKKEQNEVKIFGCVFSLVIIQGKLIRRVMTPISRTYIFDKSTDDFFM
KFYRTRETDGSAGQTFDSGSQGQIVTLNVLSKNLTSQMLLLW
>ECs0933 putative sensory transduction regulator
MTSLVVPGLDTLRQWLDDLGMSFFECDNCQALHLPHMQNFDGVFDAKIDL
IDNTILFSAMAEVRPSAVLPLAADLSAINASSLTVKAFLDMQDDNLPKLV
VCQSLSVMQGVTYEQFAWFVRQSEEQISMVILEANAHQLLLPTDDEGQNN
VTENYFLH
>ECs0290 hypothetical protein
MSQPLSEILTWDDEQWEVFVHDWLIVCKSDDYPWSERLGGAGDKGRDVVG
YKSDPNVEGYSWDNYQCKLYKKSLGFSDVVVEFGKLIYFTLNGDYPIPQK
YFFVAPYDLSTTFSNLLKNKNELKKAVLDSWDSAISKK
>ECs2918 hypothetical protein
MNKYWLSGAVFLAYGLASPAFSSDTTTLTINGRISSPTCSMDVVNNHLQQ
RCGQLLQRVETNYRASSTAKGVTTEVVAVGNNDKRKIVLNRYD
>ECs2715 EspF-like protein
MINNVSSLFPTVNRNITAVYKKSSFSVSPQKITLNPVKISSPFSPSSSSI
SATTLFRAPNAHSASFHRQSTAESSLHQQLPNVRQRLIQHLAEHGIKPAR
SMAEHIPPAPNWPAPPPPVQNEQSRPLPDVAQRLVQHLAEHGIQPARNMA
EHIPPAPNWPAPPLPVQNEQSRPLPDVAQRLVQHLAEHGIQPARSMAEHI
PPAPNWPAPPPPVQNEQSRPLPDVAQRLMQHLAEHGIQPARNMAEHIPPA
PNWPAPTPPVQNEQSRPLPDVAQRLMQHLAEHGIQPARNMAEHIPPAPNW
PAPTPPVQNEQSRPLPDVAQRLMQHLAEHGINTSKRS
>ECs1313 hypothetical protein
MESFVQDSPFYSGRDLYWLRPKVELTLEEKLYYCSCIRRNRHKYSYGRQA
NRTLKNLLVPSLDSVPAWVYGVTGKIISELSER
>ECs1755 hypothetical protein
MSITAQSVYRDTGNFFRNQFMTILLVSLLCAFITVVLGHVFSPSDAQLAQ
LNDGVPVSGSSGLFDLVQNMSPEQQQILLQASAASTFSGLIGNAILAGGV
ILIIQLVSAGQRVSALRAIGASAPILPKLFILIFLTTLLVQIGIMLVVVP
GIIMAILLALAPVMLVQDKMGVFASIRSSMRLTWANMRLVAPAVLSWLLA
KTLLLLFASSFAALTPEIGAVLANTLSNLISAILLIYLFRLYMLIRQ
>ECs1788 hypothetical protein
MSSKNRPRRTTTRNIRFPNQMIEQINIALDQKGSENFSAWVIESCRRELA
ADIKYARQLTIKKNDTQYALRWLFI
>ECs1725 putative factor
MAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQ
VNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQL
GLTQQDDGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAW
GEYLRLSANFYQPFAAWHEQTATQEQRMARGYDLTARMRMPFYQHLNTSV
SVEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGE
NQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEY
RQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLT
PGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLT
LVEPFDALSNDELRWEP
>ECs3517 hypothetical protein
MLINKSNGFNASAVWGSGSYNENKSSKHMELLAHSIVKLICKEAASETYR
GALEILQKIMSECIYQEGNAFVIMGAGEQLKRIKYDVDENNLKVFNVHFD
NNEVLVTDGEPDVVCLSKQVWENLLIKLKPEIKENVASEVHKSANKGEIE
QLVEWSKRNEQTLFDNIIKSDFHVGSLKPGSMNGVILEMPPNVCMEPRNS
YENKIDEVSSLSESEEHPIDIQKITDAFVKEFKGILFDKNGRSSELLFNF
YECCYTFLPRAQPQDKIDSYNSALQAFSIFCSSTLTHNNVGFDFKLFPEV
KLSGEHLETVFKYKNGDDVREIAKINITLQKEEGGLYNLRGLDFKGCFFS
GQNFSNYDIQYVNWGMSLFDVDTPCIFNTPANHESYEKSLKPVSENGLNG
VLSDRNKKIKMITGVAPFDDILFMDDDFDDNSPEDAPIENSPVVNSPLV
>ECs2739 putative endopeptidase
MNRVLCVVIIVLLVACGVLSLGLNHYRDNAITYKAQRDKKASELKLANVT
ITDMQVRQRDVAALDARYSRELADARAENETLRADVAAGRKRLRINANCP
GSLRKAPITSGVDNATGPRLAEAAERDYFILRERLMAMQKQLEGAQEYIR
TQCIP
>ECs2432 hypothetical protein
MTYQQAGRIAVLKRIMGWVIFIPALISTLISLLKFMNTRQENQEGINAVM
LDFTHVMIDMMQANTPFLNLFWYNSPTPNFNGGVNVMFWVIFILIFVGLA
LQDSGTRMSRQARFLREGVEDQLILEKAKGEEGLTREQIESRIVVPHHTI
FLQFFSLYILPVICIAAGYVFFSLLGFI
>ECs2286 hypothetical protein
MSKVFICAAIPDEQAIKEEGAVAVATAIEAGDERRARAKFHWQFLEHYPA
AQDCAYKFLVCEDKPGIPRPALDSWDAEYMQENRWDEASASFVPVETESD
PMNVTFDNLAPEVQNAVMVKFDTCENITVDMVISAQELLQEDMATFDGHI
VEALMKMPEVNAMYPELKLHAIGWVKHKCKPGAKWPEIQAEMRIWKKRRE
GERKEAGKYTSVVDLARARANQQHTENSTGKINPVIAAIHREYKQTWKTL
DDELAYALWPGDVDAGNIDGSIHRWAKNEVIDNGREDWKRISASMRKQPD
ALRYDRQTIFGLVRERPIDIHKDPVALNKYITEYLTTKGVFEDEGTNQSA
TDTLSSPVPETDAVETAIPDNEKTECKVEVEPSVEREGPFYFLFTDKDGE
KYGRANKLSGLDKALAAGATEITKEEYFARKNGTYTGLPQNANTAQNSEQ
PEPVKVTADEVKKIMQAANISQPDAEELLAVSRGEFVEGISDPNDPKWVK
GIQTRDSVNQNQQETEQNDQKAEQNSPNTQQNEPETKQPEPVVQQEPEKI
CTACGQSGGGNCPDCGAVMGDATYQEIFDGENQPEVQENDPEEMEGTAHQ
HKENTGGNQHHASDSETGEASDPLIKANGHHNLTSTSRAGIHLMIDLETM
GKNPDAPIISIGAIFFDPQTGDMGPEFSKTIDLDTAGGVIDRDTMKWWLK
QSREAQSAIMTDEIPLDDALLQLREFIDENSGEFFVHVWGNGANFDNTIL
RRSYERQGSPCPWRYYNDRDVRTIVELGKAIDFDARTAIPFEGERHNALD
DARYQAKYVSAIWQKLIPSQADS
>ECs4656 hypothetical protein
MLSFPSEWCILLTEPFESTNAIKKTGIISLYLLFIAIALYGFIQLKRDCF
QRLWIIPGR
>ECs0281 hypothetical protein
MKPVFDENGLATVPGDMRCFYYDAETSEYTGWSDEYINTGVSMPACSTGI
DPGENIPGRVAVFTGKGWSHEEDHRNETVYSIENGAAVTVDYIGAIKDGY
VTISPLTPYDKWDGEKWVTDTEAQHSAAVDAAEAQRQSLIDAAMASISLI
QLKLQAGRKLTQAETTRLNAVLDYIDAVTATDTSTAPDVIWPELPEA
>ECs1158 periplasmic glucose-1-phosphatase
MNKTLIAAAVAGVVLLASNAQAQTVPEGYQLQQVLMMSRHNLRAPLANNG
SVLEQSTPNKWPEWDVPGGQLTTKGGVLEVYMGHYMREWLAEQGMVKSGE
CPPPDTVYAYANSLQRTVATAQFFITGAFPGCDIPVHHQEKMGTMDPTFN
PVITDDSAAFSEQAVAAMEKELSKLQLTDSYQLLEKIVNYKDSPACKEKQ
QCSLVDGKNTFSAKYQQEPGVSGPLKVGNSLVDAFTLQYYEGFPMDQVAW
GEIKSDQQWKVLSKLKNGYQDSLFTSPEVARNVAKPLVSYIDKALVTDRT
SAPKITVLVGHDSNIASLLTALDFKPYQLHDQNERTPIGGKIVFQRWRDS
KANRDLMKIEYVYQSAEQLRNADALTLQAPAQRVTLELSGCPIDADGFCP
MDKFDSVLNEAVK
>ECs0816 hypothetical protein
MKDGALLRSSSLFIAYMGCLGWGSAYFYGWGTSFYYGFPWWIVGAGVDDV
ARSLFFAVIVIAIFLIGWGIGVVFFFAVKRKHSMQELNVFRLYFAVELLF
VPAIIEFSILRQKIQVPLLLLSAAIALAVTISIRSYGRFLSVSCFYDKPF
IKKHFFEIVMIAFVAYFWLFSFLTGYYKPQFKKEYEMINYNDGWYYVLAR
YDNCLVLSTSFNAGSKRFVIYQSAQDKNLQVDIVRTRI
>ECs2230 hypothetical protein
MNILKKIMQRLCGCGKHDDRENGELLTAQLRLGPADILESDENGIIPEQD
RVITQVVILDADKKQIQCVVRPLQILRADGTWENIGGMK
>ECs1617 NinE
MATPLIRVMNGHIYKVPNRRKRKPELKPSEIPTLLGYTASLVDKKWLRLA
ARRNHG
>ECs4528 hypothetical protein
MNASQRQQVRQFLLDTALQRMDNERGFNNVLCWLAVFNTLGGAAPLIRSL
WSRWWALDTPGKAVCAIQYAAHLIYPIEANPLWSQEWIGWGHPLGHKDGW
SSDNRAFLRQMLTPEMIVAGVQAAAEILRGEPEGAMAARIAQDAYEAMDI
LTIQIEDLLRDLSCDESGHALE
>ECs5360 hypothetical protein
MTKVRNCVLDALSINVNNIISLVVGTFPQDPTVSKTAVILTILTAT
>ECs2691 hypothetical protein
MKVNDRVTVKTDGGPRRPGVVLAVEEFSEGTMYLVSLEDYPLGIWFFNEA
GHQDGIFVEKAE
>ECs5423 hypothetical protein
MSNNLNECDDTFWNGSILGYWLKYTHTRKELLLICRVVSSFTSVRVGILQ
HREKSHNFYEITVLTMGNDKYQ
>ECs1796 hypothetical protein
MTALLTLEEIKAHLRVDHDADDDMLMDKVRQATAVLLAYIQGSRDKVIRE
DGELIPGEALTRMKGAAMRLTGMLYRNPDLAEREELIQGELPFSVSVLIY
DLRCPTVL
>ECs1815 hypothetical protein
MLPTSGSSANLYSWMYVSGRGNPSTPESVSELNHNHFLSPELQDKLDVMV
SIYSCARNNNELEEIFQELSAFVSGLMDKRNSVFEVRNENTDEVVGALRA
GMTIEDRDSYIRDLFFLHSLKVKIEESRQGKEDSKCKVYNLLCPHHSSEL
YGDLRAMKCLVEGCSDDFNPFDIIRVPDLTYNKGSLQCG
>ECs1769 putative phage replication protein
MQQHQVAPHHGQFRKFGQHVSSGNVKTDLSATETAWKLWELMGVVYSNRW
IQKNGAAPSKLWIAQIGAMTEQQIRQVCRQCMDRCRAGETWPPDLAEFVA
LISKSGANPFGLTVDAVMEEYRRWRNESWRYDGSDKYPWSQPVLYHICLE
MRSKGIERQMTEGELKRLAERQLTKWAKHVSNGLSVPPVRRQLAAPKRPS
GPTPIELLKQEYERRKAAGFV
>ECs1180 hypothetical protein
MEFHESAICDFRANANSVKPQPIAVLFKTMGAWAVLCFAADDTDARMAIG
QEMEMDPTNDEFIIYGAPSNYLLDTCNIYNKAA
>ECs1094 hypothetical protein
MKGIEVETPASLDLTRAAAFAIRIVAIAVLVWAIRWW
>ECs3713 hypothetical protein
MINDLKSILLKSSEEVDVFIKIFESWVTKLPSISGPVNLHIPTSFKDKSL
EVESYFVDKSIWNVHIAYHDDKRFVFFTDQFIAEFSPQEFVDNCEQYLIN
NHCFSPDKVNEICEQARHYLVEKMCETHSLAMNNSVLTNPEDS
>ECs4240 putative dehydrogenase
MSTIVIFLAALLACSLLAGWLIKVRSRRRQLPWTNAFADAQTRKLTPEER
SAVENYLESLTQVLQVPGPTGASAAPISLALNAESNNVMMLTHAITRYGI
STDDPNKWRYYLDSVEVHLPPFWEQYINDENTVELIHTDSLPLVISLNGH
TLQEYMQETRGYALQPVPSTQASIRGEESEQIELLNIRKETHEEYALSRP
RGLREALLIVASFLMFFFCLITPDVFVPWLAGGALLLLGAGLWGLFAPPA
KSSLREIHCLRGTPRRWGLFGENDQEQINNISLGIIDLVYPAHWQPYIAQ
DLGQQTDIDIYLDRHLVRQGRYLSLHDEVKNFPLQHWLRSTIIAAGSLLV
LFMLLFWIPLDMPLKFTLSWMKGAQTIEATSVKQLADAGVRVGDTLRISG
TGMCNIRTSGTWSAKTNSPFLPFDCSQIIWNDARSLPLPESELVNKATAL
TEAVNRQLHPKPEDESRVSASLRSAIQKSGMVLLDDFGDIVLKTADLCSA
KDDCVRLKNALVNLGNSKDWDALVKRANAGKLDGVNVLLRPVSAESLDNL
VATSTAPFITHETARAAQSLNSPAPGGFLIVSDEGSDFVDQPWPSASLYD
YPPQEQWNAFQKLAQMLMHTPFNAEGIVTKIFTDANGTQHIGLHPIPDRS
GLWRYLSTTLLLLTMLGSAIYNGVQAWRRYQRHRTRMMKIQAYYESCLNP
QLITPSESLIE
>ECs2799 hypothetical protein
MKLPVKLLMSLISLVRVIARAGEYKNYSRDEIKYWRYTSYKGGKLPEGFT
DEKFSSAIYNGRIFTMKRLHTLMLFLAVLFTGFNFNAEAATVKQALSCNP
EAWAEQPGACPSTYELYEGDATYKAAIDKALKPVGLSGMFGKGGYMDGPG
GGITPVNINGTVWFQGDGCKANTCGWDFIVTLYNPKTHEVVGYRYFGLDD
PAYLVWFGEIGVHEFAYLVKNYVAAVN
>ECs0741 hypothetical protein
MRFAKGVLLAICLIFLPLKAALALNCYFGTANGAVEKSEAIMPFAVPANS
KPGDKIWESDDIKIPVYCDNNTNGNFESEHVYAWVNPYPGIQDPYYQLGV
TYEGVDYDASLGKSRIDTNQCIDSKNIDIYTPEQIIAMGWQNKLCSGDPT
VMHKSRTFVARMRLYVKVRAMPPHDYQSKLSDYIVVQFDGAGSVNEDPTA
KNLKYHITGLENIRVLDCSVNFAISPETQVVDFGRFNVLDIRRHTMSQQF
KITTTKSQNDQCTDGFKVSSSFYTDETLIDEDKSLLIGNGLKLRLLDENA
SPYTFNKYSEYADFTSDLLVYEKTYTAELSSTPGTPIDVGPFDTVVLFKI
NYN
>ECs0272 putative transcription antitermination protein
MCDTKRSIGRKCESGLAANVPLRGVFVQDYDSHTQPKLTNRRIQMDAQTR
RRERRAEKQAQWKADRILASTFRRNATKRHY
>ECs2189 hypothetical protein
MSIKHYDVVRAASPSDLAEKLTHKLKEGWQPFGSPVAITPYTLMQAITAE
GDVVVSGATEPDWYYVIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLAR
RSTVTPGGAACRYNDIIPADHCLHDVQDMSTLNHPKADLSKGQYGCVGQG
LHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGIFSESTGASQDSARW
GVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDMSAATHAQQPALF
TAMLTQFRADLSVFNAQCHGGSAADVPWICGDTTYYWKNTYATQYDTVYG
GYKNRESEGVYFVPFMTDGNGVNTATNAPAEDPDIPASGYYGAASRTNGN
QVSSNRPTHFSSWARRSIIPDRLATAILNAAGRTSAFISGKAPEIKPSPG
GNTPSGPSADTSVRTISLLPAAGEAAAQGWSIKDGGIQLSDGVFKITRQS
NKTWSLTHPVDDAITLLTQGGRLTCKFRLSGALTNNQFGLGIYLYTDAPV
PDGVAMTGTGNPFLMSYFTQTTDGRVNLMHHRKAGNTKLGEFGDYGNDWQ
TLELVFTAGSATVTPKLNGVAGPAFQVIKDSLTLGLNALTLTDVTKNAAY
GV
>ECs4252 hypothetical protein
MASLIQVRDLLALRGRMEAAQISQTLNTPQPMINAMLQQLESMGKAVRIQ
EEPDGCLSGSCKSCPEGKACLREWWALR
>ECs0898 hypothetical protein
MASTFTSDTLPADHKAAIRQMKHALRAQLGDVQQIFNQLSDDIATRVAEI
NALKAQGDAVWPVLSYADIKAGHVTAEQREQIKRRGCAVIKGHFPREQAL
GWDQSMLDYLDRNRFDEVYKGPGDNFFGTLSASRPEIYPIYWSQAQMQAR
QSEEMANAQSFLNRLWTFESDGKQWFNPDVSVIYPDRIRRRPPGTTSKGL
GAHTDSGALERWLLPAYQHVFANVFNGNLAKYDPWHAAHRTEVEEYTVDN
TTKCSVFRTFQGWTALSDMLPSQGLLHVVPIPEAMAYVLLRPLLDDVPED
ELCGVAPGRVLPVSEQWHPLLIEALTSIPKLEAGDSVWWHCDVIHSVAPV
ENQQGWGNVMYIPAAPMCEKNLAYAHKVKAALEKGASPGDFPREDYETNW
EGRFTLADLNIHGKRALGMDV
>ECs2939 hypothetical protein
MRVEICIAKEKITKMPNGAVDALKEELTRRISKRYDDVEVIVKATSNDGL
SVTRTADKDSAKTFVQETLKDTWESADEWFVR
>ECs3897 hypothetical protein
MKIILLFLAALASFTVHAQPPSQTVEQTVRHIYQNYKSDATAPYFGETGE
RAITSVRIQQALTLNDNLTLPGNIGWIMIRFVIVRILAIWC
>ECs1767 hypothetical protein
MKITTEQVCEALDTWVCRPGMTQEQATILITEAFWALKERPNIDVQRVTF
NDGEVDQRALGVNRVKIFERWKAIDTRDKRDKFTALIPAIMEAIRISDFR
LYCEITDGKSITYMIAGLNKEYGDVVESGLLFADPVVVERETDELIEKAI
AFKHAYRQQYQYYFADKQMSARGAYEYRCTTMG
>ECs1965 putative antirepressor protein
MNMMAVPFHGNSLYVVNHNGEPYVPMKPVVAGMGLAWQSQLAKLRQRFAS
TITEIVMVAEDGKQRNMVSMPLRKLAGWLQTINPNKVKPEIRDKVIRYQE
ECDDVLYEYWTKGFVVNPRKMSVMEELNQACADMKRDKNIASVFATGLNE
WKQVKAAHVSKIRTLVNEANMLIDFVLADTGKGKITKAD
>ECs1800 putative major tail subunit
MKCRTIFRKTAVTVQPWSFRSRCNFFNRTHNPPRAGFLLSGGRMSALYER
SQLTQVMISSAPATAETMEKAEYLRLDCTIKEVQFTAGQKQDIDVTTLCS
TEQENINGLGASSEISMSGNFYLNQAQNALRDAYDNDALYAFKVLFPSGK
GFKFLAEVRQHTWSSGTNGVVAATFSLRLKGKPVSFVVPLAFVKNPDKTL
TVNTGALLTMSVSVNGGTPPYKHAWKKDGQPVEGQTTDTFSKANTQSGDK
GAYTCEVTDSAEQPQSITSDACTVTVNGAGG
>ECs3234 hypothetical protein
MEIKLIDNPVKLAEFLNNPVNTGNIVDSGDKYYIKPDAVYLGIYEGLVLA
GVHEVRNFWHSVVECHAVYDPGFRGEYALQGHRLFCKWLLENSPFLNSIT
MVPDTTKYGRAIIRLLGATRVGHLDDAYMSNGKPVGITLYQLPRSKYEEL
LNVST
>ECs0802 hypothetical protein
MHFRVTGEWNGEPFNRVIEAENINDCYDHWMIWAQIAHADITNIRIEELK
EHQAA
>ECs1421 CsgC
MNALLLLAALSSQITFNTTQQGDMYTIIPEVTLTQSCLCRVQILSLREGS
SGQSQTKQEKTLSLPANQPIALTKLSLNISPDDRVKIVVTVSDGQSLHLS
QQWPPSSEKS
>ECs4827 putative glycoprotein
MKKSTLSLAIGLLLACSTGMAKTQHLMLEQRMALLEERLEAAEMRAAKAE
SQVKQLQTQQAAEIREIKAAQGNTPVNGQATAESAKKNATSPNLLLSGYG
DLKIYGDVEFNMDAESNHGLLAMTNADVNSDPTNEQWNLNGRILLGFDGM
RKMDNGYFAGFSAQPLGDMHGSVNIDDAVFFFGKENDWKVKVGRFEAYDM
FPLNQDTFVEHSGNTANDLYDDGSGYIYMMKEGRGRSNAGGNFLVSKQLD
NWYFELNTLLEDGTSLYNDGNYHGRDMEQQKNVAYLRPVIAWSPTEEFTV
SAAMEANVVNNAYGYTDSKGNFVDQSDRTGYGMSMTWNGLKTDPDNGIVV
NLNTAYLDANNEKDFTAGINALWKRFELGYIYAHNKIDEFSGVVCDNDCC
IDDEGTYTIHTIHASYQFANVMDMENFNIYLGTYYSILDSDGDKKHGDDT
DDRYGARVRFKYFF
>ECs1610 excisionase
MSRLITLRDWAKEEFGDLAPSERVLKKYAQGKMMAPPAIKVGRYWMIDRN
SRFVGTLAEPQLPINANPKLQRIIADGC
>ECs1967 hypothetical protein
MSSKNRPRRTTTRNIRFPNQMIEQINIALDQKGSENFSAWVIESCRRELA
ADIKYARQLTIKKNDTQYALRWLFI
>ECs2228 hypothetical protein
MPGLVSYISSTSFANEMAEMRQQVMEGQIGGFLLGGERVRVSYLFQLH
>ECs4940 hypothetical protein
MTNTALPDALRLPGLPDNCNILNFRDYVGRIRLYSASGINNVYFATNLSA
P
>ECs1357 hypothetical protein
MPGFFLGIDPLSTGEYGKGGGRKQSYVNSGMNLLCRPRVYLWLLVKASIP
CLISTACPSDICYFCDL
>ECs1956 hypothetical protein
MAHIQLVKQTSSGLLLPATPESCDFLHQIKIGEWIHADFKRVRNYAFHKR
FFKLLQLGFDYWTPVGGAITPRERELLSGFVDYLCESVGREHTPALSDAA
EQYLNTVATRRTRDTALLKSFEAFREWVTIQAGFYTEHIYPDGSRGRRAK
SIAFANMDEVEFQQVYKSVLNVLWNWILFRKFSSQEEVENVAAQLLEFA
>ECs1187 CII
MEQTSYSKLSQREIDRAETDLLINLSTLTQRGLAKMIGCHESKISRTDWR
FIASVLCAFGMASDISPISRAFKYALDEITKKKSPAATEDFKQIDMQF
>ECs2050 hypothetical protein
MSHLDEVIARVDAAIEESVIAHMNELLIALSDDAELSREDRYTQQQRLRT
AIAHHGRKHKEDMEARHEQLTKGGTIL
>ECs1082 hypothetical protein
MRVLLRPVLVPELGLVIVKPGRESMSAFHNGRILVEPEPKSMRALPSGVV
PAVHQPLAEDKSLLPFFSDERVIRAAGGAGALSDWLLRHVKSCQWPHGDY
HHSETVIHRYGTGAMVLCWHCDNQLRDQTSESLEQLAQQNLAAWMIDVIR
HAMNGIQERELSLAELSWWAVCNQVVDALPEAVSRRSLGLPAEKIRSVYR
ESDIIPGEQTATSILKQRTKNIALPPHTHQQQNPPQEKTVVSIAVDPESP
KSFMKRPKRRRWVNEKYTRWVKTQPCACCGKPADDPHHLIGHGQGGMGTK
SHDIFTLPLCREHHNELHADPLAFEEKHGSQVDLIFRFLDHAFATGVLG
>ECs4966 hypothetical protein
MWVSIVKDYVVPILSATATAGGIFMALMRKTFVPREAFEKLSDRVEKVET
RLSSLPTEAEVNRLNVEIVTLRGELKTTNATLRSVSYQNELLLEQAVRKK
TQ
>ECs4608 hypothetical protein
MVSVTSYSYPTETENDNSQFQNQSLRESIDKSLEDNIYKFNMIEQSGLLD
NYRARFKDDTFRDIFRSFLLYLLDSFHIYRHDISDKLVPYFHVTPESENA
NYKT
>ECs3852 hypothetical protein
MYKRQVTGVVNKTANVAAHAVVNAALAVVQGNNARASAAGAATGDVVGMI
ATEMCGISPGTVTCKR
>ECs1963 hypothetical protein
MKKKYELVVKEINNYPDKIAVTVALEIGGHPSLLLPHVAISLDRTEGATL
EFYEAEAKKQAKQFFMDVAAGLCEGDGPLPEKRPVILEAQDVLITYRGKL
PGIITGSLKTPPLA
>ECs1084 putative anti-termination protein
MARDIQMVLERWGAWAANNHEDVTWPSIAAGFKGLIPTKVKSRPKCSDDD
AMIICGCMARLNKNNQYLHDLLVDYYVGGMTFMALARKHRCSDGLIGKRL
YKAEGIIEGMLMALNVRLDMDMR
>ECs4587 hypothetical protein
MITITELEDEIIKNKEAANVFIEKINDKKNEIHEKMKHPLDKVTYDEAKE
LLIACDAAIRTIEIMRIRINNK
>ECs3157 putative sulfatase/phosphatase
MMVTVVSNYCQLSQTQLSQTFAEKFTVTEELLQSLKKTALSGDEESIELL
HNIALGYDEFGKKAEDILYHIVRNPTNDTLSIIKLIKNACLKLYNLAHTA
TKHPLKSHDSDNLLFKKLFSPSKLMAIIGEDIPLISEKQSLSKVLLNDKN
NELSDGTNFWDKNRQLTTDEIACYLKKIAANAKNTQVNYPTDFYLPNSNS
TYLEVALNDNIKSDPSWPKEVQLFPINTGGHWILVSLQKIVNEKNNTQQI
KCIIFNSLRALGHEKENSLKRIINSFNSFNCDPTRETPNNKNITDHLTEP
EIIFLHADLQQYLSQSCGAFVCMAAQEVIEQMESNSDSAPYTLLKNYADR
FKKYSAEEQYEIDFQHRLENRNCYLDKYGDANINHYYRNLEIKNSHPKNR
ASSKRVS
>ECs2333 beta-lactam resistance protein
MNRLIELTGWIVLVVSVILLGVASHIDNYQPPEQSTTVQHK
>ECs5418 hypothetical protein
MAKIRYLQGTHDARAGDIRDVAQPCAEVLVRLGKAEYITVRRPAGQKKKR
DAEHGECGTFYGEPEKTRNQDVT
>ECs1763 hypothetical protein
MEFKDLPTPLQEMASNIVRSQLATLDLSTAEKETIDNMVRNVRNAFSGLY
GSDNQKQESDVNKRVISVCVNGHVLSSIKTETATVFDCLCIVQSLVDALF
RSVNLENDANLRGRIIAHPYAHTLGSVDIKDPTNL
>ECs2967 putative antirepressor protein
MNMMAVPFHGNSLYVVNHNGEPYVPMKPVVAGMGLAWQSQLAKLRQRFAS
TITEIVMVAEDGKQRNMVSMPLRKLAGWLQTINPNKVKPEIRDKVIRYQE
ECDDVLYEYWTKGFVVNPRKMSVMEELNQACADMKRDKNIASVFATGLNE
WKQVKAAHVSKIRTLVNEANMLIDFVLADTGKGKITKAD
>ECs4975 hypothetical protein
MNEHHTVAATDDSGLQVSGDNSVTVLAEVRCSRSAFRRAGFLFTRGRQQV
EVTPEQLARLEEEPCLTVRILQTSADDAGSVAGVVHAVAGTDLAEAEAEA
EAEAEAEAEAGQDAPRKKTGNKAEQARA
>ECs1821 hypothetical protein
MPFSIKSIFSGHTWHQPEISRPIADKSSTKNCILDSTTCNVDGFTVFNRR
SCSFDMRPPGSADRTPQLRLSISEVAWMSKIIETETNNTNKS
>ECs0320 putative receptor
MRVNLLIAMIIFALIWPVTALRAAVSKTTWADAPAREFVFVENNSDDNFF
VTPGGALDPRLTGANRWTGLKYNGSGTIYQQSLGYIDNGYNTGLYTNWKF
DMWLENSPVSSPLTGLRCINWYAGCNMTTSLILPQTTDTSGFYGATVTSG
GAKWMHGMLSDAFYQYLQQMPVGSSFTMTINACQTSVNYDASSGARCKDQ
ASGNWYVRNVTHTKAANLRLINTHSLAEVFINSDGVPTLGEGNADCRTQT
IGSRSGLSCKMVNYTLQTNGLSNTSIHIFPAIANSSLASAVGAYDMQFSL
NGSSWKPVSNTAYYYTFNEMKSADSIYVFFSSNFFKQMVNQGISDINTKD
LFNFRFQNTTSPESGWYEFSTSNTLIIKPRDFSISIISDEYTQTPSREGY
VGSGESALDFGYIVTTSGKTAADEVLIKVTGPAQVIGGRSYCVFSSDDGK
AKVPFPATLSFITRNGATKTYDAGCDDSWRDMTDALWLTTPWTDISGEVG
QMDKTTVKFSIPMDNAISLRTVDDNGWFGEVSASGEIHVQATWRNIN
>ECs0304 hypothetical protein
MTSYSNFSNQIKETINNKFDHEIHDWDIIKNSITTLINKNIHGAGRNIVD
FIDLGNWDFISNFSFDDSTRRLELEWHPNDKFHIYIESVVFVEFNDTIYA
FLKGYYHNQLSLNRIYNTKCSSCSFENSGSYMVDVYRTVKRVNETIQTPN
INCYTTCILTRPANGHVTSTGFSRNLMDAINISLAEHKIASLHNEVMSIE
EYDRDSLQEKGNTARRYLEYILMLVNIRIMHLNNVQYQEQMLGSLVSVIE
ALDYEPLMKNDVEITKDILNACSHHGGVRIEKKDVIFSLEVIENLIKAIK
KTDINKLQLDGMFKSIQK
>ECs5246 hypothetical protein
MRTVEILQYTLRKGSGAAFHAIMQEISVPLHQSHGIDVVSFGNSLDDLDC
YYLIRAFDSAKSMTAVLDAFYASADWRSGPREDIIGSIENCIKTVISLPS
ESVKGLRMQS
>ECs3719 EprH
MLFFLFVIIFIAFYIVNASNDPQLRHIDKILVNKNRNYEILYGRDHVIYI
NTNSLDEAVWVKQALEKNQPGKPVRVINPDDESIRIFSWLADNFPDLQYF
KLQLLDASNPRLTVSKQRNAITQQLIDNLIKGLLQTMPYASNISIAVLDD
NVLESQAIETLSAIGLSYEKYKTANNVYFNIIGTLSDSELNKINNYVDEY
YKQWGKQYVRFNVNLKNQDTNNSSFSYGDNRFEKSQGSKWTFQE
>ECs3783 hypothetical protein
MHRPEPNFLIATSSPTFTAVINPLFLQCMSALHDVLRVRSKKRFKGRSMR
CGYLKRVYDGRAAWGCNGKALTEARA
>ECs5438 hypothetical protein
MAKIRYLQGTHDARAGDIRDVAQPCAEVLVRLGKAEYITARRPAGQKKKR
DAEHGECGTFCGEPEKTRNQDVT
>ECs5593 hypothetical protein
MTDTHSIAQPFEAEVSPANNRQLTVSYASRYPDYSRIPAITLKGQWLEAA
GFATGTVVDVKVMEGCIVLTAQPPAAAESELMQSLRQVCKLSARKQRQVQ
EFIGVIAGKQKVA
>ECs4993 hypothetical protein
MTSKWVQLSSMPGNFTVKVSGGTAAFLEAPFPPAETKGGMTFADCLISFN
TRDCLWVRPVSGDPSVEITGAGIGAVIPLSADVAGTAEPSDWDNAETHTR
PSGNETASSSPSWYYVVVLAGQSNGMAYGEGLPLPETYDRPEPRIMQLAR
RSTVTPGGKACQYNDIILADHCLHDVQDMSGKNHPKADVAKGQYGTVGQG
LHIAKKLLPFIPADAGILLVPCCRGGSAFTAGADGTYSDSTGASEDSARW
GVDKPLYKDLISRTKAALAKNPKNRLLAVVWMQGEFDIDAKPTEHSALFL
AMVEKFRADLAEQAEQCTGGSAAGVPWICGDTTYFWKQKNEPAYQAIYGG
YKNKTDKNIHFVPLMTDENDANVPTNNPAEDPDIESIGYYGSTWRNSAAT
WTSADRASHFSSWARRGIISDRLATAVLTHAGRTTVKADVPSSETEAPVP
SPSETEAVTTTLLSYRVSESEGNLKAQGWEPAGGKAEIISDAGGTGGKAM
KLTKETGKSSWYLDHDAGTGAELLKNGGLISCRFKVPGDLVANQYVMGLY
WPVSSLPQGVTLTGDAGNNLLASFYIQTDAKDLNVMYHNAKVATNNQKLG
TFGAFDNEWHTLAFRFAGNNSLQVIPVIDGQDGAAFTLTQSPVGTFPVDK
LRVTDITKNATYPVLIDSIVVEVKKAVTE
>ECs1205 Shiga toxin 2 subunit A
MKCILFKWVLCLLLGFSSVSYSREFTIDFSTQQSYVSSLNSIRTEISTPL
EHISQGTTSVSVINHTPPGSYFAVDIRGLDVYQARFDHLRLIIEQNNLYV
AGFVNTATNTFYRFSDFTHISVPGVTTVSMTTDSSYTTLQRVAALERSGM
QISRHSLVSSYLALMEFSGNTMTRDASRAVLRFVTVTAEALRFRQIQREF
RQALSETAPVYTMTPGDVDLTLNWGRISNVLPEYRGEDGVRVGRISFNNI
SAILGTVAVILNCHHQGARSVRAVNEESQPECQITGDRPVIKINNTLWES
NTAAAFLNRKSQFLYTTGK
>ECs1982 hypothetical protein
MRLALRLGRTLSELRHSLSASEAMMWMEFDRVSPLGDERGDIRNAQIVKA
VFGAQGMNVALKDAMLCWGEDEDKPEVDPFAALEDALSLAAMS
>ECs4955 hypothetical protein
MFFKTSNPSALAAWQKYQQDCQKVKDEAKRLEAVLNVACRSVFVSGISGF
CFKGLRFMDDKYPFHRDLWRKPTASNGWSCTPRTSRIPKALRVASDELNS
LWREYSPVTYARTDALLFWLGIDFSAILHGPVKWFCVDDVIYLQCEDDSA
KRKMTEILSDEFYAAEKRVGG
>ECs4539 hypothetical protein
MKTLPDTHVREASRCPSPITIWQTLLGRLLDQHYGLTLNDTPFADERVIE
QHIEAGISLCDAVNFLVEKYVLVRTDQPGFSACTRSQLINSIDILRARRA
TGLMTRDNYRTVNNITLGKHPEAK
>ECs1071 hypothetical protein
MKIKHEHIRMAMNAWAHPDGEKVPAAKITKAYFELGMTFPELYDDSHPEA
MARNTQKIFRWVEKDTPDAVKKIQALLPAIEKAMPPPLVARMRSHSSAYF
RELVETKERLVKDIDDFVASAIVLFDQMNRGGPAGNTLAVH
>ECs1086 hypothetical protein
MLKQQDMTETARVVFDELSVTEPATVGEIAQNTYLSRERCQLILTQLVMR
VWQTISSVVTDAFSPEGFFICGKWAAGGC
>ECs4957 hypothetical protein
MRKITTLSELQEMNMSIELRSSYEYRKILIAGGMKPEDAEKIVSFMDKEC
DKRDMPEIIMDDMILDSAVALSPLWIVHSLAEIAKGTDKQAAVAALQTLN
EMRISPRPTLIHMILSSMEDKANE
>ECs2272 hypothetical protein
MTTFTKEQLISHVSENVKAMKFAVKQTAFKNSLEAIELDLALALVAQASL
EAEPVLYMNRFTGKTFSLEEQPGADKEPEIYVPLYAAPPDSAAMLQAGNF
REKKGSSTNNFREISETSTNYPVTLDDWISCSERMPDDGQHVIILCDGAF
VLYAQYRDGEFFDVVRNGDEFFETQSRNVTDWMPLPEPPQEVRQ
>ECs1931 hypothetical protein
MYKITATIEKEGGTPTNWTRYSKSKLTKSECEKMLSGKKEAGVSREQKVK
LINFNCEKLQSS
>ECs2754 prophage maintenance protein
MLHKRRLASYAPKGKEKQVMKQQKAMLIALIVICLTVIVTALVTRKDLCE
VRIRTGQTEVAVFTAYEPEE
>ECs3737 hypothetical protein
MVIYAFNKRLMEYFMKGKSALTLLLAGIFSCGTCQATGAEVTSESVFNIL
NSTGAATDKSYLSLNPDKYPNYRLLIHSAKLQNEIKSHYTKDEIQGLLTL
TENTRKLTLTEKPWGTFILASTFEDDKTAAETHYDAVWLRDSLWGYMALV
SDQGNSVAAKKVLLTLWDYMSTPDQIKRMQDIISNPKRLDGIPGQMNAVH
I
>ECs2040 hypothetical protein
MFTKALSVVLLTCALFSGQLMAGHKGHEFVWVKNVDHQLRHEADSDELRA
VAEESAEGLREHFYWQKSRKPEAGQR
>ECs2003 hypothetical protein
MLGKYKAVLALLLLIILVPLTLLMTLGLWVPTLAGIWLPLGTRIALDESP
RITRKGLIIPDLRYLVGDCQLAHITNASLSHPSRWLLNVGTVELDSACLA
KLPQTEQSPAAPKTLAQWQSMLPNTWINIDKLIFSPWQEWQGKLSLALTS
DIQQLRYQGEKVKFQGQLKGQQLTVSELDVVAFENQPSVKLVGEFTMPLM
PDGLPVSGHATATLNLPQEPSLVDAELDWQENSGQLIVLARDNGDPLLDL
PWQITRQQLTVSDGRWSWPYAGFPLSGRLGVKVDNWQAGLENALVSGRLS
VLTQGQAGKGNAVLNFGPGKLSMDNSQLPLQLTGEAKQADLILYARLPAQ
LSGSLSDPTLTFEPGALLRSKGRVIDSLDIDEIRWPLAGVKVTQRGVDGR
LQAILQAHENELGDFVLHMDGLANDFLPDAGRWQWRYWGKGSFTPMNATW
DVAGKGEWHDSTITLTDLSTGFDQLQYGTMTVEKPRLILDKPVVWVRDAQ
HPSFSGALSLDAGQTLFTGGSVLPPSTLKFSVDGRDPTYFLFKGDLHAGE
IGPVRVNGRWDGIRLRGNAWWPKQSLTVFQPLVPPDWKMNLRDGELYAQV
AFSAAPEQGFRAGGHGVLKGGSAWMPDNQVNGVDFVLPFRFADGAWHLGT
RGPVTLRIAEVINLVTAKNITADLQGRYPWTEEEPLLLTDVSVDVLGGNV
LMKQLRMPQHDPALLRLNNLSSSELVSAVNPKQFAMSGAFSGALPLWLNN
EKWIVKDGWLANAGPMTLRLDKDTADAVAKDNMTAGTAINWLRYMEISRS
STRINLDNLGVLTMQANISGTSRVDGKSGTVNLNYHHEENIFTLWRSLRF
GDNLQVWLEQNAQLPETGCPSGKECEEKQ
>ECs4433 putative lipase
MIIKKSGGRWQLSLLASVVISAFFLNTAYAWQQEYIVDTQPGHSTERYTW
DSDHQPDYNDILSQRIQSSQRALGLEVNLAEETPVDVTSSMSMGWNFPLY
EQVTTGPVAALHYDGTTTSMYNEFGDSTTTLTDPLWHASVSTLGWRVDSR
LGDLRPWAQISYNQQFGENIWKAQSGLSRMTATNQNGNWLDVTVGADMLL
NQNIAAYAALSQAENTTNNSDYLYTMGVSARF
>ECs1948 hypothetical protein
MAKVFTPEEREKIKGQVVELVRLSGRETLRALEAKTGASRYYISTLAREL
VASGDVYNSGYGLFPSEQARKDWQNARKKLSRAKVKKPVVVDPDLIWSLP
DGEIRRYDRRLNIICSECRKSEAMQRVLAFYQGNFQKVLL
>ECs1197 NinE
MRRQRRSITDIICENCKYLPTKRSRNKRKPIPKESDVKTFNYTAHLWDIR
WLRHRARKTR
>ECs0760 hypothetical protein
MARSFKVLSPTAILGYGFPEESFRKAMAESPDLIAVDAGSSDPGPHYLGA
GKPFTDRAGVKRDLRYMITAGVQNNIPVVIGTAGGSGAAPHLEWCREIIH
EIAREEHLSFSMALIPADVDKAIVHQALDNGKITALDFVPPLTHDAIDES
TYIVAQMGIEPFQRALKERAQVVLGGRAYDPACFAALPIMQGFDEGLALH
CGKILECAAIAATPGSGSDCAMGIIDDNGFTLKTFNPKRKFTETSAAAHT
LYEKSDPYFLPGPGGVLNLKGCTFKAVNDGEVYVSGSKHEETPYALKLEG
ARRVGFRCLTIAGTRDPIMIAGIDKIIDEVKTSVSRNLSLDDDSIRINFH
LYGKNGVMGDHEPMQTAGHELGIVLDVVAPTQEIANSVCSLVRSTMLHYG
YENRIATAGNLAFPFSPSDIQGGPVYEFSIYHLIEANDALRFDFHIEQVT
PEGVQA
>ECs2289 hypothetical protein
MKITLSKRIDLLAFLLPCALALSTTVHAETNKLVIESGDSAQSRQHAAME
KEQWNDTGNLRQKVNKRTEKEWDKADAAFDNRDKCEQSANINAYWEPNTL
RCLDRRTGRVITP
>ECs2748 hypothetical protein
MTVKLRLTVAALLLFLVVMVDFTSRIMSVLADGVLVCGIMVLLWPVIKRN
SLHNA
>ECs5296 hypothetical protein
MKEFLFLFHSTVGVIQTRKALQAAGMTFRVSDIPRDLRGGCGLCIWLTCP
PGEEIQWVIPGQTEAIYCQQGSDWQCVVHYDAEASTRQ
>ECs2436 hypothetical protein
MRREMFSTNLIQSNYGDLNIKSLAFDSFKERLQSTMTALTFFISTGQCDC
DEIAESNFNYMIAYMSNINYDASKPGAPALSFDTYLQDNVKYRVIINNLY
GSEIRIRGINKDFIGMDVTSVFRPEKMTNLISIKNRLVIHYLRTIYYEQY
YIHPVGSIFAAIQKNESLLKFPSIISMLNINLLFNPLNLPGMGSGILEDI
MSIPDSSLRKRLGYEVLSFSLQAHSLSQECIDKLDIFSLTICLNMSQYAS
PRWNI
>ECs2188 putative holin protein
MEKITTGVSYTTSAVGTGYWLLQLLDKVSPSQWVAIGVLGSLLFGLLTYL
TNLYFKIKEDRRKAARGE
>ECs2242 putative tail assembly chaperone
MKKDLKTLALARLSGFRHKTVKVPEWGNVSVVLREPSAEAWYLWQEVLNG
DGEDDDTLSVVAKTRRNLEADVTLFCDVLCDTDLQRVFTPDDREQVLAVY
GPVHARLLRQALELIADAESARKK
>ECs2769 hypothetical protein
MTSAFALVMTVFLITGESQNVITGIYASKESCLQARDEQKISGECLPVKK
VSLYLNNETPAG
>ECs1171 hypothetical protein
MTPQQENALRSIARQANSEIKKARQQFPDKNVDDICRSVLKKHRETVTLM
GFTPTHLSLAIGMLNGVFKER
>ECs1196 putative DNA methylase
MTIKSNTPAHDKDCWQTPLWLFDALDIEFGFWLDSAASDKNALCAHWLTE
ADDALNSEWVSHGAIWNNPPYSNIRPWVEKAAEQCIQQRQTVVMLVPEDM
SVGWFSKALESVDEVRIITDGRINFIEPSTGLEKKGNSKGSMLLIWRPFI
SPRRMFTTVSKAALMAIGQGVRRAA
>ECs2797 hypothetical protein
MTSPFIQQIADNRVCQVLTCLPEKFVVDFANGIDVAQEHIRTAGERTFFR
RLKEGLTGEGAARQNAINASLAQGVEASLRWLTEMTTSLATTNYAITRVN
DRVSSLVSDTARLAHYSADTREQLLTLADQVHHKLNHLEEKLHRVDQVQR
AQLHLEQIFSWWSAGRYASFSPAGRCYVALEELRWGAFGDVIRQSETGQV
NQLLDILRHKALTQMAQESGGSATVRLNTLDWLGGQGREQADNEWHDAIN
WLGDWCSEEQHPVIWSTTQAAEHLPVRMPRLCSAERLSESMVDEIFQKGA
A
>ECs4577 hypothetical protein
MESKNKNGDYVIPDSVKNYDGEPLYILVSLWCKLQEKWISRNDIAEAFGI
NLRRASFIITYISRRKEKISFRVRYVSYGNLHYKRLEIFIYDVNLEAVPI
ESPGTTGPKRKTYRVGNGIVGQSNIWNEMIMRRKKES
>ECs5250 hypothetical protein
MNDGVMLDSDVIAGLEWLASTQENKQHFYQRLAKAQQFYIATTQKAANFG
KQFDPEWYGSDVVAGYFAQAKSLIDNRRSYEVTNASNIIPWVKQLGVCAK
SLDRITGARDRAIRMLKNTTVLPDTALLELVLAGNYASEGYEVEFIPEQK
GIAKTPEFRCRLGDGDYFFVECKRLQKGPYVKQEILQHHIRSHLAELHIK
SKKMNVWTDVTYLCEVKDAPENYIISHLKNFKGVFYEWNDDYGKGTIRPA
NLNLIREDITENGSLLVNTKLARLIKGSPLEDENYQVVAMGRPDERDSRF
ITKVKLASLITWRCINEKSFDARSRHVTKTLAEIDNQLLDYGLGVGHIAL
DVDIQKDVADKRREKNYAAIVNFEQKSKIVRLNVHYLVPRIDENSAWMVD
ETADDFFTHDLVKYVIPLTKIFPEAEVFENDQPGWHQN
>ECs3817 hypothetical protein
MVADPTTTLQVKNTGSLSVNRYGWINIWMAILGQFFTRFPLFFESCLILL
KTWLEIFPDNAGILRIYLLQFSAIVGYKTRRAA
>ECs0297 putative polarity suppression protein
MMARVTPDQALISFRNARILWAGHSESRKAVEQQIESLLTAMEKPADYAR
QLELQREHLDVLKWQINCAARECIYSQHQLMEACTEDALSNFMQANGAAL
TSALAPFLKGRGGVDVASRILRNALVRQLAITPPEIAGDYRMILDESGVM
PDPMMVRDCQSSYTPAQQLRFQQRLDYINGMQE
>ECs4855 hypothetical protein
MQNIILLFIYLLLIAVNHFRYKFLLSGNAGEMILYFANVFFNPLALSFII
ATIICITIKKRSSRSFIRGTCWLMGLFLIYSSYTVWERHSIWDYTFPEPG
ITVTVPSKQWVANIVKQGPTLVTRDAHAILAIVAFSQNELGVHSIEELTA
TQNGTAEMHVCDVQGFACAYQEGVQTMSDGKVRHAIFMTLLDNTNLIQLV
AAIDPDYLDEYQEEVMKMMLSARQ
>ECs1581 hypothetical protein
MKLKYPGLTDSGKTRTKFMRGDIYRDQYGGTVMIKGVEERRVTYHREGYE
YDCVMPVYQFRRDFSLVQAAPRSKPTSREKARANIQEIKKMLNVFRGKK
>ECs3011 hypothetical protein
MFALIQRGQIYTDRAGYPVVITRSTQHSVFFRRMDGRSGRVRIGEFNNLF
EHIDQQEYRKILAGTEQEMRLKKLRAMQRR
>ECs5011 hypothetical protein
MIKQTIVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQP
RLANSWWPGAVISEELATAAALRQQQALLTRLAEQGADSSADDAAAINAL
RQQIQALKVTGRQKINLDPDIVRVAERANPPLQGNYTLWVGPPPSTVTLF
GLISRPGNQPFTPGRDVASYLSGQSLLGGADRSYAWVVYPDGRTQKAPVA
YWNKRHVEPMPGSIIYVGLADSVWSEMPDALNADILQTLTQRIPQ
>ECs5193 hypothetical protein
MLPRIRHNNFIGAVELFVKSSYTKTHSNNFFNNIHHAFKKKDWISNYDSL
LTLREFFRCATQIDKSSYQVLSSKNETVNAMDKFLISFSLKDNGAEYTMT
LRGSGFEYEEIPITINEYNSFMDFKNREFPLEQNRRLYAWDILQKKQSDI
PKRIKGYIHQAIGDVSLGYALLDDIVSKLKRGKFELQGPGGGIKQCDGWY
IYEKIIDDNFAIVIESLGFALKIYGGDERFRNGSSVVLEDEDYSLIYNFL
VNAGCQQVELAEQVDAIVSANLAADSDITKEKICEKYKSTIEAFKKEQLA
LPVLVRRKNSET
>ECs5301 hypothetical protein
MTNFTTSTPHDALFKSFLMHPDTARDFMEIHLPKDLRELCDLDSLKLESA
SFVDEKLRALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSM
AVMQRHIEHDKRRPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKL
YNAAFPLVDITVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLL
VTECANDSQITALLNYILLTGDEERFNEFISELTSRMPQHRERIMTIAER
IHNDDCRANS
>ECs1596 hypothetical protein
MNEREQNRLIRGLIRQRDAWKTQETGHKDKASGRAERITAKRLTDRDREV
MECFRNR
>ECs1184 hypothetical protein
MAKSNVSVQAFKDFLEELMSLNIMKEATARNLKNSSARLLTVVQEEEMGD
VTQLDVNELAERYINATEPKPSDSSITAYKSRMESAIKKFVAFQSGEEIP
YTPIDKESSEEKDLTGEPTKVEGKANALHTYDLPVVLRPESGVTVTIKGI
PNDITNEEAERISSILKVYVRPQ
>ECs2292 hypothetical protein
MMKLSTCYAALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTL
SIVPNDQVDQPDSQVVGHCANDTHKILYTRTTSGNVSAPAQSTQDGAPAE
PQ
>ECs2284 inhibitor of cell division
METLLPNVNTSEGCFEIGVTISNPVFTEDAINKRKHERELLNKICILSML
ARLRPIQKGCWQ
>ECs1812 hypothetical protein
MNIQPTIQSGITSQNNQHHQTEQIPSTQIPQSELPLGCQAGFVVNIPDDI
QQHAPECGETTALLSLIKDKGLLSGLDEYIAPHLEEGSIGKKTLDMFGLF
NVTQMALEIPSSVSGISGKYGVQLNIVKPDIHPTSGNYFLQIFPLHDEIG
FNFKDLPGPLKNALSNSNISTTAVSTIASTGTSATTSTVTTEPKDPIPWF
GLTAQVVRNHGVELPIVKTENGWKLVGETPLTPDGPKANYTEEWVIRPGE
ADFKYGASPLQATLGLEFGAHFKWDLDNPNTKYAVLTNAAANALGALGGF
AVSRFASTDPMLSPHIGAMVGQAAGHAIQYNTPGLKPDTILWWAGATLGA
ADLNKAEFEVARFTDYPRIWWHAREGAIFPNKADIEHATGADIRAMEEGI
PVGQRHPNPEDVVIDIESNGLPHHNPSNHVDIFDIIQETRV
>ECs1228 putative tail fiber protein
MSVVVSGTLKSPDGEAISGANITLTALTVSPDALSGTSASAVTREGGYYG
MTMDPGEYAVSVTVKGKTAVYGRVRIEGTESTVTLNMLLRRSLVEVSIPG
ELLTDFRQIQNNVADDLATIRRLNEDTATKNTQATQSKESAAASAKSASD
SAKTATSRAAEAGQKATDATEAATRAVTAAGNAEESSTRAGESEKAAGAD
AEKARQHAEKARLAQESAGEILKRAEAATVSAEEARRMAENARGPRGPQG
ETGPKGDVGPKGETGPVGPQGPAGPKGERGDVGAQGAVGPAGPRGEKGEQ
GERGPQGIPGLKGDTGERGPKGDQGDMGPKGEKGDPGGPAGPQGPKGERG
EAGPQGPMGARGERGETGPRGEPGPAGPRGERGETGPQGPRGEPGPAGSA
ANVADATTAQKGIVQLSSATDSDDETKAATPKAVKAAMDVANEAKTKAEE
AAAGGGVPGPKGDKGDTGPAGPAGPKGDKGERGDTGPVGATGERGPAGDA
GPAGPQGPKGDRGERGETGLTGNAGPQGPKGDTGAAGPAGPQGPKGETGA
AGPVGATGPQGPKGDPGETQIRFRLGPGNIIETNSHGWFPDTDGALITGL
TFLDPKDATRVQGFFQHLQVRFGDGPWQDVKGLDEVGSDTGRTGE
>ECs0832 putative minor tail protein
MKGLENAIRNLNSLDTRMVPQASAWAINRVAQKAGSVATRQVAGNTVAGD
NQVKGIPLKLVRQRVRVFKASPSGKMTARIRVNRGNLPAIKLGTARVRLA
QRGGKLQYRGSVLKVGKYLFRDAFIQQLANGRWHVMRRIDGKNRYPIDVV
KIPLSGPLTQAFEDARDRIIAAEMPKQLGYALKQQLRLWLTR
>ECs4610 hypothetical protein
MASPINSGIMMANLCPSTFLDKNRNVAAELDIKNNEKKYSPGSNFAKWML
QEIKRLILNIMSGSRSINTDILDYFHPMPGTENNGNRTWVAATGEDEYIE
IKQTGDKSFNITLVGRDKPSRKEIPYSGVAVATIIKSLSEKTSALETHSA
DTVLRKKLVNSIVMKNTDFNYEIPAGILSNIYDLLKLRIKKDEGYVPVQE
SFKRTDVFFDSMIMDAH
>ECs1515 hypothetical protein
MTITKQRVEEIISRIEMYGHGAGYIADEVNDLAILALNLSNIANLKRYEL
DMGGCDSCGQDCGADMTEDPDGDYVLFDDVVKLFEFDTTTQKLEIPAKEA
ASGQD
>ECs2159 putative tail fiber protein
MAVKISGVLKDGTGKPVENCTIQLKARRNSATVVVNTVASENPDEAGRYS
MDVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMSEDDVRP
EALRRFELMVEEAARHAEEAKKNAGEAETSARNAGISASQAEESAANADT
SAGDASESARQAAESAAAAKQSEEASSSSASAAAQKASESSQSAADAELS
KKTAESAAGNAARMQRPQQKKPGSQQKAHSQREQSRIAAEEAVNRIPTVV
GLPAKGGTGSRGSSGAEGR
>ECs1366 hypothetical protein
MSYQPGGLVLPDLSRRGYLSKNASSSNRFGLVNGDGLSKDSRCWCTFSRW
RSSMDNAIACSSSLVIVCTWEAIYRSTSHYGKNVLLLQLA
>ECs2174 putative major capsid protein
MSMYTTAQLLAANEQKFKFDPLFLRLFFRESYPFTTEKVYLSQIPGLVNM
ALYVSPIVSGEVIRSRGGSTSEFTPGYVKPKHEVNPQMTLRRLPDEDPQN
LADPAYRRRRIIMQNMRDEELAIAQVEEMQAVSAVLKGKYTMTGEAFDPV
EVDMGRSAANNITQSGGTEWSKRDKSTYDPTDDIEAYALNASGVVNIIVF
DPKGWALFRSFKAVREKLDTRRGSHSELETAVKDLGKAVSYKGMYGDVAI
VVYSGQYVENGVKKNFLPDNTMVLGNTQARGLRTYGCIQDADAQREGINA
SARYPKNWVTTGDPAREFTMIQSAPLMLLADPDEFVSVQLA
>ECs5303 hypothetical protein
MEFHENKAKTPFIRLVQLWQAVRRWRRQMQARRVLQQMSDERLRDIGLRR
EDVE
>ECs4642 hypothetical protein
MIYMGLWLKIFKIVVGENENYLKDVMMQLEAKNNEGKYVISKANGNPVFK
ELFWKAIDEFNFPQEELNRLKQYRSL
>ECs1191 hypothetical protein
MNKKQLAILEKAWDAQISYALKEQALPIIQTKSKIARQLCDGGFLNEIEI
TRQMVTFKGYEINHHGIAAYCSHLPDDVDIDEMEREMKQ
>ECs1625 Bor precursor
MKKMLLATALALLITGCAQQTFTVQNKPAAVTPKETITHHFFVSGIGQKK
TVDAAKICGGAENVVKTETQQTFVNGLLGFITLGIYTPLEARVYCSQ
>ECs2761 hypothetical protein
METVFDALKALKKASSHEIAARLEISRDDAVTELWKLKRRGEADNKGSMW
WLTSEATEAAPKTTAEMLINAIEQHGPQSADELALMFGITSRRANSSLAM
AVSKGRLIRVNQDGKYRYCIPGDNLPAEPKAASVTETDGKAFPQPAGVAL
PVREAETQEEIKTESVAVTVQSQSSFIRKHPDGLILPSLHVANRELRRAK
GQVQKWERLCAALREINKHRDVIQKITTGDNG
>ECs2908 hypothetical protein
MTETQIYENIKQAISSAPRNSQTMEMHLQMIKYADHLKNVTAKEFCEGVG
LKASFATEFSKMRNLTERLKAAGLDTTKL
>ECs4321 hypothetical protein
MNEPLWPFIERKKSMRNLVKYVGIGLLVMGLAACDDKDTNATAQGSVAES
NATGNPVNLLDGKLSFSLPADMTDQSGKLGTQANNMHVWSDATGQKAVIV
IMGDDPKEDLAVLAKRLEDQQRSRDPQLQVVTNKAIELKGHKMQQLDSII
SAKGQTAYSSVILGNVGNQLLTMQITLPADDQQKAQTTAENIINTLVIQ
>ECs4552 EscF
MNLSEITQQMGEVGKTLSDSVPELLNSTDLVNDPEKMLELQFAVQQYSAY
VNVESGMLKTIKDLVSTISNRSF
>ECs1226 hypothetical protein
MAELSDFLPYVRRHISGPLNIMMTDALSMAAVAFSRQSLVCRREVTVVPV
AGKEIVLPYDKDDEECVHIIRISDDNHELFVGRDVDISSGRSLRFACSPG
EVSVLYAVAPKAGRSQIPDELLTWPEEVAAGALERLFMQTGVSWSDPLRA
QYFSVQFSEGIRRAYRHTLATSPYSSYRNPVRRQRFF
>ECs1691 hypothetical protein
MSHYHEQFLKQNPLAVLGVLRDLHKAAIPLRLSWIGGQLISKILVITPDK
LVLDFGSQAEDNIAVLKAQHITITAETQGAKVEFTVEQLQQSEYLQLPAF
ITVPPPTLWFVQRRRYFRISAPLHPPYFCQTKLADNSTLRFRLYDLSLGG
MGALLETAKPAGLHEGMRFAQIEVNMGQWGVFHFDAQLISISERKVID
>ECs2215 hypothetical protein
MTSAFALMMTVFLITGESQNVITGIYASKESCLQARDEQKISGECLPVKK
VSLYLNNETPAG
>ECs1436 hypothetical protein
MRPFLQEYLMRRLLHYLINNIREHLMLYLFLWGLLAIMDLIYVFYF
>ECs2316 DNA replication terminus site-binding protein
MARYDLVDRLNTTFRQMEQELAAFAAHLEQHKLLVARVFSLPEVKKEDEH
NPLNRIEVKQHLGNDAQSQALRHFRHLFIQQQSENRSSKAAVRLPGVLCY
QVDNLSQAALVSHIQHINKLKTTFEHIVTVESELPTAARFEWVHRHLPGL
ITLNAYRTLTVLHDPATLRFGWANKHIIKNLHRDEVLAQLEKSLKSPRSV
APWTREEWQRKLEREYQDIAALPQNAKLKIKRPVKVQPIARVWYKGDQKQ
VQHACPTPLIALINRDNGAGVPDVGELLNYDADNVQHRYKPQAQPLRLII
PRLHLYVAD
>ECs1988 hypothetical protein
MKAGAERSWLSGGRHKSEKIPQCSRTGTAGALRRLSVRNYSYVTTKTLTQ
RGRMCRSFQGERIYRPEE
>ECs1126 EspF-like protein
MAEHIPPAPNWPAPPPPVQNEQSRPLPDVAQRLMQHLAEHGIQPARNMAE
HIPPAPNWPAPTPPVQNEQSRPLPDVAQRLMQHLAEHGIQPARNMAEHIP
PAPNWPAPTPPVQNEQSRPLPDVAQRLMQHLAEHGIQPARNMAEHIPPAP
NWPAPTPPVQNEQSRPLPDVAQRLMQHLAEHGINTSKRS
>ECs1179 Ea10
MSNIKKYIIDYDWKASIEIEIDHDVMTEEKLHQINNFWSDSEYRLNKHGS
LLNAVLIMLAQHALLIAISSDLNAYGVVCEFDWNDGNGQEGWPPMDGSEG
IRITDIDTSGIFDSDDMTIKAA
>ECs3735 hypothetical protein
MINKLLLAYLIGLVVTSTLIFIFSEEKVTYRLFAAIITGLTWPLSLIPSI
ISLMIRKSD
>ECs5328 hypothetical protein
MILVIIINERDKTMLQRTLGSGWGVLLPGFLIAGLMYADLSPDQWRIVIL
MGLVLTPVMLYHKQLRHYILLPSCLALIAGIMLMIMNLNQG
>ECs2280 hypothetical protein
MPPRLSYKTGGNMNRALSPMVSEFETIEQENSYNEWLRAKVATSLADPRP
AIPHDEVERRMAERFAKMRKERSKQ
>ECs4335 hypothetical protein
MKFLPLLALLISPFVSALTLDDLQQRFTEQPVIRAHFDQTRTIKDLPQPL
RSQGQMLIARDQGLLWDQTSPFPMQLLLDDKRMVQVINGQPPQIITAENN
PQMFQFNHLLRALFQADRKVLEQNFRVEFADKGEGRWTLRLTPTTTPLDK
IFNTIDLAGKTYLESIQLNDKQGDRTDIALTQHQLTPAQLTDDERQRFAA
Q
>ECs2059 hypothetical protein
MLCFLIYITLPFIQLVYFISSEKKLTIHIVQMFHLLSQVFYNLKMFLMMD
MLGVGDAININTNKNIRQVC
>ECs0812 NinG
MMAKPARRRCKNDECREWFHPAFANQWWCSPECGTKIALERRSKEREKAE
KAEKAAEKKRRREEQKQKDKLKIQKLALKPRSYWIKQAQQAVHAFIRERD
RDLPCISCGTLTSAQWDAGHYRTTAAAPQLRFDERNIHKQCVVCNQHKSG
NLVPYRVELISRIGQEAVEEIESNHNRYRWTVEECRAIKAEYQQKLKKLR
NSRSEVA
>ECs3924 hypothetical protein
MTTTGLRPRLNVRQRKDTGYLPHSSPFSLQFRPAILYSDGYLPQVPEDKN
ETDKIHTPRIVPQKLERTPSDTSRSRGCHCFYAGWL
>ECs4561 Tir
MPIGNLGHNPNVNNSIPPAPPLPSQTDGAGGRGQLINSTGPLGSRALFTP
VRNSMADSGDNRASDVPGLPVNPMRLAASEITLNDGFEVLHDHGPLDTLN
RQIGSSVFRVETQEDGKHIAVGQRNGVETSVVLSDQEYARLQSIDPEGKD
KFVFTGGRGGAGHAMVTVASDITEARQRILELLEPKGTGESKGAGESKGV
GELRESNSGAENTTETQTSTSTSSLRSDPKLWLALGTVATGLIGLAATGI
VQALALTPEPDSPTTTDPDAAASATETATRDQLTKEAFQNPDNQKVNIDE
LGNAIPSGVLKDDVVANIEEQAKAAGEEAKQQAIENNAQAQKKYDEQQAK
RQEELKVSSGAGYGLSGALILGGGIGVAVTAALHRKNQPVEQTTTTTTTT
TTTSARTVENKPANNTPAQGNVDTPGSEDTMESRRSSMASTSSTFFDTSS
IGTVQNPYADVKTSLHDSQVPTSNSNTSVQNMGNTDSVVYSTIQHPPRDT
TDNGARLLGNPSAGIQSTYARLALSGGLRHDMGGLTGGSNSAVNTSNNPP
APGSHRFV
>ECs1406 hypothetical protein
MKTLSDTHVREVSRCPSPVTIWQTLLIRLLDQHYGLTLNDTPFADERVIE
QHIEAGISLCDAVNFLVEKYALVRTDQPGFSTCPRSQLINSIDILRARRA
TGLMTRDNYRTVNNITLGKYPEAK
>ECs4415 hypothetical protein
MNNNEPDTLPDPAIGYIFQNDILALKQAFSLPGIDYADISQREQLAAALK
RWPLLAEFAQQK
>ECs5421 hypothetical protein
MMFESYMAERLRHRWMRLRLYRFPGSVLTDYRILKNYAKTLKGAAA
>ECs1586 hypothetical protein
MTRTRRDRTEPKYKALDITELALKVAIRTIDRHVGEGYAKEHPDLISAFM
TTAAANFATLTEREIAEAEQVTTINVKTGEVES
>ECs4086 hypothetical protein
MIRLSEQSPLGTGRHRKCYAHPEDAQRCIKIVYHRGDGGDKEIRRELKYY
AHLGRRLKDWSGIPRYHGTVETDCGTGYVYDVIADFDGKPSITLTEFAEQ
CRYEEDVAQLRQLLKQLKRYLQDNRIVTMSLKPQNILCHRISESEVIPVV
CDNIGESTLIPLATWSKWCCLRKQERLWKRFIAQPALAIALQKDLQPRES
KTLALASREA
>ECs4715 rho operon leader peptide
MRSEQISGSSLNPSCRFSSAYSPVTRQRKDMSR
>ECs2291 hypothetical protein
MIENHLYSLVTDVKYKLLPCLLAILLTGCDRTEVTLSFTPEMASFSNEFD
FDPLRGPVKDFTQTLMDEQGEVTKRVSGTLSEEGCFDSLELLDLENNTVV
ALVLDANYYRDAETLEKRVRLQGKCQLAELPSAGVSWETDDNGFVIKASS
KQMQMEYRYDDQGYPLGKTTKSNDKTLSVSATPSTDPIKKLDYTAVTLLN
NQRVGNVKQSCEYDSHANPVGCQLIIVDEGVKPAVERVYTIKNTIDYY
>ECs2208 putative regulatory protein
MKKSEVLGYFGGVVKTAAALGTSKTTVSMWGEDVPWKWALLIQAVTAGAL
KYELHIPTVVIPDSDHNPPSNQGGIHENQA
>ECs2277 hypothetical protein
MKIKHEHIRMAMNVWAHPDGEKVPAAKITKAYFELGMTFPELYDDSHPEA
LARNTQKIFRWLDKDTPDAVEKMQALLPAIEKAMPPLLVARMRSHSSEYY
REIVERRDRLVKDVDDFVASAVVLYDQMNRGGPAGNAVVMH
>ECs3706 hypothetical protein
MDIEFSQIHEMVYMHDIVNSDSKKKPRIPLKKFLNAEKVLTQTTSWALNS
RFVNVNSVNKVNVKSKVKNSYISRSVNDEFSLTDDEINSFKETLVLSSID
SLSKLVLNNPLSVLFTSTVRRNNNRAKMNVEFDSWICTRCC
>ECs1627 hypothetical protein
MNKEQSADDPSVDLIRVKNMLNSTISMSYPDVVIACIEHKVSLEAFRAIE
AALVKHDNNMKDYSLVVD
>ECs5432 hypothetical protein
MSEFDAQRVAERIDIVLDILVADDYHSAIHNLEILKAELLRQVAESTPDI
PKAPWEI
>ECs3496 hypothetical protein
MKKKYELVVKGINNYPDKITVTVALEIGGYPSLLLPDVAISLDRTEGATL
EFYEAEAKKQAKQFFMDVAAGLCEGDGPLPEKRPVILEAQDVLITYRGKL
PGIITGSLKTPPLA
>ECs5413 hypothetical protein
MPQKTIIVGMLCLTMLLTVWVLHASPCEFRVSFMWSEIAAFLQCKP
>ECs1081 hypothetical protein
MAHDTKLYNSDDSAVFASRRGRCFHAFKSDWYQHPPCTEEQAEWLIQCYR
RRGCEVKKALSLDYRHWIISVRLPYSERPPRPSRTFQQRIWR
>ECs2552 DNA polymerase III theta subunit
MLKNLAKLDQTEMDKVNVDLAAAGVAFKERYNMPVIAEAVEREQPEHLRS
WFRERLIAHRLASVNLSRLPYEPKLK
>ECs2890 hypothetical protein
MWRVRIFFGKRQTCAFWLCLAGTCASTMPLRERHRAMKGDSIDVVNGRRL
SGIGLMHKK
>ECs0571 hypothetical protein
MNRPAILKKKAAKDVASVLKIIFLFYFFLIARLKQRYSIREIKRDLWNIR
ENYSSNAAIAKIYCRKRKASGPGKHLTILPYGWVRFITFPIM
>ECs0848 hypothetical protein
MLSPYSVNLGCSWNSLTRNLTSPDNRVLSSVRDAAVHSDNGAQVKVGNRT
YRVVATDNKFCVTRESHSGCFTNLLHRLGWPKGEISRKIEVMLNASPVSA
AMERGIVHSNRPDLPPVDYAPPELPSVDYNRLSVPGNVIGKGGNAVVYED
AEDATKVLKMFTTSQSNEEVTSEVRCFNQYYGAGSAEKIYGNNGDIIGIR
MDKINGESLLNISSLPAQAEHAIYDMFDRLEQKGILFVDTTETNVLYDRA
KNEFNPIDISSYNVSDRSWSESQIMQSYHGGKQDLISVVLSKI
>ECs4417 hypothetical protein
MTISDIIEIIVVCALIFFPLGYLARHSLRRIRDTLRLFFAKPRYVKPAGT
LRRTEKARATKK
>ECs4609 hypothetical protein
MVELVCFNDDDCAQVRELLKQSNGNITDDKIDEITACLSNPQGITCFYYS
PLQGYQSFLKNTSQDVSELLSKSLNGTISSTSGLKKSGNAFFGTGKYLKE
IDFTINGFSQKMYLMAVNNVGNEVKLQCSCYFTTGSVDEEKSKSDKIDFT
FSGAATTADL
>ECs1991 putative outer membrane protein
MRKLYAAILSAAICLTVSGAPAWASEQQATLSAGYLHVSTNAPGSDNLNG
INVKYRYEFTDTLGLVTSFSYAGDRNRQITRYSDTRWHEDSVRNRWFSVM
AGPSVRVNEWFSAYAMAGVAYSRVSTFSGDYLRVTDNKGKTHDVLTGSDD
GRHSNTSLAWGAGVQFNPTESVAIDIAYEGSGSGDWRTDGFIVGVGYKF
>ECs1637 minor capsid protein
MADFDNLFDAAIARADETIRGYMGTSATMTSGERSGAVIRGVFDDPENIS
YAGQGVRVEGSSPSLFVRTDDVRQLRRGDTLTIGEENFWIDRISTDDGGS
CHLWLGRGVPPAVNRRR
>ECs1154 hypothetical protein
MANVTVTFTITEFCLHTGISEEELNEIVGLGVVEPREIQETTWVFDDHAA
IVVQRAVRLRHELALDWPGIAVALTLMDDIAHLKQENRLLRQRLSRFVAH
P
>ECs2983 hypothetical protein
MKSWCVGIGIVHRPVCRLMSCMSIHLPHDAIRVTVFFDGCSNSTMSFSAT
ELSGVWILSEGVSMSDLSLTQPKLKECPFCGGNARLWVEAGINIDVWGYA
ECDLCEARGAWAPSVAAAAEKWNRRAGDEANLSASQRSNQK
>ECs1626 hypothetical protein
MQTTRPRITWKVLPMAQVAIFKEIFDQVRKDLNCELFYSELKRHNVSHYI
YYLATDNIHIVLENDNTVLIKGLKKVVNVKFSRNTHLIETSYDRLKSREI
TFQQYRENLAKAGVFRWVTNIHEHKRYYYAFDNSLLFTESIQNTTQIFPR
>ECs2199 hypothetical protein
MSTITRERAEIKSYITGFLSDSAHDNKSSDSLLANVFRIALASLEAEPIA
MVVPDEMDLLTCHLDGVTKTYADGWNACRVAMLQAGNFRENKNSSTNNFR
EISETSTRSPITLDGWISCTERMPEKSQNVLISMNIDSEAGPLIYSARYL
GGTFRRGGIAVSPGNDLRQATHWMSLPEPPQEVNQ
>ECs2771 hypothetical protein
MHCCNSGNFIDENSGEFSVQVWGNGATFDNVILRRSYERQGIPCPWRYTN
DRDVRTMVALGLVMDFDARTTIPFEGERHNALHDARYQAKYVSAIWQKLL
PSQADF
>ECs1119 hypothetical protein
MKAVTGRWWLSGGRHKSEKIPQCSRTGTAGALRRLSVRNYSYVTTKTLTQ
RGRMCRSFQGERIYRPEE
>ECs1657 hypothetical protein
MLPQHSDIEIAWYAAIQQEPNGWKTVTTQFYIQEFSEYIAPLQDAVDLEI
ATEEERSLLEAWNKYRVLLNRVDTSVARKRTA
>ECs2713 hypothetical protein
MNHGHQRIINVYASAYRTLSTINKEEQKMCKSYNGKTYQKYRLILVFKPF
LLVSMIQRLSKKLGNSTTLGNYGLCPAALI
>ECs1026 hypothetical protein
MATGIAVQILDAQSQQEIPLNQVQPLTPLKAGDNTLKYQLRYKSTKAGAT
GGNATAVLYFDLVYQ
>ECs2310 hypothetical protein
MKLKNTLLASALLSATAFSVNAATELTPEQAAAVKPFDRVVVTGRFNAIG
EAVKAVSRRADKEGAASFYVVDTSDFGNSGNWRVVADLYKADAEKAEETS
NRVINGVVELPKDQAVLIEPFDTVTVQGFYRSQPEVNDAITKAAKAKGAY
SFYIVRQIDANQGGNQRITAFIYKKDAKKRIVQSPDVIPADSEAGRAALA
AGGEAAKKVEIPGVATTASPSSEVGRFFETQSSKGGRYTVTLPDGTKVEE
LNKATAAMMVPFDSIKFSGNYGNMTEVSYQVAKRAAKKGAKYYHITRQWQ
ERGNNLTVSADLYK
>ECs2717 putative tail fiber protein
MAVKISGVLKDGTGKPVENCTIQLKARRNSATVVVNTVASENPDEAGRYS
MDVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMSEDDVRP
EALRRFELMVEEAARHAEEAKKNAGEAETSARNAGISASQAEESAANADT
SAGDASESARQAAESAAAAKQSEEASSSSASAAAQKASESSQSAADAELS
KKTAESAAGNAARDATTATEKARESAESAQSAEQSRIAAEEAVNRIPTVV
GPPGPKGEPGPAGPQGPKGDKGERGDTGPVGATGERGPAGDAGPAGPQGP
KGDRGERGETGLTGNAGPQGPKGDTGAAGPAGPQGPKGETGAAGPVGATG
PQGPKGDPGETQIRFRLGPASIIETNSHGWFPGTDGALITGLTFLAPKDT
TRVQGFFQHLQVRFGDGPWQDVKGLDEVGSDTGRTGE
>ECs1181 N
MSRKTEFKGTAASRRRARRANLQSQEAISSDKLHRPTPSRVVLQCKRKPA
MRAEVITLTTLTRKYEGSTCLPNVALYAAGYRKSKQLTAR
>ECs2547 hypothetical protein
MAGYLSWLFPRCKISPKLNGTAPHFGDEMFALVLFVCYLDGGCEDIVVDV
YNTEQQCLYSMSDQRIRHGGCFPIEDFIDGFWRPAQEYGDF
>ECs2718 putative outer membrane protein precursor
MRKLYAAILSAAICLAVSGAPAWASEQQATLSAGYLHARTSAPGSDNLNG
INVKYRYEFTDTLGLVTSFSYAGDKNRQLTRYSDTRWHEDSVRNRWFSVM
AGPSVRVNEWFSAYAMAGVAYSRVSTFSGDYLRVTDNKGKTHDVLTGSDD
GRHSNTSLAWGAGVQFNPTESVAIDIAYEGSGSGDWRTDGFIVGVGYKF
>ECs2535 hypothetical protein
MNEVVNSGVMNIASLVVSVVVLLIGLILWFFINRASSRTNEQIELLEALL
DQQKRQNALLRRLCEANEPEKADKKTIESQKSVEDEDIIRLVAER
>ECs1429 MsyB
MTMYATLEEAIDAAREEFLADNPGIDAEDANVQQFNAQKYVLQDGDIMWQ
VEFFADEGEEGECLPMLSGEAAQSVFDGDYDEIEIRQEWQEENTLHEWDE
GEFQLEPPLDTEEGRAAADEWDER
>ECs2018 hypothetical protein
MNVRNVKFDNLSFYEEKIRWRMRRQISNAEAIFYENQEEHIRRFLGRYTC
RKPVLVYWESEQRWTSVNGLEVMSMFGEHCHHIYLDDIKKKIQTELPENK
TLDFKRQLQFIILGDDKIRIWVPEGKLYFALMNVLQMFPMKNG
>ECs0244 hypothetical protein
MVNNYKTHCGVVDINLNFFNDILYSVRLKNISKLENMEFCATKQRVYFSD
KNKKASYKIINYGDYYDVDYYDNNLKNEVFDWIGKWS
>ECs2183 putative lipoprotein Rz1 precursor
MRELKMKLCVLMLPLVVSACASTPTVQAPCVKPPSPPAWIMQPVPDWQKP
LNGIISSSENG
>ECs5592 hypothetical protein
MSRKRQEVLYPLGGKVLCKQLRRGWWHKRMVTESGIRKWLLVVEVVIVPH
AGGVKSADGCE
>ECs3009 hypothetical protein
MTTITDKELIKEIKERIGSLDVRDNIERRAYEIALASLEAEPIAWECGEN
IILFNPDTVEAYAKRAEISPKPLFSAPPALVVPDKLPREYRNGWPLAYSD
YAEGWNDCREAMLQGDKS
>ECs0139 putative fimbrial protein
MQRKGNKLLIQLCSVILLFFTTSWYALANECYIERNAEGDYHMKISSTQL
SLASQMVEVPTEIAEATWDVNIQLRGDAIGCKSLGDSKAVHFLNTADPSL
ISTYTTTNGAALLKTTVPGIVYSVELLCLSCGAADELDLWLPAQSGADNF
IPSTQTKWAYEYSDQSWYLRFRLFITPEFKPKNGVSSGTTIAGKIASWYI
GTNDQPWINFYIDNDSLKFFVDEPTCATVALAQDQGNVSGNQVTLGNSYV
SEVKNGLTREIPFSIRAEYCYASKITVKLKAANKPSDATLVGKTTGSASG
VAVKVNSTYDNSKVLLKADGSNTVDYNFAAWSNNLLFLPFTAQLVPDGSG
NAVGVGTFSGNATFSFTYE
>ECs2572 hypothetical protein
MKKCVIKIYIAPPEKINFLCYYVYSYLFQWNDNVNINYPAEYEIGDKVFT
CIGAALFGQISAASNCWSNHVGIIIGHNGEDFLVAESRVPLSTITTLSRF
IKRSSNQRYAIKRLDAGLTEQQKQRIVEQVPSRLRKLYHTGFKYESSRQF
CSKFVFDIYKEALCIPVGEIETFGELLNSNPNAKLTFWKFWFLGSIPWER
KTVTPASLWHHPGLVLIHAEGVETPQPELTEAV
>ECs1207 hypothetical protein
MAFKHYDVVRAASPSDLAKRITQKLKEGWQPYGSALISTAGYGAEFIQPV
VSEGSISSPEEPGNRPTTSAPSVAPEYYYVIALAGQSNGMSYGEGLPLPD
TFDSPDPRIKQLARRSTVTPGGAVCKYNDIIPADHCLHDVQDMSRLNHPK
ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGT
YSDASGASENSTRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEF
DFGGTPVNHAAQFGALVDKFRADLADMAGQCVGGSAGGVPWICGDTTYFW
KQKNESTYQTVYGSYKNKTEKNIHFVPFMTDENGVNVPTNKPEEDPDIPG
IGYYGSKWRDSSATWTSQDRASHFSSWARRGIISDRLATAILRHAGRVAL
NAGASSTVSEVRPSSPSGAEATGVTTLLSYLASESEGSLKVQGWSASGGR
AEVVSDAEGTGGKAVKLTKEAGKSSWVLEYAAGNGAALLQKGGQIRCRFK
VSGALAANQYVMAFYWPVSSLPQGVALTGDGGNNLLAAFYIQTDAKDLNV
MYHNAKVNRPGFPGECFICELRLPDHRFRWKHNKLFLLLPEEYGPAFPAI
VDCYTSPPTLVWPVPLLHGFSSSYGVLPPLCKDH
>ECs1193 hypothetical protein
MSTIAELVRANFREELVRWYRYRSSSSLPLDELYEHSPAARRYPRDRVLR
RLFKLNNEFQRNRIIRSLDLK
>ECs4187 hypothetical protein
MNLLSTTTSPRIIRKSEPDASIGQKLIATRRLRAAFHEICKVVKIAFFTG
IRLTNQRSLCC
>ECs5380 hypothetical protein
MALDLARRELELREIPYIKNSLHANYSYKSISIGSKQGWLISAKLKVPET
FEPDMIFIEISDPEGFINIPDVL
>ECs5267 hypothetical protein
MPPPSTQLFAEHLPTEWIQHFLTLSAHATVRRLNLSVDGEAGMNLLAPAA
LSPRRVSAWGPPRWNGSSARPHRHGARNVT
>ECs2734 putative head completion protein
MVTVAELQALRQARLDLLTGKRVVSVQKDGRRIEYTAASLDELNRAINDA
ESVLGTTRRRRRPLGVRL
>ECs1300 hypothetical protein
MYNFITIMYDVFSCFGVLAKNQNSRDIRNIKNFSSHQHSLGDMFDELINI
IDKEQVLSKEQRKVIFRRYEDLYVKLMHYSVFTDKTHQIIKQKYFNDIVP
MILALDIRNTYRPDNEMAFYYHIHSFLTQIPDNEDDIYHAARTYLRNYVK
LCLSGYTPANAHFKDIFDGVYEFIRNIRKNSTPGKTKLIATINTCKETCK
HLLYLSNEDKEKIISDLDKVQVACYYLTILLAFERRTSLTSTLATLYKML
ISEREVSEYECQLLYLTNPIDVMNILNKYIYYFPNENSPFYTLKIDSALS
WDAIDAIRDYSISDIYLYPEQKTINCVVEIENIVFGGYIYTLNNGVTLQN
IENSLKDSSCHYVLNGYTEFVNCLRQLTSGKTESVHRTINKLNYEKLPFG
FIIAAFAILKIAFKIKFSKNHVNIRALLNDINYFMTYQGESINLISLDHE
YPESCLQNDTNTYLLGRVIFLYNSMIYKFINCQEHETNNIHSAMINNLLQ
EVDIALGKINDIIDSRNISAPHELANILTREKILTTREKKGNLISLFDGF
TLFHCVGMITFLIHYLRTPEEKVENIFMLYGADKNNKLRRRLIYDALGII
QSQQE
>ECs4592 hypothetical protein
MFQNEIKDKFKTGLAYVRGPEKSGSSAQNAGSEGGKKCAGRSRRLSRDVR
LRNTSLIKRVSMMILVADQRNYIRRKQ
>ECs4541 hypothetical protein
MTSLTPEAALDILIAWLQDNIDSESGIIFDNDEDKTDSAALLPCIEQVRE
DVRTLRQLQLLQQNR
>ECs2139 multiple antibiotic resistance protein
MKPLLSAIAAALILFSAQGVAEQTQQPLVTSCGDVVVVPPSQEQPPFDLN
HMGTGSDKSDALGVPYYNQQAM
>ECs2998 Kil protein
MPLQGGLLLAALPNLYLNESPVNYVTDGNALSTYLISQESQRMDQTLMAI
QTKFTIATFIGDEKMFREAVDAYKKWILMLKRRSSKSIH
>ECs1775 hypothetical protein
MAQNSRLHNSDNSAVFASRHGRRSHAFKSDWFRHAPCTEEQAEWLIQNYR
RRGYEFRKALSLDYRHWIIYVRLPYSERPPRPSRTFQQRIWR
>ECs5270 hypothetical protein
MKKAKILSGVLLLCFSSPLISQAATLDVRGGYRSGSHAYETRLKVSEGWQ
NGWWASMESNTWNTIHDNKKENAALNDVQVEVNYAIKLDDQWTVRPGMLT
HFSSNGTRYGPYVKLSWDATKDLNFGIRYRYDWKAYRQQDLSGDMSRDNV
HRWDGYVTYHINSDFTFAWQTTLYSKQNDYRYANHKKWATENAFVLQYHM
TPDITPYIEYDYLDRQGVYNGRDNLSENSYRIGVSFKL
>ECs0821 putative lipoprotein Rz1 precursor
MRKLKMMLFGASLIMVVGCSSKENALCHPQTKPPAPPAWAMMPPSNSLQL
LDETFSVSGTELSATKQH
>ECs0716 hypothetical protein
MYYGALSIRAEAWLIVSPEVTKIMAKEQTDRTTLDLFAHERRPGRPKTNP
LSRDEQLRINKRNQLKRDKVRGLKRVELKLNAEAVEALNELAESRNMSRS
ELIEEMLMQQLAALRSQGIV
>ECs1408 hypothetical protein
MKSLTTETALDILIAWLQDNIDCGSGIIFDNDEDKTDSAALLPCIEQARE
DVRTLRHLQLLHQNR
>ECs1177 Kil protein
MPLQGGLLLAALPNLYLNESPVNYVTDGNALSTYLISQESQKMDQTLMAI
QTKFTIATFIGDEKMFREAVDAYKKWILILKLRSSKSIH
>ECs2982 hypothetical protein
MKQTFLLRNEAIRNNAIDAILSLPIDDKSPHEVHVKEPRRSKAQNDRMWP
MLNDVSRQVLWHGQRLAPEDWKDLFTALWLKTKKLEQRSVPGIDGGVVML
GVRTSKMRKASMTELIEIMFWFGSERNVRWSDDSRREYEWSQRKGRAA
>ECs1206 Shiga toxin 2 subunit B
MKKMFMAVLFALASVNAMAADCAKGKIEFSKYNEDDTFTVKVDGKEYWTS
RWNLQPLLQSAQLTGMTVTIKSSTCESGSGFAEVQFNND
>ECs2261 putative holin protein
MEKITTGVSYTTSAVGTGYWLLQLLDKVSPSQWVAIGVLGSLLFGLLTYL
TNLYFKIKEDRRKAARGE
>ECs5363 hypothetical protein
MLLWQTPSAQARGKKRNGKAASEVFYFVAAKDKKFARRYEEGWGRGDYPP
TMERINLSATSENALKGRVPVRA
>ECs1569 hypothetical protein
MACFLLSSKQHASIHKIKQLQSNFGERFFFSLLANRYKKTTNGIKMSNPN
PCMTDWRKYSQKLKSTVCWF
>ECs5417 hypothetical protein
MAFSTEGPEVRLLITTTELKCNAVIEFTVTKVNHASTETTVSGIPPPDTT
FEGNPGTATDTTLFPPQDIYPPGVQSFDSAR
>ECs4023 truncated putative fimbrial protein
MKRAPLITGLLLISTSCAYASSEGCGADSTSGATNYSSVVDDVTVNQTDN
VTGREFTSATLSSTNWQYACSCSAGKAVKLVYMVSPVLTTTGHQTGYYKL
NDSLDIKTMNRPGNPGD
>ECs3506 hypothetical lipoprotein
MGGRFSLRYKKLSYRFVFLTLAGCSSVGNQSLKNETQESVKTKIVKGKTT
KQDVLASFGEPDSRSLIDGEEQWSYTMYNSQSKATSFIPVVGLLAGGADS
QTKSLTVSFKGEKVSTYIFNAGTSNVKTGIF
>ECs5362 hypothetical protein
MTRFEAIKQGHIKIVDISIVCNFTVDKCELNPAYVIKNIDSPKDLLNGQK
KRSSSENRISYSIKLADEKYPP
>ECs0599 hypothetical protein
MRKFIFVLLTLLLVSPFSFAMKGIIWQPQNRDSQVTDTQWQGLMSQLRLQ
GFDTLVLQWTRYGDAFTQPEQRALLFKRAAAAQQAGLKLIVGLNADPEFF
MHQKQSSAALESYLNRLLAADLQQARLWSAVPGVTPDGWYFSAEIDDLNW
RSEAARQPLLTWLNNAQRLISDVSAKPVYISSFFAGNMSPDGYRQLLEHV
KATGVNVWVQDGSGVDKLTAEQRERYLQASADCQSSAPASGIVYELFVAG
KGKTFTAKPKPDAEIASLLAKRSSCGKDTLYFSLRYLPVAHGILEY
>ECs4542 hypothetical protein
MSGIIISRPEVDTGHTDVICSTSIRHVVTVRNAALQQTETLIRQLAEISV
LTAADIGGKTALDRAMKQDFHCGCWLMEKPETAMKAITRNLDCEIWRDLM
QRSGMLSLMDAQG
>ECs0443 hypothetical protein
MTQRPWSKLQREIYDLLTPTINLQIHCTRYPMRSQNGGSTDLPRYWITLD
KNVIWDYPKDFIAGNGGVRNFHGETCWYPYLTDICSISDLLREYIDTPKA
ELLTKQFTSDKWGLVNILRAADRRIGMRRLDQLRRKTHNIAALKNYCPP
>ECs0750 hypothetical protein
MNALSGLQVITRRPDKRSASGVTKKCRKSLKTAPDTKTVFKGSFASHYVD
KSSQVNPGTHITVRGSIHGEVSIKKMLKGSRCRTAL
>ECs0319 hypothetical protein
MITAGLAKSALQEKFMFRRRGVTLTKALLTAVCMLAAPLTQAISVGNLTF
SLPSETDFVSKRVVNNNKSARIYRIAISAIDSPGSSELRTRPVDGELLFA
PRQLALQAGESEYFKFYYHGPQDNRERYYRVSFREVPTRNLTKRSPTGGE
VSTEPVVVMDTILVVRPRQVQFKWSFDQVTGTVSNTGNTWFKLLIKPGCD
STEEEGDAWYLRPEDVVHQPELRQPGNHYLVYNDKFIKISDSCPAKPPSA
D
>ECs0198 RcsF
MRALPICLVALMLSGCSMLSRSPVEPVQSTAPQPKAEPAKPKAPRATPVR
IYTNAEELVGKPFRDLGEVSGDSCQASNQDSPPSIPTARKRMQINASKMK
ANAVLLHSCEVTSGTPGCYRQALCIGSALNITAK
>ECs2457 hypothetical protein
MLQHYSVSWKKGLTALCLLAVAGLSGCDQKENAAAKVEYDGLSNSQPLRV
DANNHTVTMLVQINGRFLTDDTRHGIVFKDGSNGHKSLFMGYATPKAFYE
ALKEAGGTPGENMTMDNKETTHVTGSKLDISVNWQGAAKTYSFDEVIVDS
NGKKLDMRFGGNLTAAEEKKTGCLVCLDSCPVGIVSNATYTYGAVEKRGE
VKFKGNASVLPANNTLATVTFKITE
>ECs4390 hypothetical protein
MKKVLGVILGGLLLLPVVSNAADAQKAADNKKPVNSWTCEDFLAVDESFQ
PTAVGFAEALNNKDKPEDAVLDVQGIATVTPAIVQACTQDKQANFKDKVK
GEWDKIKKDM
>ECs0280 putative tail fiber protein
MAKNDFKAFATDRNANVMSQEEWEALPALISGFTAGKASSAQVNKVIRQA
SFIAAALAQFVSDKTQRDVLDNGDLPGFVELLGSGFAVEYLSRKNPFGDI
KSDGTVKTALQNLGLGEGAPAIGVPFFWPSAAMPNTVIDSWSCMVFLKFN
GAKFSATDYPVLAKVFPSLVLPEARGDFIRIWDDGRGADGGRELLSWQAA
TNFSQFAGNIGEGAGHAINFHDGIAGNQPGFSRFNFTSNSVGDGVNFVAV
RPRNIAFNFLVRAK
>ECs1521 hypothetical protein
MAHDTKLYNSDDSAVFASRRGRCFHAFKSDWYQHPPCTEEQAEWLIQCYR
RRGCEVKKALSLDYRHWIISVRLPYSERPPRPSRTFQQRIWR
>ECs4574 SepD
MNNNNGIAKNDCDWLTALDFVKDVNGSPTHLTFYIYQKNAFLHDFGNYWV
LYIELSGDFRQVPTDTFIRLCNILAVSNEYKQMGIFLSNKKWYLCQIFHK
DNNHRANMSKAIMQHTLASLLDKQFDKLEQLSSSDTMMPPTHLFSDIGRI
V
>ECs0368 hypothetical protein
MWALTADADFLAQRGQGQVEQVFARAVNIALPARQQLLTLLCEEYDNAPN
SCRLALTHFDDLFRHGDKVQFDDQGITVGQHLHIEMSRCRRWLSPTLQMT
AVNFRLIAWQQWHDIIHQHLGENETLFNYRGDNPFYQALNKELHIKRRAV
IQAVIDKQNIASAVASMMGLGIGLTPSADDYLTGLALILFISGHPAGKYK
EEFYLGLQRGRNNTTLLSAITLEAALQQRCRENIHRFIHNIIYDIPGNAT
QAIEKIKHIGSSSGCDMLYGMADGCALSQTYGGNYVS
>ECs4496 involved in lipopolysaccharide biosynthesis
MVLLVMKSSTTIITAYFDIGRGDWTANKGFREKLARSVDVYFSYFERLAA
LENEMIIFTSPDLKPRVEAIRNGKPTTVIVIDIKKKFRYIRSRIEKIQKD
ESFTNRLEPRQLKNPEYWSPEYVLVCNLKAYFVNKAINMGLVKTPLVAWI
DFGYCRKPNVTRGLKIWDFPFDESKMHLFTIKKGLTVTSQQQVFDFMIGN
HVYIIGGAIVGSQHKWKEFYKLVLESQKITLNNNIVDDDQGIFVMCYYKR
PDLFNLNYLGRGKWFDLFRCFRSNTLGAKMQALRIFLSRK
>ECs1766 putative regulatory protein
MLKIDAIAFFGSKTKLANVAGVRLASVAAWGELVPEGRAMRLQEASGGEL
QYDPKVYDEYRKAKRAGRLNNENHH
>ECs2505 hypothetical protein
MGKATYTVTVTNNSNGVSVDYETETPMTLLVPEVAAEVIKDLVNTVRSYD
TENEHDVCGW
>ECs0353 hypothetical protein
MKKPLVIISACQFTRLALESLIPADRYIVRVYSNVTTEVEFVLKTSCGYL
LADIP
>ECs2986 phage replication protein P
MKNIAAQMVNFDREQMRRIANNMPEQYDEKPQVQQVAQIINGVFSQLLAT
FPASLANRDQNEVNEIRRQWVLAFRENGITSMEQVNAGMRVARRQNRPFL
PSPGQFVAWCREEASVIAGLPNVSELVDMVYEYCRKRGLYPDAESYPWKS
NAHYWLVTNLYQNMRANALTDAELRRKAADELTCMTARINRGEAIPEPVK
QLPVMGGRPLNRVQALAKIAEIKAKFGLKGASV
>ECs1088 hypothetical protein
MAFKHYDVVRAASPSDLAEKLTHKLKEGWQPYGGPVAITPYTLMQAVAIE
GDPQVGPSSKPDWFYVVVLAGQSNGMAYGEGLPLPDSYDAPDPRIKQLAR
RSTVTPGGESCTYNDIIPADHCLHDVQDMSTLNHPKADLSKGQYGCVGQG
LHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQDSARW
GVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDMSAATHAQQPALF
TAMLTQFRADLSVFNAQCHGGSAADVPWICGDTTYYWKNTYATQYDTVYG
GYKNRESEGVYFVPFMTDGNGVNTATNAPAEDPDIPASGYYGAASRTNGN
QVSSNRPTHFSSWARRSIIPDRLATAILNAAGRTSAFISGKAPEIKPSPG
GNTPSGPSADTSVRTISLLPAAGEAAAQGWSIKDGGIQLSDGVFKITRQS
NKTWSLTHPVDDAITLLTQGGRLNCKFRLSGALTNNQFGLGIYLYTDAPV
PDGVAMTGTGNPFLMSYFTQTTDGRVNLMHHRKAGNTKLGEFGDYGNDWQ
TLELVFTAGSATVTPKLNGVAGPAFQVIKDSLTLGLNALTLTDVTKNAAY
GVEIESLVLEINAPAA
>ECs2205 putative replication protein
MQHQVARHHGQFRKSGLHANSGNVTTDLSATETAWKLWELMGEVYSNRWT
QKNGAAPSKLWIAQIGAMTEQQIRLVCRQCMDRCRAGETWPPDLAEFVAL
ISESGANPFGLTVDAVMEEYRRWRNESWRYDGSDKYPWPQPVLYHICLEM
RTRGIERQMTQGELKRLAERQLTKWAKHVGNGMSVPPVRRQLEGAKHPQG
PTPIERLKQEYERRKAAGFI
>ECs2930 hypothetical protein
MSELNDLLTTRELQRWRLILGEAAETTLCGLDDNARQIDHALEWLYGRDP
ERLQRGERSGGLGGSNLTTPEWINSIHTLFPQQVIERLESDAVLRYGIED
VVTNLDVLERMQPSESLLRAVLHTKHLMNPEVLAAARRIVRQVVEEIMAR
LAKEVRQAFSGVRDRRRRSSIPLARDFDFKSTLRANLQHWHPQHGKLYIE
SPRFNSRIKRHSEQWQLVLLVDLSGSMVDSVIHSAVMAACLWQLPGIRTH
LVAFDTSVVDLTADVADPVELLMKVQLGGGTNIASAVEYGRQLIEQPAKS
VIILVSDFYEGGSSSLLTHQVKKCVQSGIKVLGLAALDSTATPCYDRDMA
QALVNVGAQIAAMTPGELATWLAENLQS
>ECs1167 hypothetical protein
MVFAKSPARSKTAGKTTCALKESDMAIAASYTMHLYCDCLQCTDGKYKSP
DFGEYIGTSWAGCAKEARKDGWRISKDKTRAFAPGHKILRSNKGE
>ECs2800 hypothetical protein
MKTFIKTLLVAVTILFSVFATAKQVKLPNNIKYVNTTEAFSCTEIDGMNC
QTKNQFNYKDNSYVFVLERGGAWCYDYTVSVVNLKTGKAQMIEYGDNQLC
SGSNKPFFEIKNGVPTVGVIDTSGKPVGVAQDKLKI
>ECs3022 hypothetical protein
MNSQQGGGMSHVWGLFSHPDREMQVINRENETISHHYTHHVLLMAAIPVI
CAFIGTTQIGWNFCDGTILKLSWFTGLALAVLFYGVMLAGVAVMGRVIWW
MARNYPQRPSLAHCMVFAGYVATPLFLSGLVALYPLVWLCALVGTVALFY
TGYLLYLGIPSFLNINKEEGLSFSSSTLAIGVLVLEVLLALTVILWGYGY
RLF
>ECs5411 hypothetical protein
MMLQLCATMLNVTCVASLKMNEMLIKEIDMTGIKKITQTFSLRQLTFLKG
ATAKNVRECNLMKNSVAEH
>ECs2614 hypothetical protein
MDSIHGHEVLNMMIESGEQYTHASLEAAIKARFGEQARFHTCSAEGMTAG
ELVAFLAAKGKFIPSEEGFSTDQSKICRH
>ECs2622 putative DNA-binding protein
MSGITININVNAPYVSLQKYAEITGIPLNTCKKMLADGRIIIRPKRAKME
KPEVNLVAMLKDALANS
>ECs4572 hypothetical protein
MDVLCPCLFHKKRLTVNMNNINQSENINIQLNKAPQTNFVDEHTSLASAP
SAAGAAQFLDQLLPKTAGVSSPEQVLIEEIKKRHLATMNSDLSFDALSAG
GLSPEDVLTLQKNVLNANVNVDVVSKLASLLSTSVTKLVSMQ
>ECs1227 hypothetical protein
MTTITEIIGRVNTQLVDPMMVRWPLQELCDYYNDAVRAVILARPDAGASL
ETISCVPGARQVLPDGVIQLLDVICLSDGSAVRPLSREVLDAQYPEWPTM
KGIPECFISNDLSPRVFWLFPAPDKEISIDAVVSRIPEAVYVLTQDDDTP
VPLEEAYVNPLVEWMLFRAFSKDAAGGAESGLAAQHYQSFVEQLGIKQGA
DSALYARKKVFNGGGV
>ECs2379 hypothetical protein
MGKMNHQDELPLAKVSEVDEAKRQWLQGMRHPVDTVTEPEPAEILAEFIR
QHSVAGQLVARAVFLSPPYSVAEEELSVLLENIKQNGDYADIACMTGSQD
DYYYSTQAMSENYAAMSLQVVEQDICRAIAHAVRFECQTYPRPYKVAMLM
QAPYYFQEAQIEAAIAAMDVAPEYADIRQVESSTAVLYLFSERFMTYGKA
YGLCEWFEVEQFQNP
>ECs1439 DinI
MRIEVTIAKTSPLPAGAIDALAGELSRRIQYAFPDNEGHVSVRYAAANNL
SVIGATKEDKQRISEILQETWESADDWFVSE
>ECs2226 hypothetical protein
MLQLSSNIGWKKGAENALKNKIHSHSFVVNPDEFSCDTQFLKCPITLCVP
EKGVFVKNALNSNICTLYDKSAFMNLTREHLPHPLSREKIVKEMIIERNM
CYFDTISQHFIIMDADQQKQHCK
>ECs2232 putative outer membrane protein
MRKLYAAILSAAICLAVSGAPAWASEQQATLSAGYLHARTSAPGSDNLNG
INVKYRYEFTDTLGLVTSFSYAGDKNRQLTRYSDTRWHEDSVRNRWFSVM
AGPSVRVNEWFSAYAMAGVAYSRVSTFSGDYLRVTDNKGKTHDVLTGSDD
GRHSNTSLAWGAGVQFNPTESVAIDIAYEGSGSGDWRTDGFIVGVGYKF
>ECs3850 putative virulence-related membrane protein
MILYVMSGSRLADNHTLSAGYAQSKVQDFKNIKGVNLQYRYEWDSPVSVV
GSFSYMKGDWADSHRDEADDFYRHQADIKYYSFLAGPAYRLNDYISFYGL
VGISHTKAKGDYEWRNSVGADESDGYLSESVSKKSTDFAYAAGVIINPWG
NMSVNVGYEGTKADIYGKHSVNGFTVGVGYRF
>ECs1954 hypothetical protein
MSGQYIPYHLRHNKSIDREIFLESLNLLSKRLNIQEYTYIGFGGPMLEDF
RIMHNRIALSDMISLEEQESTHIRQKYNLPYNCIDCKLISAHDFILDYSF
SKPSITWLDYASPKKIQTDLDDIHLLSTKVSSFDILKVTFPINPSSYYQR
RVGESLDIFKEAFINSLKSLLGKKYLDFNLQISNADLSDRKIKALLIRII
TNAFRSAIERGLSGRKDKIQYYPLSLNQYNDGSHTMLTISGFFSSEQEYI
ELSKACDLSNWQFYSTNWEDVQEIAIPTLTIKEKINLDSKLPDKEAYQAA
AEEFSLNENERDNYYKYYRLYPNFQRIMV
>ECs1856 osmotically inducible lipoprotein
MFVTSKKMTAAVLAITLAMSLSACSNWSKRDRNTAIGAGAGALGGAVLTD
GSTLGTLGGAAVGGVIGHQVGK
>ECs2625 hypothetical protein
MGKEYKTLINKAPERFYFRLSASGAHAERAARDSLTRAIRSLYDVAFYAD
DLDALNELSELICAAECGEHIEPYKLGNIA
>ECs3507 hypothetical protein
MTIAKDSSTDEIIHGNDLRGMDDFYIKTTSFECPYEPCKIKATPCSFTMK
HVNQSYFRYGDKHKDGCGIHDPRYKNNHTSNDERKHNSPPAPVISLLKID
VKPRGGVKNARNIKNENHKDEKKGNEHPVSSSSIKPVVDYYINNSNHNEQ
LSIPPYGTRSYKDTFQLIFYKNNIRYYKPAIYYGVAQSNIRLHEDSDKHC
ITFLARDKKTQKPFTLEIDVSDWNKSQKDVFWKEYEKQRKEADRYYKGLK
DKRNAKKYLTVFFFGMPDENDKFLFKTNHFKLVYVAFLGKFESSYNDSNY
YIENDLSVSSLNEQPISLSDIDHKNHEYNIETSLDFSSPLPEPEPEPEPE
PEPEPEPEPEPEPEPEPEPEPEPEPEPEPEPEPEPEPEPEPIRSSLKENT
ETVEMNSKVCRRLKKIITFFKK
>ECs3588 hypothetical protein
MSGKRISREKLTIKKMIDLYQAKCPQASAEPEHYEALFVYAQKRLDKCVF
GEEKPACKQCPVHCYQPAKREEMKQIMRWAGPRMLWRHPILTVRHLIDDK
CPVPELPEKYRPKKPRE
>ECs1578 hypothetical protein
MLKTFRVFASAVNPLGHTIGIAQNVKAVNVQTAIAAVRSESSEYGLSQVI
ISAVYELKEVH
>ECs0767 hypothetical protein
MSPDNIGSAVNRFSGWPSNDSLKTDKESMTSPVTVQVIKMIAAILDPHRI
VVGIDTLYTTQRETFTSESQQLIESALQTLHVMNLNNLETNRDEESHYLA
FEYNVDTSDRITVSAYNEKKPGCYGQCLVIQDDETDLFNLIKIKIFEARH
NYVAQLMSDPEFMFKFSYSAQQNRLEPHLLVPAYFPLKTNEPVTQEDVLL
LYRFFKMNDDFNKLTSDEYMSILTPLMKCERSVHDNNRYVTGKDTLLLDY
PPSGNQIHFHVFPDESATLVLYFSNNDIECFVFERDIPSQYRFFKMFTNL
ALVIDQLTEANKIL
>ECs4427 putative fimbrial protein precursor
MKAAIALSLLGCVFGFSGKAFAGDAWGPCTPADGTTYHYNVDVDVGIPDA
AKNVAGTVLPDVLNWSNGQNVSLICECPDSYKNEKDTLVQGVSMLPPSGR
TVDSMKYYTLTEELEVATNIRISTSVYGFVPFKNQQALQTTGCNKVITTP
YMGGAGLLSFAITKPFIGDSVIPLTLIAELYASKTNKDYGTIPISSVSIQ
GRVTVTQDCEIKPGTVLDVPFGEFPSSAFKNRQGQMPEGATEQEINLSFD
CNNISDGIKVALRLEGATNADDPRAVDMGNPDIGVLVKDSSGKILVPNDS
SSTTLLNLSSLDSKTHRNAAIRLLALPISTTGKAPKGGTFEGVTTIYLEM
E
>ECs2049 hypothetical protein
MTHICARFIHLAGRPYMSLYQHMLFFYAVMAAIAFLITWFLSHDKKRIRF
LSAFLVGATWPMSFPVALLFSLF
>ECs2981 putative DNA methylase
MTIKSNTPAHDKDCWQTPLWLFDALDIEFGFWLDSAASDKNALCAHWLTE
ADDALNSEWISHGAIWNNPPYSNIRPWVEKAAEQCIQQRQTVVMLVPEDM
SVGWFSKALESVDEVRIITDGRINFIEPSTGLEKKGNSKGSMLLIWRPFI
SPRRMFTTVSKAALMAIGLGVRRAA
>ECs2218 hypothetical protein
MVVFFLKIPNLDFAVYLRPFYVPYMPHRYPAAKINKMPKGSVPALQQEML
RRVSKRYDDVEVIIKSTSNDGLEPPRFSWRVFYL
>ECs5279 D-mannose specific adhesin
MKRVITLFAVLLMGWSVNAWSFACKTANGTAIPIGGGSANVYVNLAPAVN
VGQNLVVDLSTQIFCHNDYPETITDYVTLQRGSAYGGVLSNFSGTVKYSG
SSYPFPTTSETPRVVYNSRTDKPWPVALYLTPVSSAGGVAIKAGSLIAVL
ILRQTKNYNSDDFQFVWNIYANNDVVVPTGGCDVSARDVTVTLPDYPGSV
PIPLTVYCAKSQNLGYYLSGTTADAGNSIFTNTASFSPAQGVGVQLTRNG
TIIPANNTVSLGAVGTSAVSLGLTANYARTGGQVTAGNVQSIIGVTFVYQ
>ECs1245 MokW
MLNTCRLASYVPKGKEKQAMKQQKAMLIALIVICLTVIVTALVTRKDLCE
VRIRTGQTEVAVFVDYESEK
>ECs2269 hypothetical protein
MRVLLRPVLVPELGLVIVKPGRESMSAFHNGRILVEPEPKSMRALPSGVV
PAVHQPLAEDKSLLPFFSDERVIRAAGGAGALSDWLLRHVKSCQWLHGDY
HHSETVIHRYGTGAMVLCWHCDNQLREQTSDSLDQLAQQNLAAWMIDIIR
HAMNGAQERELSLAELSWWAVRNQVADALPEAVLRRSLGLRAEKIRSVYR
ESDIIPGEQTATSILKQRTKNIALPPHTHQQQNPPQEKTVVSIAVDPESP
ESFMKRPKRRRWVNEKYTRWVKTQPCACCGKPADDPHHLIGHGQGGMGTK
SHDIFTLPLCREHHNELHADPLAFEEKHGSQVDLIFRFLDHAFATGVLG
>ECs2182 putative transcriptional regulator
MLHDHLAECLEKKGLYRRAAERWAKVMVQLSDDQKRKVAAQKRAECLRKA
RRTPVSPVNLTEIKQAVNRLHSELGMGFEERRVFRRYKGTGEQNTSGNAR
SKKC
>ECs0766 hypothetical protein
MIPSQISFNTLPVNATYASETTVDTQKFSDILYSAGCSLKDMAYLSRCLA
SIHPTLAKNIYETENLSDQKLLHIDCRSTNEIKINIFFGQQREGLIEINS
DTVIFSILSSLIDSAISKFQHQIPVNGSVNRELLYEDYTD
>ECs4998 putative DNA modification protein
MGNRKQVTSRIISTPELIRYNDNIVGYGSRELRVETISCWLARLVIVNKH
YSHRFVNNSYLHLGIFSERELVGVMQWGYALNPNSGARVVTGTQNREYME
LNRLWLHDCMPRNSESRAISYALKLIRQLYPQVQWVQSFADERCGCLGVV
YQASNFDYVGSHESIFYELDGEWYHEICRNAIKRGGQRGEHLRANIDRAS
VHKFRQFRYIRFLNKRARKRLNTKLFKVQPYPKPQTVKTGLKESE
>ECs0222 hypothetical protein
MSRLRVFRQTGWFIAGLMTGLPATAAPAEASSMAGVAVAVATTTPPDATA
TLQAMQSCRRESAALERLDCYDHLLAPLSPSGFDGALVKAGFVGEAWTRA
TEQEKRREGNTTELLVTQVPGERPTVVITTPAIGHVPPRPVLMFSCVDNI
TRMQVALMHPLDVHDIAVTLNADSRALRSHWFVRENGTLLESSRGLSGID
EIKQLFGAKTLTVDTGADNAAGKLTFNIDGLARAIAPLRDACHWAGE
>ECs2258 putative antirepressor protein
MNMMAVPFHGNSLYVVNHNGEPYVPMKPVVAGMGLAWQSQLAKLRQRFAS
TITEIVMVAEDGKQRNMVSMPLRKLAGWLQTINPNKVKPEIRDKVIRYQE
ECDDVLYEYWTKGFVVNPRKMSVMEELNQACADMKRDKNIASVFATGLNE
WKQVKAAHVSKIRTLVNEANMLIDFVLADTGKGKITKAD
>ECs1672 hypothetical protein
MKIKSISKAVLLLALLTSTSFAAGKNVNVEFRKGHSSAQYSGEIKGYDYD
TYTFYAKKGQKVHVSISNEGADTYLFGPGIDDSVDLSRYSPELDSHGQYS
LPASGKYELRVLQTRNDARKNKTKKYNVDIQIK
>ECs3461 leader peptide of chorismate mutase-P-prephenate dehydratase
MKHTPFFFAFFFTFP
>ECs3978 hypothetical protein
MQIPRMSLRQLAWSGAVLLLVGTLLLAWSAVRQQESTLAIRAVHQGTTMP
DGFSIWHHLDAHGIPFKSITPKNDTLLITFDSSDQSAAAKAVLDRTLPQG
YIIAQQDNNSQAMQWLTRLRDNSHRFG
>ECs1217 Bor precursor
MKKMLLATALALLITGCAQQTFTVQNKQTAVAPKETITHHFFVSGIGQKK
TVDAAKICGGTENVVKTETQQTFVNGLLGFITLGIYTPLEARVYCSQ
>ECs4416 putative protease
MRDIVDPVFSIGISSLWDELRHMPAGGVWWFNVDRHEDAISLANQTIASQ
AETAHVAVISMDSDPAKIFQLDDSQGPEKIKLFSMLNHEKGLYYLARDLQ
CSIDPHNYLFILVCANNAWQNIPAERLRSWLDKMNKWSRLNHCSLLVINP
GNNNDKQFSLLLEEYRSLFGLASLRFQGDQHLLDIAFWCNEKGVSARQQL
SVQQQNGIWTLVQSEEAEIQPRSDEKRILSNVAVLEGAPPLSEHWQLFNN
NEVLFNEARTAQAATVVFSLQQNAQIEPLARSIHTLRRQRGSAMKILVRE
NTASLRATDERLLLACGANMVIPWNAPLSRCLTMIESVQGQKFSRYVPED
ITTLLSMTQPLKLRGFQKWDVFCNAVNNMMNNPLLPAHGKGVLVALRPVP
GIRVEQALTLCRPNRTGDIMTIGGNRLVLFLSFCRINDLDTALNHIFPLP
TGDIFSNRMVWFEDDQISAELVQMRLLAPEQWGMPLPLTQSSKPVINAEH
DGRHWRRIPEPMRLLDDAVERSS
>ECs4463 hypothetical protein
MFLDYFALGVLIFVFLVIFYGIIILHDIPYLIAKKRNHPHADAIHVAGWV
SLFTLHVIWPFLWIWATLYRPERGWGMQSHDSSVMQLQQRIAGLEKQLAD
IKSSSAE
>ECs2752 hypothetical protein
MRVLLRPVLVPELGLVIVKPGRESMPVFHNTRVLVEPEPKSMRNLPSGVV
PAVRQPLAEDKSLLPFFSDERVIRAAGGAGALSDWLLRHVKSCQWPHGDY
HHSETVIHRYGTGAMVLCWHCDNQLRDQTSESLEQLAHQNLSAWMIDVIR
HAMNGTQERELSLAELSWWAACNQVVDALPEAVARRSLGLPAEKIRSVYR
ESDIVPGEQTAISILKQRTKNIALPLHVHQQQNPPQKKTVVSIAVDPESP
ESFMRRPKRCRWVNEKYTRWVKTQPCACCGKPADDPHHLIGHGQGGMGTK
AHDIFTLPLCREHHNELHADPLEFEKKYGSQIELIFRFLDHAFATGVLG
>ECs3459 hypothetical protein
MSCRFFILSVVKLKRFSHYRSHQIWLALRYSSSKKTPLPAISHKKDSLTK
SDKIMRFPSHILTSGTVC
>ECs3003 hypothetical protein
MVMKHPHDNIRVGTITFVYSVTKRGWVFPGLSVIRNPLKAQRLAEEINNK
RGAVCTKHLLLN
>ECs5265 hypothetical protein
MTAPTGGAIRREATKAASEISFKFAFQFIVTEMIVLGNTASPGTIPKRLE
HLRGLWKWCS
>ECs3466 hypothetical protein
MRFSHRLFLLLILLLTGAPILAQEPSDVAKNVRMMVSGIVSYTRWPALSG
PPKLCIFSSSRFSTALQENAATSLPYLPVIIHTQQEAMISGCNGFYFGNE
SPTFQMELTEQYPSKALLLIAEQNTECIIGSAFCLIIHNNDVRFAVNLDA
LSRSGVKVNPDVLMLARKKNDG
>ECs0661 hypothetical protein
MNVSKYVAIFSFVFIQLISVGKVFANADEWMTTFRENIVQTWQQPEHYDL
YIPAITWHARFAYDKEKTDRYNERPWGGGFGLSRWDEKGNWHGLYAMAFK
DSWNKWEPIAGYGWESTWRPLADENFHLGLGFTAGVTARDNWNYIPLPVL
LPLASVGYGPVTFQMTYIPGTYNNGNVYFAWMRFQF
>ECs4244 putative transport
MDNVELSPATRWGMIATGLLQGLVCYLLIAWLSGKNHSWIVYGVPATVAF
SSVLLFSVISFKQKRLWGWLALVFIATLGMSGWLKWQTDGMTPWRAEKAL
WDFGCYLLLMAMLLLPWIQQSLRIRNDSSRYRYFYQSVWHNVLILLVIFL
ANGLTWLVLLLWSELFKLVGITFFKTLFFATDWFIYLTLGLVTALAVILA
RTQSRLIDSIQKLFTLIATGLLPLVSLLTLMFIITLPFTGLSAISRHISA
AGLLLTLAFLQLILMAIVRDPQKASLPWTGPLRCLIKTALLVAPLYVFIA
AWALWLRVAQYGWTVDRLQGALAVLVLLVWSLGYFVSIVWRKGQNPLVLQ
GKVNLAVSLLVLVILVLLNSPVLDSMRISVNSHMARYQSGKNTPDQVTIY
MLEHSGRYGRAALESLKSDAEYMKDPKRARDLLMALDGEQHLQQQISEKV
LAENVLIAPGSGKPDATFWSALIQDRYNVMTCIEKDACVLVEQDLNSDGR
AERILFAFDDERYIVYGFDPDKKEWQELTMSLLPRDITKEKLLTAAKDGK
LGTKPKAWRDLVVDGERLDVNLNE
>ECs0307 hypothetical protein
MSLYIKLILSIVREISVNTICSLIVVVALSLLSFSSVAKTITAVGSTINS
TEKEISLQAEKQGKSYKILGAFFKNRVYMIAKLTPVSKNDAS
>ECs3612 hypothetical protein
MTTFIQLHLLTAYPAANLNRDDTGAPKTVVLGGATRLRVSSQSLKRAWRT
SALFEQALAGHIGIRSGRIAREAATILIEKGIEEKKAIEWAAKIADYLGK
AKNDKKPKDPLTNAETEQLVHISPAEFDAVKALAHQLAEEKRAPKEEDLA
LLRKDRMAVDIAMFGRMLANKPEFNVEAACQVAHAFGVSETIVEDDFFTA
VDDLRQASEDAGAGHLGETGFGSALFYTYICIDKDLLVENLGGDEALANQ
TLRAFTEAALKVSPTGKQNSFASRAYASWALAEKGTEQPRSLAAAFYEPI
NGTRQLDVAVQRITTLRENMNTVYEQKTECASFDVMNKQGSMKDVLDFIC
A
>ECs1548 hypothetical protein
MTEADLYPHLAHLAGGQVYPYVVPLLDGRPSVALPWVVFSLISSVSADVM
GGQAESSVSVQIDVYAGTVTQARQIRQDAREAIMLLAPGSVSEMQDYIPE
NRCYRATLEFQVTV
>ECs3239 hypothetical protein
MEKDLKELREYLLLSPVYSEINDCLASLSMEDVTNEEFAERLGIAPDWMD
AIDTSQLA
>ECs1940 hypothetical protein
MQQLVPPPGVKGIDKMVHYEVVQYLMDCCGITYNQAVQALRSNDWDLWQA
EVAIRSNKM
>ECs1898 outer membrane protein
MKKLLPCTALVMCAGMACAQAEEKNDWHFNIGAMYEIENVEGYGEDMDGL
AEPSVYFNAANGPWRISLAYYQEGPVDYSAGKRGTWFDRPELEVHYQFLE
SDDFSFGLTGGFRNYGYHYVDEPGKDTANMQRWKIAPDWDVKLTDDLRFN
GWLSMYKFANDLNTTGYADTRVETETGLQYTFNETVALRVNYYLERGFNM
DDSRNNGEFSTQEIRAYLPLTLGNHSVTPYTRIGLDRWSNWDWQDDIERE
GHDFNRVGLFYGYDFQNGLSISLEYAFEWQDHDEGDSDKFHYAGVGVNYS
F
>ECs5285 hypothetical protein
MMRQSLQAVLPEISGNKTSLLRKSVCSDLLTLFNSPHSTLPSLLVSGMPE
WQVHNPSDKHLQSWYCRQLRSALLFHEPRIAALQVNLKEAYCHTLAISLE
IMLYHDDEPLTFDLVWDNGGWRSATLENVS
>ECs0432 hypothetical protein
MKNLIAELLFKLAQKEEESKELCAQVEALEIIVTAMLRNMAQNDQQRLID
QVEGALYEVKPDASIPDDDTELLRDYVKKLLKHPRQ
>ECs5165 hypothetical protein
MVSRKRNSVIYRFASLLLVLMLSACSALQGTPQPAPPVTDHPQEIRRDQT
QGLQRIGSVSTMVRGSPDDALAEIRAKAVAAKADYYVVVMVDETIVTGQW
YSQAILYRK
>ECs0514 hypothetical protein
MDEYSPKRHDIAQLKFLCETLYHDCLANLEESNHGWVNDPTSAINLQLNE
LIEHIATFALNYKIKYNEDNKLIEQIDEYLDDTFMLFSSYGINMQDLQKW
RKSGNRLFRCFVNATKENPASLSC
>ECs1240 hypothetical protein
MGYGLLDIANQSRREALQGISDADRRREEIEAANKQMAAQQKAQNKQNIG
TGIGTGAAIGASVGGPVGAVAGAVIGGIAGSLF
>ECs3304 hypothetical protein
MKSTEFHPVHYDAHGRLRLPLLFWLVLLLQARTWVLFVIAGASREQGTAL
LNLFYPDHDNFWLGLIPGIPAVLAFLLSGRRATFPRTWRVLYFLLLLAQV
VLLCWQPWLWLNGESVSGIGLALVVADIVALIWLLTNRRLRACFNEVKE
>ECs1872 hypothetical protein
MNHDIPLKYFDIADEYATECAEPVADAERTPLAHYFQLLLTRLMNNEEIS
EEAQHEMAAEAGINPVRIDEIAEFLNQWGNE
>ECs3504 hypothetical protein
MAKPARRRCNRKREDLTVKRIFELLSFDKSTGVFRWKVPTQGRIALNSVA
GAFDSNGYSMIMIDGRRYKTHVLVFYITHNRWPAGQIDHVNGIRTDNRPE
NLRECLPIENSRNIRIRKNSKSGCRGVTWHKRQKKWNVRLGFHGKSKHFG
CFDDLELAVLVAEEARDKYYGDFSGNERSTYANLSKEM
>ECs0001 thr operon leader peptide
MKRISTTITTTITTTITITITTGNGAG
>ECs4361 hypothetical protein
MKIGTVAGTNDSTTTIATNDMVQEHVTNFTKELFGYIANGIGDDISSIAR
TMLGEVLEKVDDWQIERFQQSIQDDKISFTIQTDYSEKYSMLSGMRAHIL
RRNNHYQFIVTINSKNYGCPLDNTDINWCGIVYLLNNMTVNDNANDVAVT
ESYKPIWNWKISQYNVSDIKFETIIKPQFADRIYFSNCSPVDPTSTRPTY
FGDTDGSVGAVLYALFATGHLGIMAEGENFLSQLLNIEDEVLNVLLRENF
NEQVDTNVNTIISILNRRDIILESLQPYLVINKDAVTPCTFLGDQTGDRF
SNICGDQFIIDLLKRIMSINDNVHVLAGNHETNCNGNYMQNFTRMKPLDE
DTYDGIKDYPVCFYDSKYKIMANHHGITFDDQRKRYIIGPITVSIDEMTN
ALDPVELAAIINKKHHTIINSKKFKTSRAISCRSFNRYFSVSTDYRPKLE
ALLACSQMLGINQVVAHNGNGGRECIGETGTVLGLNARDSKHTGRMFSMH
NCQINPSAGPEITTPWKSYQHEKNRNGLMPLIRRRTMLQL
>ECs3608 hypothetical protein
MSMVVVVTENVPPRLRGRLAIWLLEVRAGVYVGDTSKRIREMIWQQITQL
AGCGNVVMAWATNTESGFEFQTWGENRRIPVDLDGLRLVSFLPVDNQ
>ECs0950 hypothetical protein
MIRVNFSKCLFHDCSMHGVKIKPWLPVKWTKELINDYLYGCLLSLYSICA
RDIYNMNAGNNVKVAADAFLEILYSLKNKYCIKLLSAQDRAFIYEFARMI
FAYINDKSIEILLLSCFAAADQKAIQRYIPQAQDGEDFRNHLQYKLTLPT
H
>ECs1463 hypothetical protein
MSSRVANLTVIVSSKRRRVSVARFSCGKTAQLSKKQTGYYSPEIFPSTGK
DCNPQPANCLKDQYVLRHCCVDDRSGKMGYSVKFLVLTRMDTETASLFHC
KPCYSKMTFTIYHPLTHSFFTSCW
>ECs1182 hypothetical protein
MEQTGRLFKQRRLSTTWLKSQITQPHKLWDAMPKQPSQEELRDCIAKVYS
GGIYVQKNRI
>ECs3291 hypothetical protein
MKKIICLVITLLMTLPAYAKLTAHEEARINAMLEGLAQKKDLIFVRNGDE
HTCDEAVSHLRLKLGNTRNRIDTAEQFIDKVASSSSITGKPYIVKIPGKS
DENAQPFYMR
>ECs4613 ilvB operon leader peptide protein
MATSMLNAKLLPTAPSAAVVVVRVVVVVGNAP
>ECs2728 putative minor tail protein
MFLKTEQFEYNGVSVTLSELSALQRIEHLALLKRRAEQAESSGNLQVSVE
DLVRTGAFLVAMSLWHNHPQKTQSPSMNEAVMKIEQEVLTTWPADAIARA
EDVVLCLSGMSGAVRPDTDITEVAKNNTLTDDDFSAGKSSTAS
>ECs4978 hypothetical protein
MKTIFIKPAPGRLIRDPDTMRPLAQEGEEKPFTPFWCRRLDDGDVIQAEK
AAEEAPAVSADATTATEKPVAQPASDKEKPQ
>ECs3813 hypothetical lipoprotein
MKKWKVRSALVALIVLLAGCSSNAQYNSSASGNVGTAWGGDVHSTVQGVS
AERAWRDPAEMIVISYSTNVPSGYDRVYSIRINELEYAIRDGNFNSLPIT
RVYDSSNNEPRYIVHARVGMNYQLYVRNYSRNTNYEIVATVDGLDVLNGK
QGSLNNNGYIVNAGDSLAIKGFRKDKHTEAAFQFANVADSYAANSAQGDV
RNTGVIGFAAFELQGQAQNALPPCSGQAFPADNNGYAPPPCRK
>ECs1312 putative complement resistance protein precursor
MRFSTIVSVVTLVWGISPRQPSGKNIIRWLLKKRTNGSVRYCQYTSTIWL
EPASERTVFLQIKNTSDKDMSGLQGKIADAVKARGYQIMTSPDKAYYWIQ
ANVLKADKMDLRESQGWLSRGYEGAAVGAALGGGITAYNTGSAGTTLGVG
LATGLIGMAADAMVEDINYTMITDVQIAERTRTSVRTDNVAALRQGTSGS
KIQTSTETGNQHKYQTRVVSSANQVNLKFEEAKPHLEDQLAKSIANIL
>ECs1394 hypothetical protein
MYNCGESYSEYDISLSDDENYDDDYYRERPCPGEVFGYDNTEDNDEAASD
YALKINENSD
>ECs2243 putative major tail subunit
MSALYERSQLTQVMISSAPATAETMDKAEYLRLDCTIKEVQFTAGQKQDI
DVTTLCSTEQENINGLGASSEISMSGNFYLNQAQNALRDAYDNDALYAFK
VLFPSGKGFKFLAEVRQHTWSSGTNGVVAATFSLRLKGKPVSFVVPLAFV
KNLDKTLTVNTGALLTMSVSANGGTPPYKYAWKKDGQPVDGQTTDTFSKP
GAQSADAGKYTCVVTDSAEKAQSVTSVECTVTVSAAAG
>ECs0581 hypothetical protein
MNFIPCHHVNAVGPMGGITSASMPMLVVENVTDGNRAYCNLNEGIGKVMR
FGAYGEDVLTRHRWMRDVLMPVLSAALGRMERGIDLTAMMAQGITMGDEF
HQRNIASSALLMRALAPQIARLDHDKQHIAEVMDFLSVTDQFFLNLAMAY
CKAAMDAGAMIRAGSIVTAMTRNGNMFGIRVSGLGERWFTAPVNTPQGLF
FTGFSQEQANPDMGDSAITETFGIGGAAMIAAPGVTRFVGASGMEAARAV
SEEMAEIYLERNMQLQIPSWDFQGACLGLDIRRVVETGITPLINTGIAHK
EAGIGQIGAGTVRAPLACFEQALEALAESMGIG
>ECs0938 hypothetical protein
MEDETLGFFKKTSSSHARLNVPALVQVAALAIIMIRGLDVLMIFNTLGVR
GIGEFIHRSVQTWSLTLVFLSSLVLVFIEIWCAFSLVKGRRWARWLYLLT
QITAASYLWAASLGYGYPELFSIPGESKREIFHSLMLQKLPDMLILMLLF
VPSTSRRFFQLQ
>ECs1624 lipoprotein Rz1 precursor
MRKLKMMLCMMMLPLVVVGCTSKQSVSQCVKPPPPPAWIMQPPPDWQTPL
NGIISPSERG
>ECs1143 hypothetical protein
MNKGKVMKHKLSAILMAFMLTTPAAFAAPEAANGTEATTGTTGTTTTTTG
ATTTAATTGGVAAGAVGTATVVGVATAVGVATLAVVAANDSGDGGSHNTS
TTTSTTR
>ECs0136 hypothetical protein
MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGS
FIEESLKGHSTDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIA
AMHRHLEADHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYN
SPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDE
GYTSESQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGKSMMTLAQWFEE
KGIEKGIQQGRQEERQEFAQRFLSKGMSREDVAEMTNLSLAEIDRLIN
>ECs4292 hypothetical membrane protein
MKAKYVILSVLNKQELFYLKSRYKEMKKLANKHEVSLVEQNFIDAIISMV
GYCSLGFGLVSIVNGILLVFSFFQDNMLKSFFLSSIVSIAMFAILIIHRK
LDTKFSYQLCLKMFYLKIKLFCYCIPAPVVVPDIFINN
>ECs4607 hypothetical protein
MDFSGRNVKEFIRLLSDHDQFEKDQISELTVAANALKLEVAKNNYNMKYS
FDTQTERRMIELIREQKDLIPEKYLHQSGIKKLKLHEDEFSSLLVDAERQ
VLEGSSFVLCCGEKINSTISELLSKKITDLTHPTESFTLSEYFSYDVYEE
IFKKVVFTPGECDLVRDSLVQDGIKPEKIAKIISCLSSRTGIRNLFYACS
QAFHVANDNAALDVRRAATSVGDTIDVCPSFVALNGRNYHQHVDFSEYGF
TQTLCVLGAVVNNANDTFDVANYIYTTKGENIIHNDAEDDSPYVHSDESY
RHKDDYWARDNDIRNLRRNFSGNITIATYK
>ECs2692 hypothetical protein
MMKTAKEYSDTAKREVSVDVDALLAAINEISESEVHRSQNDSEHVSVDGR
EYHTWRELADAFELDIHDFSVSEVNR
>ECs4389 hypothetical protein
MGYKINISSLRKAFIFMGAVAALSLVNAQSALAANESAKDMTCQEFIDLN
PKAMTPVAWWMLHEETVYKGGDTVTLNETDLTQIPKVIEYCKKNPQKNLY
TFKNQASNDLPN
>ECs4381 hypothetical protein
MQSKEKKKLSLLTTKFSDEAWGDNHKNKVVYRAQDRFHNPLFIVLMVVKK
KLDHKKLIDNNSHYHVCI
>ECs5006 hypothetical protein
MALPRITQKEMTEREQRELKTLLDRARIAHGRVLTNSETNSIKKEYIDKL
MVEREAEAKKARQLKKKQAYKPDPEASFSWSANTSTRGRR
>ECs3610 hypothetical protein
MYLSRITLHTGQLSPAQLLHLVDRGEYVMHQWLWDLFPGGKERQFLYRRE
ELQGAFRFFVLSQERPAESETFTIECRSFVPELRTGQQLCFNLRANPTIC
KAGKRHDLLMEAKRQVRGQAEGSDVWLHQQQAALDWLAAQGERSGFTLLD
TSVDAYRQQQLRRENSRQLIQFSSVDYTGMLTVTDPGLFLQRLSQGYGKS
RAFGCGLMLIKPGAEV
>ECs0341 hypothetical protein
MMVFYMFKKSVLFATLLSGVMAFSTNADDKIILKHISVSSVSASPTVLED
AIADIARKYNASSWKVTSMRIDNNSTATAVLYK
>ECs5247 hypothetical protein
MRIMHLSRKVVTPRGRLDLHLARGTEQTYRLSSQMALGLICPASPPRKGL
HLKEQVMF
>ECs2210 hypothetical protein
MTAGFNFNNYAAGFCSATPALRGNEVNMDTLNLGNNESLVCGVFPNQDGT
FTAMTYTKSKTFKTEAGARRWLARNTD
>ECs0781 putative homeobox protein
MKMTKLATLFLTATLSLASGAALAADSGAQTNNGQANAAADAGQVAPDAR
ENVAPNNVDNNGVNTGSGGTMLHPDGSSMNNDGMTKDEEHKNTMCKDGRC
PDINKKVQTGDGINNDVDTKTDGTTQ
>ECs3932 putative oxidoreductase
MILFADYNTPYLFAISFVLLIGLLEIFALICGHMLSGALDAHLDHYDSIT
TGHISQALHYLNIGRLPALVVLCLLAGFFGLIGILLQHTCIMVWQSPLSN
LFVVPVSLLFTIIAVHYTGKIVAPWIPRDHSSAITEEEYIGSMALITGHQ
ATSGNPCEGKLTDHFGQIHYLLLEPEEGKIFTKGDKVLIICRLSATRYLA
ENNP
>ECs0823 hypothetical protein
MPFMQGLYQTFVMAFRLTAPPGGVACHGAGASRKKASFCIFIGHHHLCIL
LIINGYLFFVCRIECFLFDIERVFLKLFARCMFKALRRKYGS
>ECs4996 hypothetical protein
MKPSIAGTMLGQSVKTCINALLRARGVSYDIPRSPVKQRCIKRRSLSWMK
EPVCPACKTPLQLIYRRSRSQNGKTYKPKNASCFRMLICPACSPRRENTQ
NVH
>ECs1992 putative tail fiber protein
MYEDSQPGTLNDFLGAMSEDDVRPEALRRFELMVEEAARHAEEAKKNAGE
AETSARNAGISASQAEESAANADTSAGDASESARQAAESAAAAKQSEDAS
SSSASAAAQKASESSQSAAEAELSRKTAESAAGNAARDATTATEKARESA
ESAQSAEQSRIAAEEAVNRIPTVVGPPGPKGEQGPAGPQGPKGDKGERGD
TGPVGATGERGPAGDAGPAGPQGPKGDRGERGETGLTGNAGPQGPKGDTG
AAGPAGPQGPKGETGAAGPVGATGPQGPKGDPGETQIRFRLGPGNIIETN
SNGWFPDTDGALITGLTFLAPKDATRVQGFFQHLQVRFGDGPWQDVKGLD
EVGSDTGRTGE
>ECs0283 putative tail fiber protein
MPFARYFCIFINVGLGEAAKRDVGTGENQIPDMASFASGNGWMKLPNGKI
LQYGRGAVTPTLSTQTMRITFSIPFPKKADCAMLTHSGDGGAPLGAGRGF
VMTAEGPTLTGFNSAYRTSSTSDTVSMNYSWWAVGE
>ECs4551 hypothetical protein
MVNDISANKILVWAAVAAANHKLPKYAEAILNVFPQIIPDKKDIAHLEFI
ILFGLNRKNDAVKALEDCMDDETSQLLYSLVHENGSGWVRGF
>ECs2245 hypothetical protein
MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAP
VRRGKLRRNVVVLSRRSRDGGMESGVHIRGVNPDTGNSDNTMKADNPRNA
FYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLRR
>ECs4291 hypothetical protein
MGTAAISHLRYDLNKYALSLRKTATLASTFFIESPLVRFEYLQEIENTIN
DITHRFNSSYDINEKARLINELKMESETARKEYQLFRQGNYDKYITTDIF
EEHGLIKYVNLSFDIVASVGQVVGGVGALKFGKVVHSNRIKGIGVTLVAH
GANNFYESLSPLFFNEYDSGPIRELYRVIAKKMGGDINSADYAYSIVDFS
ITAYGGYSGMKIVPKYNRLIRPSLGNRPGTGRLFHYTSVDFKNKFSLKPT
PLKIIQISSTVKKFKVTFYDENTNLRIKHGLNNHMPHQLLMNISGTTTGA
GIQ
>ECs0295 hypothetical protein
MDHTKHSILSSLQDKEDDVDELKYSAEDFDSLTVADLYDIEIAMQDFLND
INFENSKDNKVRFDEDTYDFNINGKRRGMFGKGTRAVMHAIFTICFAEFL
SRKGNPFIGFVVLDSPLVTHFDKDRGGSLSDVNSVSLSDSFYHALIKRDY
NFQIVILENKGPTFQIKINDANKIHNLNKNGSSGFYPV
>ECs1376 hypothetical protein
MQITEALISEPGEIRRFVQQAVDHWPNLLAFHFTLYSAEGIYGQQIQTFC
SSFHRRVHERITEHNHTVSPSAPVVLRWLREQHEGAQIRCLLLLSQTSIC
HPRVGVMADEECAQLVDLLQQTWSVISAGGQCRVERCFRVARPGSSGQYV
ALKTAVQSFMSQVIATIIR
>ECs2768 putative cell division inhibition protein
METLLPNVNTSEGCFDIGVLLSNKAFTEDAINMRKYEPYLLNDNSILSRI
ALIKLGIFGERQ
>ECs1511 hypothetical protein
METVSDALKALKRASSHVVAARLGISREEAVNELWKLKRRGEADNKGAIW
WLTQAGESEPVSPVPKVTAQMLTEAIEQHGPQTADELALMFGITSRRANS
SLAMAISKGCLIRVNQDGKFRYCIPGVDLPAEPKAASVAETEGKALPQPA
GVALPVQETAAQEEIKTEAVEDIVKLQQSFTEAKADDLILPSLHVANREL
RRAKSNVQKWERVCAALRELNKHRDILRDITATREQQR
>ECs1225 hypothetical protein
MSEKIAVVYIGPKPVKKDTITGSRTLFPRLEPVHVDSAMAWQLLGFPDVW
VRHEELDDVLKKQQQNEQLRQAQQAQERVLAALAEAENSFVVSVNGQEVD
LSKLTSARLATLCEAEELDIHKDPKETAEAFRIRVREAFRRRVAETEQHG
GTE
>ECs1221 putative portal protein
MKNETNTMATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYD
GDQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVLGMEAKTRTDLVVMSDEP
DDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSD
PFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPG
MAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRER
RRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVG
RVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISR
AIPAQDEVNFRRIKLTWLLQAKRVIMDEDATQLSDNDLMEQIERPDGIIK
LNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLG
QDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKK
RRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTP
AFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALG
TPKSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHA
AAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQEQDVLQQQ
MLYTLQQRMNEMSL
>ECs5365 hypothetical protein
MLTGGHVEKYCELIRKRYAEIASGDLGYVPDALGCVLKVLNEMAADDALS
EAVREKAAYAAANLLVSDYVNE
>ECs1961 hypothetical protein
MTFKHYDVVRAASPSDLAEKLTHKLKEGWQPYGGPVAITPYTLMQAVAIE
GEPQVGPSSEPDWYYVIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLAR
RSTVTPGGAACRYNDIIPADHCLHDVQDMSTLNHPRADLSKGQYGCVGQG
LHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSESTGASQDSARW
GVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDMSAATHAQQPALF
TAMLTQFRADLSVFNAQCHGGSAADVPWICGDTTYYWKNTYATQYDTVYG
GYKNRESEGVYFVPFMTDGNGVNTATNAPAEDPDIPASGYYGAASRTNGN
QVSSNRPTHFSSWARRSIIPDRLATAILNAAGRTSAFISGKAPEIKPSPG
GNTPSGPSADTSVRTISLLPAAGEAAAQGWSIKDGGIQLSDGVFKITRQS
NKTWSLTHPVDDAITLLTQGGRLNCKFRLSGALTNNQFGLGIYLYTDAPV
PDGVAMTGTGNPFLMSYFTQTTDGRVNLMHHRKAGNTKLGEFGDYGNDWQ
TLELVFTAGSATVTPKLNGVAGPAFQVIKDSLTLGLNALTLTDVTKNAAY
GVEIESLVLEINAPASS
>ECs5457 hypothetical protein
METVLHALKAMGKANSVELAARLDISREEVLNELWELKKMALLIKRVTPG
FWLSKVKPG
>ECs2756 hypothetical protein
MTELTKEWLQNTITGIESSRDEIPFGLDEDQNNMLTALKIALASLASVSD
ERAAYELFMEKRFGESVDRRRAKNGDREYMAWDMALGWIIWCHRAAMLQA
GNFRENKGSSTNNFRIISETSTNSPAIPDEVLSAILKVARLRADFDASEV
DRRGIGSCLDEAEQELIVTINEYASQIAVEATQGENQ
>ECs4531 hypothetical protein
MKFIGKLLLYILIALLVVIAGLYFLLQTRWGAEHISAWVSENSDYHLAFG
AMDHRFSAPSHIVLENVTFGRDGQPATLVAKSVDIALSSRQLTEPRHVDT
ILLENGTLNLTDQTAPLPFKADRLQLRDMAFNSPNSEWKLSAQRVNGGVV
PWSPEAGKVLGTKAQIQFSAGSLSLNDVPATNVLIEGSIDNDRVTLTNLG
ADIARGTLTGNAQRNADGSWQVENLRMADIRLQSEKSLTDFFAPLRSVPS
LQIGRLEVIDARLQGPDWAVTDLDLSLRNMTFSKDDWQTQEGKLSMNASE
FIYGSLHLFDPIINAEFSPQGVALRQFTSRWEGGMVRTSGNWLRDGKTLI
LDDAAIAGLEYTLPKNWQQLWMETTPGWLNSLQLKRFSASRNLIIDIDPD
FPWQLTTLDGYGANLTLVTDHKWGVWSGSANLNAAAATFNRVDVRRPSLA
LTANSSTVNISELSAFTEKGILEATASVSQTPQRQTHISLNGRGVPVNIL
QQWGWPELPLTGDGNIQLTASGDIQANVPLKPTVSGQLHAVNAAKQQVTQ
TMNAGVVSSSEVTSTEPVQ
>ECs1936 hypothetical protein
MEIIMIAHHFGTDEIPRQCVTPGDYVLHEGRTYIASANNIKKRKLYIRNL
TTKTCITDRMIKVFLGRDGLPVKAESW
>ECs2819 his operon leader peptide
MTRVQFKQHHHHHHPD
>ECs3514 hypothetical protein
MKNWSVEHTREVKKWLDIDTYRGFEELPLIHLYHELLARTLFFKPYHEEF
EAEAVRLYIDRIFTGKPFLITEKHLGYLTREDTLYQPPHFFLTTTERLAQ
LSIVGLRNSLFFWDGSDEYSVNREFLDSPVSEVLPKLFRDTVMVEIDLAN
GTDEEIAESLKAALPQWRKVRGIEPDVTEAIRFGYGTIKKIINYRLLPMI
DILVWSKLYKVRISEDRLSRLLYTDDDDEINTRLNHQIRDTDKPLALKAA
SIPFIRQFNLFINKNSHLKKIRVSDVMKIADSD
>ECs5175 hypothetical protein
MFSRVLALLAVLLLSANTWAAIEINNHQARNMDDVQSLGVIYINHNFATE
SEARQALNEETDAQGATYYHVILMREPGSNGNMHASADIYR
>ECs1112 putative minor tail protein
MFLKTEQFEYNGVSVTLSELSALQRIEHLALLKRRAEQAESSGNLQVSVE
DLVRTGAFLVAMSLWHNHPQKTQSPSMNEAVMKIEQEVLTTWPADAIARA
EDVVLCLSGMSGAVRPDTDITEVAKNNTLTDDDFSAGKSSTAS
>ECs3509 hypothetical protein
MAKHINGYHEKKIVDVINTWSIDEKLTWNALCDRLVRVIGKRPSRQSLSS
HVRIAESFNVKKTTIKSGVIHTVKPANLKVASQRIKRLEAENEALKALNE
RYLEQFKVWLYNAHLKQITIEELNNPLPAKYL
>ECs0370 hypothetical protein
MSQSLFSQPLNVINVGIAMFSDDLKKQHVEVTQFDWTPPGQGNMQVVQAL
DNIADSPLADKIAAANQQALERIIQSHPVLIGFDQAINVVPGMTPKTILH
AGPPITWEKMCGAMKGAVTGALVFEGLAKDLDEAAELAASGEITFSPCHE
HDCVGSMAGVTSASMFMHIVKNKTYGNIAYTNMSEQMAKILRMGANDQSV
IDRLNWMRDVQGPILRDAMKIIGEIDLRLMLAQALHMGDECHNRNNAGTT
LLIQALTPGIIQAGYSVAQQREVFEFVASSDYFSGPTWMAMCKAAMDAAH
GIEYSTVVTTMARNGVEFGLRVSGLPGQWFTGPAQQVIGPMFAGYKPEDS
GLDIGDSAITETYGIGGFAMATAPAIVALVGGTVEEAIDFSRQMREITLG
ENPNVTIPLLGFMGVPSAVDITRVGSSGILPVINTAIAHKDAGVGMIGAG
IVHPPFACFEKAILGWCERYGV
>ECs1505 hypothetical protein
MTAGFNFNYAAGFCSATPALRGYEVSMDTLDLGNNESLVCGVFPNQDGTF
TAMTYTKSKTFKTESGARRWLARNSD
>ECs1080 prophage maintenance protein
MNGKSRLASYVPKGKEKQAMKQQKAMLIALIVICLTVIVTALVTRKDLCE
VRVRTGQTEVAVFTAYEPEE
>ECs2740 hypothetical protein
MGKYKRILLRLTMKNGLELKAPVTDDVSRALAFAIKWVAVGIAVSPMLYG
LAKLVIALKS
>ECs0561 hypothetical protein
MLAISSNLSKMIIFIIAIIIIVVLCVITYLYLYKDESLVSKHYINYMAIP
ENDGVFTWLPDFFPHVAVDISIYTNVEDDYFFLIFP
>ECs1188 hypothetical protein
MTKQLSPYQDKIHKHILRDRFLSSFKQPGRFRAELEKVKLMQKEKGHE
>ECs0835 putative minor tail protein
MFLKQGTFNYEKQSVVLSELSGLQRIEYLAFVQQRTAKFDAEEGELPEAE
RQIAFLRMGMDINAWLVSRSLWNAEQSQDVETLCASVITTWSYDALGAGA
EMVLSLSGMGAIENAGDLEHEVLTPEKS
>ECs3815 hypothetical protein
MLNNVMKKKPVAQLERQHSLLENPCAYGLLSQFQAAIVVNCFTLNKII
>ECs2623 hypothetical protein
MKKNANNPYSKFRNGVERHVHHVATSASRSNSRYNLNETHATPDGHAVKQ
IGEHAWLIEKAGIVVHKCPRNPFTGNRIFALNCGDNHFGQDFTLYEALRT
VDRLLRGQSFIKQADL
>ECs2969 putative holin protein
MYQMEKITTGVSYTTSAVGTGYWFLQLLDRVSPSQWAAIGVLGSLLFGLL
TYLTNLYFKIKEDRRKAARGE
>ECs3705 hypothetical protein
MIIKKNLLFILVFISGFILFTVYSYTAEKMIYNETCTANWVIFNDQGRAN
LTIDFMYNKKNKTGTVALSGTWQQGNRESKSIRRNIEYTWVENYDTAHLT
SKKVNKFEIMDQVDDDRLAELIPDFYVFPEKSVSYNILKQGKHAFILSIG
NRAIMHCAR
>ECs2759 hypothetical protein
MSEINYQELRVELGAAKKRIEELEANRVVLEVENERLKHAMAVALEHISV
TDAGQAGVAAMIIYDAMYHSEKTDPYAFLAELRAQGVEMVREHPAIKLCS
LTHICDELVAELRKGGNK
>ECs1577 Icd-like protein
MVAVNHIPHLVHTQTAFVWRFLALSAGESQIIHVTAWTEREARSRCPSGC
VAVFAARIRQGVCHA
>ECs0604 hypothetical protein
MKEIDLLYENIYQLLIKPYLLDLSSQSGKKIELNYTCKIKDAADEIKGSM
IFNDVDGKQKATCTIRVLILKTFHDGRYRFVIESVIYDLINNYSGFILTG
RLFWQGEGFGHELFPVTNKYNAWRWKNKKIIDISW
>ECs3888 hypothetical protein
MNALSGLQTHEDSTCCNRFCRPDERSASGNSTLLRFGGFFADQTYLQVTR
LMQRVHYLHQRLVIDSFIRSEEDGGVFLAFG
>ECs2171 putative minor tail protein
MAIKGLDQAIENLSRVRKNAIPAASAMAINRVATTAINQSSSQVARETKV
RRKLVKERSRLKRATVRNPNARIIVKRGDLPVIKLGIRMLGRRPDSILKA
GQHRYQRAFIQRLKNGRWHVMQRVAGKNRYPIDVVKIPMAAPLKQAFDEN
IDRIRRERLPGELAYALKQQLRIAIKR
>ECs3010 hypothetical protein
MINRIKLEHILEYARQQRHIGQHCKIPPGDMVEIMEIAMRKAGNSPVTPD
GWISCSERMPNDKQYVWCWGKSYGWTECDTFEGYYDCSRNKWWAVTDNGE
EPASKVTHWMPLPEPPQEVK
>ECs4876 hypothetical protein
MPPALPLHTVRARIICGFRLVSETGSGSSSSDLTSASGASQWGNSSRSHW
LKRWCASCSSAPEISNWQLSPASVSVPISARALLALPLRSVRRLCHSICD
EKVLHSVSTWLAGRACTPLGSSQINSTIRAIASLPVRFSVYRIRAHGNGL
PCDKSAKSWTNVM
>ECs1781 hypothetical protein
MTFKHYDVVRAASPSDLAEKLTHKLKEGWQPFGSPVAITPYTLMQVITAE
GDVVVSGATEPDWYYVIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLAR
RSTVTPGGAACRYNDIIPADHCLHDVQDMSTLNHPKADLSKGQYGCVGQG
LHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADAGASQDSARW
GVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDMSAATHAQQPALF
TAMLAQFRADLSVFNAQCHGGSAADVPWICGDTTYYWKNTYGTQYNTIYG
AYKNRESEGVYFVPFMTDGNGVNTATNAPAEDPDIPASGYYGAASRTNGN
QVSSNRPTHFSSWARRSIIPDRMATAILNAAGRTSAFISGKAPEIKPSPG
GNTPSGPSADTSVRTISLLPAAGEAAAQGWSIKDGGIQLSDGVFKITRQS
NKTWSLTHPVDDAITLLTQGGRLTCKFRLSGALTNNQFGLGIYLYTDAPV
PDGVAMTGTGNPFLMSYFTQTTDGRVNLMHHRKAGNTKLGEFGDYGNDWQ
TLELVFTAGSATVTPKLNGVAGPAFQVIKDSLTLGLNALTLTDVTKNAAY
GVEIESLVLEINAPASS
>ECs1589 hypothetical protein
MAHYRVFVTVRHSYYLKQTDTTEEKTMPMKFDEILKQRDKYHADNMETMS
INDYRAFLETGALIEKDQHGFVRCALSGEMLAVNPEQIDALIEFLKGIRD
>ECs1235 hypothetical protein
MYGLSIMKPDGSVWISPGFTPQCLINKGTIPATEKSFFKTSIPSGKSCFF
FIRTEKKADVMYTHEQIDGYHALRLHVIVRGTNPGVTTVYAFANMVTPPS
EYGIAMYNPDGEMIYHGEMMLLDAKLIPVDIKFEKDLGYPCAIMPALVGY
YNWKRTPYDRPIYTTSTGATGNKIYSCEHYSGGATWDIRKPYIDKVLVIN
TSVYD
>ECs2802 putative antirestriction protein
MKTVSQNTPTIYSATTPENNPPQLVASLVPDEQRISFWPQHFGLIPQWVT
LEPRVFGWMDRLCEDYCGGIWNLYTLNNGGAFMAPEPDDDDDETWVLFNA
MNGNRAEMSPEAAGIAACLMTYSHHACRTENYAMTVHYYRLRDYALQHPE
GSAIMRIID
>ECs2178 putative head-to-tail joining protein
MTRQEELAAARAALHDLMTGKRVATVQKDGRRVEFTTTSVSDLKKYIAEL
EVQTGMTQRRRGPAGFYV
>ECs3487 hypothetical protein
MPKISSVVSSCYHLFSEHQQLSNETTMTNPVSRRIVHKEYGISLKSVPVW
LATAKTPLALLNGRHTRSHSFIIAGTPGMGSRSGAQYYAINSDDKRSRID
IDSLFLKKLNNVRNQNKFPIDVKETVIKLQGQKFTCIEDFYKRYNETRLK
ANTNIQQEQIADEVKSLTYLIPSEKKEMWIYKNNGKDNAKPNLGERDVRM
FENISSDDTDKITGRKFSELGEYLYSGNVIKLSQLSIRYLPNISSISLIE
TKQSLLLHRLYSDEVLQRNGTLIPTPLHEEKSIPADNIKTMLNNIPTYKM
LPPFTETQGNCSSGAATFLRKSGAEEKDILACSPRNYGLHHNIKTWDPLV
RN
>ECs0836 putative minor tail protein
MQFVMRLAREFRRADWRRMLSEMSATELGEWGDYFRMQSFSDVWMDAQFA
SLKALIVRMVSGSSDAAVADFSLLPEENGIPERTDEELMHLGEGISGGVR
YGPDSQPGH
>ECs3005 hypothetical protein
MHFSGSRLHILCAYACRHGTCSMTPQQENALRSIARQANSEIKKARQQFP
DKNVDDICRSVLKKHRETVTLMGFTPTHLSLAIGMLNGVFKER
>ECs1760 hypothetical protein
MNTAFALVLTVFLVSGEPVDIAVSVHRTMQECVTAATEQKIPGNCYPVDK
VIHQDNNEIPAGL
>ECs0808 hypothetical protein
MKHPHDNIRVGTITFVYSVTKRGWVFPGLSVIRNPLKAQRLAEEINNKRE
AICTKHLLLS
>ECs3653 SecY interacting protein Syd
MDDLTAQALKDFTARYCDAWHEEHKSWPLSEELYGVPSPCIISTTEDAVY
WQPQPFTGEQNVNAVERAFDIVIQPTIHTFYTTQFAGDMHAQFGDIKLTL
LQTWSEDDFRRVQENLIGHLVTQKRLKLPPTLFIATLEEELEVISVCNLS
GEVCKETLGTRKRTHLAPNLAEFLNQLKPLL
>ECs2396 hypothetical protein
MFISCSHYTMNAYELQALRHIFAMTIDECATWIAQTGDSESWRQWENGRC
AIPDCVVEQLLAMRQQRKKHLHAIIEKINNRIGNNTMRFFPDLTAFQQVY
PDGNFIDWKIYQSVAAELYAHDLERLC
>ECs1125 hypothetical protein
MPEEYVQPYSGLVECHALQDAAPDAELHQVEADVKRFFLYLTEYKMWDVE
VLILYKNFLP
>ECs1942 regulatory protein
MNQKTLEDVIKTVRVSVVADVCGVSQRAIYKWMDNGKLPRTEYTGETNYA
EKIAHASNGLFSADAILTIGRNKTTTKKLMGVDS
>ECs1762 hypothetical protein
MLKSTLIAKCLYQNRMVSSISIGESAVKSIFEEYFPGHDFNKWNTKLPPA
VSTRILKATERASTIRVNYFIKDLWDL
>ECs1400 hypothetical protein
MGEMNEVYCLRSPSLANHVLSCIPASPVQLTFSIADNPPEICSLVVFVSS
PKIK
>ECs1405 hypothetical protein
MSDKLSGITHPDDNHDRPWWGLPCTVRPCFGARLVQEGNRLHYLADRAGI
RGRFSDADAYHLDQAFPLLMKQLELMLTSGELNPRYQHTVTLNAKGLTCE
ADTLGSCGYVYLAVYPTPAPATTS
>ECs2173 putative DNA-packaging protein
MTKDELIARLRSLGEQLNRDVSLTGTKEELALRVAELEEELDDTDETAGQ
DTPLSRENVLTGHENEVGSAQPDTVILDTSELVTVVALVKLHTDALHVTR
DEPVAFVLPGTAFRVSAGVAAEMTERGLARMQ
>ECs0815 anti-termination protein
MNTQYLQYVREQLMAATADLNGATKGQLEAWQEHAQFDTGTYKRKKPRIL
DVVTGKMITLDNTPTSGKQSYAKGSSIALVSPVEFSTSSWRRAVLSLDEH
QKAWLLWCYSESVRWGHQVTITQWAWSEFKDLLSNRKIAGKTLDRLKTLI
WLAAQDVKSELAGREAYEYQTLASLVGVTTKNWSETFTERWVAMKHIFLQ
LDSDALLLVTRTRSKQKAAFLQQNIAKLD
>ECs3257 hypothetical protein
MFYLWMFLALCIVCVSGYIGQVLNVVSAVSSFFGMVILAALIYYFTMWLT
GGNELVTGIFMFLAPACGLMIRFMVGYGRR
>ECs5591 hypothetical protein
MKEIAIRKAPFTEHQIIAVIKSLEYGRTVKDVAYLKPLTTTASPDTAVWN
FLILK
>ECs1506 putative phage repressor
MLSGKDLGRAIEQAINKKIASGAVKSKAEIARHFKVQPPSIHDWIKKGSI
SKDKLPELWRFFSDVVGPEHWGLNEYPIPTPSTSDTKSELLDINSLYQAA
SDEKRAIVAFLLSGNATEPSWVDHDVRAYIAAMEMKVANYLKNQESKRKS
QNITKTGT
>ECs3613 hypothetical protein
MSIVKEEHKATLRKWHEELQEKRGERASLRRSTTVNDVCLTDGFRLFLKN
RQIKWQDEPEWRITALALIAALSANVKAIDERLPFAAQLAAVMSKGRFTR
LSAVKTPDELLRQLRRVVRLLNGSVNLDSLAEGVFRWCQESDDLLNHHRR
HQRPTEFIRIRWALEYYQAGDVDNEQNQ
>ECs2910 putative outer membrane protein
MKKLLVTVIFVAMMPNAFALSHGDSATLKEGTFNCKKLTDFYEMISYIQD
NDQQAMLSLITSNKCRVLDESMTVEIQSVDDKGFVSFITPGGHGGWAVKQ
FFEN
>ECs2352 hypothetical protein
MKFMLNATGLPLQDLVFGASVYFPPFFKAFAFGFVIWLVVHRLLRGWIYA
GDIWHPLLMDLSLFAICVCLALAILIAW
>ECs5251 hypothetical protein
MSKTRLSKPGFSAKSVLTIMLIAASSASYAAESIDQLATRIGMDHFREAI
KKGERSAFKTLPAKGYKNGLTGLLIKTNTYVNPEYSFADCDTDTVIISVQ
EHKTLPGCRISLKLATKDADGSYHDNGIRVEYGIAIGSKKFIANNSAWAQ
HAISGDKVLEGALHPDKTNNNKDQMNNNKDQMCMGMAQFTANVTEGRNSG
MTKREYYQRLDNMNGKADVTQQMITLYKTFVDYVYEHPKQDAKISGTYIY
TNCMNNF
>ECs2993 putative regulatory protein
MVTIVWKESKGTAKSRYKARRAELIAERRSNEALARKIALKLSGCVRADK
AASLGSLRCKKADECSGSICLPNVAIYAAGYRKSKQLTAR
>ECs2231 putative tail fiber protein
MAVKISGVLKDGTGKPVENCTIQLKARRNSATVVVNTVASENPDEAGRYS
MDVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMSEDDVRP
EALRRFELMVEEAARHAEEAKKNAGEAETSARNAGISASQAEESAANADT
SAGDASESARQAAESAAAAKQSEEASSSSASAAAQKASESSQSAADAELS
KKTAESAAGNAARDATTATEKARESAESAQSAEQSRIAAEEAVNRIPTVV
GPPGPKGEPGPAGPQGPKGDKGERGDTGPAGATGERGPAGDAGPAGPAGP
AGPQGPKGETGAAGPVGATGPQGPKGDPGETQIRFRLGPGNIIETNSHGW
FPDTDGALITGLTFLDPKDATRVQGFFQHLQVRFGDGPWQDVKGLDEVGS
DTGRTGE
>ECs0428 hypothetical protein
MSRVNHLSSLSFLAVLVLAGCSSQAPQPLKKGEKAIDVASVVRQKMPASV
KDRDAWAKDLATTFESQGLAPTLENVCSVLAVAQQESNYQADPAVPGLSK
IAWQEIDRRAERMHIPAFLVHTALKIKSPNGKSYSERLDSVRTEKQLSGI
FDDLISMVPMGQTLFGSLNPVRTGGPMQVSIAFAEQHTKGYPWKMDGTVR
QEVFSRRGGLWFGTYHLLNYPASYSAPIYRFADFNAGWYASRNAAFQNAV
SKASGVKLALDGDLIRYDSKEPGKTELATRKLAAKLGMSDSEIRRQLEKG
DSFSFEETALYKKVYQLAEAKTGKSLPREMLPGIQLESPKITRNLTTAWF
AKRVDERRARCMKQ
>ECs2270 hypothetical protein
MLTGAFLCLPYGGMPEADSLKHPQQFYLTLALPQTVFTRYGNSHIVMNSV
P
>ECs0595 SfmH
MACLCLANISWATVCANSTGVAEDEHYDLSNVFNSTNNQPGQIVVLPEKS
GWVGVSAICPPGTLVNYTYRSYVTNFIVQETIDNYKYMQLNDYLLGAMSL
VDSVMDIQFPPQNYIRMGTDPNVSQNLPFGVMDSRLIFRLKVIRPFINMV
EIPRQVMFTVYVTSTPYDPLVTPVYTISFGGRVEVPQNCELNAGQIVEFD
FGDIGASLFSAAGPGNRPAGVMPQTKSIAVKCTNVAAQAYLTMRLEASAV
SGQAMVSDNQDLGFIVADQNDTPITPNDLNSVIPFRLDAAAAANVTLRAW
PISITGQKPTEGPFSALGYLRVDYQ
>ECs4657 hypothetical protein
MSVGNEEKYSITHKDITNLIAGDRQSFLKLTLWEFILDLFPMYHIKEAKE
TIVEFVTKTEDKVKRFERIKSLAIPEEKWRFSTTTEFTVDENKDIIVSRV
FKININSKDNHNNAEENTMRSVLSEERNYNSYKHDIHFDEKAFQSGILKM
QFPNDNEKLSVYLKNKIPFENMIIDLSALKKDVLVSLKQCCFKKMTFTGN
ISYENLNGPVFENCFFEECNFESVSLVGFDKVTCESYDVLCNVKPNNKIP
IYGMFKGCFLYQCEMKNFKIETSKIYSINQDPQRGDKKVGAYLFMQSFVY
ASNLQDGVCKEASVIASSLLSCNIAKLNGVGMDFIETSFYGKTRGGFRDS
NNFYDCDFRYVNMTCKRDGIREDLRDYDPRKYFGESENKNKLNLDKALLN
FIHDDIFQNIKTHYNIKDDIDKHLNLNHCCLYAACLGELTGGNPLYDCAI
DTHTTILSGNLAANTQVKIPVYMYHRGAIAEKDILPFMEKVTNIYSRIED
IYKYSNNMKNNYKKALADLQSLLVNLHNIMPGYDVKDFQLNDETIKYIID
NVLYNRKKDIASEIDTDKVNFMHKIYNNLYWYRHYDENGNEKTYRTANPN
GNRFYFDFAGLQVLYYDYENKNKPTKEKFLEKQNLFLTVKNFKAEEKINQ
EKYIEKPANVAHEIASFFDIEQYLLGGIDRNNNYINENFTYKFQDLANLS
GTRDAQLERKVYFNEKMLNLNDNDEKVKIENDIKKIDKKIEEIDNAIRDL
SSETERFKIKYQQENTLNDVVQNIRTWLKENQNMVKAIKKIDTE
>ECs1525 hypothetical protein
MLVLNSPGGTRHHALPQKCYFCFSQTIIVIPLFPAAHGAAFFLRPATGRW
SSCDLIPVPAF
>ECs4737 hypothetical protein
MVAALFGCQPYLVQRFLAVDNDFAAILKGNGQHAAVDFAVDIAVAIPVVQ
TLFNGQPQLISQAMKFTVVHCCILFLSDGGSIAAWGQGFKVCTSAAALRH
DSSVTGIMMSLCSERFVSSLGRSYTTPFPVSGKHYDQVYRASILTLTELK
KPTNKPHCDSA
>ECs1098 hypothetical protein
MLLVASLLSAIRRSNSDLTFSVISCKNLLRKLSLLFIYMASFVVICCVSR
DMFMRATMSVGLYT
>ECs5452 hypothetical protein
MSPSPPTENESKEKQKSRQCHPLTANAGSRDYGIQALLKMPDNIPATP
>ECs4324 hypothetical lipoprotein
MKWFAAIAVVGALAGCARTAPIDQVHSTVSAGHTQEQVKKAILKAGVERK
WIMSEAGQGIIKARQQSRDHSAEIRINYTASSYDINYENSQNLQASGGQI
HKNYNRWVRNLDKDIQLNLSAGADL
>ECs1146 Sfa
MHRWISQNNIRLPCGAFFISVLFFFNAVCIVSDNLLIIESFGEMAYNISY
LTRVPGTNTLLACCCLLRPEEVNSEY
>ECs0519 hypothetical protein
MSLENAPDDVKLAVDLIVLLEENQIPARTVLRALDIVKRDYEKKLQSDEA
SQSE
>ECs1100 putative holin protein
MEKITTGVSYTTSAVGTEYWLLQLLDKVSPSQWVAIGVLGSLLFGLLTYL
TNLYFKIREDRRKVARGE
>ECs2643 putative tail protein
MKETKNIDTENTVVTDTVKETSERGVKLTQPIERGGEKITYVEITGAIEQ
AGSLRDLSLSDVLNLKAESMFTLLSRVTSPRLDEVTIKKMASRDFIQLCV
VAVNFLSGADSGGKNEQATEA
>ECs2392 hypothetical protein
MTHIAFGHYLHSKRWYTVQVLSKSEDAMSTQLDPTQLAIEFLRRDQSNLS
PAQYLKRLKQLELEFADLLTLSSAELKEEIYFAWRLGVH
>ECs1347 hypothetical protein
MPFHSEFLEKIKMQPHETFTGSYQPGDVEFLLKPVVIEMTPVEKKEELIQ
SGKKHYSDMLSQEPAPTQWHLDLFHRALDRGAERLAKEVTQLAIALAERF
GDEPIVLASLVRAGVPLGRYAAPGPA
>ECs1539 hypothetical protein
MSSKNRPRRTTIRNIRFPNHMIEQINIALEHKGSGNFSAWVIEACRRRLS
TERSGMNYIIK
>ECs5497 hypothetical protein
MSHAVDGTYIKAQQDKMVNSSLCKKSSVDTYFNVMEVNLIIYNDDVIMMN
>ECs5428 hypothetical protein
MALYIDISAIAGQVRVIRAVTKRYAPLLQKVSGECTEDIVNDFVIELRGL
IFSYKVTTIFADGSRETVRALRLKGCVKDLATTFWARKLDCIHNQFPLE
>ECs3274 hypothetical protein
MFKERMTPDELARLTGYSRQTINKWVRKEGWTTSPKPGVQGGKARLVHVN
EQVREYIRNAERPEGQGEAPALSGDAPLEVLLVTLAKEMTPVEQKQFTSL
LLREGIIGLLQRLGIRDSK
>ECs0511 hypothetical protein
MTEIQRLLTETIESLNTREKRDNKPRFSISFIRKHPGLFIGMYVAFFATL
AVMLQSETLSGSVWLLVVLFILLNGFFFFDVYPRYRYEDIDVLDFRVCYN
GEWYNTRFVPAALVEAILNSPRVADVHKEQLQKMIVRKGELSFYDIFTLA
RAESTS
>ECs4814 hypothetical protein
MKLIFHHRPVSTATNRSLLAHRYNFVGNSSGNKEFLPEHPRHHPIALTSI
IGSTVSFFNRIIGWQRSARYGS
>ECs4220 hypothetical protein
MNKFIKVALVGAVLATLTACTGHIENRDKNCSYDYLLHPAISISKIIGGC
GPTAQ
>ECs4813 hypothetical protein
MLCAVQKNLHAARKVLSGIIGNVPKNNKPKTIIFDLRFLSAIKQKVFLGS
EQPLFVKHTAMMREACKELPQLVEYVPLACEAHSHRSVALAKTVAAPIMA
TGVGAALMYATSSGPYYAQSLSGSPDIEMIKKRMAELFNQSDTSVRNKYR
PAFDKWNKLVDKLNHERNTVPFACLTTIIATAINETPEGDTNVAVVMGCK
SAKDRTISIVLGNSMLQTLFEKRLADDGEIENLFDQQGYFSCDSLTAEEV
MMLKDLFDIRVLHLSNKFNVGLQGNINTDVLQDSFFKNVDFIHESNSYTV
GMGV
>ECs2629 hypothetical protein
MSTINHQKLRELAFALQRMATPQKLLAFRAMLSPSAVLALLDQLEHARTT
APAIRLTLHHEIADFCATLGAPGEPETPEAMQQELLQRIDNVFDFFLNQ
>ECs0846 hypothetical protein
MLSPIRTTFHNSVNIVQSSPCQTVSFAGKEYELKVIDEKTPILFQWFEPN
PERYKKDEVPIVNTKQHPYLDNVTNAARIESDRMIGIFVDGDFSVNQKTA
FSKLERDFENVMIIYREDVDFSMYDRKLSDIYHDIICEQRLRTEDKRDEY
LLNLLEKELREISKAQDSLISMYAKKRNHAWFDFFRNLALLKAGEIFRCT
YNTKNHGISFGEGCIYLDMDMILTGKLGTIYAPDGISMHVDRRNDSVNIE
NSAIIVNRSNHPALLEGLSFMHSKVDAHPYYDGLGKGVKKYFNFTPLHNY
NHFCDFIEFNHPNIIMNTSQYTCSSW
>ECs4991 putative tail fiber protein
MADSRYVQSIRRGSRSTIGMQYNIFEVPDGCVLTGLDVAGDGNATVTAYY
RPVQFLIDGSWKTASSA
>ECs4593 hypothetical protein
MARMAQFAPRRFFLAQTGELNIFNKQQANHCNTQHPYRQPEQRVVAPDFN
HNFVNRAGDNLAEA
>ECs2204 hypothetical protein
MSEEILMETVFDALKALKKASSQVVASRLGISREDAVNELWKLKRRGEAD
NKGSMWWLTQTGESEPVSPVPKVTAQMLTEAIEHHGPQTADELALMFGIT
SRRANSSLAMAISKGRLIRVNQGGKFRYCIPGADLPAEPKAASITETDGK
AFPQPAGVALPVGEAETQEEIKTESVAVTVQSQPSFTRKHPDGLILPSLH
VANRELRRAKGQVQKWERVCAALRELNKCRDILRDITATREQQR
>ECs1758 excisionase
MARLILLTEWAKEEFSEPVPTPSTLSKYAKAGMIFPLPKKVGRRWRVDPQ
ARFVGMVNKPEVIATDHPALKRILEDGAPAKI
>ECs3681 hypothetical protein
MNREKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAI
VQSALAWGKMHSWQTQPAVQCLLYAATGARVCLRLLEDNEALLIAGYEGV
SLWRTGEVIDGKIVFSPRGWSDFCPLKEGALCQLP
>ECs4538 hypothetical protein
MLTSGELNPRHQHTVTLYAKGLTCKADTLGSRGYVYMAVYPTPETKK
>ECs1124 hypothetical protein
MNILKKIMQRLCGCGKHDDCEHGQSLTVQLRLGPADILESDENGIIPEQD
GVITQVVILDADKKQIQCVVRPLQILRADGRWENIGGMK
>ECs1168 hypothetical protein
MLRPTLIANMHRLTKSLWRFTSGFGNKVKPSAPLAFASKEVSNACGMYRL
VTVSDSFLGFCANCKEHHPDTKAPPYRYCSGTTSSKEMQESKTMKNRKAK
ILLVRRNAPGVWQWVRLSNRRMGLMKYYGMMDCGFCKKPSAEQNRWKNHL
RTKGE
>ECs1073 hypothetical protein
MRDYAKVSPRFWLGETGRELRKAGAEAQVVAFYLMTSPHANMLGLYYLPV
LYLAHETGLGLEGASKGLKRAVEAGFCSYDHDAEMVWVHEMAAWQVGETL
KPGDNRCAGVRNEYASLPENAFLSVFYDRYKTDFHLDVRRNNSRNSVRGF
EGAFKGLRSQEQEQEQEKEQEQDKNTMVHGKKNTTNQAGDVQTVNPGQPA
GTTPEADSGAVQQVMTAGSEQSHQLQQPEADSAIQREADRVVPESTGQSV
GRVDYPDVFEQVWREYPLRAGANPKKSAFSAWKARLREGVPPETMLDGVR
RYARYLAATGKAGTEFVQRATTFFGPDRNFENPWLLPVSGTNNQRCVNHI
SEPDTEIPPGFRG
>ECs2977 hypothetical protein
MAKPARRKCKICKEWFHPAFSNQWWCCPEHGTQLALKLQSKQRKKAEKAA
EKKRRREEQKQKDKLKIRKLALKPRSYWIKQAQQAVNAFIRERDRDLPCI
SCGTLMSAQWDAGHYRTTAAAPQLRFDERNIHKQCVVCNQHKSGNLVPYR
VELINRIGQEAVDEIESNHNRHRWTIEECRAIKAKYQQKLKDLRNSRSEA
A
>ECs2635 putative plasmid partition protein
MTTPTRRISFYLKPAAVKSEQEACNYLDSLPASERSRAQRAAFLAGLALI
KRNPAFAYWMAEWPEDNLPFNTVSIKETYQANQPSGVECNFYKIKKNIQT
LFPE
>ECs3138 hypothetical protein
MKKIALAGLAGMLLVSASVNAMSISGQAGKEYTNIGVGFGTESTGLALSG
NWTHNDDDGDVAGVGLGLNLPLGPLMATVGGKGVYTNPNYGDEGYAAAVG
GGLQWKIGNSFRLFGEYYYSPDSLSSGIKSYEEANAGARYTIMRPVSIEA
GYRYLNLSGKDGNRDNAVADGPYVGVNASF
>ECs1533 antirepressor protein
MNMMAVPFHGNSLYVVNHNGEPYVPMKPVVAGMGLAWQSQLAKLRQRFAS
TITEIVMVAEDGKQRNMVSMPLRKLAGWLQTINPNKVKPEIRDKVIRYQE
ECDDVLYEYWTKGFVVNPRKMSVMEELNQACADMKRDKNIASVFATGLNE
WKQVKAAHVSKIRTLVNEANMLIDFVLADTGKGKITKAD
>ECs5314 hypothetical protein
MLNRDFDHLSTLLHLFFEREDICQKLNIIDQNNITGWEVWFQVEFANMLC
STDHEWWREQALSCDMRKKPERPTLRTDFLLRKKGWAQDSYIALEIKQNR
DATSCVKNMIADLEKSAKIKRSELDLRSFWTLGITPMIDRERLDSLIDCY
LDEKFYLTKSRKKHVLLKEINNTPYCYIVI
>ECs5030 hypothetical protein
MLKIIPGATGYFNKTLNSNQFDNEDAIKGKVNSIYGKSIDYSTMRHRDII
IAKIDLFIQRITHNLWTAREKNVTLIKQINDLKMCVNEYIGDCTDEELND
REFIASVVDRAIFHFAINSICNPEDNKDATLIERCTFDVETKNGLPSTVQ
LFYEESKDNEPLANIHLQAIGSGFLTFVNACQEHDDNSLKLFASLLISLS
YSSVYSDFAGRVNINEYNDNYLKAQFEELSQRDMKKYLGEMKRLADGGEM
NFDGYLDKMSHLVNEGTLAPDILSKMRDAAPKLIDFAKSFDPNSKEKI
>ECs2772 hypothetical protein
MSSIFLTENELQTLTGCKYASHQRNWLIKNGLPFYTNRSGKPIVSRELFT
CIKTLPPREAEPDFGAI
>ECs3488 hypothetical protein
MERRAVALERQLNGGVDFLRSVNNYFQSVMAEHRENKTSNKILMEKINSC
VFGTDSNHFSCPESFLTCPITLDTPANGVFMRNSQGAEICSLYDKDTLVQ
LVETGGAHPLSREPITESMIMRKDECHFDSKKESFVASDA
>ECs1776 hypothetical protein
MRVLLRPVPVPELGVVVLKPGRESMQVFHNPRVLVEPEPKSMRGLPSGIV
PAVRQPLAEDKSLLPFFSNERVIRAVGGAGALSDWLLRHVKSCQWPHGDY
HHSETVIHRYGTGAMVLCWHCDNQLRDQTSESLEQLAHQNLSAWMIDVIG
HAISGTQERELSLAELSWWAVRNQVADALPEAVLRRSLGLRAEKIRSMYR
ESDIVPGEQTATIILKQRTKNLAPLPHAHQQNPPQEKTVVSIAVDPESPE
SFMKRPKRRRWVNEKYTRWVKTQPCACCGKPADDPHHLIGHGQGGMGTKS
HDIFTLPLCREHHNELHADPLAFEEKHGSQVDLIFRFLDHAFATGVLG
>ECs4234 hypothetical protein
MRVKRWLLAGIALCLLTGMRDPFKPPEDLCRISELSQWRYQGMVGRGERI
IGVIKDGQKKWRRVQQNDVLENGWTILQLTLDVLTLGTGTNCEPPQWLWQ
RQGDTNEAMDSRTTVDADTRRTGGKAAKSDADGG
>ECs1401 hypothetical protein
MRLASRFGYANQIRRDRPLTHEELMRHVPSIFGENRHTSRSEHYAYIPTI
TVLENLQQEGFQPFFACQTRVRDQSRREYTKHMLRLRRAGQITGQHVPEI
ILLNSHDGSSSYQMLPGYFRAICTNGLVCGQSLGELRVPHRGNVVDRVIE
GAYEVVGVFDRIEEKRDAMQSLVLPPPARQALAQAALTYRYGDEHQPVTT
ADILTPRRREDYGKDLWSAYQTIQENMLKGGISGRSAKGKRIHTRAIHSI
DTDIKLNRALWVMAETLLESMR
>ECs3961 hypothetical protein
MKLITAPCRALLALPFCYAFSAAGEEARPAEHDDTKTPAITSTSSPSFRF
YGELGVGGYMDLEGENKHKYSDGTYIEGGLEMKYGSWFGLIYGEGWTVQA
DHDGNAWVPDHSWGGFEGGINRFYGGYRTNDGTEIMLSLRQDSSLDDLQW
WGDFTPDLGYVIPNTRDIMTALKVQNLSGNFRYSVTATPAGHHDESKAWL
HFGKYDRYDDKYTYPAMMNGYIQYDLAEGITWMNGLEITDGTGQLYLTGL
LTPNFAARAWHHTGRADGLDVPGSESGMMVSAMYEALKGVYLSTAYTYAK
HRPDHADDETTSFMQFGIWYEYGGGRFATAFDSRFYMKNASNDPSDQIFL
MQYFYW
>ECs4111 hypothetical protein
MGHETKAQLKDYVEVKIMKIKTTVAALSVLSVLSFGAFAADSIDAAQAQN
REAIGTVSVSGVASSPMDMREMLNKKAEEKGATAYQITEARSGDTWHATA
ELYK
>ECs3366 hypothetical protein
MSQATSMRKRHRFNSRMTRIVLLISFIFFFGRFIYSSVGAWQHHQSKKEA
QQSTLSVESPVQR
>ECs3228 hypothetical protein
MLSSISINNTSAAYPESINENNNDEVNGLVQEFKSLFNGKEGISTCIKHL
LELIKNAIRGNDDPDRININNSSVTYINIGSNDTDHITIGIDNQEPIKLP
ASYKDKELVRTIINDNIVEKTHDDINNAEFTDGGIDDRSDIGIDSNDTDD
IDIDNQEPIKSPANDENKKVGQTIINENIVENTHDINNKKMIFSAIKEIY
DEDPSFIFDKISHKLRHTVTEFDESGKSEPTGLFTWYGKDKKGDSLAIVI
KNKNGNDYLSLGYYDQDGYHIQKGFRIDGDNLTQYCSKKARSASAWFEHS
KAIIAESFATGSDHQVVNELKGERLRELNEAFNPSNLARGYEY
>ECs2730 putative major head protein
MGLFTTRQLLGYTEQKVKFRALFLELFFRRTVNFHTEEVMLDKITGKTPV
AAYVSPVVEGKVLRHRGGETRVLRPGYVKPKHEFPWSR
>ECs1377 hypothetical protein
MRTRIDYLADKYCFTERNESPRLRRQWQDVLEECRQTEAGPEERLRIALL
NVDYVTSFELPFRLLLTRTPQLIAALREEWGISQKNVVFNDKRFGCVYSL
KASLSGVPDTYRYHLSHRIRRVVGNENTSLPYQQVAREVKAPRERLKYAL
EAGLLVTALDGLFRFGSQRIAADVLRLRKAGMPVVTTTVEVHDNLTGTTR
KVPAYHL
>ECs0701 hypothetical protein
MSTSHSTLSERRKAKSSPKRAFLMWLGYEDSNLGMPESESGALPLGDTPT
GCTYKVSVLNKLAGVRGFEPRNAGIRIRCLTAWRYPNKLVFEFAERIRHI
QNLVATTGFEPVTPSL
>ECs0810 NinE
MRRQRRSFTDIICENCKYLPTKRSRNKRKPIPKESDVKTFNYTAHLWDIR
WLRHCARKTR
>ECs4270 hypothetical membrane protein
MNIIRLYIYPLFLGFVIVPLLVWPTVVALAACVITLSFLGEILFSIPLMH
RRISLLQLQLWLSAEYALFFCAMVGVGWQFARRTPPELKSRLHCWLVFAP
VYFWLLLWNIIFYVAPAQIALLENMRSFFLTVVWLPLNFSPFWSQQWIDF
VGPISAQLGFALGYYCQWRSKNRSQIRKWSEWGTCLSFVTLGMMAWALR
>ECs1638 minor tail protein
MAIKGLEQAVENLSRISKTAVPGASAMAINRVASSAISQSASQVARETRV
RRKLVKERARLKRATVKNPQARIKVNRGDLPVIKLGNARVVLSRRRRRKK
GQRSSLKGGGSVLVVGNRRIPGAFIQQLKNGRWHVMQRVAGKNRYPIDVV
KIPMAVPLTTAFKQNIERIRRERLPKELGYALQHQLRMVIKR
>ECs5487 hypothetical protein
MRKSEKLAEDRVSVWLTAVMHYSRMQRKARSIPVSSLWFQGYSALAGFNA
YVFSEGNRQSGEFAWNWRRLSMM
>ECs0366 hypothetical protein
MVWAFFAALAAGWLASVSGLSAFWASVITTVPFSAVVVWQGRFWLLSFIP
GGFLGMTLFFASGMNWTVTLLGFLAGNCVGIISEYGGQKLSEATTKRDGY
>ECs1065 hypothetical protein
MDTIDLGNNESLVYGVFPNQDGTFTAMTYTKSKTFKTENGARRWLERNSG
E
>ECs4235 hypothetical protein
MNMFFDWWFATSPRLRQFCWAFWLLMLVTLIFLSSTHHEERDALIRLRAS
HHQHWATLYRLVDTTPFSEEKTLPFSPLDFQLPGAQLVFWHPSAQGGELA
LKTLWEAVPSAFTRLAERNVSVSRFSLSVEGDDLLFTLQLETPHEG
>ECs4418 hypothetical protein
MTQFTQNTAMPSSLWQYWRGLSGWNFYFLVKFGLLWAGYLNFHPLLNLVF
AAFLLMPIPRYSLHRLRHWIALPIGFALFWHDTWLPGPESIMSQGSQVAG
FSTDYLIDLVTRFINWQMIGAIFVLLVAWLFLSQWIRITVFVVAILLWLN
VLTLAGPSFSLWPAGQPTTTVTTTGGNAAATVAATGGAPVVGDMPAQTAP
PTTANLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVINICSLSWSDI
EAAGLMSHPLWSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQ
PANNDCYLFDNLSKLGFTQHLMMGHNGQFGGFLKEVRENGGMQTELMDQT
NLPVILLGFDGSPVYDDTAVLNRWLDVTEKDKNSRSATFYNTLPLHDGNH
YPGVSKTADYKARAQKFFDELDAFFTELEKSGRKVMVVVVPEHGGALKGD
RMQVSGLRDIPSPSITDVPVGVKFFGMKAPHQGAPIVIDQPSSFLAISDL
VVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQYQDKPYVRLN
GGDWVPYPQ
>ECs1537 hypothetical protein
MSDLIIHARYCRRIPALTETAARPGESSARVCGDNQKRYTPGFTALTERG
VVPSWSLVRCDGGRSRMFITIN
>ECs4631 putative replicase
MKLNFKGFFKAASLFPLALMLSGCISYALVSHTAKGSSGKYQSQSDTITG
LSQAKDSNGTKGYVFVGESLDYLITDGADDIVKMLNDPALNRHNIQVAND
ARFVLNAGKKKFTGTISLYYYWNNEEEKALATHYGFACGVQHCTRSLENL
KGTIHEKNKNMDYSKVMAFYHPFKVRFYEYYSPRGIPDGVSAALLPVTVT
LDIITAPLQFLVVYAVNQ
>ECs3120 InaA protein
MAVSAKHDEFNHWWATEGDWVEEPNYRRNGMSGVQCVERNGKKLYVKRMT
HHLFHSVRYPFGRPTIVREVAVIKELERAGVIVPKIVFGEAVKIEGEWRA
LLVTEDMAGFISIADWYAQHAVTPYSDEVRQAMLKAVALAFKKMHSVNRQ
HGCCYVRHIYVKTEGKAEAGFLDLEKSRRRLRRDKAINHDFRQLEKYLEP
IPKADWEQVKAYYYAM
>ECs2630 hypothetical protein
MHTQKNRLPCRNRSGYISAAPHKTGAGILNPIQSKAHNRASGFFVRTVLP
RLFRVRIMAGRTGPTSVGPDSLLSGVENPVRLASPRFSTLDGELFLSPSK
EATPWQTANSTALSRSVVTSRLKSTADFPAHHASRKSCTSICCMSAATHY
QTFIPPLFSAIWRMICTSFNSSSSSKTNSINSCSGPFLHLAAGGLRTSVT
RGLPQ
>ECs1932 restriction alleviation and modification enhancement protein
MRYDNVKPCPFCGCPSVTVKAISGYYRAKCNGCESRTGYGGSEKEALER
>ECs1101 hypothetical protein
MATQTEVARHLSLTDRQLRRLQKLPGAPISNKRGQLDLDAWRDFYISYLR
RSKNDVPDGDSEDDYEEKLLIARWELTAEQAVTQQLKKRTA
>ECs4146 hypothetical protein
MNQAIQFPDREEWVENKKCVCFPALVNGMQLTCAISGDSLAYRFTGDTPE
QWLASFRQHRWDLEEEAENLIQEQSEDDQGWVWLP
>ECs1379 hypothetical protein
MLLSGSVIAGRDFVLFCFFPGFGFEFSQKVVHRSHAGMRIAVLHKHHPGF
LNTGQGDKRTMIVFVSLIDFR
>ECs2536 hypothetical protein
MKKFRWVVLVVVVLACLLLWAQVFNMMCDQDVQFFSGICAINQFIPW
>ECs1531 hypothetical protein
MKKKYELVVKGINNYPDKITVTVALEIGGYPSLLLPDVAISLDRTEGATL
EFYEAEAKKQAKQFFMDVAAGLCEGDGPLPEKRPVILEAQDVLITYRGKL
PGIITGSLKTPPLA
>ECs5539 hypothetical protein
MYFVHIIIFWAIFNKVNSIVFASKRDNGGIPYFWIYPMMALLHSGFKLNF
IYDIFIGYMKPSVEPVP
>ECs1527 hypothetical protein
MSIKHYDVVRAASPSDLAEKLTHKLKEGWQPFGSPVAITPYTLMQAIAAE
GDVTTPVVVPGTGDGGYPGVVTTEPDYYYVIPLAGQSNGMAYGEGLPLPQ
TYDRPDPRIKQLARRSTVTPDGAPCKYNDIIPADHCLHDVQDMSRLNHPK
ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGT
YSDVTGASESSTRWGVGRPLYKDLIGRTKAALAKNPKNVLLAVVWMQGEF
DFDGTPANHTARFTEVVEQYRTDLADMVGQCAGGSADGVPWICGDTTYFW
KQKSESTYQTVYGSYKNKTEKNIHFVPFMTDENGANVPTNKPEEDPDIPA
SGYYGAASRTSANWTSADRASHFSSWARRGIISDRLASAILLHAGRTAEL
VGGEQVVMPPDEKPSPDTPSTPSTDGKSVTTLLYYRATESGGLLNPQGWG
AEGGRALVVDDAGAAGGKALRWTKQTGSSSWFMQHDAGNGADLLEKGGLI
SCRFKVDGTLTANQYALALYWPVSSLPQGVTLEGNAGHNLLASFYVQSDA
TDLNVMYHKGNAGQNTKLGSFGAFDNEWHTLGFRFAGNNSIEVTPVIDGK
DGTPFMLSQSPVGTFTADKLRVTDITSGATYPVLIESITVEVNNP
>ECs0440 AroM
MSASLAILTIGIVPMQEVLPLLTEYIDEDNISHHSLLGKLSREEVMAEYA
PEAGEDTILTLLNDNHLAHVSRRKVERDLQGVVEVLDNQGYDVIILMSTA
NISSMTARNTIFLEPSRILPPLVSSIVEDHQVGVIVPVEELLTVQAQKWQ
ILQKPPVFSLGNPIHDSEQKIIDAGKELLAKGADVIMLDCLGFNQRHRDL
LQKQLDVPVLLSNVLIARLAAELLV
>ECs3614 hypothetical protein
MNSFSLLTTPWLPVRFKDGTTGKLAPVDLADENVVDIAAPRADLQGAAWQ
FLLGLLQSSFAPKDYRRWDDIWEDGLEAEKLREALLSLEHPFQFGPDSPS
FMQDFEVLMGDKVQVASLLPEIPGAQTTKFNKDHFIKRGVTEHVCSHCSA
LALFSLQLNAPSGGKGYRTGLRGGGPMTTLIELQEYQGNQQAPLWRKLWL
NVMPQDEADLPLPKKFDDLVFPWLGPTRTSELAGAVVTDDQVNKLQAYWG
MPRRIRIDFNTTTVGNCDICGEQSDALLSLMTTKNYGANYAMWQHPLTPY
RVPLKEGGEFYSVKPQPGGLIWRDWLGLIETGKSENNTELPALVVKLFNA
SSLKQAKVGLWGFGYDFDNMKARCWYEHHFPLLLNKKEGQIPKLRLAAQT
ASRILSLLRSALKEAWFSDPKGARGDFSFVDIDFWNKTQHRFLRLVRQIE
EGQDADELLGKWQKEIWLFARQDFDERVFTNPYEPVDLERVMTARKKYFT
TSAEKQSAKAAREKKQEAAE
>ECs1706 hypothetical protein
MRDMSGAFNNDGRGISPLIATSWERCNKLMKRETWNVPHQAQGLTPLPSP
TTSARIIKVLLPLRFSTFNVVNVPAVDPWLAKSTVEWLSASVSRLELLVT
FQTLLFIATSSFPPLTLSLREEPVQTDPGCISRLISPPLERTLPLISIRP
EALMRLAA
>ECs0826 hypothetical protein
MNQNDIEAMIQRYTEAEMAVLDGKSVTFNGQQMTMENLSEIRQGRQEWER
RLAALITRRRGHPGYRLARF
>ECs4947 hypothetical protein
MAVFYIPDIYGRFYLVNFDNVKVISLAENKECGDLLFEFNDRTRMVISAG
LDREGATDVYSGICRSVGAKQVS
>ECs1943 hypothetical protein
MKIKHEHIESVLLALAAEKGQAWVANAITEEYLRQGGGELPLVPGKDWNN
QQNIYHRWLKGETNAQREKIQKLIPAVLAILPRELRHRLCIFDTLERRAL
LAAQDALSTAIDAHDDAVQAVYRKAHFSGGGSPSDSVVVH
>ECs2526 hypothetical protein
MFNSRLTIMEYRAVARSMDRHRRHFSIRPFNACLSGTLCRTFRLHFVVTP
ALFLASNSYSLSRSLSWNS
>ECs2036 hypothetical protein
MRTTSFAKVAALCGLLALSGCASKITQPDKYSGFLNNYSDLKETTSATGK
PVLRWVDPSFDQSKYDSIVWNPITYYPVPKPSTQVGQKVLDKILNYTNTE
MKEAIAQRKPLVTTAGPRSLIFRGAITGVDTSKEGLQFYEVVPVALVVAG
TQMATGHRTMDTRLYFEGELIDAATNKPVIKVVRQGEGKDLNNESTPMAF
ENIKQVIDDMATDATMFDVNKK
>ECs2276 putative replication protein
MILSRVAVMARLADYSNDEGVSWPAIETIRRQIGARSESTVKSAIAELAK
EGWLTKEERKVGGRNVSNIYRLNVEKLEAAAAAARESYKPKRKISPAKND
PLTVDPSNIAPSTVDPSNFDGSTVDNKLPIRGAMIDPDPSVLKPDPSDKR
SSCPDASQPDPQTAEQNFLTRHPDAVVFSAKKRQWGSQEDLVCAQWIWGR
IVSLYEQAASDDGEITRPKEPNWTAWANDVRTMRMLDGRTHRQICEMFGR
LQRDSFWVKNIMSPAKLREKWDELVIRLGRSPAQRCVNHISEPDTEIPPG
FRG
>ECs2618 hypothetical protein
MKVKKVQLLVTFLSMFSFSAVAMPFKTIERESFNGVWPFNTDEVQLQCLD
GNPYVMNFDDNKLYALTGLARIKGKTFGALPLDNNNPFWLDNDAAPGLKK
SLGDVTKAAFDLCDK
>ECs3858 hypothetical protein
MINPVTNTQGVSPINTKYAEHVVKNIYPEIKHDYFNESPNIYDKKYISGI
TRGVAELKQEEFVNEKARRFSYMKTMYSVCPEAFEPISRNEASTPEGSWL
TVISGKRPMGQFSVDSLYNPDLHALCELPDICCKIFPKENNDFLYIVVVY
RNDSPLGEQRANRFIELYNIKRDIMQELNYELPELKAVKSEMIIAREMGE
IFSYMPGEIDSYMKYINNKLSKIE
>ECs1995 hypothetical protein
MPVTTLSIPSISQLSPAGVQSLQDAARLESGIRISIGSGQYSVHYVQLLD
GFSVEPVRGGLLDRLLGREHRMERRAVALERQLNGGVDFLSSVNNYFQSV
MAEHRENKTSNKILMEKINSCLFRPDSNHFSCPESFLTCPITLDTPETGV
FMRNSRGAEICSLYDKDALVQLVETGGAHPLSREPITESMIMRKDECHFD
TKREAFCCK
>ECs1183 hypothetical protein
MLTHMFKISMPYPFIGIHMNFSIELVIQIVCRHLCFLILAAALQPLEKMY
QTR
>ECs2984 hypothetical protein
MNDSYRQFENWWSKDKSQFTGDDELKEFAWVIWQASRSAIELDIDWPESN
DDLWKDGEEGAYAMGYEDGRDKTVIAVMKAIRAAGIKEKNFD
>ECs1295 hypothetical protein
MNMAFYGKWFACLWLATSCVQAASTDNKALEIIRRADEIRSPNKPFRYTL
TVTEYKAGATQPENKQVLDISMRFMKPQGNEKADARSLVRFIYPPRDKGK
IMLSDWYDLWFYTPELRRPMPISRQQRLIGQISNGDVIVTNFEYAYDSTL
MGEVTCAEKQCYKLALVRKSADITWPKVIYYVEKDGDNRPWKAAYYSQDD
QLIKEVLYQDFQPVLGKTRPMKITVTDVRHGNNYSVMEYSDVRLESLPEF
HFTKEYIQRGAK
>ECs4536 hypothetical protein
MTATQLSASVLLWKYTAAVTSTGENDAIRLIRRESGCAAKYAGRGTKVGA
LRGRARWLKC
>ECs2978 hypothetical protein
MLTSDFLMVKAMLSPSQSLQYQKESVERALTCANCGQKLHVLEVHVCEHC
CAELMSDPNSSMYEEEDDG
>ECs1438 hypothetical protein
MMEKNNEVIQTHPLVGWDISTVDSYDALMLRLHYQTPNKSEQEGTEVGQT
LWLTTDVARQFISILEAGIAKIESGDFPVNEYRRH
>ECs1934 RecE
MSTKPLFLLRKAKKSSGEPDVVLWASNDFESTCATLDYLIVKSGKKLSSY
FKAVATNFPVVNDLPAEGEIDFTWSERYQLSKDSMTWELKPGAAPDNAHY
QGNTNVNGEDMTEIEENMLLPISGQELPIRWLAQHGSEKPVTHVSRDGLQ
ALHIARAEELPAVTALAVSHKTSLLDPLEIRELHKLVRDTDKVFPNPGNS
NLGLITAFFEAYLNADYTDRGLLTKEWMKGNRVSHITRTASGANAGGGNL
TDRGEGFVHDLTSLARDVATGVLARSMDLDIYNLHPAHAKRIEEIIAENK
PPFSVFRDKFITMPGGLDYSRAIVVASVKEAPIGIEVIPAHVTEYLNKVL
TETDHANPDPEIVDIACGRSSAPMPQRVTEEGKQDDEEKPQPSGTTAVEQ
GEAETMEPDATEHHQDTQPLDAQSQVNSVDAKYQELRAELHEARKNIPSK
NPVDADKLLAASRGEFVDGISDPNDPKWVKGIQTRDCVYQNQPETEKTSP
DMNQPEPVVQQEPEIACNACGQTGGDNCPDCGAVMGDATYQETFDEESQV
EAKENDPEEMEGAEHPHNENAGSDPHRDCSDETGEVADPVIVEDIEPGIY
YGISNENYHAGPGVSKSQLDDIADTPALYLWRKNAPVDTTKTKTLDLGTA
FHCRVLEPEEFSNRFIVAPEFNRRTNSGKEEEKAFLRECASTGKTVITAE
EGRKIELMYQSVMALPLGQWLVESAGHAESSIYWEDPETAILCRCRPDKI
IPEFHWIMDVKTTADIQRFKTAYYDYRYHVQDAFYSDGYEAQFGVQPTFV
FLVASTTIECGRYPVEIFMMGEEAKLAGQLEYHRNLRTLADCLNTDEWPA
IKTLSLPRWAKEYAND
>ECs5161 hypothetical protein
MARKRKSRNNSKIGHGAISRIGRPNNPFEPRRNRYAQKYLTLALMGGAAF
FVLKGCGDSGDVDNDGDGTFYSTVQDCIDDGNNADICARGWNNAKAAFYA
DVPKNMTQQNCQSKYENCYYDNVEQSWIPVVSGFLLSRVIRKDRDEPFVY
NSGGSSFASRPVWRNTSGDYSWRSGSGKKESYSSGGFTTKKASTVSRGGY
GRSSSARGHWGG
>ECs0276 cII
MVRANKRNEALRIESALLNKIAMLGTEKTAEAVGVDKSQISRWKRDWIPK
FSMLLAVLEWGVVDDDMARLARQVAAILTNKKRPAATERSEQIQMEF
>ECs5484 hypothetical protein
MGDFIQPMLLGIGFIRLAAFAGAKSGGECLAGICVEGDVFPQSMSGAARR
TAKDACGAHGENEFAVGIRVAG
>ECs3877 HybE
MTEEIAGFQTSPKAQVQAAFEEIARRSMHALSFLHPSMPVYVSDFTLFEG
QWTGCVITPWMLSAVIFPGPDQLWPLRKVSEKIGLQLPYGTMTFTVGELD
GVSQYLSCSLMSPLSHSMSIEEGQRLTDDCARMILSLPVTNPDVPHAGRR
ALLFGRRSGENA
>ECs3159 hypothetical protein
MGMIGYFAEIDSEKINQLLESTEKPLMDNIHDTLSGLRRLDIDKRWDFLH
FGLTGTSAFDPAKNDPLSRAVLGEHSLEDGIDGFLGLTWNQELAATIDRL
ESLDRSELRKKFSIKRLNEMEIYPGVTFSEELEGQLFASIMLDMEKLISA
YRRMLRQGNHALTVIVG
>ECs0083 fruR leader peptide
MRNLQPNMSRWAFFAKSVGTWNKSSCRS
>ECs2241 putative tail protein
MRLALRLGRTLSELRHSLSASEAMMWMEFDRISPLGDERGDIRNAQIVKA
VFGAQGMNVALKDAMLCWGEDEDKPEVDPFAALEDALSLAAQS
>ECs1517 hypothetical protein
MNGQISIVRPGACDDREIRMIIRLAMGKTITALITPENLALALTGKSDMP
VELKLRNVEIKVK
>ECs1237 hypothetical protein
MAPVSGERMNDKILWYMQRVVRNSRNPEFMNEVKDACLKKQAFCFEAPDG
FLVLRSVLSADGIPYVLVLLGVCTGSNSVERYLPEVKTLTHLAGGRWAEF
HTARRGFIRLGKRLGFERMPDDEDGFMVFRIAV
>ECs3388 enhanced serine sensitivity
MSETKNELEDLLEKAATEPAHRPAFFRTLLESTVWVPGTAAQGEAVVEDS
ALDLQHWEKEDGTSVIPFFTSLEALQQAVEDEQAFVVMPVRTLFEMTLGE
TLFLNAKLPTGKEFMPREISLLIGEEGNPLSSQEILEGGESLILSEVAEP
PAQMIDSLTTLFKTIKPVKRAFICSIKENEEAQPNLLIGIEADGDIEEII
QVAGSVATDTLPGDEPIDICQVKKGEKGISHFITEHIAPFYERRWGGFLR
DFKQNRII
>ECs1163 hypothetical protein
MVLSKQLTGCRYQNRRRRLTVANLQLAVKGEYFDAMIRGEKTEEYRLFND
YWNKRIMFREYDRLIITKGYPKRDDSSRRIDVPYDGYEIKTITHPHFGDK
PVKVFAIKVNIGNE
>ECs2974 Shiga toxin I subunit A precursor
MKIIIFRVLTFFFVIFSVNVVAKEFTLDFSTAKTYVDSLNVIRSAIGTPL
QTISSGGTSLLMIDSGTGDNLFAVDVRGIDPEEGRFNNLRLIVERNNLYV
TGFVNRTNNVFYRFADFSHVTFPGTTAVTLSGDSSYTTLQRVAGISRTGM
QINRHSLTTSYLDLMSHSGTSLTQSVARAMLRFVTVTAEALRFRQIQRGF
RTTLDDLSGRSYVMTAEDVDLTLNWGRLSSVLPDYHGQDSVRVGRISFGS
INAILGSVALILNCHHHASRVARMASDEFPSMCPADGRVRGITHNKILWD
SSTLGAILMRRTISS
>ECs2181 hypothetical protein
MSSKNRTRRTTTRNIRFPNQMIEQINIALDLKGSGNFSAWVIEACRRRLI
NEKYSQFVPNKDKHDQRTCSDRFT
>ECs0101 SecA regulator SecM
MLWTSGFNDKICALNTFEYDRDGNNVSGILTRWRQFGKRYFWPHLLLGMV
AASLGLPALSNAAEPNAPAKATTRNHEPSAKVNFGQLALLEANTRRPNSN
YSVDYWHQHAIRTVIRHLSFAMAPQTLPVAEESLPLQAQHLALLDTLSAL
LTQEGTPSEKGYRIDYAHFTPQAKFSTPVWISQAQGIRAGPQRLS
>ECs2212 hypothetical protein
MPNKKRNPLIEKQIECLVNQLRQSGLLKTHSELRLTESAFDDKLNNVLYN
GIIDFNRSVGRRGPAGVSL
>ECs0016 Gef
MLNTCRVPLTDRKVKEKRAMKQHKVMIVALIVICITAVVAALVTRKDLCE
VHIRTGQTEVAVFTAYESE
>ECs1599 hypothetical protein
MTEAELLGLIRRVSGISQQADEQTTQPDSVTAENYARVVAEVMCRDGIQL
NDVDMRNIRIRVLEMLAYNRRVELHREKEKITYHWKKPERLRR
>ECs3717 EprJ
MSVSNMPPIDRAEQSTAHEIQQAKVIDLNDRVLNLDNPDDKMISAFANYA
VQTENWQQNALQALTSDKKGLTPEKLLVLQDHVLNYNVEVSLVGTLARKI
VAAVETLTRS
>ECs1173 hypothetical protein
MKHPHDNIRVGAITFVYSVTKRGWVFPGLSVIRNPLKAQRLAEEINNKRG
AVCTKHLPLS
>ECs1802 CHECK ME putative tail assembly chaperone
MRLALRLGRTLSELRHSLSASEAMMWMEFDRVSPLGDERGDIRNAQIVKA
VFGAQGMNVALKDAMLCWGEDEDKPEVDPFAALEDALSLAAMS
>ECs2965 lipoprotein Rz1 precursor
MRELKMKLCVLMLPLVVSACGSTPPAPVPCVKPPAPPAWIMQPAPDWQTP
LNGIISSSENG
>ECs1981 putative tail assembly chaperon
MAKDLKTLALARLSGFRHKTVKVPEWRNVSVVLREPSAEAWYLWQEVLNG
DGEDDDTLSVVAKTRRNLEADVTLFCDVLCDTDLQRVFAPDDREQVLAVY
GPVHARLLRQALELIADAESARKK
>ECs3727 EivJ
MQTKGAEQFSTKKLLNMTSRDQGINSELSNRTIQFKEKIHNGIHTEYITD
QKHSNNKDREKKYRDGDKINGPQAHSLDITNERRFADNRTMFTQHIEKQR
NVNTLNQNDINNSANNANVRENELTYQFQRWGQNHTVRILESSEGIRLKP
SDTLVSDRLHEAQHNDVTAQRWVLTEQDERQGQRHQPHEEQENEGKFEND
QKDES
>ECs4954 hypothetical protein
MIDAKVLEGVKNWLSIYGRLTCGILAEKMNMPPSSMVYFLRDAVDAGVLT
ECNGFYDIPRPRPVQPVRRKCSQEGAADDVQWCSFRKSLPWIEGHDIPSM
AWEFAQGVLTCETVYVVAEVDEQAMKEGVPQFVMAYIDIRLGVIICGLSG
WNITEHVLRYLIVDRTAAPAGISAEVA
>ECs5431 hypothetical protein
MAFSTEGPEVRLLITTTELKCNAVIEFTVTKVNHASTETTVSGIPPPDTT
FEGNPGTATDTTLFPPQDIYPPGVQSFDSAR
>ECs2988 regulatory protein CII
MAQASYSKPTQREIDRAETDLLINLSTLTQRGLAKMIGCHESKISRTDWR
FIASVLCAFGMASDISPISRAFKYALDGITKKKSPVAAGDSKQIDMQF
>ECs2765 putative cell division control protein
MRVDELVQFFGSVQRVADFYGITREAIYMWRKRPGEIVPKGRAAEAAAYS
KGKLSLNPELYKKKDTTQGEGKGDS
>ECs2016 hypothetical protein
MSRHDILLRSQFERIIEGDRVGQALISFYEKLPEENYRRALYILSIIYPI
KLNVGDDEFKFIFYIMSQKKFLRQQTISDFVRSINVIEFTETQKSVLREL
IKKNNDIIITQCTFELDCLLTRVSASSNQFRNSNGYLPENS
>ECs3898 hypothetical protein
MLESVAITETDADHADAVVRFRIFKDDKEKTTQTLKMVAENGRWVIDDIV
SNHGSVLQAVNSENEKTLAALASLQKEQPEAFVAELFEHIADYSWPWTWV
VSDSYRQAVNAFYKTTFKTANNPDEDMQIERQFIYDNPICFGEESLFSRV
DEIRVLEKTADSARIHIRFTLTNGNNEEQELVLQRREGKWEIADFIRPNS
GSLLKQIEAKTAARLKQ
>ECs5582 hypothetical protein
MVINFITPEGDDYMNISYVNSNKTTSLPIELDVQNNKDFSYAKDFFLYIE
TQLKIAKDFCRPEEEVSSSIASTVFHAFIDLVNKIMGKKDCMYICTLCCF
AEEVKDDYSHYRTFSFDISNQYKVKLTQRSSL
>ECs2942 putative outer membrane protein Lom precursor
MRKLCAVILSAVVWQVAAATPASAAEHQSTLSAGYLHASTNVPGSDDLNG
INVKYRYEFMDALGLITSFSYANAEDEQKTRYSDTRWHEDSVRNRWFSVM
AGPSVRVNEWFSAYAMAGVAYSRVSTFSGDYLRVTDNKGKTHDVLTGSDD
GRHSNTSLAWGAGVQFNPTESVAIDIAYEGSGSGDWRTDGFIVGVGYKF
>ECs5292 hypothetical protein
MASTNAHQIKNNDQNSLCGLGDKIRRLTAGVCLFTQFFFPVMATAQNVVH
AKPQTTVSSAPPSTRK
>ECs1594 hypothetical protein
MTEELITLEEVKLHCRIDGDEEDQLISGYIAASLEACQIHIGRRFDDGLE
FTPAIKIGCMMFIAHLYENRQIVADNAKTRVPMTIGALWTAYRDVGVY
>ECs0240 hypothetical protein
MKIIRLLLLACSITIHFFIVKEHVFKIFNQHAYCFLFAFYVLIVFCFLLF
FKRSFIVDLIACIIIPYISSVISFLFLSMLISPHLRERSPNAAFLY
>ECs4902 hypothetical protein
METCEFMQLKNCIICFIGHILCDTMKKQPNGGQISTLFQHHCAQQCAPAK
P
>ECs2422 phenylalanyl-tRNA synthetase operon leader peptide
MNAAIFRFFFYFST
>ECs1419 CsgB
MKNKLLFMMLTILGAPGIAAAAGYDLANSEYNFAVNELSKSSFNQAAIIG
QAGTNNSAQLRQGGSKLLAVVAQEGSSNRAKIDQTGDYNLAYIDQAGSAN
DASISQGAYGNTAMIIQKGSGNKANITQYGTQKTAIVVQRQSQMAIRVTQ
R
>ECs4995 hypothetical protein
MPEIKGTVTEELVKQALYSEEVNRVLKAQVRKDFEAQIDAYVDEVLARMV
GRSPAENSTENDPQPVEQPEPVQPGTDGTMM
>ECs5518 hypothetical protein
MKNFTVVQYQILSQFQIEHIQLFKEQFFIKLNKLFPQCSVKMLTYGIYFR
RIEISPYEPDGGFL
>ECs1109 putative head decoration protein
MVTKNITEQRAEVRIFAGNDPAHTATGSSGISSATPALTPLMLDEASGKL
VVWDGQKAGSAVGILVLPLEGTETVLTYYKSGTFATEAIRWPESVDEHKK
ANAFAGSALSHAALP
>ECs2263 hypothetical protein
MLCHKSVISVFLKLSSLSLYFRLRMARPFFYDQPLADGHPVI
>ECs2274 hypothetical protein
MGTQKNGVVDKTGHTWFLAVEGEAGVTEGQALQPEAPDVVTEEVAPKVTA
DMMVEFIGQDGAKTCEELAGKFGVSTRKVASTLAVVTATGRLARVNQNGK
FRYCMSGGNLPADPKAAPVTKNDGKAFPQPAGAALPVREAATQEEIKTET
VADIVQPLPSFTETQADELIFPSLRRANLALRRAKSDVQKWERVCAALRE
LNKHRDIVRQITDSSRRVVSEK
>ECs2107 putative adhesin
MGKTISIKVLFGIYLLLMAGKVFAFSCNVDGGSSIGAGTTSVYVNLDPVI
QPGQNLVVDLSQHISCWNDYGGWYDTDHINLVQGSAFAGSLQSYKGSLYW
NNVTYPFPLTTNTNVLDIGDKTPMPLPLKLYITPVGAAGGVVIKAGEVIA
RIHMYKIATLGSGNPRNFTWNIISNNSVVMPTGGCTVDSRNVTVNLPDFP
GSAEIPLGVYCSSEQKLSFYLSGTTTDSARQVFANTAPDATKASGVGVSL
MRNGKILATGENVSLGTVNKSKVPLGLSATYGQTGNKVSAGTVQSVIGVT
FIYE
>ECs2074 hypothetical protein
MITDLILHNHPRMKTITLNDNHIAHLNTKTTTKLEYLNLSNNNLLPTNDI
DQLISSKHLWHVLVNGINNDSLAQMQYWTAVRNIIDDTNEVTIDLSYNLA
ITNIDTSDEHLVEVSDNSEGNYIKDNDSMSIRYRSKYYSRDSALIEEETI
FSDAELKAILPMRRMYGVSDYKSNSSSLSSHSGLKDPTGTPVCYYIHNES
KPSLGYGPTSNNWLSQSFTTEL
>ECs1362 hypothetical protein
MPGNWTYNYSSVLKERSEMFKYAEMLMAEEDDKVCELIFKQQDNLILKIN
VRFPRIIIKKYRTLQGKVVWLWILPFVIEWLIVCTLGVNS
>ECs1994 hypothetical protein
MPLTSDIRSHSFNLGVEVVRARIVANGRGDITVGGETVSIVYDSTNGRFS
SSGGNGGLLSELLLLGFNSGPRALGERMLSMLSDSGEAQSQESIQNKISQ
CKFSVCPERLQCPLEAIQCPITLEQPEKGIFVKNSDGSDVCTLFDAAAFS
RLVGEGLPHPLTREPITASIIVKHEECIYDDTRGNFVIKGN
>ECs1174 exonuclease
MTPDIILQRTGIDVRAVEQGDDAWHKLRLGVITASEVHNVIAKPRSGKKW
PDMKMSYFHTLLAEVCTGVAPEVNAKALAWGKQYENDARTLFEFTSGVNV
IESPIIYRDESMRTACSPDGLCSDGNGLELKCPFTSRDFMKFRLGGFEAI
KSAYMAQVQYSMWVTRKDAWYFANYDPRMKREGLHYVVVERDEKYMASFD
EMVPEFIEKMDEALAEIGFVFGEQWR
>ECs2004 hypothetical protein
MKIVIAAFASSFMLVGCTPRIEVAAPKEPITINMNVKIEHEIIIKADKDV
EDLLKTRSDLF
>ECs2767 hypothetical protein
MDMLNLGNNESLVCGVFPNHDGTFTAMTYTRSKTFKTEAGAHRWLARNAN
>ECs2262 hypothetical protein
MSIKHYDVVRAASPSDLAEKLTHKLKEGWQPFGSPVAITPYTLMQAITAE
GDVVVSGATEPDWYYVIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLAR
RSTVTPGGAACRYNDIIPADHCLHDVQDMSTLNHPKADLSKGQYGCVGQG
LHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSESTGASQDSARW
GVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDMSAATHAQQPALF
TAMLAQFRADLSVFNAQCHGGSAADVPWICGDTTYYWKNTYATQYDTVYG
GYKNRESEGVYFVPFMTDGNGVNTATNAPAEDPDIPASGYYGAASRTNGN
QVSSNRPTHFSSWARRSIIPDRLATAILNAAGRTSAFISGKAPEIKPSPG
GNTPSGPSADTSVRTISLLPAAGEAAAQGWSIKDGGIQLSDGVFKITRQS
NKTWSLTHPVDDAITLLTQGGRLTCKFRLSGALTNNQFGLGIYLYTDAPV
PDGVAMTGTGNPFLMSYFTQTTDGRVNLMHHRKAGNTKLGEFGDYGNDWQ
TLELVFTAGSATVTPKLNGVAGPAFQVIKDSLTLGLNALTLTDVTKNAAY
GVEIESLVLEINAPAA
>ECs0916 putative receptor
MFVDRQRIDLLNRLIDARVDLAAYVQLRKAKGYMSVSESNHLRDNFFKLN
RELHDKSLRLNLHLDQEEWSALHHAEEALATAAVCLMSGHHDCPTVITVN
ADKLENCLMSLTLSIQSLQKHAMLEKA
>ECs4865 hypothetical protein
MSFGVLPATVNLNGLATGTMKVNCDHETSLKASIISLIQLIGGITVYRSD
SLRKGRATTKQQTDYGGNYFFHHPLPRKQRPFFGNRGELWRIENGLW
>ECs1580 hypothetical protein
MKNNKLATSELRSLANSAISLNNAANEVMNMWLQSLVNNERDESTARLIL
AVSELLDSSTSKIESISEKINGLSGHAIGE
>ECs1628 hypothetical protein
MSTKNRTRRTTTRNIRFPNQMIEQINIALDQKGSGNFSAWVIEACRRRLT
SEKRAYTSIKSDEE
>ECs3037 hypothetical protein
MDVQQFFVVAVFFLIPIFCFREAWKGWRAGAIDKRVKNAPEPVYVWRAKN
PGLFFAYMVAYIGFGILSIGMIVYLIFYR
>ECs2196 hypothetical protein
MTQHIKSHNSEADPEIKQGRRFRAPQYGWFHYLFCTIDEADMLQEAYLRR
GVRVERSLNADRLTWTVSVYLPVRAHLPRTHACYRQRVWR
>ECs0620 hypothetical protein
MLTKYALVAVIVLCLTVPGFTLLVGDSLCEFTVKERNIEFRAVLAYEPKK
>ECs1058 hypothetical protein
MNTTFALVLTVYLVSGESLELVTGLYGSMKECMAAAAEQKIPGNCYPVDK
TTHTNNNEIPAGL
>ECs0005 hypothetical protein
MKKMQSIVLALSLVLVAPMATQAAEITLVPSVKLQIGDRDNRGYYWDGGH
WRDHGWWKQHYEWRGNRWHPHGPPPPPRHHKKAHHDHHGGHGPGKHHR
>ECs4659 hypothetical protein
MSWINCWREKTPKSSANCRPDKAFCRIWQSARSYIVTGLIHFLTEILPHK
LTSKLSPVH
>ECs1955 hypothetical protein
MTENTTPTADFSPSKRFFVSMLTRDIDLNDAILDLLDNCVDGALRTIKDT
KKTSKPYEGFYAKLTINKDVFIIEDNCGGIPKSFREYAFKMGRPHQKEEE
NEGTVGVYGIGMKRAIFKMGRDCSIQSNNPDGAFTVDITPDWIDGDGWKI
PMHESDYDNKNPTGTTIEIKKLHSNVAQKFNESTYLTDLFLQIKHSLSFI
IQKGFKIELNGVVVEHNPINIITDHSKIEPYIYKAKIDDVDIDLVVGFYK
NLEDDNDDVLEKRSSDDAGWTVICNDRVVLYCDKTHLTGWGFANVPRFHT
QFIAISGVVRFTSKNPEKLPITTTKRGVDLSSTLYNDVRNKMIEGMMHFI
RFTNQWKGEHLEEGKKLLKSAQSHEAQSLFEITPQSTPEDKKNNWSNPNR
NKNEWRYTPKLPTPVKKSSTVRIIFTREKEDVKILSKYFFGHEDASASDV
GMQCFDTVIEEVK
>ECs1092 putative lipoprotein Rz1 protein precursor
MRELKMKLCVLMLPLVVSACGSTPPAPVPCVKPPAPPAWIMQPAPDWQTP
LNGIISSSENG
>ECs5009 hypothetical protein
MKKVLYGIFAISALAATSAWAAPVQVGEAAGSAATSVSAGSSSATSVSTV
SSAVGVALAATGGGDGSNTGTTTTTTTSTQ
>ECs1243 hypothetical protein
MFTPYRRGTIPAIRIADGTIQAHDDIDEEFFQPVLDGFLISKYTPFDILH
ALKDGVLQRTG
>ECs2335 hypothetical protein
MRRYSGEPMTTTTPQRIGGWLLGPLAWLLVALLSTTLALLLYTAALSSPQ
TFQTLGAQALTTQILWGVSFITAIAMWYYTLWLTIAFFKRRRCVPKHYII
WLLISVLLAVKAFAFSPVEDGIAVRQLLFTLLATALIVPYFKRSSRVKAT
FVNP
>ECs1612 replication protein P
MKNIAAQMVNFDREQMRRIANNMPEQYDEKPQVQQVAQIINGVFSQLLAT
FPASLANRDQNELNEIRRQWVLAFRENGITTMEQVNAGMRVARRQNRPFL
PSPGQFVAWCREEASVIAGLPNVSELVDMVYEYCRKRGLYPDAESYPWKS
NAHYWLVTNLYQNMRANALTDAELRRKAADELTCMTARINRGETIPEPVK
QLPVMGGRPLNRAQALAKIAEIKAKFGLKGASV
>ECs2953 putative minor tail protein U
MKHREIRAAVLSALKENISERVSWFDGRPVFIDEQELPAVAVYLTDASAA
DEFVDEGTWEATLHIEVFLRAKEPDSALDMWMEEKILPALEAVPGLSALL
LKMNLQGYDYRRDDEFMMWGSADLLWKITYEM
>ECs1203 antitermination protein Q
MRDIRQVLERWGAWAANNYEDVTWSPIAAGFKGLIPEKVKSRPQCCDDDA
MVICGCIARLYRNNRDLHDLLVDYYVLGETFMALARKHGCSDTCIGKRLH
KAEGIVEGMLMMLGVRLEMDRYVERELPGGRTSVFYQRKNSLRS
>ECs1091 putative transcriptional regulator
MLHDHVAECLEKKGLYRRAAERWAKVMVQLSDDQKRKVAAQKRAECLRKA
RRTPVSPVNLTEIKQAVNRLHSELGMGFEERRVFRRYKGTGEQNTSGNAR
SKKC
>ECs1127 hypothetical protein
MSGTSGSSSDAALATRYAAEYFCKTWTAPGLNQAEGYKAISDLSHHYFRA
EGSSPPQSFL
>ECs2024 hypothetical protein
MKKLALILFMGALVSFYADAGRKPCSGSKGGISHCTAGGKFVCNDGSISA
SKKTCTN
>ECs3233 putative DNA transfer protein precursor
MLVLSESFKNKLLPMNGYMKGGSDSGSKAQARATEKGIELQREMWQTNMQ
NLAPFTPLAQQYVSQLQNLSSLQGQGQALNQYYNSQQYKDLAGQARYQSL
AAAEATGGLGSTATGNQLAAIAPTLGQNWLSGQMNNYNNLANIGLGALTG
QANAGQNYANNVSQLYQQQAAASAANANKPSGLQSFATGAIGGAASGAMI
GSAVPVIGTGIGALAGGVIGGLGSLF
>ECs0790 hypothetical protein
MLELLKSLVFAVIMVPVVMAIILGLIYGLGEVFNIFSGVGKKDQPGQNH
>ECs1192 hypothetical protein
MNDSYRQFENWWSKDKSQFTGDDELKEFAWVIWQASRSAIELDIDWPESN
DDFWKDGEEGAYAMGYEDGRDKTVIAVMKAIRAAGIKEKNFD
>ECs3989 hypothetical protein
MMMSKKSAKKRQPVKPVVAKEPARTAKNFGYEEMLSELEAIVADAETRLA
EDEATA
>ECs0824 hypothetical protein
MDHELKNLVLNINQLAALSGLHRQTVVARLKNIRPAGGHDKLKLYRLTDI
LTEFMGLPPPVAEGEMDPHERKAWYQSERERLKFEQETAQLIPASDVRRE
FAIWAKAVVQVLETLPDILERDCGLQPAAVSRVQSIIDDLRDQIALRVTE
AGADDEEELQQEE
>ECs4564 hypothetical protein
MSLSGAVFKTFLTSEHASWNRFNRRLHIPNEDIVDEIQLKARMQQRHHRV
YPEIGDSTIVSFRGKDYAVHFIKDGPKDDYVYKVQRITPENGCFSTLFSV
FSGGVTKALERKLNERHITPLSSTWFPRTPLEGILAERGLSSLLRRVQST
ERLDNRAIATRASSYSVL
>ECs1979 hypothetical protein
MTEADLYPYLAHLAGGQVYPYVVPLLDGRPSVALPWVVFSLISSVSADVM
GGQAESSVSVQIDVYAGTVTQARQIRQDAREAIMLLAPGSVSEMQDYIPE
NRCYRATLEFQVTV
>ECs5536 hypothetical protein
MMMNSFFPAMALMVLVGCSTPSPVQKAQRVKVDPLRSLNMEALCKDQAAK
RYNTGEQKIDVTAFEQFQGSYEMRGYTFRKEQFVCSFDADGHFLHLSMR
>ECs1671 hypothetical protein
MNMMRIFYIGLSGVGMMFSSMASGHDAGGLQSPACGVVCDPYICVNSDGI
SPELTRKYLSEKAAENLQSLQGYDPSEFTFANGVFCDVKEKLCRDDRYFG
VDGKRSGKINQTTTKMLFMCRE
>ECs1248 hypothetical protein
MSKIDYQALREKAEKATKGSYIVGHTSVNQHGNLTGVFVCQKWKGEPGGV
IAECHVNCLVETDAQAYATLNS
>ECs0347 hypothetical protein
MDACLFHCKYPGISNARIFTEEEHNHDGTGLSQVVLNAIFNLVCLLQVYV
QTSYLSQQSSIIRYTAFTGP
>ECs2737 putative transcriptional regulator
MLHDHLAECLEKKGLYRRAAERWAKVMVQLSDDQKRKVAAQKRAECLRKA
RRTPVSPMNLTEIKQAVNRLHSELGMGFEERRVFRRYKGTGEQNTSGNAR
SKKC
>ECs5540 hypothetical protein
MMRAAKANYTYASFDAHLQKVAAMKPTMLLMITVFLIFPALSQAESPFSS
LQSAKEKTTVLQDLRKICTPQASLSDEAWEKLMLSDENNKQHIREAIVAM
ERNNQSNYWEALGKVECPDM
>ECs1378 hypothetical protein
MARSDYDIINLSLEHELNEWLAERGYAGLVDNRNRLAEVVTRKLQDSFYI
NVSRDALNTAYSEHPEWFSGLVSGDEN
>ECs2175 putative head decoration protein
MTSKETFTHYQPLGNSDPAHTATAPGGLSAKAPAMTPLMLDTSSRKLVAW
DGTTDGAAVGILAVAADQTSTTLTFYKSGTFRYEDVLWPEAASDETKKRT
AFAGTAISIV
>ECs2619 hypothetical protein
MAVNSNGSNNRVAGRDFHEKNIQIERYDGSHTVNIAIPSNNDDDDRPLLK
AQRKELNSLVAAIAEASNTEAFIIWQKVHAEIGVAGIDDMTVNQYKTAES
FLHAMLERCKDHDACKALVSLLLRNSEDCGLRQKLLRYCHINFGTGRLND
LTRSQLQSALSWLEQQSASSHIESSTLPEVRLRASELIRLYPKEIIFFIC
VGVLVGGVISRAFFNL
>ECs0348 hypothetical protein
MSEQIKQDIDLIEILFYLKKKILVILFIIAICMAMVLLFLYINKDNIKVS
YSLKINQTTPGILVNCDSNNNFACQTTMTEDVIQRITTFFHTSPDVKNRE
IKLEWSGDKRDLPTAEAEISRVQASIIKWYASEYHNGR
>ECs5537 hypothetical protein
MIYCNKDDSQEASSQTPFLPLGFAFTTLYICQPDYSGGEDESMDGDINKY
LVLAIICVGGLSGLVASQSTGRNFPPATANKTVGR
>ECs0296 putative Ogr family transcription activator
MALKCPECGTTAHARTSAYEAPSVKRSWYQCQNLECSYTFTALESVDTII
MKPRRNEQESEKAQMPEKQQQTLNRYGSASKLSSRQQIPV
>ECs4366 universal stress protein UspB
MISTVALFWALCVVCIVNMARYFSSLRALLVVLRNCDPLLYQYVDGGGFF
TSHGQPNKQVRLVWYIYAQRYRDHHDDEFIRRCERVRRQFILTSALCGLV
VVSLIALMIWH
>ECs3981 hypothetical protein
MADTHHAQGPGKSVLGIGQRIVSIMVEMVETRLRLAVVELEEEKANLFQL
LLMLGLTMLFAAFGLMSLMVLIIWAVDPQYRLNAMIATTVVLLLLALIGG
IWTLRKSRKSTLLRHTRHELANDRQLLEEESREQ
>ECs2626 hypothetical protein
MNAKEKNIINTLKIVSAEQDKLSRAAQKDNQHMAALYALTIAIATPEAAK
VIEEQSKEIDTLKTQSTVAAMNPSSIGRCIYILGSAMMLQYTIIAELHGK
YLITPYHTKESELLTNLRLIERSQAVFIDDAQRAVFNA
>ECs1176 Gam
MDINTETEIKQKHSLTPFPVFLISPAFRGRYFHSYFRSSAMNAYYIQDRL
EAQSWTRHYQQIAREEKEAELADDMGKGLPQHLFESLCIDHLQRHGASKK
AITRAFDDDVEFQERMAEHIRYMVETIAHHQVDIDSEV
>ECs2926 hypothetical protein
MIVQKELVAIYDYEVPVPEDPFSFRLEIHKCSELFTGSVYRLERFRLRST
FHQRDREDADPLINDALIYIRDECIDERKLRGESPETVIAIFNRELQNIF
NQEIE
>ECs1631 head-to-tail joining protein
MTRQEELAAARAALHDLMTGKRVATVQKDGRRVEFTTTSVSDLKKYIAEL
EVQTGMTQRRRGPAGFYV
>ECs4213 hypothetical protein
MKKLTDKQKSRLWELQRNRNFQASRRLEGVEMPLVTLTAAEALARLEELR
SHYER
>ECs0702 putative RNA
MMCSNQLSYVARLFLRWLGYLDSNQGMPVSKTGALPLGDTPITGR
>ECs0686 putative tRNA ligase
MNKEEQYLLFALSAPMEILNQGCKPAHDSPKMYTGIKEFDLSSSWGINNR
DDLIQTIYQMTDDGHANDLAGLYLTWHRSSPEEWKALIAGGSERGLIYTQ
FVAQTAMCCGEGGIKAWDYVRMGFLSRVGVLNKWLTEDESLWLQSRVYER
AHHYYHSWMHYFSAYSLGRLYWQSSQCEDNASLREALTLYKYDSAGSRMF
EELAAGSDRFYATLPWQPLTVQPECPVTLKDVSDL
>ECs1367 hypothetical protein
MCHRAHQSCTTRDVALDRHLPDCEDILKEVIWAFSDFVRDNRGVYDPEAR
YPAGNPWYPVTGQF
>ECs2631 putative derepression protein
MADRKQHRAIAERRHIQTEINRRLSRASRVAQIMHINMLHERSHALSNIY
SASVFSYLADDLHEFQQLIQQQNKLH
>ECs5383 hypothetical protein
MHSHPLYAKSAIFPGSHCSLLLQNYSVSFVGITMAEAFYILIGFLIMAAI
IVMAVLYLENHS
>ECs2150 hypothetical protein
MTTYDRNRNAITTGSRVMVSGTGHTGKILSIDTEGLTAEQIRRGKTVVVE
GCEEKLAPLDLIRLGMN
>ECs4412 hypothetical protein
MKRKLFWICAVAMGMSAFPSFMTQATPATQPLINAEPAVAAQTEQNPQVG
QVMPGVQGADAPVVAQNGPSRDVKLTFAQIAPPPGSMVLRGINPNGSIEF
GMRSDEVVTKAMLNLEYTPSPSLLPVQSQLKVYLNDELMGVLPVTKEQLG
KKTLAQMPINPLFITDFNRVRLEFVGHYQDVCENPASTTLWLDVGRSSGL
DLTYQTLNVKNDLSHFPVPFFDPRDNRTNTLPMVFAGAPDVGLQQASAIV
ASWFGSRSGWRGQNFPVLYNQLPDRNAIVFATNDKRPDFLRDHPAVKAPV
IEMINHPQNPYVKLLVVFGRDDKDLLQAAKGIAQGNILFRGESVVVNEVK
PLLPRKPYDAPNWVRTDRPVTFGELKTYEEQLQSSGLEPAAINVSLNLPP
DLYLMRSTGIDMDINYRYTMPPVKDSSRMDISLNNQFLQSFNLSSKQEAN
RLLLRIPVLQGLLDGKTDVSIPALKLGATNQLRFDFEYMNPMPGGSVDNC
ITFQPVQNHVVIGDDSTIDFSKYYHFIPMPDLRAFANAGFPFSRMADLSQ
TITVMPKTPNEAQMETLLNTVGFIGAQTGFPAINLTVTDDGSTIQGKDAD
IMIIGGIPDKLKDDKQIDLLVQATESWVKTPMRQTPFPGIVPDESDRAAE
TQSTLTSSGAMAAVIGFQSPYNDQRSVIALLADSPRGYEMLNDAVNDSGK
RATMFGSVAVIRESGINSLRVGDVYYVGHLPWFERLWYALANHPILLAVL
AVLAVLAAISVILLAWVLWRLLRIISRRRLNPDNE
>ECs2172 putative tail attachment protein
MADFDNLFDAAIARADETIRGYMGTSATMTSGERSGAVIRGVFDDPENIS
YAGQGVRVEGSSPSLFVRTDDVRQLRRGDTLTINGEMFWVDRVSPDDGGS
CYLWLNRGQPPAVNRRR
>ECs0239 hypothetical protein
MESRKTLPVATTPTHKITFTTTATSSSKRKLKPKLASSSPVIITMYLADA
SKVAREKMKI
>ECs1404 hypothetical protein
MRIITRGEAMRIHQQHPASRLFPFCTGKYRWHGSAEAYTGREVQDIPGVL
AVFAERRKDSFGPYVRLMSVTLN
>ECs1567 hypothetical protein
MPFSIKNRFSSSQVHYPEISGPIKDKPASKNCILTSTTCNVDSYTVYQKK
ACSFDMRPPGAGERTPKLKLSVTEMTWLSKTIETEIHNTKE
>ECs3883 hypothetical protein
MNNHFGKGLMAGLKATHADSAVNVTKFCADYKRGFVLGYSHRMYEKTGDR
QLSAWEAGILTRRYGLDKEMVMDFFRENNSCSTLRFFMAGYRLEN
>ECs5465 hypothetical protein
MKQVLHDTRSRIPRNTATGPRLALRLSLEERAVIDEMAAKEQRSSSNMAR
MISLRGLELTQKEQNKSS
>ECs5481 hypothetical protein
MGDWRFFISEPGIISIEDLPPGWGLLHVVNGRVRKVHGWPKGNCCWGNPD
DKPFTGNKQVECDYMLSALRRMELRGHLNEIYDGVIVNKKEGNAA
>ECs3999 threonine dehydratase operon activator protein
MTGITIFYGDNIIRYVVNTKKGLRPYFKQLPDNYQAKFELNLMSKFSNFI
INKPFSAINNAARHIFSRYLLENKHLFYQYFKISNTGIDHLEQLINVNFF
SSDRTSFCECNRFP
>ECs1060 hypothetical protein
MSLKRMVNLTTRCPQRGKNNPAILTAAPFRGLPQPEAHGRIKFNDGSKGE
GLRRALSCYALTFQGYILSVNCQCRILICVRRTTTRSSVLVSIFNSALNG
G
>ECs5437 hypothetical protein
MAFSTEGPEVRLLITTTELKCNAVIEFTVTKVNHASTETTVSGIPPPDTT
FEGNPGTATDTTLFPPQDIYPPGVQSFDSAR
>ECs1070 hypothetical protein
MLDSTREKIRQKYTQAEIGRYMGVAQQTVWQWFSFGVPPKQVIPLCQLMK
WEVTPHEIRPDIYPNPTDGLPVGCKVNTSNAPELIHENQA
>ECs2376 hypothetical protein
MIITRADLREWRIGAVMYRWFLRHFPRGGSYADIHHALIEEGYTDWAESL
VEYAWKKWLADENFAHQEVSSMQKLATDPGERAFCSQFARSDDHARIGCC
EDNARIATAGYAAQIASMGYSVRIGSVGFNSHIGSSGERARVAVTGNSSR
ISSAGDSSRIANTGMRVRVCTLGKRCHVASNGDLVQIASFGANARIANSG
DNVHIIASGENSTVVSTGVVDSIILGPGGSAALAYHDGERVRFAVAIEGE
NNIRAGVRYRLNEQHQFVEC
>ECs1547 hypothetical protein
MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAP
VRRGKLRRNVVVLSRCSRDGGMESGVHIRGVNPDTGNSDNTMKADNPRNA
FYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLRR
>ECs4980 hypothetical protein
MSGKQYQGTATIRVNGQEYETLEGATFSPSGFEREVVKGAKVYGYRQKPR
EATLDCKFPAGGEGSPAADEINNWTAVTIEFVADTGEVHMMTKAWSSEPA
SLDGGGEISVKFASASSTRVQ
>ECs1538 hypothetical protein
MDFMLTVSGVVILSIAYTADKYGCHLLSRIGAYCSLMLIFSSLFFE
>ECs0513 haemolysin expression modulating protein
MSEKPLTKTDYLMRLRRCQTIDTLERVIEKNKYELSDNELAVFYSAADHR
LAELTMNKLYDKIPSSVWKFIR
>ECs4666 putative fimbrial protein
MKKILSGLILLLCCPYGFAANGDGATHMSNLSFGPLTVAAANNHSGYNIF
EALSNTTGTYPVRCHCDDTHGGPGQQTAFFPIFYTGDAAPGLVLERTLNG
LNYYALNDYLSVGVTIFIINNQYAAIPFEHLSNQSTSPQHTCGAGNNGST
VNLDSGRSAKLSFYVRHSITGTVTIPTTEVAWLYAGMSDHFPKTTPVSKV
TIRGQLTAPQNCELTPNQSIDVDFQKINSAEFSSTAGSIIAERKIKTEVT
VSCTGMEDVRSTEVVSASMIAANRSADATMIVTSNPDVGIKIFDKNDRPV
NVDGGNLPADMGAISRLGKTDGSVTFYSAPASLTGAKPAPDNGFTATATL
VIEFTN
>ECs4253 hypothetical protein
MSKKQSSTPHDALFKLFLRQPETARDFLAFHLPAPIHALCDMKTLKLESS
SFIDDDLRESYSDVLWSVKTEQGPGYIYCLIEHQSTSNKLIAFRMMRYAI
AAMQNHLDAGYKTLPMVVPLLFYHGIESPYPYSLCWLDCFADPKLARQLY
ASAFPLIDITVMPDDEIMQHRRMTLLELIQKHIRQRDLMGLVEQMACLLS
SGYANDRQIKGLFNYILQTGDAVRFNDFIDGVAERSPKHKESLMTIAERL
RQEGEQSKALHIAKIMLESGVPLADIMRFTGLSEEELAAASQ
>ECs3710 hypothetical protein
MTNPIGINNLSQSSNIANATGDEVVSLDKHINTSATDKVQIQAFIVSTWM
ASFQNDMYSEDNPISPYHKIEW
>ECs1884 phage shock protein
MNTRWQQAGQKVKPGFKLAGKLVLLTALRYGPAGVAGWAIKSVARRPLKM
LLAVALEPLLSRAANKLAQRYKR
>ECs1334 hypothetical protein
MYHHKGILFQATRILQSPEGVSLRLLKHWLAVTRRRPSQTAGTLAGRKSD
S
>ECs5266 hypothetical protein
MEWLFRQTTQTWGAERYLKDDWHGLQLFAIDGAQFRTPDEPELREYYGSA
NTSTERQSAYPVMRLVALMNLGITFY
>ECs2546 hypothetical protein
MESTKMKTSVRIGAFEIDDGELHGESPGDRTLTIPCKSDPDLCMQLDAWD
AETSIPALLNGEHSVLYRTRYDQQSDAWIMRLA
>ECs1949 hypothetical protein
MLNTQKAINAEKYNEWARKFSEQIFKITGDENAAKNELEPWTPEGADPNY
CWREVDPVDAANEAMSYHND
>ECs4520 DNA-damage-inducible protein
MEWAMNEHHQPFEEIRHYGTEGQEFWSARELAPLLDYRDWRNFQKVLARA
TQACEASNQAASDHFVETTKMVVLGSGAQRELEDVHLSRYACYLVVQNGD
PAKPVIAAGQTYFAIQTRRQELADDEAFKQLREDEKRLFLRNELKEHNKQ
LVEAAQQAAVATATDFAIFQNHGYQGLYGGLDQKAIHQLKGLKKSQKILD
HMGSTELAANLFRATQTEEKLKRDGVNSKQQANTTHFDVGSKVRQTIQEL
GGTMPEELPTPQVSIKQLENSVKITEKK
>ECs1973 hypothetical protein
MMPGPGISVMWHSRVRRCWFAWGRRSTSQRDVRQVRKRNVMRSMANVEPF
AANPKKPEIRT
>ECs3238 hypothetical protein
MSYALIDNASLTAVERTLGDILVKNPDTINGDLVAFENLIQAILFYDTLI
CVDNYKKEYRDKRIAKFDFIKFVSESDFQLSELDQLAQVESRQITSEIRG
GEFVDDDFRQLVEMLKLNMICTWDLRSSVYYLTMKMLGQPGTPEYAKYSE
LSASIFNELSDASDTKGYWSTDVKLVSSSGHEYTEREFELEREKSTRGKG
GMTRSLEMFIASLNWLAYKSIYYSVIAKHFKADSFIHPIRHAYQLHWMKK
TGAFGHDYTARLVQSLSDKISTARSEIVDHGRTSTISLDLPIFSAWLANE
TGNVSQVINSALELRTHDHFRTCRDVIREIHVTYDESGISAGNKKVTKLL
SDLDKISGDIKRSYGVPSNQGIQGSFLIKCINSLTGLVGIPGLPDKEFAL
STPEFMKSKQHKAFSTVFKDVTNELTSIERLGGLRDKLASNFTIDDSYYT
EPKTEDPRFRYVSSNWKQPM
>ECs1202 hypothetical protein
MTFTVKTIPDMLVEAYENQTEVARILNCSRNTVRKYTGDKEGKRHAIVNG
VLMVHRGWGKDTDA
>ECs2880 hypothetical protein
MSWRLVYASAVGTSHISADLPCQDACQMQVAWLNDQQPLLVMFLADGAGS
VSQGGEGAMLAINEAMAYVSQKVQHGEFGLNDILATDIVLTVRQRLFAEA
EAKELAVRDFACTFLGLISSANGTLIMQIGDGGVVVDFGHGLQLPLTPMV
GEYANMTHFITDEDVVSRLETFTSTERAHKVAAFTDGIQRLALNMLDNSP
HVPFFTPFFNGLAAATQEQLDLLPELLKQFLSSPAVNERTDDDKTLTLAL
WLP
>ECs3498 hypothetical protein
MTFKHYDVVRAASPSDLAEKLTHKLKEGWQPFGSPVAITPYTLMQVITAE
GDVVVSGATEPDWYYVIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLAR
RSTVTPGGAACRYNDIIPADHCLHDVQDMSTLNHPKADLSKGQYGCVGQG
LHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADAGASQDSARW
GVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDMSAATHAQQPALF
TAMLAQFRADLSVFNAQCHGGSAADVPWICGDTTYYWKNTYGTQYNTIYG
AYKNRESEGVYFVPFMTDGNGVNTATNAPAEDPDIPASGYYGAASRTNGN
QVSSNRPTHFSSWARRSIIPDRMATAILNAAGRTSAFISGKAPEIKPSPG
GNTPSGPSADTSVRTISLLPAAGEAAAQGWSIKDGGIQLSDGVFKITRQS
NKTWSLTHPVDDAITLLTQGGRLTCKFRLSGALTNNQFGLGIYLYTDAPV
PDGVAMTGTGNPFLMSYFTQTTDGRVNLMHHRKAGNTKLGEFGDYGNDWQ
TLELVFTAGSATVTPKLNGVAGPAFQVIKDSMTLGLNALTLTDVTKNAAY
GVEIESLVLEINAPASS
>ECs0334 hypothetical protein
MNKNLILAFALFSLPVFAEEDLGPGKYVCDIRISSLDTATQILSKSATVL
DNGNNFIVQMPNGDQLYSPDLENVDDGIKQKATIGGVTFIRRPTFNDRFI
VEDGNTGFFYKMRNCEKK
>ECs5541 hypothetical protein
MEVAIGDLVGFVFNNIYTSTALQGRQWKIFQQGKIEGLITFCTVLQVTKN
PAGARFRQPPAAGNRKAHCIMSTQKVYVFRAKRAISARILTYLVHS
>ECs1785 antirepressor protein
MNMMAVPFHGNSLYVVNHNGEPYVPMKPVVAGMGLAWQSQLAKLRQRFAS
TITEIVMVAEDGKQRNMVSMPLRKLAGWLQTINPNKVKPEIRDKVIRYQE
ECDDVLYEYWTKGFVVNPRKMSVMEELNQACADMKRDKNIASVFATGLNE
WKQVKAAHVSKIRTLVNEANMLIDFVLADTGKGKITKAD
>ECs1980 major tail protein
MSALYERSQLTQVMISSAPATAETMDKAEYLRLDCTIKEVQFTAGQKQDI
DVTTLCSTEQENINGLGASSEISMSGNFYLNQAQNALRDAYDNDTVYAFK
VQFPSGKGFKFLAEVRQHTWSSGTNGVVAATFSLRLKGKPVSYVVPLAFV
KNPEKTLTVNTGALLTMSVSVNGGTPPYKYAWKKDGQPVEGQTTDTFSKA
NAQSGDKGAYTCMVMDSAEQPQSITSDACTVTVNGAGG
>ECs0465 hypothetical protein
MNINVFRLILLGSLFSLSACVQQSEVRQMKHSVSTLNQEMTQLNQETVKI
TQQNRLNAKSSSGVYLLPGAKTPARLESQIGTLRMSLVNITPDTDGTTLT
LRIQGESNDPLPAFSGTVEYGQIQGTIDNFQEINVQNQLINAPASVLAPS
DVDIPLQLKGISVDQLGFVRIHDIQPVMQ
>ECs3669 hypothetical protein
MMKKTAAIISACMLTFALSACSGSNYVMHTNDGRTIVSDGKPQTDNDTGM
ISYKDANGNKQQINRTDVKEMVELDQ
>ECs1212 putative holin protein
MYQMEKITTGVSYTTSAVGTGYWFLQLLDRVSPSQWAAIGVLGSLLFGLL
TYLTNLYFKIKEDRRKAARGE
>ECs2704 hypothetical protein
MAFHLHLRHTQLPKLIDTPPYPAPPGTITSTSDSSSQHPNAFLYELIRNK
HASHIEIFAKIVHEISFFSTNPRYKYSSAPQYSTALNLSPPPRTSDNGLK
SLRPLATKLLNKENLA
>ECs3236 putativa replication protein
MEWAQHQSEPHVHAIIMVNERSINGQAGKIVSKWVTLANNAIPSNVPPQW
DLDLKGQQYSEPTKCIGKQIDYMAKPETKLNDFNREASKSLYDWSDANVW
GYSNGWARHEIKKETLSITGFHAVRRILKAVQASRKDNYKAKRQIQRTLK
RQDINRSPITPFQAPLISKHDWDLSVIAVTKEAETERRELLSSQIINSNK
VQTHPLLRMLRRARVRASRAKRESGGYDHYY
>ECs0027 hypothetical protein
MCRHSLRSDGAGFYQLAGCEYSFSAIKIAAGSQFLPVICAMTMTMKSHFF
LISVLNRRLTLTAVQGILGRFSLF
>ECs0403 2,3-dihydroxyphenylpropionate 1,2-dioxygenase
MHAYLHCLSHSPLVGYVDPAQEVLDEVNGVIASARERIAAFSPELVVLFA
PDHYNGFFYDVMPPFCLGVGATAIGDFGSAAGELPVPVELAEACAHAVMK
SGIDLAVSYCMQVDHGFAQPLEFLLGGLDKVPVLPVFINGVATPLPGFQR
TRMLGEAIGRFTSTLNKRVLFLGSGGLSHQPPVPELAKADAHMRDRLLGS
GKDLPASERELRQQRVISAAEKFVEDQRTLHPLNPIWDNQFMTLLEQGRI
QELDAVSNEELSAIAGKSTHEIKTWVAAFAAISAFGNWRSEGRYYRPIPE
WIAGFGSLSARTEN
>ECs1314 hypothetical protein
MNLSASMNDAGDGFELLMSGNMQDFNICCRRDNHYPEHKLIPLSELFDVN
YGLNLELNKLEKDSSGINFVSRTSKNNGVSARVKLKDGISPCQQLC
>ECs2844 O antigen polymerase
MKSAAKLIFLFLFTLYSLQLYGVIIDDRITNFDTKVLTSIIIIFQIFFVL
LFYLTIINERKQQKKFIVNWELKLILVFLFVTIEIAAVVLFLKEGIPIFD
DDPGGAKLRIAEGNGLYIRYIKYFGNIVVFALIILYDEHKFKQRTIIFVY
FTTIALFGYRSELVLLILQYILITNILSKDNRNPKIKRIIGYFLLVGVVC
SLFYLSLGQDGEQNDSYNNMLRIINRLTIEQVESVPYVVSESIKNDFFPT
PELEKELKAIINRIQGIKHQDLFYGERLHKQVFGDMGANFLSVTTYGAEL
LVFFGFLCVFIIPLGIYIPFYLLKRMKKTHSSINCAFYSYIIMILLQYLV
AGNASAFFFGPFLSVLIMCTPLILLHDTLKRLSRNENISYNCDL
>ECs5433 hypothetical protein
MVMTKKIKCAYHLCKKDVEESKAIERMLHFMHGILSKDEPRKYCSEACAE
KDQMAHEL
>ECs0582 putative carboxylase
MTIIHPLLASSSAPNYRQSWRLAGVWRRAINLMTESGELLTLHRQGSGFG
PGGWVLRRAQFDALCGGLCGNERPQVVAQGIRLGRFTVKQPQRYCLLRIT
PPAHPQPLAAAWMQRAEETGLFGPLALAASDPLPAELRQFRHCFQAALNG
VKTDWRHWLGKGPGLTPSHDDTLSGMLLAAWYYGALDARSGRPFFACSDN
LQLVTTAVSVSYLRYAAQGYFASPLLHFVHALSCPKRTAVAIDSLLALGH
TSGADTLLGFWLGQQLLQGKP
>ECs5110 hypothetical protein
MEGKNKFNTYVVSFDYPSSYSSVFLRLRSLMYDMNFSSIVADEYGIPRQL
NENSFAITTSLAASEIEDLIRLKCLDLPDIDFDLNIMTVDDYFRQFYK
>ECs5406 hypothetical protein
MVTVAELQALRQARLDLLTGKRVVSVQKDGRRIEYTAASLDELNRAINDA
ESVLGTTRRRRRPLGVRL
>ECs4316 putative receptor
MSKPPLFFIVIIGLIIVAASFRFMQQRREKADNDMAPLQQKLVLVSNKRE
KPINDRRSRQQEVTPAGTSMRYEASFKPQSGGMEQTFRLDAQQYHALTVG
DKGTLSYKGTRFVSFVGEQ
>ECs5398 hypothetical protein
MIVANQYGLNPWTKEIYAFPDKQNGIVPVVGVDGWSRIINENQQFDGMDF
EQDNESCTCRIYRKDRNHPICVTEWMDECRREPFKTREGREITGPWQSHP
KRMLRHKAMIQCARLAFGFAGIYDKDEAERIVENTAYTAERQPERDITPV
NDETMQEINTLLIALDKTWDDDLLPLCSQIFRRDIRASSELTQAEAVKAL
GFLKQKATEQKVAA
>ECs5320 putative glycoprotein/receptor
MMMKTVKHLLCCAIAASALISTGVHAASWKDALSSAASELGNQNSTTQEG
GWSLASLTNLLSSGNQALSADNMNNAAGILQYCAKQKLASVTDVENIKNQ
VLEKLGLNSEEQKEDTNYLDGIQGLLKTKDGQQLNLDNIGTTPLAEKVKT
KACDLVLKQGLNFIS
>ECs0951 hypothetical protein
MPCWEEGFDQFRKSSHCVKNSYISLYNYLFRNRILVSLWSTAYLPYYVVP
ATLMSGKSQLIL
>ECs5581 hypothetical protein
MKVGVIYETVAVMFIDDDQSGQPATYCLNPQLTGAITNDIVVIANKFRRA
DDSGCDLLSVPHDRISLCMWGALWRSEM
>ECs4474 hypothetical protein
MFLNYFALGVLIFVFLVIFYGIIAIHDIPYLIAKKRNHPHADAIHTAGWV
SLFTLHVIWPFLWIWATLYQPERGWGMQSHVASQEKATDPEIASLSDRIS
RLEHQLAAEKKTDYSTFPEI
>ECs0339 hypothetical protein
MQVFFHCEYPYLYNTAVAVELLSIRIDVTFQDEALYFLAISAMASSRTVG
DADTDETLICFRIILSSALVENAITPDNSVANKTDFLNI
>ECs2519 hypothetical protein
MQIKVIYSLIDNMVNFKDKNMPAVIDKALDFIGAMDVSAPTPSSMNESTA
KGIFKYLKELGVPASAADITARADQEGWNPGFTEKMVGWAKKMETGERSV
IKNPEYFSTYMQEELKALV
>ECs1960 hypothetical protein
MTVKLRLAVAALLLFLVVMVDFTSRIMSVLADGVLVCGIVVLLWPVIKRN
SLHNA
>ECs1303 hypothetical protein
MCLLAPENPYPIYALPPLVRNAIIETQKNTQAPLAMVATSALTAISIACQ
NQIDVCRPGNLHGPVNLYSLILADSGERKTTVDKVFMKAFYLRDEALADE
YAKLVENYSTEKEIWEQKQKALESKFHKEIRAGKDYKATESELETHLNKP
PVPPQIRRTIFNETTIEGMLKYYSDSNRSFALVSSEGGVIFDSRAMSKLG
IINTLWDGGSLFIDRKSSPGINLKEPRLTISVMIQPDVYHKGFCTRKKEI
VKTSGHHARFLMCQPTSTQGTRIMINELIDESLAMSGERRCLHFSPQAAR
IWTDYYNDVESKLGGLGPLRHCREYAAKNAEYMARLAGLIYHSSGEEGEI
SPYTAEMARELAIWYGNEYVRLSNPLTFDNSALTVPVRLIPEELELFNWI
KSYCIEKGILCMKKNDILQRGPNRFRKKDKINWLLDLLYEQNRVVPVIEG
KTLCVAPNFDL
>ECs1545 hypothetical protein
MTALLTLEEIKAHLRVDHDADDEMLMDKVRQATAVLLAYIQGSRDKVISE
DGELIPGEALTRMKGAAMRLTGMLYRNPDLAEREDLVQGELPFSVSVLIY
DLRCPTVL
>ECs0142 putative fimbrial protein
MENSMKRILLTSALIGLGLPAVGSATDLNVDFTATVLATTCTITIVEDGG
PAVTGSNDEYSLTIPDVGLDKVATAAPEAQANFKLKASDCSNGYSKIFTT
LTGTTVSGKLIVNEATSGAAGVGMGIKRRDTADSTFFTPNNTDKFEWSAD
EKASGVPLTVALRETTAGAGRTGAFQAKATFNFTYE
>ECs2168 minor tail protein
MFLKTEQFEYNGVSVTLSELSALQRFDYIKFVSDAEQQETTKHDVVHINQ
RYLETASLLVAMSLWHSHSLKGTLASPETEMQQIRREVMLGWPADALNQA
TNRVLYLSGMLDNRHDADPEQTGKAEATESVTSKKHSKAS
>ECs2763 hypothetical protein
MANAWLRLWHDMPNDPKWRTIARVSGQPIATVMAVYIHLLVSASRNVTTC
HGVSLRGHIDVTTEDLASALDVTEDVIDSILHAMQGRVLDGDLISGWEKR
QVMKEDNGNVSQTAKSPAERKRAQRERERLRKQNTDCHDESRRVTHMSRQ
ITTDTDTDKELNPTHNARMRESATTGELNDASLQTAEPEYLDGLSEPIGK
FPMTGVWQPSPDFRQRAAVWGMALPEPEFTPAELAAFRDYWMAEGKVFTQ
VQWEQKFARHVQHVRAQVKPVSKGVSHAAPGGTASRAVQEIRAAREQWER
ENGFISNGNGLEAVGTYGGGVFEPLDPEERGCAVEALDCSDWRDD
>ECs1220 putative terminase large subunit
MTFRKNEPRCDEPSEMTEAEQRLFIMTKLSNPWWRLNHLYKIQNEKGELV
TFRMRPAQRQLFRSMHNKNIILKARQLGFSTAIDIYLLDQALFIPHLKCG
IVAQDKQAASEIFRTKIAVPFDHLPDWLRASFTIVERRSGASGGYILFGH
GSSIQVATSFRSGTVQRLHISEHGKICAKYPAKAKELRTGTLNAVSDECI
IFDESTAEGVGGDFYEMSNRAQEITASGLLLTAQDYKFHFYAWWQDPKYS
ARVPESGLKLSREKMTYFSAVEKAMNITLTDEQKQWYINKETEQREEMKQ
EFPSTPQEAFLTSGRRVFSAESTLQAESFCSPPMIVYDIEPVTGAKTKAQ
SLREGNKNELQRTLMNYLLVWELPDPDEEYVCGADTAEGLEHGDRSSLDV
VKRSNGEQVAHWFGHLDAELFAHLISQVCRMYNNAFVGPERNNHGHAVIL
KLRELYPTRYIYNEQHLDQAYDDDTPRLGWLTTRQSKPVLTEGMKTLLNN
GISGIRWSGTLSEMNTYVYDAKGSMNAQEGCFDDQLMSYMIAQEMRARMP
VRVKQKTDKRRTTHWMAH
>ECs2633 putative phage replication protein
MADAAFSATPLGNLINKSLDAQEKQDKTITLAGDARKQARGAVDEAMASL
RLLPSYLRDPLIRHLSFLRKKQEADRQKGKKSWQAERYARGNLRKIFERL
ERTDHRWLTQGYRSLAGRERLDDLLYLPQLNKHQIQTLATMTAAMFSSTF
EKLCDGFGATDGELTMDVTLKAYQMLARMALHLHAMPPHYDALTTDKDRR
NEPDTELLPGAILRLTCAEWWKRKLWLLRCEWREEQLRAACLVSRKTSPY
LSQDALSEFRAQREKTRDFLKSFMLENEDGFTIDLETVYYAGVSNPVHRK
AEMMATMKGLELLAEARGDKAVFLTVTCPSKYHATTENGHPNPKWNGATM
RDSSDYLVNTFFAAVRKKLNRDGLRWYGIRTVEPHHDGTVHWHMMVFAHP
EEIDTIVSHTRDIAIQEDRHELGDDITPRFKAEYVDGSKGTPTSYIATYI
GKNLDSRAVDGIDPKTGKPRVDHETGKSMTESVERAIGWARLHRVRQFQF
FGIPSRQVWRELRRLASQMARNPEGPQRLKDDAMDAVLAAADAGCFTTYI
EKQGGVLVPRKDYLIRTAYDLADELNDYGEQSVQIYGIWSPLIGESSRVC
THPDNWKLVRRKPGVEDSARENGFDLQGGPAAPWTRGNNCPRVQETDNNG
TEQPEERPAPWPQLPDGVDVNEWMRSLKRHERRALMRSLRDKQAKNSSDE
MQSWTQSRKQQRPLPDNHELLAKEWRESAESLGLHIGEQQMQHLLRGGSL
YVDGSIIAPQGFEIVRKPDTRPDSRITQLWQRLSRNHGVSSTEIRHNPVA
SYLAQLGASDPEAAARLASTLQQDQNTMKTPVTVLSDMLRAIRDAEHAQR
ISETTERASRKADLLRGGLTSGNKKQTETGLTNPVNEQKTRRDI
>ECs2200 hypothetical protein
MKHYLEKNYPRKSRTTEFLFFILFIVLMIPISPLLLVWIIGRTFEPVIEL
YTDVTWESFSALHNKINPYKEN
>ECs2750 antiterminator
MNNHYLQFVRELLIIATADLSGATKGQLEAWQENAMFDTGRYRRKKIRYR
DEVTGKMITRDNPPIPGKQSLAKVTSIPLVSPVEFSTSSWRRAVLSLEEH
HKAWLLWCYSGSICWEYQIAITQWAWNEFNTQSGTRKIAGKTQERLKKLI
WLAAQAVKAELFGGEGYEYKELALLVGVTTKNWSKTFTRHWVAMKHIFHR
LDSEALLFVMRTRSKQKAAFSKQSVAKVD
>ECs5125 hypothetical protein
MASSSLIMGNNMHVKYLAGIVGAALLMAGCSSSNELSAAGQSVRIVDEQP
GAECQLIGTATGKQSNWLSGQHGEEGGSMRGAANDLRNQAAAMGGNVIYG
ISSPSQGMLSSFVPTDSQIIGQVYKCPN
>ECs2211 hypothetical protein
MEFKDLPPSIQEIAAHTLRHRLNELELESVTKKDTDNMARNVRDAFTGLY
FCASINKHDSESVANKIAETTAQNINTKPTEEEIDQFAHDAGLKNKKEKS
PYAGNMFVYDNLIRIRGEIPAEYLARVHQALLKNLETELFDGNTNGFFMV
SGLEKDWDAEKRWNVATWLFSNRAAALEASACICGLFLTDHKYNLDVYSY
IYAEHGPLWIDW
>ECs4835 2-keto-3-deoxygluconate permease
MMEMQIKRLIEKIPGGMMLVPLFLGALCHTFSPGAGKYFGSFTNGMITGT
VPILAVWFFCMGASIKLSATGTVLRKSGTLVVTKIAVAWVVAAIASRIIP
EHGVEVGFFAGLSTLALVAAMDMTNGGLYASIMQQYGTKEEAGAFVLMSL
ESGPLMTMIILGTAGIASFEPHVFVGAVLPFLVGFALGNLDPELREFFSK
AVQTLIPFFAFALGNTIDLTVIAQTGLLGILLGVAVIIVTGIPLIIADKL
IGGGDGTAGIAASSSAGAAVATPVLIAEMVPAFKPMAPAATSLVATAVIV
TSILVPILTSIWSRKVKARAAKIEILGTVK
>ECs0140 putative fimbrial protein
MLCRHHKIVHFLGLATALITPFAYSGQDVDLTAKIVPSTCQVEVSNNGVV
DLGTVTLDYFADNVTPTTDYAGGKTFNVNVVSCDNIQTTQSQMKLDFQPQ
AGSLAQANNQIFSNEYEQQATGAKNVGIVIFSAQPNQQTFNVRGTDGSST
AIYSVAPGNAVPSTWTFYSRMQRVNNALPPESGMVRSQVIVNVSYE
>ECs0804 hypothetical protein
MKKDSYPYLICMTVLGLIFIFLFFWWRADIYRVTFLNQSISHYYILFSMG
IAFLLSLFWVKKGIVKQSGWKSLSAYLKVYAGMCIFAGFFLIIPLTTLTY
FLPGETSSYVAPYRYTSGSSKSCSGAEVDDPDLHENIRICYPYGNYEYDN
IIYVEKKINTLGAVVTYAQTARDDTE
>ECs4723 hypothetical protein
MQAKIAASNTGELDALQQLGFSLVEGEVDLALPVNNASDSGAVVAQETDI
PALRQLASAAFAQSRFRAPWYAPDASSRFYAQWIENAVRGTFDHQCLILR
AASGDIRGYVSLRELNATDARIGLLAGRGAGAELMQTALNWAYARGKTTL
RVATQMGNTAALKRYIQSGANVESTAYWLYR
>ECs5245 hypothetical protein
MEILLVKRQNMKIKIYQEKGHQLPHIHIDYGKQAHAASYAIQSGDRIEGD
LPKKYDSDVSSWLGRNRDKILEIWNALQAGAPYEPLIAELTGGV
>ECs3497 holin protein
MYQMEKITTGVSYTTSAVGTGYWLLQLLDKVSPSQWVAIGVLGSLLFGLL
TYLTNLYFKIKEDRRKAARGE
>ECs1799 hypothetical protein
MTEADLYPHLAHLAGGQVYPYVVPLLDGRPSVALPWVVFSLISSVSADVM
GGQAESSVSVQIDVYAGTVTQARQIRQDAREAIMLLAPGSVSEMQDYIPE
NRCYRATLEFQVTV
>ECs2980 NinE
MRRQRRSITDIICENCKYLPTKRSRNKRKPIPKESDVKTFNYTAHLWDIR
WLRHRARK
>ECs1178 regulatory protein cIII
MQYAIAGWPVAGCPSESLLERITRKLRDGWKRLIDILNQPGVPKNGSNTY
GYPD
>ECs0021 hypothetical protein
MKWLLLITLSLYSFIVQSAPCGLTNVGEQRGTYILKPLSMKGNLTAEKIE
NTGNIIECADLQETTNAIKVHFLTQNALRFKVGSRHYIINFPFESDGKKN
SASREVIL
>ECs0549 hypothetical protein
MNEFEKIFNEMNLDRALLPILFRSNRSTVWKYLSGDSTAPASAMSLIMLL
QLIQKRNPDLLAEWLTLSDFTIPPEVYLDQPDYWKGWVYTQHKVNKNVLE
YLKKHYPDEDQKSMSKGREE
>ECs1801 putative tail assembly chaperone
MAKDLKTLALARLSGFRHKTVKVPEWRNVSVVLREPSAEAWYLWQEVLNG
DGEDDDTLSVVAKTRRNLEADVTLFCDVLCDTDLQRVFTPDDREQVLAVY
GPVHARLLRQALELIADAESARKK
>ECs1764 hypothetical protein
MQKRDPVIIAPDYTDDELYEWMHQKIKAVQDLKWANEARAKQAENLSALE
QDITNLEKAAALSIARMITYPR
>ECs5584 hypothetical protein
MKRHSTLFLFTLLTLTTVPAQADIIDDTIGNIQQAINDASNPDRGRDYED
SRDDGWQREVSDDRRRQYDDRRRQFEDRRRQLDDRQHQLNQERRQLEDEE
RRMEDEYGQ
>ECs3732 EivE
MAIHVEHVGVLERAREVSRLEDIITEDNEDIEAEMPKMRDDPAGKEARFL
QATDEMSAALTQFMKKKIYEEQLANFLDGEEYVLEDQPIEKTDKVMEALK
AATTHDYEVYSFAKKLFPDESDLVVVLRAILRKKQISENVRLNAEALLRK
VNQETTKKFINSGINSALKAKLFGQALSLNPKLLRASYRQFLMAEDDAVD
TYVEWIGSYGYQNRMLVTKFIKETLFSDINALDASCSSLEFGMFLNKLSQ
LLSLQSAEALFLKTLMNNPIIKKFISAEDYWIFFLISLIKFPETAEELLN
NALVTLPADANYKDKTLLLKAIYSGCTNLPFSLFINNEQLLEIRECCKQA
IKVTFAAELFDTQNCNKKQNKKPWKKVMFNV
>ECs5464 hypothetical protein
MSQTLNADQELVSDVVACQLVIKQILDVLDVIAPVEVREKMSSQLKNIDF
THHPAAADPVTMRAIQKAIALIELKFTPQGESH
>ECs4795 hypothetical protein
MVTINNARKILQRVDTLPLYLHAYAFHLNMRLERVLPADLLDIASENNLR
GVKIHVLDGERFSLGNMDDKELSAFGDKARRLNLDIHIETSASDKASIDE
AVAIALKTGASSVRFYPRYEGNLRDVLSIIANDIAYVREAYQDSGLTFTI
EQHEDLKSHELVSLVKESEMESLSLLFDFANMINANEHPIDALKTMAPHI
TQVHIKDALIVKEQGGLGHKACISGQGDMPFKELLTHLICLGDEEPQVTA
YGLEEEVDYYAPAFRFEDEDDNPWIPYRQMSETPLPEKHLLDERLRKEKE
DAINQINHVRNVLQQIKQEASHLLNH
>ECs1211 hypothetical protein
MSEITSLVTAEAVKDVLRSEEVRSALKQKLRHNLEARLDAEVDAILDELL
GVQAEPPTEAGDTTAESGEVQPESPVADATEPQPESVMML
>ECs0439 hypothetical protein
MPTKPPYPREAYIVTIEKGKPGQTVTWYQLRADHPKPDSLISEHPTAQEA
MDAKKRYEDPDKE
>ECs1273 FidL-like protein
MTQRYFLFAGIILCAFIAAILSHIAFHHANEPAEQNISCNAHVINFTDHT
KMDQYFSLNMISENNTGNMYLTGAYTVDGKKIGFIRRYVSFTYKKFRDTV
YFTTVKIQKIQKDDNISDEILENITSDFFIKVDTSINFYVTRQNDYNYVF
STGRTPRFICISH
>ECs2202 hypothetical protein
MKILYQDYGPVGQVVISSTVMEFRKHNRVVDAVLLTCPGISASRAGVFIM
KTKLYGSKAWIKKAYRVALQEVNSE
>ECs5376 hypothetical protein
MLTVSGSEQVTIARNTSGVILKLSGISPAVYQCIHSTFISEIERVSLEFA
GEVVTMKHESYPYVLYNDSVPIPGTVTLNLIGS
>ECs5456 hypothetical protein
MLDTCRLASYAPKGKEKQAMKQQKAMLIALIVICITVIVTALVTRKDLCE
VRIRTGQTEVAVFTAYEPEE
>ECs1218 hypothetical protein
MSGAMNRHCIPGLTMSTALTGNFSAGVCGDNQKRCTPGFLIFHEMGAISR
EAACPVRWWKKPDKTTALCKYRSNMVLLCEI
>ECs1635 major capsid protein
MSMYTTAQLLAANEQKFKFDPLFLRLFFRESYPFTTEKVYLSQIPGLVNM
ALYVSPIVSGEVIRSRGGSTSEFTPGYVKPKHEVNPQMTLRRLPDEDPQN
LADPAYRRRRIIMQNMRDEELAIAQVEEMQAVSAVLKGKYTMTGEAFDPV
EVDMGRSAANNITQSGGTEWSKRDKSTYDPTDDIEAYALNASGVVNIIVF
DPKGWALFRSFKAVREKLDTRRGSHSELETAVKDLGKAVSYKGMYGDVAI
VVYSGQYVENGVKKNFLPDNTMVLGNTQARGLRTYGCIQDADAQREGINA
SARYPKNWVTTGDPAREFTMIQSAPLMLLADPDEFVSVQLA
>ECs1807 putative outer membrane protein precursor
MRKLCAVILSAVVWQVAAATPASAAEHQSTLSAGYLHASTNVPGSDDLNG
INVKYRYEFMDALGLITSFSYANAEDEQKTRYSDTRWHEDSVRNRWFSVM
AGPSVRVNEWFSAYAMAGVAYSRVSTFYGDYLRVTDNKGKTHDVLTGSDD
GRHSNTSLAWGAGVQFNPTESVTIDLAYEGSGSGDWRSDAFIVGIGYRF
>ECs5564 hypothetical protein
MSKACQKVNQTLKNKKRLAITCGYMLGVKKAEEIRLILLTKCALSMPDAA
RTPYPAYKSL
>ECs2653 hypothetical protein
MSTPDFSTAENNQELANEVSCLKAMLTLMLQAMGQADAGRVMLKMEQQLA
LIEDETQAAVFSKTVKQIKQAYRQ
>ECs1165 hypothetical protein
MATLQELIDLTPEQEKAWNRLVKAVKDFRAAGGKFYSVLDTLSAYNGEHV
ASIDNDKGYHTASVYMPSIDAPGLTSWADDWHGITLKDGVEVDKD
>ECs2848 hypothetical protein
MPFKKLSRRTFLTASSALAFLHTPFARALPARQSVNINDYNPHDWIASFK
QAFSEGQTVVVPAGFVCDNINTGIFIPPGKTLHILGSLRGNGRGRFVLQD
GSQVTGEEGGSMHNITLDVRGSDCTIKGLAMSGFGPVTQIYIGGKNKRVM
RNLTIDNLTISHANYAILRQGFHNQIIGANITNCKFSDLQGDAIEWNVAI
NDRDILISDHVIERINCTNGKINWGIGIGLAGSTYDNNYPEDQAVKNFVV
ANITGSDCRQLIHVENGKHFVIRNIKARNITQDFSKKAGIDNATVAIYGC
DNFVIDNIEMINSAGMLIGYGVIKGNYLSIPQNFRVNNIQLDNTHLAYKL
RGLQISAGNAVSFVALTNIEMKRASLELHNKPQHLFLRNIKVMQESSVGP
ALIMNFDMRKDVRGVFMAKEETLLSLANVHAVNEKGQSSVDIDRINHHIV
NVEKINFRLPERRE
>ECs2466 hypothetical protein
MSRALFAVVLAFPLIALANPHYGPDVEVNVPPEVVSSGGQSAQPCTQCCV
YQDKNYSEGAVIKAEGILLQCQRDDKTLSTNPLVWRRVKP
>ECs1522 hypothetical protein
MRVLLRPVLVPELGLVIVKPGRESMSAFHNGRILVEPEPKSMRALPSGVV
PAVHQPLAEDKSLLPFFSDERVIRAAGGAGALSDWLLRHVKSCQWLHGDY
HHSETVIHRYGTGAMVLCWHCDNQLREQTSDSLDQLAQQNLAAWMIDIIR
HAMNGAQERELSLAELSWWAVRNQVADALPEAVLRRSLGLRAEKIRSVYR
ESDIIPGEQTATSILKQRTKNIALPSHTHQQQNPPQEKTVVSIAVDPESP
ESFRKRPKRRRWVNEKYTRWVKTQPCACCGKPADDPHHLIGHGQGGMGTK
SHDIFTLPLCREHHNELHADPLAFEEKHGSQVDLIFRFLDHAFATGVLG
>ECs1409 hypothetical protein
MSDITISRPEVVNGHTDVICSTSIRHILAVRKSTLLQIDTLIRQLAEISV
LTESIGGKTAPDWAMKQDFRCGCWLMEKPETAMKAITRNLDREIWRDLMQ
RSGMLSLMDTQARDTWYRSLEYDNFPEISEANILSTFEQLHQNKDEVFER
GVINVFRGLNWNYKTNCPCKFGSKIIVNNLVRWDRWGFHLITGQQADRLA
DLERMLHLFSGKPIPDNRGNITLRLDDHIQSVQGKESYEDEMFSIRYFKK
GSAHITFRKPELVDRLNDIIAKHYPDMLAV
>ECs1364 hypothetical protein
MNENKIKRLEQLLQARQREFATKWGRAALPHERLTYFPLEDLIKNNGKKI
RSDKKHFLECNEDK
>ECs2287 excisionase
MSEVIMIVSPGKWVSEEQLIALKGIKKGTLKKAREKSFMEGREYKHVAHD
GMPWDNSPCFYNLEEIDRWIERQASARPRRHLT
>ECs2157 hypothetical protein
MNILKKIMQRLCGCGKHDDRENGELLTAQLRLGPADILESDENGIIPEQD
RVITQVVILDADKKQIQCVVRPLQILRADGTWENIGGMK
>ECs4158 hypothetical protein
MWLLDQWAERHIAEAQAKGEFDNLAGSGEPLILDDDSHVPPELRAGYRLL
KNAGCLPPELEQRREAIQLLDILKGIRHDDPQYQEVSRRLSLLELKLRQA
GLSTDFLRGDYADKLLNKINDN
>ECs4570 hypothetical protein
MNLLVKRNVEEFLRLLGNDFYLFDNRVEIDFNGFSFFIEIIDNNVFVTFA
LEYNENAFFSFFSALAPERTQGVIEHIFVYDNKLCLSCLLTNIDVFFLMN
TFQQHVQIIERVRRMTS
>ECs2382 hypothetical protein
MGNRTKEDELYREMCRVVGKVVLEMRDLGQEPKHIVIAGVLRTALANKRI
QRSELEKQAMETVINALVK
>ECs1618 hypothetical protein
MADLRKAARSRECQVRIPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIA
TIACSACHDEIDRRTHFVDAAYAKECALEGMARTQVIWLKEGVIKA
>ECs1159 hypothetical protein
MPTQEAKAHHVGEWASLRNTSPEIAEAIFEVAGYDEKMAEKIWEEGSDEV
LVKAFAKTDKDSLFWGEQTIERKNV
>ECs2437 hypothetical protein
MRLVKPVMKKPLRQQNRQIISYVPRTEPAPPEHAIKMDSFRDVWMLRGKY
VAFVLMGESFLRSPAFTVPESAQRWANQIRQEGEVTE
>ECs1164 hypothetical protein
MTTFTDKEMIKEIKERIGSLDVRDNIERRAYEIALTALTTEPFATIDTVG
IELVKYGCNTFICPDNSMEPGNVPLYIGLPRIDPASQTAKLSFQEWLSEQ
KEKIDVDCGCVSIETLTHWMKSAYEAGNSPVTPDSWISCSDRMPEKGQNV
LISVNFDSSLVEPLICSARYTGSTFRRGDATIKPGNGIEQATHWMPLPEP
PQEVNRG
>ECs1759 putative exonuclease
MSKVFICAAIPDEQAIKEEGAVAVATAIEAGDERRARAKFHWQFLEHYPA
AQDCAYKFLVCEDKPGIPRPALDSWDAEYMQENRWDEGAASFVPVETESD
PMNVAFDKLAPEVQNAVMVKFDTCENITVDMVISAQELLQEDMATFDGHI
VEALMKMPEVNAMYPELKLHAIGWVKHKCKPGAKWPEIQAEMRIWKKRRE
GERKETGKYTSVVDLARARVHRQHTENSAEKIPPVTAVIRREYKQTWKTL
DDELAYALWPGDVDAGNIDGSIHRWAKNEVIDNDREDWKRISASMRKQPD
ALRYDRQTIFGLVRERPIDIHKDPVALNKYITEYLTTKGVFEDEGTNQSA
TDTLSSPVPETDAVETAIPDNEKTECKVEVEPSVEREGPFYFLFTDKDGE
KYGRANKLSGLDKALAAGATEITKEEYFARKNGTYTGLPQNANTAQNSEQ
PEPVKVTADEVKKIMQAANISQPDANQLLAASRGEFVAGISDPNDPKWVK
GIETRDSVNQNQQETEQNDQKAEQNSPNTQQNEPETKQPEPVAQQEPEKV
CTACGQTGGGNCPDCGAVMGDATYQETFDEEYQVEVQEDDPEEMEGAEHP
HKENTDGNQHHDSDNETGETADHSIKVNGHQEITSTSRTCDHLMIDLETM
GKNPDAPIISIGAIFFDPQTGDMGPEFSKTIDLETAGGVIDRDTIKWWLK
QSREAQSAIMTDEIPLDDALLQLREFIDENSGEFFVQVWGNGANFDNTIL
RRSYERQGIPCPWRYYNDRDVRTIVELGKAIDFDARTAIPFEGERHNALD
DARYQAKYVSVIWQKLIPNQADF
>ECs3581 transcriptional repressor of hyc and hyp operons
MTIWEISEKADYIAQRHRRLQDQWHIYCNSLVQGITLSKARLHHAMSCAP
DKELCFVLFEHFRIYVTLADGFNSHTIEYYVETKDGEDKQRIAQAQLSID
GMIDGKVNIRDREQVLEHYLEKIAGVYDSLYTAIENNVPVNLSQLVKGQS
PAA
>ECs4951 hypothetical protein
MSKVRVIFEFKHVSHDEKPAGNDCVEVHEKIGVDVKTERDTNNRPTSLCD
VYASILQYHSPEIIQFLSAEFQASVQAFGADAIIKRHRVHKASGTLQ
>ECs4959 hypothetical protein
MDNSSLLFWCLYITSFFGAFVITRWLCRKIICFFDKRHPVERAADALIQQ
AIVLYSGEFFCRITTRDGWHIMIIPPTHHARWDEAEKAFHVRKKVNTV
>ECs3931 glycogen synthesis protein GlgS
MDHSLNSLNNFDFLARSFARMHAEGRPVDILAVTGNMDEEHRTWFCARYA
WYCQQMMQTRELELEH
>ECs0683 hypothetical protein
MDMESQKILFALSTPMEIRNECCLPSHSSPKMYLGTRFFDLSSSWGIDDR
DDLLRTIHRMIDNGHAARLAGFYHRWFRYSPCEWRDYLAELNEQGQAYAQ
FVASTAECCGEGGIKAWDYVRMGFLSRMGVLNNWLSEEESLWIQSRIHLR
ALRYYSNWRQYFAGYTFGRQYWQSPEDDNLPLLREFLARKEYDDSGNDMF
YQLFASDDAYYATLPWQPLADYPTCPETLKDMSDL
>ECs5422 hypothetical protein
MYLHNKNNMRHFCITNKKKQKTSIFIFGQWVFWSIVVIVLLVIIFILPGS
HVITLFGDFICGILPFR
>ECs1232 hypothetical protein
MMPIAILANSIINPLIFKPEAVKGISMPYIDITTMRGMMPRVVTSMLPEH
SAVLAEDCHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAWPDVV
DVIRSPIAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPTSSYRLGIPAP
TTAPVCTVQQGGDVSDDNPNDDETRFYTETFVSDYGEEGPPGPASLEVTL
RTPGTAVQLTLAPVPLQNASIKRRRIYRSASGGGEADFLLVAELDASVLS
YTDKIPAKNLGPSLATWDYLPPPENMTGLCLMANGIAAGFAGNEVMFSEA
YLPYAWPEVNRHTTAEDIVAICPLGTSLVVATKGEPYLFSGVSPSTISGS
KIPSMQACLSRRSMVAMEGFVLYAGTNGLVSVDANGNVALATEQIVSPEQ
WQSQFNPASIVAYPWRGEYIACYTKPDGKQDVFVFSPVNMDIRYLSTPFD
CAWVDLAKDMMRVVTGDKMSVLAGGALPSTIRWHSKIFSLPERTSFSCIR
VKSPAPERVGITIMADDVPVIHFAPGTFKGSVVRLPAATGQNWQVMVSGF
GQVERITLSTSMSEMPV
>ECs2973 Shiga toxin I subunit B precursor
MKKTLLIAASLSFFSASALATPDCVTGKVEYTKYNDDDTFTVKVGDKELF
TNRWNLQSLLLSAQITGMTVTIKTNACHNGGGFSEVIFR
>ECs2170 putative minor tail protein
MKHTDIRAAVLDALELHEHGATLFDGRPVVFDEEDFPAVAVYLTDAEYTG
EELDADTWRATLHIEVFLPAQVPDSELDQWMESRIYPAVTAIPALADLIT
TMVTQGYEYRRDDDMALWSSADLTYSITYEM
>ECs0282 hypothetical protein
MDEYVYSARHNAFFPVDMIDKYKSEGWDLSDAKEVNQNIISEFMAEPPQG
KIRIAGDDGLPAWADIPPPTHEELIEITESERQLLINQANEYMNSKQWPG
KAAIGRLKGDELTQYNLWLDYLDALELVDTSGAPDIEWPTPPAVQAR
>ECs2972 hypothetical protein
MAFKHYDVVRAASPSDLADALAQKIREGWQPYGGPFSSYTDDGAALIQAI
VAEGDVSTPVVVKLTGGEGAVISATSDPGYYFVVVLAGQSNGMSYGEGLP
LPETYDRPDPRIKQLARRSTVTPGGVACKYNDIIPADHCLHDVQDMSRLN
HPKADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGA
DGTYSDASGASENSTRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQ
GEFDFGGTPANHAAQFGALVDKFRADLADMAGQCVGGSADGVPWICGDTT
YFWKQKNESSYQTVYGSYKNKTEKNIHFVPFMTDENGVNVPTNKPEEDPD
IPGIGYYGSKWRDSSATWTSQDRASHFSAWARRGIISDRLATAILRHAGR
VALNAGASSTVSEVRPSSPSGAEATGVTTLLSYLASESEGSLKVQGWSAS
GGRAEVVSDAEGTGGKAVKLTKEAGKSSWVLEYAAGNGAALLQKGGQIRC
RFKVSGVLAANQYVMAFYWPVSSLPQGVALTGDGGNNLLAAFYIQTDAKD
LNVMYHNAKVATNNLKLGTFGAFDNEWHTLAFRFAGNNSLQVTPVIDGQD
GTPFTLTQSPVSAFAADKLHVTDITKGATYPVLIDSIAVEVNNTDTAA
>ECs5322 primosomal protein I
MSSRVLTPDVVGIDALVHDHQTVLAKAEGGVVAVFANNAPAFYAVTPARL
AELLALEEKLARPGSDVDLDDQLYQEPQAAPVAVPMGKFAMYPDWQPDAD
FIRLAALWGVALREPVTTEELASFIAYWQAEGKVFHHVQWQQKLARSLQI
GRASNGGLPKRDVNTVSEPDSQIPPGFRG
>ECs3483 putative DNA damage-inducible protein
MRIEICIAKEKMTKMPNGAVDALKEELTRRISKRYDDVEVIVKATSNDGL
SVTRTADKDSAKTFVQETLKDTWKSADEWFVH
>ECs1365 hypothetical protein
MELSSTKGDEARLYRNGSMALMPVCCSIMASLSIRCCSSRKNFASSGDKS
IVITRKAMRCSALRYRADGSYPFY
>ECs3073 hypothetical protein
MTSLQLSIVHRLPQNYRWSAGFAGSKVEPIPQNGPCGDNSLVALKLLSPD
GDNAWSVMYKLSQALGDIEVPCSVLECEGEPCLFVNRQDEFAATCRLKNF
GVAIAEPFSNYNPF
>ECs3303 hypothetical protein
MKSLRLMLCAMPLMLTGCSTMSSVNWSAANPWNWFGSSTKVSEQGVGELT
ASTPLQEQAIADAVDGDYRLRSGMKTANGNVVRFFEVMKGDNVAMVINGD
QGTISRIDVLDSDIPADTGVKIGTPFSDLYSKAFGNCQKADGDDNRAVEC
KAEGSQHISYQFSGEWSGPEGLMPSDDTLKNWKVSKIIWRR
>ECs1951 hypothetical protein
MTTITKERIELFIKNPVENGLTRGEQMELARIALASLEAEPVGDFYEYKP
DDW
>ECs2996 putative single-stranded DNA binding protein
MSNIKKYIIDYDWKASIEIEIDHDVMTEEKLHQINNFWSDSEYRLNKHGS
VLNAVLIMLAQHALLIAISSDLNAYGVVCEFDWNDGNGQEGWPSMDGSEG
IRITDIDTSGIFDSDDMTIKAA
>ECs1585 hypothetical protein
MCGLCGLLGYLSVLTIFYPFISACCKTPCLLFAQHVTRKTLKITRKRGFF
CGLRMTVRCYFSYSPAGYNRAFAGYKCPFAGYVRVIDFLYIHTLNTLIHR
ITRITRNFFPHTGD
>ECs4585 hypothetical protein
MTIFNKIDYNYYKLINYPIMMVHDEWLGDLTGVNQVSFRRLRETSSTRNQ
LNKILRQEIHDKISGVELSDINKEGFLYQSIGKIRLLALSSALFDIQCPD
YIFSRLYRETLIREIGYQNVKQLSFYWQGGQCKPEYGEERFCAELIKYGA
GNLEWLFADNPLWTIVKYLLPKSGEIKPTHINDLFLNRLNKILLPYETL
>ECs1249 hypothetical protein
MNINTTITIDTALNTGLALLGYFYIMFCSGRWLSLLFMKKWNKRRKQEQR
QKAIDAFFEAFGIDGMEPGDPARAISRGGVVILVYRSEEKNEQD
>ECs1063 hypothetical protein
MEFKDLLKEIQEIAAHALHQRLNEVELESATKKYIDNMARNVRDAFTGLY
SVSVTNNQNTEETAKRIASVMGFHVEEKYSKKEFWKTTKKLQSENCHLLR
QSLLSMRKVIQMTQDYRNCSHYLKNSELS
>ECs1518 hypothetical protein
MTTITKDRLLTIQHWRETYGPGSNVVLPAEEAEELARIALASLAAVSDER
AAYELFMEKRFGESVDRRRAKNGDRDYMVWDMALGWIIWCHRAAMLQAGN
FRENKGSSTNNFREISETSTNYPVTPDGWISCSERMPDDGQHVIILCDGA
FVLYAQYRDGEFFDVVRNGDEFFETQSRNVTDWMPLPEPPQEVRQ
>ECs0349 hypothetical protein
MPVYLCKTSLSGMLRDPQLLMLWLRLGDESVNASIDGAAFRVGAGVQADI
TKNMGAYASLDYTKGDDIENPLQGVVGINVTW
>ECs3800 putative transporter
MARSHFSSQALVLIVISIAINMIGGQLASMVKLPIFLDSIGTLISAVLLG
PVIGMLTGLLTNLLWGLLTDPIAAAFAPVAMVIGLVAGWLARAGWFRTLP
KVVVSGVIITLAVTVVAVPLRTALFGGVTGSGADLFVAWMHSMGQNLVES
VAITVIGANLVDKTLTAVIVWLLLRQLPIRTTRHFPAMAAVR
>ECs0806 hypothetical protein
MTPQQENALRSIARQANSEIKKARQPFPDKNVDDICRSVLKKHRETVTLM
GFTPTHLSLAIGMLNGVFKER
>ECs5259 hypothetical protein
MKTPVNPLLQWLNMFFSRRSLSGADGRALYAYRCTDTEYESLAELLRTYA
PRSYPRTIFISYSDVLFSIYAAEFIRRTHTVGHPKWDTILDSINWKVPYV
HRQKLVNDGIRYWKRKIRNLGQASGYLHTLACEGGLPIRMIENESGYLIT
YFRRIYQALRGQSSQYPAAKIAQELGDTIPVTMQNELVYEIAGEFCETLC
RLLSEHPPHSSDPVSALRKLSPDWHLQLPLVLPEANAAEIVRRLLSQSSE
IRSASSLQVERIWVDVDDSWYCDARFRFPATMRTEQLTSLFECHIQPEQT
RLIISGKWKNGGARLAMLSRYEQQDWRVELLPIAMQKLSGADAMAEISLS
LHEGPILLGHTIPKGGYELTEELPWVFEAMNESESQLKLVGMGSVSSRLN
ALFISLPKNSHLDISGEGEFDIPRLLKNSERSLTKISGVFSVVLHDGAVC
TIRTQQLYDSAIEYYIKSTEVELVKSDYPVHRAWPKIGWKKDLQYGIVPE
KELFWRSIRSGNNAWYSVASEMPKGQIEVRRIVNDEVLFSGKVVVLPADF
DINIIPESAQQGIIMLSGITDTRIDKYSNNEKVTLKSDYSQNECAIYYNS
SLMLENTVDLRVSWKDGSNLKLLLPKPVSGGRFVTNDGSVHFDGVASIAH
LHGIDAELLTISCAGRGYLNIELLDENPVAEKFRYLHADLPLLSGRNDKL
QQISLYENYNLLNAMLACAWNSNSTLCVDFYSDRFGKDKATLNIKRYDGS
FIEHDQGLLVDIKNSVVFPANRIDELVVDAISLKNPGLHISLLKKDEFAY
DLSALNVQDSPWLIVGKLDGTARIAPVIKWMLPVLQTNDLLLNALCEADP
EQRKKNFNELIFEIDNNPLQNYCCLLTEYIKKYKMNNGLSLLDLDLFRGI
SSNYRVVVQLLISSCLSGDSDTIYDIQEELPFSWGWIPVSIWKDVFQKCW
TYLEKQINDKTLALHILQPFIAFMNHRAHIDRRLAPIANMLLTYSESLPT
GCDVLPTVSREQFNEAKQMLLRNPDSFGRISIFPKELWSSAITPELKSVF
NKLWIEDKYHSRLEKRFNLMLVAALLTQKDNNLIHQLSALFEFHYQQAPQ
QLGVIYQYYFEQAGVCH
>ECs2156 hypothetical protein
MPLTSDIRSHSFNLGVEVVRARIVANGRGDITVGGETVSIVYDSTNGRFS
SSGGNGGLLSELLLLGFNSGPRALGERMLSMLSDSGEAQSQESIQNKISQ
CKFSVCPERLQCPLEAIQCPITLEQPEKGIFVKNSDGSDVCTLFDAAAFS
RLVGEGLPHPLTREPITASIIVKHEECIYDDTRGNFIIKGN
>ECs5020 periplasmic protein of mal regulon
MKMNKSLIALCLSAGLLASAPGISLADVNYVPQNTSDAPAIPSAALQQLT
WTPVDQSKTQTTQLATGGQQLNVPGISGPVAAYSVPANIGELTLTLTSEV
NKQTSVFAPNVLILDQNMTPSAFFPSSYFTYQEPGVMSADRLEGVMRLTP
ALGQQKLYVLVFTTEKDLQQTTQLLDPAKAYAKGVGNSIPDIPDPVARHT
TDGLLKLKVKTNSSSSVLVGPLFGSSAPAPVTVGNTAAPAVAAPAPVPVK
KSEPMLNDTESYFNTAIKNAVAKGDVDKALKLLDEAERLGSTSARSTFIS
SVKGKG
>ECs5032 hypothetical protein
MLELLFVIGFFVMLMVTGVSLLGIIAALVVATAIMFLGGMLALMIKLLPW
LLLAIAVVWVIKAIKAPKVPKYQRYDRWRY
>ECs2667 hypothetical protein
MMKKLAIAGALMLLAGCAEVENYNNVVKTPAPDWLAGYWQTKGPQRALVS
PEAIGSLIVTKEGDTLDCRQWQRVIAVPGKLTLMSDDLTNVTVKRELYEV
ERDGNTIEYDGMTMERVDRPTAECAAALDKAPLPTPLP
>ECs3421 putative alpha helix protein
MRHIFQRLLPRRLWLAGLPCLALLGCVQNHNKPAIDTPAEEKIPVYQLAD
YLSTECSDIWALQGKSTETNPLYWLRAMDCADRLMPAQSRQQARQYDDGS
WQNTFKQGILLADAKITPYERRQLVARIEALSTEIPAQVRPLYQLWRDGQ
ALQLQLAEERQRYSKLQQSSDSELDTLRQQHHVLQQQLELTTRKLENLTD
IERQLSTRKPAGNFSPDTPHESEKPAPSTHEVTPDEP
>ECs5291 hypothetical protein
MLIASVFHWRSFVVLISSVLLLAALITCARVKNWMFRQQPRRKAMSNKMP
YRLRMAKTRWRIK
>ECs0141 putative fimbrial protein
MKLKVIATLIATVAVGVSFNSNFASASTTSASLTVNSNLTMGTCSAQIMD
NSNKVINEVVFGNVYISELGAKSKVQQFKIRFSNCSGLPQNSAQIVLAPN
GISCAGSQSSSAGFSNKFTDASAATRTAVEVWTTDTPESNGSTQFHCAQK
IPVPVTLPADTTTQPYDYPLSARMTVAEGRLVTDVRPGNFRSPTTFTITY
Q
>ECs2201 hypothetical protein
MSEIKEMPVVRDGYGYWTHPEYEKFCDGREYISTEEFNAWMEENNLQYVL
CFRDEGCADLDACDADISAWEPERPEGDGWFIGSIHDTEDGPVCVWLRNK
AEA
>ECs4618 hypothetical protein
MGIIAQNKISSLGMLFGAIALMMGIIHFSFGPFSAPPPTLESIVADKTAE
IKRGLLAGIKGEKITTVERKEDVDVDKILDQSGIALAIAALLCAFIGGMR
KENRWGIRGARVFGGGTLAFHTLLFGIGIVCSILLIFLIFSFLTGGSLV
>ECs2185 putative antirepressor protein
MNMMAVPFHGNSLYVVNHNGEPYVPMKPVVAGMGLAWQSQLAKLRQRFAS
TITEIVMVAEDGNQRNMVSMPLRKLAGWLQTINPNKVKPEIRDKVIRYQE
ECDDVLYEYWTKGFVVNPRKMSVMEELNQACADMKRDKNIASVFATGLNE
WKQVKAAHVSKIRTLVNEANMLIDFVLADTGKGKITKAD
>ECs3891 hypothetical protein
MLVTFLLRKRKEKKAKVRQYANSNENDYQFDVVLILLCADFVTCVLEIHS
G
>ECs1710 hypothetical protein
MKRKNASLLGNVLMGLGLVVMVVGVGYSILNQLPQFNMPQYFAHGAVLSI
FVGAILWLAGARVGGHEQVCDRYWWVRHYDKRCRRSDNRRHS
>ECs0213 hypothetical protein
MRKITIILLLTFSCTKGLAFDISMSKECSGVDELEFIDCVKNSFIMSTIT
LKKAEANIKEKIQKWNSDEDEAIRKEAVISLKRLESSNREFIRYRFAQCS
FTLTWGARVGRLAFSRLYPCYTELNKIRAMQLNNAYISDFQNPDEISSK
>ECs2105 hypothetical protein
MHATTVKNKITQRDNYKEIMSVIVVILLLTLTLIAIFSAIDQLGISEMGR
IARDLTHFIVNSLQD
>ECs2891 hypothetical protein
MPLLYLNTRECRWYLMGEGEMKKIAAISLISVFLMSGCAVHNDETSIGKF
GLAYKSNIQRKLDNQYYTEAEASLARGRISGAENIVKNDAAHFCVTQGKK
MQIVDLKTEGAGLHGVARLTFKCGE
>ECs2214 putative cell division inhibitor
METLLPNVNTSEGCFDIGVLLSNREFTEDAINMRKYEPYLLNDNSILSRI
ALLELGIFGERQ
>ECs2192 hypothetical protein
MLKQQDMTETARVVFNELSVTEPATVGEIAQNTYLSRERCQLILTQLVMA
GLADYQCGCYRRIQS
>ECs3531 hypothetical protein
MFSPQSRLRHAVADTFAMVVYCSVVNMCIEVFLSGMSFEQSFYSRLVAIP
VNILIAWPYGMYRDLFMRAARKVSPSGWIKNLADILAYVTFQSPVYVAIL
LVVGADWHQIMAAVSSNIVVSMLMGAVYGYFLDYCRRLFKVSRYQQVKA
>ECs2807 hypothetical protein
MTLEADSVNVQALDMGHIVVDIDGVNITELINKAAENGYSLRVVDDRDST
ETPATYASPHQLL
>ECs1204 hypothetical protein
MPASPSGWLFYVRSVKAAMALGRRAIGVELESGRFEQTVREVQNVVSQNG
>ECs2951 putative minor tail protein
MFLKTEQFEYNGVSVTLSELSALQRFDYIKFVSDAEQQETTKHDVVHINQ
RYLETASLLVAMSLWHSHSLKGTLASPETEMQQIRREVMLGWPADALNQA
TNRVLYLSGMLDNRHDADPEPTGKTEATEPVTSKKHSKAS
>ECs1952 hypothetical protein
MDLLTCHLDGVTETYAEGWNACRAALLQGKGEPGKQVRELTMLVKQLVSQ
LKKAKPGCKLPDKVMDYLERSGLISVEDVSR
>ECs0923 hypothetical protein
MIMKNCLLLGALLMGFTGVAMAQSVTVDVPSGYKVVVVPDSVSVPQAVSV
ATVPQTVYVAPAPAPAYHPHPYVRHLASVGEGMVIEHQIDDHHH
>ECs3235 hypothetical protein
MVVFNENKTLFFKLSIVGTWPSGTANRSMQLTFSGSVPDTLVSSRNSATT
TDNILLATFFSVDKDGFLATNGSTLTIQSNGAAFTATTIKIIAEQ
>ECs2198 MokW
MLNTCRLASYVPKGKEKQAMKQQKAMLIALIVICLTVIVTALVTRKDLCE
VRIRTGQTEVAVFVDYESEK
>ECs2377 hypothetical protein
MNPSQHAEQFQSQLANYVPQFTPEFWPVWLIIAGVLLVGMWLVLGLHALL
RARGVKKSATDHGERIYLYSKAVRLWHWSNALLFVLLLASGLINHFALVG
ATAVKSLVAVHEVCGFLLLACWLGFVLINGVGGNGHHYRIRRQGWLERAA
KQTRFYLFGIMQGEEHPFPATTQSKFNPLQQVAYVGVMYGLLPLLLLTGL
LCLYPQAVGDVFPGVRYWLLQAHFALAFISLFFIFGHLYLCTTGRTPHET
FKSMVDGYHRH
>ECs2992 hypothetical protein
MNAAQSSLDEGKLVIAYSDKNGSTVGLEFSSVASSQATLLMKACSVAASD
KEKRIVTSVVMDDTEIIQTTSDDGGDDMDKRLAVLEAEVAHIKSSMAGIK
EDTRKISSDSTDAKRDTAVLLQKSLDFDASLSKKPSVDYFEAKFSALETK
IADVKVWMLGVLLASLAMPTIFFLINLYLKKGQ
>ECs5479 hypothetical protein
MRKVLQLHKGNEDKKGWILFEGFPEQVHIILRLKLDRNAFCFPILKSELL
PVNVMYQLHRHRLYILETQKKLLRDS
>ECs3680 prepilin peptidase dependent protein C
MSASLKNQQGFSLPEVMLAMVLMVMIVTALSGFQRTLMNSLTSRNQYQQL
WRHGWQQTQLRAISPPANWQVNRMQTSQAGCVSISVTLVSPGGRAGEMTR
LHCPNRQ
>ECs4854 hypothetical protein
MRYVILLILWIITVFLSHLQHSNLLTGDPGMVGEYIGSVLLGPLLLPVLV
SGILCTFTKKTRNFASFTRGCCWVLGVLLLSNVGNTFRLFTPWHYTFEKA
AISVTVPNRHWNTVSISTDKTIDIRSEDNSVFISAFRLPAGRSADDSLEE
LKKMQRDNLKDQYNEETFQFHDCNAKHFTCKYQDVLINFDGQQKRTISVY
LEDTPRAVGIIALMEPDTADKYRQQAMEIMLSAKNTVK
>ECs1136 phosphoanhydride phosphorylase
MKAILIPFLSLLIPLTPQSAFAQSEPEPELKLESVVIVSRHGVRAPTKAT
QLMQDVTPDAWPNWPVKLGWLTPRGGELIAYLGHYQRQRLVADGLLTKKG
CPQPGQVAIIADVDERTRKTGEAFAAGLAPDCAITVHTQADTSSPDPLFN
PLKTGVCQLDNANVTDAILSRAGGSIADFTGHRQTAFRELERVLNFPQSN
LCLNREKQDESCSLTQALPSELKVSADNVSLTGAVSLASMLTEIFLLQQA
QGMPEPGWGRITDSHQWNTLLSLHNAQFYLLQRTPEVARSRATPLLDLIM
IALTPHPPQKQAYGVTLPTSVLFIAGHDTNLANLGGALELNWTLPGQPDN
TPPGGELVFERWRRLSDNSQWIQVSLVFQTLQQMRDKTPLSLNTPPGEVK
LTLAGCEERNAQGMCSLAGFTQIVNEARIPACSL
>ECs3367 putative outer membrane lipoprotein
MMKFKKCLLPVAMLASFTLAGCQSNADDHAADVYQTDQLNTKQETKTVNI
ISILPAKVAVDNSQNKRNAQAFGALIGAVAGGVIGHNVGSGSNSGTTAGA
VGGGAVGAAAGSMVNDKTLVEGVSLTYKEGTKVYTSTQVGKECQFTTGLA
VVITTTYNETRIQPNTKCPEKS
>ECs2247 hypothetical protein
MTALLTLEEIKAHLRVDHDADDDMLMDKVRQATAVLLAYIQGSRDKVIRE
DGELIPGEALTRMKGAAMRLTGMLYRNPDLAEREELIQGELPFSVSVLIY
DLRCPTVL
>ECs5460 hypothetical protein
MQVNCFFNSEIFYSLVIKITMLWQCKNIARIRLKSFKFTHATNIDLQNIT
LAIFKKLDYLTRKNAK
>ECs0807 hypothetical protein
MHKASPVELRTSIEMAHSLAQIGVRFVPIPVETDEEFHTLATSLSQKLEM
MVAKAEADERDLV
>ECs0648 hypothetical protein
MLLPWLFPQTKRNNFAAKVNSENQEKLLILHKEDASASSGQSELSLVQMG
LKQVVASSFWLVFEFQHFKVGCNAARKFPVNGIANAQSQQCGTYRSHNGK
LSIAIGHFCRIHQRAHTHFAIAKIAEFNPAVHCHHVVWHLFWRTHLGAIQ
LCV
>ECs3140 aluminum-inducible protein
MLAFCRSSLKSKKYFIILLALAAIAGLGTHAAWSSNGLPRIDNKTLGRLA
QQHPVVVLFRHAERCDRSTNQCLSDKTGITVKGTQDARELGNAFSADIPD
FDLYSSNTVRTIQSATWFSAGKKLTVDKRLLQCGNEIYSAIKDLQSKAPD
KNIVIFTHNHCLTYIAKDKRDATFKPDYLDGLVMHVEKGKVYLDGEFVNH
>ECs3368 hypothetical protein
MKKVFLCAILASLSYPAIASSLQDQLSAVAEAEQQGKNEEQRQHDEWVAE
RNREIQQEKQRRANAQAAANKRAATAAANKKARQDKLDAEATADKKRDQS
YEDELRSLEIQKQKLALAKEEARVKRENEFIDQELKHKAAQTDVVQSEAD
ANRNMTEGGRDLMKSVGKAEENKSDSWFN
>ECs3674 putative amidase
MSGSNTAISRRRLLQGAGAMWLLSVSQVSLAAVSQVVAVRVWPASSYTRV
TVESNRQLKYKQFALSNPERVVVDIEDVNLNSVLKGMAAQIRADDPFIKS
ARVGQFDPQTVRMVFELKQNVKPQLFALAPVAGFKERLVMDLYPANAQDM
QDPLLALLEDYNKGDLEKQVPPAQSGPQPGKAGRDRPIVIMLDPGHGGED
SGAVGKYKTREKDVVLQIARRLRSLIEKEGNMKVYMTRNEDIFIPLQVRV
AKAQKQRADLFVSIHADAFTSRQPSGSSVFALSTKGATSTAAKYLAQTQN
ASDLIGGVSKSGDRYVDHTMFDMVQSLTIADSLKFGKAVLNKLGKINKLH
KNQVEQAGFAVLKAPDIPSILVETAFISNVEEERKLKTATFQQEVAESIL
AGIKAYFADGATLARRG
>ECs3715 hypothetical protein
MNLALRKIIYAPISYIHPQRVSLNNTPINNPVLRSITNEMILLQYNLSVE
HFNLNSSLIYYINNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDY
LSAIPLEINEKARYKPGIANYHNIITCGFSTLLPYIRQQPLAMQQRFNLL
FPDFVDHILSPLPLASTLLERITFYAKKNRDELDKISCKWCCD
>ECs5441 hypothetical protein
MNEAKEKDLGTYKKSTLKTEKITRGLFSNDEITLIYFSEYSKRIVQEVFV
FNVEDKKVKLKGYRYDSIN
>ECs5451 hypothetical protein
MAKIRYLQGTHDARAGDIRDVAQPCAEVLVRLGKAEYITARRPAGQKKKR
DAEHGECGTFCGEPEKTRNQDVT
>ECs4439 hypothetical protein
MILPGRLRRKGILQACPGLSLSRQTRVCRCALFLGERSKKMATGKSCSRW
FAPLAALLMVVSLSGCFDKEGDQRKAFIDFLQNTVMRSGERLPTLTADQK
KQFGPFVSDYAILYGYSQQVNQAMDSGLRPVVDSVNAIRVPQDYVTQSGP
LREMNGSLGVLAQQLQNAKLQADAAHSALKQSDDLKPVFDQAFTKVVTTP
ADALQPLIPAAQTFTQQLVMVGDYIAQQGTQVSFVANGIQFPTSQQASEY
NKLIAPLPAQHQAFNQAWTTAVTATQ
>ECs5415 hypothetical protein
MSEDVPLPKVNQRYKDDHGALVTLTSVEETRVVFMQDGYPHPCMRPMYNF
LGKFKPEPREETE
>ECs0037 transcriptional regulator of cai operon
MKLHTIDISTILIWPCLIASKRVIARLLICETGVRMCEGYVEKPLYLLIA
EWMMAENRWVIAREISIHFDIEHSKAVNTLTYILSEVTEISCEVKMIPNK
LEGRGCQCQRLVKVVDIDEQIYARLRNNSREKLVGVRKTPRIPAVPLTEL
NREQKWQMMLSKSMRR
>ECs1582 hypothetical protein
MKLAPNVKKQPRGIKHKDTEVIIFAGSDAWAHAKQWQEQDGPASGDNVPP
VWLGEQQLSELDKLQIVPEGRKSVRIFRAGHLAPVMIKAIGQKLAAAGVQ
DANFYPEGMHGQKVENWREYLARERQNLSDGLVIEFPVKKKDTGSHSDDE
LKPRVESRADGVFWVTPKVDKQSGEIIRPETWLCSPLELLGTGTIGKEHY
RVMRWKKTANHEVITMAIPCGGIGDRDGWRLLKDHGLNVTTNGKYRAILA
DWMQLSGSHEEWQLSTTTGWHFGAYIMPDGSIIGESEKPILFTGKSAAIN
GYSVAGTADGWRDKRGAAGWR
>ECs4553 hypothetical protein
MVDTFNDEVFNYYLEQKGYTIQKEFLCGSAFFIGWRIETPFFSLAYRLDE
QELILCSFEARNQTGLNGPVLSLTHLLEELYHHFSGIKKISAMKSKIGSD
SERQKREELFNYFIRKGAVQQETEDGIWFVMNVNS
>ECs1201 NinG
MAKPARRKCKICKEWFHPAFSNQWWCCPEHGTQLALERRSKEREKAEKAA
EKKRRREEQKQKDKLKIRKLALKPRSYWIKQAQQAVNAFIRERDRDLPCI
SCGTLTSAQWDAGHYRTTAAAPQLRFDERNIHKQCVVCNQHKSGNLVPYR
VELINRIGQEAVDEIESNHNRHRWTVEECRAIKAKYQQKLKDLRNSRSEA
A
>ECs0818 holin protein S
MYQMEKITTGVSYTTSAVGTGYWLLQLLDKVSPSQWVAIGVLGSLVFGLL
TYLTNLYFKIKEDKRKAARGE
>ECs1642 minor tail protein
MFDGELSFALKLAREMGRPDWRAMLAGMSSTEYADWHRFYSTHYFHDVLL
DMHFSGLTYTVLSLFFSDPDMHPLDFSLLNRREADEEPEDDVLMQKAAGL
AGGVRFGPDGNEVIPASPDVADMTEDDVMLMTVSEGIAGGVRYG
>ECs4578 hypothetical protein
MIMKDGIYSIIFISNEDSCGEGILIKNGNMITGGDIASVYQGVLSEDEDI
ILHVHRYNYEIPSVLNIEQDYQLVIPKKVLSNDNNLTLHCHVRGNEKLFV
DVYAKFIEPLVIKNTGMPQVYLK
>ECs2167 putative minor tail protein
MGRPDWRAMLAGMTSTEYADWRRFYCTHYFQDTQLDAHFSGLMYAVLSLF
FGDPDMHPADFSLLAPACEEEQTEMPDEEEMLMQKATGVAGGVRFGGDGG
RDISPSADVVDVSEDDVALMMASAGISGGVRYVPASG
>ECs5426 hypothetical protein
MSLRIWIHLANKNAGKALVSVGGGGIMTILLLHRVVNMKRSRTEVGRWRM
QRQASRRKSRWLEGQSRRNMRIHSIRKCILNKQRNSLLFAIYNI
>ECs1774 hypothetical protein
MRIEELREIFSEDGLYTVRVEKGAIVSHCRIKCLQSQQRKSGAALIHFVD
GLVTDGFILRANEFVTSLPSLKEAGIKAGFSAFED
>ECs2283 hypothetical protein
MGEVKHQNGRHPGIGKEAAMALYIDISAIAGQVRVIRAVTKRYAPLLQKV
SGECTEDIVNDFVIELRGLIFSYKVTTIFADGSRETVRALRLKGCVKDLA
TTFWARKLDCIHNQFPLE
>ECs0811 NinF
MIDQNRSYEQESVERALTCANCGQKLHVLEVHVCEHCCAELMSDSNSSMH
EEKDDG
>ECs2970 hypothetical protein
MSEITSLVTAEAVKEVLRSEEVRSALKQKLRHNLEARLDAEVDAILDELL
GAPAAPEPEGIAGEGSASDSGDPTPDSDMMM
>ECs1939 hypothetical protein
MQKIDLGNNESLVCGVFPNQDGTFTAMTYTKSKTFKTETGARRWLEKHTV
S
>ECs3372 hypothetical protein
MELHCPQCQHVLDQDNGHAYCPSCGKVIEMKALCPDCHQPLQVLKACGAV
DYFCQHGHGLISKKRVEFVLA
>ECs1937 phage superinfection exclusion protein
MLDVFTPLLKLFANEPLERLMYTIIIFGLTLWLIPKEFTVAFNAYTEIPW
LFQIIVFAFSFVVAISFSRLRAHIQKHYSLLPEQRVLLRLSEKEIAVFKD
FLKTGNLIITSPCRNPVMKKLERKGIIQHQSDSANCSYYLVTEKYSHFMK
LFWNSRSRRFNR
>ECs2549 hypothetical protein
MMKKSILAFLLLTSSAAALAAPQVITVSRFEVGKDKWAFNREEVMLTCRP
GNALYVINPSTLVQYPLNDVAQKEVASGKTKAQPISVIQIDDPNNPGEKM
SLAPFIERAEKLC
>ECs1837 trp operon leader peptide
MKAIFVLKGWWRTS
>ECs1935 RacC
MITNYEATVVTTDDIVHEVNLEGKRIGYVIKTENKETPFTVVDIDGPSGN
VKTLDEGVKKMCLVHIGKNLPTEKKAEFLATLIAMKLKGEI
>ECs4726 4-alpha-L-fucosyltransferase
MTVLIHVLGSDIPHHNRTVLRFFNDALAATSEHAREFMVVGKDDGLSDSC
PALSVQFFPGKKSLAEAVIAKAKANRQQRFFFHGQFNPTLWLALLSGGIK
PSQFYWHIWGADLYELSSGLRYKLFYPLRRLAQKRVGCVFATRGDLSFFA
KTHPKVRGELLYFPTRMDASLNTMANDRQREGKMTILVGNSGDRSNEHIA
ALRAVHQQFGDTVKVVVPMGYPPHNEAYIEEVRQAGLELFSEENLQVLSE
KLEFDAYLTLLRQCALGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQ
DMTEQHLPVLFTTDDLNEDIVREAQRQLASVDKNTIAFFSPNYLQGWQRA
LAIAAGEVA
>ECs5480 hypothetical protein
MPPYKDVEQKTTTIIIFLILTALKTTSYHHDDDDDKITKMRLFPRPPAPC
SGPPHQEDPQNDNGYHLQQNPVSSTIAPDWRL
>ECs1072 hypothetical protein
MYFLKSLYQAHVLNVAATNRWCNSPEMLPDYRAWLRAETYLRLDILISEL
QKETASIHNLQGIDAVRILVSRHSALSIIEVRHLSFSELIFLLQPALESA
NIPPEVIQYPPHVDEQLQDVPYNQRAGLTPCSEAEWDHSLLKKYQDLYNP
Q
>ECs2755 hypothetical protein
MSDDISLVMEGALAVIAVVGVYCLVVFLMERLGN
>ECs0322 hypothetical protein
MKKHLLLLALLLSGISPAQALDVGDISSFMNSDSSTLSKTIKNSTDSGRL
INIRLERLSSPLDDGQVISMDKPDELLLTPASLLLPAQASEVIRFFYKGP
ADEKERYYRIVWFDQALSDAQRDNANRSAVATASARIGTILVVAPRQANY
HFQYANGTLTNTGNATLRILAYGPCLKAANGKECKENYYLMPGKSRRFTR
VDTADNKGRVALWQGDKFIPVK
>ECs3219 putative fimbrial protein
MKGAYSAWAWLLLCSIVSTPVMAGSKSVAMVLRVLVDAPPPCTVTGASVE
FGNVFISKIDGVSYKRPIDYSLVCNNLAMDDLRLNMQATTTVINGETVID
TGIAGFGIRIQKVSDHSILDLTPGAWLPFNFSSGALALEAVPVVQSGVSL
TAAEFSASATIVVDYQ
>ECs3520 hypothetical protein
MNALTAVQNNAVDSDQDYSGFTLIPSAQSPRLLELTFTEQTTKQFLEQVA
EWPVQALEYKSFLRFRVGKILDDLCANQLQPLLLKTLLNRAEGALLINAV
GVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARFVVKNVDNSDSYL
RQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLD
HYFRHPMARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQFVQP
KDFEEGVWLSELSDAIEISKGILSVPVPVGKFLLINNLFWLHGRDRFTPH
PDLRRELMRQRGYFAYATHHYQTHQ
>ECs1335 hypothetical protein
MHTNDRSLTMVDLFCCAGATARLVSEAGSEELRYCSGELILTHFSGEPLA
AHTIDHRLTMVDLFCCTGATARLVSEAGSEELSRVIASGREVLTRVLMAG
DEVQPREWLRADSGSDSLSARVVAALAHFLSSGVQLPTAGYGPLHEHLMA
GITHPEKVNPSRGSNSSSRSSGHSGRRVRTSVSQACGSRPLRFAVSRRLI
ICAARFPAISEPANNHAFRPMTTGRIARSQILLSRGTAPSLRKQLSSSRR
FRI
>ECs0380 hypothetical protein
MISLKAPHNNLMPYTQQSILNTVKNNQLPEDIKSSLVSCVDIFKVLIKQY
YDYPYDCRDDLVDDDKLIHLMAAVRDCEWSDDNALTINVQFNDFPGFYDW
MDYPDHPVKFVFRILENQKGTVWVYDQDDAFLDIKANVQAGRFTGLKKLV
QFINSVRTDCKCILLEHHMPLLRLFPKGKECMHVEKWLREMSSIPETDAP
IKQALAHGLLLHLKNIYPVFPESLVMLLLSVLDVKTYRDDARLNEGISNR
VQELGDRYYPVNKHVKIRYTL
>ECs1576 hypothetical protein
MKIEYTPERGRGFVRPGETEKQQNWGFSGIKKAAPKWSRLSEQITRCAVC
VCDPKHKHGDDSRYQAGGQCNQSGSVRCHTCNERFSLYSLRNCSRAKAHG
ANLSDSCSIFLRRLFRAGDNVLVNSLSVIVLSCIAMRRNTSHHGADSFYP
LGSIPRRFSSVRPNSLSQLARLMPSLCASLSNCSFSSGEIRILNCGDCPS
PLGLLSRLIVDKWSPIELAFLLLGGHLNTRALKKAKPRKCHYHSQGF
>ECs5458 hypothetical protein
MQKREPVIIAPDYTDDELYEWMHQKINAAQDLKWANEARAKQAENLSALE
QDITNLEKAAALSIARMITYPR
>ECs0247 hypothetical protein
MGRISSGGMMFKAITTVAALVIATSAMAQDDLTISSLAKGETTKAAFNQM
VQGHKLPAWVMKGGTYTPAQTVTLGDETYQVMSACKPHDCGSQRIAVMWS
EKSNQMTGLFSTIDEKTSQEKLTWLNVNDALSIDGKTVLFAALTGSLENH
PDGFNFK
>ECs2931 hypothetical protein
MNSLRPELLELTPQALTALSNAGFVKRSLKELENGNVPEISHENGALIAT
FSDGVRTQLANGQALKEAQCTCGASGMCRHRVMLVLSYQRLCATTRPTEK
EEEWQPAIWLEELATLPDATRKRAQVLVAKGITIELFCAPGEIPSARLPM
SDVRFYSRSSIRFARCDCIEGTLCEHVVLAVQAFVEAKTQQAEFTHLIWQ
MRSEHVTSSDDPFASEEGNACRQYVQQLSQALWLGGISQPLIHYEAAFSR
AQQAAERCNWRWVSESLRQLRASVDAFHARASHYHAGECLRQLAALNSRL
NCAQEMARRDSVGEVPPVPWRTVVGSGIAGEAKLDHLRLVSLGMRCWQDI
EHYGLRIWFTDPDTGSILHLSRSWPRSEQENSPAATRRLFSFQAGALAGG
QIVSQAAKRSADGELLLATRNRLSSVVPLSPDAWRMLSAPLRQPGIVALR
EYLRQRPPSCIRPLNQVDNLFILPVAECISLGWDSSRQTLDAQVISGEGE
DNLLTLSLPASASAPYAVERMAALLQQTDDPVCLVSGFVSFVDGQLTLEP
QVMMTKTRAWALDAETAPVVASLPSASVLPVPSTAHQLLMRCQALLIQLL
HNGWRYQEQSAIGQAELLANDLTAVGFYRLAHVLGQFRNTESEARVEAMN
NGVLLCEQLFPLLQQQGLNRPGFPGECFICELRLPDHRFRWKHNKLFLLL
PEEYGPAFPAIVDCYTSPPTLVWPVPLLHGFSSSYGVLPPLCKDH
>ECs1281 hypothetical protein
MNKNLLTRVIIISDDLYYLQGLSAKLGHHYRFFIECFFCSQNACNNDLST
LRFDNRFTKNVLLVAVDDVSILAKLPDLARFLMIVSLRKMKNRNIFFSLS
EQTFISRYIKLNKLITLIKSTDKDQQVRQVTLTRYEWDIYYLYFKIDNKR
ILHTLLKKDVKYISHYKISVFNKFEIACDSDGFYIFKALIQLRRFTGFKN
VRLYQLH
>ECs4990 putative tail fiber assembly protein
MELINVKRYYPEHKPYGEDVQYFQSEDGRDFYESIPLFTKKYKLCISPVT
GIICSVAEDVSALYPAGFTVVEVDELPEGVNIDGNWQFSDGLISKVPVNW
KTVAEKRRSSLLQEANETVDDWKTELKLDMISDENKLQLTRWMAYIRQLK
EMHFNDIASEGHYQAIPWPEKPE
>ECs2743 putative holin protein
MEKITTGVSYTTSAVGTGYWLLQLLDKVSPSQWVAIGVLGSLLFGLLTYL
TNLYFKIREDRRKAARGE
>ECs4237 hypothetical protein
MAFKIWQIGLHLQQQEAVAVAIVRGAKECFLQRWWRLPLAHDIIKDGRIV
DAQRLAKTLLPWSRELPQRHHIMLAFPASRTLQRSFPRPSMSLGEREQTA
WLSGTMARELDMDPDSLRFDYSEDSLSPAYNVTAAQSKELATLLTLAERL
RVHVSAITPDASALQRFLPFLPSHQQCLAWRDNEQWLWATRYRWGRKLAV
GMTSAKELAAALSVDPESVAICGEGGFDPWEAVSVRQPPLPPPGGDFAIA
LGWRLGRRTDEPAN
>ECs1808 putative tail fiber protein
MTVKISGVLKDGTGKPVQNCTIVLKARRTSSTVVVNTVASENPDEAGRYS
MDVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMSEDDVRP
EALRRFELMVEEAARHAEEAKKNAGEAETSARNAGISASQAEENAANADT
SAGDASESARQAAESAAAAKQSEEASSSSASAAAQKASESLQSATDAELS
KKTAESAAGNAARDATTAAEKARESAESAQSAEQSRIAAEEAVNRIPTVV
GPPGPKGEPGPAGPQGPKGDKGERGDTGPAGATGERGPAGDAGPAGPQGP
KGDRGERGETGLTGNAGPQGPKGDTGAAGPAGPQGPKGETGAAGPVGATG
PQGPKGDPGETQIRFRLGPASIIETNSHGWFPGTDGALITGLTFLAPKDT
TRVQGFFQHLQVRFGDGPWQDVKGLDEVGSDTGRTGE
>ECs3270 hypothetical protein
MFRSLFLAAALMAFTPLAANAGEITLLPSIKLQIGDRDHYGNYWDGGHWR
DRDYWHRNYEWRKNRWWRHDNGYHRGWDKRKAYERGYREGWRDRDDHRGK
GRGHGHRH
>ECs2155 hypothetical protein
MPVTTLSIPSISQLSPARVQSLQDAARLESGIRISIGSGQYSVHYVQLLD
GFSVEPVRGGLLDRLLGREHRMDRRAVALERQLNGGVDFLSSVNNYFQSV
MAEHRENKTGNKILMEKINSCVFGTDSNHFSCPESFLTCPITLDTPETGV
FMRNSRGAEICSLYDKDALVQLVETGGTHPLSREPITESMIMRKDECHFD
AKREAFCCK
>ECs0833 putative minor tail protein
MNRHTQIRQVVLARLREQCGDSATFFDGLPAFVDAQELPAVAVWLSDAQY
TGKMTDEDDWQAVLHIAVFIRAQAPDSELDMWMESTIFPALNDVSALSGL
IDTLIPLGFNYQRDNEMATWAMAEITYQITYTN
>ECs1267 hypothetical protein
MNNLIITTRQSPVRLLVDYVATTILWTLFALFIFLFAMDLLTGYYWQSEA
RSRLQFYFLLAVANAVVLIVWALYNKLRFQKQQHHAAYQYTPQEYAESLA
IPDELYQQLQKSHRMSVHFTSQGQIKMVVSEKALVRA
>ECs1375 hypothetical protein
MTTSPFILEFHDNDRDNHYQLIVSVILTYITLSWFNQVIDLMKRVCLWMY
CVYVLPGICSYIN
>ECs2557 hypothetical protein
MEKNMKKRGAFLGLLLVSACASVFAANNETSKSVTFPKCEGLDAAGIAAS
VKRDYQQNRVARWADDQKIVGQADPVAWVSLQDIQGKDDKWSVPLTVRGK
SADIHYQVSVDCKAGMAEYQRR
>ECs3616 hypothetical protein
MLTKYALVAIIVLCCTVLGFTLMVGDSLCELSIRERGMEFKAVLAYESKK
>ECs0884 hypothetical protein
MHVIDLSYFVKIYRSKNKHISVNFLRHLHRAGDLHLSGTTRCIYEDVIMK
KCLTLLIATVLSGISLTAYAAQPMSNLDSGQLRPAGTVSATGASNLSDLE
DKLAEKAREQGAKGYVINSAGGNDQMFGTATIYK
>ECs2195 hypothetical protein
MPATVSASGGNVRVLLRPVLVPELGLVVLKPGRESLPVFHRGRVLVEPEP
KNMRALPSGAVPAVRQPLAEDKSLLPFFSDERVIRAAGGAGALSDWLLRH
VKSCQWPHGDYHHSETVIHSYGAGAMVLCWHCDNQLRDQTSESLEQLTQQ
NLTAWMIDVIRHVMNGTQERELSLAELSWWAVCNQVVDALPEAVSRRSLG
LPAEKIRSVYRESDIIPGEQTATSILKQRTKNIALPPHTHQQQNPPQEKT
VVSIAVDPESPESFMKRPKRRRWVNEKYTRWVKTQPCACCGKPADDPHHL
IGHGQGGMGTKSHDIFTLPLCREHHNELHADPLAFEEKHGSQVDLIFRFL
DHAFATGVLG
>ECs3256 hypothetical protein
MKVNLILFSLFLLVSIMACNVFAFSNSGGGSETSYKEPEKTSAMTTTHST
KFQPSQAILFKMREDAPPLNLTEETTPPFPTKANYLIHPVR
>ECs5580 hypothetical protein
MIRRLVFFAIIVPVWGVGFIFAVTGNLSMMPDIWFFIRMSLFLFIMNLLI
DIYIRITGKHK
>ECs1773 putative prophage maintenance protein
MLVTYRLPLTDRKVKEKQAMKQQKAMLIALIVICLIVIVTALVTRKDLCE
VRIRTGQTEVAVFTAYEPEE
>ECs1787 lipoprotein Rz1 precursor
MRELKMKLCVLMLPLVVSACGSTPPAPVPCVKPPAPPAWIMQPAPDWQTP
LNGIISSSENG
>ECs0371 hypothetical protein
MNALSGLRMAQESVGLISVAHQAFVTIAGCGVNALSGLQVAQEFVGLISV
AHQAFITIAGCGVNALSGLRMAQESVGLISVAHQAFVTIAGCGVNALSGL
RMAREL
>ECs1140 hypothetical protein
MKKNSYLLSCLAIAVSSACHAEVLTYPDPLGSSQSDFGGTGLLQMPNARI
APEGEFSVNYRDNDQYRFYSTSVALFPWLEGTIRYTDVRTRKYSQWEDFS
GDQSYKDKSFDFKLRLWEEGYWLPQVAFGKRDIAGTGLFDGEYLVASKQA
GPFDFTLGMAWGYAGNAGNITNPFCRVSDKYCHRAESHDAGDISFSDIFR
GPASIFGGIEYQTPWNPLRLKLEYDGNNYQNDFAGKLPQASHFNVGAVYR
AASWADLNLSYERGNTLMFGFTLRTNFNDLRPALRDTPKPAYQPAPESEG
LQYTTVANQLTALKYNAGFDAPEIQLRDKTLYMSGQQYKYRDSREAVDRA
NRILVNNLPQGVEKISVTQKREHMAMVTTETDVASLRKQLAGTAPGQSEP
LQQQRVEAEDLSAFGRGYRIREDRFSYSFNPTLSQSLGGPEDFYMFQLGL
MSSARYWFTDHLLLDGGIFTNIYNNYDKFKSSLLPADSTLPRVRTHIRDY
VRNDVYLNNLQANYFADLGNGFYGQVYGGYLETMYAGVGSELLYRPLDAS
WALGVDVNYVKQRDWDNMMRFTDYSTPTGFVTAYWNPPTLNGVLMKLSVG
QYLAKDKGATIDVAKRFDSGVAVGVWAAISNVSKDDYGEGGFSKGFYISI
PFDLMTIGPNRNRAVVSWTPLTRDGGQMLSRKYQLYPMTAEREVPVGQ
>ECs2264 hypothetical protein
MWRWPLRRQTAKLSRKSARCWKRLPVFWVFVWRITCDGKTAPDCGCTPAV
SGGDGGFHQQNHVGAGGWGAGLRHCGIAVAGDKKKQPA
>ECs2954 putative minor tail protein
MAIKGLAQAMKNLDAIDRRAVPRASATTLNRVAGAIIAKTASSVARELAV
PRRLIRARIRLSPARPDKVYAKVYINTGNLPAIKLGEARVRLSRRKRRKK
GQRAALKGGGSVLIVGKRRIPDAFITRLANGRWHVMQRMPWASSSTGADS
KGRPKRHRLPIEVVKITTAGPLAETFERERDRMYREKLPAQMMKAMTHQL
RLVLKRK
>ECs1978 hypothetical protein
MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAP
VRRGKLRRNVVVLSRRSRDGGMESGVHIRGVNPDTGNSDNTMKADNPRNA
FYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLRR
>ECs0442 hypothetical protein
MLQSRNDHLRQTALRNAHTPASLLTTLTESQDRSLAINNPQLAADVKTVW
LKEDPSLLLFVDKPDLSQLRDLVKTGATRKIRNEARHRLEEKQ
>ECs1422 hypothetical protein
MFRPFLNSLMLGSLFFPFIAIAGSTAQGGVIHFYGQIVEPACDVSTQSSP
VEMNCPQNGSVPGKTYSSKALMSGNVKNAQIASVKVQYLDKQKKLAVMNI
EYN
>ECs5412 hypothetical protein
MIGKAIIKAQYNKQVPLTLVAMSVLTAMSIACQNQVDVCSPGNLRGAVNI
YTMVLEKGKRPGERY
>ECs1441 hypothetical protein
MNKFLFAAALIVSGLLVGCNQLTQYTITEQEINQSLAKHNNFSKDIGLPG
VADAHIVLTNLTSQIGREEPNKVALTGDANLDMNSLFGSQKATMKLKLKA
LPVFDKEKGAIFLKEMEVVDATVQPEKMQTVMQTLLPYLNQALRNYFNQQ
PAYVLREDGSQGEAMAKKLAKGIEVKPGEIVIPFTD
>ECs0759 hypothetical protein
MKQSIVSLAQVIRSKNAGPYELVLDILFKTKENYERVKSSGQLTPELIAR
LYHVEPDFIHRIVWFDPSNAVKIVMPRDIISGNVGDNDVYGAQQHAPLLN
MSFDL
>ECs1536 hypothetical protein
MKKLLVTVKPFQGTIPFRILQRGRVLVEGSFSGKCTQLHSRTFQVNATNE
ELTVECTMNAAKCRMVSAALQPVC
>ECs1579 hypothetical protein
MKEITLHEAAERAHQTEIICRLLEVYPDKLNDGDISALAGLLAKLSGSVA
LWLIDERAERYEK
>ECs5482 hypothetical protein
MHTAFEFWVRKTFGNRYDLTRDVDGFYCREVVKRMFDVWCHCRG
>ECs1214 putative antirepressor protein
MNMMAVPFHGNSLYVVNHNGEPYVPMKPVVAGMGLAWQSQLAKLRQRFAS
TITEIVMVAEDGKQRNMVSMPLRKLAGWLQTINPNKVKPEIRDKVIRYQE
ECDDVLYEYWTKGFVVNPRKMSVMEELNQACADMKRDKNIASVFATGLNE
WKQVKAAHVSKIRTLVNEANMLIDFVLADTGKGKITKAD
>ECs5268 hypothetical protein
MNAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHT
HPGGPSCHFNDIIPLTHCPHDVQDMQSYHHPLATNHQTQYGTVGQALHIA
RKLLPFIPDNAGILIVPCCRGGSAFTAGSEGTYSERHGASHDACRWGTDT
PLYQDLVSRTRAALVKNPQNKFLGVCWMQGEFDLMTSDYASHPQHFNHMV
EAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHAYEAIYGNYQNNI
LANIIFVDFQQQGARGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSS
HFSSAARRGIISDRFVEAILQFWRER
>ECs3854 hypothetical protein
MLDEHSNYSLAVLSLKSDIIRHFFCCVAQLFLVGINQIITDDGQRVLGKI
RQNATGSHRRVRMDLSLYAENKCHK
>ECs0914 hypothetical protein
MNVVNHLQTFYFAKTSGPFIANVFKIRSAGPLGFISLRSHFETLSFDIPI
VCASTDCDMPYRVRMALISSPVIDLTLAIRALTSRALTMTAMASTSLSDI
RPSAFSSFASSSIICNLSLIFHLNQEASFDQILYLLIAKGQKCICHFFVG
CQFIIFDFFASFTA
>ECs1402 hypothetical protein
MTTVSHNSTTPSVSVTTASGNNPPQLVATPVPDEQRISFWPQHFGLIPQW
VTLEPRVFGWMDRLCEDYCGGIWNLYTLNNGGAFMAPEPDDDDDETWVLF
NAMNGNRAEMSPEAAGIAACLMTYSHHACRTECYAMTVHYYRLRDYALQH
PECSAIMRIID
>ECs4337 hypothetical lipoprotein
MIKSTFWRAFALTATLILTGCSHSQPEQEGRPQAWLQPGTRITLPAPGIS
PAVNSQQLLTGSFNGKTQSLLVMLNADDQKITLAGLSSVGIRLFLVTYDA
QGLRAEQSIVVPQLPPASQVLADVMLSHWPISAWQPQLPAGWTLRDNGDK
RELRNASGKLVTEITYLNRKGKRVPISIEQHVFKYHITIQYLGD
>ECs1535 putative lipoprotein Rz1 precursor
MRELKMKLCVLMLPLVVSACGSTPPAPVPCVKPPAPPAWAMMPPSNSLRL
LDETFSVSETESSATRQH
>ECs2085 hypothetical protein
MFTYYQAENSTAEPALVNAIEQGLRAEHGVVTEDDILMELTKWVEASDND
ILSDIYQQTINYVVSGQHPTL
>ECs2753 hypothetical protein
MAHDTKFYNSDNSAAPASRHGRRSHAFKSDWYQHDPCTEEQAEWLIQCYR
RRGYEVKKALSLDYRHWIISVRLPYSERPPRPSRTFQQRIWR
>ECs4115 hypothetical protein
MPFSAATDAENNNACSLYTTKVNMSLFPVIVVFGLSFPPIFFELLLSLAI
FWLVCRVLVPTGIYDFVWHPALFNTALYCCLFYLISRLFV
>ECs2256 lipoprotein Rz1 precursor
MRELKMKLCVLMLPLVVSACGSTPPAPVPCVKPPAPPAWIMQPAPDWQTP
LNGIISSSENG
>ECs2956 hypothetical membrane protein
MAKNFVQDGTTIELVNAGDQTILSGAAVVVGSMVAVAITDIPAGEAGDGF
AEGVFLLPKQSADDIQSGAVVYLKDGVVQLAADGAVAAGVAWENAPANSA
TVAVKINV
>ECs3279 hypothetical protein
MATISSTSIPSIQTQSSNRASQGSDVASQIARISQQIIKLTQHIKEIVDT
SGSAEDKQKQAELIQQQITLLETQLAQLQRQQAEKAQEKEQRLSLNVSLL
NPVENTTHIGIYI
>ECs2987 phage replication protein O
MTNTAKILNFGRGNFAEQERNVADLDDGYARLSNMLIEAYSGADLTKRQF
KVLLAILRKTYGWNKPMDRITDSQLSEITKLPVKRCNEAKLELVRMNIIK
QQGGMFGPNKNISEWCIPQNEGGSPKMRDIPQNEGKSPKTRDKTSLKLGD
CYPSKQGDTKDTITKEKRKDYSSENSGESSDQPENDLSVVKPDAAIQSGS
KWGTAEDLTAAEWMFDMVKTIAPSARKPNFAGWANDIRLMRERDGRNHRD
MCVLFRWACQDNFWSGNVLSPAKLRDKWTQLEINRNKQQAGVTASKPKLD
LTNTDWIYGVDL
>ECs2497 hypothetical protein
MHANNNVKTAISRGYAMNDQMFVETLIITSSFFAIAVVLVLSVLLIERTG
>ECs4953 hypothetical protein
MAYFYFKLDRVQTNKYFTKYQQTVLPLRNSILRALLKNTGAAGLRLKPFA
MDVISEFYFSGALPAGWRKRDDVAFIGDGPCFIARPDESCPEGPAIAAMI
ETAERELRKRPDFLVWLCEKLGVMRIPSMFNTDSWWTPSLSRDALCVVFK
VGAYGREIKGCIPEECQEIKHSEYVALTEE
>ECs1166 hypothetical protein
MPTLFRKEYPRKSRATEFLFLILFIVLMTPISPLIFVWAIGKIIELVIEL
YNDVVWASFNTLHNKINPYKEN
>ECs0563 hypothetical protein
MLHTANPVIKHKAGLLNLAEELSNVSKACKIMGVSRDTFYRYRELVAEGG
VDAQINRSRRAPNLKNRTDEATEQAVVDYAVAFPTHGQHRASNELRKQGV
FISDSGVRSVWLLHNLENLKRRY
>ECs1388 putative transcriptional regulator
MVAQKLEAAGCWRRASDRWLFVMGNVECTEAQREWLLLRRNYCLAQISSP
PLPETLDISEVAKAADATLRRMGIATPSGEVFRKGTPVC
>ECs1976 hypothetical protein
MTALLTLEEIKAHLRVDHDADDDMLMDKVRQATAVLLAYIQGSRDKVIRE
DGELIPGEALTRMKGAAMRLTGMLYRNPDLAEREDLVQGELPFSVSVLIY
DLRCPTVL
>ECs2971 hypothetical protein
MTFLNQLMLYFCTVVCVLYLLSGGYRAMRDFWRRQIDKRAAEKISASQSA
GSKPEEPLI
>ECs1700 hypothetical protein
MCIKLALLFFRHEMPPILRSMLLFSYIREDFNFILFYQFVNQYTSIYIFR
HSAILPDNFLVEQSVI
>ECs3728 hypothetical protein
MDEVKKIEHCKILNTDTPDNSASDLDSFLKKNKKNKEMSEIVAGTILQLL
RQTQDNQFSTLGKKNGHMKSIDNKSSL
>ECs4557 SepL
MANGIEFNQNPASVFNSNSLDFELESQQLTQKNSSNISSPLINLQNELAM
ITSSSLSETIEGLSLGYRKGSARKEEEGSTIEKLLNDMQELLTLTDSDKI
KELSLKNSGLLEQHDPTLAMFGNMPKGEIVALISSLLQSKFVKIELKKKY
ARLLLDLLGEDDWELALLSWLGVGELNQEGIQKIKKLYEKAKDEDSENGA
SLLDWFMEIKDLPEREKHLKVIIRALSFDLSYMSSFEDKVKTSSIISDLC
RVIIFLSLDNYADIISISIKKDKDIILNEVLSIIEHVWLTEDWLLESPSR
VSIVEDKHIYYFHLLKDFFTSLPDACFIDSEQRENALLMIGKVIDYKEEI
I
>ECs1344 hypothetical protein
MGFRFRKSINIIPGVRLNLSNGAPSLSVGPRGASVSFGSRGTYANLGLPG
TGLSYRTRLDRAARSRGENRTATDPGLRQALEQKAAELMSAVTAIRNIHE
LTPDPKTGISWAELEAVYLHNRTSPFQVPAPVRPEKPDYLVLPEKPAESE
GISFLGKWFESESAKAERHAENLRRWQQELIDVERENTLRQHRYQQQRTA
WAEQYANWKFEAEEHEKRLATAQADARQQFRTDAAFFESYLAGVLAETEW
PRETLVAFEVKPELSAVLLDVDLAEIEDFPDKIYGVNARGTELTEKAMTQ
KAVRENYAHHVHGCLFRLVGIVLHTLPFDNVIVSGFTQRVSKRTGYLEDE
YILSCKCTRSQMSSVNFAGIKHIDPVEALGDQPVIRKMSSTFIFQPIEPL
TL
>ECs4845 hypothetical protein
MKPGCTLFFLLCSALTVTTTAHAQTPDTATTAPYLLAGAPTFDLSISQFR
EDFNSQNPSLPLNEFRAIDSSPDKANLTRAASKINENLYASTALERGTLK
IKSIQMTWLPIQGPEQKAAKAKAQEYMAAVIRTLTPLMTKTQSQKKLQSL
LTAGKNKRYYTETEGALRYVVADNGEKGLTFAVEPIKLALSESLEGLNK
>ECs1030 hypothetical protein
MRIKPDDNWRWYYDEEHDRMMLDLANGMLFRSRFARKMLTPDAFSPAGFC
VDDAALYFSFEEKCRDFNLSKEQKAELVLNALVAIRYLKPQMPKSWHFVS
HGEMWVPMPGDAACVWLSDTNEQVNLLVVESGENAALCLLAQPCVVIAGR
AMQLGDAIKIMNDRLKPQVNVDSFSLEQAV
>ECs3216 hypothetical protein
MKKWTIFLTSLILLALSLETPKCYAGDKLMSASFSSTKIYYAMKNVTASG
SLYFYVTVVTPGEVSYGQYNSNARKGDTLKLISWSGSGPAPTLVLTDYRR
TDTSNCPGINTRVFVCAYMTFNVTVESDNYGCPWIASFYSVSEAFGFGTY
TSPTVHDSICPTIPVASFDISWSDNYVSHNKALRLQSDGSTITTTLSTYL
MESGKLCDGSIFDSRGAYCRAVSDLLTFTSYGCDNAKVTVTPSRQPLTDR
KLHDIVVQVNTSSREPIDSTCRFQYVLNEL
>ECs1239 hypothetical protein
MEYGKYETLARAGYSGADRPQGDWQTSAALTRQQYDDWRTRYLPRVARLA
DLGENNSLMNAQLARVGGLATSSLRTAQMAQDNQMARYGVNRPDNPDSNT
LGLRNALAIAGAKNGIREAEQDRQMNILTGASAPARQKLSVGGQLVAA
>ECs1320 hypothetical protein
MSFRLNNTNVCFRHEAFRLPGLYIARKAMHIVGIIQRIRQKDPTMGCDYA
IWLRMISRCYGDNLLIQ
>ECs1110 putative major head protein
MGLFTTRQLLGYTEQKVKFRALFLELFFRRTVNFHTEEVMLDKITGKTPV
AAYVSPVVEGKVLRHRGGETRVLRPGYVKPKHEFPWSR
>ECs2160 putative outer host membrane protein precursor
MRKLYAAILSAAICLAVSGAPAWASEQQATLSAGYLHARTSAPGSDNLNG
INVKYRYEFTDTLGLVTSFSYAGDKNRQLTRYSDTRWHEDSVRNRWFSVM
AGPSVRVNEWFSAYAMAGVAYSRVSTFSGDYLRVTDNKGKTHDVLTGSDD
GRHSNTSLAWGAGVQFNPTESVAIDIAYEGSGSGDWRTDGFIVGVGYKF
>ECs0618 hypothetical protein
MKHPLETLTTAAGILLMAFLSCLLLPAPALGLTLAQKLVTTFHLMDLSQL
YTLLFCLWFLVLGAIEYFVLRFIWRRWFSLAD
>ECs0820 putative endopeptidase
MSRVTAIISALVICIIVCQSWAVNHYRDNAIAYKEQRDKKVSELKQATAT
ITDMQQRQRAADALDAKYTKELADAKAKNDALRRKLDNGGRVFVKGKCPV
PSSDETSSASGMGNDATVELSPVAGRNVLGIRDGIIRDQTALRTLQEYIR
TQCLR
>ECs3855 putative enterotoxin
MPIINKSASNYVEYISKNNPPYLSKKRDASINLNGKVSDCNGEIIWCRHI
ASYWSEFFCSNSGKIDYETFSSPQLLSKAIVIQENKGTNNIKGDVYFVEN
ESWGSVIYNLFLQLEKENKSHTSLEVHSPGHAMALGIKIKNDKENKFVIN
FYDPNQTATHKRVFFCTNNICDIINLTAYDFLSEQCLKCYGLKEDTLSLF
VDKTKSNDNNNVFIKKLPDNILQGVVINFAMGAGLREIIKKVYNDTRFTD
LTKSQMKILCESKNVNNVPGLLLALQNGHDNVIDEYGTLIKKSNLNKEEL
IHILSARTLDGTIPGLYQALQNGHAQAIKSYGNLVLDTINKNIDLEYLLS
AFKYEAHSSNKYTPGLFSAFQNGHADAIKAYCGVLGNSNLKRGEIIRMLE
ARNYDGAPGLLLAYQNGDINTIQSFFDSLIMLDISKDFIEELLTAKHYDF
TGLSLAISHRHDHVVKLYGKLFKKLDTSPYKMSIILALAIDCERNNANII
IDSEYKSNKAVKEYVEILKEFNICPEKVAEYLSEFSGKHFLDVYNYYSN
>ECs3352 putative protein processing element
MPTIHVSIVSFSNSFAYSGGYMTEECGEIVFWTLRKKFVASSDEMPEHSS
QVMYYSLAIGHHVGVIDCLNVTFRCPLTEYEDWLALVEEEQARRKMLGVM
TFGEIVIDASHTALLTRAFAPLADDATSVWQARSIQFIHLLDEIVLEPAI
YLMARKIA
>ECs1550 putative tail assembly chaperone
MAKDLKTLALARLSGFRHKTVKVPEWRNVSVVLREPSAEAWYLWQEVLNG
DGEDDDTLSVVAKTRRNLEADVTLFCDVLCDTDLQRVFTPDDREQVLAVY
GPVHARLLRQALELIADAESARKK
>ECs1374 hypothetical protein
MKIYWTRKRIPELSALPPSLRKKNFTDAYNAASSHIEYWIGAGVSFISMM
ILSRVYDFLLPAQDTFPGDIIRSLCVVCPSILIWFQFSVYVMRKYYRHIL
VRGKETETISERLIREADTREYELWRPVRRFFSIVFLIVLLGCIHSLITT
IK
>ECs5224 pyrBI operon leader peptide
MVQCVRHFVLPRLKKDAGLPFFFPLITHSQPLNRGAFFCPGVRR
>ECs0351 hypothetical protein
MRFVLFCPSGIVPAQFAALSTGSVGNVTCIGTEEELRNKLRHRPQSVVIS
AGRPAECAEMWFRFYRDHSFVVVLCVAPFFLPPDVSISGVLKNLRLLKPG
MSVEHVISIANSSGGLSGLKHAENLPVMDSYSVFMKEVNNRTKTIVMSER
FPEKQKKVLSLLLAGHSWEYSAQFLKTGIRQIWLAEQSLKKRWGIPDSMS
LREALLLSSNKFHGDGNALEKTNAMTLRENGNTNRYSVVVNAGTQVLSLH
KYK
>ECs3532 hypothetical protein
MYLRPDEVARVLEKVGFTVDVVTQKAYGYRRGENYVYVNREARMGRTALV
IHPTLKERSSTLAEPASDIKTCDHYQQFPLYLAGERHEHYGIPHGFSSRV
ALERYLNGLFGEAS
>ECs0300 putative CI repressor
MMIPAYIKYLYSGLLTVVISRYSFSAVAKSAAGIGVPYNLLATIDAPCVF
FYVVAQAQPFSGLWCLCLHHGSIEIMVVRAGQPSGWPVSNKAGYANPVRA
ATSEIGVSGGSNNRYLLEAAIMATILTPSHPQYVFVFAAIRRADTHPRIC
MLRTVSCDERSARRLLVRDYVLSLSARLPAGEVTL
>ECs4747 hypothetical protein
MNNTLLPSINIPCTLFETISLFDNYSADDMQYGDMVEQDFLSLGLSDISA
KVDPYRLIKYHFPGPGSINVAFSASSSGTKISQRECTDILFAEMKELAKM
FSFFGQYKTLIEDLIEHFRYGNGSNFHSQQLNLSFHEKINKYGYNSPIRI
IKECIENGINSTPSTGYQPLILQSIKTKLLSSRLNKFNDFEDSFNGLGIS
VHDISAQKISLLSFQKYAIGWSATIHFVAQDHFGLDVTDIKNKTYSKYRF
FRIWFFLQRHKDFAFKPFFTNFNTIERIENYL
>ECs1216 putative lipoprotein Rz1 precursor
MRELKMKLCVLMLPLVVSACGSTPPAPVPCVKPPAPPAWIMQPAPDWQTP
LNGIISSSERG
>ECs2134 hypothetical protein
MVTPVSISNYISLPDDFPVRNIAPQVKEVLKDFIDALSTIICNEEWRTSL
NINSATKKIFNNLDNLSYIQRTSFRGNDTLYNEKVQFKLTYPARNGRHKE
NIEFQVVINLSPIYLDNFRHDGEINIFCAPNPKPVTMGRVFQTGVERVLF
LFLNDFIEQFPMINPDVPIKRAHTPHIEPLPPDHHTAADYLRQFDLLVLN
FISRGNFVILPRLWNNSEVHRWFVNKDPNLITAILDITDSELKEDLLQSL
MDSLGSNKHVLP
>ECs4797 hypothetical protein
MSIAAISILVGGGLPTFKNGLIMKNINAIILLSSLTSASVFAGAYVENRE
AYNLASDQGEVMLRVGYNFDMGAGIMLTNTYTFQREDELKHGYNEIEGWY
PLFKPTDKLTIQPGGLINDKSIGSGGAVYLDVNYKFTPWFNLTVRNRFNH
NNYSSTDLNGDLDNNDSYEIGNYWNFIITDKFSYTFEPHYFYNINDFNSG
NGTKHHWEITNTFRYRINEHWLPYIELRWLDRNVEPYHREQNQIRIGTKY
FF
>ECs1530 putative holin
MYQMEKITTGVSYTTSAVGTGYWLLQLLDKVSPSQWVAIGVLGSLLFGLL
TYLTNLYFKIREDRRKAARGE
>ECs0728 hypothetical protein
MELYREYPAWLIFLRRTYAVAAGVLALPFMLFWKDRARFYSYLHRVWSKT
SDKPVWMDQAEKATGDFY
>ECs1620 antitermination protein
MNTQYLQYVREQLIVATADLSGATKGQLEAWLEHAQFDTGTYKRKKPRIL
DEVTGRMITLDNPPISGKQSYAKGSSIALVSQVEFSTSSWRRAVLSLEEH
QKAWLLWSYSESVRWEHQVTITQWAWSEFKTLLGTRKIAGKTLERLKKLI
WLAAQDVKNELAGRKTYEYQELASLVGVTSKNWSETFTERWVAMKHIFLQ
LDSQALLLLTKTRSKQKTTFSQQDIAKLD
>ECs1947 hypothetical protein
MVRPFLSPQVLRYQYRRLQHRKILKQKRWRTLCSRCHRLLKRERMTWFYH
RCIWQTANCVGRKIMSRSGSESAPRCGS
>ECs5227 hypothetical protein
MLSVLLKLLIILVLTPRASTRLLAVRLGYLAVYSMNMSVRKLFRLSIFFV
MIVR
>ECs2271 hypothetical protein
MSDDISLAMEGALAVIAVVGVYCLVVFLMDRLGN
>ECs5164 hypothetical protein
MELTMKQLLASPSLQLVTYPASATAQSAEFASADCVTGLNEIGQISVSNI
SGDPQDVERIVALKADEQGASWYRIITMYEDQQPDNWRVQAILYA
>ECs1099 hypothetical protein
MSAKNANFQLTITKAQVAIKSCPSWHTGFQWNHNDQLLHSCIPDSPTTQI
FSICCYLKRKRTNARVFHHVLLLIACDLFLPHGTFFQLEIFILQPVRIVY
SSTTLPAPPYDGPL
>ECs0801 excisionase
MYLTLQEWNARQRRPRSLETVRRWVRECRIFPPPVKDGREYLFHESAVKV
DLNRPVTGSLLKRIRNGKKAKS
>ECs1236 putative outer membrane precursor
MKSIATLVVCAISGIACVNLSAHAAEGEHTISLGYAHFQFPGLKDFVKDA
TAHNRETFSHFVNRNYFSSLGEYTDGRVSGYEGKDKNPQGINIRYRYEIT
DDFGVITSFTWTRSLTNSQTFIDVQSADHTRKIKNPAASARTDIRANYWS
LLAGPSWRVNQYMSLYAMAGMGVAKVSADLKIKDNINSSGGFSESNSTKK
TSLAWAAGAQFNLNESVTLDVAYEGSGSGDWRTSGVTAGIGLKF
>ECs3012 putative excisionase
MQTIIYQITPSKWCTERVLIASTGLKPGTIERARRKSWMQGKEYRHYAVE
GDPGHYSECLYNIEEIMRWIENQKQPGAKNASSG
>ECs5562 hypothetical protein
MSVPVYQIMGKGCHRQDKNMSFSPLFVKIFCRSSGSLANRHVDNFVHKLF
TSLLIYYHAFYSMMFVFVFHAVKNKSIPNN
>ECs0727 KdpF protein of high-affinity potassium transport system
MSAGVIAGVLLVFLLLGYLVYALIHAEAF
>ECs2255 hypothetical protein
MSSKNRPRRTTTRNIRFPNQMIEQINIALDQKGSENFSAWVIESCRRELA
ADIKYARQLTIKKNDTQYALRWLFI
>ECs2746 hypothetical protein
MSIKHYDVVRAASPSDLAEKLTHKLKEGWQPYGGPVAITPYTLMQAVAIE
GEPQVGPSSEPDWYYVIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLAR
RSTVTPGGAACRYNDIIPADHCLHDVQDMSTLNHPRADLSKGQYGCVGQG
LHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSESTGASQDSARW
GVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDMSAATHAQQPALF
TAMLTQFRADLSVFNAQCHGGSAADVPWICGDTTYYWKNTYATQYDTVYG
GYKNRESEGVYFVPFMTDGNGVNTATNAPAEDPDIPASGYYGAASRTNGN
QVSSNRPTHFSSWARRSIIPDRLATAILNAAGRTSAFISGKAPEIKPSPG
GNTPSGPSADTSVRTISLLPAAGEAAAQGWSIKDGGIQLSDGVFKITRQS
NKTWSLTHPVDDAITLLTQGGRLTCKFRLSGALTNNQFGLGIYLYTDAPV
PDGVAMTGTGNPFLMSYFTQTTDGRVNLMHHRKAGNTKLGEFGDYGNDWQ
TLELVFTAGSAMVTPKLNGVAGPAFQVIKDSLTLGLNALTLTDVTKNAAY
GVEIESLMLEINAPAA
>ECs1761 DicB
METLLPNVNTSEGCFEIGVTISNPVFTEDAINKRKHERELLNKICILSML
ARLRPIQKGCWQ
>ECs2806 hypothetical protein
MKTLPVLPGQAASSRPSPVEIWQILLSRLLDQHYGLTLNDTPFADERVIE
QHIEAGISLCDAVNFLVEKYALVLPTSRDSAPVPALS
>ECs2738 putative lipoprotein precursor
MRELKMKLCVLMLPLVVSACASTPTVQAPCVKPPSPPAWIMQPVPDWQKP
LNGIISSSENG
>ECs0011 positive regulator for sigma 32 heat shock promoters
MTALTPFSAAPTGPPSPAPRSKPCPSTLIAAWVRKMRVSWLESRCDTPFA
NNLSFISSGSSSSSSFTLASTACRNSCLCSSSIFFQVLRRNCSSNCCSIS
NVDISLSAFSFNRFETSSKMARYNLPCPRSLLAILSPPKCCNSPAISCQL
RRCCSGCPSIDLNSSLRISTLERRVLPFSLWVSSRAKFANCSSLQCWRKS
RSESFR
>ECs1503 hypothetical protein
MVKSRHTTRKHTARDTIKWWLKQSREAQSAIMTDEIPLDDALLQLREFID
ENSGEFFVQVWGNGANFDNTILRRSYERQGIPCPWRYYNDRDVRTIVELG
KAIDFDARTAIPFEGERHNALDDARYQAKYVSVIWQKLIPSQADF
>ECs4963 hypothetical protein
MRERDEARQVLANQQHTLQLISQISEAATNEKQQNIQHSEGQQSVVRRSL
AAVPAASAPVPDDVADRVRRAVCEIRACEAGADPR
>ECs2206 hypothetical protein
MANAWLRLWHDMPNDPKWRTIARVSGQPIATVMAVYIHLLVSASRNVTTC
HGVSLRGHIDVTTEDLASALDVTEDVIDSILHAMQGRVLDGDLISGWEKR
QVLKEDNGNVSQTAKSPAERKRAQREREKLRKHNADCHDESRRVTHLSRQ
VTTDKDTDKDTDTELNPTHNARESIPTSESNGAPLQTAEPEYLDGLSEPI
GKFSMTTVWQPSPDFRQRAAVWGMALPEPEFTPAELAAFRDYWMAEGKVF
TQVQWEQKFARHVQHVRAQVKPVSKGGSHAASGGTASRAVQEIRAAREQW
ERDNGFISNGNGLEAVGAHGGGVFEPLDSEERGRTFEALDCPDWCDD
>ECs5201 hypothetical protein
MHRVTYPCLTSRRFQLALIHRRVVDKRTSMHSRTASESTGARIHRPWCAR
HQVRPAWRCQYDKLHRVPFRSPELRLDSGPGYTTGSYRY
>ECs2762 putative phage replication protein
MQHQVAPHHGQFRKFGQHVNSGNVKTDLSATETAWKLWELMGEVYSNRWT
QKNGAAPSKLWIAQIGAMTEQQIRQVCRQCMDRCRAGETWPPDLAEFVAL
ISESGANPFGLTVDAVMEEYRRWRNESWRYDGSDKYPWPQPVLYHICLEM
RSKGIERQMTEGELKRLVERQLTKWAKHVGNGLSVPPVRRQLAAPKRPPG
PTPIELLKQEYERRKAAGFV
>ECs1778 hypothetical protein
MIYPEITGKSGEHLRLNTLEAVWIQGKLRMWGRWSYIGGGKSGNMFNRLL
VSKKLTKTAVNEVLRSMKKSGLEKPELEAFFRDMTRGKQKSWLSHCTDTE
ALIIDRVISEVLGEYPGLINILRQRYEGRGMSKRKMAECLNRTHPEWCFS
TCEKRIAGWLAVAEHMLYVPMHDSFR
>ECs2997 regulatory protein cIII
MQYAIAGWPVAGCPSESLLERITRKLRDGWKRLIDILNQPGVPKNGSNTY
GYPD
>ECs1291 hypothetical protein
MYERGLYGTRANGWGLKRFGVENDIYAPERIRNPQYMPYLEAMEQVTDSA
LQQMPTKERMKLMKSRTAFIYADSWGESGSFEKISSALHIATIDTLPKNL
LKKFSITEPTCKIRGEKQAFVQALRMAQDYLAWDVFDNVVICAAYRAIPV
LVFSEEDVGKPPKRLFPRRADEVNLSVERVGCFILSQRESAIRLQCGEYH
SGTAPQHATDIDVMAFSERQQMPSRPAAGVIDLVAHYGESGCLTPALSMD
YLTRHIQPGGKMRTVIADKQFGYHYFDLEYLEG
>ECs1210 hypothetical protein
MTFLNQLMLYFCTVVCVLYLLSGGYRAMRDFWRRQIDKRAAEKISASQSA
GSKPEEPLI
>ECs1944 hypothetical protein
MASNWIKLEVITPDKPEIFRLAEILNIDPDAALGKVIRFWAWADQQMIDG
NAECNARGVTKSAIDRITFMAGFADALIQVGWLVETNGVLSLPNFERHNG
KSSKKRAVTNERVTKIRELKRKGNAASVTKTDQKALPEEEKEEDINTYLP
LNPPRQKRASKKFEPEAIELPDWLPETLWHEWVQFRQALRKPIRTEQGAN
GAIRELEKFRQQGFSPEQVIRHSIANEYQGLFAPKGVRPETLLRQINTVS
FPDSAIPPGFRG
>ECs2273 hypothetical protein
MAKPFTHEQREELKARIIGLVRKNERMTISQLERATGAGWHSVRRCLVDV
LACGDLYMPGKYGVFTSEQVYRVWRKAAEKATDQTLIRKLPDGEIRRYDR
QQNIICGECRKSEVMLRVLAFYQGNFQEAVL
>ECs2995 putative superinfection exclusion protein
MNNSWWQELMHFFLQGMTLKQLIHMLIILIILIIVMPVSVKEWINLHNPE
ILPHYWMYYILLFCVSYVLNGVVNSAYHAVTERIEIFAAQKRKSKEEKYV
QDLFDSLTLGERAYLAFAVAANNQLKTEKGSPEAISLLEKGLLIRVPSAT
GYPEIDRFVIPERYRNECYIRFAGKKDSLMDELIAQDKHGKNK
>ECs1097 hypothetical protein
MADNNEATSSILADPYGKISHKTLDIITTTLTPLMLQRLKHNINAWVNEE
LSPPCLWDSRYACQQKMRIFNLLSPKLR
>ECs1782 putative holin protein
MYQMEKITTGVSYTTSAVGTGYWLLQLLDKVSPSQWVAIGVLGSLLFGLL
TYLTNLYFKIKEDRRKAARGE
>ECs1319 hypothetical protein
MTEHWSGHADDNFPFSWGSDSIFMHACLQSYGESICDTVHTFVADVAPES
LTGLQTVYLYTAGLLCDIVFHIEGGPPEMNGIIIFPDDIIP
>ECs1296 hypothetical protein
MKRTLKISSLLCVALPLTVQADCLSGDEVAQNSDVTLYEQVSYVNKQASA
WQIAGKNPYTRHNGYQEAGIGINSGCSIIDNTLDLKLNLYGINEYALKPA
GKFETDDSRTRALINRLSLVYSASDSVQFEAGKFAAPSGMFFLRSPSDLQ
THYYTGFQSTRLHDPKMTSAYQASSWGAKMSVDTRDYAFSASVIPKLATI
DKRYVTSGNWSANQQGNSDEAYLLSFSDHRFGEHTPTVNVRLGPSPSLAL
SDSYHYTPQLTLNVDAAYHRSQQWRHLSHRETAQVEEYQFPDSLYETKDE
SGVELALGGQYTSDNFSVFGVEYYFQSEGYSRAEQRQQRELIDFLNTTTG
YAPLDQAFDSYKYLMASEISNTANQGMLQGKHYLNAWASLPLAGESTLQP
SLVVNLVDGSTLLGLHYSTPLSAISNQLEAYAGGYSALGSRYSEFALFGD
TLGLYLGFKYYL
>ECs5470 hypothetical protein
MAKNDFCDKYFEYYLQINEVVRMDGNITIEYEVYVRIVWAEKAKTR
>ECs2229 hypothetical protein
MAASGGGEITVGGQTVRITYSETDGRFLASGGNNSLLSGLLLTGLNGGPE
ALRDIMLRMVSGSGNTQSHGDIEGKISQCKFSVNTESLQCPSEAVRCPII
LDKPEEGVFVKNSEGSLVCTLFDSVSFSHLVRDGGKHPLTREPITSSMIV
SQEQCIYDQTKGNFVIKDK
>ECs0235 hypothetical protein
MLFLYYAVERKVVSKGLRSREMAAAGNFGGSVPERKRSTVLSQPL
>ECs4948 hypothetical protein
MNMQSCGNKMNLFDSLNSARRLTELAGAVLERSKRYPQRFALKTTPPVGN
VQGTGEIEITIQTNGLRRRVKATRISGCTVYWEV
>ECs2282 hypothetical protein
MNMTAGFNFNNYAAVFCSATPALRGNRGFLSGRSQTSEWKASRDRQRSSN
GALY
>ECs4950 hypothetical protein
MSKVVRIIFEYKEHVIHKNADGTVRMGVSLDIRSTGIKQKGDGPAMIFGV
VMLAESRDFAELVAMKASALMKDMSMRSGVIKGNEFNQQG
>ECs1673 hypothetical protein
MNCYNITIPYGRIPVAFLRSVKSRRGIFAGGITYKGAWTQQTVEIYSWLA
CGAGFIDVSLCQNLAS
>ECs4958 hypothetical protein
MNSLPAGWARPLMARKHHFFKTGENISICGRWLYLAHNREPDTFESPDDC
AECRRRVNKEKDNGQ
>ECs2154 hypothetical protein
MPVDLTPYILPGVSFLSDIPQETLSEIRNQTIRGEAQVRLGELMVSIRPM
QVNGYFMGSLNQDGLSNDNIQIGLQYIEHIERTLNHGSLTSREVTVLREI
EMLENMELLSNYQLEELLDKIEVCAFNVEHAQLQVPESLRTCPVTLCEPE
DGVFMRNSMNSNVCMLYDKMSLIYLVKTRAAHPLSRESIAVSMIVGRDNC
AFDSDRGNFVLKN
>ECs4571 SepZ
MEAANLSPSGAVMPLATSLSGNNSVDEKTGVIKPENGTNRTVRVIAGLAL
TTTALAALGTGIAAACSETSSTEYLALGITSGVLGTLTAVGGALAMKYA
>ECs1075 hypothetical protein
METVSDALKALKKASSHVVAARLGISREEAVNELWELKINGVVDKTGHTW
FLAGEGESRVTEERPVKSEAQDMLTGEVEQKVTADMMIEFIGQDGAKTCE
ELAGKFGVSTRKVASTLAVVTATGRLARVNQNGKFRYCMPGDNLPAEPKA
ALVTESDGKAFPQPAGAALPVREAATQEEIKTETVADIVQPLPSFTETQA
DELIFPSLRRANLALRRAKSDVQKWERVCAALRELNKHRDIVRQITDSSR
RVVSEK
>ECs4008 hypothetical protein
MDFPQRVNGWALYAHPCFQETYDALVAEVETLKGKDPENYQRKAATKLLA
VVHKVIEEHITVNPSSPAFRHGKSLGSGKNKDWSRVKFGAGRYRLFFRYS
EKEKVIILGWMNDENTLRTYGKKTDAYTVFSKMLKRGHPPADWETLTRET
EETH
>ECs0949 hypothetical protein
MEYRFSWVIFNMINNISDQASSFPCTQLNQSDNFLDSLREFFAILNSSRK
GELSTWDTIYLHLILVINADSDLIKNDVLLAENIPTADYQFNTFFSNAFE
TDVKKYLGKSEDNEVEIKTGNERISIGIRNRSNGKLERQKFLFPLDYENK
LQHQLDKHFTIESHPLLYRHTMGSKIANVIFEKLYSRIDFNKEQYISFIK
DAFLHFYDYSRRYAINENIAKDAVTNNIALMSTFYDSDNTSGEVLNNDFT
EESFETALDVEHTIVLGFADDNFETKPVHYQDLLTRFSAFQDTVFNLFPE
MHSSHYNDICSVSVDMTKGTQCMIHLMVNEEVFMSLPVPVATMVREDASN
LINLKTLLNDGCFIKYSHFNDVALIKQNISNLYLSHTVVNESILKKCCFE
NGSLGDVKITNNK
>ECs2227 hypothetical protein
MDFSWEGRELEFLIYFNCINILNGLHRVNVSSTPYSLAYNTCCNLHLSAM
KKHHLSLYEILDLPSANLSFQSTFKYCIYLPTRSYFRKLNMNDNIPTARN
HKQSTCITEKTCLYF
>ECs0053 CcdB-like protein
MQFTVYRSRSRNAAFPFVIDVTSDIIGVINRRIVIPLTPIERFSRIRPPE
RLNPILLLVDGKEYVLMTHETATVPVNALGTKFCDASAHRTLIKGALDFM
LDGI
>ECs1407 hypothetical protein
MMKLALTLEADSVNVQALNMGRIVVDVDGIELAELINMVCDNGYSLRVVD
ESDQTSAECTPPFATLTGIRCSTAHITETDNAWLYSLSHQTSDFGESEWI
HFTGNGYLLRTDAWSYPVLRLKRLGLSKTFRRLVVTLIRRYGVSLIHLDA
SAECLPGLPTFDW
>ECs2964 hypothetical protein
MDGELKNMKLNINQLAALSGLHRQTVAARMADVPLAPGSNEKKKLYLLTD
LITSLLEKPPSSEDEDMDPHARKAWYQSERERLKFQHETVQLVPVSDVRR
SFSVVVKAIVQVLETWPDRLERDRGWTASQLNEVQIVVDEIRDTLEKAVI
DCCDEADM
>ECs2445 activator of ntrL
MNKNMAGILSAAAVLTMLAGCTAYDRTKDQFVQPVVKDVKKGMSRAQVAQ
IAGKPSSEVSMIHARGTCQTYILGQRDGKAETYFVALDDTGHVINSGYQT
CAEYDTDPQAAK
>ECs2009 hypothetical protein
MHMFKPRLCSWIGLLPLFMLSLPVQAELRCVANAVDIEPFFSAATAEDKQ
QVEQAINSSVNLVPFGLSASDWKVHRGDLVVEGNIESNQKLIVLGNLTVK
GNISTFSLSNPWVILGNVTATNIVTDSPLLITGSINASGLVFIDSYYDNP
STIKGSINARGIFINDIIAPVVASSTNSEFMVRASDKNDTENVKKALMII
NPDAYYWGLINDEDALKEIFKRSNIRMAGNVCNQMKKEALFRLKPSPELV
QELQMLDEGNVAAFEGRDIATFDLAIMRTLPRLKGISANLRKQLINSNDG
QTIESMARYMPDNEILELTDQQLGYQPVVLGLLDREPLSVEIMTRMSHLP
DGVGPLNLALRENLPLDIVMTLAKRDWDMIIQELYKDAWLLPESIIDGYI
RSDDSSIRQVGAGGQLTYNQAMQLANDSSNDVVTSLALKLAEMKHHGQLL
RMTPQESDKIAVYLYQKFENDDDLIGALFLALPDNLQFNFVKRMEKKSPA
YFCCRDMQIIHSDAALQRLLTRFNDPEGWSNLAKNQYLSTSMKQKIWQRA
LSHRKNNPKADSDAYETSADMILSELISYGEVDDQMLLNATSLIRSDDWD
FLESALISWDNLPAVVLKELQQNTPRNDIWAKFFLRQENSSRAQVNEALR
VYYALDPDALAQLDVLAKQPDRIWWSTLAKSNLTFFKFGALNNRHTPPAV
LAAEIDPEWWIVAMNNPRFPVDVLKARLKRDPLLALKLVNPELDLVRQLA
LNGKTRAIREQAMRKLDELY
>ECs2427 hypothetical protein
MSQNDIIIRTHYKSPHRMHIDSDIPTPSSEPINQFAPQLITLLDTSDLSS
MLSYCVTQEFTANCRKISQNCYSTALFTINFATSPIHAENIFITLHYKKE
IISLLLETTPIKANHLRSILDYIEQEQLTAENRNHCMKLSKKIHREKTIQ
PTVNLNGSAFFSQSPSDAIFCRHLSLQYALDSLRNGKGKVNLIKHYSSVE
SIQQHVPLVRDAEFRALLRHPPAGSRVIASKDFGFALDIFFCRMMANNVS
HMSAILYIDNHTLSVRLRIKQSAYGQLNYVVSVYDPNDTNVAVRGTHRTA
RGFLSLDKFISSGPDAQTWADRYVRNCAIAILPLLPEGVPGAIFTGIATR
MPFAPIHPSAMLLIMATGQTQQLITLFRQLPILPEKEIIEIITAQNSIGT
PALFLAMMNGHTDNVKIFMQEIQSLVDNHIIHEDNLVKLLQTKSANETPG
LYISMLYGFDEIIDIFLNALTTPIAQELLNKKLVMSILAMKIHDGEPGLY
AAMENNHPLCVTRFLSKINGIAFKYKLSKANIMDLLKGATAQGTPALYIA
MSKGNEDVVLSYISTLGAFAKKHSFSQHQLFTLLAAKNHDNMSAVHIAIH
HNHYKTVETYYAAINVISQSMSFSADELKTYL
>ECs1528 hypothetical protein
MRFVQLILLYFCTVVCTLYLVSGGYKVIRNYIRKKIDAAAAEKISASQSA
GTKPEEPLIS
>ECs4989 tail fiber
MFHVDNNSGVANMPALAPAQSNTTTWFTEGDGQKGISWIGQDWLNILQAE
LLNILAEASIQPDKAQLNQLTLSIKAIIAANAFSRKNNLKEIADAGAEAQ
RLARGYLGLGALATKNSLGPGDVNALAKDQNLADLENAGTARNNLDVYSK
SEGDNRYLRREQNGADIPDKGAFIDNVGLRETVNKAADALPSGGTAVAAN
RLATPRNINGVPFDGTLDINITSGMTQSTADGRYVQNVQLGAQSYHSPGG
NEMSWNYSAPSGCMLSGINVQETGSRSADNIGGVYYRPVQIYINNAWRTV
SSV
>ECs1234 putative outer membrane protein
MEYGFAIYNRNNVNVTGVLTPVFFLDRFTAESGSKTYTNKPDGKSLQAVC
CLFPWNNVFADRKVPKITINDNTVTWSNLEQGMGSYIYTFWG
>ECs3500 hypothetical protein
MKDGALLRSSSLFIAYMGCLGWGSAYFYGWGTSFYYGFPWWIVGAGVDDV
ARSLFFAVIVIAIFLIGWGIGVVFFFAVKRKHSMQELNVFRLYFAVELLF
VPAIIEFSILRQKIQVPLLLLSAAIALAVTISIRSYGRFLSVSCFYDKPF
IKKHFFEIVMIAFVAYFWLFSFLTGYYKPQFKKEYEMINYNDGWYYVLAR
YDNCLVLSTSFNAGSKRFVIYQSAQDKNLQVDIVRTRI
>ECs4994 hypothetical protein
MTFVHTMLLYFCAVVSALYLVSGGYKVIRNYIRRKIDDAAAEKLSKTAQA
PSSPNDPTPL
>ECs2267 putative antitermination protein
MNNHYLQFVREQLIIATADLSGATKGQLEAWQENAMFDTGRYRRKKIRYR
DEVTGKMITRDNPPIPGKQSLAKVTSIPLVSPVEFSTSSWRRAVLSLEEH
HKAWLLWCYSGSICWEYQIAITQWAWNEFNTQSGTRKIAGKTQERLKKLI
WLAAQAVKAELFGGEGYEYKELALLVGVTTKNWSKTFTRHWVAMKHIFHR
LDSEALLFVMRTRSKQKAAFSKQSVAKVD
>ECs0730 hypothetical protein
MIFVSFGVIADCEIQAKDHDCFTIFAKGTIFSAFPVLNNKAMWRWYQNED
IGEYYWQTELGTCKNNKFTPSGARLLIRVGSLRLNENHAIKGTLQELINT
AEKTAFLGDRFRSYIRAGIYQKKSSDPAQLLAVLDNSIMVKYFKDEKPTY
ARMTAHLPNKDESYECLTKIQHELLRSEEK
>ECs0126 hypothetical protein
MKTFFRTVLFGSLMAVCANSYALSESEAEDMADLTAVFVFLKNDCGYQNL
PNGQIRRALVFFAQQNQWDLSNYDTFDMKALGEDSYRDLSGIGIPVAKKC
KALARDSLSLLAYVK
>ECs4644 tryptophanase leader peptide
MNILHICVTSKWFNIDNKIVDHRP
>ECs5010 hypothetical protein
MKRPAFILICLLLQACSATTKELGNSLWDSLFGTPGVQLTDDDIQNMPYA
SQYMQLNGGPQLFVVLAFAEDGQQKWVTQDQATLVTQHGRLVKTLLGGDN
LIEVNNLAADPLIKPAQIIDGATWMRTMGWTEYQQVRYATARSVFKWDGT
GTVKVGSDETAVRVLDEEVSTDQARWHNRYWIDSGGQIRQSEQYLGADYF
PVKTTLIKAAKQ
>ECs3574 formate hydrogenlyase maturation protein
MSEKVVFSQLSRKFIDENDATPAEAQQVVYYSLAIGHHLGVIDCLEAALT
CPWDEYLAWIATLEAGSEARRKMEGVPKYGEIVIDINHVPMLANAFDKAR
AAQTSQQQEWSTMLLSMLHDIHQENAIYLMVRRLRD
>ECs3485 chaperone-like protein
MPMNTTGMSFSSFGISCHRENSFRNSFRGKNDEVVKCSIGERTISFSVRK
FSGNILETVRRQSTKDIDEWIKDERIVYPSRVINQEIDNYCFQKNAKIST
EERQRVFFLVSQENQLTLDVKAAQSSINHVIMGSASFGKKMDALCDGMSR
DVKNRTSDTIANLLADKFYQKHIDSDIDIVKLRNDIPDYLMRAIQG
>ECs1061 hypothetical protein
MPNKPCPACNALSGFLEKGGYYIFNCPEHVEFHISKLDSIITKPNQYQSD
LLNKELNAARD
>ECs5391 hypothetical protein
MWYFAWILGTLLACSFGVITALALEHVESGKAGQEDI
>ECs0822 hypothetical protein
MPSMIAIILLIILHIWLCRQGGAHFWSESWLNISLLMLDIEHLARGKGLR
>ECs1142 putative regulator
MRPLILSIFALFLAGCTHSQQSMVDTFRASLFDNQDITVADQQIQALPYS
TMYLRLNEGQRIFVVLGYIEQEQSKWLSQDNAMLVTHNGRLLKTVKLNNN
LLEVTNSGQDPLRNALAIKDGSRWTRDILWSEDNHFRSATLSSTFSFAGL
ETLHIAGRDVLCNVWQEEVTSTLPEKQWQNTFWVDSATGQVRQSRQMLGA
GVIPVEMTFLKPAP
>ECs2758 hypothetical protein
MSKTNPGWARPLMAKKHHYFAEGEITSICGGWMYFGNEREPDTFESPDDC
KKCRRKLNKGGK
>ECs2468 hypothetical protein
MKKVLLQNHPGSEKYSFNGWEIFNSNFERMIKENKTMLLCKWGFYLTCVV
AVMFVFAAITSNGLNERGLITAGCSFLYLLIMMGLIVRAGFKAKKEQLHY
YQAKGIEPLSIEKLQALQLIAPYRFYHKQWSETLEFWPRKPEPGKDTFQY
HVLPFDSIDIISKRRESLEEQWGIEDSESYCALMEHFLSGDHGANTFKAN
MEEAPEQVIALLNKFAVFPSDYISDCANHSSGKSSAKLIWAAELSWMISI
SSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSLMGFLYWHIC
CYRRKLTDAELEACYRYDKQFWEHYSKKCRWPIRNVPWGASSVKYS
>ECs2727 putative minor tail protein
MHFSGLTYAVLSLFFCDPDMHPSDFSLLVPRHEEEQVERPDEDKMLMQKA
AGLAGGVRFGGDGGRDILSSADVADVMVDDAALMMASAGIPGGVRYVPAG
W
>ECs1526 hypothetical protein
MAKHGAGCLPVNSVPAPESAIIPYTWLLIAPPHRGIRHAVVFLINSKQKN
QALCRLFLFITGHSNTTMPQTSPPSASTDMILLVKSTITTRNSRSAATAR
RSFTVTGNSPDEDPEHRQSLPAPCALPAQFRRLPPQWPPPHPAHLPLHAY
RVSHPRSHVTPHDDQF
>ECs4866 hypothetical protein
MKIRLLILSLLVSVPAFAWQPQTGDIIFQISRSSQSKAIQLATHSDYSHT
GMLVMRNKKPYVFEAVGPVKYTPLKQWIAHGEKGKYVVRRVEGGLSVEQQ
QKLAQTAKRYLGKPYDFSFSWSDDRQYCSEVVWKVYQNALGMRVGEQQKL
KEFDLSNPLVQAKLKERYGKNIPLEETVVSPQAVFDAPQLTTVAKEWPLF
SW
>ECs0844 putative tail fiber protein
MAAVQISGVLKDGAGKPIQNCTIQLKARRNSTTVVVNTVASENPDEAGRY
SMDVEYGQYSVTLLVEGFPPSHAGTISVYEDSQPGTLNDFLGAMTEDDAR
PEALRRFEQMVEEAARHAEEAKKNAGEAETSARNAGISASKAEASAANAD
TSAEDASESARQAAESAASAKKSEEASSSSASEAAQKASESLQSATDAEL
SKKTAESAAGNAARDATTSTEKARESAESAQSAEQSRIAAEDAVNRIPTV
VGPPGPKGEPGPAGPQGPKGDKGERGDTGPAGATGERGPGGDTGPAGPQG
PKGDRGERGETGLTGSTGPQGPKGDTGATGPAGPQGPKGETGAAGPVGAT
GPQGAKGDPGETQIRFRLGPMRIIETNSYGWFPGTDGALITGLTFLDPKD
ATQVQGMFQHLQVRFGDGPWQDVKGLDEVGSDTGRTGE
>ECs5533 hypothetical protein
MRDLQTSGIVGLSASKVGYRHSDRHGEVNDKKNVAVLPTVPASIRATKGS
TGGYTYQGNKVVLASPLVITGGNEISICIPRHASHQFQLLMS
>ECs2801 hypothetical protein
MRLASRFGYAANQIRRDRPLTHEELIRHVPSIFGEDRHTSRSERYAYIPT
ITVLENLQREGFQPFFACQTRVRDPGRRGYTKHMLRLRRAGEINGEHVPE
IILLNSHDGTSSYQMLPGYFRFVCQNGCVCGQSLGEVRVPHRGNVVEKVI
EGAYEVVGVFDRIEEKRDAMQSLVLPQPARQALAQAALTYRYGDEHQPVT
TADILTPRRREDYGKDLWSAYQTIQENMLKGGISGRSAKGKRIHTRAIHS
IDTDIKLNRALWVMAETMLESLR
>ECs0267 transcriptional regulator of cryptic csgA gene for curli surface fibers
MTLPSGHPKSRLIKKFTVLGPYIREGKCEDNRFFFDCLAVCVNVKPAPEV
REFWGWWMELEAQESRFTYSYQFGLFDKAGDWKSVPVKDTEVVERLEHTL
REFHEKLRELLTTLNLKLEPADDFRDEPVKLTA
>ECs1383 hypothetical protein
MVTICVDGENKTHKITDWTLWAGNDDREVMLTCHFRSGRKYTRPLSVCQI
TPTVNLRNVFLERKGNAVTSRAELVIIYGDKYAAVYYREGERPYIMKTTG
LDFQQCSAFTEHAVFNYLCRVANERIFYARGNNRNIDENILRQIKKIVPH
PDTALHAYCSGQSKNVIRHGV
>ECs1621 putative holin protein
MAIHMHEKESLAGAFWLVLLIIAGWGGLVRYLIDVKQSKATWSWINALAQ
IVVSGFTGVIGGLISIESGFSIYMILATAGISGAMGSVALTYFWERLTGV
KNAKS
>ECs3499 hypothetical protein
MFACTFFFLHGAGCLPVNSVSAPESAIITYTWLLIAPPHRGIHHAVVFLI
NSKQKNQALCRLFLFITGHSNTTMPQTRAPSVSTNIILLVKSIITITKSR
ITTASRHSL
>ECs2213 hypothetical protein
MSLKRMVNLTTRCPQWGRNNPAILTAAPFRGVPQPEAHVRRNLTTSLLRE
RASPYAFALCTDFSGKYPFSKLSVPDSYPCPAHDHT
>ECs2950 putative minor tail protein
MGRADWRAMLAGMTSTEYADWRRFYCTHYFQDTQLDMHFSGLMYAVLSLF
FCDPDMHPADFSLFAPEAEEGQAETPDENDVLMQKAAGLAGGVRFGEEGR
RL
>ECs1076 hypothetical protein
MAKVFTQEEREKIKGQVVELVRRSGRETLRQLEAKTGATRYLMSVLAREL
VASGDVYNSGYGLFPSEQARKDWQNARKKLSRAKLKKPVVVDPDLIWSLP
DGEIRRYDRQLNIICSECRNSEVMQRVLIFYTGVMME
>ECs1641 minor tail protein
MFLKTESFEHNGVTVTLSELSALQRIEHLALMKRQAEQAESDSNRKFTVE
DAIRTGAFVVAMSLWHNHPKKTQMPSMNEAVKQIEQEVLTTWPAEAISHA
ENVVYRLSGMYEFVVNDAPEQAEDAGPAEPVSAGKCSTVS
>ECs1509 hypothetical protein
MANAWLRLWHDMPNDPKWRTIARVSGQPIATVMAVYIHLLVSASRNVTKC
HGESLRGHIDVTTEDLASALDVTEDVIDSILHAMQGRVLDGDLISGWEKR
QVMKEDNGNVSQTAKSPAERKRAQRERERLREQNTDCHDESRRVTHMSRQ
VTTDTDTDKELNPTHNARMRESAPASESNGAPLQTAEPEYPDGLSEPIGK
FPMTGVWRPSLDFRQRAALWGMALPEPEFTPAELAAFRDYWMAEGKVFTQ
VQWEQKFARHVQHVRAQVKPVSKGVSHAAPGGTASRAVQEIRAAREQWER
ENGFISNGNGLEAVGTYGGGVFEPLDPEERGCAVEALDCSDWRDD
>ECs5180 hypothetical protein
MTLPTTIYSFPAYLSRFSSTDKPVKLKFHQYARATLLSNRGRDHNCDGRR
TVEIHKLDLSDWQAFNKLATKCNAYDGVTMNGDNSFGWNHEATLDNIHAQ
KYNKAYAGARLTAELKYLLQDVESFEPNSKYTIHEVVLGPGYGTPDYTGQ
TIGYVVTLPAQMPNCWSSELPTIDLYIDQLRTVTGVSNALGFIIAALLNA
YSDLPHDLKIGLRSLSSSAAIYSGLGFERVPQERDISCARMYLTPANHPD
LWTQENGEWIYLRN
>ECs0546 hypothetical lipoprotein
MMNSSIKSFSLLAVILLAGCSSPTSRIADCQAQGVSHDTCYLAEQQRQAA
ILSASEAQAFKNAEAAQHAQAAKKAIYKGFGMTFRMSSKNFAYLNDSLCA
IDEDNKDATVYQSGLYNVIVYHHTGKVALMKEGQFVGYLK
>ECs2041 hypothetical protein
MFQDCFTQWFFNLIAGVTRHDTAPPMKPEFQMVAAIRYIDALRFEPTSEL
ALFHIASLRLSISTK
>ECs3205 cell division protein
MIQPISGPPPGQPPGQGDNLPSGAGNQPLSSQQRTSLESLMTKVTSLTQQ
QRAELWAGIRHDIGLSGDSPLLSRHCPAAEHNLAQRLLAAQKSHSARQLL
AQLGEYLRLGNNRQAVTDYIRHNFGQTPLNQLSPEQLKTILTLLQEGKMV
IPQPQQREATDRPLLPAEHNALKQLVTKLAAATGEPSKQIWQSMLELSGV
KDGELIPAKLFNHLVTWLQARQTLSQQNTPTLESLQMTLKQPLDASELAA
LSAYIQQKYGLSAQSSLSSAQAEDILNQLYQRRVKGIDPRDMQPLLNPFP
PMMDTLQNMATRPALWILLVAIILMLVWLVR
>ECs2764 hypothetical protein
MKIKHEHIRMAINAWAYPDGEKVPAAEIARTYFELGMTFPELYDDSHPEA
LARNTQKIFRWLDKDTPDAVEKIQALLPAIEKAMPPLLVARMRSHSSEYY
REIVERRDRLVKDVDDFVAAAIAWGTLTNSGGQPGNAVVVH
>ECs1520 prophage maintenance protein
MLDTCRLASYAPKGKEKQAMKQQKAMLIALIVICITVIVTALVTRKDLCE
VRIRTGQTEVAVFTAYEPEE
>ECs0849 hypothetical protein
MELVTNCVVGNSWRITEMGNVAEDWQMAMIRFIIGFGDRLDGHF
>ECs2731 putative head decoration protein
MVTKNITEQRAEVRIFAGNDPAHTATGSSGISSATPALTPLMLDEASGKL
VVWDGQKAGSAVGILVLPLEGTETVLTYYKSGTFATEAIRWPESVDEHKK
ANAFAGSALSHAALP
>ECs1418 hypothetical protein
MSAEIFLITHPRVVFFFFCSYQSIKTHVNTLLSTQKIFCFLLNLRFPLAK
NTRQSFFLPVADCCDMSCAQAVTNLVFLQ
>ECs1195 hypothetical protein
MKQTFLLRNEAIRNNAIDAILSLPIDDKSPHEVHVKEPKRSKAQNDRMWP
MLNDVSRQVLWHGQRLAPEDWKDLFTALWLKTKKLEQRSVPGIDGGVVML
GVRTSKMRKASMTELIEIMFWFGSERNVRWSDDSRREYEWSQRKGRAA
>ECs0930 hypothetical protein
MRAIGKLPKSVLILEFIGMMLLAVALLSVSDSLSLPEPFSRPEVQILMIF
LGVLLMLPAAVVVILQVAKRLAPQLMNRPPQYSRSEREKDNDANH
>ECs3714 hypothetical protein
MRKKIEMSLIKSPANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEE
KREEKFIQGYYDGYTKGIIDVMDNFIPLISLAMLRT
>ECs2975 antitermination protein
MRDIRQVLECWGAWAANNHEDVTWSPIAAGFKGLIPEKVKSRPQCCDDDA
MVICGCIARLYRNNRDLHDLLVDYYVLGGTFMALARKHGCSDTCIGKRLH
KAEGIVEGMLMMLGVRLEMDRYVERELPGGRTSVFYQRKNSLRS
>ECs2130 hypothetical protein
MQSLDPLFARLSRSKFRSRFRLGMKERQYCLEKGAPVIEQHAADFVAKRL
APALPANDGKQTPMRGHPVFIAQHATATCCRGCLAKWHNIPHGVSLSEEQ
QRYIVAVIYHWLVVQMNQP
>ECs1389 hypothetical protein
MMKQGMANSISMTWQLLCDEHGFTREHYNQLKKFSPETLREIIAEIASCH
PSTSVLLRNKWLTPPEDILEQITREYERRIQNCPPFRSEKEAESWLNELS
FAVIAPLYRVAPEQTEQVESFVLQLIAEQERVWELILTNDGYSWHCALYD
ILFHLVLRNMEDQPEHQKQRITKIFYEPDARLVAEEIKFRIQSLYDDEQK
RALSELVNDFTSKE
>ECs1113 putative minor tail protein
MLAGMTSTEYADWHRFYRTHYFQDTQLDMHFSGLTYAVLSLFFCDPDMHP
SDFSLLVPRHEEEQVERPDEDKMLMQKAAGLAGGVRFGGDGGRDILSSAD
VADVMVDDAALMMASAGIPGGVRYVPAGW
>ECs5542 hypothetical protein
MEKKTKNRVKALYRDIKNERFFTVFIGGLVIIVANQHPGMLFKTNRQCIM
LGRINIVFRRNDYGYIII
>ECs2760 hypothetical protein
MKIRYHDFGPVSHMLISSTVLETRKHNHILDMLRLADPYLVINTSGIFFL
RSTVSGKTSHVLRAYKTAVREEGE
>ECs5446 hypothetical protein
MSPSPPTENESKGKQKSRQCHPLTANAGSRDYGIQALLKMPDNIPASPDS
GYK
>ECs2701 hypothetical protein
MTGSSNVEKCDFYHVIVLSLNFPGYLKMEYGSTKMEERLSRSPGGKLALW
AFYTWCGYFVWAMARYIWVMSRIPDAPVSGFESDLGSTAGKWLGHWSDFY
LWLWSGHC
>ECs1634 major capsid protein
MTSKETFTHYQPLGNSDPAHTATAPGGLSAKAPAMTPLMLDTSSRKLVAW
DGTTDGAAVGILAVAADQTSTTLTFYKSGTFRYEDVLWPEAASDETKKRT
AFAGTAISIV
>ECs2716 hypothetical protein
MNILKKLMQRLCGCGKHDDREHGELLTAQLRLGPADILESDENGIIPEQD
RVITQVVILDADKKQIQCVVRPLQILRADGTWENIGGMK
>ECs1962 putative holin protein
MYQMEKITTGVSYTTSAVGTGYWLLQLLDKVSPSQWVAIGVLGSLLFGLL
TYLTNLYFKIREDRRKTARGD
>ECs3816 hypothetical protein
MGITSAGMQSRDAECGERIFTRTVRQVKQQTTVHYFVSPPRPPVKTNPQA
KTLISTRLEVATRKKRRVLFI
>ECs1636 DNA packaging protein
MTKDELIARLRSLGEQLNRDVSLTGTKEELALRVAELEEELDDTDETAGQ
DTPLSRENVLTGHENEVGSAQPDTVILDTSELVTVVALVKLHTDALHVTR
DEPVAFVLPGTAFRVSAGVAAEMTERGLARMQ
>ECs1733 hypothetical protein
MVVGEGLLSAARFALRVVACGNALSLALKSNLGRSFSSFPAWAEYLITDS
FESSGTFESDGGGGRITQRCALRPSGRCLRQRSLAGARVEP
>ECs3002 exonuclease
MTPDIILQRTGIDVRAVEQGDDAWHKLRLGVITASEIHNVIAKPRSGKKW
PDMKMSYFHTLLAEVCTGVAPEVNAKALAWGKQYENDARTLFEFTSGVNV
TESPIIYRDESMRTACSPDGLCSDGNGLELKCPFTSRDFMKFRLGGFEAI
KSAYMAQVQYSMWVTRKDAWYFANYDPRMKREGLHYVVIERNEKYMASFD
EMVPEFIEKMDEALAEIGFVYGEQW
>ECs1002 hypothetical protein
MEQLRAELSHLLGEKLSRIECVNEKADTALWALYDSQGNPMPLMARSFST
PGKARQLAWKTTMLARSGTVRMPTIYGVMTHEEHPGPDVLLLERMRGVSV
EAPARTPERWEQLKDQIVEALLAWHRQDSRGCVGAVDNTQENFWPSWYRQ
HVEVLWTTLNQFNNTGLTMQDKRILFRTRECLPALFEGFNDNCVLIHGNF
CLRSMLKDSRSDQLLAMVGPGLMLWAPREYELFRLMDNSLAEDLLWSYLQ
RAPVAESFIWRRWLYVLWDEVAQLVNTGRFSRRNFDLASKSLLPWLA
>ECs5012 hypothetical protein
MKKRHLLSLLALGISTACYGETYPAPIGPSQSDFGGVGLLQTPTARMARE
GELSLNYRDNDQYRYYSASVQLFPWLETTLRYTDVRTRQYSSVEAFSGDQ
TYKDKAFDLKLRLWEESYWLPQVAVGARDIGGTGLFDAEYLVASKAWGPF
DFTLGLGWGYLGTSGNVKNPLCSASDKYCYRDNSYKQAGSIDGSQMFHGP
ASLFGGVEYQTPWQPLRLKLEYEGNNYQQDFAGKLEQKSKFNVGAIYRVT
DWADVNLSYERANTFMFGVTLRTNFNDLRPSYNDNARPQYQPQPQDAILQ
HSVVANQLTLLKYNAGLADPQIQAKGDTLYVTGEQVKYRDSREGIVRANR
IVMNDLPDGIKTIRVTENRLNMPQVTTETDVASLKNHLAGEPLGHETKLA
QKRVEPVVPKSTEQGWYIDKSRFDFHIDPVLNQSVGGPENFYMYQLGVMG
TADLWLTDHLLTTGSLFANLANNYDKFNYTNPPQDSHLPRVRTHVREYVQ
NDVYVNNLQANYFQHLGNGFYGQVYGGYLETMFGGAGAEVLYRPLDSNWA
FGLDANYVKQRDWRSAKDMMKFTDYSVKTGHLTAYWTPSFAQDVLVKASV
GQYLAGDKGGTLEIAKRFDSGVVVGGYATITNVSKEEYGEGDFTKGVYVS
VPLDLFSSGPTRSRAAIGWTPLTRDGGQQLGRKFQLYDMTSDRSVNFR
>ECs3004 hypothetical protein
MHKASPVELRTSIGMAHSLAQIGVRFVPIPVETDEEFHTLATSLSQKLEM
MAAKAEANERDPA
>ECs3864 hypothetical protein
MPERYLTADTEGFRVLCERFWLCEPPRKSWRLNSLRKR
>ECs1304 hypothetical protein
MYNANPNYEMDFMILKDVNEHMEGLFQRFSKLLPFRIDFAYRKDTPSFGH
SCRHSMCIEMHRLLSETQTMLAGYYWVMEYTSNKGLHIHFIGYLDGQRHN
KSYRISRQLGDIWRRITEGEGYFHLCRAKDKYPVRIDHVIHYSDKSAVDD
LRYALSYLAKQDQKEHGIILRRSRLPEKSNRGRPRHN
>ECs2244 hypothetical protein
MTEADLYPHLAHLAGGQVYPYVVPLLDGRPSVALPWVVFSLISSVSADVM
GGQAESSVSVQIDVYAGTVTQARQIRQDAREAIMLLAPGSVSEMQDYIPE
NRCYRATLEFQVTV
>ECs1510 putative replication protein
MKTDLSATETAWKLWELMGEVYSNRWTQKNGAAPSKLWIAQIGAMTEQQI
RQVCRQCMDRCRAGETWPPDLAEFVALISESGANPFGLTVDAVMEEYRRW
RNESWRYDGSDKYPWPQPVLYHICLEMRDRGIERQMTEGELKRLAERQLT
KWAKQVGNGMSIPPIRRQLASPKCPQGPTPIELLKQEYERRKAAGFV
>ECs2303 acid shock protein precursor
MKKQIEGMTMKKVLALVVAAAMGLSSAAFAAETATTPAPTATTTKAAPAK
TTHHKKQHKAAPAQKAQAAKKHHKNTKAEQKAPEQKAQAAKKHAKKHSHQ
QPAKPAAQPAA
>ECs2216 putative exonuclease
MSTDKEEFALYCEAKNDKVRKRLGIKGGFYWTTAKKLSVAISRCITAMDD
NDYDEDDFKKPVRVHLPVVNDLPPEGVFDTEFCNRYEKGGEDGITMVFIA
PSPSVQEKPASTDNTNVNGEDMTEIEENMLLPVSGQELPIRWLAQHGSEK
PVTHVAREELQALHIARAEELPAVTALAISHKTKLLDPLEIRDLHKLVRD
TDKVFPNPGNSDLGLITAFFEAYLDADYTDRGLLTKEWMKGNRVSRITRT
ASGANAGGGNKTDRNPNLVHTFDTLDVEIAAATLPMDFNIYEIPGSVYRR
AKEIVLKRESPFKEWSAALRATPGILDYSRAAIFALIRSAHPEFYHYPGR
LQGYINAYLTETDHENPSKETLTAARHTPEKDILEEVNRELSAKQETEEE
NDEEKPQPSCAMAE
>ECs0880 hypothetical protein
MKTINTVVAAMALSALSFGVFAAEPVTASQAQNMNKIGVVSADGASTLDA
LEAKLAEKAAAAGASGYSITSATNNNKLSGTAVIYK
>ECs4590 EspG
MILVAKLFITNQIGESLMINGLNNDSASLVLDAAMKVNSGFKKSWDEMSC
AEKLFKVLSFGLWNPTYSRSERQSFQELLTVLEPVYPLPNELGRVSARFS
DGSSLRISVTNSELVEAEIRTANNEKITVLLESNEQNRLLQSLPIDRHMP
YIQVHRALSEMDLTDTTSMRNLLGFTSKLSTTLIPHNAQTDPLSGPTPFS
SIFMDTCRGLGNAKLSLNGVDIPANAQKLLRDALGLKDTHSSPTRNVIDH
GISRHDAEQIARESSGSDKQKAEVVEFLCHPEAATAICSAFYQSFNVPAL
TLTHERISKASEYNAERSLDTPNACINISISQSSDGNIYVTSHTGVLIMA
PEDRPNEMGMLTNRTSYEVPQGVKCIIDEMVSALQPRYAASETYLQNT
>ECs0292 hypothetical protein
MVRIYNSSLEVACRMAKVLVAIYPSSLSLERLICFDFILVNLKDFLPEEI
SLHPPIPRRDAQLALKREIVLESLALLQGYELASKIYTHRGFVYKASEKT
YAFTNSLHNEYVAQMEHNINLVVKLYSDIPDEQLQSIIKNKIGKYDMEFN
YE
>ECs1616 hypothetical protein
MNLPQDGIKLHRGNFTAVGQQIQTYLEDGKCFRMVLKPWRERRSLSQNAL
SHMWYSEISEYLISRGKSFATSAWVKDALKHTYLGYETKDLVDVVTGEIT
TIQSLRHTSDLGTGEMYVFLCKVEAWAMNIGCHLTIPQSCEFQLLRDKQE
A
>ECs2757 hypothetical protein
MSVIKEMPVERNEYGCWTHPEYEKFCDGREYISTEEFNAWMEKNNLQWAI
RSMDEDDFNLDADGPDISAWEPERPEGEGWFIGSIHDTEDGPVCVWLRNK
AEA
>ECs0326 hypothetical protein
MQAASQREETEWRVQSKRGLMPAYRGEAGQQVNINIMEYSERNVRQLTLN
EQEDTSPGKLMLV
>ECs4701 ilvGEDA operon leader peptide
MTALLRVISLVVISVVVIIIPPCGAALGRGKA
>ECs1425 mdoC, glucans biosynthesis protein
MNPVPAQREYFLDSIRAWLMLLGIPFHISLIYSSHTWHVNSAEPSLWLTL
FNDFIHSFRMQVFFVISGYFSYMLFLRYPLKKWWKVRVERVGIPMLTAIP
LLTLPQFIMLQYVKGKAESWPGLSLYDKYNTLAWELISHLWFLLVLVVMT
TLCVWIFKRIRNNLENSDKTNKKFSMVKLSVIFLCLGIGYAVIRRTIFIV
YPPILSNGTFNFIVMQTLFYLPFFILGALAFIFPHLKALFTTPSRGCTLA
AALAFVAYLLNQRYGSGDAWMYETESVITMVLGLWMVNVVFSFGHRLLNF
QSARVTYFVNASLFIYLVHHPLTLFFGAYITPHITSNWLGFLCGLIFVVG
IAIILYEIHLRIPLLKFLFSGKPVVKRENDKAPAR
>ECs0434 psiF, phosphate starvation-induced protein
MKRDGAMKITLLVTLLFGLVFLTTVGAAERTLTPQQQRMTSCNQQATAQA
LKGDARKTYMSDCLKNSKSAPGEKSLTPQQQKMRECNNQATQQSLKGDDR
NKFMSACLKKAA
>ECs1882 pspB, phage shock protein B
MSALFLAIPLTIFVLFVLPIWLWLHYSNRSGRSELSQSEQQRLAQLADEA
KRMRERIQALESILDAEHPNWRDR