TitleGenColors Logo

Gene list

Applied filters:

COG category: NOT ANALYZED
Gene type: CDS

Number of genes found: 875

Free access
Sort by:

 



# Buchnera aphidicola str. Cc (Cinara cedri), Cc

>BCc_196 aspS, AspS
MRTKYCGEIKKKDINKIITICGWVHKIRSLKNIIFIDIKDITGIIQVIFL
KKYNSNFSLVKTIRNDYCIKIKGLVILKKKLTYFNTQEKENFFEIIAFKL
KIFNPSLPIPFDYKKENKEDIRFKFRYLDLRRFYMFHKLQTRSKITTIIR
NFFIKKKFLEIETPILTRSTPEGAKDYLVYSRLYPGKYFALPQSPQLFKQ
LLMISGIDRYFQIAKCFRDEDLRSDRQPEFTQIDIEIAFKKKKFLQKLIN
KLIKKLWLKINNYTIKKIKNITYKESILKYGTDKPDLRNPIQFVELYDIF
NNFYKKFFPWLLIKNINRIISIKIPEGIKKIRDNKIYKYKKFLEKLNFNY
FIYVKIINIKKYIIKDNRNIFSKIDKKTLKEFFLKTSAINGDIIFIIAEK
YNIANNISNLLRKKIGIDLKITNLKKICPVWITKFPLFKLDKKNNLKSVH
HPFTSPEKNSKNIELLNPLKIISSAFDLVINGYEIGGGSERIYNYQLQIK
IFNILKINKNKQKKKFGFFINSLQYGAPPHLGLALGLDRLVMLLTNCNTI
SDVIAFPKTNSAICLMTGAPC
>BCc_290 cyoB, CyoB
MFGKLTINSIPYHEPIIIGTCFLIFLFSLLVCVYITYVKKWKYLWNEWFT
SVDHKKISVMYFILSFVMFLRGFSDAIIMRLQQFFSSLPEHTSIFSSDHY
DQVFTAHGVIMIFFVAMPLVIGLMNLVIPLQIGARDLAFPVLNNISFWLT
ASSSVLLMISLGVGEFAKTGWLGYPPLSGIIYSPGVGVDYWIWVLQISGI
GTILTSINFIVTIINLRAPGMSMFKLPVFTWTSLCTNILILISFPVLFIT
LLFLSLDRYAGFHFFTNDLGGNMMLYVNLIWIWGHPEVYILILPIFGIFS
EVVSTFSEKSLFGYISLIWATIVITVLSFIVWLHHFFTMGSGANVNSFFS
ITTMIIAIPTGVKIFNWLFTMFRGRIRFHSSMLWTIGFLISFTIGGMAGV
LLSIPPIDFVLHNSLFLVAHFHNVIIGGVVFGCFAGITYWFPKIFGFKLN
EYWGKCAFVFWITGFFTAFMPLYILGCMGMTRRLSQNIDCEYHGLLTISA
IGVFFIFLGILFQIIQIIVSIKNRNTLEYKDNTGDPWNGRTLEWSIPSPP
PIYNFAIIPQIKTIDNFWFDKVNNKGNIKNNIFTKIHMPSNSYFGIAIGC
CSTFFGFSIVWHIWWLVWVSLFGIFFILLKKFFTFDKGYFISIKKIKDIE
NKHLLKK
>BCc_289 cyoC, CyoC
MNNKKDYVSYSSLKNNSILGFWIYLMSDCIIFSVLFIVHIIMSRHGYNGF
LRENKLFSKFVLFLETLILLFCSLSFSLIKYFLKKSYKKCILFFLFITLI
LGSTFISLEFFEFLNIIEKGFFPNTNGYFSSFYVLLCIHGLHVIFGLLWI
IISIFQFFFLKFSNFLYISLICCCLFWHFLDIIWIILIFCVYFN
>BCc_266 engB, EngB
MNNINFYNMKFITSIINYSNLGTFSGIEVAFLGYSNVGKSTMINCLSGNN
KIARISKLPGRTKTINFFHVLSDFRIVDFPGYGYSKINYLDKKLLEKSLF
FYLKNRECLKGIVLLSDIRFFLKSFDEFILQFLKKRNLFVLIILTKSDKI
SKREKKKIELLKYNKISHLHMNITICSFSKFYKNNIQFIRKILSKWYFLS
KI
>BCc_182 fldA, FldA
MKKIGIFYGSDTGNTEKIAYKIHKQLGKNNSKIFDISDSSLKKIKKFNIL
FFGIPTWYYGELQCDWPDSLSILKKIDLKNKIIALFGCGDQKDYSEYFCD
SIGIMYDILKKKNAKLVGKWSTKNYKYEYSKAQKNKSYFFGLPIDEDNQS
KLTNKRIKKWLQLVIQEINNILKNNFLHK
>BCc_043 fliF, FliF
MNFIKKLFFQIKKKLNYLFFCILKKIKYIFFVSLISIISIISIFLWSKKV
HYVVLYNNLSQEDSKWIISRLKILHIPYRCNNSFKTLLVPEDKINILNLS
LLKNNNILKKNFGFELLDKEKFGISQFHEHINYHRGLEGELSKTLEQIFP
IQCARVHLVCKKDTDFFENNQIPSASVIITLFPNMHLTYEQINAIILLIS
GSVPDLSSDNIVLVDQFGNILNKYDLNHTKFFNNSQYKKITILEEYYSQR
IKDILLPILDSKDFVVQVTTKLKKDNHASTFSDKNYNVNQKIDNSFISPN
FKELNIKNIKITVLINYKKNNSGKMVPLSKNELNNIENLVKSVINLSKNK
LNNIENLVKSVINLSKNKGDRINLINYMFLDTPSVIDASYSLKNKNLNFS
LILFYFFVFFLFFLILKNIFDSYIFNKNKKKNIINIPVKDNKKKKIINDI
NNQLNNQEKNKNYFLSKKNIILKKSNLIKDIIQYWIKKK
>BCc_046 fliI, FliI
MNFLLNSWFYNTNIFEKNLLNISNSIYIGKVLSVHSLIIEVTGIYSSIGE
YCWVECFYKGIQSTIICKVMGFKKKIFFLIPIQNSYGIFPGAKVFSENYI
FNKDIKFQYFPFGSKLLGRVLNGFGHPLDNLGDLNLKKKLFNFFKKKPIN
PLNRKPITEILDTGVCAINSLLTVGRGQRMGIFSQAGIGKSMLLGMISRH
TDADIIVVSLVGERGREVKDFIDNILGKDSLKKSVVIVSSADVSPMFKIQ
SVEYATAVAEYFCNKGNNVLLIVDSLTRYAMAYREVSNSLYEIPVKRYPA
SIFSNIPYLIERTGNIDNKSGSITSFYTILTEGDEYNDPILDITKSVLDG
HIILSNVLSESGHYPAINIEKSISRLMSSIVDHDHYQYSIYIKKLISCYY
KNYDIINLGVYTSGKNKLLDQAITLWPFLEKFLQQKFLDCCTYNQSILKL
KNLLKII
>BCc_186 gpmA, GpmA
MKKKKIILMRHGESKWNKLNKFTGWQDIGLTKNGKKEAKLAAKLIKKNNF
IFDIAYTSILKRAIYTLWIILKKTNQIWIPVYKSWKLNERNYGALEGLNK
EKIKKKYGDEQVQLWRRSFTVCPPNLNISNKYHPIYDIKYKKLKKHELPT
SESLEMTFNRVIPFWEFKILPQLEKNKNILIVAHGNSLRALIKYLGNISD
SDIIDLDISTGKPLVYEFSNTNKPLKYYYL
>BCc_011 groEL, GroEL
MAAKDVKFGNEARIKMLHGVNVLADAVKVTLGPKGRNVVLDKSFGPPSIT
KDGVSVAREIELEDKFENMGAQMVKEVASKANDAAGDGTTTATLLAQSIV
NEGLKAVAAGMNPMDLKRGIDKAVIDAVDELKKLSVPCADSKAITQVGTI
SANADEKVGSLIAEAMEKVGNDGVITVEEGTGLQNELEVVKGMQFDRGYL
SPYFINQPETGLVELENPYILMVDKKISNIRELLPILEAVAKSSKPLLII
SEDLEGEALATLVVNSMRGIVKVAAVKAPGFGDRRKAMLQDISILTGGSV
ISEELAMDLEKSSLEDLGQAKRVVINKDTTTIIDGNGNKEAIKSRISQIR
QEINEATSDYDKEKLNERLAKLSGGVAVLKVGAATEVEMKEKKARVEDAL
HATRAAVEEGVVPGGGVALVRVAEKISRINGQNEDQNVGIRVALRAMEAP
LRQIVANSGEEPSVVTNNVKDGHGNYGYNAATDEYGDMISFGILDPTKVT
RSALQYAASVAGLMITTECMVTDLPKDEKSSSELNSAPGNGMGGGMGGMM
>BCc_119 grpE, GrpE
MIEKNEKKKILKKDKKKKITIKKIEKKISLLIKEKKNIRLRHYANIENII
KKNASEIKFIKTNMFENFLNSIFSIINKIDLLTINLKNMSSTQKSLFEGI
KLTKNIFEKNLKNWKIKKINKINIPFNEKIHKIKKNEKKNSISKNKKIKN
IIKPGYILKNKVIKKAIVLL
>BCc_116 gyrA, DNA gyrase, A subunit
MKDIAKEIKQVNIEEELKRSYLDYAMSVIIGRALPDVRDGLKPVHRRILF
AMYILNNEWNKPYKKSARIVGDVIGKYHPHGDSAVYDSIVRMAQKFSLRY
TLIDGQGNFGSIDGDPAAAMRYTEIRMSRIAHEMLSELEKNTIDFVLNYD
GTEKIPEILPTKIPNLLINGSSGIAVGMATNIPPHNLKEVINGCIAYLYN
PSISLKELMLYIPGPDFPTSGIIYGKNGIKEAYKTGKGKIIVRSKYNIEI
NKKNKKESLIIYELPYQVNKSKLIEKIAILVKEKKINGITNIRDESDKDG
MRIVIDIKKDFISQIILNQLYTLTSLQTSFGINMVALSSGKPKTMNLKKI
LKEFIKHRKKIIKRKCLFKLKKSNKKMHLLEGFAIALNNINTIIHLIKNS
ENHTIAKKKLKKINWKKENNSSKKNKKYFYFSNKQSLAILNIKLNKLTSL
EKKKINLEYNKLIKKTIKLKKILSNKNILEKKIEKELNEIKKKFSDTRKT
KIISKMSEINIEDMIVKETVVVTLSYSGYVKYQPISDYNAQKRGGRGKSA
AKTKEEDYITNLLVANSHDTILCFSSRGLLYWMKVYQLPETSRNARGKPI
VNLLPLTSKERITTILPISKYKSSINIFMATALGFVKKTSLIQFKKPRNS
GIIAINLRKNDKLIGVSLTTGNNNICMFTSKGKVVQFSEKTIRKMGRTAS
GIRGMKITNNDRLVSLLVPNKKDDILIVTANGYGKRTNINQFPIKSRATK
GVLSIRVTQKNGVVIGAIQVKQNDQIMIITNAGTLVRTRVSEIAILKRNT
QGVILIRTIKKEKVVGLQKLSNKPIIS
>BCc_377 hlsU, ATPase component of ATP-dependent protease
MSEMTPRKIVKELNKYIIGQNNAKRAVAIALRNRWRRMQLNSELRNEITP
KNILMIGPTGVGKTEIARRLAKLANAPFIKVEATKFTEVGYVGKEVDSII
RDLTDLAIKMIRLQIIKKNKKHAKKRAEERILKILIPVPKDNWNEENLKE
KPEKTIQIFRKKLQEGKLDNKEIEIQIAATPIGIEIMSPPGMEELTNQLQ
SLFQNLSGKKKNLRKLKIKDAMKIIIEEEAAKLINLEELKEKAIYSVEQN
SIVFIDEIDKICKHHSSASNSDVSREGVQRDLLPLIEGCTVSTKHGSVKT
DHILFIASGAFQTSTPSDLIPELQGRLPIRVELNALTVDDFERILTEPNA
SITTQYKALIKTEGVDIIFTKKGIRKIAEASWKINESMENIGARRLYTVL
EKLMEDISFNSNEKFGQKIYIDEKYVNLHLDKLIENEDLSRFIL
>BCc_371 miaA, MiaA
MNKKKFLFFLMGPTAIGKSSLALEIKKKFPLIELISVDSKLVYKGLNIGT
DKPNKSDLKNFSYKLVNIVKPKNIYTVINFYNDVLKEIKNILKSGKIPLL
VGGTMLYFKILLNGFANLPPSNSIIRKYIFKNICLKKKKNLFNLLKKIDP
ISSKKIHINDVQRVLRAVEVFFVSGGFPLSELIKFFHNKLPYKVFQFGLI
PDNKEHLYKKIEKRFFFMLKSGFKKEVQNLYNQKFLDPKLPSMNSIGYKQ
MLLYLKNKYTYFQMIKETIKSTHKLVKHQLTWLKKWPNIIFIKDNKKDLL
ITKIYKILNRNL
>BCc_098 nuoB, NADH dehydrogenase I chain B
MKYTLTRVNTKNSISKKYPLRSLKKTSDPIKKQIKNNIFFGTISKYMQYI
MNWGRKNSLWPYNFGLSCCYVEMVSAFTSIHDISRFGSEVLRTSPRQADF
MVIAGTPFIKMAPVIQRLYDLMLEPKWVISMGSCANSGGMYDIYSVVQGV
DKFLPVDIYIPGCPPRPEAYIHAITLLQKAISKERRPLSWIVGEQGVYKY
NMLSEKEEKNKKRISIINIDSEDSF
>BCc_102 nuoG, NADH dehydrogenase I chain G
MVKIFIDGKIFFVKSSYNLLQACLSKGFNVPFFCWHPALGSIGACRQCAV
KVYQNKEDDVGSIVMSCMSVVQKNMRISLIDSDVKKFQKDIIELMMLNHP
HDCPVCAEGGSCHLQDMTVLNKHHIRRYIFKKRIFKNQYLGPLISHNMNR
CITCYRCIRYYKDYSGGKDFGVYGSNNKIYFGRLEDGFLESEYSGNLIDI
CPTGVFTDKTNINNFHRKWDLQYTPSICHNCSIGCNISIGERLGKVCRID
NRYNLSINKYFLCDLGRFGYNYINYNKVYQPFEKKNNIIKFLKYSKIISL
IINIFRNSSNKILGIGSDRASIESNTLLCQLVGEKNFSNGMLPQLNQCIS
IITKILKSNEFIIPSISEIKNYDVILIISEDITQTASLAALAVRQAINGF
HSFILKDTSIPSWNSNAIKNILQNKKNYLFIIQGDKSKLDDISNINYYGS
IEEQLQFCLLLLDRINNSSNLFSYQNHKICKKIKFIAKILCQAKKPLIIS
GTSYNNLNILKISCNIARSLKKRGLPVGLMLFPPAANSIGISLIPSISLD
SMFNIINSNKVDILIILENNLHKLYEATVINKIFEKVNKIIVLDHYNTKI
VNKSDFFLPTTNFAESSGNILNYEGRLQRFFKVYNPNFYKKKIYKLEGWR
WIYAILNNIKSFKLIQSISIDKIIKFCSKKNSYFKYLKNSSPSAKFRINN
QKIARSSLRYSGRTAMFADINMHEPKSPIDVDSMFSFSIEGSQQTEKSFS
FSPFLWSPNWNSQQSLYKLSIQSKNQFFLYNEGILIFKKYKEKHLSNFFI
KKNILNKKISVNSFKIIPYYKLLGSEEISHNYFVNILKIKFFYAVLNRTD
AKKMNIYNNDILQFQINDNIFKFPVKLSSYINSCHIGLPVGIGNIPMIFL
NKIAIKLMKYNI
>BCc_232 nusA, NusA
MNKEILSVVDAVSHEKSIPREKIFQALESALEIATKKKYNQDINIRVCIN
RSTGSFNTYRRWLVVNNVFNPTKEITLEAARFENKKIQLCDYIEDCIDSV
TFDRIATQIAKQVIIQKVREAEKEIILNKFDKKKGQIIIGIVKKISRDYI
ILDVGNNIEGIIMREDMLPRENFRINDRVRGILYNISYEKQGAQLFISRS
KSDMLIELFRIEVPEIREKFIEIKAIARDPGLRSKIAVTTYDERIDPVGA
CVGMRGSRVQAVSNELCGERIDIILWDNNPKKFVINSMAPADVSSIFLDN
TNHVINIEVKLCNLAQAIGRNGQNVRLASQLTGWELNIITSNPIIKIKKS
KKNNFFDILKNKFNFTENDILLVIHSGFSSIKSIAYASINQLLSIKGIKT
DVVLNMQKKAIFILENDKKKSIKIYKKKYLNSEFANLKNINKFIIQQLIE
KKICTLEHLAEQSIDDLHDISFLFFRTSWSLIMEARNICWFNENS
>BCc_331 rplN, 50S ribosomal protein L14
MIQVQTILNVADNSGARLVMCIKVLGGSRRRYANIGDIIKVAIKEAIPRG
KVKKGEVVKAVVVRTKKGIRRTDGSMICFDNNSCVIVHDTTNQPIGTRIF
GPVTRELRVEKFMKIISLAPEVL
>BCc_323 rpmD, 50S ribosomal subunit protein L30
MKTILITQIKSQIGRLPKHKATMKGLGLRNIGDTVERKDTAAIRGMIKKV
YYMISIK
>BCc_346 rpsL, 30S ribosomal protein S12
MSTINQLVRFSRVRKVTKSNVPALSKCPQKRGVCTRVYTTTPKKPNSALR
KVCRVRLTNGHEVTAYIGGEGHNLQEHSVILIRGGRVKDLPGVRYHVIRG
ALDCSGVKDRKKGRSKYGVKKLKV
>BCc_332 rpsQ, 30S ribosomal protein S17
MNEKKNLLNGYIVSNKMNKSAVVIVERKIKHSIYKKFIKKRTKLCIHDEK
NICNIGDIVTIRECRPISKTKSWILVNILEKSIV
>BCc_025 secE, SecE
MHIQNIHKIYINHAEKIKWLCIFLIILSIIVNYYFFVYKFLKITKIIYFS
TLLILIVSIFINTNIGQHTLKFIHSIKIELSHIIWPNYKETLKITGIILL
LTILTSIFLWLLDGIILRIISWVLTPRL
>BCc_008 thdF, GTP-binding protein
MKFFDTIVSPATVIGRSGIGIIRISGISVLKIIKKFLKISMKERFAYFSS
FYDVKNNLLDQGIALFFLAPKSFTGENILEFHSHGNPIILDLLIKNILTI
KNVRIANPGEFSKRAFLNNKIDLVQAEAINDIINAESHLSVKAALSSLRG
TFSKKINKILFNLKDIYSEIEAIINFPEELNDLNIQKNIKKKLSFIIKMI
TNLLDETHKNYIFSNTIKIVIAGPPNVGKSSLLNFLSKEKVSIVTNIPGT
TRDVIHKNIWFNGVCCEFLDTAGLQKSQDIIEVIGIKLAKKHIKSCNHIF
LMFDVTKKKMINNNFIKNIVNNLKKNQNITFIFNKIDLINKKPYISIIYK
KFECIYLSLKKNIGIEYLKNKILEITTLHNNVESTFLAKKRHISALKKSL
MYLINGKRNWMKNLYLELLSDDIRLSIKYLLKITGKFNSEDLLDKIFSKF
CIGK
>BCc_137 yabC, S-adenosyl-dependent methyl transferase
MSHIPVLLKETINALNIKKNGIYIDGTFGNGGHTKEILKYLGQSGKLYSI
DQDIKSVNKGKKIKDKRFSIIFGKFSKKIPYIYKKNKKKKIDGILLDLGI
SSNQINQNDRGFSFMKNGLLDMRMNNTTGIPAWKWLKKSSQKEIEKVLRK
YGEERYSKKISYAIYNRNKKKTITQTLDLVQIIKKAIPKIDKYKHPATRT
FQAIRIHINNEIKELKKTLKISIKILKKKRRLVIICFHSLENRIVKNFFK
KHGKTFFIPRGLPITEKKIKKIENKKIKIIKKIKPKKTEIQKNKRSRSAI
LRIAEML
>BCc_268 ygfZ, YgfZ
MKNNVLNNNKIYLSKNLSSTFIELDQWSIIRVKGKDKRNYLNNQFTININ
TINKNKYKIGAHCNINGKVLAIFFIFKYKDSFFYIINNSVCDKHLIELKK
YSLFYKIKIFKEKKFHLFGLCGSNSYYLLKNFFFIHFKKKNMVTKIKNII
FLKINYPVKRFLILTKGNMLHNFLNDNKKKILFSNNKQWISLDIESSFPI
VNKTISGRFILQTLDLKKWNAISFTKGCYYGQEMLCKYENKKINKFIICA
LIGRIGNTIPINNENVKYKDKEGNKYISGIILSWVKVYKNKILLQVRMKE
KFFNKKNNFYLSSNKENFYKIYII
>BCc_241 yhbZ, GTP-binding protein, Obg family
MKFVDSVIINVSAGKGGDGCISFRREKFVPKGGPDGGNGGDGGNIWIVSN
TNINTLTDYRIKKIFQSENGKNGLNSNRSGKNGKDIYIPVPLGTRIIDND
TKKIIIEILKINQKFLVAKGGKRGLGNTNFKSSINQTPRKKTYGTLGENK
NILLELILIADIGTLGLPNSGKSTLITSMSNAKTKIDIYPFTTLIPILGT
VKAKKKKFIIADIPGIIKNASLGIGLGIKFLKHLSRCKLLLHIIDITINK
KKINKIRYIILKELKNFNKKLFQKTRWLIFNKIDLLKPKKISQIKKYIKK
KIEKNKKYYFISAKKNIGIKKLSKDIIKYLYKKNKEYI
>BCc_007 yidC, YidC
MLQKYFFTTRILKYLKLYTFYSHQLNDFFSSFNFISKLSYFKSYILYNIF
SNFLCKLEMQINLSKISDNLSLINRYGKLHIIAYPLFHLLNFFYKFFNNW
GIAIIFVTILIKIIIYPLTKLQYTSVLQMKLLQPKIDILKNKYADNKDKM
NKKILELYSSKKFNPFNSFFSFLIQTPIFLAFYSVLSSSVELKNAPFFLW
IKDLSSYDPYHVLPLLMGISILLTQISEIDDKTTRKRKFLSFFSVFFAAF
FLWFPSGLILYYITSNIVTLIQHWFIRIQFFKKNT
>BCc_055 yraL, YraL
MVPTPIGNILDITYRSINILKKVDFIISENKKYTSILLKKFFIIKKMFSY
HLVNEKLKTKKYIKMLKDGKNLALVSNAGTPLINDPGYLLVKQAYSNNIR
VVPLPGACAAISALISSGLKTNRFCYEGFLPKKKKKLKKKIYSLNNEKRT
TILYESPKRLINTIKLIKKIMGPKKKISISKELTKKWEKIKTGTTQKFLK
KIKKNNYWKKGEILIIINGIKNKKKKISSKILQTLSILKKEMSLSKAIKI
TAKICRFNKNILYNKIIKKKNDKVS


# Escherichia coli 536, 536

>ECP_0628 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase
MDFSGKNVWVTGAGKGIGYATALAFVEAGAKVTGFDQAFTQEQYPFATEV
MDVADAAQVAQVCQRLLAETERLDVLVNAAGILRMGATDQLSKEDWQQTF
AVNVGGAFNLFQQTMNQFRRQRGGAIVTVASDAAHTPRIGMSAYGASKAA
LKSLALSVGLELAGSGVRCNVVSPGSTDTDMQRTLWVSDDAEEQRIRGFG
EQFKLGIPLGKIARPQEIANTILFLASDLASHITLQDIVVDGGSTLGA
>ECP_0038 crotonobetainyl-CoA:carnitine CoA-transferase
MDHLPMPKFGPLAGLRVVFSGIEIAGPFAGQMFAEWGAEVIWIENVAWAD
TIRVQPNYPQLSRRNLHALSLNIFKDEGREAFLKLMETTDIFIEASKGPA
FARRGITDEVLWQHNPKLVIAHLSGFGQYGTEEYTNLPAYNTIAQAFSGY
LIQNGDVDQPMPAFPYTADYFSGLTATTAALAALHKVRETGKGESIDIAM
YEVMLRMGQYFMMDYFNGGEMCPRMTKGKDPYYAGCGLYKCADGYIVMEL
VGITQITECFKDIGLAHLLGTPEIPEGTQLIHRIECPYGPLVEEKLDAWL
AAHTIAEVKERFAELNIACAKVLTVPELESNPQYVARESITQWQTMDGRT
CKGPNIMPKFKNNPGQIWRGMPSHGMDTAAILKNIGYSENDIQELVSKGL
AKVED
>ECP_0281 phospho-2-dehydro-3-deoxyheptonate aldolase, Trp-sensitive
MQSVSKSIYRGKLLGSLPAVGEIHKEIAVSEETVTWISLQREIIANILLG
KDPRLLVIVGPCSIHDVQAAVEYAKRLSVLQNKYLSQMYIVMRTYFEKPR
TRKGWKGIMHEPDLNGSYNVEKGIRYARQCLSSITTMRVATATEFLDPFL
TPYIADLICWGAVGARTTESQTHRQLASGLHCPVGFKNSTDGNINLAIDA
ILAAREQHVVYMTSLTKCISTLLTDGNPHGHLILRGGREPNYGLSDITKA
VKLMHDEGINHRLIIDCSHGNSGKVAERQISVAREVIDNRKKMPGYVAGI
MLESFLQGGKQSDSLPREYGQSVTDECLSWQQTEQLLSTLAAQL
>ECP_3477 protein transport protein HofQ precursor
MKQWIAALLLMLIPGVQAAKPQKVTLMVDDVPVAQVLQALAEQEKLNLVV
SPDVSGTVSLHLTDVPWKQALQTVVKSAGLITRQEGNILSVHSVAWQNDN
IARQEAEQTRAQANLPLENRNITLQYADAGELAKAGEKLLSAKGSMTVDK
RTNRLLLRDNKTALSTLEQWVSQMDLPVGQVELSAHIVTINEKSLRELGV
KWTLADAQQAGGVGQVTTLGSDLSVATATTHIGFNIGRINGRLLDLELSA
LEQKQQLDIIASPRLLASHLQPASIKQGSEIPYQVSSGESGATSVEFKEA
VLGMEVTPTVLQKGRIRLKLHISQNVPGQVLQQADGEVLAIDKQEIETQV
EVKSGETLALGGIFTRKNKSGQDSVPLLGDIPWFGQLFRHDGKEDERREL
VVFITPRLVSSE
>ECP_3750 ATP-dependent DNA helicase RecG
MKGRLLDAVPLSSLTGVGAALSNKLAKINLHTVQDLLLHLPLRYEDRTHL
YPIGELLPGVYATVEGEVLNCNISFGGRRMMTCQISDGSGILTMRFFNFN
AAMKNSLATGRRVLAYGEAKRGKYGAEMIHPEYRVQGDLSTPELQETLTP
VYPTTEGVKQATLRKLTDQALDLLDTCAIEELLPPELSQGMMTLPEALRT
LHRPPPTLQLSDLDTGQHPAQRRLILEELLAHNLSMLALRAGAQRFHAQP
LSANDALKNKLLAALPFKPTGAQARVVAEIERDMALDAPMMRLVQGDVGS
GKTLVAALAALRAIAHGKQVALMAPTELLAEQHANNFRNWFAPLGIEVGW
LAGKQKGKARLAQQEAIASGQVQMIVGTHAIFQEQVQFNGLALVIIDEQH
RFGVHQRLALWEKGQQQGFHPHQLIMTATPIPRTLAMTAYADLDTSVIDE
LPPGRTPVTTVAIPDTRRTDIIDRVRHACITEGRQAYWVCTLIEESELLE
AQAAEATWEELKLALPELNVGLVHGRMKPADKQAVMASFKQGELHLLVAT
TVIEVGVDVPNASLMIIENPERLGLAQLHQLRGRVGRGAVASHCVLLYKT
PLSKTAQIRLQVLRDSNDGFVIAQKDLEIRGPGELLGTRQTGNAEFKVAD
LLRDQAMIPEVQRLARHIHERYPQQAKALIERWMPETERYSNA
>ECP_2354 FolC
MIIKRTPQAASPLASWLSYLENLHSKTIDLGLERVSQVAARLGVLKPAPF
VFTVAGTNGKGTTCRTLESILMAAGYKVGVYSSPHLVRYTERVRVQGQEL
PESAHTASFAEIESARGDISLTYFEYGTLSALWLFKQAQLDVVILEVGLG
GRLDATNIVDADVAVVTSIALDHTDWLGPDRESIGREKAGIFRSEKPAIV
GEPEMPSTIADVAQEKGALLQRRGVEWNYSVTDHDWAFSDAHGTLANLPL
PLVPQPNAATALAALRASGLEVSENAIRDGIASAILPGRFQIVSESPRVI
FDVAHNPHAAEYLTGRMKALPKNGRVLAVIGMLHDKDIAGTLAWLKSVVD
DWYCAPLEGPRGATAEQLLEHLGNGKSFDSVAQAWDAAMADAKAEDTVLV
CGSFHTVAHVMEVIDARRSGGK
>ECP_1503 trans-aconitate 2-methyltransferase
MSDWNPSLYLHFAAERSRPAVELLARVPLENIEYIADLGCGPGNSTALLD
QRWPAARITGIDSSPAMIAEARSALPDCLFVEADIRNWQPEQALDLIFAN
ASLQWLPDHYELFPHLVSLLSPLGVLAVQMPDNWLEPTHVLMREVAWEQN
YPDRGREPLAGVHAYYDILSEAGCEVDIWRTTYYHQMPSHQAIIDWVTAT
GLRPWLQDLTESEQQHFLTRYHQMLEEQYPLQENGQILLAFPRLFIVARR
TE
>ECP_2268 acetyl-CoA acetyltransferase
MKNCVIVSAVRTAIGCFNGSLASTSAIDLGATVIKAAIERAKIDSLHIDE
VIMGNVLQAGLGQNPARQALLKSGLAETVCGFTVNKVCGSGLKSVALATQ
AIQAGQAQSIVAGGMENMSLAPYLLDAKARSGYRLGDGQVYDVILRDGLM
CATHGYHMGITAENVAKEYGITREMQDELALHSQRKAAAAIESGAFTAEI
VPVNVVTRKKTFVFSQDEFPKADSTTEALGALRPAFDKAGTVTAGNASGI
NDGAAALVIMEESAALAAGLNPLARIKSYASGGVPPALMGMGPVPATQKA
LQLAGLQLADIDLIEANEAFAAQFLAVGKTLGFEPEKVNVNGGAIALGHP
IGASGARILVTLLHAMQARDKTLGLATLCIGGGQGIAMVIERLN
>ECP_2623 hypothetical protein YgaF (putative GAB DTP gene cluster repressor)
MYDFVIIGGGIIGMSTAMQLIDVYPDARIALLEKESGPACHQTGHNSGVI
HAGVYYTPGSLKAQFCLAGNRATKVFCDQNGIRYDNCGKMLVATSELEME
RMNALWKRTAANGIEREWLNAEELREREPNITGLGGIFVPSSGIVSYREV
TAAMAKIFPARGGEIIYNAEVSALSEHKNGVVIRTRQGGEYEASTLISCS
GLMADRLVKMLGLEPGFIICPFRGEYFRLAPEHNQIVNHLIYPIPDPAMP
FLGVHLTRMIDGSVTVGPNAVLAFKREGYRKRDFSFSDTLEILGSSGIRR
VLQNHLRSGLGEMKNSLCKSGYLRLVQKYCPRLSLSDLQPWPPGVRAQAV
SPDGKLIDDFLFVTTPRTIHTCNAPSPAATSAIPIGAHIVSKVQTLLASQ
SNPGRTLRAARSVDALHAAFNQ
>ECP_1289 hypothetical protein
MRFLLANTFCKIVKQFSFIDRIRIYSLHPCLNKHLLIQLNRDWLSMSLFS
DYSSSSEMHNNLTIDYYLALSSTKGSGITNIISIILQQAQDYDVAKIT
>ECP_2465 ethanolamine utilization protein EutG
MQSELQTALFQAFDTLNLQRVKTFSVPPVTLCGPGAVSSCGQQAQTRGLK
HLFVMADSFLHQAGMTAGLTRSLAVKGIAMTLWPCPVGEPCITDVCAAVA
QLRESGCDGVIAFGGGSVLDAAKAVALLVTNPDSTLAEMSETSVLQPRLP
LIAIPTTAGTGSETTNVTVIIDAVSGRKQVLAHASLMPDVAILDAALTEG
VPSHVTAMTGIDALTHAIEAYSALNATPFTDSLAIGAIAMIGKSLPKAVG
YGHDLVARESMLLASCMAGMAFSSAGLGLCHAMAHQPGAALHIPHGLANA
MLLPTVMEFNRMVCRERFSQVGRALRTKKSDDRDAINAVSELIAEVGIGK
RMGDVGATSAHYGAWAQAALEDICLRSNPRTASLEQIVGLYAAAQ
>ECP_2504 exopolyphosphatase
MPIHDKSPRPQEFAAVDLGSNSFHMVIARVVDGAMQIIGRLKQRVHLADG
LGPDNMLSEEAMTRGLNCLSLFAERLQGFSPASVCIVGTHTLRQALNATD
FLKRAEKVIPYPIEIISGNEEARLIFMGVEHTQPEKGRKLVIDIGGGSTE
LVIGENFEPILVESRRMGCVSFAQLYFHGGVINKENFQRARMAAAQKLET
LTWQFRIQGWNVAMGASGTIKAAHEVLMEMGEKDGIITPERLEKLVKEVL
RHRNFASLSLPGLSEERKTVFVPGLAILCGVFDALAIRELRLSDGALREG
VLYEMEGRFRHQDVRSRTASSLANQYHIDSEQARRVLDTTMQMYEQWREQ
QPKLAHPQLEALLRWAAMLHEVGLNINHSGLHRHSAYILQNSDLPGFNQE
QQLMMATLVRYHRKAIKLDDLPRFTLFKKKQFLPLIQLLRLGVLLNNQRQ
ATTTPPTLTLITDDSHWTLRFPHDWFSQNALVLLDLEKEQEYWEGVAGWR
LKIEEESTPEIAA
>ECP_2668 putative phosphosugar binding protein
MSEALLNAGRQTLMLELQEASHLPERLGDDFVRAANIILHCEGKVVVSGI
GKSGHIGKKIAATLASTGTPAFFVHPAEALHGDLGMIESRDVMLFISYSG
GAKELDLIIPRLEDKSIALLAMTGKPTSPLGLAAKAVLDISVEREACPMH
LAPTSSTVNTLMMGDALAMAVMQARGFNEEDFARSHPAGALGARLLNKVH
HLMRRDDAIPQVALTASVMDAMLELSRTGLGLVAVCDDQRLVKGVFTDGD
LRRWLVGGGALTTPVNEAMTTGGTTLQAQSRAIDAKEILMKRKITAAPVV
DENGKLTGAINLQDFYQAGII
>ECP_0894 macrolide-specific ABC-type efflux carrier
MTPLLELKDIRRSYPAGDEQVEVLKGISLDIYAGEMVAIVGASGSGKSTL
MNILGCLDKATSGTYRVAGQDVATLDADALAQLRREHFGFIFQRYHLLSH
LTAEQNVEVPAVYAGLERKQRLLRAQELLQRLGLEDRTEYYPAQLSGGQQ
QRVSIARALMNGGQVILADEPTGALDSHSGEEVMAILHQLRDRGHTVIIV
THDPQVAAQAERVIEIRDGEIVRNPPAVEKVNATGGTEPVVNTASGWRQF
VSGFNEALTMAWRALAANKMRTLLTMLGIIIGIASVVSIVVVGDAAKQMV
LADIRSIGTNTIDVYPGNDFGDDDPQYQQALKYDDLIAIQKQPWVASATP
AVSQNLRLRYNNVDVAASANGVSGDYFNVYGMTFSEGNTFNQEQLNGRAQ
VVVLDSNTRRQLFPHKADVVGEVILVGNMPARVIGVAEEKQSMFGSSKVL
RVWLPYSTMSGRVMGQSWLNSITVRVKEGFDSAEAEQQLTRLLSLRHGKK
DFFTWNMDGVLKTVEKTTRTLQLFLTLVAVISLVVGGIGVMNIMLVSVTE
RTREIGIRMAVGARASDVLQQFLIEAVLVCLVGGALGITLSLLIAFTLQL
FLPGWEIGFSPLALLLAFLCSTVTGILFGWLPARNAARLDPVDALARE
>ECP_3691 putative iron-containing alcohol dehydrogenase
MAASTFFIPSVNVIGADSLNDAMNMMADYGFTRTLIVTDNMLTKLGMAGD
VQKALEERNIFSVIYDGTQPNPTTENVAAGLKLLKENNCDSVISLGGGSP
HDCAKGIALVAANGGDIRDYEGVDRSAKPQLPMIAINTTAGTASEMTRFC
IITDEARHIKMAIVDKHVTPLLSVNDSSLMVGMPKSLTAATGMDALTHAI
EAYVSIAATPITDACALKAVTMIAENLPLAVEDGSNAKAREAMAYAQFLA
GMAFNNASLGYVHAMAHQLGGFYNLPHGVCNAVLLPHVQVFNSKVAAARL
RDCAAAMGVNVTGKNDAEGAEACINAIRELAKKVDIPAGLRDLNVKEEDF
AVLATNALKDACGFTNPIQATHEEIVAIYRAAM
>ECP_0240 Vgr-like protein
MSLKGLRFTLEVDGQEPDTFAVVNFRLIQNQSYPFVMSVDVASDSFMQTA
EMLLEKKATLTIWQGVIPQRYVTGVVAGFGMQENNGWQMRYHLRIEPPLW
RCGLRQNFRIFQQQDIRTISATLLNENGVTEWTPLFYEDHPAREFCVQYG
ESDLAFLARLWAEEGIFFFERFAADSPEQKLTLCDDVAGLSQAGEFPFNP
DASTGAETECVSMFRYEAHVRPSSVQSQDYTFKVPDWPGMYEQQGESLNG
QLEQYEIFDYPGRFKDEQHGKDFTLYQMESLRSDAEKATGQSNSPKLWPG
TRFTLTGHPQKMLNREWQVVQSILSGDQPQALHGSQGRGTTLGNQLEVIP
ADRTWRPRLQSKPKVDGPQSAIVTGPAGEEIFCDEHGRVRVKFHWDRYNP
ATEASSCWVRVSQAWAGPGFGNLAIPRVGQEVIVDFLNGDPDQPIIMGRT
YHEDNRSPGSLPGTKTQMTIRSKTYKGSGFNELRFEDATGGEQVYIHAQK
NMDTEVLNNRTTDVKADHTETIGNDQKITVGLGQTVNVGSKKEGGHDQKV
TVANDQHLTIKNDRHKVVNNNQTSKVTGTDTEEVVKKQSIKIGDNYELKV
EHGTNIISGDSIELICGQGESGTCSIKLEKTGKIIIRGTEFLFEATGPVD
IKGKDIHLNG
>ECP_2627 hypothetical transcriptional regulator YgaE
MTITSLDGYRWLKNDIIRGNFQPDEKLRMSLLTSRYALGVGPLREALSQL
VAERLVTVVNQKGYRVASMSEQELLDIFDARANMEAMLVSLAIARGGDEW
EADVLAKAHLLSKLEACDASEKMLDEWDLRHQAFHTAIVAGCGSYYLLQM
RERLFDLAARYRFIWLRRTVLSVEMLEDKHDQHQTLTATVLARDTARASE
LMRQHLLTPIPIIQQAMAGN
>ECP_0819 probable tonB-dependent receptor YbiL precursor
MENNRNFPARQFHSLTFFAGLCIGITPVAQALAAEGQANADDTLVVEAST
PSLYAPQQSADPKFSRPVADTTRTMTVISEQVIKDQGATNLTDALKNVPG
VGAFFAGENGNSTTGDAIYMRGADTSNSIYIDGIRDIGSVSRDTFNTEQV
EVIKGPSGTDYGRSAPTGSINMISKQPRNDSGIDASASIGSAWFRRGTLD
VNQVIGDTTAVRLNVMGEKTHDAGRDKVKNERYGVAPSVAFGLGTANRLY
LNYLHVTQHNTPDGGIPTIGLPGYSAPSAGTAALNHSGKVDTHNFYGTDS
DYDDSTTDTATMRFEHDINDNTTIRNTTRWSRVKQDYLMTAIMGGASNIT
QPTSDVNSWTWSRTANTKDVSNKILTNQTNLTSTFYTGAIGHDVSTGVEF
TRETQTNYGVNPVTLPAVNIYHPDSSIHPGGLTRNGANANGQTDTFAIYA
FDTLQITRDFELNGGIRLDNYHTEYDSATAYGGSGRGAITCPAGVAKGSP
VTTVDTAKSGNLVNWKAGALYHLTENGNVYINYAVSQQPPGGNNFALAQS
GSGNSANRTDFKPQKANTSEIGTKWQVLDKRLLLTAALFRTDIENEVEQN
DDGTYSQYGKKRVEGYEISVAGNITPAWQMIGGYTQQKATIKNGKDVAQD
GSSSLPYTPEHAFTLWSQYQATDDISVGAGARYIGSMHKGSDGAVGTPAF
TEDYWVADAKLGYRVNRNLDFQLNVYNLFDTDYVASINKSGYRYHPGEPR
TFLLTANMHF
>ECP_2626 GabA permease
MGQSSQPHELGGGLKSRHVTMLSIAGVIGASLFVGSSVAIAEAGPAVLLA
YLFAGLLVVMIMRMLAEMAVATPDTGSFSTYADKAIGRWAGYTIGWLYWW
FWVLVIPLEANIAAMILHSWVPGIPIWLFSLVITLALTGSNLLSVKNYGE
FEFWLALCKVIAILAFIFLGAVAISGFYPYADVSGISRLWDSGGFMPNGF
GAVLSAMLITMFSFMGAEIVTIAAAESDTPEKHIVRATNSVIWRISIFYL
CSIFVVVALIPWNMPGLKAVGSYRSVLELLNIPHAKLIMDCVILLSVTSC
LNSALYTASRMLYSLSRRGDAPAVMGKINRSKTPYVAVLLSTGAAFLTVV
VNYYAPAKVFKFLIDSSGAIALLVYLVIAVSQLRMRKILRAEGSEIRLRM
WLYPWLTWLVIGFITFVLVVMLFRPAQQLEVLSTGLLAIGIICTVPIMAR
WKKLIMWQKTPIHNTR
>ECP_0974 putative acylphosphatase
MSKVCIIAWVYGRVQGVGFRYTTQYEAKKLGLTGYAKNLDDGSVEVVACG
DEGQVEKLMQWLKSGGPRSARVERVLSEPHHPSGELTDFRIR
>ECP_1574 electron transport complex protein RnfC
MLKLFSAFRKNKIWDFNGGIHPPEMKTQSNGTPLRQVPLAQRFVIPLKQH
IGAEGELCVSVGDNVLRGQPLTRGRGKMLPVHAPTSGTVTAIAPHSTAHP
SALAELSVIIDADGEDCWIPRDGWADYRSRSREELIERIHQFGVAGLGGA
GFPTGVKLQGGGDKIETLIINAAECEPYITADDRLMQDCAAQVVEGIRIL
AHILQPREILIGIEDNKPQAISMLRAVLADSHDISLRVIPTKYPSGGAKQ
LTYILTGKQVPHGGRSSDIGVLMQNVGTAYAVKRAVIDGEPITERVVTLT
GEAIARPGNVWARLGTPVRHLLNDAGFCPSADQMVIMGGPLMGFTLPWLD
VPVVKITNCLLAPSANELGEPQEEQSCIRCSACADACPADLLPQQLYWFS
KGQQHDKATTHNIADCIECGACAWVCPSNIPLVQYFRQEKAEIAAIRQEE
KRAAEAKARFEARQARLEREKAARLERHKSAAVQPAAKDKDAIAAALARV
KEKQAQATQPIVIKAGERPDNSAIIAAREARKAQARAKQAELQQTNDAAT
VADPRKTAVEAAIARAKARKLEQQQANAEPEQQVDPRKAAVEAAIARAKA
RKLEQQQANAEPEEQVDPRKAAVEAAIARAKARKLEQQQANAEPEEQIDP
RKAAVEAAIARAKARKLEQQQQANAEPEEQVDPRKAAVEAAIARAKARKL
EQQQQANAEPEEQVDPRKAAVEAAIARAKARKLEQQQANAEPEEQIDPRK
AAVAAAIARAQAKKAAQQKVVNED
>ECP_3730 lipopolysaccharide core biosynthesis glycosyl transferase WaaQ
MRFHGDMLLTTPVISSLKKNYPDAKIDVLLYQDTIPILSENPEINALYGI
KNKKAKASEKIANFFHLIKVLRANKYDLIVNLTDQWMIAILVRLLNARVK
ISQDYHHRQSAFWRNSFTHLVPLQGGNVVESNLSVLTPLGLESLVKQTTM
SYPPASWKRMRRELDHAGVGQNYVVIQPTARQIFKCWDNAKFSAVIDALH
ARGYEVVLTSGPDKDDLACVNEIAQGCQTPPVTALAGKVTFPELGALIDH
AQLFIGVDSAPAHIAAAVNTPLISLFGATDHIFWRPWSNNMIQFWAGDYR
EMPTRDQRDRNEMYLSVIPAADVIAAVDKLLPSSTTGTSL
>ECP_2849 AAS bifunctional protein
MLFSFFRNLCRVLYRVRVTGDTKALKGERVLITPNHVSFIDGILLALFLP
VRPVFAVYTSISQQWYMRWLKSFIDFVPLDPTQPMAIKHLVRLVEQGRPV
VIFPEGRITTTGSLMKIYDGAGFVAAKSGATVIPVRIEGAELTHFSRLKG
LVKRRLFPQITLHILPPTQVEMPDAPRARDRRKIAGEMLHQIMMEARMAV
RPRETLYESLLSAMYRFGAGKKCVEDVNFTPDSYRKLLTKTLFVGRILEK
YSVEGERIGLMLPNAGISAAVIFGAIARRRIPAMMNYTAGVKGLTSAITA
AEIKTIFTSRQFLDKGKLWHLPEQLTQVRWVYLEDLKADVTTADKVWIFA
HLLMPRLAQVKQQPEEEALILFTSGSEGHPKGVVHSHKSILANVEQIKTI
ADFTTNDRFMSALPLFHSFGLTVGLFTPLLTGAEVFLYPSPLHYRIVPEL
VYDRSCTVLFGTSTFLGHYARFANPYDFYRLRYVVAGAEKLQESTKQLWQ
DKFGLRILEGYGVTECAPVVSINVPMAAKPGTVGRILPGMDARLLSVPGI
EEGGRLQLKGPNIMNGYLRVEKPGVLEVPTAENVRGEMERDWYDTGDIVR
FDEQGFVQIQGRAKRFAKIAGEMVSLEMVEQLALGVSPDKVHATAIKSDA
SKGEALVLFTTDNELMRDKLQQYAREHGVPELAVPRDIRYLKQMPLLGSG
KPDFVTLKSWVDEAEQHDE
>ECP_3849 putative transposase
MIKTRWTKRTFSPEFKLEAIEQVVKYPRDVREVALALELNPDHLRKWIRL
YKQEFQGIESAGNAITPEQREIQQLKAQIKRVEMAKEILKQAAVLMSEIP
GKLPR
>ECP_2284 anaerobic glycerol-3-phosphate dehydrogenase subunit B
MRFDTVIMGGGLAGLLCGLQLQKHGLRCAIVTRGQSALHFSSGSLDLLSH
LPDGQPVTEIHSGLESLRQQAPAHPYTLLGPQRVLDLACQAQALIAESGA
QLQGSVELAHQRITPLGTLRSTWLSSPEVPVWPLPAKKICVVGISGLMDF
QAHLAAASLRELDLKVETAEIELPELDVLRNNATEFRAVNIARFLDNEEN
WPLLLDALIPVANTCEMILMPACFGLANDKLWHWLNEKLPCSLMLLPTLP
PSVLGIRLQNQLQRQFVRQGGVWMPGDEVKKVTCKNGVVNEIWTRNHADI
PLRPRFAVLASGSFFSGGLVAERDGIREPILGLDVLQTATRGEWYKGDFF
APQPWQQFGVTTDEALRPSQAGQTIENLFAIGSVLGGFDPIAQ
>ECP_1781 hypothetical protein YebW
MFALVLFVCYLDGGCEDIVVDVYNTEQQCLYSMSDQRIRHGGCFPIEDFI
DGFWRPAQEYGDF
>ECP_3024 capsule polysaccharide export protein KpsC
MIGIFSSGIWRIPHLEKFLAQPCQKLSLLRPVPQEVDAIAVWGHRPSAAK
PVAIAKAAGKPVIRLEDGFVRSLDLGVNGEPPLSLVVDDCGIYYDASKPS
ALEKLVQDKAGNTALISLAREAMHTIVTGDLSKYNLAPAFVADESERSDI
VLVVDQTFNDMSVTYGNAGPHEFAAMLEAAMAENPQAEIWVKVHPDVLEG
KKTGYFADLRATQRVRLIAENVSPQSLLRHVSRVYVVTSQYGFEALLAGK
PVTCFGQPWYAGWGLTDDRHPQSALLSARRGSATLEELFAAAYLRYCRYI
DPQTGAVSDLFTVLQWLQLQRNHQQQRNGYLWAPGLTLWKSAILKPFLQT
ATNRLSFSRRCTAASACVVWGVKGEQQWRAEAQRKSLPLWRMEDGFLRSS
GLGSDLLPPLSLVLDKRGIYYDATRPSDLEVLLNHSQLTLAQQMRAEKLR
QRVKLQMCDVEYLFEIEMNLMKTLLLYINIYP
>ECP_2487 hypothetical protein YpfH
MKHDHFVVQSPDKPAQQLLLLFHGVGDNPVAMGEIGSWFAPLFPDALVVS
VGGAEPSGNPAGRQWFSVQGITEDNRQARVNAIMPTFIETVRYWQKQSGV
GANATALIGFSQGAIMALESIKAEPGLASRVIAFNGRYASLPETASTATT
IHLIHGGEDPVIDLAHAVAAQEALISAGGDVTLDIVEDLGHAIDNRSMQL
ALDHLRYTIPKHYFDEALSGGKPGDDDVIEMM
>ECP_3304 hypothetical protein YhcH
MIMGEVQSLPSAGLHPALQDALTLALAARPQEKAPGRYELQGDNIFMNVM
TFNTQSPVEKKAELHEQYIDIQLLLNGEERILFGTAGTARECEEFHHEDD
YQLCSAIENEQAIILKPGMFAVFMPGEPHKPGCVVGEPDEIKKVVVKVKA
DLMA
>ECP_2391 D-serine dehydratase
MENAKMNSLIAQYPLVKDLVALQETTWFNPGTTSLAEGLPYVGLTEQDVQ
DAHARLSRFAPYLAKAFPETAATGGIIESELVAIPAMQKRLEKEYHQPIA
GQLLLKKDSHLPISGSIKARGGIYEVLAHAEKLALEAGLLTLEDDYSKLL
SPEFKQFFSQYSIAVGSTGNLGLSIGIMSARIGFKVTVHMSADARAWKKA
KLRSHGVTVVEYEQDYGVAVEEGRKAAQSDPNCFFIDDENSRTLFLGYSV
AGQRLKAQFAQQGRIVNADNPLFVYLPCGVGGGPGGVAFGLKLAFGDHVH
CFFAEPTHSPCMLLGVHTGLHDQISVQDIGIDNLTAADGLAVGRASGFVG
RAMERLLDGFYTLSDQTMYDMLGWLAQEEGIRLEPSALAGMAGPQRVCAS
VSYQQLHGFSAEQLRNATHLVWATGGGMVPEEEMNQYLAKGR
>ECP_2716 hypothetical aldolase class II protein YgbL
MSDFAKVEQSLREEMTRIASSFFQRGYATGSAGNLSLLLPDGNLLATPTG
SCLGNLDPQRLSKVTADGEWLSGDKPSKEVLFHLALYRNNPRCKAVVHLH
STWSTALSCLQGLDSSNVIRPFTPYVVMRMGNIPLVPYYRPGDKRIAQDL
AELAADNQAFLLANHGPVVCGESLQEAANNMEELEETAKLIFILGDRPIR
YLTAGEIAELRS
>ECP_3362 hypothetical protein
MRQNVAHVSFSHHCGALFLLFPVYTQLIATRNRAQKLALLLLIRGWPQTG
KRRFPNVL
>ECP_2471 ethanolamine utilization cobalamin adenosyltransferase
MKDFITEAWLRANHTLSEGAEIHLPADSRLTPSARELLESRHLRIKFIDE
QGRLFVDDEQQQPQPVHGLTSSDEHPQACCELCRQPVAKKPDTLTHLSAE
KMVAKSDPRLAFRAVLDSTIALAVWLQIELAEPWQPWLADIRSRLGNIMR
ADALGEPLGDQAIVGLSDEDLHRLSHQPLRYLDHDHLVPEASHGCDAALL
NLLRTKVRETETVAAQVFITRSFEVLRPDILQALNRLSSTVYVMMILSVT
KQPLTVKQIQQRLGETQ
>ECP_0007 putative cytoplasmic protein YaaA
MLILISPAKTLDYQSPLTTTRYTLPELLDNAQQLIHEARKLTPPQISSLM
RISDKLAGINAARFHDWQPNFTPENARQAILAFKGDVYTGLQAETFSEDD
FDFAQQHLRMLSGLYGVLRPLDLMQPYRLEMGIRLENARGKDLYQFWGDI
ITNKLNEALAAQGDNVVINLASDEYFKSVKPKKLNAEIIKPVFLDEKNGK
FKIISFYAKKARGLMSRFIIENRLTKPEQLTGFNSEGYFFDEASSSNGEL
VFKRYEQR
>ECP_4377 transcriptional activator CadC
MQQPVVRVGEWLVTPSINQISRNGRQLTLEPRLIDLLVFFAQHSGEVLSR
DELIDNVWKRSIVTNHVVTQSISELRRSLKDNDEDSPVYIATVPKRGYKL
MVPVIWYSEEEGEEIMLSSPPPIPEAVPATDSPSHSLNIQNTTTPPEQSP
VKSKRFTTFWVWFFFLLSLGICVALVAFSSLETRLPMSKSRILLNPRDID
INMVNKSCNSWSSPYQLSYAIGVGDLVATSLNTFSTFMVHDKINYNIDEP
SSSGKTLSIAFVNQRQYRAQQCFMSVKLVDNADGSTMLDKRYVITNGNQL
AIQNDLLQSLSKALNQPWPQRMQEMLQQILPHRGALLTNFYQAHDYLLHG
DDKSLDRASELLGEIVQSSPEFTYARAEKALVDIVRHSQHPLDEKQLAAL
NTEIDNIVTLPELNNLSIIYQIKAVSALVKGKTDESYQAINTGIDLEMSW
LNYVLLGKVYEMKGMNREAADAYLTAFNLRPGANTLYWIENGIFQTSVPY
VVPYLDKFLASE
>ECP_2879 putative electron transport protein YgfS
MKSLIIVNPADCIGCRTCEVACVVAHPSEQELNADIFLPRLKVQRLDSIS
APVMCHQCENAPCVGACPVGALTMGEQVVQANSARCIGCQSCVSACPFGM
ITIQSLPGDTRQQIVKCDLCEQREEGPACVESCPTQALQLLTERELRRVR
QQRIVASGENPL
>ECP_2524 penicillin-binding protein 1C
MPRLLTKRGCWIMLAAAPFIIILAAWAADKLWPLPLQEVNPARVVVAQDG
TPLWRFADADGIWRYPVTIEDVSPRYLEALINYEDRWFWKHPGVNPFSVA
RAAWQDLTSGRVISGGSTLTMQVARLLDPHPKTFGGKIRQLWRALQLEWH
LSKREILTLYLNRAPFGGTLQGIGAASWAYLGKSPANLSYSEAAMLAVLP
QAPSRLRPDRWPERAEAARNKVLERMAVQGVWSHERVKESREEPIWLAPR
QMPQLAPLFSRMMLGKSKSDKIVTTLDAGLQRRLEELAQNWKGRLPPRSS
LAMIVVDHTDMRVRGWVGSVDLNDDSRFGHVDMVNAIRSPGSVLKPFVYG
LALDEGLIHPASLLQDVPRRTGDYRPGNFDSGFHGPISMSEALVRSLNLP
AVQVLEAYGPKRFAAKLRNVGLPLYLPNGAAPNLSLILGGAGAKLEDMAA
AYTAFARHGKAGKLRLQPDDPLLERPLMSSGAAWIIRRIMADEAQPLPDG
ALPRVAPLAWKTGTSYGYRDAWAIGVNARYVIGIWTGRPDGTPVVGQFGF
ASAIPLLNQVNNILLSRSVNFPEDPRPDSVSRGVICWPGGQSLPEGDGNC
RRRLATWLLDGSQPPTLLLPEQEGINGIRFPIWLDENGKRVAADCPQARQ
EMINVWPLPLEPWLPASERRAVRLPPASTICPPYGHDAQLPLQLTGVRDG
AIIKRLPGAAAATLPLQSSGGAGERWWFLNGEPLTERGRNVTLHLTDKGD
YQLLVMDDVGQIATVKFVMQ
>ECP_3299 hypothetical protein YhcC
MQLQKLVNMFGGDLTRRYGQKVHKLTLHGGFSCPNRDGTIGRGGCTFCNV
ASFADEAQQHRSIAEQLAHQANLVNRAKRYLAYFQAYTSTFAEVQVLRSM
YQQAVSQANIVGLCVGTRPDCVPDAVLDLLCEYKDQGYEVWLELGLQTAH
DKTLHRINRGHDFACYQRTTQLARERGLKVCSHLIVGLPGEGQAECLQTL
ERVVETGVDGIKLHPLHIVKGSIMAKAWEAGRLNGIELEDYTLTAGEMIR
HTPPEVIYHRISASARRPTLLAPLWCENRWTGMVGLDRYLNEHGVQGSAL
GRPWLPPTA
>ECP_1485 hypothetical protein YddU (hypothetical sensor protein)
MKLTDADNAADGIFFPALEQNMMGAVLINENDEVMFFNPAAEKLWGYKRE
EVIGNNIDMLIPRDLRPAHPEYIRHNREGGKARVEGMSRELQLEKKDGSK
IWTRFALSKVSAEGKVYYLALVRDASVEMAQKEQTRQLIIAVDHLDRPVI
VLDPERHIVQCNRAFTEMFGYCINEASGMQPDTLLNIPEFPADNRIRLQQ
LLWKTARDQDEFLLLTRTGEKIWIKASISPVYDVLAHLQNLVMTFSDITE
ERQIRQLEGNILAAMCSSPPFHEMGEIICRNIESVLNESHVSLFALRNGM
PIHWASSSHGAEVQNAQSWSATIRQRDGAPAGILQIKTSSGAETSAFIER
VADISQHMAALALEQEKSRQHIEQLIQFDPMTGLPNRNNLHNYLDDLVDK
AVSPVVYLIGVDHIQDVIDSLGYAWADQALLEVVNRFREKLKPDQYLCRI
EGASFVLVSLENDVSNITQIADELRNVVSKPIMIDDKPFPLTLSIGISYD
VGKNRDYLLSTAHNAMDFIRKNGGNGWQFFSPAMNEMVKERLVLGAALKE
AISNNQLKLVYQPQIFAETGELYGIEALARWYDPLHGHVPPSRFIPLAEE
IGEIENIGRWVIAEACRQLAEWRSQNIHIPALSVNLSALHFRSNQLPNQV
SDAMHAWGIDGHQLTVEITESMMMEHDTEIFKRIQILRDMGIGLSVDDFG
TGFSGLSRLVSLPVTEIKIDKSFVDRCLTEKRILALLEAITSIGQSLNLT
VVAEGIETKEQFEMLRKIHCRVIQGYFFSRPLPAEEIPGWMSSVLPLKI
>ECP_1417 acyl carrier protein phosphodiesterase
MSKVLVLKSSILAGYSQSNQLSDYFVEQWREKHSADEITVRDLAANPIPV
LDGELVGALRPSDAPLTPRQQEALALSDELIAELKAHDVIVIAAPMYNFN
ISTQLKNYFDLVARAGVTFRYTEKGPEGLVTGKKAIVITSRGGIHKDGPT
DLVTPYLSTFLGFIGITDVKFVFAEGIAYGPEMAAKAQSDAKAAIDSIVA
E
>ECP_1403 probable pyruvate-flavodoxin oxidoreductase
MITIDGNGAVASVAFRTSEVIAIYPITPSSTMAEQADAWAGNGLKNVWGD
TPRVVEMQSEAGAIATVHGALQTGALSTSFTSSQGLLLMIPTLYKLAGEL
TPFVLHVAARTVATHALSIFGDHSDVMAVRQTGCAMLCAANVQEAQDFAL
ISHIATLKSRVPFIHFFDGFRTSHEINKIVPLADDTILELMPQAEIDAHR
ARALNPEHPVIRGTSANPDTYFQSREATNPWYNAVYDHVEQAMNDFAAAT
GRQYQPFEYYGHPQAERVIILMGSAIGTCEEVVDELLTRGEKVGVLKVRL
YRPFSAKHLLQALPDSVRTVAVLDRTKEPGAQAEPLYLDVMTALAEAFNN
GERETLPRVIGGRYGLSSKEFGPDCVLAVFAELNAAKPKARFTVGIYDDV
TNLSLPLPENTLPNSAKLEALFYGLGSDGSVSATKNNIKIIGNSTPWYAQ
GYFVYDSKKAGGLTVSHLRVSEQPIRSAYLISQADFVGCHQLQFIDKYQM
AERLKPGGIFLLNTPYSAAEVWSRLPQEVQAVLNQKKARFYVINAAKIAR
ECGLAARINTVMQMAFFHLTQILPGDSALAELQGAIAKSYSSKGQDLVER
NWQALALARESVEEVPLQPVNPHSANRPPVVSDAAPDFVKTVTAAMLAGL
GDALPVSALPPDGTWPMGTTRWEKRNIAEEIPIWKEELCTQCNHCVAACP
HSAIRAKVVPPEAMENAPASLHSLDVKSRDMRGQKYVLQVAPEDCTGCNL
CVEVCPAKDRQNPEIKAINMMSRLEHVEEEKINYDFFLNLPEIDRSKLER
IDIRTSQLITPLFEYSGACSGCGETPYIKLLTQLYGDRMLIANATGCSSI
YGGNLPSTPYTTDANGRGPAWANSLFEDNAEFGLGFRLTVDQHRVRVLRL
LNQFADKIPTELLTALKSDATPEVRREQVAALRQQLNDVAEAHELLRDAD
ALVEKSIWLIGGDGWAYDIGFGGLDHVLSLTENVNILVLDTQCYSNTGGQ
ASKATPLGAVTKFGEHGKRKARKDLGVSMMMYGHVYVAQISLGAQLNQTV
KAIQEAEAYPGPSLIIAYSPCEEHGYDLALSHDQMRQLTATGFWPLYRFD
PRRADEGKLPLALDSRPPSEALEETLLHEQRFRRLNSQQPEVAEQLWKDA
AADLQKRYDFLAQMAGKAEKSNTD
>ECP_0217 membrane-bound lytic murein transglycosylase D precursor
MKAKAILLASVLLVGCQSTANVQQHAQSLSAAGQGEAAKFTSQARWMDDG
TSIAPDGDLWAFIGDELKMGIPENDRIREQKQKYLRNKSYLHDVTLRAEP
YMYWIAGQVKKRNMPMELVLLPIVESAFDPHATSGANAAGIWQIIPSTGR
NYGLKQTRNYDARRDVVASTTAALNMMQRLNKMFDGDWLLTVAAYNSGEG
RVMKAIKTNKARGKSTDFWSLPLPQETKQYVPKMLALSDILKNSKRYGVR
LPTTDESRALARVHLSSPVEMAKVADMAGISVSKLKTFNAGVKGSTLGAS
GPQYVMVPKKHADQLRESLASGEIVAVQSTLVADNTPLNSRVYTVRSGDT
LSSIASRLGVSTKDLQQWNKLRGSKLKPGQSLTIGAGSSAQRLANNSDSI
TYRVRKGDSLSSIAKRHGVNIKDVMRWNSDTANLQPGDKLTLFVKNNNMP
DS
>ECP_0099 secretion monitor precursor protein SecM
MSGILTRWRQFGKRYFWPHLLLGMVAASLGLPALSNAAEPNAPAKATTRN
HEPSAKVNFGQLALLEANTRRPNSNYSVDYWHQHAIRTVIRHLSFAMAPQ
TLPVAEESLPLQAQHLALLDTLSALLTQEGTPSEKGYRIDYAHFTPQAKF
STPVWISQAQGIRAGPQRLS
>ECP_3163 probable aminotransferase
MITTFVFIPIFAIAAGVAQSLQYLNRYHVIREPPEHILNRLPSSASALAY
SAHALNLIEKRTLDHEEMKALNREVIEYFKEHVNPGFLEYRKSVTAGGDY
GAVEWQAGGLNTLVDTQGQEFIDCLGGFGIFNVGHRNPVVVSAVQNQLAK
QPLHSQELLDPLRAMLAKTLAALTPGKLKYSFFCNSGTESVEAALKLAKA
YQSPRGKFTFIATSGAFHGKSLGALSATAKSTFRKPFMPLLPGFRHVPFG
NIEAMRTALSECKKTGDDVAAVILEPIQGEGGVILPPPGYLTAVRKLCDE
FGALMILDEVQTGMGRTGKMFACEHENVQPDILCLAKALGGGVMPIGATI
ATEEVFSVLFDNPFLHTTTFGGNPLACAAALATINVLLEQNLPAQAEQKG
DMLLDGFRQLAREYPDLVQEARGKGMLMAIEFVDNEIGYNFASEMFRQRV
LVAGTLNNAKTIRIEPPLTLTIEQCEQVIKAARKALAAMRVSVEEA
>ECP_2521 hypothetical protein YfgA (putative DNA-binding protein)
MNTEATHDQNEALTTGARLRNAREQLGLSQQAVAERLCLKVSTVRDIEED
KAPADLASTFLRGYIRSYARLVHIPEEELQPGLEKQAPLRAAKVAPMQSF
SLGKRRKKRDGWLMTFTWLVLFVVIGLSGAWWWQDHKAQQEEITTMADQS
SAELNNNQSQSVPLDTSTTTDQAMATTPTSPVDTTATNTQTPAVTAPAPA
VDPQQNAVVPPSQANVDTAATPAPAATTTPDGAAPLPTDQAGVTTPAVDP
NALVMNFTADCWLEVTDATGKKLFSGMQRKDGNLNLTGQAPYKLKIGAPA
AVQIQYQGKPVDLSRFIRTNQVARLTLNAEQSPAQ
>ECP_0249 possible hydrolase
MPGLKITLLQQPLVWMDGPANLRHFDRQLEGITGRDVIVLPEMFTSGFAM
EAAASSLAQDDVVNWMTAKAQQCNALIAGSVALQTESGSVNRFLLVEPGG
TVHFYDKRHLFRMADEHLHYKAGNARVIVEWRGWRILPLVCYDLRFPVWS
RNLNDYDLALYVANWPAPRSLHWQALLTARAIENQAYVAGCNRVGSDGNG
CHYRGDSRVINPQGEIIATADAHQATRIDAELSMMALREYREKFPAWRDA
DEFRLW
>ECP_1298 putative potassium channel protein
MSHWTTFKQTATKLWVTLRHDILALAVFLNGLLIFKTIYGMSVNLLDIFH
IKAFSELDLSLLANAPLFMLGVFLVLNSIGLLFRAKLAWAISIILLLIAL
IYTLHFYPWLKFSIGFCIFTLVFLLILRKDFSHSSAAAGTIFAFISFTTL
LFYSTYGALYLSEGFNPRIESLMTAFYFSIETMSTVGYGDIVPVSESARL
FTISVIISGITVFATSMTSIFGPLIRGGFNKLVKGNNHTMHRKDHFIVCG
HSILAINTILQLNQRGQNVTVISNLPEDDIKQLEQRLGDNADVIPGDSND
SSVLKKAGIDRCRAILALSDNDADNAFVVLSAKDMSSDVKTVLAVSDSKN
LNKIKMVHPDIILSPQLFGSEILARVLNGEEINNDMLVSMLLNSGHGIFS
DNDEQETKADSKESAQK
>ECP_2120 hypothetical protein YegP
MAGWFELSKSSDNQFRFVLKAGNGETILTSELYTSKASAEKGIASVRSNS
PQEERYEKKTASNGKFYFNLKAANHQIIGSSQMYATAQSRETGIASVKAN
GTSQTVKDNT
>ECP_2526 3-mercaptopyruvate sulfurtransferase
MSTTWFVGADWLAEHIDDPEIQIIDARMASPGQEDRNVAQEYLNGHIPGA
VFFDIEALSDHTSPLPHMLPRPETFAVAMRELGVNQDKHLIVYDEGNLFS
APRAWWMLRTFGVEKVSILGGGLAGWQRDDLLLEEGAVELPEGEFNAAFN
PEAVVKVTDVLLASHENTAQIIDARPATRFNAEVDEPRPGLRRGHIPGAL
NVPWTELVREGELKTTDELDAIFFGRGVSYDKPIIVSCGSGVTAAVVLLA
LATLDVTNVKLYDGAWSEWGARADLPVEPVK
>ECP_1486 hypothetical protein
MVDFGLRFNHKGRHYFKAANGEDIALSLSIGAAMFNGHPDYERLIQIADE
ALYIAKRRGRNRVELCKASL
>ECP_2566 pyridoxal phosphate biosynthetic protein PdxJ
MAELLLGVNIDHIATLRNARGTAYPDPVQAAFIAEQAGADGITVHLREDR
RHITDRDVCILRQTLDTRMNLEMAVTEEMLAIAVETKPHFCCLVPEKRQE
VTTEGGLDVAGQREKMRDACKRLADAGIQVSLFIDADEEQIKAAAEVGAP
FIEIHTGCYADAKTDAEQAQELARIAKAATFATSLGLKVNAGHGLTYHNV
KAIAAIPEMHELNIGHAIIGRAVMTGLKDAVAEMKRLMLEARG
>ECP_0142 pantoate-beta-alanine ligase
MLIIETLPLLRQQIRRLRMEGKRVALVPTMGNLHDGHMKLVDEAKARADV
VVVSIFVNPMQFDRPEDLARYPRTLQEDCEKLNKRKVDLVFAPSVKEIYP
NGTETHTYVDVPGLSTMLEGASRPGHFRGVSTIVSKLFNLVQPDIACFGE
KDFQQLALIRKMVADMGFDIEIVGVPIMRAKDGLALSSRNGYLTAEQRKI
APGLYKVLSSIADKLQAGERDLDEIIAIAGQELNEKGFRSDDIQIRDADT
LLEISENSKRAVILVAAWLGDARLIDNKLVELA
>ECP_1514 multiple antibiotic resistance protein MarA
MSRRNTDAITIHSILDWIEDNLESPLSLEKVSERSGYSKWHLQRMFKKET
GHSLGQYIRSRKMTEIAQKLKESNEPILYLAERYGFESQQTLTRTFKNYF
DVPPHKYRMTNMQGESRFLHPLNHYNN
>ECP_4300 putative membrane protein YjcH
MNGTIYQRIEDNAHFRELVEKRQRFATILSIIMLAVYIGFILLIAFAPGW
LGTPLNPNTSVTRGIPVGVGVIVISFVLTGIYIWRANGEFDRLNNKVLHE
VQAS
>ECP_2262 sensor protein AtoS
MHYMKWIYPRRLRNQMILMAILMVIVPTLTIGYIVETEGRSAVLSEKEKK
LSAVVNLLNQALGDRYDLYIDLPREERIRALNAELAPITENITHAFPGIG
AGYYNKTLDAIITYAPSALYQNNVGVTIAADHPGREVMRTNTPLVYSGRQ
VRGDILNSMIPIERNGEILGYIWANELTEDIRRQAWKMDVRIIIVLTAGL
LISLLLIVLFSRRLSANIDIITDGLSTLAQNIPTRLPQLPGEMGQISQSV
NNLAQALRETRTLNDLIIENAADGVIAIDRQGDVTTMNPAAEVITGYQRH
ELVGQPYSMLFDNTQFYSPVLDTLEHGTEHVALEISFPGRDRTIELSVTT
SRIHNTHGEMIGALVIFSDLTARKETQRRMAQAERLATLGELMAGVAHEV
RNPLTAIRGYVQILRQQTRDPIHQEYLSVVLKEIDSINKVIQQLLEFSRP
RHSQWQQVSLNALVEETLVLVQTAGVQARVDFISELDNELSPINADRELL
KQVLLNILINAVQAISARGKIRIRTWQYSDSQQAISIEDNGSGIDLSLQK
KIFDPFFTTKASGTGLGLALSQRIINAHQGDIRVASLPGYGATFTLILPI
NPQGNQTV
>ECP_3160 hypothetical protein YqjH (hypothetical siderophore-interacting protein)
MNNSPRYPQRVRNDLRFRELTVLRVERISAGFQRIVLGGEALDGFTSRGF
DDHSKLFFPQPDAHFVPPTVTEEGIVWPEGPRPPSRDYTPLYDELRHELA
IDFFIHDGGVASGWAMQAKPGDKLTVAGPRGSLVVPEDYAYQLYVCDESG
MPALRRRLETLSKLAVKPQVSALVSVRDNACQDYLAHLDGFNIEWLAHDE
QAVDARLAQMQIPADDYFIWITGEGKVVKNLSRRFEAEQYDPQRVRAAAY
WHAK
>ECP_3186 putative usher protein
MVLYVLIQDAVILNTETVEIIYNDEESSATLFLNPQWSSAFNSKSLYLNP
DKNTVNAFIHQQDINVLVQDDYQSLSIQGNGALGITENSYIGAHWNFDGY
DADDVSDNNVDVSDLYYRYDFLRRYYVQAGRMDNRTLFNAQGGNFTFNFL
PLGAIDGMRIGSTLSYLNQTQSQQGTPVMILLSRNSRVDAYRNEQLLGSF
YLNSGSQFIDTSSFPPGSYSVALKVYENNQLTRTELVPFTKTGGLTDGNA
QWFLQAGKTTSQVSDDESSAYQLGVRLPLHPQYELYAGLANADDVSAFEL
GNNWTADLGGAGNLAISASVFRNDDGGKGDMQQANWSHPGWPTLGFYRTN
SDGDACTTDNRESYNALSCYESISATVSQNFVGWNMMLGYTRTQNNTDDS
LRWDKQQSFENNYLRQTSAQSISETVQLSASRAFVMRDWILSTSLGVFHR
NDNGGDNDDNGLYLSFSLSDTPTMDSNNNSHSTNVSTDYRYSDQDGDQTS
WQLSHTFYNDSFSHKELGVTVGGLNTDTINSAVNGRWDGQYGNVYATVSD
SYDRQNHDHLSAFTGTYSSTLAVSRYGINVGASGSDDLLGAVLVDVKGFS
EQDEQSQGLQLEARVAGSRTLQLGQSDSVLFPYPGFQSGFVEVNDSNQGN
QQGTTNIINGAGNRELMLLPGKLRYREVSASFNYNYIGRLLLPASVEKFP
LVGLNSAMLLVAEDGGFTLEISSGEKELYLLSGQQFLKCPLNVLKKRASI
RYSGDVNCSVVSYSQLPESIQVQAQLKQPKLRGNVQTAQREVAP
>ECP_1271 nitrate/nitrite sensor protein NarX
MLKRCLSPLTLVNQVALIVLLSTAIGLAGMAVSGWLVQGVQGSAHAINKA
GSLRMQSYRLLAAVPLSEKDKPLIKEMEQTAFSAELTRAAERDGQLAQLQ
GLQDYWRNELIPALMRAQNRETVSADVSQFVAGLDQLVSGFDRTTEMRIE
TVVLVHRVMAVFMALLLVFTIIWLRARLLQPWRQLLAMASAVSHRDFTQR
ANISGRNEMAMLGTALNNMSAELAESYAVLEQRVQEKTAGLEHKNQILSF
LWQANRRLHSRAPLCERLSPVLNGLQNLTLLRDIELRVYDTDDEENHQEF
TCQPDMTCDDKGCQLCPRGILPVGDRGTTLKWRLADSHTQYGILLATLPQ
GRHLSHDQQQLVDTLVEQLTATLALDRHQERQQQLIVMEERATIARELHD
SIAQSLSCMKMQVSCLQMQGDALPESSRELLSQIRNELNASWAQLRELLT
TFRLQLTEPGLRPALEASCEEYSAKFGFPVKLDYQLPPRLVPSHQAIHLL
QIAREALSNALKHSQASEVVVTVAQNDNQVKLTVQDNGCGVPENAIRSNH
YGMIIMRDRAQSLRGDCRVRRRESGGTEVVVTFIPEKTFTDVQGDTHE
>ECP_0152 2-amino-4-hydroxy-6- hydroxymethyldihydropteridine pyrophosphokinase
MTVAYIAIGSNLASPLEQVNAALKALGDIPESRILAVSSFYRTPPLGPQD
QPDYLNAAVALETTLAPEELLNHTQRIELQQGRVRKAERWGPRTLDLDIM
LFGNEVINTERLTVPHYDMKNRGFMLWPLFEIAPELVFPDGLSLVEALQA
KGFNELDKW
>ECP_0024 Isoleucyl-tRNA synthetase
MSDYKSTLNLPETGFPMRGDLAKREPGMLARWTDDDLYGIIRAAKKGKKT
FILHDGPPYANGSIHIGHSVNKILKDIIIKSKGLSGYDSPYVPGWDCHGL
PIELKVEQEYGKPGEKFTAAEFRAKCREYAATQVDGQRKDFIRLGVLGDW
SHPYLTMDFKTEANIIRALGKIIGNGHLHKGAKPVHWCVDCRSALAEAEV
EYYDKTSPSIDVAFQAVDQDALKTKFGVSNVNGPISLVIWTTTPWTLPAN
RAISIAPDFDYALVQIDGQAVILAKDLVESVMQRIGVSDYTILGTVKGAE
LELLRFTHPFMDFDVPAILGDHVTLDAGTGAVHTAPGHGPDDYVIGQKYG
LETANPVGPDGTYLPGTYPTLDGVNVFKANDIVIALLQEKGALLHVEKMQ
HSYPCCWRHKTPIIFRATPQWFVSMDQKGLRAQSLKEIKGVQWIPDWGQA
RIESMVANRPDWCISRQRTWGVPMSLFVHKDTEELHPRTLELMEEVAKRV
EVDGIQAWWDLDAKEILGDEADQYVKVPDTLDVWFDSGSTHSSVVDVRPE
FAGHAADMYLEGSDQHRGWFMSSLMISTAMKGKAPYRQVLTHGFTVDGQG
RKMSKSIGNTVSPQDVMNKLGADILRLWVASTDYTGEMAVSDEILKRAAD
SYRRIRNTARFLLANLNGFDPAKDMVKPEEMVVLDRWAVGCAKAAQEDIL
KAYEAYDFHEVVQRLMRFCSVEMGSFYLDIIKDRQYTAKADSVARRSCQT
ALYHIAEALVRWMAPILSFTADEVWGYLPGEREKYVFTGEWYEGLFGLAD
SEAMNDAFWDELLKVRGEVNKVIEQARADKNVGGSLEAAVTLYAEPELAA
KLTALGDELRFVLLTSGATVADYNDAPADAQQSEVLKGLKVALSKAEGEK
CPRCWHYTQDVGKVAEHAEICGRCVSNVAGDGEKRKFA
>ECP_0070 thiamine transport system permease protein ThiP
MATRRQPLIPGWLIPGVSAATLVVAVALAAFLALWWNAPQGDWVAVWQDS
YLWHVVRFSFWQAFLSAQLSVVPAIFLARALYRRHFPGRQMLLRLCAMTL
ILPVLVAVFGILSVYGRQGWLASLWQSLGLEWTFSPYGLQGILLAHVFFN
LPMASRLLLQALENIPGEQRQLAAQLGMRGWHFFRFVEWPWLRRQIPPVA
ALIFMLCFASFATVLSLGGGPQATTIELAIYQALSYDYDPARAAMLALIQ
MVCCLGLVLLSQRLSKAIAPGTTLLQGWRDPDDRLHSRICDTALIVLALL
LLLPPLLAVIVDGVNRQLPEVLAQPVLWQALWTSLRIALAAGVLCVVLTM
MLLWSSRELRARQKMLAGQALEMSGMLILAMPGIVLATGFFLLLNNTIGL
PQSADGIVIFTNALMAIPYALKVLENPMRDITARYSMLCQSLGIEGWSRL
KVVELRALKRPLAQALAFACVLSIGDFGVVALFGNDDFRTLPFYLYQQIG
SYRSQDGAVTALILLLLCFLLFTVIEKIPGRNVKTD
>ECP_2877 guanine deaminase
MSGEHTLKAVRGSFIDVTRTVDNPEEIASALRFIEDGLLLIKQGKVEWFG
EWEDGKHQIPDTIRVRDYRGKLIVPGFIDTHIHYPQSEMVGAYGEQLLEW
LNKHTFPTERRYEDLEYAREMSAFFIKQLLRNGTTTALVFGTVHPQSVDA
LFEAASHINMRMIAGKVMMDRNAPDYLLDTAESSYHQSKELIERWHKNGR
LLYAITPRFAPTSSPEQMAMAQRLKEEYPDTWVHTHLCENKDEIAWVKSL
YPDHDGYLDVYHQYGLTGKNCVFAHCVHLEEKEWDRLSETKSSIAFCPTS
NLYLGSGLFNLKKAWQKKVKVGMGTDIGAGTTFNMLQTLNEAYKVLQLQG
YRLSAYEAFYLATLGGAKSLGLDDLIGNFLPGKEADFVVMEPTATPLQQL
RYDNSVSLVDKLFVMMTLGDDRSIYRTYVDGRLVYERN
>ECP_0946 putative aliphatic sulfonates transport permease protein
MATPVKKWLLRVAPWFLPVGIVAVWQLASSVGWLSTRILPSPEGVVMAFW
TLSASGELWQHLAISSWRALIGFSIGGSLGLILGLISGLSRWGERLLDTS
IQMLRNVPHLALIPLVILWFGIDESAKIFLVALGTLFPIYINTWHGIRNI
DRGLVEMARSYGLSGIPLFIHVILPGALPSIMVGVRFALGLMWLTLIVAE
TISANSGIGYLAMNAREFLQTDVVVVAIILYALLGKLADVSAQLLERLWL
RWNPAYHLKEATV
>ECP_1215 hypothetical protein
MLKGKILGQKKARQLRRETSSYGRTMNNEIAGLSSPRIPTLTIRLFTQPG
VNILFCVWEQP
>ECP_3411 probable general secretion pathway protein A
MSTRREVILSWLREKRQTWRLCYLLGEAGSGKTWLAQQLQKDKHRRVITL
SLVVSWQGKAAWIVTDDNAAEQGCRDSAWTRDEMAGQLLHALHRTDSRCP
LIIIENAHLNHRRILDDLQRAISLIPDGQFLLIGRPDRKVERDFKKQGIE
LVSTGRLTEHELKASILEGQNIDQPDLLLTARVLKRIALLCRGDRRKLAL
AGETISLLQQAEQTRVFTAKQWRMIYRVLGDKRPRKMQLAVVMSGTILAL
TCGWLLLSSFTAPLPVPAWLIPVTPVVKQDMTKDIAHVVMRDSEALSVLY
GVWGYEVPADSAWCDQAVRAGLVCKSGNASLQTLVDQNLPWIASLKVGDK
KLPVVVVRVGDATVDVLVGQQTWTLTHKWFELVWTGDYLLLWKMSPEGES
TITRDSSEEEILWLETMLNRALHISTESSAEWRPLLVEKIKQFQKSHHLK
TDGVVGFSTLVHLWQVAGESAYLYRDEANISPETTVKGK
>ECP_0194 lysine decarboxylase, constitutive
MNIIAIMGPHGVFYKDEPIKELESALVAQGFQIIWPQNSVDLLKFIEHNP
RICGVIFDWDEYSLDLCSDINQLNEYLPLYAFINTHSTMDVSVQDMRMAL
WFFEYALGQAEDIAIRMRQYTNEYLDNITPPFTKALFTYVKERKYTFCTP
GHMGGTAYQKSPVGCLFYDFFGGNTLKADVSISVTELGSLLDHTGPHLEA
EEYIARTFGAEQSYIVTNGTSTSNKIVGMYAAPFGSTLLIDRNCHKSLAH
LLMMNDVVPVWLKPTRNALGILGGIPRREFTRDSIEEKVAATTQAQWPVH
AVITNSTYDGLLYNTDWIKQTLDVPSIHFDSAWVPYTHFHPIYQGKSGMS
GERVAGKVIFETQSTHKMLAALSQASLIHIKGEYDEEAFNEAFMMHTTTS
PSYPIVASVETAAAMLRGNPGKRLINRSVERALHFRKEVQRLREESDGWF
FDIWQPPQVDEAECWPVAPGEQWHGFSDADANHMFLDPVKVTILTPGMDE
QGNMSEEGIPAALVAKFLDERGIVVEKTGPYNLLFLFSIGIDKTKAMGLL
RGLTEFKRSYDLNLRIKNMLPDLYAEDPDFYRNMRIQDLAQGIHKLIRKH
DLPGLMLRAFDTLPEMIMTPHQAWQRQIKGEVETIALEQLVGRVSANMIL
PYPPGVPLLMPGEMLTEESRTVLDFLLMLCSVGQHYPGFETDIHGAKQDE
DGVYRVRVLKMAG
>ECP_0145 hypothetical protein
MKTFFRYFLFLALCSCCYTASAGTDDDVGYIVGNNYGVGPSDQKWRETGP
NGDVTVKFRYGSGTNNLVFYKPTQLGPTGVSLKWAQLDSASGGGFLYCNR
SSNSSGAPMSIEHKMVDSGKSYGGHKLFKTSVPGLYYTLAISNIWSTLTS
TDINPSGMYIGDSTSQSFNWRGESEQTLYWSCNNANSSKKYWAVGGVMQT
LTIEFYTDTDFNPTTNQRVTLSRTDSYLYSFKAYNAGGSIKSYFLKIDFD
LTDIVLTNPTCFTAALSGPSVSGSTVKMGDL
>ECP_1558 hypothetical protein YdgA
MNKSLVAVGVIVALGVVWTGGAWYTGKKIETHLEDMVAQANAQLKLTAPE
SNLEVSYQNYHRGVFSSQLQLLVKPIAGKVNPWIKSGQSVIFNESVDHGP
FPLAQLKKLNLIPSMASIQTTLVNNEVSKPLFDMAKGETPFEINSRIGYS
GDSSSDISLKPLNYEQKDEKVAFSGGEFQLNADRDGKAISLSGEAQSGRI
DAVNEYNQKVQLTFNNLKTDGSSTLASFGERVGNQKLSLEKMTISVEGKE
LALLEGMEISGKSDLVNDGKTVNSQLDYSLNSLKVQNQDLGSGKLTLKVG
QIDGEAWHQFSQQYNAQTQALLAQPEIANNPELYQEKVTEAFFSALPLML
KGDPVITIAPLSWKNSQGESALNLSLFLNDPATTKEAPQTLAQEVDRSVK
SLDAKLTIPVDMATELMTQVAKLEGYQEDQAKKLAKQQVEGASAMGQMFR
LTTLQDNTITTSLQYANGQITLNGQKMPLEDFVGMFAMPALNVPAVPAIP
QQ
>ECP_1177 major tail protein V
MPVPNPVMPVKGAGTTLWVYNGSGDPYANPLSDVDWSRLAKVKDLTPGEL
TAESYDDSYLDDEDADWTATGQGQKSAGDTSFTLAWMPGEQGQQALLAWF
NEGDTRAYKIRFPNGTVDVFRGWVSSIGKAVTAKEVITRTVKVTNVGRPS
MAEDRSTVTAATGMTVTPASSSVVKGQSTTLTVAFQPEGATDKSFRAVSA
DKTKAIVSVSGMTITVKGVAAGKVNIPVVSGNGELAAVAEITVTDS
>ECP_2375 hypothetical fimbrial chaperone YfcS precursor
MSYKLSCPMLVSTALMALLTTASLTAHASVTPDRTRLVFNESDKSISVTL
RNNNEKLPYLAQSWLEDEKGNKITSPLAVLPPVQRIDAMMNGQVKIQALP
DIHTLPSDRESLFYYNVREIPPKSGKANTLQIALQTRIKLFWRPKALEKI
DMRKPWQFKVTLTRSGQDYTVNNPTPYHVIISDASTQKKGLTAAGFKPLV
MPPKTSQPLKAKMASAPVLTYINDYGARMPLIFRCEGNTCKVDEDQSSKG
>ECP_0259 putative chemotaxis membrane protein
MIVNSVSKSERESIIAALHGQSIFNGGGLSPLNKISPSHPSKPATVAVPE
ETEKKARDVNEKTALLKKKSATELGELATSINTIARDAHMEANLEMEIVP
QGLRVLIKDDQNRNMFERGSAKIMPFFKTLLVELAPVFDSLDNKIIITGH
TDAMAYKNNIYNNWNLSGDRALSARRVLEEAGMPENKVMQVSAMADQMLL
DAKNPQSAGNRRIEIMVLTKSASDTLYQYFGQHGDKVVQPLVQKLDKQQV
LSQRAR
>ECP_2271 hypothetical protein YfaS precursor
MDTQRFQSQFHWHLSFKFSGAIAACLSLSLVGTGLANADDSLPSSNYAPP
AGGTFFLLADSSFSRSEEAKVRLEAPGRDYRRYQMEEYGGVDVRLYRIPD
PMAFLRQQKNLHRIVVQPQYLGDGLNNTLTWLWDNWYGKSRRVMQRTFSS
QSRQNVTQALPELQLGNAIIKPSRYVQNNQFSPLKKYPLVEQFRYPLWQA
KPVEPQQGVKLEGASSNFISPQPGNIYIPLGKQEPGLYLVEAMVGGYRAT
TVVFVSDTVALSKVSGNELLVWTAGKKQGEAKTGSEILWTDGLGVMTRGV
TDGSGTLQLQHISPERSYILGKDAEGGVFVSENFFYESEIYNTRLYIFTD
RPLYRAGDRVDVKVMGREFHDPLHSSPIVSAPAKLSVLDANGSLLQTVNV
TLDARNGGQGSFRLPENAVAGGYELRLAYRKQVYSSSFRVANYIKPHFEI
GLALDKKEFKTGEAVSGKLQLLYPDGEPVKNARVQLSLRAQQLSMVGNDL
RYAGRFPVSLEGSETVSDDNGHVALNLPAADKPSRYLLTVSASDGAAYRV
TTTKEILIERGLAHYSLSTAAQYSNSGESVVFRYAALESSKQVPVTYEWL
RLEDRTSHSGELQSGGKSFTVNFAKPGNYNLTLRDKDGLILAGLSHAVSG
KGSTSHTGTVDIVADKTLYQPGETAKMLITFPEPIDEALLTLERDRVEQQ
SLLSHPANWLTLQRLNDTQYEARVPVSNSFAPNITFSVLYTRNGQYSFQN
AGIKVAVPQLDIRVKTDKTHYQPGELVNVELTSSLKGKPVSAQLTVGVVD
EMIYALQPEIAPNIGKFFYPLGRNNVRTSSSLSFISYDQALSSEPVAPGA
TNRSERRVKMLERPRREEVDTAAWMPSLTTDKQGKAYFTFLMPDSLTRWR
ITARGMNGDGLVGQGRAYLRSEKSLYMKWSMPTVYRMGDKPAAGLFIFSQ
QDNEPVALVTKFAGAEMRQTLTLHKGANYISLTQNIQQSGLLSAELQQNG
QVQDSISTKLSFVDNSWPVEQQKNVMFGGGDNALTLPEQASNIRLQSSET
PQEIFRNNLDALVDEPWGGVINTGSRLIPLSLAWRSLADHQSAAANDIRQ
MIQDNRLRLMQLAGPGARFTWWGEDGNGDAFLTAWAWYADWQASQALGVT
QQPEYWQHMLDSYAEQADNMPLLHRALVLAWAQEMNLPCKTLLKGLDEAI
ARRGTKTKDFSEADTSDINDSMILDTPESPLADAVANVLTMTLLKKAQLK
STVMPQVQQYAWDKAANSNQPLAHTVVLLNSGGDATQAAAILSGLTAEQS
TIERALAMNWLAKYMATMPPVVLPAPAGAWAKHKLTGGGEYWRWVGQGVP
DILSFGDELSPQNVQVRWREPAKTAQQSNIPVTVERQLYRLIPGEEEMSF
TLQPVTSNEIDSDALYLDEITLTSEQDAVLRYGQVEVPLPPGADVERTTW
GISVNKPNAAKQQGQLLEKARNEMGELAYMVPVKELTGTVTFRHLLRFSQ
KGQFVLPPARYVRSYAPAQQSVAPGSEWIGMQVK
>ECP_4146 hypothetical protein YiiX precursor
MKIRLLILSLLVSVPAFAWQPQTGDIIFQISRSSQSKAIQLATHSDYSHT
GMLVIRNKKPYVFEAAGPVKYTPLKQWIAHGEKGKYVVRRVEGGLSVEQQ
QKLAQTAKRYLGKPYDFSFSWSDDRQYCSEVVWKVYQNALGMRVGEQQKL
KEFDLSNPLVQAKLKERYGKNIPLEETVVSPQAVFDAPQLTTVAKEWPLF
SW
>ECP_3809 hypothetical protein
MAVLFSGGFFSLIGGILKSYLFRKWSFTLPQFSTVSISIIVLFLV
>ECP_3261 hypothetical membrane protein
MVQRLLFFVLTILVVKRILSLPLRLLIAVPFVLLTAADMSISLYSWCIFG
TTFNDGFAISVLQSDPDEVVKMLGMYIPYLCAFTFLSLLFLAVIIKYDVS
LPTKKVTGILLLIVISGSLFSACQFAYKDAKNKEAFSPYILASRFATYTP
FFNLNYFALAAKEHQRLLSIANTVPYFQLSVRDTGIDTYVLIVGESVRVD
NMSLYGLHALRHRKSKHKESRSNCLIKQ
>ECP_1772 hypothetical transport protein YebQ
MPKVQADGLPLPQRYGAILTIVIGISMAVLDGAIANVALPTIATDLHATP
ASSIWVVNAYQIAIVISLLSFSFLGDMFGYRRIYKCGLVVFLLSSLFCAL
SDSLQMLTLARVIQGFGGAALMSVNTALIRLIYPQRFLGRGMGINSFIVA
VSSAAGPTIAAAILSIASWKWLFLINVPLGIIALLLAMRFLPPNGSRASK
PRFDLPSAVMNALTFGLLITALSGFAQGQSLTLIGAELVVMVVVGIFFIR
RQLSLPVPLLPVDLLRIPLFSLSICTSVCSFCAQMLAMVSLPFYLQTVLG
RSEVETGLLLTPWPLATMVMAPLAGYLIERVHAGLLGALGLFIMAAGLFS
LVLLPASPADINIIWPMILCGAGFGLFQSPNNHTIITSAPRERSGGASGM
LGTARLLGQSSGAALVALMLNQFGDNGTHVSLIAAAILAVIAACVSGLRI
TQPRSRA
>ECP_1933 hypothetical protein
MKRSEIRKALEAWFDVERYEAIEKLTLQHFYVEVERRILAYRMLLSRNTI
PTFNRLMLDDYRNKILSGEIFFSGDTATLGHELARTYAVNPTTRSHAQFY
AKTLALTEATPELSGLSQSEFLSEYLKETSLNNLSRITVDIHLEEASTEE
IIEHLKVLIPRWKRQLKMKSPAPREYRFGKSTFRKIIEYRLIPMMDLIFW
GEDNGIKIPLSLISSLLHEDSDNDRDEGMLKATDYPLAMAFLTDASYLKS
LEDYMMENNHLKDSPVEKHVEDDRNKKK
>ECP_1434 tellurite resistance protein TehB
MIIRDENYFTDKYELTRTHSEVLEAVKVVKPGKTLDLGCGNGRNSLYLAA
NGYDVDAWDKNAMSIANVERIKSIENLDNLHTRVVDLNNLTFDGQYDFIL
STVVLMFLEAKTIPGLITNMQRCTKPGGYNLIVAAMDTADYPCTVGFPFA
FKEGELRRYYEGWEMVKYNEDVGELHRTDANGNRIKLRFATMLARKK
>ECP_0031 carbamoyl-phosphate synthase small chain
MIKSALLVLEDGTQFHGRAIGATGSAVGEVVFNTSMTGYQEILTDPSYSR
QIVTLTYPHIGNVGTNDADEESSQVHAQGLVIRDLPLIASNFRNTEDLSS
YLKRHNIVAIADIDTRKLTRLLREKGAQNGCIIAGDNPDAALALEKARAF
PGLNGMDLAKEVTTAEPYSWTQGSWTLTGGLPEAKKEDELPYHVVAYDFG
AKRNILRMLVDRGCRLTIVPAQTSAEDVLKMNPDGIFLSNGPGDPAPCDY
AITAIQKFLETDIPVFGICLGHQLLALASGAKTIKMKFGHHGGNHPVKDV
EKNVVMITAQNHGFAVDEATLPANLRVTHKSLFDGTLQGIHRTDKPAFSF
QGHPEASPGPHDAAPLFDHFIALIEQYRKTAK
>ECP_3099 putative regulatory protein, GntR family
MEQVITKRRYYDIGLQIEELLYSGVFKAGERLPSERELSERFNTSRTTIR
EAIIMLELKGVLNVKQGSGIFFVDSTDKLNQKSLMPYSEIGPFELLQARQ
VIESNITGFAASQISFNELQELKKIISLQENAIAAESDKFEDLDHRFHSI
IAEATQNRVLIKQAAELWRAVRTENPRWKKLNYKYLHEKHLRLQWLEDHR
AIFLALQQKDSELAREASWRHLENSKNELIKIFKQDASISDFDDFFFAR
>ECP_1369 putative beta-phosphoglucomutase
MKLQGVIFDLDGVITDTAHLHFQAWQQIAAEIGISIDAQFNESLKGISRD
ESLRRILQHGGKEGDFNSLEKAQLAYRKNLLYVHSLRELTVNAVLPGIRN
LLADLRAQQIPVGLASVSLNAPTILAALELREFFTFCADASQLKNSKPAP
EIFLAACAGLGVPPQACIGIEDAQAGIDAINASGMRSVGIGAGLTGAQLL
LPSTESLTWPRLSAFWQNV
>ECP_0019 putative cytoplasmic protein
MFTNVNVDCCKTPGCKNLGLLNSQDYVAQGKNILCRECGYLFPVISEQSL
NIYRNIVNHSWRGLICQCSTCGGTSLKKYGYSAQGQRRMYCHHCEKTFIT
LEHVITTPRGALLALMIEQGEALADIRKSLRLNSTGLSRELLKLAREANY
KESRQCFPASDITLSTRAFRVKYNGSNNSLYALVTAEEQSGRVVAISTNY
SPSAVEQHYQYTSNYEERMSPGTLAHHVQRKELLTMRRDTLFDIDYGPAV
LHQNDPGMLVKPVLPAYRHFELVRILTDEHSNNVQHYLDHECFILGGCLM
ANLQHIHQGRCHISFVKERGVAPATIDFPPRLFLSGGVRNNVWRAFSNRN
YSMAVCNLTGSKKVREMRHATLNSATRFIHFVENHPFLISLNRMSPANVV
STLDILKHLWNKKLEHGTI
>ECP_2503 polyphosphate kinase
MGQEKLYIEKELSWLSFNERVLQEAADKSNPLIERMRFLGIYSNNLDEFY
KVRFAELKRRIIISEEQGSNSHSRHLLGKIQSRVLKADQEFDGLYNELLL
EMARNQIFLINERQLSVNQQNWLRHYFKQYLRQHITPILINPDTDLVQFL
KDDYTYLAVEIIRGDTIRYALLEIPSDKVPRFVNLPPEAPRRRKPMILLD
NILRYCLDDIFKGFFDYDALNAYSMKMTRDAEYDLVHEMEASLMELMSSS
LKQRLTAEPVRFVYQRDMPNALVEVLREKLTISRYDSIVPGGRYHNFKDF
INFPNVGKANLVNKPLPRLRHIWFDKAQFRNGFDAIRERDVLLYYPYHTF
EHVLELLRQASFDPSVLAIKINIYRVAKDSRIIDSMIHAAHNGKKVTVVV
ELQARFDEEANIHWAKRLTEAGVHVIFSAPGLKIHAKLFLISRKENGEVV
RYAHIGTGNFNEKTARLYTDYSLLTADARITNEVRRVFNFIENPYRPVTF
DYLMVSPQNSRRLLYEMVDREIANAQQGLPSGITLKLNNLVDKGLVDRLY
AASSSGVPVNLLVRGMCSLIPNLEGISDNIRAISIVDRYLEHDRVYIFEN
GGDKKVYLSSADWMTRNIDYRIEVATPLLDPRLKQRVLDIIDILFSDTVK
ARYIDKELSNRYVPRGNRRKVRAQLAIYDYIKSLEQSE
>ECP_1889 hypothetical protein YedQ
MQHETKMENQSWLKKLARRLGPGHVVNLCFIVVLLFSTLLTWREVVVLED
AYISSQRNHLENVANALDKHLQYNVDKLIFLRNGMREALVAPLDFTSLRN
AVTEFEQHRDEHAWQIELNRRRTLPVNGVSDALVSEGNLLSRENESLDNE
ITAALEVGYLLRLAHNSSSMVEQAVYVSRAGFYVSTLPTLFTRNVPTRYY
GYVTQPWFIGHSQRENRHRAVRWFTSQPEHASNTEPQVTVSVPVDSNNYW
YGVLGMSIPVRTMQQFLRNAIDKNLDGEYQLYDSKLRFLTSSNPDHPTGN
IFDPRELALLAQAMEHDTRGGIRMDSRYVSWERLDHFDGVLVRVHTLSEG
VRGDFGSISIALTLLWALFTTMLLLSWYVIRRMVSNMYVLQSSLQWQAWH
DTLTRLYNRGALFEKARPLAKLCQTHQHPFSVIQVDLDHFKAINDRFGHQ
AGDRVLSHAAGLISSSLRAQDVAGRVGGEEFCVILPGANLTQAAEVAERI
RLKLNEKEMLIAKSTTIRISASLGVSSSEETGDYDFEQLQSLADRRLYLA
KQAGRNRVFASDNA
>ECP_2782 lactaldehyde reductase
MMANRMILNETAWFGRGAVGSLTDEVKRRGYQKALIVTDKTLVQCGVVAK
VIDKMDAAGLAWAIYDGVVPNPTITVVKEGLDVFQNSGADYLIAIGGGSP
QDTCKAIGIISNNPEFADVRSLEGLSPTNKPSVPILAIPTTAGTAAEVTI
NYVITDEEKRRKFVCVDPHDIPQVAFIDADMMDGMPPALKAATGVDALTH
AIEGYITRGAWALTDALHIKAIEIIAGALRGSVVGDKDAGEEMALGQYVA
GMGFSNVGLGLVHGMAHPLGAFYNTPHGVANAILLPHVMRYNADFTGEKY
RDIARVMGVKVEGMSLEEARNAAVEAVFALNRDVGIPPHLRDVGVRKEDI
PALAQAALDDVCTGGNPREATLEDIVELYHTAW
>ECP_1678 catalase HPII
MSQHNEKNPHQHQSPLHDSSEAKPGMDSLAPEDGSHRPAAEPTPPGAQPT
APGSLKAPDTRNEKLNSLEDVRKGSENYALTTNQGVRIADDQNSLRAGSR
GPTLLEDFILREKITHFDHERIPERIVHARGSAAHGYFQPYKSLSDITKA
DFLSDTNKITPVFVRFSTVQGGAGSADTVRDIRGFATKFYTEEGIFDLVG
NNTPIFFIQDAHKFPDFVHAVKPEPHWAIPQGQSAHDTFWDYVSLQPETL
HNVMWAMSDRGIPRSYRTMEGFGIHTFRLINAEGKATFVRFHWKPLAGKA
SLVWDEAQKLTGRDPDFHRRELWEAIEAGDFPEYELGFQLIPEEDEFKFD
FDLLDPTKLIPEELVPVQRVGKMVLNRNPDNFFAENEQVAFHPGHIVPGL
DFTNDPLLQGRLFSYTDTQISRLGGPNFHEIPINRPTCPYHNFQRDGMHR
MGIDTNPANYEPNSINDNWPRETPPGPKRGGFESYQERVEGNKVRERSPS
FGEYYSHPRLFWLSQTPFEQRHIVDGFSFELSKVVRPYIRERVVDQLAHI
DLTLAQAVAKNLGIELTDDQLNITPPPDVNGLKKDPSLSLYAIPDGDVKG
RVVAILLNDEVRSADLLAILNALKAKGVHAKLLYSRMGEVTADDGTVLPI
AATFAGAPSLTVDAVIVPCGNIADIANNGDANYYLMEAYKHLKPIALAGD
ARKFKATIKVADQGEEGIAEADSADGSFMDELLTLMTAHRVWSRIPKIDK
IPA
>ECP_2512 putative outer membrane protein (RatA-like protein)
MSLTLNKILVLCALLISAMLPGWSWAESAWQDSSDTVGEFNGTVPTADSA
SIPVYQGSVFLDPAKTHDVAFTAKPSEFSADVSVSKLLVTNPQDREGDII
ATPRWENQTPPAISLVWADAATPGTLLDPQPVADRSFCAQGLAGRSLVAW
AQPDPQQTMPLLYLLTSTGYPYESVLTLADQKVTLKIAPAQGDLISVSAA
GYDESSGAAKMTVGGSITLTVTTKDCVGNVVGNIPFVIKRKDAENRQGVV
NNTAPVKLGTTELTTTATEYRGTTDANGVATVTVTQANGPGVKTPLVASL
AGIAQASETAVIFTVLTSPDVPQATMWGHMPDTLKARDYTFSRPKLAAEV
DNEDGTVNDHNETWSTFTWSGADKHCDILPGMRQFGALATVVPTSVQDVA
GWPMQGNFYWSSLAGMSGQHHAADVSNRSEAQKPDDTTFIVSCVDKEAPD
VEPKLVLTPGSYDSTIKAMKVKVGEEASLRLTITDSKNNDQPLAYYYFSL
HLDDGINRKNQTDAAWETHPVQIDGGSNVRKVDAHTYEGITDANGEATLT
LTQPGGVGVKTHITARMRSDFTASDEKDVIFTVITSPDTDKARMWGHMLG
IIEANNIFKRPRLADETDNELGSVRENNEDWALFDQNSSMQAECGLGHIP
SQSSLHSLFAAHPANAIGTEYGWPTLQKAYLSAVEETSHASVNLATGNID
TYSGFKQNYLSCSGNEMVAKIAATTDRDVSAGSRAQAKVGDTITMTVRTF
NALNNAPVPYTAFTITKDMGKNRQGQTTGFDDPTRGAIEMNGTLYGTSQP
SLVYAGTTDAQGFATVEIKQSQGVGLSTPLNIVPVNSYIPNTVNYNVIFT
TLTSPDAVGAQMWGHMDETITVDALTFARPRLAAEVFSPDGTLTENNEVW
SRVSQANASSTSKGGCGANMLPRRSQLSALYDANNGNGVQTVHGWPTQRQ
PYWSSSPADQVPHYYTIALNDGARTVGGSTAVYVSCLTTANNPASSITLE
VVDPAQWNAAANAAKLKKGETLQVKVTVKDAQGNPLGDIPFTLKRGDGYT
RSEEKHVAGSGDALVAPVVVNGGLADETSLNNTAAAYSAITGSDGTKILT
ITRPDTHGTKTSLTATLYSDTTKKATLDTIFTVVTSPDSDKAKMWGHMPE
TVTAADGTVFKRPLLLKELSSTSGRTAIAEENEDWAQFTQTQAISTSSNG
CGSEYVPSQAGLESLYEANRGNAMKTVQGWPVASSYLSSTTGSSSLEQRD
FKAVNLSSGTSSIIPSATKELLTCQTTPIVKASQIVLEAADPTKFDSTNN
VVKAKKGEEAVLRVTTKDAQGNPVGNTAFTLKRNTSVNRANVSTTTSIAS
LAVTDAWGNTQDDFLSTTLVIYGVTGADGITTFTLKQDQTTGLKTELTAA
LDSSSSTKSTLPVVFTVLTSPDSPKAKFWGHMAETATGDDGLIYRRPLLR
DENSATTSIGTLVEEGEAWSTFPSGQANDTSINGCGAEYVPTDNELRAIY
AHQGSSALHDAIGWPVSRFYISNTVADTFTQTFTYDVVSLKTGDETQMPS
SGGALLSCRTTPVAVASQIIVEANDTAQFVKVDDTLSALKVKKGEDAVIR
VVTKNAQGNSVPNVPFILRREGSKNRQNAEMINKSITVINAAGASARMNS
SSSLLYGVTGADGTTSFTVKQDDSMGLVTNMYAQLYQLTIESNKLPVMFT
VITSPDTPLASYWGHMSETFTTRSGIAFKRPLLTAEHPAGQSTMANNESW
LSLNTAAKNDVSKSDCGEPYQPLLSEFQELYSEHPNGAIGTDLGLPLTNT
WWAYDKIAYANVWYDQSINLSNGSSSRALSNTVAFVSCLVNPHAVAASIK
MTSTALDAEKTASNDGRPSATAAKGTAIRMTVIVRDSGGNLLPGANFNLI
RGTALDRAKNRLDSTYDDLTIVPVTPAGVNMSLYNNGAQALLTTGSDGKA
TFDVTQNETYGLATPLTATLMRDTTKSATMDVIFTVITSPNSPKAKYWGH
MPDTFTSRAGVTFKRPLLAAEATLGSSVSNNNESWSYLFYTNKVTPDCPV
EYQPRLNELQGLYNDHPGGTILTDLGLPITAGSGNWWTYEMSTTDALTWY
YGVINLKTGQSTTTINGYALMLCLTQPHSAPASLTLSSTAYDEGRTASNG
GTPTSSVKKGEMLPIVVTIKDANGNPVGGEGVTLKRVQAKSRSGISVSSN
TVDDLILDEVTPTSARISFNQNTSAWSGFTGSDGTITFNVTQNNTVGLVT
PFTASLARNPQVTANQDLIFTVVTSPDSAKANYWGHMPATLTAVNGAVFE
RPKLWSELTSTSGVGKINNNNEDWPYFTPTQKSDASVSPCEVARQPLFND
LSSLSARYPNNTFVTETGWPAYYTWWAEDKSADGKDQSVDLRNGTLYTGS
TKSFQPCLANARSTVSSVTLTSTAFDADSQAAKVKKGEAMSVTVTVKDSA
GNTVPNVEFTLKRGEASPRNAGATLYGNVVAMDDLIVQPLSGSAITLSES
GNTISGMTGADGTASFSLRQDNTPGYKMPLTVTLANYASATDTLDAIFTV
PTSPNVSSAHFWGHMADTVVVNSKSLHRPLLTTELPSGANPVSSPIINYE
NWASAHIIDASKWDIARQCGSIENAPTYNELELLHTVFNSLGWPSSPSFP
YLSSQQCGMDEGTGAQDCSITLINKPGLVTCFQ
>ECP_1144 hypothetical phage associated protein
MKVLLSIKPEFAEKILNGTKRFEFRKGIFKNPQISTVVIYATMPLGKVVG
QFRIESILSDEPESLWKKTEKHAGISKQFYDSYYSGREKAYAIKIGEVER
YKEPIPISALGSNIKPPQSYLYLPA
>ECP_1191 iron transport protein, ATP-binding component
MMQSAGIVVNDVAVTWRNGHTALRDASFTVPSGSIAALVGVNGSGKSTLF
KAIMGFVRLTSGKISVLGIPTRQALQKNLVAYVPQSEEVDWSFPVLVEDV
VMMGRYGHMGMLRIAKKRDRQIVTDALERVDMVNFRHRQIGELSGGQKKR
VFLARAIAQQGDVILLDEPFTGVDVKTEAKIISLLRELRAEGKTMLVSTH
NLGSVTTFCDYTVMVKGTVLASGPTDTTFTAENLELAFSGVLRHVTLNGS
EESIITDDERPFVAHRPSAVQREER
>ECP_2337 hypothetical membrane protein YfcC (hypothetical C4-dicarboxylate anaerobic carrier)
MSAITESKPTRRWAMPDTLVIIFFVAILTSLATWVVPVGMFDSQEVQYQV
DGQTKTRKVVDPHSFRILTNEAGEPEYHRVQLFTTGDERPGLMNFPFEGL
TSGSKYGTAVGIIMFMLVIGGAFGIVMRTGTIDNGILALIRHTRGNEILF
IPALFILFSLGGAIFGMGEEAVAFAIIIAPLMVRLGYDSITTVLVTYIAT
QIGFASSWMNPFCVVVAQGIAGVPVLSGSGLRIVVWVIATLIGLIFTMVY
ASRVKKNPLLSRVHESDRFFREKQADVEQRPFTFGDWLVLIVLTAVMVWV
IWGVIVNAWFIPEIASQFFTMGLVIGIIGVVFRLNGMTVNTMASSFTEGA
RMMIAPALLVGFAKGILLLVGNGEAGDASVLNTILNSIANAISGLDNAVA
AWFMLLFQAVFNFFVTSGSGQAALTMPLLAPLGDLVGVNRQVTVLAFQFG
DGFSHIIYPTSASLMATLGVCRVDFRNWLKVGATLLGLLFIMSSVVVIGA
QLMGYH
>ECP_0104 dephospho-CoA kinase
MRYIVALTGGIGSGKSTVANAFADLGINVIDADIIARQVVEPGAPALHAI
ADHFGANMIAADGTLQRRALRERIFANPEEKNWLNALLHPLIQQETQHQI
QQATSPYVLWVVPLLVENSLYKKANRVLVVDVSPETQLKRTMQRDDVTRE
HVEQILAAQATREARLAVADDVIDNNGAPDAIASDVARLHAYYLQLASQF
VSQEKP
>ECP_3850 hypothetical protein
MPFSNSSLSAQVKSYLTFLPEEIRQKILEHLHGVIHYEPVIGIMGKSGTG
KSSLCNAIFQSRICATHPLNGCTRQAHRLTLQIGERRMTLVDLPGIGETP
QHDQEYRTLYRQLLPELDLIIWILRADERAYAADIAMHQFLLNEGADPSR
FLFVLSHADRVFPAEEWNDTEKCPSRQQELSLATVTARVATLFPSSFPVL
SVAAPVGWNLPAFVSLMIHALPPQATSAVYSHIRGENRSEQTQKHAQQTF
GDAIGKSFDDTVARFTFPVWMLQLLRKARDRIIHLLITLWERLF
>ECP_3482 penicillin-binding protein 1A
MKFVKYFLILAVCCILLGAGSIYGLYRYIEPQLPDVATLKDVRLQIPMQI
YSADGELIAQYGEKRRIPVTLDQIPPEMVKAFIATEDSRFYEHHGVDPVG
IFRAASVALFSGHASQGASTITQQLARNFFLSPERTLMRKIKEVFLAIRI
EQLLTKDEILELYLNKIYLGYRAYGVGAAAQVYFGKTVDQLTLNEMAVIA
GLPKAPSTFNPLYSMDRAVARRNVVLSRMLDEGYITQQQFDQTRTEAINA
NYHAPEIAFSAPYLSEMVRQEMYNRYGESAYEDGYRIYTTITRKVQQAAQ
QAVRNNVLDYDMRHGYRGPANVLWKVGESAWDNNKITDTLKALPTYGPLL
PAAVTSANPQEATAMLADGSTVVLSMEGVRWARPYRSDTQQGPTPRKVTD
VLQTGQQIWVRQVGDAWWLAQVPEVNSALVSINPQNGAVMALVGGFDFNQ
SKFNRATQALRQVGSNIKPFLYTAAMDKGLTLASMLNDVPISRWDAGAGS
DWQPKNSPPQYAGPIRLRQGLGQSKNVVMVRAMRAMGVDYAAEYLQRFGF
PAQNIVHTESLALGSASFTPMQVARGYAVMANGGFLVDPWFISKIENDQG
GVIFEAKPKVACPECDIPVIYGDTQKSNVLENNDVEDVAISREQQNVSVP
MPQLEQANQALVAKTGAQEYAPHVINTPLAFLIKSALNTNIFGEPGWQGT
GWRAGRDLQRHDIGGKTGTTNSSKDAWFSGYGPGVVTSVWIGFDDHRRNL
GHTTASGAIKDQISGYEGGAKSAQPAWDAYMKAVLEGVPEQPLTPPPGIV
TVNIDRSTGQLANGGNSREEYFIEGTQPTQQAVHEVGTTIIDNGEAQELF
>ECP_3493 protein YhgF (S1 RNA binding domain)
MMNDSFCRIIAGEIQARPEQVDAAVRLLDEGNTVPFIARYRKEITGGLDD
TQLRNLETRLSYLRELEERRQAILKSISEQGKLTDDLANAINATLSKTEL
EDLYLPYKPKRRTRGQIAIEAGLEPLADLLWSDPSHTPEVAAAQYIDADK
GVADTKAALDGARYILMERFAEDAALLAKVRDYLWKNAHLVSTVVNGKEE
EGAKFRDYFDHHEPLSTVPSHRALAMFRGRNEGILQLSLNADPQFEEPPK
ESYCEQIIMDHLGLRLNNAPADSWRKGVVSWTWRIKVLMHLETELMGTVR
ERAEDEAINVFARNLHDLLMAAPAGLRATMGLDPGLRTGVKVAVVDATGK
LVAADTIYPHTGQAAKAAMTVAALCEKHNVELVAIGNGTASRETERFYLD
VQKQFPKVTAQKVIVSEAGASVYSASELAAQEFPDLDVSLRGAVSIARRL
QDPLAELVKIDPKSIGVGQYQHDVSQTQLARKLDAVVEDCVNAVGVDLNT
ASVPLLTRVAGLTRMMAQNIVAWRDENGQFQNRQQLLKVSRLGPKAFEQC
AGFLRINHGDNPLDASTVHPEAYPVVERILAATQQALKDLMGNSSELRNL
KASDFTDEKFGVPTVTDIIKELEKPGRDPRPEFKTAQFADGVETMNDLQP
GMILEGAVTNVTNFGAFVDIGVHQDGLVHISSLSNKFVEDPHTVVKAGDI
VKVKVLEVDLQRKRIALTMRLDEQPGETNARRGGGNDRPQNNRPAAKPRG
REAQPAGNSAMMDALAAAMGKKR
>ECP_4546 hypothetical protein
MADSADIAYENEQFSMSIRLKNRIRNRLPETGFCYNCGEPVKTGLFCDGD
CREDYEKRERFGKINTDNV
>ECP_3823 putative ABC-transporter membrane protein
MGEKCRYTPFPHGCSHATAAGAAAMMLLHLLCEPFGDFGFMRRALVGCLA
LTLSAAPLGCFLLLRRMSLIGDALSHAVLPGVAIGYLVSGMSLVAMGVGG
FIAGLSVAMLSGVVSRRTGLREDASFAGFYLGSLALGVTLVSLRGSSVDL
LHVLFGSILAIDANALITIGIISSGSVLVLALIYRVLVIESFDVTFLKVL
SRRSRALIHCLFLSMVVLNLVAGFQLLGTLMTVGIMMLPAASARFWSQRL
SIMLLAAVGIGTCASLVGLTWSYYADLPAGPAVILTSTLFFCFSVLFGSN
GGMLCRCR
>ECP_3173 hypothetical protein YgjM
MIAIADILQAGEKLTAVAPFLAGIQNEEQYTQALELVDHLLLNDPENPLL
DLACAKITAWEESAPEFAEFNAMAQAMPGGIAVIRTLMDQYGLTLSDLPE
IGSKSMVSRVLNGKRKLTLEHAKKLATRFGISPALFID
>ECP_3086 hypothetical protein YghZ (aldo/keto reductase)
MVWLANPERYGQMQYRYCGKSGLRLPALSLGLWHNFGHVNALESQRAILR
KAFDLGITHFDLANNYGPPPGSAEENFGRLLREDFAAYRDELIISTKAGY
DMWPGPYGSGGSRKYLLASLDQSLNRMGLEYVDIFYSHRVDENTPMEETA
SALAHAVQSGKALYVGISSYSPERTQKMVELLREWKIPLLIHQPSYNLLN
RWVDKSGLLDTLQNNGVGCIAFTPLAQGLLTGKYLNGIPQDSRMHREGNK
VRGLTPKMLTEANLNSLRLLNEMAQQRGQSMAQMALSWLLKDERVTSVLI
GASRAEQLEENVQALNNLTFSTEELAQIDQHIADGELNLWQASSDK
>ECP_4114 rhamnulokinase
MTFRNCVAVDLGASSGRVMLARYERECRSLTLREIHRFNNGLHSQNGYVT
WNVDRLESAIRLGLNKVCEEGIRIDSIGIDTWGVDFVLLDQQGQRVGLPV
AYRDSRTNGLMAQAQQQLGKRDIYQRSGIQFLPFNTIYQLRALTEQQPEL
IPHIAHALLIPDYFSYRLTGKMNWEYTNATTTQLVNINSDDWDESLLAWS
GANKAWFGRPTHPGNVIGHWICPQGNEIPVVAVASHDTASAVIASPLNGS
RAAYLSSGTWSLMGFESQTPFTNDTALAANITNEGGAEGRYRVLKNIMGL
WLLQRVLQERQINDLPALIAATQALPACRFIINPNDDRFINPDEMCSEIQ
AACRETAQPIPESDAELARCIFDSLALLYADVLHELAQLRGEDFSQLHIV
GGGCQNALLNQLCADACGIRVIAGPVEASTLGNIGIQLMTLDELNNVDDF
RQVVSTTANLTTFTPNPDSEIAHYVAQIHSTRQTKELCA
>ECP_2556 hypothetical protein Yfha (putative transcriptional regulator)
MSHKPAHLLLVDDDPGLLKLLGLRLTSEGYSVVTAESGAEGLRVLNREKV
DLVISDLRMDEMDGMQLFAEIQKVQPGMPVIILTAHGSIPDAVAATQQGV
FSFLTKPVDKDALYQAIDDALEQSAPATDERWREAIVTRSPLMLRLLEQA
RLVAQSDVSVLINGQSGTGKEIFAQAIHNASPRNSKPFIAINCGALPEQL
LESELFGHARGAFTGAVSNREGLFQAAEGGTLFLDEIGDMPAPLQVKLLR
VLQERKLRPLGSNRDIDIDVRIISATHRDLPKAMTRGEFREDLYYRLNVV
SLKIPALAERTEDIPLLANHLLRQAAERHKPFVRAFSTDAMKRLMTASWP
GNVRQLVNVIEQCVALTSSPVISDALVEQALEGENTALPTFVEARNQFEL
NYLRKLLQITKGNVTHAARMAGRNRTEFYKLLSRHELDANDFKE
>ECP_2679 hypothetical protein
MCAFLLSLVLPAQATSFTEYLPMSDSEYAQKRALKPLLTMPYDADQNWHF
RKVSV
>ECP_3359 acriflavine resistance protein E precursor
MTKHARFFLLPSFILISAALIAGCNDKGEEKAHVGEPQVTVHIVKTAPLE
VKTELPGRTNAYRIAEVRPQVSGIVLNRNFTEGSDVQAGQSLYQIDPATY
QASYDSAKGELAKSEAAAAIAHLTVKRYVPLVGTKYISQQEYDQAIADAR
QADATVIAAKATVESARINLAYTKVTAPISGRIGKSTVTEGALVTNGQTT
ELATVQQLDPIYVDVTQSSNDFMRLKQSVEQGNLHKENATSNVELVMENG
QTYPLKGTLQFSDVTVDESTGSITLRAIFPNPQHTLLPGMFVRARIDEGV
QPDAILIPQQGVSRTPRGDATVLIVNNKSQVEARPVVASQAIGDKWLISE
GLKSGDQVIVSGLQKARPGEQVKATTDTPADTASK
>ECP_1405 hypothetical protein
MKKVAALVALSLLMAGCVSSDKIAVTPEQLQHHRFVLESVNGKPVTSDKN
PPEISFGEKMMISGSMCNRFSGEGKLSNGELTAKGLAMTRMMCANPQLNE
LDNTISEMLKEGAQVDLTANQLTLATAKQTLTYKLADLMN
>ECP_2412 putative PTS system IIC component YpdG
MAIKKRSATVVHGASGAAAAVKNPQASKSSFWVELPQHVMSGISRMVPTL
IMGGVILAFSQLIAYSWLKIPADIGIMDALNSGKFSGFDLSLLKFAWLSQ
SFGGVLFGFAIPMFAAFVANSIGGKLAFPAGFIGGLMSTQPTQLLNFDPS
TMQWATSSPVPSTFIGALIISIVAGYLVKWMNQKIQLPDFLLAFKTTFLL
PILSAIFVMLAMYYVITPFGGWINGGIRTVLTAAGEKGALMYAMGIAAAT
AIDLGGPINKAAGFVAFSFTTDHVLPVTARSIAIVIPPIGLGLATIIDRR
LTGKRLFNAQLYPQGKTAMFLAFMGISEGAIPFALESPITAIPSYMVGAI
VGSTAAVWLGAVQWFPESAIWAWPLVTNLGVYMAGIALGAVITALMVVFL
RLMMFRKGKLLIDSL
>ECP_0618 enterobactin synthetase component F
MSQHLPLVAAQPGIWMAEKLSDLPSAWSVAHYVELTGEVDAPLLARAVVA
GLAQADTLRMRFTEDNGEVWQWVDDAQTFELPEIIDLRTNIDPHGTARAL
MQADLQQDLRVDSGKPLVFHQLIQVADNRWYWYQRYHHLLVDGFSFPAIT
RQIANIYCALLRGEPTPASPFTPFADVVEEYQQYRESEAWQRDAAFWAEQ
RRQLPPPASLSPAPLPGRSASADILRLKLEFTNGEFRQLATQLSGVQRTD
LALALAALWLGRLCNRMDYAAGFIFMRRLGSAALTATGPVLNVLPLGIHI
AAQETLPELATRLAAQLKKMRRHQRYDAEQIVRDSGRAAGEEPLFGPVLN
IKVFDYQLDIPGVQAQTHTLATGPVNDLELALFPDEHGDLSIEILANKQR
YDEPTLIQHAERLKMLIAQFAAAPSLLCGDVDIMLPGEYAQLAQINATQV
EIPETTLSALVAEQAAKTPDAPALADARYQFSYREMREQVVALANLLRER
GVKPGDSVAVALPRSVFLTLALHAIVEAGAAWLPLDTGYPDDRLKMMLED
ARPSLLITTDDQLPRFSDIPNLTSLCYNAPLTPQGSAPLQLSQPHHTAYI
IFTSGSTGRPKGVMVGQTAIVNRLLWMQNHYPLTGEDVVAQKTPCSFDVS
VWEFFWPFIAGAKLVMAEPEAHRDPLAMQHFFADYGVTTTHFVPSMLAAF
VASLTPQTARQSCATLKQVFCSGEALPADLCREWQQLTGAPLHNLYGPTE
AAVDVSWYPAFGEELAEVRGSSVPIGYPVWNTGLRILDAMMHPVPPGVAG
DLYLTGIQLAQGYLGRPDLTASRFIADPFAPGERMYRTGDVARWLDNGAV
EYLGRSDDQLKIRGQRIELGEIDRVMQALPDVEQAVTHACVINQAAATGG
DARQLVGYLVSQSGLPLDTSALQAQLRETLPPHMVPVVLLQLPQLPLSAN
GKLDRKALPLPELKAQAPGRAPKAGSETIIAAAFSSLLGCDVQDADADFF
ALGGHSLLAMKLAAQLSRQFARQVTPGQVMVASTVAKLATIIDGEEDSTQ
RMGFETILPLREGNGPTLFCFHPASGFAWQFSVLSRYLDPQWSIIGIQSP
RPHGPMQTSANLDEVCEAHLATLLEQQPHGPYYLLGYSLGGTLAQGIAAR
LRARGEQVAFLGLLDTWPPETQNWQEKEANGLDPEVLAEINREREAFLAA
QQGSTSTELFTTIEGNYADAVRLLTTAHSVPFDGKATLFVAERTLQEGMT
PEQAWAPWIAGLDIYRQDCAHVDIISPVAFEKIGPIIRATLNK
>ECP_2548 hypothetical ABC transporter ATP-binding protein YphE
MFTATEAVPVAKVVAGNKRYPGVVALDNVNFTLNKGEVRALLGKNGAGKS
TLIRMLTGSERPDSGDIWIGETRLEGDEATLTRRAAELGVRAVYQELSLV
EGLTVAENLCLGQWPRRNGMIDYLQMAQDAQRCLQALGVDVSPEQLVSTL
SPAQKQLVEIARVMKGEPRVVILDEPTSSLASAEVELVISAVKKMSALGV
AVIYVSHRMEEIRRIASCATVMRDGQVAGDVMLENTSTHHIVSLMLGRDH
VDIAPVAPQEIMDQAVLEVRALRHKPKLEDISFTLRRGEVLGIAGLLGAG
RSELLKAIVGLETYEQGEIVINGEKITRPDYGDMLKRGIGYTPENRKEAG
IIPWLGVDENTVLTNRQKISANGVLQWSTIRRLTEEVMQRMTVKAASSET
PIGTLSGGNQQKVVIGRWVYAASQILLLDEPTRGVDIEAKQQIYRIVREL
AAEGKSVVFISSEVEELPLVCDRILLLQHGTFSQEFHSPVNVDELMSAIL
SVH
>ECP_2810 putative VGR-related protein
MMNIPALNFDHSHHKLKIRGLKSPVDVLTFTGREQLSAPFRYDIEFTSTD
KTIEPESVLMQDGAFSLSAPPVQGMPVQTALRTLHGIITGFKLLSSSRDE
ARYEVRLEPRMALLTRSRQNAIYQNLTVPQIVEKILRERHQMRGQDFVFN
LKSEYPSREQVMQYGEDDLTFISRLLSEVGIWFRFATDARLKIEVIEFYD
DQSGYERGLTLPLRHPSGLHDGATEAVWGLNTAYSVVEKSVTTRDYNYRT
ATAEMMTEQHDATGGDNTTYGEAYHYADNFLQKGDKEAAESGAFYARIRH
ERYLNEQAILKGQSTSSLLMPGLEIKVQGDDAPAVFRKGVLITGVTASAA
RDRSYELTFTAIPYSERYGYRPALIPRPVMAGTLPARVTSTVKNDIYAHI
DKDGRYRVNLDFDRDTWKPGYESLWVRQSRPYAGDTYGLHLPLLAGTEVS
IAFEEGNPDRPYIAGVKHDSAHTDHVTIQNDKRNVLRTPANNKIRLDDER
GKEHIKVSTEYGGKSQLNLGHLVDAGKQQRGEGFELRTDLWGAVRAKKGI
FISADAQDKAQGQVREMAPAMAILDGAQSQMKSLSTDAQTANADPADLSS
QIALLQQSVKDLTQAAILLSAPKGVAIASGEHLQLAASKNLIANAGNHAD
IGVVKNMFIGVGQALSVFVRKAGIKLFANKGAISVQAQNDLMELLAQKSI
EITSTEDEIKITAKKKITLNGGGSYIRLDACGIEAGTPGEYNVKAGYYGR
KPKAKLTPELMAFPVIESGEFNAKFLFTDDDGLPYANTKYIACFSDGTQK
EGITDENGYTENFNTDSKQTIDVRLLNQNIDMILGGVHE
>ECP_4286 excinuclease ABC subunit A
MDKIEVRGARTHNLKNINLVIPRDKLIVVTGLSGSGKSSLAFDTLYAEGQ
RRYVESLSAYARQFLSLMEKPDVDHIEGLSPAISIEQKSTSHNPRSTVGT
ITEIHDYLRLLYARVGEPRCPDHDVPLAAQTVSQMVDNVLSQPEGKRLML
LAPIIKERKGEHTKTLENLASQGYIRARIDGEVCDLSDPPKLELQKKHTI
EVVVDRFKVRDDLTQRLAESFETALELSGGTAVVADMDDPKAEELLFSAN
FACPICGYSMRELEPRLFSFNNPAGACPTCDGLGVQQYFDPDRVIQNPEL
SLAGGAIRGWDRRNFYYFQMLKSLADHYKFDVEAPWGSLSTNVHKVVLYG
SGKENIEFKYMNDRGDTSIRRHPFEGVLHNMERRYKETESSAVREELAKF
ISNRPCASCEGTRLRREARHVYVENTPLPAISDMSIGHAMEFFNNLKLAG
QRAKIAEKILKEIGDRLKFLVNVGLNYLTLSRSAETLSGGEAQRIRLASQ
IGAGLVGVMYVLDEPSIGLHQRDNERLLGTLIHLRDLGNTVIVVEHDEDA
IRAADHVIDIGPGAGVHGGEVVAEGPLEAIMAVPESLTGQYMSGKRKIEV
PKKRVPANPEKVLKLTGARGNNLKDVTLTLPVGLFTCITGVSGSGKSTLI
NDTLFPIAQRQLNGATIAEPAPYRDIQGLEHFDKVIDIDQSPIGRTPRSN
PATYTGVFTPVRELFAGVPESRARGYTPGRFSFNVRGGRCEACQGDGVIK
VEMHFLPDIYVPCDQCKGKRYNRETLEIKYKGKTIHEVLDMTIEEAREFF
DAVPALARKLQTLMDVGLTYIRLGQSATTLSGGEAQRVKLARELSKRGTG
QTLYILDEPTTGLHFADIQQLLDVLHKLRDQGNTIVVIEHNLDVIKTADW
IVDLGPEGGSGGGEILVSGTPETVAECEASHTARFLKPML
>ECP_4565 hypothetical transposase-like protein
MVCRALRNALETRPRDGRLLFHSDQGVQYKSNKYRKLLWRYGVRQSMSCR
GNGLDNSPMERVFRSLKNEWLPKGGYGDFIHAVRDINQWINGYYNVYRPH
TNNDGLPPCLHEEKWKQVIPVP
>ECP_0996 chaperone protein TorD
MTTLTVQQIACVYAWLAQLFSRELDDEQLTQIASAQMAEWFSLLKSEPPL
TAAVNGLENSIATLTVRDDARLELAADFCGLFLMTDKQAALPYASAYKQD
EQEIKRLLVEAGMETSGNFNEPADHLAIYLELLSHLHFSLGEGTVPARRI
DGLRQKTLTALREWLPEFAARCRQYDSFGFYAALSQLLLVLVECDYQKR
>ECP_2553 hypothetical protein
MTKEFTQTITFRLRKALENKELATGNGFLFRFVMQIFHFITFFLKNTKEP
FTLQGYFL
>ECP_1518 hypothetical protein
MNSFSEIAHKSGKEAGSQIMKFEVQDASPTIATELNLVTGEQVYYIKRLR
FIEDNAAQLEETWMSVARFPDLTVSHMQKSKFSYIENECGIKIIGTFETF
SPTFPTPEIASILRISPRDPILKIQTQAVDSNSIPLDYSLLYSNIFEFQV
KYFFPR
>ECP_1208 part of hypothetical protein b1169 (putative adhesin)
MILGLWKITPVLKPQIKRTLLPMGKNAVGVLACSSPGESRTCVDAVDDEV
CDSNSYEVISRADLKMNGGSITTNGINSYGAYANGKKAYINLDYVVLETV
ADGSYAVAIRQGNIDIKKFYYNKWH
>ECP_2688 formate hydrogenlyase regulatory protein HycA
MTIWEISEKADYIAQRHRRLQDQWHIYCNSLVQGITLSKARLHHAMSCAP
DKELCFVLFEHFRIYVTLADGFNSHTIEYYVETKDGEDKQRIAQAQLSID
GMIDGKVNIRDREQALEHYLEKIAGVYDSLYTAIENNVPVNLSQLVKGQS
PVA
>ECP_4496 putative arginine repressor
MKEYDDYSAKEKKQLAVCQRLITEKSYLSQEEIRRDLQNHGFDSISQSTV
SRLLKLLGVIKIRNTKGQKIYSVNPQLLPTPDAGRSVAEMVLSVEHNGEF
ILIHTVAGYGRAVARILDFHALPEILGVIAGSNIVWVAPRVVKRTALVHK
QINYLLKLNIYS
>ECP_0144 hypothetical protein
MGSVSKELLANTLTGNDAAKGVGVLIEGLKNTKSAQMVLKPNDATSIYKD
YETENDTTGGIFPDNGNGGTSQPLHFQATLKQDGNIAIEPGDFKATSTFQ
VTYP
>ECP_2620 SsrA-binding protein
MTKKKAHKPGSATIALNKRARHEYFIEEEFEAGLALQGWEVKSLRAGKAN
ISDSYVLLRDGEAFLFGANITPMAVASTHVVCDPTRTRKLLLNQRELDSL
YGRVNREGYTVVALSLYWKNAWCKVKIGVAKGKKQHDKRSDIKEREWQVD
KARIMKNSHR
>ECP_1299 putative cytoplasmic protein YciI
MLYVIYAQDKADSLEKRLSVRPAHLARLQLLHDEGRLLTAGPMPAVDSND
PGTAGFTGSTVIAEFESLEAAQAWADADPYVAAGVYEHVSVKPFKKVF
>ECP_2043 hypothetical protein
MRLASRFGYAANQIRRDRPLTHEELMHHVPGIFGEEKHTSRSQNYTYIPT
ITVLESLQREGFQPFFACQTRVRDPGRRGYTKHMLRLRRAGEINGEHVPE
IILLNSHDGTSSYQMLPGYFRFVCQNGCVCGQSLGEVRVPHRGDVVEKVI
EGAYEVVGVFDRIEEKRDAMQSLVLPPPARQALAQAALTYRYGDEHQPVT
TADILTPRRREDYGKDLWSAYQTIQENMLKGGISGRSAKGKRIHTRAIHS
IDTDIKLNRALWVMAETMLESLR
>ECP_1853 UvrC excinuclease ABC subunit C
MKLAHLGRQALMGVSEEEYAQQVEYVRLFLSGKDDQVLTQLISRMETASQ
NLEFEEAARIRDQIQAVRRVTEKQFVSNTGDDLDVIGVAFDAGMACVHVL
FIRQGKVLGSRSYFPKVPGGTELSEVVETFVGQFYLQGSQMRTLPGEILL
DFNLSDKTLLADSLSELAGRKINVQTKPRGDRARYLKLARTNAATALTSK
LSQQSTVHQRLTALASVLKLPEVKRMECFDISHTMGEQTVASCVVFDANG
PLRAEYRRYNITGITPGDDYAAMNQVLRRRYGKAIDDSKIPDVILIDGGK
GQLAQAKNVFAELDVSWDKNHPLLLGVAKGADRKAGLETLFFEPEGEGFS
LPPDSPALHVIQHIRDESHDHAIGGHRKKRAKVKNTSSLETIEGVGPKRR
QMLLKYMGGLQGLRNASVEEIAKVPGISQGLAEKIFWSLKH
>ECP_1896 hypothetical protein YedR (putative membrane protein)
MEYGSTKMEERLSRSPGGKLALWAFYTWCGYFVWAMARYIWVMSRIPDAP
VSGFESDLGSTAGKWLGALVGFLFMALVGALLGGIAWYTRPRPARCRRYE
>ECP_1190 iron transport protein, inner membrane component
MNVLLEPFSYEYMLNAMWVSAMVGGLCAFLSCYLMLKGWSLIGDALSHSI
VPGVAGAYMLGLPFSLGAFFSGGLAAGSMLFLNQRTRLKEDAIIGLIFSS
FFGLGLFMVSLNPTSVNIQTIVLGNILAIDPADILQLTIIGILSIIVLFF
KWKDLMVTFFDENHARAIGLHPGRLKLLFFTLLSVSTVAALQTVGAFLVI
CLVVTPGATAWLLTDRFPRLLMIAVTIGSVTSFLGAWVSYFLDGATGGII
VVAQTLLFLLAFVFAPTHGLLANRRRAHKALEDRS
>ECP_1747 ribonuclease D
MITTDDALASLCEAVRAFPAIALDTEFVRTRTYYPQLGLIQLFDGEHLAL
IDPLGISDWSPLKAILRDPSITKFLHAGSEDLEVFLNVFGELPQPLIDTQ
ILAAFCGRPMSWGFASMVEEYSGVTLDKSESRTDWLARPLTERQCEYAAA
DVWYLLPITAKLMVETEASGWLPAALDECRLMQMRRQEVVAPEDAWRDIT
NAWQLRTRQLACLQLLADWRLRKARERDLAVNFVVREEHLWSVARYMPGS
LGELDSLGLSGSEIRFHGKTLLALVEKAQALPEEALPQPMLNLMDMPGYR
KAFKAIKSLITDVSETHKISAELLASRRQINQLLNWHWKLKPQNNLPELI
SGWRGELMAEALHNLLQEYPQ
>ECP_2920 possible ABC-transport protein, ATP-binding component
MVTLEQFRYCPTHSTHPPFCYDFHYVKPGMVAIFGDNGSGKSTLAQLMAG
WYPDFLPGEITGTGTLLGTPIGRLPLNEQSATIQLVQQSPYLQLSGCTFS
VEEEVAFGPENLCLAEKEIMARIDAALALTECQPLRHRHPATLSGGETQR
VVIACAIAMQPKLLILDEAFSRLTPQAREMLLQRLQHWALERGSLIILFE
RHHTPFLNHCQQAWQLQNGALQPLC
>ECP_1834 chemotaxis MotA protein
MLILLGYLVVLGTVFGGYLMTGGSLGALYQPAELVIIAGAGIGSFIVGNN
GKAIKGTLKALPLLFRRTKYTKAMYMDLLALLYRLMAKSRQMGMFSLERD
IENPRESEIFASYPRILADSVMLEFIVDYLRLIISGHMNTFEIEALMDEE
IETHESEAEVPANSLALVGDSLPAFGIVAAVMGVVHALGSADRPAAELGA
LIAHAMVGTFLGILLAYGFISPLASVLRQKSAETSKMMQCVKVTLLSNLN
GYAPPIAVEFGRKTLYSSERPSFIELEEHVRAVKNPQQQTTTEEA
>ECP_3055 glycolate oxidase iron-sulfur subunit
MQTQLTEEMRQNARALEADSILRACVHCGFCTATCPTYQLLGDELDGPRG
RIYLIKQVLEGNDVTLKTQEHLDRCLTCRNCETTCPSGVRYHNLLDIGRD
IVEQKVKRPLPERMLREGLRQVVPRPAVFRALTQVGLVLRPFLPEQVRAK
LPAETVKAKPRPPLRHKRRVLMLEGCAQPTLSPNTNAATARVLDRLGISV
MPANEAGCCGAVDYHLNAQEKGLARARNNIDAWWPAIEAGAEAILQTASG
CGAFVKEYGQMLKNDALYADKARQVSELAVDLVELLREEPLEKLAIRGDK
KLAFHCPCTLQHAQKLNGEVEKVLLRLGFTLTDVPDSHLCCGSAGTYALT
HPDLARQLRDNKMNALESGKPEMIVTANIGCQTHLASAGRTSVRHWIEIV
EQALEKE
>ECP_2539 hypothetical protein YfhR
MALPVNKRVLKILFILFVVAFCVYLVPRVAINFFYYPDDKIYGPDPWSAE
SIEFTAKDGTRLQGWFIPSSTGPADNAISTIIHAHGNAGNMSAHWPLVSW
LPERNFNVFMFDYRGFGKSKGTPSQAGLLDDTQSAINVVRHRSDVNPQRL
VLFGQSIGGANILDVIGQGDREGIRAVILDSTFASYATIANQMLPGSGYL
LDESYSGENYIASVSPIPLLLIHGKADHVIPWQHSEKLYSLAKEPKRLIL
IPDGEHIDAFSDRHGDVYREQMVDFILSALNPQN
>ECP_1571 hypothetical membrane protein YdgK
MTTTTPQRIGGWLLGPLAWLLVALLSTTLALLLYTAALSSPQTFQTLGGQ
ALTTQTLWGVSFITAIAMWYYTLWLTIAFFKRRRCVPKHYIIWLLISVLL
AVKAFAFSPVEDGIAVRQLLFTLLATALIVPYFKRSSRVKATFVNP
>ECP_1141 bacteriophage recombination protein
MSTAHATLAGKLAERVGMDSVDPQELITTLRQTAFKGDASDAQFIALLIV
ANQYGLNPWTKEIYAFPDKQNGIVPVVGVDGWSRIINENQQFDGMDFEQD
NESCTCRIYRKDRNHPICVTEWMDECRREPFKTREGREITGPWQSHPKRM
LRHKAMIQCARLAFGFAGIYDKDEAERIVENTAYTAERQPERDITPVNDE
TMQEINTLLIALDKTWDDDLLPLCSQIFRRDIRASSELTQAEAVKALGFL
KQKATEQKVAA
>ECP_1337 putative regulatory protein YciT, DeoR family
MNSRQQTILQMVIDQGQVSVTDLAKATGVSEVTIRQDLNTLEKLSYLRRA
HGFAVSLDSDDVETRMMSNYTLKRELAEFAASLVQPGETIFIENGSSNAL
LARTLGEQKKNVTIITVSSYIAHLLKDAPCEVILLGGVYQKKSESMVGPL
TRQCIQQVHFSKAFIGIDGWQPETGFTGRDMMRTDVVNAVLEKECEAIVL
TDSSKFGAVHSYSIGPVERFNRVITDSKIRASDLMHLEQSKLTVHVVDI
>ECP_1681 cell operon repressor
MQPVINAPEIATAREQQLFNGKNFHVFIYNKTESISGLHQHDYYEFTLVL
TGRYFQEINGKRVLLERGDFVFIPLGSHHQSFYEFGATRILNVGISKRFF
EQHYLPLLPYCFVASQVYRTNNAFLTYVETVISSLNFRETGLEEFVEMVT
FYVINRLRHYREEQVIDDIPQWLKSTVEKMHDKEQFSESALENMVTLSAK
SQEYLTRATQRYYGKTPMQIINEIRINFAKKQLEMTNYSVTDIAFEAGYS
SPSLFIKTFKKLTSFTPKSYRKKLTEFNQ
>ECP_2563 hypothetical protein YfhH
MNGLLRIRQRYQGLAQSDKKLADYLLLQPDTARHLSSQQLANEAGVSQSS
VVKFAQKLGYKGFPALKLALSEALASQPESPSVPIHNQIRGDDPLRLVGE
KLIKENTAAMYATLNVNTEEKLHECVTMLRSARRIILTGIGASGLVAQNF
AWKLMKIGFNAAAVRDMHALLATVQASSPDDLLLAISYTGVRRELNLAAD
EMLRVGGKVLAITGFTPNALQQRASHCLYTIAEEQATNSASISACHAQGM
LTDLLFIALIQQDLELAPERIRHSEALVKKLV
>ECP_2096 putative colanic acid polymerase
MSTSIRICSYLLLPLIYLLVNVKIAQLGESFPITIVTFLPVLLLLFLERI
SVKKLMIALGIGAGLTAFNYLFGQSLDASKYVTSTMLFVYIVIIIGMVWS
IRFKTISPHNHRKILRFFYLVVGLVVVLAAVEMAQIILTGGSSIMESISK
YLIYSNSYVLNFIKFGGKRTTALYFEPAFFALALISIWLSIKQFGIKTPK
TDAMILAGIILSGSFSGVMTFILFYLLEWAFQYLNKEAIKKKLPLALISL
AVFLVGVVIAFPYISTRLGDLGTEGSSSYYRIVGPLVMVGYSLTHIDGVV
RFGSLYEYVASFGIFNGADVGKTIDNGLYLLIIYFSWFAVLLSLWYMGKV
IKMMINAFGDNRNFRVQLYLFTPVSLFFTGSIFSPEYAFLIVCPFILRKA
LNITR
>ECP_2590 putative transposase
MSKAGHVSLRRPLYMPAMVATSKTEWGRALAANGKKGKVILGSIMRKLAQ
VAYGVLKSGVPFDASRHNPVAA
>ECP_3467 hypothetical protein YhfY
METRLNLLCDAGVIDKDICKGMMQVVNVLETECHLPVRSEQGTMAMTHMA
SALMRSRRGEEIEPLDDELLAELAQSSHWQAVVQLHQVLLKEFALEVNPC
EEGYLLANLYGLWMAANEEV
>ECP_3793 hypothetical protein
MTAMMSEICEAPHTPAFSFIRHHALQLMVGRVHYISPPLNIWQRA
>ECP_1394 hypothetical protein YdaM
MITHNFNTLDLLTSPVWIVSPFEEQLIYANSAARLLMQDLTFSQLRTGPY
SVSSQKELPKYLSDLQNQHDIIEILTVQRKEEETALSCRLVLRELTETEP
VIIFEGIEAPATLGLKASRSANYQRKKQGFYARFFLTNSAPMLLIDPSRD
GQIVDANLAALNFYGYNHETMCQKHTWEINMLGRRVMPIMHEISHLPGGH
KPLNFIHKLADGSTRHVQTYAGPIEIYGDKLMLCIVHDITEQKRLEEQLE
HAAHHDAMTGLLNRRQFYHITEPGQMQHLAIAQDYSLLLIDTDRFKHIND
LYGHSKGDEVLCALARTLESCARKGDLVFRWGGEEFVLLLPRTPLDTALS
LAETIRVSVAKVSISGLPRFTVSIGVAHHEGNESIDELFKRVDDALYRAK
NDGRNRVLAA
>ECP_2525 hypothetical lipoprotein YfhM precursor
MKKLRVAACMLMLALAGCDNNDNAPTAVKKDAPSEVTKAASSENVSSAKL
SAPERQKLAQQSAGKALTLLDLSEVQLDGAATLVLTFSIPLDPDQDFSRV
IHVVDKKSGKVDGAWELSHNLKELRLRHLEPKRDLIVTIGKEVKALNNAT
FSKDDEKTITTRDIQPSVGFASRGSLLPGKVVEGLPVMALNVNNVDVNFF
RVKPESLPAFISQWEYRNSLANWQSDKLLQMADLVYTGRFDLNPARNTRE
KLLLPLGDIKPLQQAGVYLAVMNQAGRYDYSNPATLFTLSDIGVSAHRYH
NRLDIFTQSLENGAAQQGIEVSLLNEKGQTLTQATSDAQGHVQLENDKNA
ALLLARKDGQTTLLDLKLPALDLAEFNIAGAPGYSKQFFMFGPRDLYRPG
ETVILNGLLRDADGKALPDQPIKLDVIKPDGQVLRSVVSQPENGLYHFTW
PLDSNAATGMWHIRANTGDNQYRMWDFHVEDFMPERMALNLTGEKTPLTP
KDEVKFSVVGYYLYGAPANGNILQGQLFLRPLREAVSALPGFEFGDIAAE
NLSRTLDEVQLTLDDKGRGEVSTESQWKETHSPLQVIFQGSLLESGGRPV
TRRAEQAIWPADALPGIRPQFASKSVYDYRTDSTVKQPIVDEGSNAAFDI
VYSDAQGVKKAVSGLQVRLIRERRDYYWNWSEYEGWQSQFDQKDLIENEQ
TLDLKADETGKVSFPVEWGAYRLEVKAPNEAVSSVRFWAGYSWQDNSDGG
GAVRPDRVTLKLDKASYRPGDTIKLHIAAPTAGKGYAMVESSEGPLWWQE
IDVPAQGLDLTIPVDKTWNRHDLYLSTLVVRPGDKSRSATPKRAVGVLHL
PLGDENRRLDLALETPTKMRPNQPLTVKIKASNKNGEMPKQVNVLVSAVD
SGVLNITDYVTPDPWQAFFGQKRYGADIYDIYGQVIEGQGRLAALRFGGD
GDELKRGGKPPVNHVNIVAQQPLPVTLNEQGEGSVTLPIGDFNGELRVMA
QAWTADDFGSNESKVIVAAPVIAELNMPRFMASGDTSRLTLDITNLTDKP
QKLNVALTASGLLELVSNSPAPVELAPGVRTTLFIPVRALPGYGDGDIQA
TISGLALPGETVADQHKQWKIGVRPAFPAQTVNYGTALQPGETWALPADG
LQNFSPVTLEGQLLLSGKPPLNIARYIKELKAYPYGCLEQTTSGLFPSLY
TNAAQLQALGIKGDSDEKRRASVDIGISRLLQMQRDNGGFALWDKNGDEE
YWLTAYVMDFLVRAGEQGYSVPTDAINRGNERLLRYLQDPGMISIPYADN
LKASKFAVQSYAALVLARQQKAPLGALREIWEHRADAASGLPLLQLGVAL
KIMGDATRGEEAIALALKTPRNSDERIWLGDYGSPLRDSALMLSLLEENK
LLPDEQYSLLNTLSQQAFGERWLSTQESNALFLAARTLQDLPGKWQAQTT
FSAEPLTGEKAQTSNLNSDQLATLQVTNSGDQPLWLRVDASGYPQSAPLP
ASNVLQIERHILGTDGKSKSLDSLRSGDLVLVWLQVKASNSVPDALVVDL
LPAGLELENQNLANGSASLEQSGGEVQNLLNQMQQASIKHIEFRDDRFVA
AVAVDEYQPVTLVYLARAVTPGTYQVPQPMVESMYVPQWRATGAADDLLI
VRP
>ECP_3421 probable general secretion pathway protein L
MPESLMVIRSFSTLRKHWEWMTFSADSVSSVHTLTDDLPLESLADQPGAG
NVHLLIPPEGLLYRSLTLTNAKYKLTAQTLQWLAEETLPDASQNWHWTVV
DKQNESVEVIGIQSEKLSRYLERLHTAGLNVTRVLPDGCYLPWEVDSWTL
VNQQTSWLIRSAAHAFNELDEHWLQHLANQFPPENMRCYGVAPHGVAAAN
PLIQHPEIPSLSLYSADIAFQRYDMLHGVFRKQKTVSKSGKWLARLAVSC
LVLAILSFVGSRGIAFWQTLKIEDQLQQQQQETWQRYFPQIKHTHNFRFY
FKQQLAQQYPEAVPLLYHLQTLLLEHPELQLMEANYSQRQKSLTLKMSAK
SEANIDRFCELTQSWLPMEKTEKDPVSGVWTVRNSGK
>ECP_0109 nicotinate-nucleotide pyrophosphorylase
MPPRRYNPDTRRDELLERINLDIPGAVAQALREDLGGTVDANNDITAKLL
PENSRSHATVITRENGVFCGKRWVEEVFIQLAGDDVTIIWHVDDGDVINA
NQPLFELEGPSRVLLTGERTALNFVQTLSGVASKVRHYVELLEGTDTQLL
DTRKTLPGQRSALKYAVLCGGGANHRLGLSDAFLIKENHIIASGSVRQAV
EKASWLHPDAPVEVEVENLEELDEALKAGADIIMLDNFETEQMREAVKRT
NGKALLEVSGNVTDKTLREFAETGVDFISVGALTKHVQALDLSMRFR
>ECP_4167 hypothetical transcriptional regulator YijO
MYHDVSYLLSRLINGPLSLRQIYFASSNGPVPDLAYQVDFPRLEIVLEGE
FIDTGAGAALVPGDVLYVPAGGWNFPQWQAPATTFSVLFGKQQLGFSVVQ
WDGKQYQNLAKQHVARRGPRIGSFLLQTLNEMQMQSQEQQTARLIVTSLL
SHCRDLLGSQIQTASRSQALFEAIRDYIDERYASALTRESVAQAFYISPN
YLSHLFQKTGAIGFNEYLNHTRLEHAKTLLKGYDLKVKEVAHACGFVDSN
YFCRLFRKNTERSPSEYRRQYHSQLTEKPTTPE
>ECP_2806 putative CLPA/B-type chaperone protein
MTGNHPAALLRRLNPYCARALDAAASLCQTRAHAEITIEHWLLKLLEQGE
GDITVIARRYEWDIDTLWQSLLAHLDTLPRPVRERPQLSEPLAALIRQAW
LIASLEGDDPQIRSQHLLMALTEKPMLPACNDLWVLLSLSRVQLERLRPL
LDAQSDECPARQPQVTEPLTSALPETATADAPAKTLTEKQDDALLAVLNR
FTEDVTEKARSGRIDPVFGRDTEIRQMVDILSRRRKNNPILVGEPGVGKT
ALVEGLALRIAEGNVPDSLKTVHIRTLDLGLLQAGAGVKGEFEQRLKNVI
DAVQKSPEPVLLFIDEAHTIIGAGNQAGGADAANLLKPALARGELRTIAA
TTWSEYKQYFERDAALERRFQMVKVDEPDDDTACLMLRGLKARYAQHHGV
HMLDSAIQTAVRLSRRYLTGRQLPDKAVDLLDTAGARVRMSLDTLPEPLT
QLHARLAALDIEREAIEQDCVFYPEASPERLAELTDLRDELQAEAGHLEA
QYQQEKALAQQIMTLRQEGTDSTELQQQLRTHQGFAPLLALDVDARAVAT
VVADWTGIPLSSLLKDEQSDLLSMEKSLENRVVGQSPALCAIAQRLRAAK
TGLTPENGPQGVFLLTGPSGTGKTETALTLADTLFGGEKSLITINLSEYQ
EPHTVSQLKGSPPGYVGYGQGGVLTEAVRKRPYSVVLLDEVEKAHRDVMN
LFYQVFDRGFMRDGEGREIDFRNTVILMTANLGSDHIMQLLEEKPDATDA
DLHELLYPLLRDHFQPALMARFQTVIYRPLGQEAMRAIVEMKLAQVARRL
HQHYGLETEISNSLYDALTAACLLPDTGARNIDSLLNQQILPVLSQQLLA
QQAVHHKPARLRLDWDDEDGIVLEFDEK
>ECP_1573 electron transport complex protein RnfB
MNAIWIAVAAVSLLGLAFGAILGYASRRFAVEDDPVVEKIDEILPQSQCG
QCGYPGCRPYAETISCNGEKINRCAPGGEAVMLKIAELLNVEPQPLDGEA
QELTPARMVAVIDENNCIGCTKCIQACPVDAIVGATRAMHTVMSDLCTGC
NLCVDPCPTHCISLQPVAETPDSWKWDLNTIPVRIIPVEHHA
>ECP_2786 L-fuculokinase
MLSGYIAGAIMKQEVILVLDCGATNVRAIAVNRQGKIVARASTPNASDIA
MENNTWHQWSLDAILQRFADCCRQINSELTDCHIRGIAVTTFGVDGALVD
KQGNLLYPIISWKCPRTAAVMDNIERLISAQQLQAISGVGAFSFNTLYKL
VWLKENHPQLLERAHAWLFISSLINHRLTGEFTTDITMAGTSQMLDIQQR
DFSPQILQATGIPRRLFPRLVEAGEQIGTLQNSAAAMLGLPVGIPVISAG
HDTQFALFGAGAEQNEPVLSSGTWEILMVRSAQVDTSLLSQYAGSTCELD
SQAGLYNPGMQWLASGVLEWVRKLFWTAETPWQILIEEARLIAPGSDGVK
MQCDLLSCQNAGWQGVTLNTTRGHFYRAALEGLTTQLQRNLQMLEKIGHF
KASELLLVGGGSRNTLWNQIKANMLDIPVKVLDDAETTVAGAALFGWYGV
GEFNSPEEARAQIHYQYRYFYPQTEPEFIEEV
>ECP_2482 probable aminoglycoside efflux pump
MANFFIDRPIFAWVLAILLCLTGTLAIFSLPVEQYPDLAPPNVRVTANYP
GASAQTLENTVTQVIEQNMTGLDNLMYMSSQSSGTGQASVTLSFKAGTDP
DEAVQQVQNQLQSAMRKLPQAVQNQGVTVRKTGDTNILTIAFVSTDGSMD
KQDIADYVASNIQDPLSRVNGVGDIDAYGSQYSMRIWLDPAKLNSFQMTA
KDVTDAIESQNAQIAVGQLGGTPSVDKQALNATINAQSLLQTPEQFRDIT
LRVNQDGSEVRLGDVATVEMGAEKYDYLSRFNGKPASGLGVKLASGANEM
ATAELVLNRLDELAQYFPHGLEYKVAYETTSFVKASIEDVVKTLLEAIAL
VFLVMYLFLQNFRATLIPTIAVPVVLMGTFSVLYAFGYSVNTLTMFAMVL
AIGLLVDDAIVVVENVERIMSEEGLTPREATRKSMGQIQGALVGIAMVLS
AVFVPMAFFGGTTGAIYRQFSITIIAAMVLSVLVAMILTPALCATLLKPL
KKGEHHGQKGFFAWFNQMFNRNAERYEKGVAKILHRSLRWIVIYVLLLGG
MVFLFLRLPTSFLPLEDRGMFTTSVQLPSGSTQQQTLKVVEQIEKYYFTH
EKDNIMSVFATVGSGPGGNGQNVARMFIRLKDWSVRDSKTGTSFAIIERA
TKAFNKIKEARVIASSPPAISGLGSSAGFDMELQDHAGAGHDALMAARNQ
LLALAAENPELTRVRHNGLDDSPQLQIDIDQRKAQALGVAIDDINDTLQT
AWGSSYVNDFMDRGRVKKVYVQAAAPYRMLPDDINLWYVRNKDGGMVPFS
AFATSRWETGSPRLERYNGYSAVEIVGEAAPGVSTGTAMDIMESLVKQLP
NGFGLEWTAMSYQERLSGAQAPALYAISLLVVFLCLAALYESWSVPFSVM
LVVPLGVIGALLATWMRGLENDVYFQVGLLTVIGLSAKNAILIVEFANEM
NQKGHDLFEATLHACRQRLRPILMTSLAFIFGVLPMATSTGAGSGGQHAV
GTGVMGGMISATILAIYFVPLFFVLVRRRFPLKPRPE
>ECP_0149 outer membrane usher protein HtrE precursor
MTIKSTNHLTHIATFCALLYSNSALCAELVEYDHTFLMGKDASNIDLSRY
TEGNPTLPGIYDVSVYVNDQPIMSQSIAFAVIEGKKNAQACITQKNLLQF
HISSPDKNSEKAILLKRDEDLGDCLNLAEMIPQSSIRYDVNDQRLDIDVP
QAWIMKNYQNYVDPSLWENGINAAMLSYNLNGYHSESPGRTNDSIYAAFN
GGINLGAWRLRASGNYNWMTNVHSDYDFQNRYLQRDLASLRSQLVIGESY
TTGETFDSVSIRGIRLYSDSRMLPPVLASFAPIIHGVANTNAKVTVMQNG
YKIYETTVPPGAFAIDDLSPSGYGSDLIVTIEEADGTKRTFSQPFSSVVQ
MLRPGVGRWDISAGQVLKDSIQDEPNLFQASYYYGLNNYLTGYTGIQLTD
NNYTAGLLGLGMNTPVGAFSVDVTHSNVSIPDDKTYQGQSYRISWNKLFE
NTSTSLNIAAYRYSTQHYLGLNDALTLIDEVEHPEQELEPKSMRNYSRMK
NQVTVSINQPLKFEKKDYGSFYLSGSWSDYWASGQNSTNYSIGYSNSASW
GSYSISAQRSLNEDGQTDDSIYLSFTIPIENLLGTEHRSSGFQSIDTQLN
SDFKGNNQLNISSSGYSDTNRISYSVNTGYMMNKSSDDLSYIGGYASYES
PWGTLSGSASASSDNSRQFSLNTDGGFVLHSGGLTFSNDSFSDSDTLAVI
QAPGAKGARINYGNSTVDRWGYGVTSALSPYHENRIALDINDLENDVELK
STSTVAVPRQGAVVFADFETVQGQSAIMNIVRSDGKNIPFAADIYDEQNN
IIGNVGQGGQAFVRGIGQEGNIRITWIEEGKPVSCFAHYQQNTTSEKIAQ
SIILNGLRCQIQ
>ECP_2516 probable GTP-binding protein EngA
MVPVVALVGRPNVGKSTLFNRLTRTRDALVADFPGLTRDRKYGRAEIEGR
EFICIDTGGIDGTEDGVETRMAEQSLLAIEEADVVLFMVDARAGLMPADE
AIAKHLRSREKPTFLVANKTDGLDPDQAVVDFYALGLGEIYPIAASHGRG
VLSLLEHVLLPWMEDLAPQEEVDEDAEYWAQFEAEENGEEEEEDDFDPQS
LPIKLAIVGRPNVGKSTLTNRILGEERVVVYDMPGTTRDSIYIPMERDGR
EYVLIDTAGVRKRGKITDAVEKFSVIKTLQAIEDANVVMLVIDAREGISD
QDLSLLGFILNSGRSLVIVVNKWDGLSQEVKEQVKETLDFRLGFIDFARV
HFISALHGSGVGNLFESVREAYDSSTRRVGTSMLTRIMTMAVEDHQPPLV
RGRRVKLKYAHAGGYNPPIVVIHGNQVKDLPDSYKRYLMNYFRKSLDVMG
SPIRIQFKEGENPYANKRNTLTPTQMRKRKRLMKHIKKSK
>ECP_2678 putative HTH-type transcriptional regulator YgjM
MTANAVRAVKATRELVNAVPFLGGSDSEDDYREALELVEYLIEEDDTNPL
IDFLASRIAEYENNNEKFAEFDKAVAAMPVGVALLRTLIDQHNLTYADLK
NEIGSKSLVSQILSGQRSLTISHIKALSARFGVKPEWFL
>ECP_3042 hypothetical type II secretion protein GspH
MPERGFTLLEIMLVIFLIGLASAGVVQTFATDSESPAKKAAQDFLTRFAQ
FKDRAVIEGQTLGVLIDPPGYQFMQRRQGQWLPVSATRLSAQVTVPKQVQ
MLLQPGSDIWQKEYALELQRRRLTLHDIELELQKEAKKKTPQIRFSPFEP
ATPFTLRFYSAAQNACWAVKLAHDGALSLNQCDERMP
>ECP_2513 putative outer membrane protein (SinI-like protein)
MKQDKRRGLTRIALALALAGYCVAPVALAEDSAWVDSGETNIFQGTIPWL
YSEGGSATTDADRVTLTSDLKGARPQGSETDKRLYSGDKLTVSWEIGDTE
GDVDLGGLGDNAKTIDTIRWMSYKDAQGGDPKELATKVTSYTLTDADRGR
YIGIEITPTTQTGTPNVGTALHLYDVSTASGGGSDSDNVAPGPVVNQNLK
VAIFVDGTSINLINGSTPIELGKTYVAKLYSDENKNGKFDAGTDADVTAN
YDFRWVLSGSSQQLGTSGGIVNSSFDNNNLVIPATNDEARTNLNGPARDG
KEALSIPTNGDGVQGYKLHIIYKHK
>ECP_0251 putative acyl-CoA dehydrogenase
MMILSILATVVLLGALFYHRVSLFISSLILLAWTAVLGVAGLWSAWVLVP
LAIILVPFNFAPMRKSMISAPVFRGFRKVMPPMSRTEKEAIDAGTTWWEG
DLFQGKPDWKKLHNYPQPRLTAEEQAFLDGPVEEACRMANDFQITHELAD
LPPELWAYLKEHRFFAMIIKKEYGGLEFSAYAQSRVLQKLSGVSGILAIT
VGVPNSLGPGELLQHYGTDEQKNHYLPRLARGQEIPCFALTSPEAGSDAG
AIPDTGIVCMGEWQGQQVLGMRLTWNKRYITLAPIATVLGLAFKLSDPEK
LLGGAEDLGITCALIPTTTPGVEIGRRHFPLNVPFQNGPTRGKDVFVPID
YIIGGPKMAGQGWRMLVECLSVGRGITLPSNSTGGVKSVALATGAYAHIR
RQFKISIGKMEGIEEPLARIAGNAYVMDAAASLITYGIMLGEKPAVLSAI
VKYHCTHRGQQSIIDAMDITGGKGIMLGQSNFLARAYQGAPIAITVEGAN
ILTRSMMIFGQGAIRCHPYVLEEMEAAKNNDVNAFDKLLFKHIGHVGSNK
VRSFWLGLTRGLTSSTPTGDATKRYYQHLNRLSANLALLSDVSMAVLGGS
LKRRERISARLGDILSQLYLASAVLKRYDDEGRNEADLPLVHWGVQDALY
QAEQAMDDLLQNFPNRVVAGLLNVVIFPTGRHYLAPSDKLDHKVAKILQV
PNATRSRIGRGQYLTPSEHNPVGLLEEALVDVIAADPIHQRICKELGKNL
PFTRLDELAHNALAKVLIDKDEAAILVKAEESRLRSINVDDFDPEELATK
PVKLPEKVRKVEAA
>ECP_2041 hypothetical protein
MQLPVKLLMSLISLVSVIARAGKYKNYIRDEIKYWRYTSYKGGEFPEGFT
DEKFSSAIYNGRIFTMKRLHTLMLFLAVLFTGFNVEAASVKQALSCDPNA
RAEQPGACPTTYELYEGDAAYKAALDKALKPVGLSGMFGKGGYMDGPGGN
VTPVTINGTVWLQGDGCKANTCGWDFIVTLYNPKTHEVVGYRYFGLDDPA
YLVWFGEIGVHEFAYLVKNYVAAVN
>ECP_1975 putative acyl-CoA dehydrogenase
MCTENYELAQQEAVLFAKQHLALAAQNIERQQFIVPDIISCVAQAGYLGA
SIPQKYGGRGYDSYQLCALHEVMAGVHGSLENLITVTGMVSTLLQRVGSA
AQKAHYLPKLATGELIGAIALTEPNIGSDLVNVETELQQDGDGWRLNGKK
KWITLGQIADFFIVLIHCGNQLATVLIDRNTDGFTITPLNDMLGLRGNML
AELHFNDCRLKEDALLGPLTPGVPLAVNFALNEGRFTTACGSLGLCRAAV
DVAARYIRQRKQFKRRLFSHGIVQHLFATMLTQTRSAQLMCFSAAEYRET
LHPAMINQILMAKYVASKAAVDVAGKAVQLLGANGCHADYAVERYYRDAK
IMEIIEGTSQIHEIQIAMNYMMGSEA
>ECP_3047 hypothetical type II secretion protein GspC
MARVIFRDARIYLIQWLTKIRHTLNQRQSLNTDKEHLRKIARGMFWLMLL
IISAKVAHSLWRYFSFSAEYTAVSPSANKPLRADAKAFDKNDVQLISQQN
WFGKYQPVATPVKQPEPAPVAETRLNVVLRGIAFGARPGAVIEEGGKQQV
YLQGETLGSHNAVIEEINRDHVMLRYQGKIERLSLAEEERSTVAVTNKKA
VSDEAKQAVAEPAVSAPVEIPAAVRQALAKDPQKIFNYIQLTPVRKEGIV
GYAVKPGADRSLFDASGFKEGDIAIALNQQDFTDPRAMIALMRQLPSMDS
IQLTVLRKGARYDISIALR
>ECP_3463 hypothetical protein YhfU
MKKIGVAGLQREQIKKTIEATAPGCFEVFIHNDMEAAMKVKSGQLDYYIG
ACNTGAGAALSIAIAVIGYNKSCTIAKPGIKAKDEHIAKMVAEGKVAFGL
SVEHVEHAIPMLINHLK
>ECP_3564 IS putative transposase
MKHSFEIKLAAVNHYLAGHAGIISTAKLFQLSHTSLSHWINLFLLHGPRA
LDCRHKRSYSPEDKLCVVLYALGHSESLPRVAARFNIPSHNTVKNWIKGY
RKSGNEAFIRRRKEKSMTRSDDTHENEANMTPEEMKNELRYLRAENAYLK
AMQEHLLEKKRQELEKKRKSSRA
>ECP_2551 hypothetical protein YphH
MRACINNQQIRHHNKCVILELLYRQKRANKSTLARLAQISIPAVSNILQE
LESEKRVVNIDDESQTRGHSSGTWLIAPEGDWTLCLNVTPTSIECQVANA
CLSPKGEFERFQIDAPTPQALLSEIEKCWHRHRKLWPDRTINLALAIHGQ
VDPVTGVSQTMPQAPWATPIEVKYLLEEKLGIRVMVDNDCVMLALAEKWQ
NNSQVRDFCVINVDYGIGSSFVINEQIYRGSLYGSGQIGHTIVNPDGVVC
DCGHYGCLETVASLSALKKQARVWLKSQPVNTQLDPEKLTTAQLIAAWQS
GEPWITSWVDHSANAIGLSLYNFLNILNINQIWLYGRSCAFGENWLNTII
RQTGFNPFDRDEGPSVKATQIGFGQLSRAQQVLGIGYLYVEAQLRQI
>ECP_2796 membrane-bound lytic murein transglycosylase A precursor
MRTILCSRFHATFFLPRSRIWHDFEQKQITKAYFLCKICVRERLLISLNA
VVCYLRCALFFNLKKRTMKGRWVKYLLMGTVVAMLAACSSKPTDRGQQYK
DGKFTQPFSLVNQPDAVGAPINAGDFAEQINHIRNSSPRLYGNQSNVYNA
VQEWLRAGGDTRNMRQFGIDAWQMEGVDNYGNVQFTGYYTPVIQARHTRQ
GEFQYPIYRMPPKRGRLPSRAEIYAGALSDKYILAYSNSLMDNFIMDVQG
SGYIDFGDGSPLNFFSYAGKNGHAYRSIGKVLIDRGEVKKEDMSMQAIRH
WGETHSEAEVRELLEQNPSFVFFKPQSFAPVKGASAVPLVGRASVASDRS
IIPPGTTLLAEVPLLDNNGKFNGQYELRLMVALDVGGAIKGQHFDIYQGI
GPEAGHRAGWYNHYGRVWVLKTAPGAGNVFSG
>ECP_2000 hypothetical protein
MDLLPFLLDANLSATNPPAIPHWWKRQPLIPNLLSQELKNYLKLNAKEKN
VQIADQVIIDESAGEVVIGANTRICHGAVIQGPVVIGANCLIGNYAFIRP
GTIISNGVKIGFATEIKNAVIEAEATIGPQCFIADSVVANQAYLGAQVRT
SNHRLDEQPVSVRTPEGIIATGCDKLGCYIGKRSRLGVQVIILPGRIISP
NTQLGPRVIVERNLPSGTYSLRQELIRTGD
>ECP_2718 hypothetical protein
MSTITLLCIALAGVIMLLLLVIKAKVQPFVALLLVSLLVALAAGIPAGEV
GKVMIAGMGGVLGSVTIIIGLGAMLGRMIEHSGGAESLANYFSRKLGDKR
TIAALTLAAFFLGIPVFFDVGFIILAPIIYGFAKVAKISPLKFGLPVAGI
MLTVHVAVPPHPGPVAAAGLLHADIGWLTIIGIAISIPVGIVGYFCSENN
Q
>ECP_3104 putative C4-dicarboxylate transport system
MRTLTHILNKILAGCCCIILAIMVFCVTWQVIARFIFNAPSTVLDEFTQI
LFMWMILLGGVYTAGLKKHLAIDLLAQKLPAASVLTLDSFIQIIITVFAV
IFMIYGGNIVVEKAQHVGQISPVLKWPMDKVYWVMPASGLILVWYSVMNI
IDNYRKWNSH
>ECP_1410 putative autotransporter/adhesin
MQRKTLLSACIALALSGQGWAADITEIETTTGEKKNTNVTCPADPGKLSP
EELKRLPSECSSVVEQNLMPWLVTGAATALITTLAIVELNDDDDHHRNNS
PLPPTPPDDDSDDTPVPPTPGGDEIIPDDGPDDTPAPPKPIAFNNDVTLD
KTAKTLTIRDSVFSYTENADGTISLQDSNGRKATINLWQIDETNNTVALE
GMSADGATKWQYNHNGELVITGDNTTVNNTGKTIVDGKGATGTEIAGNNA
VVNQDGELDVSGGGHGIDITGDSATVDNKGGMTVTDPDSIGIQIDGDKAV
VNNDGDNAISNGGTGTQVNGDEATVNNNGSTTVDGKDSTGTEINGDKAIV
NNDGDSTILDGGTETRITGDDATANNSGNTTVDGQGSTGTEIAGNNAVVN
QDGELDVSGGGHGIDITGDSATVDNKGGMTVIDPDSIGIQIDGDKAVVNN
DGDNAISNGGTGTQINGDEATVNNNGSTTVDGKDSTGTEINGDKAIVNND
GDSTILDGGTGTRITGDDATANNSGNTTVDGQGSTGTEIAGNNAVVNQDG
ELDVSGGGHGIDITGDSATVDNKGGMTVTDPDSIGIQIDGDKAVVNNDGD
NAISNGGTGTQVNGDEATVNNNGNTTVDGKDSTGTEINGDKAIVNNDGDS
TILDGGTGTRITGDDATANNSGNTTVDGQGSTGTEIAGNNAVVNQDGELD
VSGGGHGIDITGDSATVDNKGGMTVADADSIGIQIDGDKAVVNNDGDNAI
SNGGTGTQVNGDEAIVNNNGNTTVDGKDSTGTEINGDKAIVNNDGDSTIL
DGGTGTRITGDDATANNSGNTTVDGQGSTGTEIAGNNAVVNQDGELDVSG
GGHGIDITGDSATVDNKGGMTVADADSIGIQIDGDKAVVNNDGDSTILDG
GTGTRITGDDATANNSGNTTVDGQGSTGTEIAGNNAVVNQDGKLDVSGGG
HGIDITGDSATVDNKGGMTVADADSIGIQIDGDKAVVNNDGDNAISNGGT
GTQVNGDEATVNNNGNTTVDGKDSTGTEIAGNNATVTQEGELTVSSGGRG
IDITGNNAKVDTKGKMTITGTDSVGVSINGDSATLTNTGDIDVSNSATGF
SLVTNEGIISLAGSMKVGDFSTGMALSGDNNSVTLAAKDINVTGQKATGV
NISGDSNTVDITGNILVDKDQTADNAVDYFYDPSVGVNISGNSNTVSLDG
KLTVIADSELTSRKYMEFDGSQENISGLTVSGDGNTVNLNGGIQFVGEKN
ALADGSTIADKRSYFGKTPLVSVDGQSKVYLNGDSTISGSLPLGYANILQ
LSNKAALEIGSDATFSMQDISVYEHYFTQTPQIIKVDTGSQVVNNGDVDI
WNISFAGIWGENSTGINNGNITLSQYDYSSPETSFSEPDHMAFLSSSGGS
AVNNGTITAKVMEQHSVLNMGSAAGVADPRVFNNSVSSMMGMEAYGKGTV
LNSESGVIDMHGRGNIGMLAVDDSAADNAGKITLDTLWVDQNDTTTLRTD
LPSSTAIDYGVGMATGTNSGGGARSNGVATNQQGGVITVYNAGAAMAAYG
ASNMVINQGIINLEKNGNYDGGLGANMLVGMAVYNRGTAINDKTGVININ
VDTGQAFYNDGTGTILNYGEINLLGSPMDSADSHMGAIPENLDLLTALTG
SGETDMRTASSGGFVTTKALANYGNETLNSNVAAKAWLYNQDKANLTING
ELSIGQGLENSGLLDSDTISAAANVYNRASGSIITDQLSLTGSNSFFNEG
NFSGSVAGSSYKQNVVNTGTMAVMADGKSLISGSFLLYNEAGATLSNSSS
AVSGGENAIVNVTRTGDSLAQVNRGTITAINGYSAIKTASTGSNSNGKWI
WNTDTGVISGVNPNAPLIDLGRGYNFANAGTINVQGDGAVAISGGTTSYT
VQLVNSGTINVGAAQGKADGTNGTGLIGIKGNGSDTTINNAQSGVINVYA
DNSWAFGGKTKAIINNGEINLLCDTGCDIYAPGTTGTLNDHNSTTDIIVP
AATSTPTQGSVPTVPADSSAQQKLTNYTIGTNSDGTSGMLKANNLVISDN
VKVNTGFSAGTADTTVVINDVFKGENISGAENISSSTVMWNAQGSTDASG
NVDVTMTKNAYTDVVTDSSVNNVAQVLDSGYTNNDLYTSLNVGTTAELNS
ALKQISGSQATTVFNEARVLSNRFSMLSDAAPEVANGLAFNVVAKGDPRA
ELGNDTQYDMMALRKSLTLTEHQNLSLEYGIARLEGNGSDTAGDNGVTGG
YSQFFGLKHQMAFDNGMSWNNALRYDVHNLDSSRSIAYGDVNKTADANVK
QQYLEFRSEGAKTFELREGLNVTPYAGVKLRHTLENGYQERNAGDFNLSM
NSGSETAVDSIVGLKLDYAGKEGWSANATLEGGPNLSYVKSQRTASISGA
GSQRFNIDDGQSGGGFNSLATMGVKYSSQESALQLDAFHWKEDGISDKGV
MLNFKKTF
>ECP_2116 putative resistance protein
MKFFALFIYRPVATILLSVAITLCGILGFRMLPVAPLPQVDFPVIMVSAS
LPGASPETMASSVATPLERSLGRIAGVSEMTSSSSLGSTRIILQFDFDRD
INGAARDVQAAINAAQSLLPSGMPSRPTYRKANPSDAPIMILTLTSDTYS
QGELYDFASTQLAPTISQIDGVGDVDVGGSSLPAVRVGLNPQALFNQGVS
LDDVRTAISNANVRKPQGALEDDTHRWQIQTNDELKTAAEYQPLIIHYNN
GGAVRLGDVATVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQTVDSI
RARLPELQSTIPAAIDLQIAQDRSPTIRASLEEVEQTLIISVALVILVVF
LFLRSGRATIIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVV
DDAIVVLENIARHLEAGMKPLQAALQGTREVGFTVLSMSLSLVAVFLPLL
LMGGLPGRLLREFAVTLSVAIGISLLVSLTLTPMMCGWMLKASKPREQKR
LRGFGRMLVALQQGYGKSLKWVLNHTRLVGAVLLGTIALNIWLYISIPKT
FFPEQDTGVLMGGIQADQSISFQAMRGKLQDFMKIIRDDPAVDNVTGFTG
GSRVNSGMMFITLKPRGERSETAQQIIDRLRKKLAKEPGANLFLMAVQDI
RVGGRQANASYQYTLLSDDLAALREWEPKIRKKLATLPELADVNSDQEDN
GAEMNLIYDRDTMARLGIDVQAANSLLNNAFGQRQISTIYQPMNQYKVVM
EVDPRYTQDISALEKMFVINNEGKAIPLSYFAKWQPANAPLSVNHQGLSA
ASTISFNLPTGKSLSDASAAIDRAMTQLGVPSTVRGSFAGTAQVFQETMN
SQVILIIAAIATVYIVLGILYESYVHPLTILSTLPSAGVGALLALELFNA
PFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRHGNLTPQEAIFQACLLR
FRPIMMTTLAALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLLTLYT
TPVVYLFFDRLRLRFSRKPKQAVTE
>ECP_1103 putative TetR-family regulatory protein
MATDSTQCVKKSRGRPKVFDRDAALDKAMKLFWQHGYEATSLADLVEATG
AKAPTLYAEFTNKEGLFRAVLDRYIDRFAAKHEAQLFCEEKSVESALADY
FAAIANCFTSKDTPAGCFMINNCTTLSPDSGDIANTLKSRHAMQERTLQQ
FLCQRQARGEIPTHCDVTHLAEFLNCIIQGMSISAREGASLEKLMQIART
TLRLWPELLK
>ECP_1942 yersiniabactin biosynthetic protein
MISGAPSQDSLLPDNRHAADYQQLRERLIQELNLTPQQLHEESNLIQAGL
DSIRLMRWLHWFRKNGYRLTLRELYAAPTLAAWNQLMLSRSPENAEEETP
PDESSWPNMTESTPFPLTPVQHAYLTGRMPGQTLGGVGCHLYQEFEGHCL
TASQLEQAITTLLQRHPMLHIAFRPDGQQVWLPQPYWNGVTVHDLRHNDA
ESRQAYLDALRQRLSHRLLRVEIGETFDFQLTLLPDNRHRLHVNIDLLIM
DASSFTLFFDELNALLAGESLPAIDTRYDFRSYLLHQQKINQPLRDDARA
YWLAKASTLPPAPVLPLACEPATLCEVRNTRRRMIVPATRWHAFSNRAGE
YGVTPTMALATCFSAVLARWGGLTRLLLNITLFDRQPLHPAVGAMLADFT
NILLLDTACDGDTVSNLARKNQLTFTEDWEHRHWSGVELLRELKRQQRYP
HGAPVVFTSNLGRSLYSSRAESPLGEPEWGISQTPQVWIDHLAFEHHGEV
WLQWDSNDALFPPALVETLFDAYCQLINQLCDDESAWQKPFADMMPASQR
AIRERVNATGAPIPEGLLHEGIFRIALQQPQALAVTDMRYQWNYHELTDY
ARRCAGRLIECGVQPGDNVAITMSKGAGQLVAVLAVLLAGAVYVPVSLDQ
PAARREKIYADASVRLVLICQHDASAGSDDIPALAWQQAIEAEPIANPVV
RAPTQPAYIIYTSGSTGTPKGVVISHRGALNTCCDINTRYQVGPHDRVLA
LSALHFDLSVYDIFGVLRAGGALVMVMENQRRDPHAWCELIQRHQVTLWN
SVPALFDMLLTWCEGFADATPENLRAVMLSGDWIGLDLPARYRAFRPQGQ
FIAMGGATEASIWSNACEIHDVPAHWRSIPYGFPLTNQRYRVVDEQGRDC
PDWVPGELWIGGIGVAEGYFNDPLRSEQQFLTLPDERWYRTGDLGCYWPD
GTIEFLGRRDKQVKVGGYRIELGEIESALSQLAGVKQATVLAIGEKEKTL
AAYVVPQGEAFCVTDHRNPALPQAWHTLAGTLPCCAISPEISAEQVADFL
QHRLLKLKPGHTAGADPLPLMNSLAIQPRWQAVVERWLAFLVTQRRLKPA
AEGYQVCAGEEREDEHPHFSGHDLTLSQILRGARNELSLLNDAQWSPESL
AFNHPASAPYIQELATICQQLAQRLQRPVRLLEVGTRTGRAAESLLAQLN
AGQIEYVGLEQSQEMLLSARQRLAPWPGARLSLWNADTLAAHAHSADIIW
LNNALHRLLPEDPGLLATLQQLAVPGALLYVMEFRQLTPSALLSTLLLTN
GQPEALLHNSADWAALFSAAGFNCQHGDEVAGLQRFLVQCPDRQVRRDPR
QLQAALAGRLPGWMVPQRIVFLDALPLTANGKIDYQALKRRHTPEAENPA
EADLPQGDIEKQVAALWQQLLSTGNVTRETDFFQQGGDSLLATRLTGQLH
QAGYEAQLSDLFNHPRLADFAATLRKTDVPVEQPFVHSPEDRYQPFALTD
VQQAYLVGRQPGFALGGVGSHFFVEFEIADLDLTRLETVWNRLIARHDML
RAVVRDGQQQVLEQTPRWVIPAHILHTPEEALQVREKLAHQVLNPEVWPV
FDLQVGYVDGMPARLWLCLDNLLLDGLSMQILLAELEHGYQYPQQLPPPL
PVTYRDYLQQPAIQSLNADSLAWWQAQLDDIPPAPALPLRCMPQDVETPR
FARLNGALDSTRWHRLKKRAADAHLTPSAVLLSVWSTVLSAWSAQPEFTL
NLTLFDRRPLHPQINQILGDFTSLMLLSWHPGESWLHSAQSLQQRLSQNL
NHRDVSAIRVMRQLAQRQNVSAVPMPVVFTSALGFEQDNFLARRNLLKPV
WGISQTPQVWLDHQVYESEGELRFNWDFVAALFPAGQVERQFEQYCTLLN
RMTEDENSWHLPLAALVPPVKQAEQGTERTSRVCPEHSQSHIAADESTVS
LICDAFREVVGESVTPAQNFFEAGATSLNLVQLHVLLQRHEFSTLTLLDL
FTHPSPAALAAYLTSVATVEKTKRSRPVRRRQRRI
>ECP_2557 hypothetical protein YfhG
MRHIFQRLLPRRLWLAGLPCLALLGCVQSHNKPAIDTPAEEKIPVYQLAD
YLSTECSDIWALQGKSTETNPLYWLRAMDCADRLMPAQSRQQARQYDDGN
WQNTFKQGILLADAKITPYERRQLVARIDALSTEIPAQVRPLYQLWRDGQ
ALQLQLAEERQRYSKLQQSSDSELDTLRQQHHVLQQQLELTARKLENLTD
IERQLSTRKPAGNFSPDTPHESEKPAPSTDEVTPDEP
>ECP_3856 putative radC-like protein
MQQLSFLPGEMTPGERSLILRALKTLDRHLHEPGVAFTSTRAAREWLILN
MAGLEREEFRVLYLNNQNQLIAGETLFTGTINRTEVHPREVIKRALYHNA
AAVVLAHNHPSGEVTPSKADRLITERLVQALGLVDIRVPDHLIVGGNQVF
SFAEHGLL
>ECP_1176 minor tail protein U
MKHTELRAAVLDALEKHDTGATLFDGRPAVFDEADFPAVAVYLTGAEYTG
EELDSDTWQAELHIEVFLPAQVPDSELDSWMEFRIYPVMSDIPALSDLIT
SMVASGYDYRRDDDAGLWSSADLTYVITYEM
>ECP_3341 biotin carboxylase
MLDKIVIANRGEIALRILRACKELGIKTVAVHSSADRDLKHVLLADETVC
IGPAPSVKSYLNIPAIISAAEITGAVAIHPGYGFLSENANFAEQVERSGF
IFIGPKAETIRLMGDKVSAIAAMKKAGVPCVPGSDGPLGDDMDKNRAIAK
RIGYPVIIKASGGGGGRGMRVVRSDAELAQSISMTRAEAKAAFSNDMVYM
EKYLENPRHVEIQVLADGQGNAIYLAERDCSMQRRHQKVVEEAPAPGITP
ELRRYIGERCAKACVDIGYRGAGTFEFLFENGEFYFIEMNTRIQVEHPVT
EMITGVDLIKEQLRIAAGQPLSIKQEEVHVRGHAVECRINAEDPNTFLPS
PGKITRFHAPGGFGVRWESHIYAGYTVPPYYDSMIGKLICYGENRDVAIA
RMKNALQELIIDGIKTNVDLQIRIMNDENFQHGGTNIHYLEKKLGLQEK
>ECP_3073 glutathionylspermidine synthetase/amidase
MSKGTTSQDAPFGTLLGYAPGGVAIYSSDYSSLDPQEYEDDAVFRSYIDD
EYMGHKWQCVEFARRFLFLNYGVVFTDVGMAWEIFSLRFLREVINDNILP
LQAFPNGSPRAPVAGALLIWDKGGEFKDTGHVAIITQLHGNKVRIAEQNV
IHSPLPQGQQWTRELEMVVENGGYTLKDTFDDTTILGWMIQTEDTEYSLP
QPEIAGELLKISGARLENKGQFDGKWLDEKDPLQNAYAQANGQVINQDPY
HYYTITESAEQELIKATNELHLMYLHATDKVLKDDNLLALFDIPKILWPR
LRLSWQRRRHHMITGRMDFCMDERGLKVYEYNADSASCHTEAGLILERWA
EQGYKGNGFNPAEGLINELAGAWKHSRARPFVHIMQDKDIEENYHAQFME
QALHQAGFETRILRGLDELGWDAAGQLIDGEGRLVNCVWKTWAWETAFDQ
IREVSDREFAAVPIRTGHPQNEVRLIDVLLRPEVLVFEPLWTVIPGNKAI
LPILWSLFPHHRYLLDTDFSVNDELVKTGYAVKPIAGRCGSNIDLVSHHE
EVLDKTSGKFAEQKNIYQQLWCLPKVDGKYIQVCTFTVGGNYGGTCLRGD
ESLVIKKESDIEPLIVVKK
>ECP_1489 glutamate decarboxylase beta
MDKKQVTDLRSELLDSRFGAKSISTIAESKRFPLHEMRDDVAFQIINDEL
YLDGNARQNLATFCQTWDDDNVHKLMDLSINKNWIDKEEYPQSAAIDLRC
VNMVADLWHAPAPKNGQAVGTNTIGSSEACMLGGMAMKWRWRKRMEAAGK
PTDKPNLVCGPVQICWHKFARYWDVELREIPMRPGQLFMDPKRMIEACDE
NTIGVVPTFGVTYTGNYEFPQPLHDALDKFQADTGIDIDMHIDAASGGFL
APFVAPDIVWDFRLPRVKSISASGHKFGLAPLGCGWVIWRDEEALPQELV
FNVDYLGGQIGTFAINFSRPAGQVIAQYYEFLRLGREGYTKVQNASYQVA
AYLADEIAKLGPYEFICTGRPDEGIPAVCFKLKDGEDPGYTLYDLSERLR
LRGWQVPAFTLGGEATDIVVMRIMCRRGFEMDFAELLLEDYKASLKYLSD
HPKLQGIAQQNSFKHT
>ECP_1213 hypothetical protein
MSRRYSLATITLVILFCGRPHCRKISVRQYYQKRVIFYPTTLQRTAFKKY
LFLIRDIIISS
>ECP_0248 hypothetical protein
MLKMSLYVIILLFSLQFSAAITGKESEVVSPLLMDVNPSLTMENISELST
SSEPSQQGVFPVICTRLHPGSVMKRQLLTGWGPVFIIGDDPFSLRWMSEH
LEILKSLNALGLVVNVESVERMEVLQQRADGLLLLPVICDNFVQALQLNA
YPVLITEMEISQ
>ECP_3105 putative C4-dicarboxylate transport system
MDIEYIYPVLILFGSFVIMLAIGVPITFAIGLSSLLSIITALPPDAAISV
ISQKMTVGLDGFTLLAIPFFVLAGNIMNTGGIARRLVNLAQALVGRLPGS
LAHCNILANTLFGAISGSAVASAAAVGGIMSPLQEKEGYDPAFSAAVNIA
SAPIGLMIPPSNVLIVYSLASGGTSVAALFLAGYLPGILTAAALMFVAAL
YARRNHYPVAERINFHQFLQVFRESIPSLMLIFIIIGGIIAGVFTPTEAS
AIAVIYSLVLAMIYREITIKKLNDILLDSVVTSSIVLLLVGCSMGMSWAM
TNADVPELINELITRVSDNKWVILFIINIILLIVGTFMDITPAILIFTPI
FLPIAQHLGIDPIHFGIIMVFNLTIGLCTPPVGTILFVGCSIGKVSIDRA
IKPLLPMFLALFVVMAIICYFPQLSLMLPGLFST
>ECP_0006 putative periplasmic protein YaaX
MQSIVLALSLVLVAPMAAQAAEITLVPSVKLQIGDRDNRGYYWDGGHWRD
HGWWKQHYEWRGNRWHPHGPPSSPRHNKHNDHRGDHRPGPDKHHR
>ECP_2001 hypothetical protein YaiO
MIKHTLLVPFFFSALPAYAGLTSITAGYDFTDYSGEHGNRNLAYAELVAK
VENATLLFNLSQGRRDYETEHFNATRGQGAVWYKWNNWLTTRTGIAFADN
TPVFARQDFRQDINLALLPKTLFTTGYRYTKYYDDVEVNAWQGGVSLYTG
PVITSYRYTHYDSSDAGSSYSNMISVRLKDPRGAGYTQLWLSRGTGAYTY
DWTPETRYGSMKSISLQRIQPLTEQLNLGLTAGKVWFNTPTDDYNGLQLA
AHLIWKF
>ECP_2422 glutamyl-tRNA synthetase
MKIKTRFAPSPTGYLHVGGARTALYSWLFARNHGGEFVLRIEDTDLERST
PEAIEAIMDGMNWLNLEWDEGPYYQTKRFDRYNAVIDQMLEEGTAYKCYC
SKERLEALREEQMAKGEKPRYDGRCRHSHEHHADDEPCVVRFANPQEGSV
VFDDQIRGPIEFSNQELDDLIIRRTDGSPTYNFCVVVDDWDMEITHVIRG
EDHINNTPRQINILKALKAPVPVYAHVSMINGDDGKKLSKRHGAVSVMQY
RDDGYLPEALLNYLVRLGWSHGDQEIFTREEMIKYFALNAVSKSASAFNT
DKLLWLNHHYINALPPEYVATHLQWHIEQENIDTRNGPQLADLVKLLGER
CKTLKEMAQSCRYFYEDFAEFDADAAKKHLRPVARQPLEVVRDKLTAITD
WTAENVHHAIQATADELEVGMGKVGMPLRVAVTGAGQSPALDVTVHAIGK
TRSIERINKALAFIAERENQQ
>ECP_2800 putative cytoplasmic protein
MADSFQNEVPAARVNIKLDLHTGNAKKKVELPLKLLAVGDYSNGKEQRPL
SERDKVDINKNNFNSVMAEFSPAVNLTVEDTLNGNGNEQNFALEFKSLKD
FEPEQVAKNIPQLRVLLAMRNLLRDLKSNLLDNATFRRELENILKDPTLS
SELRDELAKIAPQENV
>ECP_3449 hypothetical protein YhfK
MWRRLIYHPDINYALRQTLVLCLPVAVGLMLGELRFGLLFSLVPACCNIA
GLDTPHKRFFKRLIIGASLFATCSLLTQVLLAKDVPLPFLLTGLTLVLGV
TAELGPLHAKLLPASLLAAIFTLSLAGYMPVWEPLLIYALGTLWYGLFNW
FWFWIWREQPLRESLSLLYRELADYCEAKYSLLTQHTDPEKALPPLLVRQ
QKAVDLITQCYQQMHMLSAQNNTDYKRMLRIFQEALDLQEHISVSLHQPE
EVQKLVERSHAEEVIRWNAQTVAARLRVLADDILYHRLPTRFTMEKQIGA
LEKIARQHPDNPVGQFCYWHFSRIARVLRTQKPLYARDLLADKQRRMPLL
PALKSYLSLKSPALRNAGRLSVMLSVASLMGTALHLPKSYWILMTVLLVT
QNGYGATRLRIVNRSVGTVVGLIIAGVTLHFKIPEGYTLTLMLITTLASY
LILRKNYGWATVGFTITAVYTLQLLWLNGEQYILPRLIDTIIGCLIAFGG
TVWLWPQWQSGLLRKNAHDALEAYQDAIRLILSEDPQPTPLAWQRMRVNQ
AHNTLYNSLNQAMQEPAFNSHYLADMKLWVTHSQFIVEHINAMTTLAREH
RALPPELAQEYLQSCEIAIQRCQQRLEYDEPGSSGDANIMDAPEMQPHEG
AAGTLEQHLQRVIGHLNTMHTISSMAWRQRPHHGIWLSRKLRDSKA
>ECP_1516 6-phospho-beta-glucosidase
MSGFKEDFLWGGAVAAHQLEGGWNEGGKGISIADVMTAGAHGVPREVTEG
VIDGLNYPNHEAIDFYHRYKTDIQLFAGMGFKCFRTSIAWTRIFPQGDEQ
EPNEEGLQFYDDLFDECLKQGMEPVVTLSHFEMPYHLVTKYGGWRNRKLI
DFFIHFASTVFTRYKEKVKYWMTFNEINNQVNFSESLCPFTNSGILYSPE
EDLNEREQIMYQAVHYELVASALAVQTGKLINPEFNIGCMIAMCPIYPLT
CAPNDMMMATKAMHRRYWFTDVHARGYYPQHMLNYFARKGFNLDITPDDN
AILARGCVDFIGFSYYMSFTTQFSPDNPQLDYVEPRDLVSNPYIDTSEWG
WQIDPAGLRYSLNWFWDHFQLPLFIVENGFGAVDQRQADGTVNDHYRIDY
FSSHIREMKKAVVEDGVDLIGYTPWGCIDLVSAGTGEMKKRYGMIYVDKD
NEGKGTLERIRKASFYWYRDLIANNGENI
>ECP_2325 NADH dehydrogenase I chain C/D
MTDLTAQEPAWQTRDHLDDPVIGELRNRFGPDAFTVQATRTGVPVVWIKR
EQLLEVGDFLKKLPKPYVMLFDLHGMDERLRTHREGLPAADFSVFYHLIS
IDRNRDIMLKVALAENDLHVPTFTKLFPNANWYERETWDLFGITFDGHPN
LRRIMMPQTWKGHPLRKDYPARATEFSPFELTKAKQDLEMEALTFKPEEW
GMKRGTENEDFMFLNLGPNHPSAHGAFRIVLQLDGEEIVDCVPDIGYHHR
GAEKMGERQSWHSYIPYTDRIEYLGGCVNEMPYVLAVEKLAGITVPDRVN
VIRVMLSELFRINSHLLYISTFIQDVGAMTPVFFAFTDRQKIYDLVEAIT
GFRMHPAWFRIGGVAHDLPRGWDRLLREFLDWMPKRLASYEKAALQNTIL
KGRSQGVAAYGAKEALEWGTTGAGLRATGIDFDVRKARPYSGYENFDFEI
PVGGGVSDCYTRVMLKVEELRQSLRILEQCLNNMPEGPFKADHPLTTPPP
KERTLQHIETLITHFLQVSWGPVMPANESFQMIEATKGINSYYLTSDGST
MSYRTRIRTPSYAHLQQIPAAIRGSLVSDLIVYLGSIDFVMSDVDR
>ECP_3004 hypothetical protein
MRSSPQQQGRCLDRLVILHTIYEEKDSKYGGYSGEHPTKFLRKTIEKTIR
FDSFLYAFWARGDDELLHVQRRMHPEAPESSRKAERLRIKPGDSNIRITY
>ECP_0531 DNA polymerase III subunit tau
MSYQVLARKWRPQTFADVVGQEHVLTALANGLSLGRIHHAYLFSGTRGVG
KTSIARLLAKGLNCETGITATPCGVCDNCREIEQGRFVDLIEIDAASRTK
VEDTRDLLDNVQYAPARGRFKVYLIDEVHMLSRHSFNALLKTLEEPPEHV
KFLLATTDPQKLPVTILSRCLQFHLKALDVEQIRHQLEHILNEEHIAHEP
RALQLLARAAEGSLRDALSLTDQAIASGDGQVSTQAVSAMLGTLDDDQAL
SLVEAMVEANGERVMALINEAAARGIEWEALLVEMLGLLHRIAMVQLSPA
ALGNDMAAIELRMRELARTIPPTDIQLYYQTLLIGRKELPYAPDRRMGVE
MTLLRALAFHPRMPLPEPEVPRQSFAPVAQTAVMTPTQVPPQPQSAPQQA
PTVPLPETTSQVLAARQQLLRVQGATKAKKSEPAAATRARPVNNAALERL
ASVTDRVQARPVPSALEKAPAKKEAYRWKATTPVMQQKEVVATPKALKKA
LEHEKTPELAAKLAAEAIERDPWAAQVSQLSLPKLVEQVALNAWKEESDN
AVCLHLRSSQRHLNNRGAQQKLAEALSTLKGSTVELTIVEDDNPAVRTPL
EWRQAIYEEKLAQARESIIADNNIQTLRRFFDAELDEESIRPI
>ECP_2273 hypothetical protein YfaA
MSGEKKAKGWRFYGLVGLGAIVLLSTGVWALQYAGSGPEKTLSPLVVHNN
LQIDLNEPDLFLDSDSLSQLPKDLLTIPFLHDVLSEDFVFYYQNHADRLG
IEGSIRRIVYEHDLTLKDKLFSSLLDQPAQAALWHDKQGHLSHYMVLIQR
SGLSKLLEPLLFAATSDSQLSKTEISSIKLNSETIPVYQLRYNGNNALMF
ATYQDKMLVFSSTDMLFKDDQQDTEAAAIASDLLSGKKRWETSFGLEERT
AEKTPVRQRIVVSARLLGFGYQRLVPSFAGMRFEMGNDGWHSFLALNDES
ASVDASFDFTPVWNSMPAGASFCVAVPYSHGIAEEMLSHISQENDKLNGA
LDGGAGLCWYEDSKLQTPLFVGQFDGTAEQAQLPGKLFTQNIGAHESKAP
EGVLPVSQTQQGEAQIWRREVSSRYGQYPKAQAAQPDQLMSDYFFRVSLA
MQNKTLLFSLDDTLVNNALQALNKTRPAMVDVIPTDGIVPLYINPQGMAK
LLRNETLTSLPKNLEPVFYNAAQTLLMPKLDALSQQPRYVMKLAQMEPGA
AWQWLPITWQPL
>ECP_2545 hypothetical protein YphB (aldose 1-epimerase)
MTIYTLSHGPLKLDVSDQGGVIEGFWRDTTPLLRPGKKSGVATDASCFPL
VPFANRVSGNRFVWQGREYQLQPNVEWDAHYLHGDGWLGEWQCVSRSDDS
LCLVYEHLSGVYHYRVSQAFHLTADTLTVTLAVTNQGAETLPFGTGWHPY
FPLSPQTRIQAQASGYWLEREQWLAGEFCEQLPQELDFNQLAPLPRQWVN
NGFAGWNGKARIEQPQEGYAIIMETTPPAQCYFIFVSDPAFDKGYAFDFF
CLEPMSHAPDDHHRPEGGDLIALAPGESTISEMSLRVALL
>ECP_1101 NADH dehydrogenase
MTTPLKKIVIVGGGAGGLEMATQLGHKLGRKKKAKITLVDRNHSHLWKPL
LHEVATGSLDEGVDALSYLAHARNHGFQFQLGSVIDIDREAKTITIAELR
DEKGELLVPERKIAYDTLVMALGSTSNDFNTPGVKENCIFLDNPHQARRF
HQEMLNLFLKYSANLGANGKVNIAIVGGGATGVELSAELHNAVKQLHSYG
YKGLTNEALNVTLVEAGERILPALPPRISAAAHSELTKLGVRVLTQTMVT
SADEGGLHTKDGEYIEADLMVWAAGIKAPDFLKDIGGLETNRINQLVVEP
TLQTTRDPDIYAIGDCASCPRPEGGFVPPRAQAAHQMATCAMNNILAQMN
GKPLKNYQYKDHGSLVSLSNFSTVGSLMGNLTRGSMMIEGRIARFVYISL
YRMHQIALHGYFKTGLMMLVGSINRVIRPRLKLH
>ECP_0148 hypothetical protein YadM precursor
MIKITPHKITILMGLLLSPSVFATDVNVDFTATVKATTCNITLTGTNVTD
NGNDKYTLVIPSMGMDKIANKTAQSEANFKLVANGCSSGISWIDTTLTGN
QSGSSPALIIPLASDTTSTTSYIGMGFKRKATSGDTFLKPNSAEYIRWSA
SEISTDGLEMTVALRETSVGKGVPGKFRALATFNFSYQ
>ECP_4281 two component response regulator
MNNKSSEITIVYIEDSDDVRFACEQTLTLAGYRVISCCDAEHSISLIQSQ
ANIIILTDVRLPGISGLELLSYINEMDSKIPVILITGHGDVEMAVDAMRN
GAFDFIEKPSSSDKLLSIIARAVEKRRLVLENQQLLANLQQENGPVLIGR
SPQMQQLRKMILNVADTGADVLIYGETGCGKEVVARMLHHWSTRRQGQFV
ALNCAGLPETLFESEIFGHEAGAFTGAVKKRIGKIEHANGGTLFLDEIEG
MPSGMQVKLLRVLQERTIERLGANQLIPVNCRVIAATKEDLLRRSEEHLF
RLDLYYRLNVVSLNIPPLRQRREDIPELFYWFASQAAQKYNRPLPDISPM
LLAWLQSQSWPGNVRELKHNAERFVLGLLTHHQPVPMTQQEESGLTACID
AFEKKLIEDMLRQTEGQVSLTARLLQLPRKTLYDKLNKHQIQPQVYRPES
SS
>ECP_0964 cell division inhibitor
MYTSGYAHRSSSFSSAASKIARVSTENTTAGLISEVVYREDQPMMTQLLL
LPLLQQLGQQSRWQLWLTPQQKLSREWVQASGLPLTKVMQISQLSPCHTV
ESMVRALRTGNYSVVIGWLADDLTAEEHAELVDAANEGNAMGFIMRPVSA
SSHATRQLSGLKIHSNLYH
>ECP_3059 hypothetical protein YghO
MECDLLMIKIEKVINKNDLKAFIAFPSSLYPDDPNWIPPLFIERNEHLSA
KNPGTDHIIWQAWVAKKAGQIVGRITAQIDTLHRERYGKDTGHFGMIDAI
DDPQVFAALFGAAEAWLKSQGASKISGPFSLNINQESGLLIEGFDTPPCA
MMPHGKPWYAAHIEQLGYHKGIDLLAWWMQRTDLTFSPALKKLMDQVRKK
VTIRCINRQRFAEEMQILREIFNSGWQQNWGFVPFTEHEFATMGDQLKYL
VPDDMIYIAEIDSAPCAFIVGLPNINEAIADLNGSLFPFGWAKLLWRLKV
SGVRTARVPLMGVRDEYQFSRIGPVIALLLIEALRDPFARRKIDALEMSW
ILETNTGMNNMLERIGAEPYKRYRLYEKQI
>ECP_2579 hypothetical transcriptional regulator YfiE
MDLRRFITLKTVVEEGSFLRASQKLCCTQSTVTFHIQQLEQEFSVQLFEK
IGRRMCLTREGKKLLPHIYELTRVMDTLREAAKKESDPDGELRVISGETL
LSYRMPQVLQRFRQRAPKVRLSLQSLNCYVIRDALLNDEADVGVFYRVGN
DDALNRRELGEQPLALVASPQIADVDFTEPGRHNACSFIINEPQCVFRQI
FESTLRQRRITVENTIELISIESIKRCVAANIGVSYLPRFAVVKELECGE
LIELPFGEQSQTITAMCAHHAGKAVSPAMHTFIQCVEECFLPG
>ECP_1779 hypothetical protein YebU
MLVAQHTVYFPDAFLTQMREAMPSTLSFDDFLAACQRPLRRSIRVNTLKI
SVADFLQLTAPYGWTLTPIPWCEEGFWIERYNEDALPLGSTAEHLSGLFY
IQEASSMLPVAALFANGNAPQRVMDVAAAPGSKTTQIAARMNNEGAILAN
EFSASRVKVLHANISRCGISNVALTHFDGRVFGAAVPEMFDAILLDAPCS
GEGVVRKDPDALKNWSPESNQEIAATQRELIDSAFHALRPGGTLVYSTCT
LNREENEAVCLWLKETYPDAVEFLPLGDLFPGANKALTEEGFLHVFPQIY
DCEGFFVARLRKTQAIPALPTPKYKVGNFPFSPVKDREAGQIRQAAASVG
LNWDENLRLWQRDKELWLFPVGIEALIGKVRFSRLGIKLAETHNKGYRWQ
HEAVIALASPDNVNAFELTPQEAEEWYRGRDVYPHAAPVADDVLVTFQHQ
PIGLAKRIGSRLKNSYPRELVRDGKLFTGNA
>ECP_3049 putative prepilin peptidase
MFFDVFQQYPAAMPVLATVGGLIIGSFLNVVIWRYPIMLRQQMAEFHGEM
PSTQSKISLALPRSHCPHCQQTIRVRDNIPLLSWLMLKGRCRDCQAKISK
RYPLVELLTALAFLLASLVWPESGWGLAVMILSAWLIAASVIDLDNQWLP
DVFTQGVLWTGLIAAWAQQSPLTLQDAVTGVLVGFITFYSLRWIAGIVLR
KEALGMGDVLLFAALGGWVGALSLPNVALIASCCGLIYAVITKRGSTTLP
FGPCLSLGGIATLYLQALF
>ECP_2823 hypothetical protein
MASNANFISQFVMGGDPCTYKESGELQAEMSKLTHPARPDVDWRQVEKLS
LALFRQNGVELQTLVCYVLAITRRQGLAGMADGLGSLDILLQRWADFWPV
QVHSRISLLGWVTEKMQQALRTLDIQYQDLPQIYRCVQHLSAIETTLQQC
ELWHMTKLDLLAGQFRNTALRLERLAPQGAETTITPPELPRREMNQPKKS
EESPQPVFATRSVQQNDKDASPPVPSPEISRQRTWPIFMAGMVVMAGLGG
TGLWGWSQLNQPDALIQRIQLSVMPLPLSLESGELAKLDVKDKALLAQDR
TIAASQMQLEQLNKLPARWPLEQGYRQLRQLDALWPDNPQVRALNAQWRK
QRELSALSAEALNGYAQAQSQLQRLSAQLDALDERKGRYLTGSELKTAVY
GIRQSLKEPPLEELLRQLEEQKQTGEVSPTLLTQIDTRLNQLLNRYVILL
DTKVEQSQ
>ECP_4547 hypothetical protein
MLLYSLIGTCKLNDEDPENYLRHVFGVIADWPVNRVSELLPWRIALPAG
>ECP_2538 inositol-1-monophosphatase
MHPMLNIAVRAARKAGNLIAKNYETPDAVEASQKGSNDFVTNVDKAAEAV
IIDTIRKSYPQHTIITEESGELEGTDQDVQWVIDPLDGTTNFIKRLPHFA
VSIAVRIKGRTEVAVVYDPMRNELFTATRGQGAQLNGYRLRGSTARDLDG
TILATGFPFKAKQYATTYINIVGKLFNECADFRRTGSAALDLSYVAAGRV
DGFFEIGLRPWDFAAGELLVREAGGIVSDFTGGHNYMLTGNIVAGNPRVV
KAMLANMRDELSDALKR
>ECP_2695 putative molybdenum-pterin-binding protein
MAVSARNQLTGTVSAVAMGAVNDEVELTLAGGAKLVAIVTHSSQQALGLA
KGKEAIALIKAPWVTLATEDCGLKFSARNQFAGSVSTITEGAVNATVHIK
TDAGFEIVAVVTNESQDEMKLTTGSRVIALIKASAILIATKA
>ECP_2097 putative colanic acid biosynthesis glycosyl transferase WcaC
MNILQFNVRLAEGGAAGVALDLHQRALQQGLASHFVYGYGKGGKESVSHQ
NYPQVIKHTPRMTAMANIALFRLLNRDLFGNFNELYRTITRTPGPVVLHF
HVLHSYWLNLKSVVRFCEKVKNHKPDVTLVWTLHDHWSVTGRCAFTDGCE
GWKTGCQKCPTLNNYPPVKIDRAHQLVAGKRQLFREMLALGCQFISPSQH
VADAFNSLYGPGRCRIINNGIDMATEAILVDLPPVCETQGKPKIAVVAHD
LRYDGKTKQQLVREMMALGDKIELHTFGKFSPFTAGNVVNHGFETDKRKL
MSVLNQMDALVFSSRVDNYPLILCEALSIGVPVIATHSDAAREVLQKSGG
KTVSEEEVLQLVQLSKPEIAQAIFGTTLAEFSQRSRAAYSGQQMLEEYVN
FYQNL
>ECP_3358 hypothetical protein
MFRHYYSSKNQKRVICPKRQCVNSQKEKYSSLSYKLSFVDVRNSYEYWRL
KAHKHKKRLISYCFILTRI
>ECP_2852 transcriptional activator protein LysR
MAAVNLRHIEIFHAVMTAGSLTEAAHLLHTSQPTVSRELARFEKVIGLKL
FERIRGRLHPTVQGLRLFEEVQRSWYGLDRIVSAAESLREFRQGELSIAC
LPVFSQSFLPQLLQPFLARYPDVSLNIVPQESPLLEEWLSAQRHDLGLTE
TLHTPAGTERTELLSLDEVCVLPPGHPLAVKKVLTPDDFHGENYISLSRT
DSYRQLLDQLFTENQVKRRMIVETHSAASVCAMVRAGVGVSVVNPLTALD
YAASGLVVRRFSIAVPFTVSLIRPLHRPSSALVQAFSEHLQAGLPKLVTS
LDAILSSATTA
>ECP_3179 hypothetical protein YgjT (integral membrane protein TerC family)
MNTVGTPLLWGGFAVVVTIMLAIDLLLQGRRGAHAMTMKQAAAWSLVWVT
LSLLFNAAFWWYLVQTEGRAVADPQALAFLTGYLIEKSLAVDNVFVWLML
FSYFSVPAALQRRVLVYGVLGAIVLRTIMIFTGSWLISQFDWILYIFGAF
LLFTGVKMALAHEDESGIGDKPLVRWLRGHLRMTDTIDNEHFFVRKNGLL
YATPLMLVLILVELSDVIFAVDSIPAIFAVTTDPFIVLTSNLFAILGLRA
MYFLLAGVAERFSMLKYGLAVILVFIGIKMLIVDFYHIPIAVSLGVVFGI
LVMTFIINAWVNYRHDKQRVG
>ECP_1650 hypothetical protein
MKTAFHFYGFVYTDGLCGNCRREQICATLGSVLKSKKIYLLERFTVFFIR
LNMQR
>ECP_1218 septum site-determining protein MinC
MSNTPIELKGSSFTLSVVHLHEAEPKVIHQALEDKIAQAPAFLKHAPVVL
NVSALEAPVNWSAMHKAVSATGLRVIGVSGCKDAQLKAEIEKMGLPILTE
GKEKAPRPAPAPQAPAQNTTPVTKTRLIDTPVRSGQRIYAPQCDLIVTSH
VSAGAELIADGNIHVYGMMRGRALAGASGDRETQIFCTNLMAELVSIAGE
YWLSDQIPAEFYGKAARLQLVENALTVQPLN
>ECP_2706 putative transposase protein
MVTLCHVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHNISHGSAGAR
SIATMATLRGFRMGRWLAGRLMKELGLVSCQQPAHRYKRGGREHVTIPNH
LGRQFAVTEPNQVWCGDVTYIWTGKRWAYLAVVLDLFARKPVGWAMSFSP
DSRLTIKALKMAWEIRSKPAGVMFHSDQGSHYTSRQFRQLLWRYQIKQSL
SRRGNCWDNSPMERFFRSLKNEWIPVTGYMNFSDAAHEITDYIVGYYNAL
RPHEYNGGLPPNESENRYWKNSKAVASFC
>ECP_2598 hypothetical lipoprotein YfiO precursor
MTRMKYLVAAATLSLFLAGCSGSKEEVPDNPPNEIYATAQQKLQDGNWRQ
AITQLEALDNRYPFGPYSQQVQLDLIYAYYKNADLPLAQAAIDRFIRLNP
THPNIDYVMYMRGLTNMALDDSALQRFFGVDRSDRDPQHARAAFSDFSKL
VRGYPNSQYTTDATKRLVFLKDRLAKYEYSVAEYYTERGAWVAVVNRVEG
MLRDYPDTQATRDALPLMENAYRQMQMNAQAEKVAKIIAANSSNT
>ECP_0625 Isochorismate synthase EntC
MDTSLAEEVQQTMATLAPNRFFFMSPYRSFTTSGCFARFDEPAVNGDSPD
SPFQQKLAALFADAKAQGIKNPVMVGAIPFDPRQPSSLYIPESWQSFSRQ
EKQASARRFTRSQSLNVVERQAIPEQTTFEQMVARAAALTATPQVDKVVL
SRLIDITTDAAIDSGVLLERLIAQNPVSYNFHVPLADGGVLLGASPELLL
RKDGERFSSIPLAGSARRQPDEVLDREAGNRLLASEKDRHEHELVTQAMK
EVLRKRSSELHVPSSPQLITTPTLWHLATPFEGKANSQENALTLACLLHP
TPALSGFPHQAATQVIAELEPFDRELFGGIVGWCDSEGNGEWVVTIRCAK
LRENQVRLFAGAGIVPASSPLGEWRETGVKLSTMLNVFGLH
>ECP_3199 hypothetical protein YhaH
MDWYLKVLKNYVGFRGRARRKEYWMFILVNIIFTFVLGLLDKMLGWQRAG
GEGILTTIYGILVFLPWWAVQFRRLHDTDRSAWWALLFLIPFIGWLIIIV
FNCQAGTPGENRFGPDPKLEQE
>ECP_4649 type-1 fimbrial major subunit
MKIKTLAIVVLSALSLSSTAALAAATTVNGGTVHFKGEVVNAACAVDAGS
VDQTVQLGQVRTASLAQEGATSSAVGFNIQLNDCDTTVADKAAIAFLGTA
IDGNHPNVLALQSSAAGSATSVGVQILDRTGNALTLDGATFSAQTTLNNG
TNTIPFQARYYAIGEATPGAANADATFKVQYQ
>ECP_2784 L-fucose permease
MGNTSIQTQSYRAVDKDAGQSRSYIIPFALLCSLFFLWAVANNLNDILLP
QFQQAFTLTNFQAGLIQSAFYFGYFIIPIPAGILMKKLSYKAGIITGLFL
YAFGAALFWPAAEIMNYTLFLVGLFIIAAGLGCLKTAANPFVTVLGPESS
GHFRLNLAQTFNSFGAIIAVVFGQSLILSNVPHQSQDVLDKMSPEQLSAY
KHSLVLSVQTPYMIIVAIVLLVALLIMLTKFPALQSDNHSDAKQGSFSAS
LSRLARIRHWRWAVLAQFCYVGAQTACWSYLIRYAVEEIPGMTAGFAANY
LTGTMVCFFIGRFTGTWLISRFAPHKVLAAYALIAMALCLISAFAGGHVG
LIALTLCSAFMSIQYPTIFSLGIKNLGQDTKYGSSFIVMTIIGGGIVTPV
MGFVSDAAGNIPTAELIPALCFAVIFIFARFRSQTATN
>ECP_4210 uroporphyrinogen decarboxylase
MTELKNDRYLRALLRQPVDVTPVWMMRQAGRYLPEYKATRAQAGDFMSLC
KNAELACEVTLQPLRRYPLDAAILFSDILTVPDAMGLGLYFEAGEGPRFT
SPVTCKADVDKLPIPDPEDELGYVMNAVRTIRRELKGEVPLIGFSGSPWT
LATYMVEGGSSKAFTVIKKMMYADPQALHALLDKLAKSVTLYLNAQIKAG
AQAVMIFDTWGGVLTGRDYQQFSLYYMHKIVDGLLRENDGRRVPVTLFTK
GGGQWLEAMAETGCDALGLDWTTDIADARRRVGNKVALQGNMDPSMLYAP
PARIEEEVATILAGFGHGEGHVFNLGHGIHQDVLPEHAGVFVEAVHRLSE
QYHR
>ECP_4279 putative sodium:sulfate symporter transmembrane protein
MSSISHGAPQKRRIIPNPGLWLAIIAGIIITLLPLGDTLPVAGQNMIAIL
VFAIIVWISEAMDYTASAIVISALIIFMVGFAPDMNHPDTILGTAKALKM
TLSGFSNSALALVAAAMFIAAAMTITGLDKRIALFTMSKIGASSRSIIIG
AIVVTIVLSLVVPSATARTACVVPIMMGVIAAFKVDKHSRLAASMMIVIA
QATSIWNVGIQTSAAQNLLSIGFINKTFGAGHSVSWLDWLLAGAPWSLTM
SAILYFLARKLLPPETEAVEGGSEAIKKALAELGPTTGKEKRLIGISLLL
LLFWSTGGKLHSIDTTSVTLAGLAIMLLPGIGVMSWKEVEKRVQWGTLLM
FGIGISLGSTLLDTQAASWMANYVVKGFGLDGLPSLAIFAILAAFLIIIH
LGFASATALTAALLPILISLLSSLPPELGVNPVGMTILLAFSVSFGFILP
INAPQNMVCMGTDTFTPRQFTRVGLYLTVIGYLLLLLFAATWWKILGLM
>ECP_3051 hypothetical protein
MSGWLFLRNFNKQKINNFSQRNYINLLVVLTIKIAKATDRTVFAHQANII
NTDLFCVIKKQMFSRF
>ECP_3910 putative membrane transport protein
MSRFLICSFALVLLYPAGIDMYLVGLPRIAADLNASEAQLHIAFSVYLAG
MAAAMLFAGKVADRSGRKPVAIPGAALFIIASVFCSLAETSALFLAGRFL
QGLGAGCCYVVAFAILRDTLDDRRRAKVLSLLNGITCIIPVLAPVLGHLI
MLKFPWQSLFWTMATMGIAVLMLSLFILKETRPAAPAASDKPRENSESLL
NRFFLSRVVITTLSVSVILTFVNTSPVLLMEIMGFERGEYATIMALTAGV
SMTVSFSTPFALGIFKPRTLMITSQVLFLAAGITLAVSPSHAVSLFGITL
ICAGFSVGFGVAMSQALGPFSLRAGVASSTLGIAQVCGSSLWIWLAAVVG
IGAWNMLIGILIACSIVSLLLIMFVAPGRPVAAHEEIHHHA
>ECP_2283 anaerobic glycerol-3-phosphate dehydrogenase subunit A
MKTRDSQSSDVIIIGGGATGAGIARDCALRGLRVILVERHDIATGATGRN
HGLLHSGARYAVTDAESARECISENQILKRIARHCVEPTNGLFITLPEDD
LSFQATFIRACEEAGISAEAIDPQQARIIEPAVNPALIGAVKVPDGTVDP
FRLTAANMLDAKEHGAVILTAHEVTGLIREGATVCGVHVRNHLTGETQAL
HAPVVVNAAGIWGQHIAEYADLRIRMFPAKGSLLIMDHRINQHVINRCRK
PSDADILVPGDTISLIGTTSLRIDYNEIDDNRVTAEEVDILLREGEKLAP
VMAKTRILRAYSGVRPLVASDDDPSGRNVSRGIVLLDHAERDGLDGFITI
TGGKLMTYRLMAEWATDAVCRKLGNSRPCTTADLALPGSQEPAEATLRKV
ISLPAPLRGSAVYRHGDRTPAWLSEGRLHRSLVCECEAVTAGEVQYAVEN
LNVNSLLDLRRRTRVGMGTCQGELCACRAAGLLQRFNVTTSAQSIEQLST
FLNERWKGVQPIAWGDALRESEFTRWVYQGLCGLEKEQKDAL
>ECP_0418 lactose operon repressor
MKPVTLYDVAEYAGVSYQTVSRVVNQACHVSAKTREKVEAAMAELNYIPN
RVAQQLAGKQSLLIGVATSSLALHAPSQIVAAIKSRADQLGASVVVSMVE
RSGVEACKAAVHNLLAQRVSGLIINYPLDDQDAIAVEAACANVPALFLDV
SDQTPINSIIFSHEDGTRLGVEHLVALGHQQIALLAGPLSSVSARLRLAG
WHKYLTRNQIQPIAEREGDWSAMSGFQQTMQMLNEGIVPTAMLVANDQMA
LGAMRAITESGLRVGADISVVGYDDTEDSSCYIPPLTTIKQDFRLLGQTS
VDRLLQLSQGQAVKGNQLLPVSLVKRKTTLPPNTQTASPRALADSLMQLA
RQVSRLESGQ
>ECP_2986 hypothetical protein
MIPWSGVTRRQEIRSMQGCEMNSNRLSAPVIFEDSSGCYPVCIKNPDIMD
NTTDSITGRSLFIIRA
>ECP_1142 host-nuclease inhibitor protein Gam
MNAYYIQDRLEAQSWVRHYQQIAREEKEAELADDMEKGLPQHLFESLCID
HLQRHGASKKAITRAFDDDVEFQERMAEHIRYMVETIAHHQVDIDSEV
>ECP_2360 hypothetical protein
MIQPISGPPPGQPPGQGDNLPSGAGNQPLSSQQRTSLESLMTKVTSLTQQ
QRAELWAGIRHDIGLSGDSPLLSRHFPAAEHNLAQRLLAAQKSHSARQLL
AQLGEYLRLGNNRQAVTDYIRHNFGQTPLNQLSPEQLKTILTLLQEGKMV
IPQPQQREATDRPLLPAEHNALKQLVTKLAAATGEPSKQIWQSMLELSGV
KDGELIPAKLFNHLVTWLQARQTLSQQNTPTLESLQMALKQPLDASELAA
LSAYIQQKYGLSAQSSLSSAQAEDILNQLYQRRVKGIEPRDMQPLLNPFP
PMMDTLQNMATRPALWILLVAIILMLVWLVR
>ECP_3187 hypothetical protein
MDYRTAMDKKLLALLILASLSPAKATLTKIPAGFEVIAQGQQEYIEVYFA
GKSLGKYYAMVNLDTVTFLDSSSLYSKLELSTDDQKIAHTVKEKLSQPLA
RHGELACGFVRTDSGCGYS
>ECP_2582 uracil-DNA glycosylase
MANELTWHDVLAEEKQQPYFLNTLQTVASERQSGVTIYPPQKDVFNAFRF
TELGDVKVVILGQDPYHGPGQAHGLAFSVRPGIATPPSLLNMYKELENTI
PGFTRPNHGYLESWARQGVLLLNTVLTVRAGQAHSHASLGWETFTDKVIS
LINQHRKGVVFLLWGSHAQKKGAIIDKQRHHVLKAPHPSPLSAHRGFFGC
NHFVLANQWLEQRGETPIDWMPVLPAESE
>ECP_2443 hypothetical protein YfdK precursor
MLEGLAQKKDLIFVRDGDEHTCDEAVSHLRLKLGNTRNRIDTAEQFIDKV
ASSSSITGKPYIVKMPGKSDENAQPFLHALIAQTDKTVPAQ
>ECP_3568 hypothetical protein
MPEPVAEPALNGLRLNLRIVSIVMFNFASYLTIGLPLAVLPGYVHDVMGF
SAFWAGLVISLQYFATLLSRPHAGRYADLLGPKKIVVFGLCGCFLSGLGY
LTAGLTASLPVISLLLLCLGRVILGIGQSFAGTGSTLWGVGVVGSLHIGR
VISWNGIVTYGAMAMGAPLGVVFYHWGGLQALALIIMGVALVAILLAIPR
PTVKASKGKPLPFRAVLGRVWLYGMALALASAGFGVIATFITLFYDAKGW
DGAAFALTLFSCAFVGTRLLFPNGINRIGGLNVAMICFSVEIIGLLLVGV
ATMPWMAKIGVLLAGAGFSLVFPALGVVAVKAVPQQNQGAALATYTVFMD
LSLGVTGPLAGLVMSWAGVPVIYLAAAGLVAIALLLTWRLKKRPPVEIPE
AASSS
>ECP_3780 hypothetical protein
MPDQFASLGTAACVVDKAGNGMALSSWSASDATGAVTVGVVAKGTHQNSM
AQGEFSCTTRENEVYIGYDSGVINPVSPRGPDKIRGPGGISDGAWDTEAA
TIRQLNPLTDEVYSGISGRITA
>ECP_3851 hypothetical protein
MSDNRSRHDRLAVRLSLIISRLMAGESLSLKTLSDEFGVTERTLQRDFHQ
RLVHLDLEYRNGRYSLRRQSSPGAIPEMLSFIQNTGIARILPLRNGRLIT
CLTDNQEPSPCLIWLPAPDITATFPECFSQLILAIRQCNHISLMTERWYP
SLEPCRLIYYSGSWYLIALQKGKLQVFPLADIKSVSLTSERFERRGHIHS
LVAEERFISALPHFSFIHKLINTFNL
>ECP_2511 exodeoxyribonuclease VII large subunit
MLPSQSPAIFTVSRLNQTVRLLLEHEMGQVWISGEISNFTQPASGHWYFT
LKDDTAQVRCAMFRNSNRRVTFRPQHGQQVLVRANITLYEPRGDYQIIVE
SMQPAGEGLLQQKYEQLKAKLQAEGLFDQQYKKTLPSPAHCVGVITSKTG
AALHDILHVLKRRDPSLPVIIYPTAVQGDDAPGQIVRAIELANQRNECDV
LIVGRGGGSLEDLWSFNDERVARAIFASLIPVVSAVGHETDVTIADFVAD
LRAPTPSAAAEVVSRNQQELLRQVQSAQQRLEMAMDYYLANRTRRFTQIH
HRLQQQHPQLRLARQQTMLERLQKRMSFALENQLKRAGQQQQRLTRQLVQ
QNPQSRIHRAQTRIQQLEYRLAETLRAQLSATRERFGNAVTHLEAVSPLS
TLARGYSVTSAADGALLKQVKQVKVGETLTTRLGDGVVISEVSAVTKTRK
SRKKTSNP
>ECP_1843 hypothetical protein
MVKKKTSRMGTKDNKDFQANLMDYSAVHSIICLTNSLFLMSFPAKFMHVL
TVILRCDIALLWRINFSLKLCQHSHLSFS
>ECP_2576 L-aspartate oxidase
MNTLPEHSCDVLIIGSGAAGLSLALRLADQHQVIVLSKGPVTEGSTFYAQ
GGIAAVFDETDNIDSHVEDTLIAGAGICDRHAVEFVASNARSCVQWLIDQ
GVLFDTHIQPNGEESYHLTREGGHSHRRILHAADATGREVETTLVGKAQN
HPNIRVLERSNAVDLIVSDKIGLPGTRRVVGAWVWNRNKETVETCHAKAV
VLATGGASKVYQYTTNPDISSGDGIAMAWRAGCRIANLEFNQFHPTALYH
PQARNFLLTEALRGEGAYLKRPDGTRFMPDFDERGELAPRDIVARAIDHE
MKRLGADCMFLDISHKPADFIRQHFPMIYEKLLGLGIDLTKEPVPIVPAA
HYTCGGVMVDDHGRTDVEGLYAIGEVSYTGLHGANRMASNSLLECLVYGW
SAAEDISRRIPYAHGVSTLPPWDESRVENPDERVVIQHNWHELRLFMWDY
VGIVRTTKRLERALRRITMLQQEIDEYYAHFRVSNNLLELRNLVQVAELI
VRCAMMRKESRGLHFTLDYPELLTHSGPSILSAGNHYINR
>ECP_3190 exu regulon transcriptional regulator
MEITEPRRLYQQLAADLKERIEQGVYLVGDKLPAERFIADEKNVSRTVVR
EAIIMLEVEGYVEVRKGSGIHVVSNQPRHQQSADNNMEFANYGPFELLQA
RQLIESNIAEFAATQVTKQDIMKLMAIQEQARGEQCFRDSEWDLQFHIQV
ALATQNSALAAIVEKMWTQRSHNPYWKKLHEHIDARTVDNWCDDHDQILK
ALIRKDPHAAKLAMWQHLENTKIMLFNETSDDFEFNADRYLFAENPVVHL
DTATSGSK
>ECP_1870 EmrE methyl viologen resistance protein C
MNSFVSLGFLLIIIVPAFISCHARAPWIHIHQDENGELCSNCSTILSSMN
RKEYAMNPYIYLGGAILAEVIGTTLMKFSEGFTRLWPSVGTIICYCASFW
LLAQTLAYIPTGIAYAIWSGVGIVLISLLSWGFFGQRLDLPAIIGMMLIC
AGVLVINLLSRSAPH
>ECP_0285 ABC transporter ATP-binding protein
MVYVLIRLVTYGYYRQISEETLVRGARASSYFMESLYGIATVKIQGMVGI
RGTHWLNLKIDAINSGIKLTRMDLLFGGINTFVAACDQMAILWLGASLVI
DNQMTIGMFVAFGSFRGQFSDRVASLTSFLLQLRIMSLHNERIADIALHE
KEEKKPEIEIVADMSPVSLETTDLSYRYDSQSAQVFSGLNLSVAPGESVA
ITGASGAGKTTLMKVLCGLFEPDSGKVLVNGTDIRQLGINNYHRMIACVM
QDDRLFSGSIRENICGFAEETDDEWMTECARASHIHDVIMKMPMGYETLI
GELGEGLSGGQKQRIFIARALYRKPGILFMDEATSSLDTESERFVNAAIK
K
>ECP_3512 hypothetical membrane protein
MIKFRLYIPPVILGFVIVPLLVWPTVIALAVLIFTLTFLAEIIFSFPLLV
VRISLQELQLELLVEYALFFSVMCGIGWQFSRRTPPELKNRLHCWLVFSP
VYFWLILSNFILYISPEKSALLENIRNFFLTFVWLPLNFSPFWPQPWTDF
VGPISAQLGFALGYYCQWRSKNRSHRKKWGDWVTCLSLAILSLGPLFNYS
Q
>ECP_3913 hypothetical protein YieF
MSEKLQVVTLLGSLRKGSFNGMVARTLPKIAPASMEINALPSIADIPLYD
ADVQQEDGFPATVEALAEQIRQADGVVIVTPEYNYSVPGGLKNAIDWLSR
LPDQPLAGKPVLIQTSSMGVIGGARCQYHLRQILVFLDAMVMNKPEFMGG
VIQNKVDPQTGEVIDQGTLDHLTGQLTAFGEFIQRVKI
>ECP_3905 ribonuclease P protein component
MLTPSQFTFVFQQPQRAGTPQITILGRLNSLGHPRIGLTVAKKNVRRAHE
RNRIKRLTRESFRLRQHELPAMDFVVVAKKGVADLDNRALSEALEKLWRR
HCRLARGS
>ECP_3581 hypothetical protein YhiM
MNIYIGWLFKLIPLLMGIICIALGEFVLTGSGQSEYFVAGHVLISLSAIC
LALFTTAFIIISQLTHGMNKFYNRLFPVIGYAGSATTMIWGWSLLASNNV
MADEFVAGHVIFGVGMIAACVSTVAASSGHFLLIPKNASGSKSDGTPLQA
YSSTIGNSLIAVPVLLTLFGFIWSVILLRSADITPHYVAGHVLMGLTAIC
ACLIGLVATIVHQTRNTFSVKEHWLWCYWVILLGSLTIIFGIYVLISSDA
SARLAPGIILICLGMICYSIFSKVWLLALVWRRTCSLANRIPMIPVFTCL
FCLFLAAFLAEIAQTDMAYFIPSRVLVGLGAVCFTLFSIVSILEAGSAKK
>ECP_3499 putative amidophosphoribosyltransferase
MLTVPGLCWLCRMPLALGHWGICSVCSRAARTDKTLCPQCGLPATHSHLP
CGRCLQKPPPWQRLVTVADYAPPLSPLIHQLKFSRRSEIASALSRLLLLE
VLHARRTTGLQLPDRIISVPLWQRRHWRRGFNQSDLLCQPLSRWLHCQWD
SEAVTRTRATATQHFLNARLRKHNLKNAFRLELPVQGRHMVIVDDVVTTG
STVAEIAQLLLRNGAATVQVWCLCRTL
>ECP_1414 putative phosphatidate cytidylyltransferase
MTLTFFALISFLALKEYCTLISVHFPRWLYWGIPLNYLLIGFNCFELFLL
FIPLAGFLILATWRVLVGDPSGFLHTVSAIFCGWIMTVFTLSHAAWLLML
PTINIHGGALLVLFLLALTESNDIAQYLWGKSCGRRKVVPKVSPGKTLEG
LVGGVITTMIASLIIGPLLTPLNTLQVLLAGLLIGISGFCGDVVMSATKR
DVGVKDSGKLLPGHGGLLDRIDSLIFTAPVFFYFIRYCCY
>ECP_0640 putative universal stress protein
MYKTIIMPVDVFEMELSDKAIRHAEFLAQDDGVIHLLHVLPGSASLSLHR
FAADVRRFEEHLQHEAEERLQTMVSHFTIDPSRIKQHVRFGSVRDEVNEL
AKELDADVVVIGSRNPSISTHLLGSNASSVIRHANLPVLVVR
>ECP_1099 beta-hexosaminidase
MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR
NHLVVAVDQEGGRVQRFREGFTRLPAAQSFAALLGMEEGGKLAQEAGWLM
ASEMIAMDIDISFAPVLDVGHISAAIGERSYHADPQKALAIASRFIDGMH
EAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIREN
KLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELGFDGVIFSDDLSMEG
AAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLY
HKGSFSRQELMDSARWKAISARLNQLHERWQEEKAGH
>ECP_2480 AegA protein
MNRFIMANSQQCLGCHACEIACVMAHNDEQHVLSQHHFHPRITVIKHQQQ
RSAVTCHHCEDAPCARSCPNGAISHVDDSIQVNQQKCIGCKSCVVACPFG
TMQIVLTPVAVGKVKATAHKCDLCAGRENGPACVENCPADALQLVTDAAL
SGMAKSRRLRTARQEQQPWHASTAAQEMPVMSKVEQMQATPARGEPDKLA
IEARKTGFDEIYLPFRADQAQREASRCLKCGEHSVCEWTCPLHNHIPQWI
ELVKAGNIDAAVELSHQTNTLPEITGRVCPQDRLCEGACTIRDEHGAVTI
GNIERYISDQALAKGWRPDLSHVTKVDKRVAIIGAGPAGLACADVLTRNG
VAVTVYDRHPEIGGLLTFGIPSFKLDKSLLARRREIFSAMGIHFELNCEV
GKDVSLNSLLEQYDAVFVGVGTYRSMKAGLPNEDAPGVYDALPFLIANTK
QVMGLEELPEEPFINTAGLNVVVLGGGDTAMDCVRTALRHGASNVTCAYR
RDEANMPGSKKEVKNAREEGANFEFNVQPVALELNEQGHVCGIRFLRTRL
GEPDAQGRRRPVPVEGSEFVMPADAVIMAFGFNPHGMPWLESHGVTVDKW
GRIIADVESQYRYQTTNPKIFAGGDAVRGADLVVTAMAEGRHAAQGIIDW
LGVKSVKSH
>ECP_3360 acriflavine resistance protein F
MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYP
GADAQTVQDTVTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDP
DIAQVQVQNKLQLATPLLPQEVQQQGISVEKSSSSYLMVPGFVSDNPGTT
QDDISDYVASNVKDTLSRLNGVGDVQLFGAQYAMRIWLDADLLNKYKLTP
VDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRLKNPEEFGKVT
LRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL
DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIQEVVKTLFEAIML
VFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVL
AIGLLVDDAIVVVENVERVMMEDKLPPREATEKSMSQIQGALVGIAMVLS
AVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPV
SAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYALIVAG
MVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN
EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERSGDENSAEAVIHRA
KMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQ
LLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTIS
TALGGTYVNDFIDRGRVKKVYVQADAKFRMLPEDVDKLYVRSANGEMVPF
SAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPRTSSGDAMALMENLASKL
PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV
MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKD
LMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQN
AVGIGVMGGMVSATLLAIFFVPVFFVVIRRCFKG
>ECP_2540 hypothetical protein CsiE
MMPTLAPPSVLSAPQRRCQILLTLFQPGLTATTATFSELNGVDDDIASLD
ISETGREILRYHQLTLTTGYDGSYRVEGTVLNQRLCLFHWLRRGFRLCPS
FITSRFTPALKSELKRRGIARNFYDDTNLQALVNLCSRRLQKRFETRDIH
FLCLYLQYCLLQHHAGITPQFNPLQRRWAESCLEFQVAQEIGRHWQRRAL
QPVPPDEPLFMALLFSMLRVPDPLRDAHQRDRQLRQSIKRLVNHFRELGN
VRFYDEQGLCDQLYTHLAQALNRSLFAIGIDNTLPEEFARLYPRLVRTTR
AALAGFESEYGVHLSDEESSLVAVIFGAWLMQENDLHEKQIILLTGNDSE
REAQIEQQLRELTLLPLNIKHMSVKVFLQTGAPRGAALIIAPYTMPLPLF
SPPLIYTDLTLTTHQQEQIRKMLESA
>ECP_3364 hypothetical amino-acid ABC transporter permease protein YhdX
MSHRRSTVKGSLSFANPTVRAWLFQILAVVAVVGIIGWLFHNTVTNLSNR
GITSGFAFLDRGAGFGIVQHLIDYQQGDTYGRVFIVGLLNTLLVSALCIV
FASVLGFFIGLARLSDNWLLRKLSTIYIEIFRNIPPLLQIFFWYFAVLRN
LPGPRQAVSALDLVFLSNRGLYIPSPQLGDGFLAFILAVVIAIVLSVGLF
RFNKTHQIKTGQLRRTWPIAAVLIIGLPLLAQWLFGAALHWDVPALRGFN
FRGGMVLIPELAALTLALSVYTSAFIAEIIRAGIQAVPYGQHEAARSLGL
PNPVTLRQVIIPQALRVIIPPLTSQYLNIVKNSSLAAAIGYPDMVSLFAG
TVLNQTGQAIETIAMTMSVYLIISLTISLLMNIYNRRIAIVER
>ECP_0691 PTS system, N-acetylglucosamine-specific IIABC component
MNILGFFQRLGRALQLPIAVLPVAALLLRFGQPDLLNVAFIAQAGGAIFD
NLALIFAIGVASSWSKDSAGAAALAGAVGYFVLTKAMVTINPEINMGVLA
GIITGLVGGAAYNRWSDIKLPDFLSFFGGKRFVPIATGFFCLVLAAIFGY
VWPPVQHAIHAGGEWIVSAGALGSGIFGFINRLLIPTGLHQVLNTIAWFQ
IGEFTNAAGTVFHGDINRFYAGDGTAGMFMSGFFPIMMFGLPGAALAMYF
AAPKERRPMVGGMLLSVAVTAFLTGVTEPLEFLFMFLAPLLYLLHALLTG
ISLFVATLLGIHAGFSFSAGAIDYALMYNLPAASQNVWMLLVMGVVFFAI
YFVVFSLVIRMFNLKTPGREDKEDEIVTEEANSNTEEGLTQLATNYIAAV
GGSDNLKAIDACITRLRLTVADSARVNDAMCKRLGASGVVKLNKQTIQVI
VGAKAESIGDAMKKVVARGPVAAASAEATPATAAPVAKPQAVPNAVSIAE
LVSPITGDVVALDQVPDEAFASKAVGDGVAVKPTDKIVVSPAAGTIVKIF
NTNHAFCLETEKGAEIVVHMGIDTVALEGKGFKRLVEEGAQVSAGQPILE
MDLDYLNANARSMISPVVCSNIDDFSGLIIKAQGHVVAGQTPLYEIKK
>ECP_1488 amino acid antiporter
MATSVQTGKAKQLTLLGFFAITASMVMAVYEYPTFATSGFSLVFFLLLGG
ILWFIPVGLCAAEMATVDGWEEGGVFAWVSNTLGPRWGFAAISFGYLQIA
IGFIPMLYFVLGALSYILKWPALNEDPITKTIAALIILWALALTQFGGTK
YTARIAKVGFFAGILLPAFILIALAAIYLHSGAPVAIEMDAKTFFPDFSK
VGTLVVFVAFILSYMGVEASATHVNEMSNPGRDYPLAMLLLMVAAICLSS
VGGLSIAMVIPGNEINLSAGVMQTFTVLMSHVAPEIEWTVRVISALLLLG
VLAEIASWIVGPSRGMYVTAQKNLLPAAFAKMNKNGVPVTLVISQLVITS
IALIILTNTGGGNNMSFLIALALTVVIYLCAYFMLFIGYIVLVLKHPDLK
RTFNIPGGKGVKLVVAIVGLLTSIMAFIVSFLPPDNIQGDSTDMYVELLV
VSFLVVLALPFILYAVHNRKGKGNTGVTLEPINSQNAPKGHFFLHPRARS
PHYIVMNDKKH
>ECP_1874 flagellar assembly protein FliH
MSDNLPWKTWMPDDLAPPQAEFVPMVEPEETIIEEAEPSLEQQLAQLQMQ
AHEQGYQAGIAEGRQQGHEQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQ
LVSEFQTTLDALDSVIASRLMQMALEAARQVIGQTPTMDNSALIKQIQQL
LQQEPLFSGKPQLRVHPDDLQRVDDMLGATLSLHGWRLRGDPTLHPGGCK
VSADEGDLDASVATRWQELCRLAAPGVV
>ECP_3562 lead, cadmium, zinc and mercury transporting ATPase
MSTPDNHGKKAPQFAAFKPLTTVQNTNDCCCDGACSSTPTLSENVSGTRY
SWKVSGMDCAACARKVENAVRQLAGVNQVQVLFATEKLVVDADNDIRAQV
ESAVQKAGYSLRDEQASDEPQESRLKENLPLITLIVMMAISWGLEQFNHP
FGQLAFIATTLVGLYPIARQALRLIKSGSYFAIETLMSVAAIGALFIGAT
AEAAMVLLLFLIGERLEGWAASRARQGVSALMALKPETATRLRNGEREEV
AINSLRPGDVIEVAAGGRLPADGKLLSPFASFDESALTGESIPVERATGD
KVPAGTTSVDRLVTLEVLSEPGASAIDRILKLIEEAEERRAPIERFIDRF
SRIYTPAIMAVALLVTLVPPLLFAASWQEWIYKGLTLLLIGCPCALVIST
PAAITSGLAAAARRGALIKGGAALEQLGRVTQVAFDKTGTLTVGKPRVTA
IHPATGISESELLTLAAAVEQGATHPLAQAIVREAQVAELAIPTAQSQRA
LVGSGIEAQVNGERVLICAAGKHPADAFAGLINELESAGQTVVLVVRNDD
VLGVIALQDTLRADAATAISELNALGVKGVILTGDNPRAAAAIAGELGLE
FKAGLLPEDKVKAVTELNQHAPLAMVGDGINDAPAMKAAAIGIAMGSGTD
VALETADAALTHNHLRGLVQMIELARATHANIRQNITIALGLKGVFLVTT
LLGMTGLWLAVLADTGATVLVTANALRLLRRK
>ECP_2781 exodeoxyribonuclease IX
MAVHLLIVDALNLIRRIHAVQGSPCVETCQHALDQLIMHSQPTHAVAVFD
DENRSSGWRHQRLPDYKAGRPPMPEELHNEMPALRAAFEQRGVPCWSASG
NEADDLAATLAVKVTQAGHQATIVSTDKGYCQLLSPTLRIRDYFQKRWLD
APFIDKEFGVQPQQLPDYWGLAGISSSKVPGVAGIGPKSATQLLVEFQSL
EGIYENLDAVAEKWRKKLENHKEMAFLCRDIARLQTDLHIDGNLQQLRLV
R
>ECP_4653 FimF protein precursor
MRNKPFYLLCAFLWLAVSHALAADSTITIRGYVRDNGCSVAAESTNFTVD
LMENAVKQFNNIGATTPVVPFRILLSPCGNAVSAVKVGFTGVADSHNANL
LALENTVSAAAGLGIQLLNEQQNEIPLNAPSSAISWTTLTPGKPNTLNFY
ARLMATQVPVTAGHINATATFTLEYQ
>ECP_1315 putative membrane protein YciQ
MMAGIYRCILLLIVGLFFSSLSYAKNTEIPSYEEGISLFDVEATLQPNGV
LDIKENIHFQARNQQIKHGFYRDLPRLWMQPDGDAALLNYHIVGVTRDGI
PEPWHLDWHIGLMSIVVGDKQRFLPQGDYHYQIHYQVKNAFLREGDSDLL
IWNVTGNHWPFEIYKTLFSLKLPDIAGNPFSEIDLFTGEEGDTYRNGRIL
EDGRIESRDPFYREDFTVLYRWPHALLGNAPAPQTTNIFSHLLLPSMSSL
LICFPSLFLACGWLYLWKRRPQFTPVDVIETDVIPPDYTPGMLRLDAKLV
YDDKGFCADIVNLIVKGKIHLEDHYDKNQQILIRVNEGATRNNAVLLPAE
QLLLEALFRKGDKVVLTGRRNRVLRKAFLRMQKFYLPRKKSSFYRPDAFL
QWGGMAILAVILYGNLSPVGWAGMSLVGDMFIMICWLLTFLFCSLDLLFA
RDDDKPCVNRVIITLFLPLICSGVAFYSLYINVGDVFFYWYMPAGYFSAV
FLTGYLTGMGYIFLPKFTQTGQQRYAHGEAIVNYLARKEAATHSGRRRKG
ETRKLDYALLGWAISANLGREWALRIAPSLSSAICAPEIARNGVLFSLQT
HLSCGVYTSLLGRSYSGGGSGAGGGGGGGW
>ECP_2003 hypothetical protein
MFFGGERSALLYGLIGTCLLNDIDPEAYLRHILSVLPE
>ECP_0137 putative PTS system IIA component Yadi
MLGWVITCHDDRAQEILDALEKKHGALLQCRAVNFWRGLSSNMLSRMMCD
ALHETDSGEGVIFLTDIAGAPPYRVASLLSHKHSRCEVISGVTLPLIEQM
MACRETMTSSAFRERIVELGAPEVSSLWHQQQKNPPFVLKHNLYEY
>ECP_2880 hypothetical protein YgfT (putative pyridine nucleotide-disulphide oxidoreductase)
MNKFIAAEAAECIGCHACEIACAVAHNQENWPLSHSDFRPRIHIVGKGQA
ANPVACHHCNNAPCVTACPVYALTFQSDSVQLDEQKCIGCKRCAIACPFG
VVEMVDTIAQKCDLCNQRSSGTQACIDVCPTQALRLMDDKGLQQIKVARQ
RKTAAGKASSDAQPSRSAALLPVNSRKGADKISASERKTHFGEIYCGLDP
QQATYESDRCVYCAEKANCNWHCPLHNAIPDYIRLVQEGKIIEAAELCHQ
TSSLPEICGRVCPQDRLCEGACTLKDHSGAVSIGNLERYITDTALAMGWR
PDVSKVVPRSEKVAVIGAGPAGLGCADILARAGVQVDVFDRHPEIGGMLT
FGIPPFKLDKTVLSQRREIFIAMGIDFHLNCEIGRDISFNELTAEYDAVF
LGVGTYGMMRADLPHEDAPGVIQALPFLTAHTRQLMGLPESAEYPLTDVE
GKRVVVLGGGDTTMDCLRTSIRLNAASVTCAYRRDEVSMPGSRKEVVNAR
EEGVEFQFNVQPQYIACDEDGRLTAVGLIRTAMGEPGPDGRRRPRPVAGS
EFELPADVLIMAFGFQAHTMPWLQGSGIKLDKWGLIQTGDVGYLPTQTHL
KKVFAGGDAVHGADLVVTAMAAGRQAARDMLTLFDTKAS
>ECP_0966 probable membrane protein YccS
MAFMLSPLLKRYTWNSAWLYYARIFIALCGTTAFPWWLGDVKLTIPLTLG
MVAAALTDLDDRLAGRLRNLIITLFCFFIASTSVELLFPWPWLFAIGLTL
STSGFILLGGLGQRYATIAFGALLIAIYTMLGTSLYEHWYQQPMYLLAGA
VWYNVLTLIGHLLFPVRPLQDNLARCYEQLARYLELKSRMFDPDIEDESQ
APLYDLALANGQLMATLNQTKLSLLTRLRGDRGQRGTRRTLHYYFVAQDI
HERASSSHIQYQTLREHFRHSDVLFRFQRLMSMQGQACQQLSRCILLRQP
YQHDPHFERAFTHIDAALERMRDNGAPADLLKTLGFLLNNLRAIDAQLAT
IESEQAQALPHNNDENELADDSPHGLSDIWLRLSRHFTPESALFRHAVRM
SLVLCFGYAIIQITGMHHGYWILLTSLFVCQPNYNATRHRLKLRIIGTLV
GIAIGIPVLWFVPSLEGQLVLLVITGVLFFAFRNVQYAHATMFITLLVLL
CFNLLGEGFEVALPRVIDTLIGCAIAWAAVSYIWPDWKFRNLPRMLERAT
EANCRYLDAILEQYHQGRDNRLAYRIARRDAHNRDAELASVVSNMSSEPN
VTPQIREAAFRLLCLNHTFTSYISALGAHREQLTNPEILAFLDDAVCYVD
DALHHQPADEERVNQALAGLKQRMQQLEPRADSKEPLVVQQVGLLIALLP
EIGRLQRQITQVPQETPVLA
>ECP_0003 homoserine kinase
MVKVYAPASSANMSVGFDVLGAAVTPVDGALLGDVVTVEAAETFSLNNLG
RFADKLPSEPRENIVYQCWERFCQELGKQIPVAMTLEKNMPIGSGLGSSA
CSVVAALMAMNEHCGKPLNDTRLLALMGELEGRISGSIHYDNVAPCFLGG
MQLMIEENDIISQQVPGFDEWLWVLAYPGIKVSTAEARAILPAQYRRQDC
IAHGRHLAGFIHACYSRQLELAAKLMKDVIAEPYRERLLPGFRQARQAVA
EIGAVASGISGSGPTLFALCDKPDTAQRVADWLGKNYLQNQEGFVHICRL
DTAGARVLEN
>ECP_0075 3-isopropylmalate dehydrogenase
MSKNYHIAVLPGDGIGPEVMTQALKVLDAVRNRFAMRITTSHYDVGGAAI
DNHGQPLPTATVEGCEQADAVLFGSVGGPKWEHLPPDQQPERGALLPLRK
HFKLFSNLRPAKLYQGLEAFCPLRADIAANGFDILCVRELTGGIYFGQPK
GREGSGQYEKAFDTEVYHRFEIERIARIAFESARKRRHKVTSIDKANVLQ
SSILWREIVNEIATEYPDVELTHMYIDNATMQLIKDPSQFDVLLCSNLFG
DILSDECAMITGSMGMLPSASLNEQGFGLYEPAGGSAPDIAGKNIANPIA
QILSLALLLRYSLDADDAASAIERAINRALEEGIRTGDLARGAAAVSTDE
MGDIIARYVAEGV
>ECP_1905 hypothetical protein
MTLDHDDLTRRFPLEERIYRMRCVEAWSMVVPWIGFPLHKLLALAEPTSN
AKYVAFETIYAPEQMPGQQDRFIGGGLKYPYVEGLRLDEAMHPLTLMTVG
VYGKALPPQNGAPVRLIVPWKYGFKGIKSIVSIKLTRERPPTTWNLAAPD
EYGFYANVNPHVDHPRWSQATERFIGSGGILDVQRQPTLLFNGYADQVAS
LYRGLDLRENF
>ECP_0132 glucose dehydrogenase
MAINNTGSRRLLVTLTALFAALCGLYLLIGGGWLVAIGGSWYYPIAGLVM
LGVAWMLWRSKRAALWLYAALLLGTMIWGVWEVGFDFWALTPRSDILVFF
GIWLILPFVWRRLVIPASGAVAALVVALLISGGILTWAGFNDPQEISGTL
SADTTPAEAISPVADQDWPAYGRNQEGQRFSPLKQINADNVHKLKEAWVF
RTGDVKQPNDPGEITNEVTPIKVGDTLYLCTAHQRLFALDAASGKEKWHY
DPELKTNESFQHVTCRGVSYHEAKAETASPEVMADCPRRIILPVNDGRLI
AINAENGKLCETFANKGVLNLQSNMPDTKPGLYEPTSPPIITDKTIVMAG
SVTDNFSTRETSGVIRGFDVNTGELLWAFDPGAKDPNAIPSDEHTFTFNS
PNSWAPAAYDAKLDLVYLPMGVTTPDIWGGNRTPEQERYASSILALNATT
GKLAWSYQTVHHDLWDMDLPAQPTLADITVNGQKVPVIYAPAKTGNIFVL
DRRNGELVVPAPEKPVPQGAAKGDYVTPTQPFSELSFRPTKDLSGADMWG
ATMFDQLVCRVMFHQMRYEGIFTPPSEQGTLVFPGNLGMFEWGGISVDPN
REVAIANPMALPFVSKLIPRGPGNPMEQPKDAKGTGTESGIQPQYGVPYG
VTLNPFLSPFGLPCKQPAWGYISALDLKTNEVVWKKRIGTPQDSMPFPMP
VPVPFNMGMPMLGGPISTAGNVLFIAATADNYLRAYNMSNGEKLWQGRLP
AGGQATPMTYEVNGKQYVVISAGGHGSFGTKMGDYIVAYALPDDVK
>ECP_1178 minor tail protein G
MFLKTESFEHNGVTVTLSELSALQRIEHLALMKRQAEQAESDSNRKFTVE
DAIRTGAFVVAMSLWHNHPQKTKQPSMNEAVKQIEQEVLTTWPTEAISHA
ENVVYRLSGMYEFVMNDAPEQAEDVGPAEPVSAGKCSTVS
>ECP_1407 hypothetical protein YdbH
MLGKYKAVLALLLLIILVPLTLLMTLGLWVPTLAGIWLPLGTRIALDESP
RITRKGLIIPDLRYLVGDCQLAHITNASLSHPSRWLLNVGTVELDSACLA
KLPQAEQSPAAPKTLAQWQSMLPNTWINIDKLIFSPWQEWQGKLSLALTS
DIQQLRYQGEKVKFQGQLKGQQLTVSELDVAAFENQPPVKLVGEFTMPLV
PDGLPVSGHVTATLNLPQEPSLVDVELDWQENSGQLIVLARDNGDPLLDL
PWQITRQQLTVSDGRWSWPYAGFPLSGRLGVKVDNWQAGLENALVSGRLS
VLTQGQAGKGNAVLNFGPGKLSMDNSQLSLQLTGEAKQADLILYARLPAQ
LSGSLSDPTLTFEPGALLRSKGRVIDSLDIDEIRWPLAGVKVTQRGVDGR
LQAILQAHENELGDFVLHMDGLANDFLPDAGRWQWRYWGKGSFTPMNATW
DVAGKGEWHDSTITLTDLSTGFDQLQYGTMTVEKPRLILDKPVVWGRDAQ
HPSFSGALSLDAGQTLFTGGSVLPPSTLKFSVDGRDPTYFLFKGDLHAGE
IGPVRVNGRWDGIRLRGNAWWPKQSLTVFQPLVPPDWKMNLRDGELYAQV
AFSAAPEQGFRAGGHGVLKGGSAWMPDNQVNGVDFVLPFRFADGAWHLGT
RGPVTLRIAEVINLVTAKNITADLQGRYPWTEEEPLLLTDVSVDVLGGNV
LMKQLRMPQHDPALLRLNNLSSSELVSAVNPKQFAMSGAFSGALPLWLNN
EKWIVKDGWLANSGPMTLRLDKDTADAVVKDNMTAGSAINWLRYMEISRS
STKINLDNLGLLTMQANITGTSRVDGKSGTVNLNYHHEENIFTLWRSLRF
GDNLQAWLEQNARLPGNDCPQGKECEDKQ
>ECP_0244 H repeat-associated protein of Rhs element
MELKKLMEHISIIPDYRQAWKVEHKLPDILSVNYLRRYFWCIMLGRYRGF
GETHLDFLK
>ECP_1235 muramoyltetrapeptide carboxypeptidase
MSLFHLIAPSGYCIKQHAALRGIQRLTDAGHQVNNVEVIARRCERFAGTE
TERLEDLNSLARLTTPNTIVLAVRGGYGASRLLADIDWQALVARQQHDPL
LICGHSDFTAIQCGLLAQGNVITFSGPMLVANFGADELNAFTEHHFWLAL
RNKTFTIEWQGEGPTCQTEGTLWGGNLAMLISLIGTPWMPKIENGILVLE
DINEHPFRVERMLLQLYHAGILPRQKAIILGSFSGSTPNDYDAGYNLESV
YAFLRSRLSISLITGLDFGHEQRTVTLPLGAHAILNNTQEGTQLTISGHP
VLKM
>ECP_2537 hypothetical tRNA/rRNA methyltransferase YfhQ
MLQNIRIVLVETSHTGNMGSVARAMKTMGLTNLWLVNPLVKPDSQAIALA
AGASDVIGNAHIVDTLDEALAGCSLVVGTSARSRTLPWPILDPRECGLKS
VAEAANTPVALVFGRERVGLTNEELQKCHYHVAIAANPEYSSLNLAMAVQ
VIAYEVRMAWLATQENGEQVEHEETPYPLVDDLERFYGHLEQTLLATGFI
RENHPGQVMNKLRRLFTRARPESQELNILRGILASIEQQNKGNKAE
>ECP_0255 putative lipoprotein
MSFMSSFLHGRFLHPGVFSLCVLLPLLASATTSHISFSYAARQRMQNRAR
LLKQYQTHLKKQASYIVEGNAKSRRALRQHNREQIKQHPEWFPAPLKASD
RRWQALVENNHFLSSDHLHNITEVAIHRLEQQLGKPYIWGGTWPDKGFDC
SGLVFYAYNKILEAKLPRTANEMYHYRRATIVANNDLRRGDLLFFHIHSR
EIADHMGVYLGDGQFIESPRTGETIRVSRLAEPFWQDHFLGARRILTEET
IL
>ECP_2926 PTS system, mannitol (cryptic)-specific IIBC component
MENKSARAKVQAFGGFLTAMVIPNIGAFIAWGFITALFIPTGWLPNEHFA
KIVGPMITYLLPVMIGSTGGHLVGGKRGAVMGGIGTIGVIVGAEIPMFLG
SMIMGPLGGLVIKYIDKSLEKRIPAGFEMVINNFSLGIAGMLLCLLGFEV
IGPAVLIANTFVKECIEALVHAGYLPLLSVINEPAKVLFLNNAIDQGVYY
PLGMQQASVNGKSIFFMVASNPGPGLGLLLAFTLFGKGMSKRSAPGAMII
HFLGGIHELYFPYVLMKPLTIIAMIAGGMSGTWMFNLLDGGLVAGPSPGS
IFAYLALTPKGSFLATIAGVTVGTLVSFAITSLILKMEKTVETKSEDEFA
QSANAVKAMKQEGAFSLSRVKRIAFVCDAGMGSSAMGATTFRKRLEKAGL
AIEVKHYAIENVPADADIVVTHASLEGRVKRVTDKPLILINNYIGDPKLD
TLFNQLTAEHKH
>ECP_2233 nitrate/nitrite response regulator protein NarP
MPEATPFQVMIVDDHPLMRRGVRQLLELDSGFEVVAEAGDGASAIDLANR
LDIDVILLDLNMKGMSGLDTLNALRRDGVTAQIIILTVSDASSDVFALID
AGADGYLLKDSDPEVLLEAIRAGAKGSKVFSERVNQYLREREMFGAEEDP
FSVLTERELDVLHELAQGLSNKQIASVLNISEQTVKVHIRNLLRKLNVRS
RVAATILFLQQRGAQ
>ECP_2415 hypothetical protein YfdO
MLHPRARTMLLLSLPAVAIGIASSLILIMVMKIASVLQNLLWQRLPGTLG
IAQDSPLWIIGVLTLTGIAVGLVIRFSQGHAGPDPACEPLIGAPVPPSAL
PGLIVALILGLAGGVSLGPEHPIITVNIALAVAIGARLLPRVNRMEWTIL
ASAGTIGALFGTPVAAALIFSQTLNGSNEVPLWDRLFAPLMAAAAGALTT
GLFFHPHFSLPIAHYGQMEMTDILSGAIVAAIAIAAGMVAVWCLPRLHAM
MHQMKNPVFVLGIGGLILGILGVIGGPVSLFKGLDEMQQMVANQAFSTSG
YFLLAVIKLAALVVAAASGFRGGRIFPAVFVGVALGLMLHEHVPAVPAAI
TVSCAILGIVLVVTRDGWLSLFMAAVVVPNTTLLPLLCIVMLPAWLLLAG
KPMMMVNRQKQQPPHDNV
>ECP_2550 hypothetical protein YphG
MTPVKVWQERVEIPTYETGPQDIHPMFLENRVYQGSSGAVYPYGVTDTLS
EQKTLKSWQAVWLENDYIKVMILPELGGRVHRAWDKVKQRDFVYHNEVIK
PALVGLLGPWISGGIEFNWPQHHRPTTFMPVDFTLDAHDDGAQTVWVGET
EPMHGLQVMTGFTLRPDRAALEIASRVYNGNATPRHFLWWANPAVKGGEG
HQSVFPPDVTAVFDHGKRAVSAFPIATGTYYKVDYSAGVDISRYKNVPVP
TSYMAEKSQYDFVGAWCHDEDGGLLHVANHHIAPGKKQWSWGHSEFGQAW
DKSLTDNNGPYIELMTGIFADNQPDFTWLDAYEEKRFEQYFLPYHSLGMV
QNASRDAVIKLQRSERGIEWGLYAISPLNGYRLAIREIGKCNALLDDAVA
LTPATAIQGVLHGINPERLTIELSDADGNIVLSYDEHQPQALPLPDVAKA
PLAAQDITSTDEAWFIGQHLEQYHHASRSPFDYYLRGVALDPLDYRCNLA
LAMLEYNRADFPQAVAYATQALKRAHALNKNPQCGQASLIRASAYERQGQ
YQQAEEDFWRAVWSGNSKAGGYYGLARLAARNGNFDTGLDFCQQSLRACP
TNQEVLCLHNLLLVLSGRQDNARLQREKLLRDYPLNATLWWLNWFDGRSE
SALAQWRGLCQGRDVNALMTAGQLINWGMPALAAEMLNALDCQRTLPLYL
QASLLPKAERGELVAKAIDAFPQFVRFPNTLEEVAALESIEECWFARHLL
ACFYYNKRSYGKAIALWQRCVEMSPEFADGWRGLAIHAWNKQHDYELAAR
YLDNAYQLALQDARLLFERDLLDKLSGVTPEKRLARLENNLEIALKRDDM
TAELLNLWHLTGQADKAADILATRKFHPWEGGEGKVTSQFILNQLLRAWQ
HLDARQPQQASELLHAALHYPENLSEGRLPGQTDNDIWFWQAVCANAQGD
ETEATRCLRLAATGDRTINIHSYYNDQPVDYLFWQGMALRLLGEQHTAQQ
LFSEMKQWAKEMAKTSIEADFFAVSQPDLLSLYSDLQQQHKEKCLMVAML
AAAGLGEVAHYESARAELMAINPAWPKAALFTTVMPFIFSYVH
>ECP_3088 hypothetical oxidoreductase YghA
MSHLKDPTTQYYTGEYPKQKQPTPGIQAKMTPVPDCGEKTYVGSGRLKDR
KALVTGGDSGIGRAAAIAYAREGADVAISYLPVEEEDAQDVKKIIEECGR
KAVLLPGDLSDEKFARSLVHEAHKALGGLDIMALVAGKQVAIPDIADLTS
EQFQKTFAINVFALFWLTQEAIPLLPKGASIITTSSIQAYQPSPHLLDYA
ATKAAILNYSRGLAKQVAEKGIRVNIVAPGPIWTALQISGGQTQDKIPQF
GQKTPMKRAGQPAELAPVYVYLASQESSYVTAEVHGVCGGEHLG
>ECP_3252 polyribonucleotide nucleotidyltransferase
MRRRSGINTSTVRYCLRKRKDITLLNPIVRKFQYGQHTVTLETGMMARQA
TAAVMVSMDDTAVFVTVVGQKKAKPGQDFFPLTVNYQERTYAAGRIPGSF
FRREGRPSEGETLIARLIDRPIRPLFPEGFVNEVQVIATVVSVNPQVNPD
IVAMIGASAALSLSGIPFNGPIGAARVGYINDQYVLNPTQDELKESKLDL
VVAGTEAAVLMVESEAELLSEDQMLGAVVFGHEQQQVVIQNINELVKEAG
KPRWDWQPEPVNEALNARVAALAEARLSDAYRITDKQERYAQVDVIKSET
IATLLAEDETLDENELGEILHAIEKNVVRSRVLAGEPRIDGREKDMIRGL
DVRTGVLPRTHGSALFTRGETQALVTATLGTARDAQVLDELMGERTDTFL
FHYNFPPYSVGETGMVGSPKRREIGHGRLAKRGVLAVMPDMDKFPYTVRV
VSEITESNGSSSMASVCGASLALMDAGVPIKAAVAGIAMGLVKEGDNYVV
LSDILGDEDHLGDMDFKVAGSRDGISALQMDIKIEGITKEIMQVALNQAK
GARLHILGVMEQAINAPRGDISEFAPRIHTIKINPDKIKDVIGKGGSVIR
ALTEETGTTIEIEDDGTVKIAATDGEKAKHAIRRIEEITAEIEVGRVYNG
KVTRIVDFGAFVAIGGGKEGLVHISQIADKRVEKVTDYLQMGQEVPVKVL
EVDRQGRIRLSIKEATEQSQPAAAPEAPAAEQGE
>ECP_3923 phosphate transport system protein PhoU
MDSLNLNKHISGQFNAELESIRTQVMTMGGMVEQQLSDAITAMHNQDSDL
AKRVIEGDKNVNMMEVAIDEACVRIIAKRQPTASDLRLVMVISKTIAELE
RIGDVADKICRTALEKFSQQHQPLLVSLESLGRHTIQMLHDVLDAFARMD
IDEAVRIYREDKKVDQEYEGIVRQLMTYMMEDSRTIPSVLTALFCARSIE
RIGDRCQNICEFIFYYVKGQDFRHVGGDELDKLLAEKDSDK
>ECP_3964 dihydroxy-acid dehydratase
MPKYRSATTTHGRNMAGARALWRATGMTDADFGKPIIAVVNSFTQFVPGH
VHLRDLGKLVAEQIEAAGGVAKEFNTIAVDDGIAMGHGGMLYSLPSRELI
ADSVEYMVNAHCADAMVCISNCDKITPGMLMASLRLNIPVIFVSGGPMEA
GKTKLSDQIIKLDLVDAMIQGADPKVSDSQSDQVERSACPTCGSCSGMFT
ANSMNCLTEALGLSQPGNGSLLATHADRKQLFLNAGKRIVELTKRYYEQD
DESALPRNIASKAAFENAMTLDIAMGGSTNTVLHLLAAAQEAEIDFTMSD
IDKLSRKVPQLCKVAPSTQKYHMEDVHRAGGVIGILGELDRAGLLNRDVK
NVLGLTLPQTLEQYDIIVTQDDAVKNMFRAGPAGIRTTQAFSQDCRWDTL
DDDRSNGCIRSLEHAYSKDGGLAVLYGNFAENGCIVKTAGVDDSILKFTG
PAKVYESQDDAVEAILGGKVVAGDVVVIRYEGPKGGPGMQEMLYPTSFLK
SMGLGKACALITDGRFSGGTSGLSIGHVSPEAASGGSIGLIEDGDLIAID
IPNRGIQLQVSDAELAARREAQEARGNKAWTPKNRERQVSFALRAYASLA
TSADKGAVRDKSKLGG
>ECP_3794 hypothetical protein
MDETFWCENLKETTCWVERTIQMVNYRLISLALTLMEIPGVGPVITTAAV
STMGESSAFCGLCWSGA
>ECP_1212 hypothetical protein
MSYFSFEIILIPVKNIIPIITVTLILNYLNNSERSLVKQILIEDEIIVCA
TYLIPDI
>ECP_1746 hypothetical protein
MSLTQNALDTTTSKYRFLNDDFILCSFIYSTAWRGIFAFGVFPDHVKIDV
ARLFPASGHLVQVLPEYYQPANVWAVYVLKLSTSAKVRITVEFLRQYFAE
HYPNFSLEHA
>ECP_1225 truncated hemolysin E
MKNKLKSVQSFFTTLSNTVKQANKDIDAAKLKLTTEIAAIGEIKTETETT
RFYVDYDDLMLSLLKEAAKKIINTCNEYQKRHGKKTLFEVSEV
>ECP_4206 thiamine-phosphate pyrophosphorylase
MYQPEFPPVPFRLGLYPVVDSVQWIERLLDAGVRTLQLRIKDRRDEEVEA
DVVAAIALGRRYNARLFINDYWRLAIKHQAYGVHLGQEDLQATDLSAIRA
AGLRLGVSTHDDMEIDVALAERPSYIALGHVFPTQTKQMPSAPQGLEQLA
RHVERLADYPTVAIGGISLARAPAVIATGVGSIAVVSAITQAADWRLATA
QLLEIAGVGDE
>ECP_1013 PutA
MGTTTMGVKLDDATRERIKSAATRIDRTPHWLIKQAIFSYLEQLENSDTL
PELPALLSGAANESDEAPTPAEEPHQPFLDFAEQILPQSVSRAAITAAYR
RPETEAVSMLLEQARLPQPVAEQAHKLAYQLADKLRNQKNASGRAGMVQG
LLQEFSLSSQEGVALMCLAEALLRIPDKATRDALIRDKISNGNWQSHIGR
SPSLFVNAATWGLLFTGKLVSTHNEASLSRSLNRIIGKSGEPLIRKGVDM
AMRLMGEQFVTGETIAEALANARKLEEKGFRYSYDMLGEAALTAADAQAY
MVSYQQAIHAIGKASNGRGIYEGPGISIKLSALHPRYSRAQYDRVMEELY
PRLKSLTLLARQYDIGINIDAEEADRLEISLDLLEKLCFEPELAGWNGIG
FVIQAYQKRCPLVIDYLIDLATRSRRRLMIRLVKGAYWDSEIKRAQMDGL
EGYPVYTRKVYTDVSYLACAKKLLAVPNLIYPQFATHNAHTLAAIYQLAG
QNYYPGQYEFQCLHGMGEPLYEQVTGKVADGKLNRPCRIYAPVGTHETLL
AYLVRRLLENGANTSFVNRIADTSLPLDELVADPVTAVEKLAQQEGQTGL
PHPKIPLPRDLYGHGRDNSAGLDLANEHRLASLSSALLNSALQKWQSLPM
LEQSVAAGEMSPVINPAEPKDIVGYVREATPREVEQALESAVNNAPIWFA
TPPAERAAILHRAAVLMESQMQQLIGILVREAGKTFSNAIAEVREAVDFL
HYYAGQVRDDFANETHRPLGPVVCISPWNFPLAIFTGQIAAALAAGNSVL
AKPAEQTPLIAAQGIAILLEAGVPPGVVQLLPGRGETVGAQLTGDDRVRG
VMFTGSTEVATLLQRNIASRLDAQGRPIPLIAETGGMNAMIVDSSALTEQ
VVIDVLASAFDSAGQRCSALRVLCLQDEIADHTLKMLRGAMAECRMGNPG
RLTTDIGPVIDSEAKANIERHIQTMRSKGRQVFQAVRENSEDAREWQSGT
FVAPTLIELDDFAELQKEVFGPVLHVVRYNRNQLPELIEQINASGYGLTL
GVHTRIDETIAQVTGSAQVGNLYVNRNMVGAVVGVQPFGGEGLSGTGPKA
GGPLYLYRLLANRPESALAVTLARQDAEYPVDAQLKAALTQPLNALREWA
ANRPELQALCTQYGELAQAGTQRLLPGPTGERNTWTLLPRERVLCIADDE
QDALTQLAAVLAVGSQVLWPDDALHRQLVKALPSAVSERIQLAKAENITA
QPFDAVIFHGDSDQLRALCEAVAARDGAIVSVQGFARGESNILLERLYIE
RSLSVNTAAAGGNASLMTIG
>ECP_3060 AMP-dependent synthetase (putative polyketide synthase)
MSNKIFTHSLPMRYADFPTLVDALDYAALSSAGMNFYDRRCQLEDQLEYQ
TLKARAEAGAKRLLSLNLKKGDRVALIAETSSGFVEAFFACQYAGLVAVP
LAIPMGVGQRDSWSAKLQGLLASCQPAAIITGDEWLPLVNAATHNNPELH
VLSHVWFKALPEADVALQRPVPNDIAYLQYTSGSTRFPRGVIITHREVMA
NLRAISHDGIKLRPGDRCVSWLPFYHDMGLVGFLLTPVATQLSVDYLRTQ
DFAMRPLQWLKLISKNRGTVSVAPPFGYELCQRRVNEKDLAELDLSCWRV
AGIGAEPISAEQLHQFAECFRQVNFDDKTFMPCYGLAENALAVSFSDEAS
GVVVNEVDRDILEYQGKAVAPDAETRAVSTFVNCGKALPEHGIEIRNEAG
IPVAERVVGHICISGPSLMSGYFGDQISQDEIAATGWLDTGDLGYLLDGY
LYVTGRIKDLIIIRGRNIWPQDIEYIAEQEPEIHSGDAIAFVTAQEKIIL
QIQCRISDEERRGQLIHALAARIQSEFGVTADIDLLPPHSIPRTSSGKPA
RAEAKKRYQKAYAASLHVQESLA
>ECP_2889 hypothetical protein
MVLWQSDLRVSWRAQWLSLLIHGLVAAVILLIPWPLSYTPLWMVLLSLVV
FDCVRSQRRINARQGEIRLLMDGRLRWQGQEWSIVKAPWMIKSGMMLRLR
SDSGKRQHLWLAADSMDEAEWRDLRRILLQQETQR
>ECP_2559 phosphoribosylformylglycinamidine synthase
MMEILRGSPALSAFRINKLLARFQAARLPVHTIYAEYVHFADLNAPLNDD
EHAQLERLLKYGPALASHAPQGKLLLVTPRPGTISPWSSKATDIAHNCGL
QQVNRLERGVAYYIEAGTLTNEQWQQVTAELHDRMMETVFFALDDAEQLF
AHHQPTPVTSVDLLGQGRQALIDANLRLGLALAEDEIDYLQDAFTKLGRN
PNDIELYMFAQANSEHCRHKIFNADWVIDGEQQPKSLFKMIKNTFETTPD
HVLSAYKDNAAVMEGSEVGRYFADHETGRYDFHQEPAHILMKVETHNHPT
AISPWPGAATGSGGEIRDEGATGRGAKPKAGLVGFSVSNLRIPGFEQPWE
EDFGKPERIVTALDIMTEGPLGGAAFNNEFGRPALNGYFRTYEEKVNSHN
GEELRGYHKPIMLAGGIGNIRADHVQKGEINVGAKLVVLGGPAMNIGLGG
GAASSMASGQSDADLDFASVQRDNPEMERRCQEVIDRCWQLGDANPILFI
HDVGAGGLSNAMPELVSDGGRGGKFELRDILSDEPGMSPLEIWCNESQER
YVLAVAADQLPLFDELCKRERAPYAVIGEATEELHLSLHDRHFDNQPIDL
PLDVLLGKTPKMTRDVQTLKAKGDALAREGITIADAVKRVLHLPTVAEKT
FLVTIGDRSVTGMVARDQMVGPWQVPVANCAVTTASLDSYYGEAMAIGER
APVALLDFAASARLAVGEALTNIAATQIGDIKRIKLSANWMAAAGHPGED
AGLYEAVKAVGEELCPALGLTIPVGKDSMSMKTRWQEGNEECEMTSPLSL
VISAFARVEDVRHTITPQLSTEDNALLLIDLGKGNNALGATALAQVYRQL
GDKPADVRDVAQLKGFYDAIQALVAQRKLLAYHDRSDGGLLVTLAEMAFA
GHCGIDADIATLGDDRLAALFSEELGAVIQVRAADREAVEAVLAQHGLAD
CVHYVGQAVSGDRFVITANGQTVFSESRTTLRVWWAETTWQMQRLRDNPE
CADQEHQAKSNDADPGLNVKLSFDINEDVAAPFIATGARPKVAVLREQGV
NSHVEMAAAFHRAGFDAIDVHMSDLLAGRTGLEGFHALVACGGFSYGDVL
GAGEGWAKSILFNDRVRDEFATFFHRPQTLALGVCNGCQMMSNLRELIPG
SELWPRFVRNTSDRFEARFSLVEVTQSPSLLLQGMVGSQMPIAVSHGEGR
VEVRDAAHLAALESKGLVALRYVDNFGKVTETYPANPNGSPNGITAVTTE
SGRVTIMMPHPERVFRTVSNSWHPENWGEDGPWMRIFRNARKQLG
>ECP_0884 hypothetical protein YbjT
MPQRILVLGASGYIGQHLVRTLSQQGHQILAAARHVDRLAKLQLANVSCH
KVDLNWPDNLPALLQNIDTVYFLVHSMGEGGDFIAQERQVALNVRDALRE
VPVKQLIFLSSLQAPPHEQSDHLRARQATADILREAGVPVTELRAGIIVG
AGSAAFEVMRDMVYNLPMLTPPRWVRSRTTPIALENLLHYLVALLDHPAC
EHRIFEAAGPEVLSYQQQFEHFMAVSGKRRWLLPIPLPTRWISVWFLNVI
TSVPPTTARALIQGLKHDLLADDTALRALIPQRLIAFDDAVRSTLKEEEK
LVNSSDWGYDAQAFARWRPEYGYFAKQAGFTVKTSASLAALWQVVNQIGG
KERYFFGNILWQTRALMDRAIGHKLAKGRPEREYLQTGDAVDSWKVIVVE
PQKQLTLLFGMKAPGLGRLCFTLEDKGDYRTIDVRAFWHPHGMPGLFYWL
LMIPAHLFIFRGMAKRIARLAEQSTD
>ECP_3912 hypothetical protein
MERKMATHFARGILTEGHLISVRLPSLCHQEARNIPPHRQSRFLASRGLL
AELMFMLYGIGELPEIVTLPKGKPVFSDKNLPSFSISYAGNMVSVALTTE
GECGLDMELQRATRGFHSPHAPDNHTFSSNESLWISKQNDPNEARAQLIT
LRRSVLKLTGDVLNDDPRDLQLLPIAGRLKCAHVNHVEALCDAEDVLVWS
VAVTPTIEKLSVWELDGKHGWKSLPDIHSRANNPTSRMMRFAQLSTVKAF
SPN
>ECP_2685 formate hydrogenlyase subunit 4
MSVLYPLIQALVLFAVAPLLSGITRVARARLHNRRGPGVLQEYRDIIKLL
GRQSVGPDASGWVFRLTPYVMVGVMLTIATALPVVTVGSPLPQLGDLITL
LYLFAIARFFFAISGLDTGSPFTAIGASREAMLGVLVEPMLLLGLWVAAQ
VAGSTNISNITDTVYHWPLSQSIPLVLALCACAFATFIEMGKLPFDLAEA
EQELQEGPLSEYSGSGFGVMKWGISLKQLVVLQMFVGVFIPWGQIETFTV
GGLLLALVIAIVKLVVGVLVIALFENSMARLRLDITPRITWAGFGFAFLA
FVSLLAA
>ECP_2593 IS putative transposase
MQQLGLKSPVRLKKYRSYRGNMGLAAENILQRQFKAEAPCEKWVTDITEF
RAGGQKLYLSPILDLFNGEIVAWETACRPTEELVKRMLNKGLESLAEGEK
PLLHSDQGWHYRIKSYQSALADRGLVQSMSRKGNCLDNAVMENFFGHLKE
EMYYRRDYRNVEELENAVNEYITYWNQKRIKLSLGGLSPVEYRTEYQKAG
>ECP_1273 nitrite extrusion protein
MSHSSAPERATGAVITDWRPEDPAFWQQRGQRIASRNLWISVPCLLLAFC
VWMLFSAVAVNLPKVGFNFTTDQLFMLTALPSVSGALLRVPYSFMVPIFG
GRRWTAFSTGILIIPCVWLGFAVQDTSTPYSVFIIISLLCGFAGANFASS
MANISFFFPKQKQGGALGLNGGLGNMGVSVMQLVAPLVVSLSIFAVFGSQ
GVKQPDGTELYLANASWVWVPFLAIFTIAAWFGMNDLATSKASIKEQLPV
LKRGHLWIMSLLYLATFGSFIGFSAGFAMLSKTQFPDVQILQYAFFGPFI
GALARSAGGALSDRLGGTRVTLVNFILMAIFSGLLFLTLPTDGQGGSFMA
FFAVFLALFLTAGLGSGSTFQMISVIFRKLTMDRVKAEGGSDERAMREAA
TDTAAALGFISAIGAIGGFFIPKAFGSSLALTGSPVGAMKVFLIFYIACV
VITWAVYGRHSKK
>ECP_0035 carnitine operon protein CaiE
MSYYAFEGLIPVVHPTAFVHPSAVLIGDVIVGAGVYIGPLASLRGDYGRL
IVQAGANIQDGCIMHGYCDTDTIVGENGHIGHGAILHGCVIGRDALVGMN
SVIMDGAVIGEESIVAAMSFVKAGFRGEKRQLLMGTPARAVRSVSDDELH
WKRLNTKEYQDLVGRCHAALHETQPLRQMEENRPRLQGTTDVTPKR
>ECP_0021 transcriptional activator protein NhaR
MSMSHINYNHLYYFWHVYKEGSVVGAAEALYLTPQTITGQIRALEERLQG
KLFKRKGRGLEPSELGELVYRYADKMFTLSQEMLDIVNYRKESNLLFDVG
VADALSKRLVSSVLDAAVVEDEQIHLRCFESTHEMLLEQLSQHKLDMIIS
DCPIDSTQQEGLFSMKIGECGVSFWCTNPLPEKPFPACLEERRLLIPGRR
SMLGRKLLNWFNSQGLNVEILGEFDDAALMKAFGATHNAIFVAPSLYAND
FYNDDSVVEIGRVENVMEEYHAIFAERMIQHPAVQRICNTDYSALFTPAS
K
>ECP_2683 formate hydrogenase-3 component F
MFTFIKKVIKTGTATSSYPLEPIAVDKNFRGKPEQNPQQCIGCAACVNAC
PSNALTVETDLATGELAWQFNLGRCIFCGRCEEVCPTAAIKLSQEYELAV
WKKEDFLQQSRFALCNCRVCNRPFAVQKEIDYAIALLKHNGDSRAENHRE
SFETCPECKRQKCLVPSDRIELTRHMKEAI
>ECP_2624 succinate-semialdehyde dehydrogenase
MKLNDSNLFRQQALINGEWLDANNGEVIDVTNPANGDKLGSVPKMGADET
RAAIDAANRALPAWRALTAKERANILRNWFNLMMEHQDDLARLMTLEQGK
PLAEAKGEISYAASFIEWFAEEGKRIYGDTIPGHQADKRLIVIKQPIGVT
AAITPWNFPAAMITRKAGPALAAGCTMVLKPASQTPFSALALAELAIRAG
VPAGVFNVVTGSAGAVGNELTSNPLVRKLSFTGSTEIGRQLMEQCAKDIK
KVSLELGGNAPFIVFDDADLDKAVEGALASKFRNAGQTCVCANRLYVQNG
VYDRFAEKLQQAVSKLHIGNGLEKGVTIGPLIDEKAVAKVEEHIADALEK
GARVVCGGKAHERGGNFFQPTILVDVPANAKVSKEETFGPLAPLFRFKDE
ADVIAQANDTEFGLAAYFYARDLSRVFRVGEALEYGIIGINTGIISNEVA
PFGGIKASGLGREGSKYGIEDYLEIKYMCIGL
>ECP_2269 hypothetical protein YfaP precursor
MRKIFLPLLLVALSPVAHSEGVQEVEIDAPLSGWHPAEGEDASFSQTINY
PASSVNMADDQNISAQIRGKIKNYAAAGKVQQGRLVVNGASMPQRIESDG
SFARPYIFTEGSNSVQVISPDGQSRQKMQFYSTLGAGAIRARLRLVLSWD
TDNTDLDLHVVTPDGEHAWYGNTVLKNSGALDMDVTTGYGPEIFAMPAPV
HGRYQVYINYYGGRSETELTTAQLTLITDEGSVNEKQETFIVPMRNAGEL
TLVKSFDW
>ECP_1310 tryptophan biosynthesis protein trpCF
MQTVLAKIVADKAIWVETRKQQQPLASFQNEIQPSTRHFYDALQGARTAF
ILECKKASPSKGVIRDDFDPARIAAIYKHYASAISVLTDEKYFQGSFDFL
PIVSQIAPQPILCKDFIIDPYQIYLARYYQADACLLMLSVLDDEQYRQLA
TVAHSLEMGVLTEVSNEEELERAIALGAKVVGINNRDLRDLSIDLNRTRE
LAPKLGHNVTVISESGINTYAQVRELSHFTNGFLIGSALMAHDDLNAAVR
RVLLGENKVCGLTREQDAKAAYDAGAIYGGLIFVATSPRCVNVEQAQEVM
AAAPLQYVGVFRNHDIADVVDKAKVLSLAAVQLHGNEDQLYIDTLREALP
AHVAIWKALSVGETLPARDFQHIDKYVFDNGQGGSGQRFDWSLLNGQSLG
NVLLAGGLGADNCVEAAQTGCAGLDFNSAVESQPGIKDARLLASVFQTLR
AY
>ECP_1261 hypothetical protein
MTSFSTLLSVHLISIALSVGLLTLRFWLRYQKHPRAFARWTRIVPPVVDT
LLLLSGIALMAKAHIQPFSGQAQWLTEKLFGVIIYIVLGFIALDYRRMHS
QQTRIIAFPLALVVLYIIIKLATTKVPLLG
>ECP_2625 4-aminobutyrate aminotransferase
MSSNKELMQRRSQAIPRGVGQIHPIFADRAENCRVWDVEGREYLDFAGGI
AVLNTGHLHPKVVAAVEAQLKKLSHTCFQVLAYEPYLELCEIMNQKVPGD
FAKKTLLVTTGSEAVENAVKIARAATKRSGTIAFSGAYHGRTHYTLALTG
KVNPYSAGMGLMPGHVYRALYPCPLHGISEDDAIASIHRIFKNDAAPEDI
AAIVIEPVQGEGGFYAATPAFMQRLRALCDEHGIMLIADEVQSGAGRTGT
LFAMEQMGVAPDLTTFAKSIAGGFPLAGVTGRAEVMDAVAPGGLGGTYAG
NPIACVAALEVLKVFEQENLLQKANDLGHKLKDGLLAIAEKHPEIGDIRG
LGAMIAIELFEDGDHSKPDAKLTAEIVARARDKGLILLSCGPYNNVLRIL
VPLTIEDAQIRQGLEIISQCFAEAKQ
>ECP_0612 hypothetical protein
MPLPDFHVSEPFTLGIELEMQVVNPPGYDLSQDSSMLIDTVKNQITAGEV
KHDITESMLELATDVCRDINQAAGQFSAMQKVVLQAAADHHLEICGGGTH
PFQKWQRQEVCDNERYQRTLENFGYLIQQATVFGQHVHVGCASGDDAIYL
LHGLSRFVPHFIALSAASPYMQGTDTRFASSRPNIFSAFPDNGPMPWVSN
WQQFEALFRCLSYTTMIDSIKDLHWDIRPSPHFGTVEVRVMDTPLTLSHA
VNMAGLIQATAHWLLTERPFKHQEKDYLMYKFNRFQACRYGLEGVITDPH
TGDRRSLTEATLRLLEKIAPSAHKIGASSAIEALHRQVVSGLNEAQLMRD
FVADGGSLIGLVKKHCEIWAGE
>ECP_1196 hypothetical protein
MIFNDKNHFVVFLLAKINFIQPFDFIDVRFYFGERFSLPFISKLQVYILL
TSQYM
>ECP_2261 sensor protein RcsC
MKYLASFRTTLKASRYMFRALALVLWLLIAFSSVFYIVNALHQRESEIRQ
EFNLSSDQAQRFIQRTSDVMKELKYIAENRLSAENGVLSPRGRETQTDVP
AFEPLFADSDCSAMSNTWRGSLESLAWFMRYWRDNFSAAYDLNRVFLIGS
DNLCMANFGLRDMPVERDTALKALHERINKYRNAPQDDSGSNLYWISEGP
RPGVGYFYALTPVYLANRLQALLGVEQTIRMENFFLPGTLPMGVTILDEN
GHTLISLTGPESKIKGDPRWMQERSWFGYTEGFRELVLKKNLPPSSLSIV
YSVPVDKVLERIRMLILNAILLNVLAGAALFTLARMYERRIFIPAESDAL
RLEEHEQFNRKIVASAPVGICILRTADGVNILSNELAHTYLNMLTHEDRQ
RLTQIICGQQVNFVDVLTSNNTNLQISFVHSRYRNENVAICVLVDVSSRV
KMEESLQEMAQAAEQASQSKSMFLATVSHELRTPLYGIIGNLDLLQTKEL
PKGVDRLVTAMNNSSSLLLKIISDILDFSKIESEQLKIEPREFSPREVMN
HITANYLPLVVRKQLGLYCFIEPDVPVALNGDPMRLQQVISNLLSNAIKF
TDTGCIVLHVRADGDYLSIRVRDTGVGIPAKEVVRLFDPFFQVGTGVQRN
FQGTGLGLAICEKLISMMDGDISVDSEPGMGSQFTVRIPLYGAQYPQKKG
VEGLSGKRCWLAVRNASLCQFLETSLQRSGIVVTTYEGQEPTPEDVLITD
EVVSKKWQGRAVVTFCRRHIGIPLEKAPGEWVHSVAAPHELPALLARIYL
IEMESDDPANALPSTDKAVSDNDDMMILVVDDHPINRRLLADQLGSLGYQ
CKTANDGVDALNVLNKNHIDIVLSDVNMPNMDGYRLTQRIRQLGLTLPVI
GVTANALAEEKQRCLESGMDSCLSKPVTLDVIKQTLTVYAERVRKSRES
>ECP_2454 hypothetical protein YfdZ
MKSTEFHPVHYDAHGRLRLPLLFWLVLLLQARTWVLFVIAGASREQGTAL
LNLFYPDHDNFWLGLIPGIPAVLAFLLSGRRASFPRIWHVLYFLLLLAQV
VLLCWQPWLWLNGESVSGIGLALVVADIVALIWLLTNRRLRACFNEEKE
>ECP_2634 hypothetical protein YgaC
MYLRPDEVARVLEKVGFTVDVVTQKAYGYRRGENYVYVNREARMGRTDLV
IHPTLKERSSTLAEPASDIKTCDHYQQFPLYLAGERHEHYGIPHGFSSRV
ALERYLNGLFGEAS
>ECP_4077 oxygen-independent coproporphyrinogen III oxidase
MSVQQIDWDLALIQKYNYSGPRYTSYPTALEFSEDFGEQAFLQAVARYPE
RPLSLYVHIPFCHKLCYFCGCNKIVTRQQHKADQYLDALEQEIVHRAPLF
AGRHVSQLHWGGGTPTYLNKAQISRLMKLLRENFQFNADAEISIEVDPRE
IELDVLDHLRAEGFNRLSMGVQDFNKEVQRLVNREQDEEFIFALLNHARE
IGFTSTNIDLIYGLPKQTPESFAFTLKRVAELNPDRLSVFNYAHLPTIFA
AQRKIKDADLPSPQQKLDILQETIAFLTQSGYQFIGMDHFARPDDELAVA
QREGVLHRNFQGYTTQGDTDLLGMGVSAISMIGDCYAQNQKELKQYYQQV
DEQGNALWRGIALTRDDCIRRDVIKSLICNFRLDYAPIEQQWDLLFADYF
AEDLKLLAPLAKDGLVDVDEKGIQVTAKGRLLIRNICMCFDIYLRQKARM
QQFSRVI
>ECP_0030 hypothetical protein
MIFGPFIFSVMILNLMQMLKITCFVFCFCCFNVNFDHLVHFFLLILISCN
LLAAQAFSRTCYMIFLSLNACKTCMSHKII
>ECP_0242 hypothetical protein
MGNYHIGDGECRFRVVCSSPDVCEVGGYKVPFDSYQTLDSERQYSSTVWA
RGCRALNVGSVIAGTQSNAGKGVISGTSQGTGDCVILTGSPTVTIEGKPV
AYHGSVVGINNHNCLGKLYTKIKSPMISVIDRTFNYERTAEVIHDLLLLK
DLLSVGNIFDGDISPEVKNDLFQIKDPDQSWGEFFSIKNIRESLRNGIEG
DKSQIREWFGENTLTQMGNGAITTLHGVADLALVTFDALLDTATATVACP
IGEDGLCEQANINLNEKEQALFNISNSLINGQAWDALKKMIMDTNNGDQI
ALEHFASFLWGFMIPAKIPEENISGKVFVEPVVLEGGAGGNWTVFDEVLD
SNVIKQLTLTGCGAACGEMLLRDRYIFVTQNVIGTELTSMTSLANKLNKF
DVGWEGNAVNESSLYALSNTGSWGAMMWDSGSKVGHWVLVKGVDDAGNVI
IYDPYQGSRYLMTEQEFKEVWNGHSVYKP
>ECP_3013 putative radC-like protein YeeS
MQQLSFLPGEMTPGERSLILRALKTLDRHLHEPGVAFTSTRAAREWLILN
MAGLEREEFRVLYLNNQNQLIAGETLFTGTINRTEVHPREVIKRALYHNA
AAVVLAHNHPSGEVTPSKADRLITERLVQALALVDIRVPDHLIVGGSQVF
SFAEHGLL
>ECP_0257 hypothetical protein
MRRERLIRPAGEHGFVGMIRRVAAHLAPVHNCRMRRERLIRPTGERGFVG
MIRRVCVASGIGAQPPDAA
>ECP_2828 putative transcriptional antiterminator protein
MIIEKVMNNNCVQASMNGQEVIISGPGVGYNKKYGMSVPEHPANRIFYVR
NEQKNKLYKLIEHVDIEYVFVAEKIVQYAEKNLEKNLNPSLLLILADHIS
NAISRVVSGIQINNVFLDEIKALYKAEYAISRDALTIINEQFSVQLPDDE
IGFIALHILNNYENSVDYESVRIIELSQIITGLIEVVYNRKVDRSSFNYS
RFMMHLKYFSSRVLCNEKIKQKDIGDIYEQFLEKDILLQRAIHEIERYLY
ATFKYELILEEKLYLSIRTKVLMD
>ECP_2985 hypothetical protein
MWAFSRRPFNVEVIFCLPIQDRDKSHIRLLVNDDQRLE
>ECP_2444 cysteine synthase B
MNTLEQTIGNTPLVKLQRMAPDNGSEVWLKLEGNNPAGSVKDRAALSMIV
EAEKRGEIKPGDVLIEATSGNTGIALAMIAALKGYRMKLLMPDNMSQERR
AAMRAYGAELILVTKEQGMEGARDLALEMANRGEGKLLDQFNNPDNPYAH
YTTTGPEIWQQTGGRITHFVSSMGTTGTITGVSRFMREQSKPVTIVGLQP
EEGSSIPGIRRWPAEYLPGIFNASLVDEVLDIHQRDAENTMRELAVREGI
FCGVSSGGAVAGALRVAKANPGAVVVAIICDRGDRYLSTGVFGEEHFSQG
AGI
>ECP_3090 biopolymer transport ExbB protein
MQTDLSVWGMYQHADIVVKCVMIGLILASVVTWAIFFSKSVEFFNQKRRL
KREQQLLAEARSLNQANDIAADFGSKSLSLHLLNEAQNELELSEGSDDNE
GIKERTSFRLERRVAAVGRQMGRGNGYLATIGAISPFVGLFGTVWGIMNS
FIGIAQTQTTNLAVVAPGIAEALLATAIGLVAAIPAVVIYNVFARQIGGF
KAMLGDVAAQVLLLQSRDLDLEASAAAHPVRVAQKLRAG
>ECP_2596 hypothetical protein YfiH
MSKLIVPQWPLPKGVAACSSTRIGGVSLPPYDSLNLGAHCGDNPDHVEEN
RKRLFAAGNLPSKPVWLEQVHGKDVLKLTGEPYASKRADASYSNTPGTVC
AVMTADCLPVLFCNRAGTEVAAAHAGWRGLCAGVLEETVSCFADNPENIL
AWLGPAIGPRAFEVGAEVREAFMVVDAKSSTAFIQHGDKYLADIYQLARQ
RLASVGVEQIFGGDRCTYTENETFFSYRRDKTTGRMASFIWLI
>ECP_4569 hypothetical protein
MSSRQILEHYNALTYPLHQSILLQIMTSNLLSVCTGKSIYEDISGSSWNI
IHFNIPLPISRARLSIFSYCVRIKPWMSMDYM
>ECP_3143 glutamate-ammonia-ligase adenylyltransferase
MKPLSSPLQQYWQTVVERLPEPLAEESLSAQAKSVLTFSDFVQDSVIAHP
EWLTELESQPPQADEWQHYAAWWQEALSNVSDEAGLMRELRLFRRRIMVR
IAWVQTLALVTEESILQQLSHLAETLIVAARDWLYDACCREWGTPCNAQG
EAQPLLILGMGKLGGGELNFSSDIDLIFAWPEHGCTQGGRRELDNAQFFT
RMGQRLIKVLDHQRRMASSIAWICDCVRLAKVARWC
>ECP_0524 acriflavin resistance protein A
MTVNRYQKLLGTQYISKQEYDQALADAQQANAAVTAAKAAVETARINLAY
TKVTSPISGRIGKSNVTEGALVQNGQATALATVQQLDPIYVDVTQSSNDF
LRLKQELANGTLKQENGKAKVSLITSDGIKFPQDGTLEFSDVTVDQTTGS
ITLRAIFPNPDHTLLPGMFVRARLEEGLNPNAILVPQQGVTRTPRGDATV
LVVGADDKVETRPIVASQAIGDKWLVTEGLKAGDRVVISGLQKVRPGVQV
KAQEVTADNNQQAASGAQPEQSKS
>ECP_2605 hypothetical protein YfiN
MMDNDNSLHKRPTFKRALRNISMTSIFITMTLIWLLLSVTSVLTLKQYAQ
KNLALTAATMTYSLEAAVVFADGPAATETLAALGQQGQFSTAEVRDKQQN
ILASWHYTHKEPGDTFSNFISHWLFPAPIIQPIRHNGETIGEVRLTARDS
SISHFIWFSLAVLTGCILLASGIAITLTRHLHNGLVEALKNITDVVHDVR
SNRNFSRRVSEERIAEFHRFALDFNSLLDEMEEWQLRLQAKNAQLLRTAL
HDPLTGLANRAAFRSGINTLMNNSDARKTSALLFLDGDNFKYINDTWGHA
TGDRVLIEIAKRLAEFGGLRHKAYRLGGDEFAMVLYDVQSESEVQQICSA
LTQIFNLPFDLHNGHQTTMTLSIGYAMTIEHASAENLQELADHNMYQAKH
QRAEKLVR
>ECP_2365 hypothetical protein YfcM
MNSTHHYEQLIEIFNSCFADEFNTRLIKGDDEPIYLPADAEVPYNRIVFA
HGFYASAIHEISHWCIAGKARRELVDFGYWYCPDGRDAQTQSQFEDVEVK
PQALDWLFCVAAGYPFNVSCDNLEGDFEPDRVVFQRRVHAQVMDYLANGI
PERPARFIKALQNYYHTPELTAEQFPWPEALN
>ECP_3302 glutamate synthase [NADPH] small chain
MSQNVYQFIDLQRVDPPKKPLKIRKIEFVEIYEPFSEGQAKAQADRCLSC
GNPYCEWKCPVHNYIPNWLKLANEGRIFEAAELSHQTNTLPEVCGRVCPQ
DRLCEGSCTLNDEFGAVTIGNIERYINDKAFEMGWRPDMSGVKQTGKKVA
IIGAGPAGLACADVLTRNGVKAVVFDRHPEIGGLLTFGIPAFKLEKEVMA
RRREIFTGMGIEFKLNTEVGRDVQLDDLLSDYDAVFLGVGTYQSMRGGLE
NEDADGVYAALPFLIANTKQLMGFGETREEPFVSMEGKRVVVLGGGDTAM
DCVRTSVRQGAKHVTCAYRRDEENMPGSRREVKNAREEGVEFKFNVQPLG
IEVNGNGKVSGVKMVRTEMGEPDAKGRRRAEIVAGSEHIVPADAVIMAFG
FRPHSMEWLAKHSVELDSQGRIIAPEGNDNAFQTSNPKIFAGGDIVRGSD
LVVTAIAEGRKAADGIMNWLEV
>ECP_0041 probable flavoprotein subunit, carnitine metabolism
MKIITCYKCVPDEQDIAVNNADGSLDFSKADAKISQYDLNAIEAACQLKQ
QAVEAQVTALSVGGKALTNAKGRKDVLSRGPDELIVVIDDQFEQALPQQT
ASVLAAAAQKAGFDLILCGDGSSDLYAQQVGLLVGEILNIPAVNGVSKII
SLTADTLTVERELEDETETLSIPLPAVVAVSTDINSPQIPSMKAILGAAK
KPVQVWSAADIGFNAEAAWSEQQVAAPKQRERQRIVIEGDGEEQIAAFAE
NLRKVI
>ECP_2147 hypothetical outer membrane usher protein YehB precursor
MLRMTPLASAIVALLIGIEAYAAEETFDTHFMIGGMKDQQVSNIRLEDNQ
PLPGQYDIDIYVNKQWRGKYEIIVKDNPQETCLSREMIKRLGINTDNFAS
GKQCLTFKQLIQGGSYTWDIGVFRLDFSVPQAWVEELESGYVPPENWERG
INAFYTSYYVSQYYSDYKASGNSKSTYVRFNSGLNLLGWQLHSDASFSKT
NNNPGGWKSNTLYLERGFAQLLGTLRVGDMYTSSDIFDSVRFSGVRLFRD
MQMLPNSKQNFTPRVQGIAQSNALVTIEQNGFVVYQKEVPPGPFAITDLQ
LAGGGADLDVSVKEADGSVTTYLVPYAAVPNMLQPGVSKYDFAAGRSHIE
GASKQSDFVQAGHQYGFNNLLTLYGGSMVANNYYAFTLGTGWNTRIGAIS
VDATKSHSKQDNGDVFDGQSYQIAYNKFVSQTSTRFGLAAWRYSSRDYRT
FNDHVWANNKDNYRRDENDIYDIADYYQNDFGRKNSFSANMSQSLPEGWG
SVSLSTLWRDYWGRSGSSKDYQLSYSNNLRRISYTLAASHAYDENHHEEK
RFNIFISIPFDWGDDVTTPRRQIYMSKSTTFDDQGVASNNTGLSGTVGSR
DQFNYGVNLSYQYQGNETTAGANLTWNAPVATVNGSYSQSSAYRQAGASV
SGGIVAWSGGVNLANRLSETFAVMNAPGIKDAYVNGQKYRTTNRNGVVVY
DGMTPYRENYLMLDVSQSDSEAELRGNRKIAAPYRGAVVLVNFDTDQRKP
WFIKALRADGQPLTFGYEVNDIHGHNIGVVGQGSQLFIRTNEVPPSVNVA
IDKQQGLSCTITFGKEIDESKNYICQ
>ECP_2466 ethanolamine utilization protein EutJ
MAHDEQWLTPRLQTAATLCNQTPAATESPLWLGVDLGTCDVVSMVVDRDG
QPVAVCLDWADVVRDGIVWDFFGAVTIVRRHLDTLEQQFGRRFSHVATSF
PPGTDPRISINVLESAGLEVSHVLDEPTAVADLLQLDNAGVVDIGGGTTG
IAIVKKGKVTYSADEATGGHHISLTLAGNRRISLEEAEQYKRGHGDEIWP
AVKPVYEKMADIVARHIEGQGITDLWLAGGSCMQPGVAALFCKQFPALQV
HLPQHSLFMTPLAIASSGREKAEGIYAK
>ECP_3792 hypothetical protein
MRLADNAGAGLSDNKKHHRSHPCLTHRSSERPEPILSGMVGLAIPVCFIH
PAALLIVMESSSPPGADTVILL
>ECP_2518 hypothetical protein YfgM (putative membrane protein)
MEIYENENDQVEAVKRFFAENGQALAVGVILGVGALIGWRYWNSHQVDSA
RSASLAYQNAVTAVSEGKPDSIPAAEKFAAENKNTYGALASLELAQQFVD
KNELEKAAAQLQQGLADTSDENLKAVINLRLARVQVQLKQADAALKTLDT
IKGEGWAAIVADLRGEALLSKGDKQGARSAWEAGVKSDVTPALSEMMQMK
INNLSI
>ECP_3870 putative xanthine/uracil permeases family
MIIITEPLLSFVLQKQGIKSPPMDKKMNNDNTDYVSNESGTLSRLFKLPQ
HGTTVRTELIAGMTTFLTMVYIVFVNPQILGAAQMDPKVVFVTTCLIAGI
GSIAMGVFANLPVALAPAMGLNAFFAFVVVGAMGISWQTGMGAIFWGAVG
LFLLTLFRIRYWMISNIPLSLRIGITSGIGLFIALMGLKNTGVIVANKDT
LVMIGDLSSHGVLLGILGFFIITVLSSRHFHAAVLVSIVVTSCCGLFFGD
VHFSGVYSIPPDISGVIGEVDLSGALSLELAGIIFSFMLINLFDSSGTLI
GVTDKAGLIDSNGKFPNMNKALYVDSVSSVAGAFIGTSSVTAYIESTSGV
AVGGRTGLTAVVVGVMFLLVMFFSPLVAMVPPYATAGALIFVGVLMTSSL
ARVNWDDFTESVPAFITTVMMPFTFSITEGIALGFMSYCIMKVCTGRWRD
LNLCVVVVATLFALKIILVD
>ECP_2554 flavohemoprotein
MLDAQTIATVKATIPLLVETGPKLTAHFYDRMFTHNPELKEIFNMSNQRN
GDQREALFNAIAAYAGNIENLPALLPAVEKIAQKHTSFQIKPEQYNIVGE
HLLATLDEMFSPGQEVLDAWGKAYGVLANVFINREAEIYNENASKAGGWE
GTRDFRIVAKTPRSALITSFELEPVDGGAVAEYRPGQYLGVWLKPEGFPH
QEIRQYSLTRKPDGKGYRIAVKREEGGQVSNWLHNHANVGDVVKLVAPAG
DFFMAVADDTPVTLISAGVGQTPMLAMLDTLAKAGHTAQVNWFHAAENGD
VHAFADEVKELGLSLPRFTAHTWYRQPNEADRAKGQFDSEGLMDLSKLEL
AFSDPTMQFYLCGPVGFMQFAAKQLVDLGVKQENIHYECFGPHKVL
>ECP_4785 sensor protein CreC
MRIGMRLLLGYFLLVAVAAWFVLAIFVKEVKPGVRRATEGTLIDTATLLA
ELARPDLLSGDPTHGQLAQAFNQLQHRPFRANIGGINKVRNEYHVYMTDA
QGKVLFDSANKAVGQDYSRWNDVWLTLRGQYGARSTLQNPADPESSVMYV
AAPIMDGSRLIGVLSVGKPNAAMAPVIKRSERRILWASAILLGIALVIGA
GMVWWINRSIARLTRYADSVTDNKPVPLPDLGSSELRKLAQALESMRVKL
EGKNYIEQYVYALTHELKSPLAAIRGAAEILREGPPPEVVARFTDNILTQ
NARMQALVETLLRQARLENRQEVVLTAVDVAALFRRVSEARTVQLAEKNI
TLHVMPTEINVAAEPALLEQALGNLLDNAIDFTPESGRITLSAEVDQEYV
TLKVLDTGSGIPDYALSRIFERFYSLPRANGQKSSGLGLAFVSEVARLFN
GEVTLRNVQEGGVLASLRLHRHFT
>ECP_2406 hypothetical protein YpdA
MLLAVFDRAALMLICLFFLIRIRLFRELLHKSAHSPKELLAVTAIFSLFA
LFSTWSGVPVEGSLVNVRIIAVMSGGILFGPWVGIITGVIAGIHRYLIDI
GGVTAIPCFITSILAGCISGWINLKIPKAQRWRVGILGGMLCETLTMILV
IVWAPTTALGIDIVSKIGIPMILGSVCIGFIVLLVQSVEGEKEASAARQA
KLALDIANKTLPLFRHVNSESLRKVCEIIRDDIHADAVAMTNTDHVLAYV
GVGEHNYQNGDDFISPTTRQAMNYGKIIIKNNDEAHRTPEIHSMLVIPLW
EKGVVTGTLKIYYCHAHQITSSLQEMAVGLSQIISTQLEVSRAEQLREMA
NKAELRALQSKINPHFLFNALNAISSSIRLNPDTARQLIFNLSRYLRYNI
ELKDDEQIDIKKELYQIKDYIAIEQARFGDKLTVIYDIDEEVNCCIPSLL
IQPLVENAIVHGIQPCKGKGVVTISVAECGNRVRIAVRDTGHGIDPKVIE
RVEANEMPGNKIGLLNVHHRVKLLYGEGLHIRRLEPGTEIAFYIPNQHTP
VASQATLLL
>ECP_1506 hypothetical protein YneF
MHVQPISTFRLFQEGHLLRNSIAIFVLTTLFYFIGAELRLVHELSLFWPL
NGVMAGVFARYVWLNRLHYYAISYVAMLVYDAITTEWGLVSLAINFSNMM
FIVTVALLVARDKRLGKNKYEPVSALRLFNYCLLAALLCAIVGAIGSVSI
DSLDFWPLLADWFSEQFSTGVLIVPCMLTLAIPGVLPRFKAEQMIPAIAL
IVSVIASVVIGGAGSLAFPLPALIWCAVRYTPQVTCLLTFVTGAVEVVLV
ANSVIDISVGSPFSIPQMFSARLGIATMAICPIMVSFSVAAINSLMKQVA
LRADFDFLTQVYSRSGLYEALKSPSLKQTQHLTVMLLDIDYFKSINDNYG
HECGDKVLSVFAQHIQKIVGDKGLVARMGGEEFAVAVPSVNPVDGLLMAE
KIRKGVELQPFTWQQKTLYLTVSIGVGSGCASYRTLTDDFNKLMVEADTC
LYRSKKDGRNRTSTMRYGEEVV
>ECP_0596 hypothetical protein YbcY precursor
MKKNTDDGAKIYTPLTLKLYDWWVLGVSNRLAWGCPTKEHLLPHFLEHLG
NNHLDIGVGTGFYLTHVPESSLISLMDLNEASLNAASTRAGESKIKHKIS
HDVFDPYPAALHGQFDSISMFYLLHCLPGNISTKSCVIRNAAQALTDDGT
LYGATILGDGVVHNSFGQKLMRIYNQKGIFSNTKDSEEGLTHILSEHFEN
VKTKVKGTVVMFSASGKK
>ECP_3336 hypothetical protein YhdA
MRLTTKFSAFVTLLTGLTIFVTLLGCSLSFYNAIQYKFSHRVQAVATAID
THLVSNDFSTLRPQITELMMSADIVRVDLLHGDKQVYTLARNGSYRPVGT
NDLFRELSVPLIKHPGMSLRLVYQDPMGNYFHSLMTTAPLTGAIGFIILM
LFLAVRWLQRQLAGQELLETRATRILNGERGSNVLGTIYEWPPRTSSALD
TLLCEIQNAREQHSRLDTLIRSYAAQDMKTGLNNRLFFDNQLATLLEDQE
KVGTHGIVMMIRLPDFNMLSDTWGHSQVEEQFFSLTNLLSTFMMRYPGAL
LARYHRSDFAALLPHRTLKEAESIASQLIKAVDTLPNNKMLDRDDMIHIG
ICAWRSGQDTEQVMEHAESATRNAGLQGGNSWAIYDDSLPEKGRGNVRWR
TLIEQMLSRGGPRLYQKPAVTREGRVHHRELMCRIFDGNEEVSSAEYMPM
VLQFGLSEEYDRLQISRLIPLLRYWPEENLAIQVTVESLIRPRFQRWLRD
TLMQCEKSQRRHIIIELAEADVGQHISRLQPVIRLVNALGVRVAVNQAGL
TLVSTSWIKELNVELLKLHPGLVRNIEKRTENQLLVQSLVEACSGTSTQV
YATGVRSRSEWQTLIQRGVTGGQGDFFASSQPLDTNVKKYSQRYSV
>ECP_3227 putative N-acetylgalctosamine-6-phosphate deacetylase
MTHVLRARRLLTEEGWLDDHQLRIADGVIAAIEPIPVGVTERDAELLCPA
YIDTHVHGGAGVDVMDDAPDVLDKLAMHKAREGVGSWLPTTVTAPLNTIH
AALERIAQRCQRGGPGAQVLGSYLEGPYFTPQNKGAHPPELFRELEIAEL
DQLIAVSQHTLRVVALAPEKEGALQAIRHLKQQNVRVMLGHSAATWQQTR
AAFDAGADGLVHCYNGMTGLHHREPGMVGAGLTDKRAWLELIADGHHVHP
AAMSLCCCCAKERIVLITDAMQAAGMPDGHYTLCGEEVQMRGGVVRTASG
GLAGSTLSVDAAVRNMVELTGVTPAEAIHMASLHPARMLGVDGVLGSLKP
GKRASVVALDSGLHVQQIWIQGQLASF
>ECP_0032 carbamoyl-phosphate synthase large chain
MPKRTDIKSILILGAGPIVIGQACEFDYSGAQACKALREEGYRVILVNSN
PATIMTDPEMADATYIEPIHWEVVRKIIEKERPDAVLPTMGGQTALNCAL
ELERQGVLEEFGVTMIGATADAIDKAEDRRRFDVAMKKIGLETARSGIAH
SMEEALAVAAEVGFPCIIRPSFTMGGSGGGIAYNREEFEEICARGLDLSP
TKELLIDESLIGWKEYEMEVVRDKNDNCIIVCSIENFDAMGIHTGDSITV
APAQTLTDKEYQIMRNASMAVLREIGVETGGSNVQFAVNPKNGRLIVIEM
NPRVSRSSALASKATGFPIAKVAAKLAVGYTLDELMNDITGGRTPASFEP
SIDYVVTKIPRFNFEKFAGANDRLTTQMKSVGEVMAIGRTQQESLQKALR
GLEVGATGFDPKVSLDDPEALTKIRRELKDAGAERIWYIADAFRAGLSVD
GVFNLTNIDRWFLVQIEELVRLEEKVAEVGITGLHAEFLRQLKRKGFADA
RLAKLAGVREAEIRKLRDQYDLHPVYKRVDTCAAEFATDTAYMYSTYEEE
CESNPSDRDKIMVLGGGPNRIGQGIEFDYCCVHASLALREDGYETIMVNC
NPETVSTDYDTSDRLYFEPVTLEDVLEIVRIEKPKGVIVQYGGQTPLKLA
RALEAAGVPVIGTSPDAIDRAEDRERFQHAVERLKLKQPANATVTTIEMA
VEKAKEIGYPLVVRPSYVLGGRAMEIVYDEADLRRYFQTAVSVSNDAPVL
LDHFLDDAVEVDVDAICDGEMVLIGGIMEHIEQAGVHSGDSACSLPAYTL
SQEIQDVMRQQVQKLAFELQVRGLMNVQFAVKNNEVYLIEVNPRAARTVP
FVSKATGVPLAKVAARVMAGKSLAEQGVTKEVIPPYYSVKEVVLPFNKFP
GVDPLLGPEMRSTGEVMGVGRTFAEAFAKAQLGSNSTMKKHGRALLSVRE
GDKERVVDLAAKLLKQGFELDATHGTAIVLGEAGINPRLVNKVHEGRPHI
QDRIKNGEYTYIINTTSGRRAIEDSRVIRRSALQYKVHYDTTLNGGFATA
MALNADATEKVISVQEMHAQIK
>ECP_1475 formate dehydrogenase, nitrate-inducible, major subunit
MTNHWVDIKNANVVMVMGGNAAEAHPVGFRWAMEAKNNNDATLIVVDPRF
TRTASVADIYAPIRSGTDITFLSGVLRYLIENNKINAEYVKHYTNASLLV
RDDFAFEDGLFSGYDAEKRQYDKSSWNYQFDENGYAKRDDTLTHPRCVWN
LLKAHVSRYTPDVVENICGTPKADFLKVCEVLASTSAPDRTTTFLYALGW
TQHTVGAQNIRTMAMIQLLLGNMGMAGGGVNALRGHSNIQGLTDLGLLST
SLPGYLTLPSEKQVDLQSYLEANTPKATLADQVNYWSNYPKFFVSLMKSF
YGDAAQKENNWGYDWLPKWDQTYDVIKYFNMMDEGKVTGYFCQGFNPVAS
FPDKNKVVSCLSKLKYMVVIDPLVTETSTFWQNHGESNDVDPASIQTEVF
RLPSTCFAEEDGSIANSGRWLQWHWKGQDAPGEARNDGEILAGIYHHLRE
LYQAEGGKGVEPLMKMSWNYKQPHEPQSDEVAKENNGYALEDLYDANGVL
IAKKGQLLSSFAHLRDDGTTASSCWIYTGSWTEQGNQMANRDNSDPSGLG
NTLGWAWAWPLNRRVLYNRASADINGKPWDPKRMLIQWNGSKWTGNDIPD
FGNAAPGTPTGPFIMQPEGMGRLFAINKMAEGPFPEHYEPIETPLGTNPL
HPNVVSNPVVRLYEQDALRMGKKEQFPYVGTTYRLTEHFHTWTKHALLNA
IAQPEQFVEISETLAAAKGIANGDRVTVSSKRGFIRAVAVVTRRLKPLNV
NGQQVETVGIPIHWGFEGVARKGYIANTLTPNVGDANSQTPEYKAFLVNI
EKA
>ECP_2549 ABC transporter periplasmic binding protein YphF precursor
MPKKMTRTRNLLLMATLLGSALFARASDKEMTIGAIYLDTQGYYAGVRQG
VQDAAKDSSVQVQLIETNAQGDISKESTFVDTLVARNVDAIILSAVSENG
SSRTVRRASEAGIPVICYNTCINQKGVDKYVSAYLVGDPLEFGKKLGNAA
ADYFIANKIDQPKIAVINCEAFEVCVQRRKGFEEVLKTRVPGAQIVANQE
GTVLDKAISVGEKLIISTPDLNAIMGESGGATLGAVKAVRNQNQAGKIAV
FGSDMTTEIAQELENNQVLKAVVDISGKKMGNAVFAQTLKVINKQADGEK
VIQVPIDLYTKTEDGKQWLATHVDGLP
>ECP_2853 hypothetical protein YgeA (putative aspartate/glutamate racemase
MKTIGLLGGMSWESTIPYYRLINEGIKQRLGGLHSAQVLLHSVDFHEIEE
CQRRGEWDKTGDILAEAALGLQRAGAEGIVLCTNTMHKVADAIESRCSLP
FLHIADATGCAITGAGMTRVALLGTRYTMEQDFYRGRLTEQFSINCLIPE
ADERAKINQIIFEELCLGQFTEASRAYYAQVIARLAEQGAQGVIFGCTEI
GLLVPEERSVLPVFDTAAIHAEDAVAFMLS
>ECP_3911 putative transcriptional regulator, LysR family
MTKAAKRMSVTPSAVSKSLAKLRMWFDDPLFVNSPLGLSPTPLMVSMEQN
LAEWMQMSNQLLDKPLHETPRGLKFELAAESPLMMIMLNALSKRIYQRYP
QATIKLRNWDYDSLDAITRGEVDIGFSGRESHPRSRELLSSLPLAIDYEV
LFSDVPCVWLRKDHPALHEAWNLDTFLRYPHISICWEQSDTWALDNVLQE
LGRERTIAMSLPEFEQSLFMAAQPDNLLLATAPRYCQYYNQLHQLPLVAL
PLPFDESQQKKLEVPFTLLWHKRNSRNPKIVWLRETIKNLYASMA
>ECP_2234 cytochrome c-type biogenesis protein CcmH precursor
MIATDLRQKVYELMQEGKSKKEIVDYMVARYGNFVTYDPPLTPLTVLLWV
LPVVAIGIGGWVIYARSRRRVRVVPEAFPEQSVPEGKRAGYVVYLPGIVV
ALIVAGVSYYQTGNYQQVKIWQQATAQAPALLDRALDPKADPLNEEEMSR
LALGMRTQLQKNPGDIEGWIMLGRVGMALGNASIATDAYATAYRLDPKNS
DAALGYAEVLTRSSDPNDNRLGGELLRQLVRTDHSNIRVLSMYAFNAFEQ
QRFGEAVAAWEMMLKLLPANDTRRAVIERSIAQAMQHLSPQESK
>ECP_1668 hypothetical protein
MQSPSDTIFCRHLSLKYALDSLRNGKGKVNLIKHYSSVESIQQHVSLVRD
AEFRALLRHPPARSRVIASQDFGFALDIFFCRMMANNVSHMSAILYIDNH
TLSVSKDVVLSYISTLGTFEKNILLANVSYLHC
>ECP_3495 ferrous iron transport protein B
MKKLTIGLIGNPNSGKTTLFNQLTGSRQRVGNWAGVTVERKEGQFSTTDH
QVTLVDLPGTYSLTTISSQTSLDEQIACHYILSGDADLLINVVDASNLER
NLYLTLQLLELGIPCIVALNMLDIAEKQNIRIEIDALSARLGCPVIPLVS
TRGRGIEALKLAIDRYKANENVELVHYAQPLLNEADSLAKVMPSDIPLKQ
RRWLGLQMLEGDIYSRAYAGEASQHLDAALARLRNEMDDPALHIADARYQ
CISAICDVVSNTLTAEPSRFTTAVDKIVLNRFLGLPIFLFVMYLMFLLAI
NIGGALQPLFDVGSVALFVHGIQWIGYTLHFPDWLTIFLAQGLGGGINTV
LPLVPQIGMMYLFLSFLEDSGYMARAAFVMDRLMQALGLPGKSFVPLIVG
FGCNVPSVMGARTLDAPRERLMTIMMAPFMSCGARLAIFAVFAAAFFGQN
GALAVFSLYMLGIVMAVLTGLMLKYTIMRGEATPFVMELPVYHVPHVKSL
IIQTWQRLKGFVLRAGKVIIIVSIFLSAFNSFSLSGKIVDNINDSALASV
SRVITPVFKPIGVHEDNWQATVGLFTGAMAKEVVVGTLNTLYTAENIQDE
EFNPAEFNLGEELFSAVDETWQSLKDTFSLSVLMNPIEASKGDGEMGTGA
MGVMDQKFGSAAAAYSYLIFVLLYVPCISVMGAIARESSRGWMGFSILWG
LNIAYSLATLFYQVASYSQHPTYSLVCILAVILFNIVVIGLLRRARSRVD
VELLATRKSVSSCCAASTTGDCH
>ECP_0902 thioredoxin reductase
MGTTKHSKLLILGSGPAGYTAAVYAARANLQPVLITGMEKGGQLTTTTEV
ENWPGDPNDLTGPLLMERMHEHATKFETEIIFDHINKVDLQNRPFRLTGD
SGEYTCDALIIATGASARYLGLPSEEAFKGRGVSACATCDGFFYRNQKVA
VIGGGNTAVEEALYLSNIASEVHLIHRRDGFRAEKILIKRLMDKVENGNI
ILHTNRTLEEVTGDQMGVTGVRLRDTQNSDNIESLDVAGLFVAIGHSPNT
AIFEGQLELENGYIKVQSGIHGNATQTSIPGVFAAGDVMDHIYRQAITSA
GTGCMAALDAERYLDGLADAK
>ECP_2544 IS putative transposase
MQQLGLKSPVRLKKYRSYRGNMGLAAENILQRQFKAEAPCEKWVTDITEF
RAGGQKLYLSPILDLFNGEIVAWETACRPTEELVKRMLNKGLESLAEGEK
PLLHSDQGWHYRIKSYQSALADRGLVQSMSRKGNCLDNAVMENFFGHLKE
EMYYRRDYRNVEELENAVNEYITYWNQKRIKLSLGGLSPVEYRTEYQKAG


# Escherichia coli APEC O1, APEC O1

>APECO1_519 hypothetical protein
MIPYSKVESLAACRMTAQQIADVLDVDLNRLKENREAMTDFYAAIRKGRA
KGEAELRAALFKLARKGDAFALRELLRVDKNQD
>APECO1_417 unknown protein encoded by prophage CP-933X
MVKSMKKNTDDGAKIYTPLTLKLYDWWVLGVSNRLAWGCPTKEHLLPHFL
EHVGNNHLDIGVGTGFYLTHVPESSLISLMDLNEASLNAASTRAGESKIK
HKISHDVFEPYPAALHGQFDSISMFYLLHCLPGNISTKSCVIRNAAQALT
DDGTLYGATILGDGVVHNSFGQKLMRIYNQKGIFSNTKDSEEGLTHILSE
HFENVKTKVQGTVVMFSASGKK
>APECO1_4214 hypothetical protein
MLHTIHFLCPVNTATVGQLQNHCLTALSQGATELNIHISSQGGETAAGFT
AYNFLKSLPVTVRTHNISNVESIANIVFLAGSERFANPLSRFLLHPLLWC
FASPAADHARLREYGKCLDNDLDRFVETFNIDIGTHIRWASLIADSTILD
ANKALEHGIINSIKTARLASNQANWWVV
>APECO1_450 acriflavine resistance protein B
MFSRFFVRRPVFAWVIAILIMLAGILAIRTLPVAQYPDVAPPTIKVSATY
TGASAETLENSVTQVIEQQLTGLDNLLYFSSTSSSDGSVSINVTFEQGTD
PDTAQVQVQNKIQQAESRLPSEVQQTGVTVEKSQSNFLLIAAVYDTTDKA
SSSDIADWLVSNVQDPLARVEGVGSLQVFGAEYAMRIWLDPAKLASYSLM
PSDVQSAIEAQNVQVTAGKIGALPSPNTQQLTATVRAQSRLQTVDQFKNI
IVKSQSDGAVVRIKDVARVEMGSEDYTAIGKLNGHPSAGVAVMLSPGANA
LNTATLVKDKIAEFQRNMPQGYDIAYPKDSTEFIKISVEDVIQTLFEAIV
LVVCVMYLFLQNLRATLIPALAVPVVLLGTFGVLALFGYSINTLTLFAMV
LAIGLLVDDAIVVVENVERIMRDEGLPAREATEKSMGEISGALVAIALVL
SAVFLPMAFFGGSTGVIYRQFSITIISAMLLSVVVALTLTPALCGSVLQH
VPPHKKGFFGAFNRFYRRTEDKYQRGVIYVLRRAARTMGLYVVLGGGMAL
MMWKLPGSFLPTEDQGEIMVQYTLPAGATAARTAEVNRQIVDWFLINEKA
NTDVIFTVDGFSFSGSGQNTGMAFVSLKNWSQRKGAENTAQAIALRATKE
LGTIRDATVFAMTPPAVDGLGQSNGFTFELLANGGADRETLLQMRNQLIE
KANQSPELHSVRANDLPQMPQLQVDIDSNKAVSLGLSLNDVTDTLSSAWG
GTYVNDFIDRGRVKKVYIQGDSEFRSAPSDLGKWFVRGSDNAMTPFSAFA
TTRWLYGPERLVRYNGSAAYEIQGENATGFSSGDAMTKMEELANSLPAGT
TWAWSGLSLQEKLASGQALSLYAVSILVVFLCLAALYESWSVPFSVILVI
PLGLLGAALAAWMRDLNNDVYFQVALLTTIGLSSKNAILIVEFAEAAVAE
GYSLSRAALRAAQTRLRPIIMTSLAFIAGVMPLAIATGAGANSRIAIGTG
IIGGTLTATLLAIFFVPLFFVLVKRLFAGKPRRQE
>APECO1_1724 hypothetical protein
MARLREVLNAIFSAPRSGKSRIPKYQQGIFLRNQHRFYLLNHYTESLSVM
SLTKLSLNTRMIGFMSDQMLETAPRLTRAVSDETSVYAGTGQNIGQNPFN
IIIVICTKEHIERLELMYQGKSDGFFT
>APECO1_1619 hypothetical protein
MNCEKCDADPLWRHSESLPSSLMRSLFVRVLIAFFRIHRFLSGRMFTDQR
VWFRMIGSELVPGYRLSWLSFINRHYICFTRIRRFRWHRSSLFHGMNVKY
RRSKINN
>APECO1_1029 putative phage lysozyme
MNAKIRYGLSAAVLALIAVGAPAPDILDQFLDEKEGNHTTAYRDGAGIWT
ICRGATMVDGKPVFPGMKLSKEKCDQVNAIERDKALAWVERNIKVPMTEP
QKAGIASFCPYNIGPGKCFPSTFYKRLNAGDRKGACEAIRWWIKDGGRDC
RIRSNNCYGQVIRRDQESALACWGIDQ
>APECO1_597 hypothetical protein
MMIFSQWKDTMDDNNQTSGQPKPEPEECVKEQKITDHFKIMIDKARKAQK
LVLIKRADDLLRWGAQEEYDFSKIFGVKGNKEVNIRKYGHNTGRRMNARF
LMMDGVRRLMIIANDLTMSSFINYTGCNEFAAFVSPSKDMPYIINIGAKF
EYRDGKKNPVTGKDSHVATLCHEMSHIQWYYGDNKKGGMWSQDYTTTDKY
STCKEDEVSYDEHIRIATKLISKQKDQIFENAYNIERYFEIRLIESEIDS
IDDEILSNSVKKK
>APECO1_1719 hypothetical protein
MIIDLDALFPSRKTFIDLVSYCTDPFASVENKVFASLPADVECGDLITST
GAKYESGDDIYVVMSEFVTAGENKPVDVLRSNAGLVCIKADALNAVSEAA
KTALIKKGFQLEGFNTVFTS
>APECO1_1387 putative inner membrane protein
MMMEKGITKPSILVVADDFTGANDAGVSLAQVGHTVDVAFEMHYRGDASV
WVINSDSRAMDPKLAAMKITSLMSHLPLANNPPLVIKKIDSTLRGNIGAE
IEALMKACGITGAVVAPAFPQAGRTTVAGECWVNGVRITETEFASDPKTP
VLSARIADIIRLQTAIPCQPVTVSQLSHLSYEQPWIGVIDAQTDSDLDRI
AAAVMQAKQPLLLVGSAGICDAVARRSAIMSPPTVLAIIGSMSEIAQRQI
ATLHSHPRITQIYVDVEHILAGNASDYDARIVQALQKGDHCIVHTCNDSV
ARHQIDTLCQRWQMSRAALGEKICRFLGELTRQVLLRTMPDALYLSGGDV
AMATASALGATGFRITGKVAQCVPYGHFLGGVWSRSVMTKAGGFGDETTL
HQVLNFIEEKCSE
>APECO1_231 hypothetical protein
MCIFTCIRKVHIVFFIRVYYGGHMSNLRKYRESLNISQTTLAKAVGCTQG
AIGHWESGRRFPDLKTCRALVACLNKLGAKVSLDDVFPPEHKAA
>APECO1_1642 putative transferase
MPSGLFMDLLPFLLDANLSATNPPAIPHWWKCQPLIPTLLSQELKNYLKL
NVKEKNIQIADQVIIDESAGEVVIGANTRICHGAVIQGPVVIGANCLIGN
YAFIRPGTIISNGVRIGFATEIKNAVIEAEATIGPQCFIADSVVANQAYL
GAQVRTSNHRLDEQPVSVRTPDGIIATGCDKLGCYIGQRSRLGVQVIILP
GRIISPNTQLGPRVIVERNLPTGTYSLRQELIRTGD
>APECO1_1407 hypothetical protein
MSASSVKPLNVQLPAITLILFALCVGIFCYLAQWMSYEEVDQSALIHLGA
NVASLTLSDESWRLLSSVFLHSSFSHLLMNMFALLVVGTVAERILGKWRL
LIIWLFSGIFGGLISACYTLRESEQIVISIGASGAIMGIAGAAIATQLAS
GAGTHHKNQRRVFPLLGMVALTLLYGTRQTGIDNACHIGGLIAGGALGWL
SARLVGQNRLVTEGGIIVAVTLLLTGTIWFVQQQIDESVLQVGQSLREAF
YPQEIEQERRQKKQQLVEERNALRETLSAPVSREQASGDLLAEIADIHDM
AISRDGNTLYAAIENTNSIVVFDLGQKKILHTFTAPIAKEKSVKHCGGCK
DQGVRSLALSLDEKLIYATSFEANALSVINVATGEIIQSITTGAHPDSFI
LSRDGTKAWVMNRTSNSVSAIDLVAYQHVADIPLEKYDGTGMSGKPGAWV
MALSPDEKTLLVPGAGRGNIVRINTITHQKEDFPAGNARGVVSAMGFRPK
NGEIIFADSQGISRIRAEDQQASIMTQWCSRSVYSVEGISPDGQYLALVS
YGLQGYVILLNINAGQIIGVYPASYVNHLRFSADDRKIFVMAKNRLIQMD
RTRSLDPQAIIRHLQYGDVACIPEP
>APECO1_1031 putative inner membrane protein
MDDSTLLRHSSLFVAYMGCLGWGSAYFYGWGTSFYYGFPWWVVGAGVDDV
ARSLFYAVTVIVIFLIGWGTGVFFFLGIKQKNNVQDLSFIRLFLAILLLF
VPPALEFSVIHRRVAPDALVLCVIAALIITFFVRSGRRFISVKFFSDMSF
IRHHWIECMMAGFMIYFWGFSLIAGWYKPQFKDEYQMIRYESVWYYVLAR
YDDRLILSESYSNGSSTFVILNSGQIDDFKINVVRVR
>APECO1_1019 hypothetical protein
MMDFSLDFSGLADIARDLETLSRAENNKVLRDATRAGAEVMRDAVVERAP
ERTGKLKKNVVVLTQRSKRRGEIISGVHIRGRNLRTGNSDNSMKASDPRN
AFYWRFVELGTINMPAHPFIRPAFDTTEELAAQVAIQRMNQAIDEVLSK
>APECO1_443 putative DEOR-type transcriptional regulator
MNSRQQTILQMVIDQGQVSVTDLAKATGVSEVTIRQDLNTLEKLSYLRRA
HGFAVSLDSDDVETRMMSNYTLKRELAEFAASLVQPGETIFIENGSSNAL
LARTLGEQKKNVTIITVSSYIAHLLKDAPCEVILLGGVYQKKSESMVGPL
TRQCIQQVHFSKAFIGIDGWQPETGFTGRDMMRTDVVNAVLEKECEAIVL
TDSSKFGAVHSYSIGPVERFNRVITDSKIRASDLMHLEQSKLTVHVVDI
>APECO1_1662 putative permease component of transport system
MESASEPESAMKKSWRNNVEFYLIGLLVLTVAAFSITMPEIFWSISNFQS
VASQMPVLGILALAMAVTMLCGGINLSIIATANACSLVMAWVATQYPPGI
ATVVATLLAGAGAAVIIGLCNGVLIAGIRVSPILATLGMMTLLKGVNILV
TGGSAIANYPSWVLWLNHAQWFGIPLPMWLFTAVALGLWILLEKTPLGKS
IYLIGSNERATLYSGINTRRVLIWVYVISALLCAVAAFLMMSKLNSAKAS
YGESYLLVSILAAVLGGVNPDGGSGRIIGMVLALFLLQIIESGFNILGIS
PYLTMALWGTLLLCFIQARGMLGLDRVV
>APECO1_2018 putative minor tail protein
METFHWKVRPDMNVVSEPKVVTVKLGDGYEQRRAAGLNNQLSTYSVTIRV
RKCDHPSLKAFLERHGGVRAFQWTPPYDWKPIRVVCRKWSASVGALWVTI
TADFEQVVA
>APECO1_1664 putative ATP-binding component of transport system, probable substrate ribose
METFLSLRHINKTFHATRALRDVSLDFMSGEVHCLAGQNGCGKSTLIKIM
SGVYRPDEGAEITLGGKNWSKLTPAASVAQGIQVIYQDLSLFPNLSVWEN
IAVNHYHHGLFVNRRRLREVAQAAMTSINVTLPLDTLVSELSIARCQLVA
ICRALAQDARLIVMDEPTASLTHQEVQGLLQVVHQLRERGICVVFVSHRL
EEVMEVSDRISVLKDGELVGTFPAAEMTTKQLGFLMTGQEFEYQVRELWQ
GKSSTPVLEVRNLSRHGEYLNINLRVEAGEVVSIVGLLGAGRTELCLSLF
GMTRPDAGEILINGQLVTLHSNQDAIRHGIGYVSEDRMSRGLVMAQSIED
NIISTVFHKVKDRFGFLSEAKVCDLVDRLIKALTIKVSDPHLPVNTLSGG
NAQRVSIAKWLAIGPRLLILDSPTVGVDIANKAGIYGIISDLAAHGIAVL
MICDEIEEAWYQSHRILVMQKGQITHSFLPDSSSQARIAEVVNG
>APECO1_418 hypothetical protein
MVRAVADFADITTGGRRTIEPWPPEQIRAKITTTIESSQTNRSTFLHTTC
LRHNPNDYSLTGFAGHSISGAVDVSTRFNNTRYLFHASSNDLSSSVAISR
STASCSGAIYSLNSWM
>APECO1_1083 hypothetical protein
MTMYAKSFIALDGNGRLTGARTAQAAPYANYTCHLCGSALRYHPQYDTEL
PWFEHTDDRLTEHGQQCPYVRPERREIQLIKRLQQFVPDALPVVRKASWH
CRQCHHDYYGERYCTHCQTGGFSIPRTTQEEICEF
>APECO1_3884 hypothetical protein
MINAVEYRLPLGNISPENLDALSELIVKYSDKFNNYVLSSISDDDIRYSY
DYYNFEITEIDEYGFHFIAPYSYYEGCVDNNFSGEVEGYAEYEIIDNELV
FSLEELPWDVK
>APECO1_2626 putative aldolase
MVSGVIPPTGNIGTDCGITARQALVTSGVICSAGNSLMALAPCSMAAKAS
VGVATPGAQYIPSARARRITAVSQCGITIIWPPASLTATTCSTESTVPAP
TRQSLGRASRSKLILCNASGEFIGISIRRNPAAYSFSPMATISSGFTPRK
MATNPVFCKSVLNGMGTFPGVGGQ
>APECO1_2124 hypothetical protein
MLRISSIYYNRFSVQTEVEYNNTFKKCINELVNSFPDNIELISEFESQCK
IMHDINNSFFAATKCAGIWRKNTKKSHALILNLIMFCEMFLSSLSSLVVV
NDPKKMKRIFPLFIVSENINIHAEPDIECFRVIDRAFDEVANYTGRIFSY
LRTHGPLKLTSCVNKQTLLSMGGYLNEWNVFDSLSRVRDFFRLSSAVFTK
LDNNIYSLEVDSFCLYRDYEIARNRLMMRASHLYSEVHEFSNKHFHLNSW
VKDHMPSYLNSDGVFSSFHLSELENMSPDDLHEEYGNISLFNWVHAYQCL
VELSKEEMSKRFSSTKPIPLQLDRWLIIKSRESWLSFFQRKGIAADAAKK
LIDYFTFNSKSHDLNDCPFIPCMDGLCLMPALIANSSVTRSLMSLFGSKK
ISQASKGRFHEQQFIKRVRDAGIKASPIDAHANYQCDCVILLDDCLIFTE
LKSNGQPIYYGKYYQQVCNIVGDSSLIHDHNNKFMRSYFQQINRISEHYL
NHLDVIIKEFELPSTWQPKGVYKLIVTTTMLGGKYHVDDTYVADKYALSS
FFQRIPGVIYQTNENGKMAKNIIDGFECCEGEITIDKFIDYLSSLPSINA
VRKNIKKLTYSVRFNEKLVHHPYYDSWAFGPYIRKGND
>APECO1_515 hypothetical protein
MGSAMNRVTAIISALVICIIVCLSWAVNHYRDNAITYKAQRDNVKEKLNQ
ATAIITDMQIRQRDVAALDAKYLKELADAKAENDALRDDVAAGRRRLHIK
AVCPSVREATIASSVDNAVSPRLADTAERDYFTLRERLVMMQAQLEGAQQ
YITEQCLK
>APECO1_1085 hypothetical protein
MSRNTTLVFRYRGSGRTATTGEVFLVRAVSRGISQPLTHRLTLRAEALIH
SVHVLYRNGHRGSWRVTADFTHQRSCICTLIQNASQRSVTGEIYSLRLFW
QILYVTL
>APECO1_526 hypothetical protein
MITYVTCDDVDNAFGNAWTSENAKNKAVLMANAWLNGFGLKINPSRIPEE
VKLAGAYAARIASVGKLFQQKNDSGVVISKAVSADGVSVSKSFAELPANS
TALLEPDLQLAIALLKPYGLSRSQVRVVRGG
>APECO1_6004 tail assembly protein
MTETESAILAHARRCAPAESCGFVVRAPEGERYNHYVNISGEPEDYFRMA
PEDWLQAEMQGEIVALVHSHPGGLPWLSEADRRLQVQSDLPWWLVCRGAI
HKFRCVPHLTGRRFEHGVTDCYTLFRDAYHLAGIEMPDFHREDDWWRNGQ
NLYLDNLEATGLYQVPLSAAQPGDVLLCCFGSSVPNHAAIYCGDGELLHH
IPEQLSKRERYTDKWQRRTHSLWRHRAWRASAFTGIYNDLAAASTFV
>APECO1_2416 putative transcriptional regulator, LacI family
MAKSDTGRKRVTLTDVARAAGVSKSTVSLVLNDSSLIKKETQQKVQQAIE
QLGYVYNRFAANLRSQKSLTIGVVIDDLINPFFAEFTMGLEMTLAEHGFI
TVMSNTSQRSDRQKQVLDTLLEHHVAGIVLCPVNSTSEADLQRYANSSTP
LLITMRPLDWQQLPVDYVGVDSHAGVREATEYLIQQGHRDIVFIGGLTHH
MRYQGYLEAMNHHGLQPWSTDAFSLRAEPTQANGYQLMQQLLDMPSPPTA
VICYNDLMAFGAESALGERGLFAGEDISLIGNDGVAACAYSNPPLTTIAV
EPLALGKQAAQQILRRIAQPDAPLSHYIYRPTLQIRASTGKRGR
>APECO1_1530 hypothetical protein
MAQKKNIRSFRDAWLADFFVHSTPHRKIPAEIHTTLSRKLDIINAATSHW
DLRSPPGNRYEELSGKLQEYSSIRVNKQYRLIFKWVNGKAEELFLDPHNY
>APECO1_345 putative transposase
MPARQVCQNFFRGALAPFHKYRQNALLDATIALINGASLTLTSIGRYLPG
NAQVKNKIKRVDRLLGNESLHHDIPLIFRNIISMLTSKLSLCVIAVDWSG
YPSQEYHVLRASLICYGHSIPLLSWIVPSEKQQNAKIQQAFLNTLSEAVN
PKARVIIVTDAGFQNAWFRHIKSLGWDFIGRIRGNKQLHLARKGECWFRR
QELQASNKPEYLGPGTLSRAEYARCDGHFYLHKKEPKGRRNKRSRCGIAR
PSQIKDARSAAKEPWLIFSSTDDFKPREIMKLYSRRMQIEQNFRDEKSER
FGFGLRASYSCSAGRMLVLSLLATLSTIVLWLIGYHAENQGLHLRYQANS
IKTRRVISYLTLAENVLRHSPLILKRTVLSTILNHLTRTYQNMVLVYYR
>APECO1_1725 hypothetical protein
MDETVLELDSFPIKLGFEEVGFNPLLHLKLFPSFSLFILCNVSRRVASLD
TAPGVEHLAKSFGRRFRDGGTSSSFKGTCYEVEPVNYSPIRVTLPKPITT
YRRL
>APECO1_511 hypothetical bacteriophage protein
MAHIQLVKQTSSGLLLPATPESCDFLHQIKIGEWIHADFKRVRNYAFHKR
FFKLLQLGFDYWTPVGGAITPRERKLVSGFVDYLCESVGREHTPALSEAA
EQYLNTVATRRTRDTALLKSFEAFREWVTIQAGFYTEHIYPDGSRGRRAK
SISFANMDETEFQQVYKSVLNVLWNWILFRKFSSPEQVENVAAQLLEFA
>APECO1_2143 carbamate kinase
MTVSKEEIMENKPTLVIALGGNALLKRGEPLEAEIQRKNIDLAAKTIAQL
TQHWRVVLVHGNGPQVGLLALQNSAYAHVAPYPLDILGAESQGMIGYMLQ
QALKNQLPQREISVLLTQVEVDANDPAFSNPTKYIGPIYDHAQTQVLQAE
KGWVFKADGHSFRRVVPSPQPKRIVERDAIQTLIAHDHLVICNGGGGVPV
VEKADGYHGIEAVIDKDLSAALLASQIHAEALLILTDADAVYLDWDKPTQ
RPLAQVTPELLNEMQFDAGSMGPKVTACAKFVSQCRGIAGIGSLADGPEI
LAGDKGTLIRLDTPITTLDPFL
>APECO1_3354 putative transcriptional regulator YgiP
MVMFCEIHCVYQNCGYLTMLNSWPLAKDLQVLVEIVHSGSFSAAAATLGQ
TPAFVTKRIQILENTLATTLLNRSARGVALTESGQRCYEHALEILTQYQR
LVDDVTQIKTRPEGMIRIGCSFGFGRSHIAPAITELMRNYPELQVHFELF
DRQIDLVQDNIDLDIRINDAIPDYYIAHLLTKNKRILCAAPEYLQKYPQP
QSLQELSRHDCLVTKERDMTHGIWELGNGQEKKSVKVSGHLSSNSGEIVL
QWALEGKGIMLRSEWDVLPFLESGKLVRVLPEYAQSANIWAVYREPLYRS
MKLRVCVEFLAAWCQQRLGKPDEGYQVM
>APECO1_1680 hypothetical protein
MWFWSDLFLYSLQEPFSYMLTVSGNEQVTIARNISGVILKLSGISSVVYL
CMHSTFISQIEQVSLEFAGEIVTMKHESYLSVLYKDSVLIVER
>APECO1_4469 putative major capsid protein
MRQETRFKFNAYLSRVAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSD
FLTRINIVPVSEMKGEKIGIGVTGSIASTTDTAGGTERQPKDFSKLASNK
YECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDFIMAGFNGV
KRAETSDRSSNPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIR
VGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVGRQLLADKYFPIVN
KEQDNSEMLAADVIISQKRIGNLPAVRVPYFPADAMLITKLENLSIYYMD
DSHRRVIEENPKLDRVENYESMNIDYVVEDYAAGCLVEKIKVGDFSTPAK
ATAEPGA
>APECO1_541 hypothetical protein
MDITPFLHALCAVAAQVLVGLFTGNWAYGAIAGCTFFIAREHTQAEYRWI
GMFGHGKRINMPWWGGFDPRAWDMASMMDFAVPVVACLLIWLLVNR
>APECO1_3714 putative outer membrane protein
MVLLLVLWSVFISPSGVLRWAGAAAIVLAVAALLIYRRRQAWTEMTGDAG
LSSLPPETYRQPVVLVCGGLSAHLFTDSPVRQVSEGLYLHVPDEEQLVAQ
VERLLTLRPAWASQLAVAYTIMPGIHRDVAVLAGRLRRFAHSMATVRRRA
GVNVPWLLWSGLSGSPLPERASSPWFICTGGEVQVATSTENAMPAQWIAQ
SGVQERSQRLSYLLKAESLMQWLDLNVLAELNGPEAKCPPLAMAVGLVPS
LPAVDNNLWQLWITARTGLTPDIADTGTDDALPFPDALLRRLPRQSGFTP
LRRACVTMLGVTTVAGIAALCLSATANRQLLRQVGDDLHRFYAVPVEEFI
TKARHLSVLKDDATMLDGYYREGEPLRLGLGLYPGERIRQPVLRAIRDWR
PPEQKMEVTASLQVQTVRLDSMSLFDVGQARLKDGSTKVLVDALVNIRAK
PGWLILVAGYTDATGDEKSNQQLSLRRAEAVRNWMLQTSDIPATCFAVQG
LGESQPAATNDTPQGRAVNRRVEISLVPRSDACQDVK
>APECO1_1101 hypothetical protein
MFGKGENLVATDYQMVRHPDIHQAQCLYQTFRDKAVCLAGCDFTGRVIVC
QYHGSGIVVQGAFNHFPGMDFRAVDGAGEEGFTGNQLILVVQIQHPELFT
LQSRHVQNQPFSCRTGGGEGHAGFMKMAVQGFQGPLNEAALAGRHLSGQK
GKLLHCTSLHSVDDAHNGAAFRMLQGVIPQPVIMDRHGITLRTTGMMTVR
HQTGSNAGGFRAHFSAVTVHGIEQYPCFVIIVIRFGCHKCPAVVQGVQIP
DTTAVVFAQTVHPAEDTGLQGHPLWNEAKMLRPEADALFIRDEAGNQLRR
IILWCCS
>APECO1_4213 hypothetical protein
MQVTVEYNQDSFDYFFSPVFVEFPDLKQTLVDDFIIYKSTGTLPSYFGRD
TSYHRPPDIEDAGLMHLHLAIGENKFEPIKNGTDISTPQKLQWHKTSNTA
LVYAQNLFDENRYSLIALFHPVAHMSANNHNRMRVLAGYARDFRNTMFD
>APECO1_1489 hypothetical protein
MSPRLLANGHAFNDNVFFWHVLVHTATTGCHTFDFVYHVHAFNHFSKYAV
APTLQAFAREVQEVVINHVDEELGSCRVRCLSTGHCQRTTGIFQTVVRFV
FDRVFGGFLFHARFKTAALNHKAVDNTVENGVVVKTFAAVVQEVFNCFRC
FIVKSFDDNIAVIGVESNHFCILFRLIGASTRFGSYIGWCYSITARSDHL
AMYADSRGKYGYYTQLNYPHMSKRNLRC
>APECO1_1172 hypothetical protein
MKELSLAHLHLQQVQEEANKGTLEAMITLSRLYGNKEDEKLFNMKLSARW
THFAESLYPDNEIIADCLYHLHFSSLWKRCRYAWYTFRIPASELPGQVNS
MV
>APECO1_1064 hypothetical protein
MTPIFFLNGTNCCVEGDNIMSKKYQPLLITHYMSTWVTITEAVEITTKAI
KQKITPSDIYRHALSGNILLSVYFQSPVILKKIQTFNGKIKFRQFVGDLL
DKLCMLDRDGFIYGQNLRLCTEARYICPVQQIIDTPLLRKLNQFRTFVRN
ARPGDELDVQAQVSEKNLTPPPGNSSGNLEQQIASTSQLIGSLLAEDMNS
EQAANIARGWASSQASGVMTDWLSRFGTARITLGVDEDFSLKNSRDGNPR
KWRFDATNSELCAERHEC
>APECO1_399 putative capsid assembly protein of prophage
MRRNLSHIIAAAFNEPLLLEPAYARVFFCALGREMGAASLSVPQQQVQLD
APGMLAETDEYMAGGKRPARVYRVVNGIAVLPVTGTLVHRLGGMRPFSGM
TGYDGIVACLQQAMADSQVRGILLDIDSPGGQAAGAFDCADMIYRLRQQK
PVWALCNDTACSAAMLLASACSRRLVTQTSRIGSIGVMMSHVSYAGHLAQ
AGVDITLIYSGAHKVDGNQFEALPAEVRQDMQQRIDAARRMFAEKVAMYT
GLSVDAVTGTEAAVFEGQSGIEAGLADELINASDAISVMATALNSNVRGG
TMPQLTATEAAVQENQRVMGILTCQEAKGREQLATMLAGQQGMSVEQARA
ILAAAAPQQPVASAQSEADRIMACEEANGREQLAATLAAMPEMTVEKARP
ILAAAPLADAGPSLRDQIMALDEAKGAEAQAEKLAACPGMTVENARAVLA
AGSGKAEPVSASTTALFEHFMANHSPAAVRGGVSQTSADGDADVKMLMAM
P
>APECO1_3495 hypothetical protein
MKQYKGGYNVNRYDWKNKTTEHLLNDNNLADNHIPEWNRKNEIAAFWFWL
IMSHTVPEKLKLNNESEFNSALKGVIVMKDNPMTIIPSSHETRLNVIGKW
INALILYGEADRIISIMHQNWLGLEKRNEAKWLKKNTEQLKWAWEYIDAR
ISPTYLSWFNPVNDNERYIAIVVLLKMLFPTDTPCLLTPKEFHLGFAAKE
RFFDKMHNAFRKQFIDGKKDKRVQINVKISPSAKSALDRLTRERKTTQQA
ILEQLILYGRLD
>APECO1_1541 adk, adenylate kinase
MVSFIAFSKNSTHFKGIFAMRIILLGAPGAGKGTQAQFIMEKYGIPQIST
GDMLRAAVKSGSELGKQAKDIMDAGKLVTDELVIALVKERIAQEDCRYGF
LLDGFPRTIPQADAMKEAGINVDYVLEFDVPDELIVDRIVGRRVHAPSGR
VYHVKFNPPKVEGKDDVTGEELTTRKDDQEETVRKRLVEYHQMTAPLIGY
YSKEAEAGNTKYAKVDGTKPVAEVRAALEKILG
>APECO1_1770 aec30, hypothetical protein
MFKFPTSRLFSTLKSALRPAMPRFKVSATWLLTLAWIFLLVWIWWQGPKW
TLYEQHWLAPLANRWLATAVWGLIALVWLTWRVMKRLQKLEKQQKQQREE
EKDPLTVELHRQQQYLDHWLLRLRRHLDNRRYLWQLPWYMVIGPAGSGKS
TLLREGFPSDIVYTPESIRGVEYHPLITPRVGNQAVIFDVDGVLTTPGGD
DLLRRRLREHWLGWLMQTRARQPLNGLILTLDLPDLLTADKSRRETLVQN
LRQQLQEIRQSLHCRLPVYVVLTRLDLLNGFAALFHSLDKKDRDAILGVT
FTRRAHESDGWRSELGAFWQTWVQQVNLALSDLVLAQTGAAPRSAVFSFS
RQMQGTGEIVTALLAALLDGENMDVMLRGVWLTSSLQRGQVDDIFTQSAA
RQYGLGNSSLATWPLVETTPYFTRRLFPEVLLAEPNLAGENSVWLNSSRR
RLTAFSTCGAALAALMVGSWHHYYNQNWQSGVNVLAQAKAFMDVPPPQGT
DEFGNLQLPLLNPVRDATLAYGDYRDHGFLADMGLYQGARVGPYVEQTYI
QLLEQRYLPSLMNGLIRDLNIAPPESEEKLAVLRVVRMMEDKSGRNNEAV
KQYMARRWSNEFHGQRDIQAQLMVHLDYALEHTDWHAQRQSSDSDAVSRW
TPYDKPIINAQQELSKLPIYQRVYQTLRTKALSVLPADLNLRDQVGPTFD
NVFVAGNDEKLVIPQFLTRYGLQSYFVKQREGLVELTALDSWVLNLTQSV
AYSEADREEIQRHITEQYISDYTATWRAGMDNLNVRDYEAMSALTDALEQ
IISGDQPFQRALTALRDNTHALTLSGKLDDKAREAAINEMDYRLLSRLGH
EFAPENSALEEQKDKASTLQAVYQQLTELHRYLLAIQNSPVPGKSALKAV
QLRLDQNSSDPIFATRQMAKTLPAPLNRWVGKLADQAWHVVMVEAVRYME
VDWRDNVVKPFNEQLADNYPFNPRATQDASLDSFERFFKPDGILDNFYKN
NLRLFLENDLTFGDDGRVLIREDIRQQLDTVQKIRDIFFSQQNGLGAQFA
VETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTL
IGTSGRAPRSIAFSGPWAQFRLFGAGQLTNVTSDTFNVRFNVDGGAMVYQ
VHVDTEDNPFTGGLFSLFRLPDTLY
>APECO1_1546 apt, adenine phosphoribosyltransferase
MSTTVTSRHTLMTATAQQLEYLKNSIKSIQDYPKPGILFRDVTSLLEDPK
AYALSIDLLVERYKNAGITKVVGTEARGFLFGAPVALGLGVGFVPVRKPG
KLPRETISETYDLEYGTDQLEIHVDAIKPGDKVLVVDDLLATGGTIEATV
KLIRRLGGEVADAAFIINLFDLGGEQRLEKQGITSYSLVPFPGH
>APECO1_1962 aslA, putative arylsulfatase
MISNFHQGIIMQKTLMASLIGLAVCTGNAFNPVVAAETKQPNLVIIMADD
LGYGDLATYGHQIVKTPNIDRLAQEGVKFTDYYAPAPLSSPSRAGLLTGR
MPFRTGIRSWIPTGKDVALGRNELTIANLLKAQGYDTAMMGKLHLNAGGD
RTDQPQAKDMGFDYSLVNTAGFVTDATLDNAKERPRYGMVYPTGWLRNGQ
PTPRADKMSGEYVSSEVVNWLDNKKDSKPFFLYVAFTEVHSPLASPKKYL
DMYSQYMSAYQKQHPDLFYGDWADKPWRGVGEYYANISYLDAQVGKVLDK
IKAMGEEDNTIVIFTSDNGPVTREARKVYELNLAGETDGLRGRKDNLWEG
GIRVPAIIKYGKHLPQGMVSDTPVYGLDWMPTLAKMMNFKLPTDRTFDGE
SLVPVLEQKALKREKPLIFGIDMPFQDDPTDEWAIRDGDWKMIIDRNNKP
KYLYNLKSDRYETLNLIGKKPDIEKQMYGKFLKYKTDIDNDSLMKARGDK
PEAVTWG
>APECO1_4340 atoS, sensor protein AtoS
MHYMKWIYPRRLRNQMILMAILMVIVPTLTIGYIVETEGRSAVLSEKEKK
LSAVVNLLNQALGNRYDLYIDLPREERIRALNAELAPITENITHAFPGIG
AGYYNKTLDAIITYAPSALYQNNVGVTIAADHPGREVMRTNTPLVYSGRQ
VRGDILNSMIPIERNGEILGYIWANELTEDIRRQAWKMDVRIIIVLTAGL
LISLLLIVLFSRRLSANIDIITDGLSTLAQNIPTRLPQLPGEMGQISQSV
NNLAQALRETRTLNDLIIENAADGVIAIDRQGDVTTMNPAAEVITGYQRH
ELVGQPYSMLFDNTQFYSPVLDTLEHGTEHVALEISFPGRDRTIELSVTT
SRIHNTHGEMIGALVIFSDLTARKETQRRMAQAERLATLGELMAGVAHEV
RNPLTAIRGYVQILRQQTRDPIHQEYLSVVLKEIDSINKVIQQLLEFSRP
RHSQWQQVSLNALVEETLVLVQTAGVQARVDFISELDNELSPINADRELL
KQVLLNILINAVQAISARGKIRIRTWQYSDSQQAISIEDNGSGIDLSLQK
KIFDPFFTTKASGTGLGLALSQRIINAHQGDIRVASLPGYGATFTLILPI
NPQGNQTV
>APECO1_2878 avtA, valine--pyruvate transaminase
MTFSLFGDKFTRHSGITLLMEDLNDGLRTPGAIMLGGGNPAQIPEMQDYF
QTLLTDMLESGKATDALCNYDGPQGKTELLTLLAGMLREKLGWDIEPQNI
ALTNGSQSAFFYLFNLFAGRRADGRVKKVLFPLAPEYIGYADAGLEEDLF
VSARPNIELLPEGQFKYHVDFEHLHIGEETGMICVSRPTNPTGNVITDEE
LLKLDALANQHGIPLVIDNAYGVPFPGIIFSEARPLWNPNIVLCMSLSKL
GLPGSRCGIIIANEKIITAITNMNGIISLAPGGIGPAMMCEMIKRNDLLR
LSETVIKPFYYQRVQETIAIIRRYLPEDRCLIHKPEGAIFLWLWFKDLPI
TTEQLYQRLKARGVLMVPGHNFFPGLDKPWPHTHQCMRMNYVPEPEKIEA
GVKILAEEIERAWAESH
>APECO1_2916 bcsB, regulator of cellulose synthase, cyclic di-GMP binding protein BcsB
MKRKIFWICAVALGMSAFPSFMTQATPATQPLINAEPAVTAQAEQNPQVG
QVMPGVQGADAPVVAQNGPSRDVKLTFAQIAPPPGSMVLRGINPNGSIEF
GMRSDEVVTKAMLNLEYTPSPSLLPVQSQLKVYLNDELMGVLPVTKEQLG
KKTLAQMPINPLFITDFNRVRLEFVGHYQDVCENPASTTLWLDVGRSSGL
DLTYQTLNVKNDLSHFPVPFFDPRDNRTNTLPMVFAGAPDVELQQASAIV
ASWFGSRSGWRGQNFPVLYNQLPDRNAIVFATNDKRPDFLRDHPAVKAPV
IEMINHPQNPYVKLLVVFGRDDKDLLQAAKGIAQGNILFRGESVVVNEVK
PLLPRKPYDAPNWVRTDRPVTFGELKTYEEQLQSSGLEPAAINVSLNLPP
DLYLMRSTGIDMDINYRYTMPPVKDSSRMDISLNNQFLQSFNLSSKQEAN
RLLLRIPVLQGLLDGKTDVSIPALKLGATNQLRFDFEYMNPMPGGSVDNC
ITFQPVQNHVVIGDDSTIDFSKYYHFIPMPDLRAFANAGFPFSRMADLSQ
TITVMPKTPNEAQMETLLNTVGFIGAQTGFPAINLTVTDDGSTIQGKDAD
IMIVGGIPDKLKDDKQIDLLVQATESWVKTPMRQTPFPGIVPDESDRAAE
TQSTLTSSGAMAAVIGFQSPYNDQRSVIALLADSPRGYEMLNDAVNDSGK
RATMFGSVAVIRESGINSLRVGDVYYVGHLPWFERLWYALANHPILLAVL
AAISVILLAWVLWRLLRIISRRRLNPDNE
>APECO1_3626 bglA, 6-phospho-beta-glucosidase BglA
MIVKKLTLPKDFLWGGAVAAHQVEGGWNKGGKGPSICDVLTGGAHGVPRE
ITKEVVPGKYYPNHEAVDFYGHYKEDIKLFAEMGFKCFRTSIAWTRIFPK
GDEAQPNEEGLKFYDDMIDELLKYNIEPVITLSHFEMPLHLVQQYGSWTN
RKVVDFFVRFAEVVFERYKHKVKYWMTFNEINNQRNWRAPLFGYCCSGVV
YTEHENPEETMYQVLHHQFVASALAVKAAHRINPEMKVGCMLAMVPLYPY
SCNPDDVMFAQESMRERYVFTDVQLRGYYPSYVLNEWERRGFNIKMEDGD
LDVLREGTCDYLGFSYYMTNAVKAEGGTGDAISGFEGSVPNPYVKASDWG
WQIDPVGLRYALCELYERYQKPLFIVENGFGAYDKVEDDGSINDDYRIDY
LRAHIEEMKKAVTYDGVDLMGYTPWGCIDCVSFTTGQYSKRYGFIYVNKH
DDGTGDMSRSRKKSFNWYKEVIASNGENL
>APECO1_1812 cdsA, phosphatidate cytidylyltransferase
MLIPVVIAALFLLPPVGFAIVTLVVCMLAAWEWGQLSGFTTRSQRVWLAV
LCGLLLALMLFLLPEYHRNIHQPLVEISLWASLGWWIVALLLVLFYPGSA
AIWRNSKTLRIIFGVLTIVPFFWGMLALRAWHYDENHYSGAIWLLYVMIL
VWGADSGAYMFGKLFGKHKLAPKVSPGKTWQGFIGGLVTAAVISWGYGMW
VNLDVAPVTLLICSIVAALASVLGDLTESMFKREAGIKDSGHLIPGHGGI
LDRIDSLTAAVPVFACLLLLVFRTL
>APECO1_1213 clpA, ATP-dependent clp protease ATP-binding subunit ClpA
MPMLNQELELSLNMAFARAREHRHEFMTVEHLLLALLSNPSAREALEACS
VDLVALRQELEAFIEQTTPVLPASEEERDTQPTLSFQRVLQRAVFHVQSS
GRNEVTGANVLVAIFSEQESQAAYLLRKHEVSRLDVVNFISHGTRKDEPT
QSSDPGSQPNSEEQAGGEERMENFTTNLNQLARVGGIDPLIGREKELERA
IQVLCRRRKNNPLLVGESGVGKTAIAEGLAWRIVQGDVPEVMADCTIYSL
DIGSLLAGTKYRGDFEKRFKALLKQLEQDTNSILFIDEIHTIIGAGAASG
GQVDAANLIKPLLSSGKIRVIGSTTYQEFSNIFEKDRALARRFQKIDITE
PSIEETVQIINGLKPKYEAHHDVRYTAKAVRAAVELAVKYINDRHLPDKA
IDVIDEAGARARLMPVSKRKKTVNVADIESVVARIARIPEKSVSQSDRDT
LKNLGDRLKMLVFGQDKAIEALTEAIKMARAGLGHEHKPVGSFLFAGPTG
VGKTEVTVQLSKALGIELLRFDMSEYMERHTVSRLIGAPPGYVGFDQGGL
LTDAVIKHPHAVLLLDEIEKAHPDVFNILLQVMDNGTLTDNNGRKADFRN
VVLVMTTNAGVRETERKSIGLIHQDNSTDAMEEIKKIFTPEFRNRLDNII
WFDHLSTDVIHQVVDKFIVELQVQLDQKGVSLEVSQEARNWLAEKGYDRA
MGARPMARVIQDNLKKPLANELLFGSLVDGGQVTVALDKEKNELTYGFQS
AQKHKAEAAH
>APECO1_1473 cusA, copper/silver efflux system, membrane component
MIEWIIRRSVANRFLVLMGALFLSIWGTWTIINTPVDALPDLSDVQVIIK
TSYPGQAPQIVENQVTYPLTTTMLSVPGAKTVRGFSQFGDSYVYVIFEDG
TDPYWARSRVLEYLNQVQGKLPAGVSAELGPDATGVGWIYEYALVDRRGK
HDLADLRSLQDWFLKYELKTIPDVAEVASVGGVVKEYQVVIDPQRLAQYG
ISLAEVKSALDASNQEAGGSSIELAEAEYMVRASGYLQTLDDFNHIVLKA
SENGVPVYLRDVAKVQVGPEMRRGIAELNGEGEVAGGVVILRSGKNAREV
IAAVKDKLETLKSSLPEGVEIVTTYDRSQLIDRAIDNLSGKLLEEFIVVA
VVCALFLWHVRSALVAIISLPLGLCIAFIVMHFQGLNANIMSLGGIAIAV
GAMVDAAIVMIENAHKRLEEWQHQHPDATLDNKTRWQVITDASVEVGPAL
FISLLIITLSFIPIFTLEGQEGRLFGPLAFTKTYAMAGAALLAIVVIPIL
MGYWIRGKIPPESSNPLNRFLIRVYHPLLLKVLHWPKTTLLVAALSVLTV
LWPLNKVGGEFLPQINEGDLLYMPSTLPGISAAEAASMLQKTDKLIMSVP
EVARVFGKTGKAETATDSAPLEMVETTIQLKPQDQWRPGMTMDKIIEELD
NTVRLPGLANLWVPPIRNRIDMLSTGIKSPIGIKVSGTVLADIDTMAEQI
EEVARTVPGVASALAERLEGGRYINVEINREKAARYGMTVADVQLFVTSA
VGGAMVGETVEGIARYPINLRYPQSWRDSPQALRQLPILTPMKQQITLAD
VADVKVSTGPSMLKTENARPTSWIYIDARDRDMVSVVHDLQKAIAEKVQL
KPGTSVAFSGQFELLERANHKLKLMVPMTLMIIFVLLYLAFRRVGEALLI
ISSVPFALVGGIWLLWWMGFHLSVATGTGFIALAGGAAEFGVVMLMYLRH
AIEAEPSLNNPQTFSEQKLDEALYHGAVLRVRPKAMTVAVIIAGLLPILW
GTGAGSEVMSRIAAPMIGGMITAPLLSLFIIPAAYKLMWLHRHRVRK
>APECO1_1203 cydC, Transport ATP-binding protein cydC
MRALLPYLALYKRHKWMLSLGIVLAIVTLLASIGLLTLSGWFLSASAVAG
VAGLYSFNYMLPAAGVRGAAITRTAGRYFERLVSHDATFRVLQHLRIYTF
SKLLPLSPAGLARYRQGELLNRVVADVDTLDHLYLRVISPLVGAFVVIMV
VTIGLSFLDFTLAFTLGGIMLLTLFLMPPLFYRAGKSTGQNLTHLRGQYR
QQLTAWLQGQAELTIFGASDRYRTQLENTEIQWLEAQRRQSELTALSQAI
MLLIGALAVILMLWMASGGVGGNAQPGALIALFVFCALAAFEALAPVTGA
FQHLGQVIASAVRITDLTDQKPEVTFPDTQTRVADRVSLTLRDVQFTYPE
QSQQALKGIYLQVNAGEHIAILGRTGCGKSTLLQLLTRAWDPQQGEILLN
DSPIASLNEAALRQTISVVPQRVHLFSATLRDNLLLASPGSSDEALAEIL
RRVGLEKLLEDAGLNSWLGEGGRQLSGGELRRLAIARALLHDAPLVLLDE
PTEGLDATTESQILELLAEMMCEKTVLMVTHRLRGLSRFQQIIVMDNGQI
IEQGTHAELLARQGRYYQFKQGL
>APECO1_1579 cyoA, cytochrome o ubiquinol oxidase subunit II
MRLRKYNKSLGWLSLFAGTVLLSGCNSALLDPKGQIGLEQRSLILTAFGL
MLIVVIPAILMAVGFAWKYRASNKDAKYSPNWSHSNKVEAVVWTVPILII
IFLAVLTWKTTHALEPSKPLAHDEKPITIEVVSMDWKWFFIYPEQGIAAV
NEIAFPANTPVYFKVTSNSVMNSFFIPRLGSQIYAMAGMQTRLHLIANEP
GTYDGISASYSGPGFSGMKFKAIATPDRAAFDQWVAKAKQSPNTMSDMAT
FEKLAAPSEYNQVEYFSNVKPDLFADVINKFMAHGKSMDMTQPEGEHSAH
EGMEGMEGMDMSHAESAH
>APECO1_1582 cyoD, cytochrome O ubiquinol oxidase protein CyoD
MPEPVLALPGCGLDLCVHCCLSDGGDVMSHSNASSGASHGSVKTYMTGFI
LSIILTVIPFWMVMTGAASPAVILGTILAMAVVQILVHLVCFLHMNTKSD
EGWNMTAFVFTVLIIAILVVGSIWIMWNLNYNMMMH
>APECO1_659 dcp, putative oxidoreductase YdfG
MTTMNPFLVQSTLPYLAPHFDQIANHHYRPAFDEGIQQKRAEIAAIALNP
QTPDFKNTILALEQSGELLTRVTSVFFAMTAAHTNDELQRLDEQFSAELA
ELANDIYLNGELFARVDAVWQRRESLGLDSESIRLVEVIHQRFVLAGAKL
EQADKAKLKVLNTEAATLTSQFNQRLLAANKSGGLVVNDIAQLAGMSEQE
IALAAEAAREKGLDNNWLIPLLNTTQQPVLAELRDRATREKLFTAGWTRA
EKNDANDTRAIIQRLVEIRVQQAKLLGFPHYAAWKIADQMAKTPEAALNF
MREIVPAARQRASDELASIQAVIDKQQGGFSAQPWDWAFYAEQVRREKFD
LDESQLKPYFELNTVLNEGVFWTANQLFGIKFVERFDIPVYHPDVRVWEI
FDHNGVGLALFYGDFFARDSKSGGAWMGNFVEQSTLNETHPVIYNVCNYQ
KPAAGEPALLLWDDVITLFHEFGHTLHGLFARQRYATLSGTNTPRDFVEF
PSQINEHWATHPQVFARYARHYQSGAAMPDELQQKMRNASLFNKGYEMSE
LLSAALLDMRWHCLEENEAMQDVDDFELRALVAENMDLPAIPPRYRSSYF
AHIFGGGYAAGYYAYLWTQMLADDGYQWFVEQGGLTRENGRRFREAILSR
GNSEDLERLYRQWRGKAPQIMPMLQHRGLNV
>APECO1_3837 emrB, multidrug resistance protein B
MQQQKPLEGAQLVIMTIALSLATFMQVLDSTIANVAIPTIAGNLGSSLSQ
GTWVITSFGVANAISIPLTGWLAKRVGEVKLFLWSTIAFAIASWACGVSS
SLNMLIFFRVIQGIVAGPLIPLSQSLLLNNYPPAKRSIALALWSMTVIVA
PICGPILGGYISDNYHWGWIFFINVPIGVAVVLMTLQTLRGRETRTERRR
IDAVGLALLVIGIGSLQIMLDRGKELDWFSSQEIIILTVVAVVAICFLIV
WELTDDNPIVDLSLFKSRNFTIGCLCISLAYMLYFGAIVLLPQLLQEVYG
YTATWAGLASAPVGIIPVILSPIIGRFAHKLDMRRLVTFSFIMYAVCFYW
RAYTFEPGMDFGASAWPQFIQGFAVACFFMPLTTITLSGLPPERLAAASS
LSNFTRTLAGSIGTSITTTMWTNRESLHHAQLTESVNPFNPNAQAMYSQL
EGLGMTQQQASGSIAQQITNQGLIISANEIFWMSAGIFLVLLGLVWFAKP
PFGAGGGGGGAH
>APECO1_975 emrE, multidrug efflux protein
MNSFVSLGFLLIIIVPAFISCHARAPWIHIHQDENGELCSNCSTILSSMN
RKEYAMNPYIYLGGAILAEVIGTTLMKFSEGFTRLWPSVGTIICYCASFW
LLAQTLAYIPTGIAYAIWSGVGIVLISLLSWGFFGQRLDLPAIIGMMLIC
AGVLVINLLSRSAPH
>APECO1_6023 erfK, conserved protein with NAD(P)-binding Rossmann-fold domain
MMRRVNILCSFALLFASHTSLAVTYPLPPEGSRLVGQSLTVTVPDHNTQP
LETFAAQYGQGLSNMLEANPGADVFLPKSGSQLTIPQQLILPATVRKGIV
VNVAEMRLYYYPPDSNTVEVFPIGIGQAGRETPRNWVTTVERKQEAPTWT
PTPNTRREYAKRGESLPAFVPAGPDNPMGLYAIYIGRLYAIHGTNANFGI
GLRVSQGCIRLRNDDIKYLFDNVPVGTRVQIIDQPVKYTTEPDGSKWLEV
HEPLSRNRAEYESDRKVPLPVTPSLRAFINGQEVDVNRANAALQRRSGMP
VQISSGSRQMF
>APECO1_1462 fepE, ferric enterobactin transport protein FepE
MNVAFSGSNLQTSASDKSHQLVEWGPWRFTLIGLIFRLYPMSSLNIKQGS
EAHFPEYPLASPSNNEIDLLSLIEVLWRAKKTVMAVVFAFACAGLLISFI
LPQKWTSSAVITPAEAIQWQDLEKTFTKLRVLDLDINIDRGGAFNLFIKR
FQSVSLLEEYLRSSPYVMDQLKEAKIDELDLHRAIVALSEKMKAVDDNAS
KKKDEPSLYTSWTLSFTAPTSEEAQKVLAGYIDYISALVVKESIENVRNK
LEIKTQFEKEKLAQDRIKTKNQLDANIQRLNYSLDIANAAGIKKPVYSNG
QAVKDDPDFSISLGADGIERKLEIEKAVTDVAELNGELRNRQYLVEQLTK
ANINDVNFTPFKYQLRPSLPVKKDGPGKSIIVILSALIGGMVACGGVLLR
HAMASRKQDAMMADHLV
>APECO1_155 flgB, cell-proximal portion of basal-body rod FlgB
MLDKLDAALRFQQEALNLRAQRQEVLAANIANADTPGYQARDIDFASELK
KVMQRGRDATSVVALTMTSTKHIPAQALTPLSAELQYRIPDQPSLDGNTV
DMDRERTQFADNSLQYQMSLSALSGQIKGMMNVLQSGN
>APECO1_977 fliF, Flagellar M-ring protein
MNATAAQTKSLEWLNRLRANPKIPLIVAGSAAVAVMVALILWAKAPDYRT
LFSNLSDQDGGAIVSQLTQMNIPYRFSEASGAIEVPADKVHELRLRLAQQ
GLPKGGAVGFELLDQEKFGISQFSEQVNYQRALEGELSRTIETIGPVKGA
RVHLAMPKPSLFVREQKSPSASVTVNLLPGRALDEGQISAIVHLVSSAVA
GLPPGNVTLVDQGGHLLTQSNTSGRDLNDAQLKYASDVEGRIQRRIEAIL
SPIVGNGNIHAQVTAQLDFASKEQTEEQYRPNGDESHAALRSRQLNESEQ
SGSGYPGGVPGALSNQPAPANNAPISHASGKSK
>APECO1_3464 gspJ, putative type II secretion protein GspJ
MRRARAGFTLLEMLVAIAIFASLALMAQQVTNGVTRVNSAVAGHDQKLNL
MQQTMSFLTHDLTQMMPRPVRGDQGQREPALLAGAGVLVSESGGMRFVRG
GVVNPLMRLPRSNLLTVGYRIHDGYLERLAWPLTDAAGSVKPTTQKLIPA
DSLRLQFYDGTRWQESWSSVQAIPVAVRITLHSPQWGEIERIWLLRGPQL
S
>APECO1_1553 hha, Hha protein
MKTISSLSRSTNIWMTPLCCSVVMVLICRIFRNGGSQVIDYSVVLSMRRK
RILRVYLVRIITTMGRSMSEKPLTKTDYLMRLRRCQTIDTLERVIEKNKY
ELSDNELAVFYSAADHRLAELTMNKLYDKIPSSVWKFIR
>APECO1_1846 htrE, outer membrane usher protein HtrE precursor
MTIKSTNHLTHIATFCALLYSNSALCAELVEYDHTFLMGKDASNIDLSRY
TEGNPTLPGIYDVSVYVNDQPIMSQSIAFAVIEGKKNAQACITQKNLLQF
HISSPDKNSEKAILLKRDDDLGDCLNLAEMIPQSSIRYDVNDQRLDIDVP
QAWIMKNYQNYVDPSLWENGINAAMLSYNLNGYHSESPGRTNDSIYAAFN
GGINLGAWRLRASGNYNWITNVHSDYDFQNRYLQRDLASLRSQLVIGESY
TTGETFDSVRIRGIRLYSDSRMLPPVLASFAPIIHGVANTNAKVTVMQNG
YKIYETTVPPGAFAIDDLSPSGYGSDLIVTIEEADGTKRTFSQPFSSVVQ
MLRPGVGRWDISAGQVLKDSIQDEPNLFQASYYYGLNNYLTGYTGIQLTD
NNYTAGLLGLGMNTPVGAFSVDVTHSNVSIPDDKTYQGQSYRISWNKLFE
NTSTSLNIAAYRYSTQHYLGLNDALTLIDEVEHPEQDLEPKSMRNYSRMK
NQVTVSINQPLKFEKKDYGSFYLSGSWSDYWASGQNSTNYSIGYSNSASW
GSYSISAQRSLNEDGQTDDSIYLSFTIPIENLLGTEHRSSGFQSIDTQLN
SDFKGNNQLNISSSGYSDTNRISYSVNTGYMMNKSSDDLSYIGGYASYES
PWGTLSGSASASSDNSRQFSLNTDGGFVLHSGGLTFSNDSFSDSDTLAVI
QAPGAKGARINYGNSTVDRWGYGVTSALSPYHENRIALDINDLENDVELK
STSTVAVPRQGAVVFADFETVQGQSAIMNIVRSDGKNIPFAADIYDEQNN
IIGNVGQGGQAFVRGIGQEGNIRITWIEEGKPVSCFAHYQQNTTSEKIAQ
SIILNGLRCQIQ
>APECO1_2257 insB, IS1 InsB protein
MPGNCPHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMGEQWGYV
GAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>APECO1_1485 intD, prophage DLP12 integrase
MIKQASITNVGVHTRAAKRADGTSMPALRKMRIDSKTAWLSACRRAGVEN
FRFHDLWHTRASRLIPSGVPLSVLQEMGGWESREMVRRYAHLAPNHLTEH
ARKIDDIFGDNVPLWNYRRNKEGVTD
>APECO1_1484 intD, prophage DLP12 integrase
MVFNEMPRCLITVENNHEASRNIWRCNISTVCRQTCMRWNGTRDLQKNQE
KRLIDESPKPLKSVVKFALVTVLRKSNIINLEWQQIDMQRQVARVNPEDS
KSNRAIGVALNDTASKVLHDQTGKHHKCGCTYQGG
>APECO1_3480 kpsD, polysialic acid transport protein KpsD precursor
MKLFKSILLIAACHAAQASAAIDINADPNLTGAAPLTGILNGQQSDTQNM
SGFDNTPPPSPPVVMSRMFGAQLFNGTSADSGATVGFNPDYILNPGDSIQ
VRLWGAFTFDGALQVDPKGNIFLPNVGPVKVAGVSNSQLNALVTSKVKEV
YQSNVNVYASLLQAQPVKVYVTGFVRNPGLYGGVTSDSLLNYLIKAGGVD
PERGSYVDIVVKRGNRVRSNVNLYDFLLNGKLGLSQFADGDTIIVGPRQH
TFSVQGDVFNSYDFEFRESSIPVTEALSWARPKPGATHITIMRKQGLQKR
SEYYPISSAPGRMLQNGDTLIVSTDRYAGTIQVRVEGAHSGEHAMVLPYG
STMRAVLEKVRPNSMSQMNAVQLYRPSVAQRQKEMLNLSLQKLEEASLSA
QSSTKEEASLRMQEAQLISRFVAKARTVVPKGEVILNESNIDSVLLEDGD
VINIPEKTSLVMVHGEVLFPNAVSWQKGMTTEDYIEKCGGLTQKSGNARI
IVIRQNGAAVNAEDVDSLKPGDEIMVLPKYESKNIEVTRGISTILYQLAV
GAKVILSL
>APECO1_1563 mdlA, multidrug resistance-like ATP-binding protein MdlA
MRLFAQLSWYFRREWRRYLGAVALLVIIAMLQLVPPKVVGIVVDGVTEQH
FTTGQILMWIATMVLIAVVVYLLRYVWRVLLFGASYQLAVELREDYYRQL
SRQHPEFYLRHRTGDLMARATNDVDRVVFAAGEGVLTLVDSLVMGCAVLI
MMSTQISWQLTLFALLPMPVMAIMIKRNSDALHERFKLAQAAFSSLNDRT
QESLTSIRMIKAFGLEDRQSALFAADAEDTGKKNMRVARIDARFDPTIYI
AIGMANLLAIGGGSWMVVQGSLTLGQLTSFIMYLGLMIWPMLALAWMFNI
VERGSAAYSRIRTMLAEAPVVIDGSDKVPEGRGELDVNIRQFTYPQTDHP
ALENVNFALKPGQMVGICGPTGSGKSTLLSLIQRHFDVSEGDIRFHDIPL
TKLQLDSWRSRLAVVSQTPFLFSDTVANNIALGCPNAIQQEIEHVARLAS
VHDDILRLPQGYDTEVGERGVMLSGGQKQRISIARALLVNAEILILDDAL
SAVDGRTEHQILHNLRQWGQGRTVIISAHRLSALTEASEIIVMQHGHIGQ
RGNHDVLVQQSGWYRDMYRYQQLEAALDDAPEIREEAIDA
>APECO1_1308 moaA, molybdenum cofactor biosynthesis protein A
MTMLVKIATRAKEEMTSPPVSGKVYMASQLTDAFARKFYYLRLSITDVCN
FRCTYCLPDGYKPSGVTNKGFLTVDEIRRVTRAFASLGTEKVRLTGGEPS
LRRDFTDIIAAVRENDAIRQIAVTTNGYRLERDVANWRDAGLTGINVSVD
SLDARQFHAITGQDKFNQVMAGIDAAFEAGFEKVKVNTVLMRDVNHHQLD
TFLNWIQHRPIQLRFIELMETGEGIELFRKHHISGQVLRNELLRRGWIHQ
LRQRSDGPAQVFCHPDYAGEIGLIMPYEKDFCATCNRLRVSSIGKLHLCL
FGEGGVNLRDLLEDDTQQQALEARISAALREKKQTHFLHQNNTGITQNLS
YIGG
>APECO1_1895 murC, UDP-N-acetylmuramate--L-alanine ligase
MNTQQLAKLRSIVPEMRRVRHIHFVGIGGAGMGGIAEVLANEGYQISGSD
LAPNPVTQQLMNLGATIYFNHRPENVRDASVVVVSSAISADNPEIVAAHE
ARIPVIRRAEMLAELMRFRHGIAIAGTHGKTTTTAMVSSIYAEAGLDPTF
VNGGLVKAAGVHARLGHGRYLIAEADESDASFLHLQPMVAIVTNIEADHM
DTYQGDFENLKQTFINFLHNLPFYGRAVMCVDDPVIRELLPRVGRQTTTY
GFSEDADVRVEDYQQIGPQGHFTLLRQDKEPMRVTLNAPGRHNALNAAAA
VAVATEEGIDDEAILRALESFQGTGRRFDFLGEFPLEPVNGKSGTAMLVD
DYGHHPTEVDATIKAARAGWPDKKLVMLFQPHRFTRTRDLYDDFANVLTQ
VDTLLMLEVYPAGEAPIPGADSRSLCRTIRGRGKIDPILVPDPAQVAEML
APVLTGNDLILVQGAGNIGKIARSLAEIKLKPQTPEEEQHD
>APECO1_190 ndh, NADH dehydrogenase
MIVGGGAGGLEMATQLGHKLGRKKKAKITLVDRNHSHLWKPLLHEVATGS
LDEGVDALSYLAHARNHGFQFQLGSVIDIDREAKTITIAELRDEKGELLV
PERKIAYDTLVMALGSTSNDFNTPGVKENCIFLDNPHQARRFHQEMLNLF
LKYSANLGANGKVNIAIVGGGATGVELSAELHNAVKQLHSYGYKGLTNEA
LNVTLVEAGERILPALPPRISAAAHSELTKLGVRVLTQTMVTSADEGGLH
TKDGEYIEADLMVWAAGIKAPDFLKDIGGLETNRINQLVVEPTLQTTRDP
DIYAIGDCASCPRPEGGFVPPRAQAAHQMATCAMNNILAQMNGKPLKNYQ
YKDHGSLVSLSNFSTVGSLMGNLTRGSMMIEGRIARFVYISLYRMHQIAL
HGYFKTGLMMLVGSINRVIRPRLKLH
>APECO1_3475 neuE, polysialic acid biosynthesis protein NeuE
MTSRNVPPVMTRKKVLCFVFRYDSHFLALKNIFEQIDVDSYDLFFCCLDN
SLQEFVKKNLDEKIVVFYPDDFVCFFTFINIEFIFCSTGGKDLHEIVNTV
RTKDTIIISCFPGIVLTSQIEAFISKSNSHYLLINSPKDIKTYKKICKII
GVPFNGILFGPPWIKNVNINAKSENSCLIVDQVNEPLTPIKRIEYARFLI
RVIQKHPHMNFIFKTRNPLISPDSIVFDIKEYIERFDLKNITFSDDNIDS
LISKVEYCITISSSVAIYCLANKIKVYLINGFNHTCNGQCYFSRSGLIVD
YNKFNFKHIPRIKKKWMEENFYYSRDIQHKILNDILKMPPNVNVRTFGIK
RSTLIILFLIFFNFFFSLGPKKIKTLKKIHKVLLRYKKDDI
>APECO1_2955 prlC, oligopeptidase A
MTNPLLTPFELPPFSKILPEHVVPAVTKALNDCRENVERVVAQGAPYTWE
NLCQPLAEVDDVLGRIFSPVSHLNSVKNSPELREAYEQTLPLLSEYSTWV
GQHEGLYKAYRDLRDGDHYATLNTAQKKAVDNALRDFELSGIGLPKEKQQ
RYGEIATRLSELGNQYSNNVLDATMGWTKLVTDEAELAGMPESALAAAKA
QAEAKELEGYLLTLDIPSYLPVMTYCDNQALREEMYRAYSTRASDQGPNA
GKWDNSKVMEEILALRHELAQLLGFENYAFKSLATKMAENPQQVLDFLTD
LAKRARPQGEKELAQLRAFAKAEFGVDELQPWDIAYYSEKQKQHLYSISD
EQLRPYFPENKAVNGLFEVVKRIYGITAKERKDVDVWHPDVRFFELYDEN
DELRGSFYLDLYARENKRGGAWMDDCVGQMRKADGSLQKPVAYLTCNFNR
PVNGKPALFTHDEVITLFHEFGHGLHHMLTRIETAGVSGISGVPWDAVEL
PSQFMENWCWEPEALAFISGHYETGEPLPKELLDKMLAAKNYQAALFILR
QLEFGLFDFRLHAEFRPDQGAKILETLAEIKKLVAVVPSPSWGRFPHAFS
HIFAGGYAAGYYSYLWADVLAADAFSRFEEEGIFNRETGQSFLDNILSRG
GSEEPMELFKRFRGREPQLDAMLEHYGIKG
>APECO1_1622 proC, pyrroline-5-carboxylate reductase
MEKKIGFIGCGNMGKAILGGLIASGQVLPGQIWVYTPSPDKVTVLHDQFG
INAAESAQEVAQIADIIFAAVKPGIMIKVLSEITSSLNKDSLVVSIAAGI
TLDQLARALGHDRKIIRAMPNTPALVNAGMTSVTPNALVTPEDTADVLNI
FRCFGEAEVIAEPMIHPVVGVSGSSPAYVFMFIEAMADAAVLGGMPRAQA
YKFAAQAVMGSAKMVLETGEHPGTLKDMVCSPGGTTIEAVRVLEEKGFRA
AVIEAMTKCMEKSEKLSKS
>APECO1_1658 prpR, DNA-binding transcriptional activator
MAHPPRLNDDKPVIWTVSVTRLFELFRDISLEFDHLANITPIQLGFEKAV
TYIRKKLASERCDAIIAAGSNGAYLKSRLSVPVILIKPSGYDVLQALAKA
GKLTSSIGVVTYQETIPALVAFQKTFNLRLDQRSYITEEDARGQINELKA
NGTEAVVGAGLITDLAEEAGMTGIFIYSAATVRQAFSDALDMTRMSLRHN
THDATRNALRTRYVLGDMLGQSPQMEQVRQTILLYARSSAAVLIEGETGT
GKELAAQAIHREYFARHDARQGKKSHPFVAVNCGAIAESLLEAELFGYEE
GAFTGSRRGGRAGLFEIAHGGTLFLDEIGEMPLPLQTRLLRVLEEKEVTR
VGGHQPVPVDVRVISATHCNLEEDMRQGQFRRDLFYRLSILRLQLPPLRE
RVTDILPLAESFLKVSLAALSAPFSAALRQGLQASETVLVHYDWPGNIRE
LRNMMERLALFLSVEPTPDLTPQFLQLLLPELARESAKTPAPRLLTPQQA
LEKFKGDKTAAANYLGISRTTFWRRLKN
>APECO1_308 prrA, putative outer membrane receptor, probably TonB dependent
MRLKKRYLCTVLTLAFTQQAVAAQESDTLTVWSSPVSSTTTTVLDQPTMK
ALDKQNVAQALSVVPGVVLQKSGSRNEEQVKVRGFDSRQVPVYFDGVPIY
VPYDGNLDLARILTNNLGAVEVSKGYSSLLQGPNQMGGAINITTQKPTKP
LEASLGYRQGWSRSQDNAYDMHASFAASSDLGYLQVSGSQLKQDFLGLPH
GVNNDIAGKHGKMINSSADDKRSIVKLGFTPRENDEYTLTYIKQDGEKDN
PPYSGNSGQKSRYWQWPEYDKESFYYQGTTQLNDRFTLKSRLYRDTFENT
LMMYNSLADLKNKKGSYSHYSDYSDGAGLQLAADVRENDLLSFAVNWKDD
VHREKGAPHAAYDRYEDRTWSLASEYQWAAADNVDVVAGISYDWRDSVEA
KKHEKDGSITHYDDNNQSAFNWQVMGKYHFANEDTLALSYYDRTRFPTLK
ERYTTSKPAYNQIAIVNPQLKPERARGVDLTWNGAFTHDWGFEVSVYYNR
VSDAILSHNIDADTIQNQNSGTVDYSGLDAGIKGKISNILDVGLSYALIH
ADAKRKDIGKITDLPTQTMTAWMTLKPWEPLSVTLSEEARSSSYSNSDGS
QKAAGFAVTHIRADYTLGHGFSVNASVNNLFDTKYAYSEGFIEEGRNFWA
GIEYTF
>APECO1_3186 rbsA, putative ribose transport ATP-binding protein RbsA
MTDVILDVSHIAKTFGHVQALKDITLSLRKGRVHTLLGENGAGKSTLMKI
LAGVYPPTQGTITLRGETITINNPQHSRQLGIAIIFQELSLSNNMTVAEN
IYANNEPRRFGIINDKKMLADCQNLLADLGIPLDPLEMVGNMSMAHRQLV
EIAKALSYAADVVIMDEPTSSLSDNEAEILFNIIEKLKQRGCAVIYISHR
MDEIMRISDDISVIRDGEYIATHEKKNSDIQHLIAQMVGREMKNIWPARL
GEKPDENVPAKLEVKNLSHPSLFKEVSFAVRPGEVLGFFGLVGAGRSDVM
KALFGLVSYHGTVLIDGKEVRIANPKQAIDHGIAFVTENRKEEGLVLMHD
VNMNTHHVAFQYNASRMGLINHRQEEAKTLQSIARMNTKVSSVHQAVGAL
SGGNQQKIVLSKWLEKTPRILLLDEPTRGVDVGAKFEIYNVIRQLAAAGT
AIILVSSELPEVMALSDRLVVMRNKTIADIYSCENLTQIQVMTAATGVR
>APECO1_4341 rcsC, hybrid sensory kinase in two-component regulatory system with RcsB and RojN
MFRALALVLWLLIAFSSVFYIVNALHQRESEIRQEFNLSSDQAQRFIQRT
SDVMKELKYIAENRLSAENGVLSPRGRETQTDVPAFEPLFADSDCSAMSN
TWRGSLESLAWFMRYWRDNFSAAYDLNRVFLIGSDNLCMANFGLRDMPVE
RDTALKALHERINKYRNAPQDDSGSNLYWISEGPRPGVGYFYALTPVYLA
NRLQALLGIEQTIRMENFFLPGTLPMGVTILDENGHTLISLTGPESKIKG
DPRWMQERSWFGYTDGFRELVLKKNLPPSSLSIVYSVPVDKVLERIRMLI
LNAILLNVLAGAALFTLARMYERRIFIPAESDALRLEEHEQFNRKIVASA
PVGICILRTADGVNILSNELAHTYLNMLTHEDRQRLTQIICGQQVNFVDV
LTSNNTNLQISFVHSRYRNENVAICVLVDVSSRVKMEESLQEMAQAAEQA
SQSKSMFLATVSHELRTPLYGIIGNLDLLQTKELPKGVDRLVTAMNNSSS
LLLKIISDILDFSKIESEQLKIEPREFSPREVMNHITANYLPLVVRKQLG
LYCFIEPDVPVALNGDPMRLQQVISNLLSNAIKFTDTGCIVLHVRADGDY
LSIRVRDTGVGIPAKEVVRLFDPFFQVGTGVQRNFQGTGLGLAICEKLIS
MMDGDISVDSEPGMGSQFTVRIPLYGAQYPQKKGVEGLSGKRCWLAVRNA
SLCQFLETSLQRSGIVVTTYEGQEPTPEDVLITDEVVNKKWQGRAVVTFC
RRHIGIPLEKAPGEWVHSVAAPHELPALLARIYLIEMESDDPANALPSTD
KAVSDNDDMMILVVDDHPINRSLLADQLGSLGYQCKTANDGVDALNVLNK
NHIDIVLSDVNMPNMDGYRLTQRIRQLGLTLPVIGVTANALAEEKQRCLE
SGMDSCLSKPVTLDVIKQTLTVYAERVRKSRES
>APECO1_499 recT, recombinational DNA repair protein
MNGQLLRHYHCPAGLRNMQMTKQPPIAKADLQKTQGNRAPAAVKNSDVIS
FINQPSMKEQLAAALPRHMTAERMIRIATTEIRKVPALGNCDTMSFVSAI
VQCSQLGLEPGSALGHAYLLPFGNKNEKSGKKNVQLIIGYRGMIDLARRS
GQIASLSARVVREGDEFSFEFGLDEKLIHRPGENEDAPVTHVYAVARLKD
GGTQFEVMTRKQIELVRSLSKAGNNGPWVTHWEEMAKKTAIRRLFKYLPV
SIEIQRAVSMDEKEPLTIDPADSSVLTGEYSVIDNSEE
>APECO1_2565 rhaA, L-rhamnose isomerase
MTTQLEQAWELAKQRFAAVGIDVEEALRQLDRLPVSMHCWQGDDVSGFEN
PEGSLTGGIQATGNYPGKARNASELRADLEQAMRLIPGPKRLNLHAIYLE
SDTPVSRDQIKPEHFKNWVEWAKANQLGLDFNPSCFSHPLSADGFTLSHA
DDRIRQFWIDHCKASRRVSAYFGEQLGTPSVMNIWIPDGMKDITVDRLAP
RQRLLAALDEVISEKLNPAHHIDAVESKLFGIGAESYTVGSNEFYLGYAT
SRQTALCLDAGHFHPTEVISDKISAAMLYVPQLLLHVSRPVRWDSDHVVL
LDDETQAIASEIVRHDLFDRVHIGLDFFDASINRIAAWVIGTRNMKKALL
RALLEPTAELRKLEAAGDYTARLALLEEQKSLPWQAVWEMYCQRHDTPAG
SEWLENVRTYEKEILSRRG
>APECO1_2566 rhaD, rhamnulose-1-phosphate aldolase
MQNITQSWFVQGMIKATTDAWLKGWDERNGGNLTLRLDDADIAPYHDNFH
QQPRYIPLSQPMPLLANTPFIVTGSGKFFRNVQLDPAANLGIVKVDSDGA
GYHILWGLTNEAVPTSELPAHFLSHCERIKATNGKDRVIMHCHATNLIAL
TYVLENDTAVFTRQLWEGSTECLVVFPDGVGILPWMVPGTDEIGQATAQE
MQKHSLVLWPFHGVFGSGPTLDETFGLIDTAEKSAQILVKVYSMGGMKQT
ISREELIALGQRFGVTPLASALAL
>APECO1_1129 rmlA, glucose-1-phosphate thymidylyltransferase
MKTRKGIILAGGSGTRLYPVTMVVSKQLLPIYDKPMIYYPLSTLMLAGIR
DILIISTPQDTPRFQQLLGDGSQWGLNLQYKVQPSPDGLAQAFIIGEEFI
GGDDCALVLGDNIFYGHDLPKLMDAAVNKESGATVFAYHVNDPERYGVVE
FDKNGTAISLEEKPLQPKSNYAVTGLYFYDNDVVEMAKNLKPSARGELEI
TDINRIYMEQGRLSVAMMGRGYAWLDTGTHQSLIEASNFIATIEERQGLK
VSCPEEIAYRKGFVDAEQVKVLAEPLKKNAYGQYLLKMIKGY
>APECO1_445 rnb, exoribonuclease II
MFQDNPLLAQLKQQLHSQTPRAEGVVKATEKGFGFLEVDAQKSYFIPPPQ
MKKVMHGDRIIAVIHSEKERESAEPEELVEPFLTRFVGKVQGKNDRLAIV
PDHPLLKDAIPCRAARGLNHEFKEGDWAVAEMRRHPLKGDRSFYAELTQY
ITFGDDHFVPWWVTLARHNLEKEAPDGVATEMLDEGLVREDLTALDFVTI
DSASTEDMDDALFAKALPDGKLQLIVAIADPTAWIAEGSKLDKAAKIRAF
TNYLPGFNIPMLPRELSDDLCSLRANEVRPVLACRMTFSTDGTIEDNIEF
FAATIESKAKLVYDQVSDWLENTGDWQPESEAIAEQVRLLAQICQRRGEW
RHNHALVFKDRPDYRFILGEKGEVLDIVAEPRRIANRIVEEAMIAANICA
ARVLRDKLGFGIYNVHMGFDPANADALAALLKTHGLHVDAEEVLTLDGFC
KLRRELDAQPTGFLDSRIRRFQSFAEISTEPGPHFGLGLEAYATWTSPIR
KYGDMINHRLLKAVIKGETATRPQDEITVQMAERRRLNRMAERDVGDWLY
ARFLKDKAGTDTRFAAEIVDISRGGMRVRLVDNGAIAFIPAPFLHAVRDE
LVCSQENGTVQIKGETAYKVTDVIDVTIAEVRMETRSIIARPVA
>APECO1_862 rnd, Ribonuclease D
MITTDDALASLCEAVRAFPAIALDTEFVRTRTYYPQLGLIQLFDGEHLAL
IDPLGITDWSPLKAILRDPSITKFLHAGSEDLEVFLNVFGELPQPLIDTQ
ILAAFCGRPMSWGFASMVEEYSGVTLDKSESRTDWLARPLTERQCEYAAA
DVWYLLPITAKLMVETEASGWLPAALDECRLMQMRRQEVVAPEDAWRDIT
NAWQLRTRQLACLQLLADWRLRKARERDLAVNFVVREEHLWSVARYMPGS
LGELDSLGLSGSEIRFHGKTLLALVEKAQALPEEALPQPMLNLMDMPGYR
KAFKAIKSLITDVSETHKISAELLASRRQINQLLNWHWKLKPQNNLPELI
SGWRGELMAEALHNLLQEYPQ
>APECO1_712 rnfC, electron transport complex protein RnfC
MLKLFSAFRKNKIWDFNGGIHPPEMKTQSNGTPLRQVPLAQRFVIPLKQH
IGAEGELCVSVGDKVLRGQPLTRGRGKMLPVHAPTSGTVTAIAPHSTAHP
SALAELSVIIDADGEDCWIPRDGWADYRSRSREELIERIHQFGVAGLGGA
GFPTGVKLQGGGDKIETLIINAAECEPYITADDRLMQDCAAQVVEGIRIL
AHILQPREILIGIEDNKPQAISMLRAVLADSHDISLRVIPTKYPSGGAKQ
LTYILTGKQVPHGGRSSDIGVLMQNVGTAYAVKRAVIDGEPITERVVTLT
GEAIARPGNVWARLGTPVRHLLNDAEFCPSADQMVIMGGPLMGFTLPWLD
VPVVKITNCLLAPSANELGEPQEEQSCIRCSACADACPADLLPQQLYWFS
KGQQHDKATTHNIADCIECGACAWVCPSNIPLVQYFRQEKAEIAAIRQEE
KRAAEAKARFEARQARLEREKAARLERHKSAAVQPAAKDKDAIAAALARV
KEKQAQATQPIVIKAGERPDNSAIIAAREARKAQARAKQAELQQTNDAAT
VADPRKTAVEAAIARAKARKLEQQQANAEPEEQIDPRKAAVEAAIARAKA
RKLEQQQANAEPEEQIDPRKAAVEAAIARAKARKLEQQQANAEPEEQIDP
RKAAVEAAIARAKARKLEQQQANAEPEEQIDPRKAAVAAAIARVQAKKAA
QQKVVNED
>APECO1_1109 sbcB, exonuclease I
MIFDTLADSRNNGFNLMMNDGKQQSTFLFHDYETFGTHPALDRPAQFAAI
RTDNEFNVIGEPEVFYCKPADDYLPQPGAVLITGITPQEARAKGENEAAF
AARIHSLFTVPKTCILGYNNVRFDDEVTRNVFYRNFYDPYAWSWQHDNSR
WDLLDVMRACYALRPEGINWPENDDGLPSFRLEHLTKANGIEHSNAHDAM
ADVYATIAMAKLVKTRQPRLFDYLFTHRNKHKLMALIDVPQMKPLVHVSG
MFGAWRGNTSWVAPLAWHPENRNAVIMVDLAGDISPLLELDSDTLRERLY
TAKTDLGDNAAVPVKLVHINKCPVLAQANTLRPEDADRLGINRQHCLDNL
KILRENPQVREKVVAIFAEAEPFTPSDNVDAQLYNGFFSDADRAAMKIVL
ETEPRNLPALDITFVDKRIEKLLFNYRARNFPGTLDYAEQQRWLEHRRQV
FTPEFLQGYADELQMLAQQYADDKEKVALLKALWQYAKEIV
>APECO1_1865 speD, S-adenosylmethionine decarboxylase
MSEEPVDPKLIDKTEHPGPLPETVVAHLDKSHICVHTYPESHPEGGLCTF
RADIEVSTCGVISPLKALNYLIHQLESDIVTIDYRVRGFTRDINGMKHFI
DHEINSIQNFMSEDMKALYDMVDVNVYQENIFHTKMLLKEFDLKHYMFHT
KPEDLTDSERQEITAALWKEMREIYYGRNMPAV
>APECO1_2483 thiG, thiazole biosynthesis protein ThiG
MAQPGFAPRQWLPGMRRKQCRSCLTINRCSAPPGKRFTNYWSNSTNNKRA
RHWRLISKSSRVSSGRNISCRMATRSCFFRLLQGVEMLRIADKTFDSHLF
TGTGKFASSQLMMEAIRACGSQLVTLAMKRVNLRQHNDAILEPLIAAGVT
LLPNTSGAKTAEEAIFAAHLAREALGTNWLKLEIHPDARWLLPDPIETLK
AAEKLVQQGFVVLPYCGADPVLCKRLEEVGCAAVMPLGAPIGSNQGLETR
AMLEIIIQQATVPVVVDAGIGVPSHAAQALEMGADAVLVNTAIAVADDPV
NMAKAFRLAVEAGLLARQSGPGSRSHFAHATSPLTGFLEASE
>APECO1_1916 thiQ, thiamine transport ATP-binding protein ThiQ
MLKLTDITWLYHHLPMRFSLTVERGEQVAILGPSGAGKSTLLNLIAGFLT
PASGLLTIDDVDHTTTPPSRRPVSMLFQENNLFSHLTVAQNIGLGLNPGL
KLNAAQQKKMHAIAHQMGIDNLMARLPGELSGGQRQRVALARCLVREQPI
LLLDEPFSALDPALRQEMLTLVSSSCQQQKMTLLMVSHSVEDAARIATRS
VVVADGRIAWQGKTDELLSGKASASALLGIKG
>APECO1_368 tonB, TonB protein
MIMTSMTLDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPI
SVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPK
PKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPV
TSVASGPRALSRNQPQYPARAQALRIEGQVKVKFDVTPDGRVDNVQILSA
KPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFKINGTTEIQ
>APECO1_832 topB, DNA topoisomerase III
MNLFPPVVVVPWLRFVKSMRLFIAEKPSLARAIADVLPKPHRKGDGFIEC
GNGQVVTWCIGHLLEQAQPDAYDSRYARWNLADLPIVPEKWQLQPRPSVT
KQLNVIKRFLHEASEIVHAGDPDREGQLLVDEVLDYLQLAPEKRQQIQRC
LINDLNPQAVERAIDRLRSNSEFVPLCVSALARARADWLYGINMTRAYTI
LGRNAGYQGVLSVGRVQTPVLGLVVRRDEEIENFVAKDFFEVKAHIVTPA
DERFTAIWQPSEACEPYQDEEGRLLHRPLAEHVVNRISGQPAIVTSYNDK
RESESAPLPFSLSALQIEAAKRFGLSAQNVLDICQKLYETHKLITYPRSD
CRYLPEEHFAGRHAVMNAISVHAPDLLPQPVVDPDIRNRCWDDKKVDAHH
AIIPTARSSAINLTENEAKVYNLIARQYLMQFCPDAVFRKCVIELDIAKG
KFVAKARFLAEAGWRTLLGSKERDEENDGTPLPVVAKGDELLCEKGEVVE
RQTQPPRHFTDATLLSAMTGIARFVQDKDLKKILRATDGLGTEATRAGII
ELLFKRGFLTKKGRYIHSTDAGKALFHSLPEMATRPDMTAHWESVLTQIS
EKQCRYQDFMQPLVGTLYQLIDQAKRTSVRQFRGIMAPGGREGKKKDSPR
KRAPKKSPPSEEAGNGVIT
>APECO1_89 torC, cytochrome c-type protein TorC
MRKLWNALRRPSARWSVLALVATGIVIGIALIVLPHVGIKVTSTTEFCVS
CHSMQPVYEEYKQSVHFQNASGVRAECHDCHIPPDIPGMVKRKLEASNDI
YQTFIAHSIDTPEKFEAKRAELAEREWARMKENNSATCRSCHNYDAMDHA
KQHPEAARQMKVAAKDNQSCIDCHKGIAHQLPDMSSGFRKQFDELRASAN
DSGDTLYSIDIKPIYAAKGDKEASGSLLPASEVKVLKRDGDWLQIEITGW
TESAGRQRVLTQFPGKRIFVASIRGDVQQQVKTLEKTTVADTNTEWSKLQ
ATAWMKKGDMVNDIKPIWAYADSLYNGTCNQCHGAPEISHFDANGWIGTL
NGMIGFTSLDKREERTLLKYLQMNASDTAGKAHGDKKEEK
>APECO1_1599 tsx, nucleoside-specific channel-forming protein Tsx
MKKTLLAAGAVLALSSSFTVNAAENDKPQYLSDWWHQSVNVVGSYHTRFG
PQLRNDTYLEYEAFAKKDWFDFYGYADAPVFFGGNSDAKGIWNHGSPLFM
EIEPRFSIDKLTNTDLSFGPFKEWYFANNYIYDMGRNKDGRQSTWYMGLG
TDIDTGLPMSLSMNVYAKYQWQNYGAANENEWDGYRFKVKYFVPITDLWG
GQLSYIGFTNFDWGSDLGDDSGYANNGIKTRTNNSIASSHILALNYDHWH
YSVVARYWHNGGQWNDDAELNFGNGNFNVRSTGWGGYLVVGYNF
>APECO1_4072 uraA, uracil permease transporter
MTRRAIGVSERPPLLQTIPLSLQHLFAMFGATVLVPVLFHINPATVLLFN
GIGTLLYLFICKGKIPAYLGSSFAFISPVLLLLPLGYEVALGGFIMCGVL
FCLVSFIVKKAGTGWLDVLFPPAAMGAIVAVIGLELAGVAAGMAGLLPAE
GQTPDSKTIIISITTLAVTVLGSVLFRGFLAIIPILIGVLVGYALSFAMG
IVDTTPIINAHWFALPTLYTPRFEWFAILTILPAALVVIAEHVGHLVVTA
NIVKKDLLRDPGLHRSMFANGLSTVISGFFGSTPNTTYGENIGVMAITRV
YSTWVIGGAAIFAILLSCVGKLAAAIQMIPLPVMGGVSLLLYGVIGASGI
RVLIESKVDYNKAQNLILTSVILIIGVSGAKVNIGAAELKGMALATIVGI
GLSLIFKLISMLRPEEVVLDAEDADITDK
>APECO1_956 uvrY, response regulator
MRAGIRRILEDIKGIKVVGEASCGEDAVKWCRANAVDVVLMDMSMPGIGG
LEATRKIARSTADVKIIMLTVHTENPLPAKVMQAGAAGYLSKGAAPQEVV
SAIRSVYSGQRYIASDIAQQMALSQIEPEKTESPFASLSERELQIMLMIT
KGQKVNEISEQLNLSPKTVNSYRYRMFSKLNIHGDVELTHLAIRHGLCNA
ETLSSQ
>APECO1_1151 wzb, tyrosine phosphatase
MFNNILVVCVGNICRSPTAERLLQRYHPELKVESAGLGALVGKGADPTAI
SVAAEHQLSLEGHCARQISRRLCRNYDLILTMENRHIERLCEMAPEMRGK
VMLFGHWDNECEIPDPYRKSRETFAAVYTLLERSARQWAQALNAEQV
>APECO1_1863 yacC, hypothetical protein
MLTFIFFKPLVEAMKTFFRTVLFGSLMAVCANSYALSESEAEDMADLTAV
FVFLKNDCGYQNLPNGQIRRALVFFAQQNQWDLSNYDTFDMKALGEDSYR
DLSGIGIPVAKKCKALARDSLSLLAYVK
>APECO1_1566 ybaE, hypothetical protein
MRLLNRLNQYQRLWQPSAGKPQTVTVSELAERCFCSERHVRTLLRQAQEA
GWLEWQAQSGRGKRGQLRFLVTPESLRNAMMEQALETGKQQDVLELAQLA
PGELRTLLQPFMGGQWQNDTPTLRIPYYRPLEPLQPGFLPGRAEQHLAGQ
IFSGLTRFDNNTQRPIGDLAHHWETSTDGLRWDFYLRSTLHWHNGDAVKA
SHLHQRLLMLLQLPALDQLFISVKRIEVTHPQCLTFFLHRPDYWLAHRLA
SYCSHLAHPQFPLIGTGPFRLTQFTAELVRLESHDYYHLRHPLLKAVEYW
ITPPLFEKDMGTSCRHPVQITIGKPEELQRVSQVSSGISLGFCYLTLRKS
LRLSLWQARKVISIIHQSGLLQTLEVGENLITASHALLPGWTIPHWQVPD
EVKLPKTLTLVYHLPIELHTMAERLQATLAAEGCELTIIFHNAKNWDDTT
LLAHADLMMGDRLIGEAPEYTLEQWLRCDPLWPHVFDAPAYAHLQSTLDA
VQVMPDEENRFNALKAVFSQLMADATLTPLFNYHYRISAPPGVNGVRLTP
RGWFEFTEAWLPAPSQ
>APECO1_1220 ybjE, hypothetical protein
MSSRISNLLCKDWVFMFSGLLIILVPLIVGYLIPLRQKAALRVINQLLSW
MVYLILFFMGISLAFLDNLASNLLAILHYSAVSITVILLCNIAALMWLER
GLPWRNHHHQEKLPSRIAMALESLKLCGVVVIGFAIGLSGLAFLQHATEA
SEYTLILLLFLVGIQLRNNGMTLKQIVLNRRGMIVAVVVVASSLIGGLIN
AFILDLPINTALAMASGFGWYSLSGILLTESFGPVIGSAAFFNDLARELI
AIMLIPGLIRRSRSTALGLCGATSMDFTLPVLQRTGGLDMVPAAIVHGFI
LSLLVPILIAFFSA
>APECO1_315 ycgC, putative phosphoenolpyruvate kinase (PTS system EI component in bacteria)
MMVNLVIVSHSSRLGEGVGELARQMLMSDSCKIAIAAGIDDPHNPIGTDA
VKVMEAIESVADADHVLVMMDMGSALLSAETALELLAPEIAAKVRLCAAP
LVEGTLAATVSAASGADIDKVIFDAMHALEAKREQLGLPSSDTEISDTCP
PYDEEARSLSVVIKNRNGLHVRPASRLVYTLSTFNADMLLEKNGKCVTPD
SINQIALLQVRYNDTLRLIAKGPEAEEALIAFRQLAEDNFGETEEVAPPT
LRPVPPVSGKAFYYQPVLCTVQAKSTLTVEEEQERLRQAIDFTLLDLMTL
TAKAETCGLDDIAAIFSGHHTLLDDPELQAAASELLQHEHCTAEYAWQQV
LKELSQQYQQLDDEYLQARYIDVDDLLHRTLVHLTQTKEELPQFNSPTIL
LAENIYPSTVLQLDPAVVKGICLSAGSPLSHSALIARELGIGWICQQGEK
LYAIQPEETLTLDVKTQRFNRQG
>APECO1_440 yciM, hypothetical protein
MLELLFLLLPVAAAYGWYMGRRSAQQNKQDEANRLSRDYVAGVNFLLSNQ
QDKAVDLFLDMLKEDTGTVEAHLTLGNLFRSRGEVDRAIRIHQTLMESAS
LTYEQRLLAIQQLGRDYMAAGLYDRAEDMFNQLTDETDFRIGALQQLLQI
YQATSEWQKAIDVAERLVKLGKDKQRVEIAHFYCELALQHMASDDLDRAM
TLLKKGAAADKNSARVSIMMGRVFMAKGEYAKAVESLQRVISQDRELVSE
TLEMLQTCYQQLGKTAEWAEFLQRAVEENTGADAELMLADIIEARDGSEA
AQVYITRQLQRHPTMRVFHKLMDYHLNEAEEGRAKESLMVLRDMVGEKVG
SKPRYRCQKCGFTAYTLYWHCPSCRAWSTIKPIRGLDGL
>APECO1_580 ydcP, putative protease YdcP precursor
MAKIAAIFQLLDKKVTVSSHRLELLSPARDAAIAREAILHGADAVYIGGP
GFGARHNASNSLKDIAELVPFAHRYGAKIFVTLNTILHDDELEPAQRLIT
DLYQTGVDALIVQDMGILELDIPPIELHASTQCDIRTVEKAKFLSDVGFT
QIVLARELNLEQLRAIHQATDATIEFFIHGALCVAYSGQCYISHAQTGRS
ANRGDCSQACRLPYTLKDDQGRVVSYEKHLLSMKDNDQTANLGALIDAGV
RSFKIEGRYKDMSYVKNITAHYRQMLDAIIEERGDLARASSGRTEHFFVP
STEKTFHRGSTDYFVNARKGDIGAFDSPKFIGLPVGEVLKVAKDHIDVAV
TEPLANGDGLNVMIKREVVGFRANTVEKTGENQYRVWPNEMPADLHKIRP
HHPLNRNLDHNWQQALTKTSSERRVAVDIELGGWQEQLILTLTSEEGVSI
THTLDGQFDEANNAEKAMNNLKDGLAKLGQTIYYARNVQINLPGALFVPN
SLLNQFRREAADMLDAARLASYQRGSRKPVADPAPVYPQTHLSFLANVYN
QKAREFYHRYGVQLIDAAYEAHEEKGEVPVMITKHCLRFAFNLCPKQAKG
NIKSWKATPMQLVNGDEVLTLKFDCRPCEMHVIGKIKNHILKMPLPGSVV
ASVSPDDLLKTLPKRKG
>APECO1_6592 ydfG, putative oxidoreductase YdfG
MIVLVTGATAGFGECITRRFIQQGHKVIATGRRQERLQELKDELGDNLYI
AQLDVRNRAAIEEMLASLPAEWSNIDILVNNAGLALGMEPAHKASVEDWE
TMIDTNNKGLVYMTRAVLPGMVERNHGHIINIGSTAGSWPYAGGNVYGAT
KAFVRQFSLNLRTDLHGTAVRVTDIEPGLVGGTEFSNVRFKGDDGKAEKT
YQNTVALTPEDVSEAVWWVSTLPAHVNINTLEMMPVTQSYAGLNVHRQ
>APECO1_746 ydhS, conserved hypothetical protein with FAD/NAD(P)-binding domain
MKKIAIVGAGPTGIYTLFSLLQQQTPLSISIFEQADEAGVGMPYSDDENS
KLMLANIASIEIPPIYCTYLEWLQKQEASHLQRYGVKKETLHDRQFLPRI
LLGEYFRDQFLRLVDQARQQKFAVAVYESCQVTDLQITNAGVMIATNQDL
PSETFDLAVIATGHVWPDEEEATRTYFPSPWSGLMEAKVDACNVGIMGTS
LSGLDAAMAVAIQHGSFIEDDKQHVIFHRDNASEKLNITLMSRTGILPEA
DFYCPIPYEPLHIVTDQALNAEIQKGEEGLLDRVFRLIVEEIKFADPDWS
QRIALESLNVDSFAQA
>APECO1_1103 yeeV, toxin of the YeeV-YeeU toxin-antitoxin system
MKTLPVLPGQAASSRPSPVEIWQILLSRLLDQHYGLTQNDTPFADERVIE
QHIEAGISLCDAVNFLVEKYALVLPTSRDSAPVPALS
>APECO1_1153 yegH, hypothetical protein
MTFVIQRIGFVLTMEWIADPSIWAGLITLIVIELVLGIDNLVFIAILAEK
LPPKQRDRARVTGLLLAMLMRLLLLASISWLVTLTQPLFSFRSFTFSARD
LIMLFGGFFLLFKATMELNERLEGKDSNNPTQRKGAKFWGVVTQIVVLDA
IFSLDSVITAVGMVDHLLVMMAAVVIAISLMLMASKPLTQFVNSHPTIVI
LCLSFLLMIGFSLVAEGFGFVIPKGYLYAAIGFSVMIEALNQLAIFNRRR
FLSANQTLRQRTTEAVMRLLSGQKEDAELDAETASMLVDHGNQQIFNPQE
RRMIERVLNLNQRTVSSIMTSRHDIEHIDLNAPEEEIRQLLERNQHTRLV
VTDGDDAEDLLGVVHVIDLLQQSLRGEPLNLRVLIRQPLVFPETLPLLPA
LEQFRNARTHFAFVVDEFGSVEGIVTLSDVTETIAGNLPNEVEEIDARHD
IQKNADGSWTANGHMPLEDLVQYVPLPLDEKREYHTIAGLLMEYLQRIPK
PGEEVQVGDYLLKTLQVESHRVQKVQIIPLRKDGEMEYEV
>APECO1_4382 yeiQ, putative dehydrogenase, NAD-dependent
MNTIASVTLPHHVHAPRYDRQQLQSRIVHFGFGAFHRAHQALLTDRVLNA
QGGDWGICEISLFSGDQLMSQLRAQNHLYTVLEKGADGNQAIIVGAVHEC
LNAKLDSLAAIIEKFCEPQVAIVSLTITEKGYCIDPAIGALDTSNPRIIH
DLQNPEEPHSAPGILVEALKRRRERGLTPFTVLSCDNIPDNGHVVKNAVL
GMAEKRSPELAGWIKEHVSFPGTMVDRIVPAATNESLAEISQHLGVNDPC
AISCEPFIQWVVEDNFVAGRPAWEVAGVQMVNDVLPWEEMKLRMLNGSHS
FLAYLGYLSGFAHISDCMQDRAFRHAARTLMLDEQAPTLRIKDVDLTQYA
DKLIARFANPALKHKTWQIAMDGSQKLPQRMLAGIRIHLGRETDWSLLAL
GVAGWMRYVSGVDDAGNAIDVRDPLSDKIRELVAVSSSEQRVTALLSLRE
IFGDDLPDNPHFVQAIEQAWQQIAQFGAHQALLNTLKI
>APECO1_4380 yeiU, undecaprenyl pyrophosphate phosphatase
MIKNLPQIVLLNIVGLALFLSWYIPVNHGFWLPIDADIFYFFNQKLVESK
AFLWLVALTNNRAFDGCSLLAMGMLMLSFWLKENALGRRRIVIIGLVMLL
TAVVLNQLGQALIPVKRASPTLTFTDINRVSELLSVPTKDASRDSFPGDH
GMMLLIFSAFMWRYFGKVAGLIALIIFVVFAFPRVMIGAHWFTDIIVGSM
TVILIGLPWVLLTPLSDRLITFFDKSLPGKNKHFQNK
>APECO1_4332 yfaS, hypothetical protein
MDTQRFQSQFHWHLSFKFSGAIAACLSLSLVGTGLANADDSLPSSNYVPP
AGGTFFLLADSSFSSSEEAKVRLEAPGRDYRRYQMEEYGGVDVRLYRIPD
PMAFLRQQKNLHRIVVQPQYLGDGLNNTLTWLWDNWYGKSRRVMQRTFSS
QSRQNVTQALPELQLGNAIIKPSRYVQNNQFSPLKKYPLVEQFRYPLWQA
KPVEPQQGVKLEGASSNFISPQPGNIYIPLGKQEPGLYLVEAMVGGYRAT
TVVFVSDTVALSKVSGNELLVWTAGKKQGEAKPGSEILWTDGLGVMTRGV
TDGSGTLQLQHISPERSYILGKDAEGGVFVSENFFYESEIYNTRLYIFTD
RPLYRAGDRVDVKVMGREFHDPLHSSPIVSAPAKLSVLDANGSLLQTVNV
TLDARNGGQGSFRLPENAVAGGYELRLAYRKQVYSSSFRVANYIKPHFEI
GLALDKKEFKTGEAVSGKLQLLYPDGEPVKDARVQLSLRAQQLSMVGNDL
RYAGRFPVSLEGSETVSDDNGHVALNLPAADKPSRYLLTVSASDGAAYRV
TTTKEILIERGLAHYSLSTAAQYSNSGESVVFRYAALESSKQVPVTYEWL
RLEDRTSHSGELQSGGKSFTVNFAKPGNYNLTLRDKDGLILAGLSHAVSG
KGSTSHTGTVDIVADKTLYQPGETAKMLITFPEPIDEALLTLERDRVEQQ
SLLSHPANWLTLQRLNDTQYEARVPVSNSFAPNITFSVLYTRNGQYSFQN
AGIKVAVPQLDIRVKTDKTHYQPGELVNVELTSSLKGKPVSAQLTVGVVD
EMIYALQPEIAPNIGKFFYPLGRNNVRTSSSLSFISYDQALSSEPVAPGA
TNRSERRVKMLERPRREEVDTAAWMPSLTTDKQGKAYFTFLMPDSLTRWR
ITARGMNGDGLVGQGRAYLRSEKSLYMKWSMPTVYRMGDKPAAGLFIFSQ
QDNEPVALVTKFAGAEMRQTLTLHKGANYISLTQNIQQSGLLSAELQQNG
QVQDSISTKLSFVDNSWPVEQQKNVMLGGGDNALTLPEQASNIRLQSSET
PQEIFRNNLDALVDEPWGGVINTGSRLIPLSLAWRSLADHQSAAANDIRQ
MIQDNRLRLMQLAGPGARFTWWGEDGNGDAFLTAWAWYADWQASQALGVT
QQPEYWQHMLDSYAEQADNMPLLHRALVLAWAQEMNLPCKTLLKGLDEAI
ARRGTKTEDFSEEDTSDINDSLILDTPESPLADAVANVLTMTLLKKAQLK
STVMPQVQQYAWDKAANSNQPLAHTVVLLNSGGDATQAAAILSGLTAEQS
TIERALAMNWLAKYMATMPPVVLPAPAGAWAKHKLTGGGEDWRWVGQGVP
DILSFGDELSPQNVQVRWREPAKTAQQSNIPVTVERQLYRLIPGEEEMSF
TLLPVTSNEIDSDALYLDEITLTSEQDAVLRYGQVEVPLPPGADVERTTW
GISVNKPNPAKQQGQLLEKARNEMGELGYMVPVKELTGTVTFRHLLRFSQ
KGQFVLPPARYVRSYAPAQQSVAPGSEWIGMQVK
>APECO1_4126 yfeK, hypothetical protein
MTLPAYAKLTAHEEARINAMLEGLAQKKDLIFVRNGDEHTCDEAVSHLRL
KLGNTRNRIDTAEQFIDKVASSSSITGKPYIVKMPGKSDENAQPFLHALI
AQTDKTVPAQ
>APECO1_4090 yffH, putative NUDIX hydrolase
MTQQITLVKDKILSDNYFTLHNITYDLTRKDGEVIRHKREVYDRGNGATI
LLYNAKKKTVVLIRQFRVATWVNGNESGQLIETCAGLLDNDEPEVCIRKE
AIEETGYEVGEVRKLFELYMSPGGVTELIHFFIAEYSDSQRANAGGGVED
EDIEVLELPFSQALEMIKTGEIRDGKTVLLLNYLQMSHLMD
>APECO1_3649 ygfJ, hypothetical protein
MSAIDCIITAAGLSSRMGQWKMMLPWQQGTILDTSIKNALQFCSRIILVT
GYRGNELHERYANQSNITIIHNPDYAQGLLTSVKAAVPAVQTEHCFLNHG
DMPTLTIDIFRKIWSLRNDGAILPLHNGIPGHPILVSKPCLMQAIQRPNV
TNMRQALLMGEHYSVEIENAEIILDIDTPDDFITAKKRYTEI
>APECO1_3565 yggM, hypothetical protein
MLMTGNVWADGEPPTENILKDQFKKQYHGILKLDVITLKNLDAKGNQATW
SAEGDVSSSDDLYTWVGQLADYELLEQTWTKDKPVKFSAMLTSKGTPASG
WSVNFYSFQAAASDRGRVVDDIKTNNKYLIVNSEDFNYRFSQLESALNNQ
NNSIPALKKDVKALDKQMVAAQKAADAYWGKDANGKQMTREDAFKKIHQQ
RDDFNKQNDSEAFAVKYDKEVYQPAIAACHKQSEECYEVPIQQKRDFDIN
EQRRQTFLQSQKLSRKLQDDWITLEKGQYPLTMKVSEINSKKVAILMKID
DINQANERWKKDTEQLRRNGVIK
>APECO1_3360 ygiF, putative adenylate cyclase
MAQEIELKFIVNHSAVEALRDHLNTLDGEHHDPVQLLNIYYETPDNWLRG
HDMGLRIRGENGRYEMTMKVAGRVTGGLHQRPEYNVALSAPTLDLAQLPT
EVWPNGELPADLASRVQPLFSTDFYREKWLVEVDGSQIEIALDQGEVKAG
EFAEPICELELELLSGDMRAVLKLANQLVSQTGLRQGSLSKAARGYHLAQ
GNPAREIKPTTILHVAAKADVEQGLEAALELALAQWQYHEELWVRGNDAA
KEQVLAAIGLVRHTLMLFGGIVPRKASTHLRDLLTQCEATIASAVSAVTA
VYSTETAMAKLALTEWLVSKAWQPFLDAKAQSKMSDSFKRFADIHLSRHA
AELKSVFCQPLGDRYRDQLPRLTRDIDSILLLAGYYDPVVAQDWLENWQG
LRHAIATGQRIEIEHFRNEANNQEPFWLHSGKR
>APECO1_2749 yidZ, hypothetical protein
MKKSITTLDLNLLLCLQLLMQERSVTKAAKRMNVTPSAVSKSLAKLRAWF
DDPLFVNSPLGLSPTPLMVSMEQNLAEWMQMSNQLLDKPLHETPRGLKFE
LAAESPLMMIMLNALSKRIYQRYPQATIKLRNWDYDSLDAITRGEVDIGF
SGRESHPRSRELLSSLPLAIDYEVLFSDVPCVWLRKDHPALHEAWNLDTF
LRYPHISICWEQSDTWALDNVLQELGRERTIAMSLPEFEQSLFMAAQPDN
LLLATAPRYCQYYNQLHQLPLVALPLPFDESQQKKLEVPFTLLWHKRNSR
NPKIVWLRETIKNLYASMA
>APECO1_1689 ykgD, putative DNA-binding transcriptional regulator
MDALSRLLMLNAPQGTIDKNCVLGSDWQLPHGAGELSVIRWHALTQGAAK
LEMPTGEIFTLRPGNVVLLPQNSAHRLSHVDNESTCIVCGTLRLQHSARY
FLTSLPETLFVAPVNHSVEYNWLREAIPFLQQESRSAMPGVDALCSQICA
TFFTLAVREWIAQVNTEKNILSLLLHPRLGAVIQQMMEMPGHAWTVESLA
SIAHMSRASFAQLFRDVSGTTPLAVLTKLRLQIAAQMFSREMLPVVVIAE
SVGYASESSFHKAFVREFGCTPGEYRERVRQLAP
>APECO1_532 ynaA, Rac prophage; predicted tail protein
MAEQTSRLAIIIDSTGAKNNADNLASSLVKMTQAGETAANSAGKVTKATE
DEKNALAKLKAAIDPVGAAIDTVGRRYSELKKFFDKGLIDKEEYEFLVRK
LNETTEELSGVAQAQREAEKAGKLAAAQQEAQAQVFQRMLDKIDPLAAAL
RNLEQQQDELNAAFASGKINGSQFENYSRKIQETRRELTGEAQAEREAAK
AHDEQVAALQRLIAQLDPVGTAFNRLVEQQKQLNEAKAKGMLSPEMYEEL
SGKLRAMRSELEVTQSQLSKTGMSAKQTAFAMRMLPAQMTDIVVGLSTGQ
SPFMVLMQQGGQLKDMFGGIGPAIKGVGSYVLGLINPFTLAAAAVGVLGL
AYYKGSQEQDEFNKSLILTGNQLGTTSGQLADIAQRAGNAADSTTGAAAA
VLNQLVRSGKVASSSLEQVTTAIVKTSEVTGISTEQLVNDFNEIAKDPVS
AISKLNDQYHFLTLATYNQIKALQDEGNQQEAARIATEAYSSSMIQRTNQ
IKENLGYLETAWKAVADSAKWAWDSMLDIGREASLDQKISDVLRQIDEIE
KNTRPGVFGLGGIGDGGAQNKKLARLKQQLGVLQAEKIAQDVLNSSINDY
NKRQQEGIELRQRADAFSKQYQTREQQRASELAKLEKLKNQYSKEEYNNL
IAQINERYKDPKQPKAKGYSDDAAQRMIDHLNQQNALLSSQAELTVKLSS
SEQELVKWRQQIADLESRPSSKLTQDQKSLLLHREEITALMEKNVAIEKN
NRLIKESAEIAAWRDSLQASIDNRQQGYDIQIAGYGVGDKNQQRQQELLR
IEREYNNQRLQLERDYADKSRGMSNHVFQEKMQALNDALEREKEIVSQKN
EQLDIQAGDWISGASQGFNNWLDDTKDIGAQIKSTTTQMFDGMTDALGDF
VTTGKANFRSFATSVISDLSRIALKASITGIFDSISNSSSGGILGTIGSA
ISKFIPNAKGGVYESPSLSTYSNGIYDSPQFFAFAKGAGVFGEAGPEAIM
PLTRTSDGSLGVRAINSKSGNGGRDITYAPVYQITIQNDGQNGEIGPQAI
KALMGMVDQRVQGALLNMRRDGGMLSG
>APECO1_561 ynbC, putative methylase
MENSRIPGEHFFTTSDNTALFYRHWPTLLPGAKKVIVLFHRGHEHSGRLQ
HIVDELAMPDTAFYAWDARGHGQTSGPRGYSPSLARSVQDVDEFVRFAAS
DSQVGLEEVVVIAQSVGAVMVATWVHDYAPAIRGLVLASPAFKVKLYVPL
ARPGLALWHRLRGLFFINSYVKGRYLTHDRQRVASFNNDPLITRAIAVNI
LLDLYKTSERIVSDAAAITLPTQLLISGDGYVVHRQPQIDFYQRLRSPLK
ELHLLPGFYHDTLGEENRAQAFEKMQSFISRLYANKSQKFDYQHEDRTGP
SADRWRLLSGGPVPLSPVDLAYRFMRKAMKLFGAHSAGLHLGMSTGFDSG
SSLDYVYQNQPQGSNAFGRFIDKIYLNSVGWRGIRQRKTHLQMLIKQAVA
HLHAKGLAVRVVDIAAGHGRYVLDALANEPAVSDILLRDYSELNVAQGQE
MIAQRGMSGRVRFEQGDAFNLEELSALTPRPTLAIVSGLYELFPENEQVK
NSLAGLANAIDPGGILIYTGQPWHPQLELIAGVLTSHKDGKPWVMRVRSQ
GEMDSLVHDAGFDKCTQRIDVWGIFTVSMAVRRDN
>APECO1_755 ynhG, hypothetical protein
MKRASLLTLTLIGAFSAIQAAWAVDYPLPSTGSRLVGQNQTYTVQEGDKN
LQAIARRFDTAAMLILEANNTIAPVPKPGTTITIPSQLLLPDAPRQGIIV
NLAELRLYYYPPGENIVQVYPIGIGLQGLETPVMETRVGQKIPNPTWTPT
AGIRQRSLERGIKLPPVVPAGPNNPLGRYALRLAHGNGEYLIHGTSAPDS
VGLRVSSGCIRMNAPDIKALFSSVRTGTPVKVINEPVKYSVEPNGMRYVE
VHRPLSAEEQQNVQTMPYTLPAGFTQFKDNKAVDQKLVDKALYRRAGYPV
AVSSGATPAASNAPSVESAQNGEPEQGNMLRATQ
>APECO1_3283 yraM, hypothetical protein
MVPSTFSRLKAARCLPVVLAALIFAGCGTHTPDQSTAYMQGTAQADSAFY
LQQMQQSSDDTRINWQLLAIRALVKEGKTGQAVELFNQLPQELNDTQRRE
KTLLAAEIKLAQKDFAGAQNLLAKITPADLEQNQQARYWQAKIDASQGRP
SIDLLRALIAQEPLLGAKEKQQNIDATWQALSSMTQEQANTLVINADENI
LQGWLDLQRVWFDNRNDPDMMKAGIADWQKRYPNNPGAKMLPTQLVNVKA
FKPASTNKIALLLPLNGQAAVFGRTIQQGFEAAKNIGTQPVVAQVAAAPA
ADVAEQPQPQTVDGVASPAQASVSDLTGEQPAAQSVPVSAPATSTAAVSA
PANPSAELKIYDTSSQPLSQILSQVQQDGASIVVGPLLKNNVEELLKSNT
PLNVLALNQPENIENRVNICYFALSPEDEARDAARHIRDQGKQAPLVLIP
RSSLGDRVANAFAQEWQKLGGGTVLQQKFGSTSELRAGVNGGSGIALTGS
PITPRETTDSGMTTNNPTLQTTPTDDQFTNNGGRVDAVYIVATPGEIAFI
KPMIAMRNGSQSGATLYASSRSAQGTAGPDFRLEMEGLQYSEIPMLAGGN
LPLMQQALSAVNNDYSLARMYAMGVDAWSLANHFSQMRQVQGFEINGNTG
SLTANPDCVINRKLSWLQYQQGQVVPAS


# Escherichia coli W3110, K-12

>gid:76135  yaiT  hypothetical protein
MHSWKKKLVVSQLALACTLAITSQANAANYDTWTYIDNPVTALDWDHMDK
AGTVDGNYVNYSGFVYYNNTNGDFDQSFNGDTVNGTISTYYLNHDYADST
ANQLDISNSVIHGSITSMLPGGYYDRFDADGNNLGGYDFYTDAVVDTHWR
DGDVFTLNIANTTIDDDYEALYFTDSYKDGDVTKHTNETFDTSEGVAVNL
DVESNINISNNSRVAGIALSQGNTYNETYTTESHTWDNNISVKDSTVTSG
SNYILDSNTYGKTGHFGNSDEPSDYAGPGDVAMSFTASGSDYAMKNNVFL
SNSTLMGDVAFTSTWNSNFDPNGHDSNGDGVKDTNGGWTDDSLNVDELNL
TLDNGSKWVGQAIYNVAETSAMYDVATNSLTPDATYENNDWKRVVDDKVF
QSGVFNVALNNGSEWDTTGRSIVDTLTVNNGSQVNVSESKLTSDTIDLTN
GSSLNIGEDGYVDTDHLTINSYSTVALTESTGWGAD


# Salmonella enterica subsp. enterica serovar Newport str. SL254, SL254

>SNSL254_pSN254_0116 hypothetical protein
MMEDAMFKSSVCSYENRGPYGNNKYRGNCSGFIVKDFIESYMRKPNGLVA
DPSVGGGSSIDVANELGVRFKGTDLHQGFNLLRDDFLSFLGEPAHLIWWH
PPYWDMIQYSGKQWGEPNKWDMSRMNLPEFVEALELAVMNIHDACERGGH
YGILMGNLRRDGDYFNLSSLVERIAPGKLVDEIIKTQHNCVSDRTQYSGK
LVRIAHEKLLVFRRNDVASSLCLLAAVHRRATNMVSTTWKAAIRRTLQGK
TLKLEQIYKEIEPYAKHRENNHWQAKVRQVLQDARFFIRIEVGVYALAE


# Shigella flexneri 5 str. 8401, 8401

>SFV_2500 hypothetical protein
MNAPYGNRSADDIISLTKKMLTFLQSRNVKLVAVACNTISSTLESEEYSG
YAKSFPFPILSIIEPAVEDVIRQQYKNVGIIATEFTIKTGCHKELIKKLN
STINVFGEPSKNLAMLIEEGNLNAPAILNNIKKHVNHLISLHPVNEIILG
CTHYPIVQNFFEAVAPDIKFINPAHDQAISIKNHLGQLNLLNSSNIGTLQ
INTSGSMEIYKTVLGELSITKPHTFSIRQF
>SFV_1444 hypothetical protein
MKKIPLGTTDITLSRMGLGTWAIGGGPAWNGDLDRQICIDTILEAHRCGI
NLIDTALGYNFGNSEVIVGQALKKLPREQVVVETKCGIVWERKGSLFNKV
GDRQLYKNLSPESIREEVEASLQRLGIDYIDIYMTHWQSVPPFFTPIAET
VAVLNELKAEGKIRAIGAANVDADHIREYLQYGELDIIQAKYSILDRAME
NELLPLCRDNGIVVQVYSPLEQGLLTGTITRDYVPGGARANKVWFQRENM
LKVIDMLEQWQPLCARYQCTIPTLALAWILKQSDLISILSGATAPEQVRE
NVAALNINLSDADATLMREMAEALER
>SFV_1737 putative ATP-binding component of a transport system
MTQPVLDIQQLHLSFPGFNGDVHALNNVSLQINRGEIVGLVGESGSGKSV
TAMLIMRLLPTGSYCVHRGQISLLGEDVLNAREKQLRQWRGARVAMIFQE
PMTALNPTRRIGLQMMDVIRHHQPISRREARAKAIALLEEMQIPDAVEVM
SRYPFELSGGMRQRVMIAPAFSCEPQLIIAGEPTTALDVTVQLQVLRLLK
HKARASGTAVLFISHDMAVVSQLCDSVYVMYAGSVIESGGTANVIHHPRH
PYTIGLLQCAPEHGVPRQPLPAIPGTVPNLTHLPDGCAFRDRCYAAGAQC
ENVPALTACGDNNQHCACWYPQQEVISV
>SFV_4104 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMKNELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SFV_1396 hypothetical protein
MALNTPQITPTKKITVRAIGEELPRGDYQRCPQCDMLFSLPEINSHQSAY
CPRCQAKIRDGRDWSLTRLAAMAFTMLLLMPFAWGEPLLHIWLLGIRIDA
NVMQGIWQMTKQGDAITGSMVFFCVIGAPLILVSSIAYLWFGNRLGMNLR
PVLLMLERLKEWVMLDIYLVGIGVASIKVQDYAHIQAGVGLFSFVALVIL
TTVTLSHLNVEELWERYYPQRPATRRDEKLRVCLGCHFTGYPDQRGLCPR
CHIPLRLRRRHSLQKCWAALLASIVLLLPANLLPISIIYLNGGRQEDTIL
SGIMSLASSNIAVAGIVFIASILVPFTKVIVMFTLLLSIHFKCQQGLRTR
ILLLRMVTWIGRWSMLDLFVISLTMSLINRDQILAFTMGPAAFYFGAAVI
LTILAVEWLDSRLLWDAHESGNARFDD
>SFV_1464 hypothetical protein
MWREGKDFPPSPARMDALLKAGTLRLSLTFNPAHAQQKIASGDLPASSYS
FGFREGMIGNVHFVTIPANANASAAAKVVANFLLSPDAQLRKADPVVWGD
PSVLDPQKLPDGQREILQSRMPQDLPPVLAEPHAGWVNALEQEWLRRYGT
H
>SFV_0017 putative IS1 encoded protein
MPCFTAMRAEIALMSGSAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYT
KSVSWQHYLQGLMRRYL
>SFV_1418 hypothetical protein
MFAGLPSLTHEQQQKAVERIQELMAQGMSSGQAIALVAEELRANHSGELI
VARFEDEDE
>SFV_2301 hypothetical protein
MSGEKKAKGWRFYGLVGFGAIALLSAGVWALQYAGSGPEKTLSPLVVHNN
LQIDLNEPDLFLDSDSLSQLPKDLLTIPFLHDVLSEDFVFYYQNHADRLG
IEGSIRRIVYEHDLTLKDKLFSSLLDQPAQAALWHDKQGHLSHYMVLIQR
SGLSKLLEPLLFAATSDS
>SFV_1750 putative IS1 encoded protein
MPCFTAMRAEIALMSGSAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYT
KSVSWQHYLCHL
>SFV_2042 putative IS1 encoded protein
MPCFTAMRAEIALMSGSAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYT
KSVSWQHYPNMQATSENG
>SFV_2119 hypothetical protein
MTEKVKQHAAPVTGSDEIDIGRLVGTVIEARWWVIGITVVFALCAVVYTF
FATPIYSADALVQIEQNSGNSLVQDIGSALANKPPASDAEIQLIRSRLVL
GKTVDDLDLDIAVSKNTFPIFGAGWDRLMGRQNETVKVTTFNRPKEMADQ
VFTLNVLDDKNYTLSSDGGFSARGQAGQMLKKEGVTLMVEAIHARPGSEF
TVTKYSTLGMINQLQNSLTVTENGKDAGVLSLTYTGEDREQIRDILNSIA
RNYQEQNIERKSAEASKSLAFLAQQLPEVRSRLDVAENKLNAFRQDKDSV
DLPLEAKAVLDSMVNIDAQLNELTFKEAEISKLYTKVHPAYRTLLEKRQA
LEEEKAKLNGRVTAMPKTQQEIVRLTRDVESGQQVYMQLLNKEQELKITE
ASTVGDVRIIDPAITQPGVLKPKKGLIILGAIILGLMLSIVGVLLRSLFN
RGIESPQVLEEHGISVYASIPLSEWQKARDSVKTIKGVKRYKQSQLLAVG
NPTDLAIEAIRSLRTSLHFAMMQAQNNVLMMTGVSPSIGKTFVCANLAAV
ISQTNKRVLLIDCDMRKGYTHELLGTNNVNGLSEILIGQGDITTAAKPTS
IAKFDLIPRGQVPPNPSELLMSERFAELVNWASKNYDLVLIDTPPILAVT
DAAIVGRHVGTTLMVARYAVNTLKEVETSLSRFEQNGIPVKGVILNSIFR
RASAYQDYGYYEYEYKSDAK
>SFV_0778 hypothetical protein
MKKPVVIGLAVVVLAAVVAGGYWWYQSRQDNGLTLYGNVDIRTVNLSFRV
GGRVESLAVDEGDAIKAGQVLGELDHKPYEIALMQAKAGVSVAQAQYDLM
LAGYRDEEIAQAAAAVKQAQAAYDYAQNFYNRQQGLWKSRTISANDLENA
RSSRDQAQATLKSAQDKLRQYRSGNREQDIAQAKASLEQAQAQLAQAELN
LQDSTLIAPSDGTLLTRAVEPGTVLNEGGTVFTVSLTRPVWVRAYVDERN
LDQAQPGRKVLLYTDGRPDKPYHGQIGFVSPTAEFTPKTVETPDLRTDLV
YRLRIVVTDADDALRQGMPVTVQFGDEAGHE
>SFV_1823 hypothetical protein
MGWRKRLASPCQQRALAAPTMPHKFVATGNVLVKTVLVSEAESETLGTVC
FMTGISPLLRELLIAINQLPPSQSTTDKQQLRFSALETLILQEIKMVVKM
SLELPWPNDERLQQLCENLLNNQGYLPTLDNLADKINVSSRTLMRLFVKE
TGLTFRHWVQQMHVISAVTLLDDGYSLTKIAHRLGYASAESFGNMFKRRT
GYSPGKFTRRLTMHNYAITRQMI
>SFV_1716 putative IS1 encoded protein
MPCFTAMRAEIALMSGSAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYT
KSVSWQHYR
>SFV_1474 putative aldehyde dehydrogenase
MTLWINGDWITGQGASRVKRNPVSGEVLWQGNDADAAQVEQACRAARAAF
PRWARLSLAERQVVVERFAGLLESNKAELTAIIARETGKPRWEAATEVTA
MINKIAISIKAYHVRTGEQRSEMPDGAASLRHRPHGVLAVFGPYNFPGHL
PNGHIVPALLAGNTIIFKPSELTPWSGEAVMRLWQQAGLPPGVLNLVQGG
RETGQALSALEDLDGLLFTGSANTGYQLHRQLSGQPEKILALEMGGNNPL
IIDEVADIDAAVHLTIQSAFVTAGQRCTCARRLLLKSGAQGDAFLASLVA
VSQRLTPGNWDDEPQPFIGGLISEQAAQQVVTAWQQLEAMGGRTLLAPRL
LQAGTSLLTPGIIEMTGVGGVPDEEVFGPLLRVWRYDTFDEAIRMANNTR
FGLFCGLVSPEREKFDQLLLEARAGIVNWNKPLTGAASTAPFGGIGASGN
HRPSAWYAADYCAWPMASLESDSLTLPATLNPGLDFSDEVVR
>SFV_1874 putative membrane protein precursor
MRKLYAAILSAAICLAVSGAPAWASEHQSTLSAGYLHASTNAPGSDDLNG
INVKYRYEFTDTLGLVTSFSYANAEDEQKTHYSDTRWHEDSVRNRWFSVM
AGPSVRVNEWFSAYAMAGVAYSRVSTFSGDYLRVTDNKGKTHDVLTGSDD
NRHSNTSLAWGAGVQFNPTESVAIDLAYEGSGSGDWRTDGFIVGVGYKF
>SFV_1344 hypothetical protein
MSITGVNIGRLFREGFSNYDPIGVLNAMASQRAKEARGGELQINELLPAS
LDEAKAHGLTERDVYEATDYYKTPRGQQPGGATKMLFSHAQKTLAWDAFA
FTEVLLTQPVMVVVGEKVGAFGAYRDGLEVYGRAMVSQDRQLVSLPDFSH
YELYDKPEAVQEALAKVIPFFNTHLG
>SFV_3418 IS2 ORF2
MSRAQLHVILRRTDDWMDGPRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRLMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SFV_3931 putative inner membrane lipoprotein
MNLKKIFFSAVTVSVLCALTGCDYIEEGKPESSLLKQQEEHNNKIVLLEK
QQAQLKSQLETIQKQQTGIINSTKTLTHVIKSVKDQQNTFIFTEFNPAKT
KYFILNNGSVALAGRVLSIDATENGSVIHISLVNLLSTPISNIGFNATWG
GEKPVDAKEFARWQQLLFNTSMKSTLKLLPGQWQDINLTLKGVSPNNLGY
LKLAINMENIQFDNLPSAENRQKRSKK
>SFV_2697 putative helicase
MTTPVWRNDDLEGAVIGAFFLRGADPEVMDILATLPADVFSVRAHQDIYT
GICRQARVSGVIDPVLLCNEMPELAPVITDTGRKTWVKSSLEHYVAALRR
NAALRDAEKTLNEALQKLRDAHTCEAAEDALKDAQNMMVTLSTGKGVIQP
VHIDDVLPEVVERVECRNQGLEKSRTLMTGIDELDAKTGGMEPGDLVFIA
ARPSMGKTELALDIIDKVTEQGHGVLLFTMEMANIQIGERMVSAAGGMPV
SRLKSVAHFEDEDWTRFSQGVGRMTGRNIWMVDQANLAIDEICATTKHHL
IKYPETALVVVDYLGLIKTRTTGRHDLAVGEISKGLKGLAKSGGFPLIAL
SQLSRGVESRPNKRPMNSDLKNSGEIEADADIILMLYRDEVYNPDTQARG
IAEINITKQRNGSLGTIYRRFYNGHFLPVDQESARVLSTPMKPGNPRRYS
NKRTDSSKMERFF
>SFV_1153 hypothetical bacteriophage protein
MEHYWLFVTARHSYYLKQTDTTEENAMPMKFDEILKQRDKYHADNMETMN
ITDYRAFLETGALIEKDHHGFVRCALSGEMLAVNPEQTDALIEFLKGIRD
>SFV_2889 hypothetical protein
MNVFNPAQFRAQFPALQDAGVYLDSAATALKPEAVVEATRQFYSLSAGNV
HRSQFAEAQRLTARYEAAREKVAQLLNAPDDKTIVWTRGTTESINMVAQC
YARPRLQPGDEIIVSVAEHHANLVPWLMVAQQTRAKVVKLPLNAQRLPDV
DLLPELITPRSRILALGQMSNVTGGCPDLARAITFAHSAGMVVMVDGAQG
AVHFPADVQQLDIDFYAFSGHKLYGPTGIGVLYGKSELLEAMSPWLGGGK
MIHEVSFDGFTTQSAPWKLEAGTPNVAGVIGLSAALEWLADYDINQAESW
SRSLATLAEEALAKRPGFRSFRCQDSSLLAFDFAGVHHSDMVTLLAEYGI
ALRAGQHCAQPLLAELGVTGTLRASFAPYNTKSDVDALVNAVDRALELLV
D
>SFV_4222 hypothetical protein
MWAFARRVLNVEDILCLPIQDCDKSHIWLLVKDDQRLE
>SFV_1381 putative IS1 encoded protein
MPCFTAMRAEIALMSGSAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYT
KSVSWQHYLTRYLKQNSMH
>SFV_1021 putative acetyltransferase
MKLSLSSPPYADAPVVVLISGLGGSGSYWLPQLAVLEQEYQVVCYDQRGT
GNNPDTLAEDYSIAQMAAELHQALVAAGIERYAVVGHALGALVGMQLALD
YPASVTMLVSVNGWLRINAHTRRCFQVRERLLYSGGAQAWVEAQPLFLYP
ADWMAARAPRLEAEDALALAHFQGKNNLLRRLNALKRADFSHHADRIRRP
VQIICASDDLLVPTACSSELHAALPDSQKMVMPYGGHACNVTDPETFNAL
LLNGLASLLHHREAAL
>SFV_3919 putative IS1 encoded protein
MPCFTAMRAEIALMSGSAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYT
KSVSWQHYPDLRFFFVVPRKIAFQLTRWLYLPHRAGILLSWFF
>SFV_3950 putative IS1 encoded protein
MPCFTAMRAEIALMSGSAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYT
KSVSWQHYPILSVMPFMKASNFLMIVMF
>SFV_1473 hypothetical protein
MMVIRPVERSDVSALMQLASKTGDGLTSLPANEATLSARIERAIKTWQGE
LPKSEQGYVFVLEDSETGTVPGICAIEVAVGLNDPWYNYRVGTLVHASKE
LNVYNALPTLFLSNDHTGSSELCTLFLDPDWRKEGNGYLLSKSRFMFMAA
FRDKFNDKVVAEMRGVIDEHGYSPFWQSLGKRFFSMDFSRADFLCGTGQK
AFIAELMPKHPIYTHFLSQEAQDVIGQVHPQTAPARAVLEKEGFRYRNYI
DIFDGGPTLECDIDRVRAIRKSRLVEVAEGQPAQGDFPACLVANENYHHF
RVVLARTDPATERLILTAAQLDALKCHAGDRVRLVRLCAEEKTA
>SFV_1442 putative aldolase
MLADIRYWENDATNKYYAIAHFNVWNAEMLMGVKDAAEEAKSPVIISFST
GFVGNTSFEDFSHMMVSMAQKATVPVITHWDHGRSIEIIHNAWTHGMNSL
MRDASAFDFEENIRLTKEAVDFFHPLGIPVEAELGHVGNETVYEEALAGY
HYTDPDQAAEFVERTGCDSLAVAIGNQHGVYTSEPQLNFEVVKRVRDAVS
VPMVLHGASGISDADIKTAISLGIAKINIHTELCQAAMVAVKENQDQPFL
HLEREVRKAVKERALEKIELFGSDGKAE
>SFV_1023 putative synthetase
MMTTLTARPEAITFDPQQSAQIVVDMQNAYATPGGYLDLAGFDVSTTRPV
IANIQTAVTAARAAGMLIIWFQNGWDEQYVEAGGPGSPNFHKSNALKTMR
KQPQLQGKLLAKGSWDYQLVDELVPQPGDIVLPKPRYSGFFNTPLDSILR
SRGIRHLVFTGIATNVCVESTLRDGFFLEHFGVVLEDATHQAGPEFAQKA
ALFNIETFFGWVSDVETFCDALSPTSFARIA
>SFV_0398 hypothetical protein
MRSTPYDFIQLIKEVIMLGKTAALYDVDKTLKNARVELKTSPDAKNKLRE
AAQAVGVDLSAFILSAAMERAESVLDNQRRRELSNQSWELMNQLITEPAQ
PTLALKALMKRKNSDGRQA
>SFV_0926 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD
AMRFPGGEMADLYSNRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA
>SFV_1712 hypothetical protein
MFISCSHYTMNAYELQALRHIFAMTIDECATWIAQTGDSESWRQWENGKC
AIPDCVVEQLLAMRQQRKKHLHAIIEKINNRIGNNTMRFFPDLTAFQQVY
PDGNFIDWKIYQSVTAELYAHDLERLC
>SFV_1455 hypothetical protein
MKKVLLQNHPGSEKYSFNGWGIFNSNFERVIKENKAMLLCKWGFYLTCVV
AVMFVFAAITSNGLNERGLITAGCSFLYLLIMMGLIVRAGFKAKKEQLHY
YQAKGIEPLSIEKLQALQLIAPYRFYHKQWSETLEFWPRKPEPGKDTFQY
HVLPFDSIDIISKRRESLENQWGIEDSESYCALMEHFLSGDHGANTFKAN
MEEAPEQVIALLNKFAVFPSDYISDWVMLPTY
>SFV_1516 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLAPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMVLERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALHPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD
AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA
>SFV_2668 putative transport protein
MSSLSQAASSVEKRTNARYWIVVMLFIVTSFNYGDRATLSIAGSEMAKDI
GLDPVGMGYVFSAFSWAYVIGQIPGGWLLDRFGSKRVYFWSIFIWSMFTL
LQGFVDIFSGFGIIVALFTLRFLVGLAEAPSFPGNSRIVAAWFPAQERGT
AVSIFNSAQYFATVIFAPIMGWLTHEVGWSHVFFFMGGLGIVISFIWLKV
IHEPNQHPGVNQKELEYIAAGGALINMDQQSTKVKVPFSVKWGQIKQLLG
SRMMIGVYIGQYCINALTYFFITWFPVYLVQARGMSILKAGFVASVPAVC
GFIGGVLGGIISDWLMRRTGSLNTARKTPIVMGMLLSMVMVFCNYVNVEW
MIIGFMALAFFGKGIGALGWAVMADTAPKEISGLSGGLFNMFGNISGIVT
PIAIGYIVGTTGSFNGALIYVGVHALIAVLSYLVLVGDIKRIELKPVAGQ
>SFV_1019 hypothetical protein
MNIVDQQTFRDAMSCMGAAVNIITTDGPAGRAGFTASAVCSVTDTPPTLL
VCLNRGASVWPVFNENRTLCVNTLSAGQEPLSNLFGGKTPMEHRFAAARW
QTGVTGCPQLEEALVSFDCRISQVVSVGTHDILFCAIEAIHRHTTPYGLV
WFDRSYHALMRPAC
>SFV_1842 putative resistance protein
MLAFTWIALRFIHFTSLMLVFGFAMYGAWLAPLMIRRLLTKRSLRLQQHA
AVWSLISATAMLAVQGGLMGTGWTDVFSPNIWQAVLQTQFGGVWLWQIVL
ALVTLIVALMQPRNMPRLLFMLTTAQFILLAGVGHATLNEGVTAKIHQTN
HAIHLICAAAWFGGLLPVLWCMQLIKGRWRHQAIQALMRFSWCGHFAVIG
VLASGVLNALLITGFPPTLTTYWGQLLLLKAILVMIMVVIALANRYVLVP
RMRQDEDRAAPWFVWMTKLEWAIGAVVLVIISLLATLEPF
>SFV_3518 hypothetical protein
MFVFLAVELSLLFIVISAGVSLIRQKVPDHKIQQMMGARKGRGYLLAALL
GAVTPFCSCSTIPMLRGLLSAKAGFGPTLTFLFVSPLLNPIIVGLMWMTF
GWKVTLLYAIIAAGVSVLASIILDSLGFERHIIASKSSSANCCAPAKTSP
GTTYTPIKVSCCSPAAKAIENPVVTCCSTKAVVSINPIKLATKDALQQFK
DVLPYLLLGVLIGSFIYGFIPSAWIAAHAGADNPFAIPLSAVVGIPLYIR
AEAVIPLASVLMTKGMGLGALMALIIGSAGASLTEVILLKSMFRMPMIAA
FLTVILGMAILMGYLTQLLF
>SFV_1038 Rtn-like protein
MLIIDQSAITRFYWQLYYNFLGCITLMRGETCMVFNKKMFVLIIIPGILG
VLLSFAMSIFQMNRDTTITAGILLKQLDNVTQIVQHTTKLTSILVMKPCK
DILEQLIANGALTPYVRTTGLIENNFQICSSVSGFKKMNVNDVYGTSFHN
KNKESRIVSISGTSFVPGKTAIVFLMPIGNDMTAFSIVESRYIYDLMDVL
DDENDDSFSLRFTEGPAIISGVNNNDRLYMLKRDFNSAISQARLTVTTPM
ISLYPYIISNVLYILPLSIILSFILYFLWQHWMSRKMSLAEEIKKGMSSG
EFSVHYQPVCDTTTKACLGVEALMRWQREDGKNISPVVFIRAAEEENLII
PLTKHLFELIIQDVQSWKVKKPFHLGINIAAEHLAHPDFVGDVLHIKNAI
SDKFNIVLEITERNLVEDTDLALQKINELRIHGCEFAVDDFGTGYCSLGL
LQKLSVDYLKIDKSFIDTLTTAGRRNASFRYYY
>SFV_0392 hypothetical protein
MKAKSTLLLMILSVFYSHQVLSSEPTQPDVYSVVEKKLNSALPLEENTTF
RSQAEWFNLQYELLVSGYPERAYNLLSKLETESKNAKYYLDLTASRVAEE
EAPETALTFLSKIGLNKPATLTSFESYINSWIKSKQPEKAMQLLSLDENA
LNYYLPTVLVAYQIVPDKAISIYNSVYGDNVVIPYQQVTMLLTVAEGYQT
KGDIKNAIHYADKALNMFDKAIAVDSSRESHYSDEYLKLMNIYAANGNKE
KAVVLSQRLHKAITESQSNYLSTLEGILLFYRNNDMQQDYQNSLSNYIVF
IDKIFAFSPSTRSELSLINLLNSLNEVELMRKRLAFLMTAPEYACYDSQY
CYEDKVSALKILYRHHDNELMDKYFTTLLKEISNQPLEDWDIIISNTGEK
FSEVGLTKYAQHLASFAEEFYIKEQKSHSESEMRGLFSSLAELYSFGNDT
ASANRVFHQHVPAFNTDHMIQYFINAKEWNNARELLIKEEMLGMHNLILL
ENICMQKNAECMAHITFTLNKLTTQPAITKIDSVGNEQLYQIGKIYHKLE
IKPEPEQQVLIQSLYDRASGSASATQ
>SFV_2496 hypothetical protein
MDAFEVLANKGAELVAQRDKTANEGERALLNKQIKAIRMAQFKLISNEVI
ETIKPQIPQIVADAIKAAGLEKRIAGLKTNAEKMEMHKQSGAPNPDEYFS
PDVEFMAQVEERLGSCLTEEQRRYFDGVDSSAGIDLNSYFGREIEHFDAA
PSMPEPEPEMATDPEAEQRKRVFNAIGIRY
>SFV_0745 putative bacteriophage protein
MLNVAIENQNGWNYSAPAPHKTGAGIATPTMTTAHNRAQAVFLCVKHSHI
QIMVGRAGQPQGWPVSVVTGCSNPVRLTTHEIATSGGESFKLTIEAAIMA
TILTLSHPDATIENGRAVTTSVAVAKFFRKMHKNVIQKIETLECSREFNR
LNFKPVTYTDAKGEKRPMYQITKNGFVFLVMGFTGKKAAAFKEAYIAEFD
RMEAELRQNNTPPADKMIPGDGRTLVVHFDKFGNVEFTETVPDGALVCTL
ETFRFYLEKQGWTLVNRGAIKNMTVEQLLSLK
>SFV_0244 putative bacteriophage protein
MKLTPVIAALRARCPYFENRVAGAAQFKNLPEVGKLKLPAAYVVPGDDSP
GENKSQTDYWQELKEGFSVVVILSNGRDERGQFASYDVVDDVRQMLFKAL
LGWNPEACGNPITYDGGTLLDLNRHELIYQFDFSVISELTEDDTRQQDDL
NSLDELQTLAIDVDYLEPGNGPDGDIEHHTEITLPS
>SFV_1430 hypothetical protein
MEDAIEQIVSYLKHAAQGLEEKKQILYLLGPVGGGKSSLAERLKSLMQLV
PIYVLSANGERSPVNDHPFCLFNPQEDAQILEKEYGIPRRYLGTIMSPWA
AKRLHEFGGDITKFRVVRCTGNSGHYHLFFF
>SFV_2934 putative IS1 encoded protein
MPCFTAMRAEIALMSGSAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYT
KSVSWQHYRVVKHITSVNPA
>SFV_1466 hypothetical protein
MATHASDWQEIKNEAKGQTVWFNAWGGDTAINRYLDWVSGEMKTHYAINL
KIVRLADAADAVKRIQTEAAAGRKTGGSVDLLWVNGENFRTLKE
>SFV_0860 putative enzyme
MTMPTNQCPWRMQVHHITQETPDVWTISLICHDHYPYRAGQYALVSVRNS
AETLRAYTISSTPGVSEYITLTVRRIDDGVGSQWLTRDVKRGDYLWLSDA
MGEFTCDDKAEDKFLLLAAGCGVTPIMSMRRWLAKNRPQADVRVIYNVRT
PQDVIFADEWRNYPVTLVAENNVTEGFIAGRLTRELLADVPDLASRTVMT
CGPAPYMDWVEQEVKALGVTRFFKEKFFTPVAEAATSGLKFTKLQPAREF
YAPVGTTLLEALESNNVPVVAACRAGVCGCCKTKVVSGEYTVSSTMTLTD
AEIAEGYVLACSCHPQGDLVLA
>SFV_1502 putative IS1 encoded protein
MPCFTAMRAEIALMSGSAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYT
KSVSWQHYPVPVQYNPVCFANG
>SFV_4142 acs, acetyl-CoA synthetase
MSQIHKHTIPANIADRCLINPQQYEAMYQQSINVPDTFWGEQGKILDWIK
PYQKVKNTSFAPGNVSIKWYEDGTLNLAANCLDRHLQENGDRTAIIWEGD
DASQSKHISYKELHRDVCRFANTLLELGIKKGDVVAIYMPMVPEAAVAML
ACARIGAMHSVIFGGFSPEAVAGRIIDSNSRLVITSDEGVRAGRSIPLKK
NVDDALKNPNVTSVEHVVVLKRTGGNIDWQEGRDLWWHDLVEQASDQHQA
EEMNAEDPLFILYTSGSTGKPKGVLHTTGGYLVYAALTFKYVFDYHPGDI
YWCTADVGWVTGHSYLLYGPLACGATTLMFEGVPNWPTPARMAQVVDKHQ
VNILYTAPTAIRALMAEGDKAIEGTDRSSLRILGSVGEPINPEAWEWYWK
KIGNEKCPVVDTWWQTETGGFMITPLPGATELKAGSATRPFFGVQPALVD
NEGNPLEGATEGSLVITDSWPGQVRTLFGDHERFEQTYFSTFKNMYFSGD
GARRDEDGYYWITGRVDDVLNVSGHRLGTAEIESALVAHPKIAEAAVVGI
PHNIKGQAIYAYVTLNHGEEPSPELYAEVRNWVRKEIGPLATPDVLHWTD
SLPKTRSGKIMRRILRKIAAGDTSNLGDTSTLADPGVVEKLLEEKQAIAM
PS
>SFV_1450 ansA, cytoplasmic L-asparaginase I
MQRSEHGYIPVSGHLQRQLALMPEFHRPEMPDFTIHEYTPLMDSSDMTPE
DWQRIAEDIKAHYDDYDGFVILHGTDTMAYTASALSFMLENLGKPVIVTG
SQIPLAELRSDGQINLLNALYVAANYPINEVTLFFNNRLYRGNRTTKAHA
DGFDAFASPNLPPLLEAGIHIRRLNTPPAPHGEGELIVHPITPQPIGVVT
IYPGISADVVRNFLRQPVKALILRSYGVGNAPQNKAFLQELQEASDRGIV
VVNLTQCMSGKVNMSGYATGNALAHAGVIGGADMTVEATLTKLHYLLSQE
LDTETIRKAMSQNLRGELTPDD
>SFV_0361 araJ, involved in either transport or processing of arabinose polymers
MNYLLALLVVILQAITLLATVIGSRSGGCDGGMKKVILSLALGTFGLGMA
EFGIMSVLTELAHNVGISIPAAGHMISYYALVVVVGAPIIALFSSRYSLK
HILLFLVALCVIGNAMFTLSSSYLMLAIGRLVSGFPHGAFFGVGAIVLSK
IIKPGKVTAAVAGMVSGMTVANLLGIPLGTYLSQECWRYTFLLIAVFNIA
VMASVYFWVPDIRDEAKGNLREQFHFLRSPAPWLIFAATMFGNAGVFAWF
SYVKPYMMFISGFSETAMTFIMMLVGLGMVLGNMLSGRISGRYSPLRIAA
VTDFIIVLALLMLFFCGGMKTTSLIFAFICCAGLFALSAPLQILLLQNAK
GGELLGAAGGQIAFNLGSAVGAYCGGMMLTLGLAYNYVALPAALLSFAAM
SSLLLYGRYKRQQAADSPVLAKPLG
>SFV_3394 aroB, 3-dehydroquinate synthase
MERIVVTLGERSYPITIASGLFNEPASFLPLKSGEQVMLVTSETLAPLYL
DKVRGVLEQAGVNVDSVILPDGEQYKSLAVLDTVFTALLQKPHGRDTTLV
ALGGGVVGDLTGFAAASYQRGVRFIQVPTTLLSQVDSSVGGKTAVNHPLG
KNMIGAFYQPASVVVDLDCLKTLPPRELASGLAEVIKYGIILDGAFFNWL
EENLDALLRLDGPAMAYCIRRCCELKAEVVVADERETGLRALLNLGHTFG
HAIEAEMGYGNWLHGEAVAAGMVMAARTSERLGQFSSAETQRIITLLTRA
GLPVNGPREMSAQAYLPHMLRDKKVLAGEIRLILPLAIGKSEVRSGVSHE
LVLNAIADCQSA
>SFV_1868 aspS, aspartate tRNA synthetase
MRTEYCGQLRLSHVGQQVTLCGWVNRRRDLGSLIFIDMRDREGIVQVFFD
PDRADALKLASELRNEFCIQVTGTVRARDEKNINRDMATGEIEVLASSLT
IINRADVLPLDSNHVNTEEARLKYRYLDLRRPEMAQRLKTRAKITSLVRR
FMDDHGFLDIETPMLTKATPEGARDYLVPSRVHKGKFYALPQSPQLFKQL
LMMSGFDRYYQIVKCFRDEDLRADRQPEFTQIDVETSFMTAPQVREVMEA
LVRHLWLEVKGVDLGDFPVMTFAEAERRYGSDKPDLRNPMELTDVADLLK
SVEFAVFAGPANDPKGRVAALRVPGGASLTRKQIDEYGNFVKIYGAKGLA
YIKVNERAKGLEGINSPVAKFLSAEIIEAILDRTAAQDGDMIFFGADNKK
IVADGMGALLLKVGKDLGLTDESKWAPLWVIDFPMFEDDGEGGLTAMHHP
FTSPKDMTAAELKAAPENAVANAYDMVINGYEVGGGSVRIHNGDMQQTVF
GILGINEEEQREKFGFLLDALKYGTPPHAGLAFGLDRLTMLLTGTDNIRD
VIAFPKTTAAACLMTEAPSFANPTALAELSIQVVKKAENN
>SFV_2260 bcr, bicyclomycin resistance protein
MTTRQHSSFAIVFILGLLAMLMPLSIDMYLPALPVISAQFGVSAGSTQMT
LSTYILGFALGQLIYGPMADSFGRKPVVLGGTLVFAAAAVACALAQTIDQ
LIVMRFFHGLAAAAASVVINALMRDIYPKEEFSRMMSFVMLVTTIALLMA
PIVGGWVLVWLSWHYIFWILALAAILASAMIYFLIKETLPPERRQPFHIR
TTIGNFAALFRHKRVLSYMLASGFSFAGMFSFLSAGPFVYIEINHVAPEN
FGYYFALNIVFLFVMTIFNSRFVRRIGALNMFRSGLWIQFIMAAWMVISA
PLGLGFWSLVVGVAAFVGCVSMVSSNAMAVILDEFPHMAGTASSLAGTFR
FGIGAIVGALLSLATFNSAWPMIWSIAFCATSSILFCLYASRPKKR
>SFV_3786 bglB, phospho-beta-glucosidase B; cryptic
MKAFPETFLWGGATAANQVEGAWLEDGKGITTSDLQPHGVMGKMEPRILG
KENIKDVAIDFYYRYPEDIALFAEMGFTCLRISIAWARIFPQGDEAEPNE
AGLAFYDRLFDEMAQAGIKPLVTLSHYEMLYGLVKNYGGWANRAVIDHFE
HYARTVFTRYQHKVALWLTFNEINMSLHAPFTGVGLAEESGEAEVYQAIH
HQLVASARAVKAYHSLIPEAKIGNMLLGGLVYPLTCQPQDMLQAMEENRR
WMFFGDVQARGQYPGYMQRFFRDHNITIEMTESDAEDLKHTVDFISFSYY
MTGCVSHDESINKNAQGNILNMIPNPHLKSSEWGWQIDPVGLRVLLNTLW
DRYQKPLFIVENGLGVKDSVEADGSIQDDYRIAYLNDHLVQVNEAIADGV
DIMGYTSWGPIDLVSASHSQMSKRYGFIYVDRDDNGEGSLTRTRKKSFGW
YAEVIKTRGLSLKK
>SFV_4433 creC, Sensor protein creC
MRIGMRLLLGYFLLVAVAAWFVLAIFVKEVKPGVRRATEGTLIDTATLLA
ELARPDLLSGDPTHGQLAQAFNQLQHRPFRANIGGINKVRNEYHVYMTDA
HGKVLFDSANKAVGQDYSRWNDVWLTLRGQYGARSTLQNPADPESSVMYV
AAPIMGGSRLIGVLSVGKPNAAMAPVIKRSERRILWASAILLGIALVIGA
GMVWWINRSIARLTRYADSVTDNKPVPLPDLGSSELRKLAQALESMRVKL
EGKNYIEQYVYALTHELKSPLAAIRGAAEILREGPPPEVVAHFTDNILTQ
NARMQALVETLLRQARLENRQEVVLTAVDVAALFRRVSEARTVQLAEKNI
TLHVMPTEVNVAAEPALLDQALGNLLDNAIDFTPESGCITLSAEVDQEHV
TLKVLDTGSGIPDYALSRIFERFYSLPRANGQKSSGLGLAFVSEVARLFN
GEVTLRNVQEGGVLASLRLHRHFT
>SFV_4273 cysQ, CysQ protein
MQVYDGTKPMDVVSKADNSPVTAADIAAHTVIMDGLRTLAPDIPVLSEED
PPGWEVRQHWQRYWLVDPLDGTKEFIKRNGEFTVNIALIDHGKPILGVVY
APVMNVMYSAAEGKAWKEECGVRKLIQVRDARPPLVVISRSHADAELKEY
LQQLSEHQTTSIGSSLKFCLVAEGQAQLYPRFGPTNIWDTAAGHAVAAAA
GAHVHDWQGKPLDYTPRESFLNPGFRVSIY
>SFV_3261 degQ, serine endoprotease
MKKQTQLLSALALSAGLTLSASFQAVASIPGQVADQAPLPSLAPMLEKVL
PAVVSVRVEGTASQGQKIPEEFKKFFGDDLPDQPAQPFEGLGSGVIINAS
KGYVLTNNHVINQAQKISIQLNDGREFDAKLIGSDDQSDIALLQIQNPSK
LTQIAIADSDKLRVGDFAVAVGNPFGLGQTATSGIVSALGRSGLNLEGLE
NFIQTDASINRGNSGGALLNLNGELIGINTAILAPGGGSVGIGFAIPSNM
ARTLAQQLIDFGEIKRGLLGIKGTEMSADIAKAFNLDMQRGAFVSEVLPS
SGSAKAGVKAGDIITSLNGKPLNSFAELRSRIATTEPGTKVKLGLLRNGK
PLEVEVTLDTSTSSSASAEMITPALEGATLSDGQLKDGGKGIKIDEVVKG
SPAAQAGLQKDDVIIGVNRDRVNSIAEMRKVLAAKPAIIALQIVRGNESI
YLLMR
>SFV_0195 dniR, transcriptional regulator for nitrite reductase (cytochrome c552)
MKAKAILLASVLLVGCQSTGNVQQHAQSLSAAGQGEAAKFTSQARWMDDG
TSIAPDGDLWAFIGDELKMGIPENDRIREQKQKYLRNKSYLHDVTLRAEP
YMYWIAGQVKKRNMPMELVLLPIVESAFDPHATSGANAAGIWQIIPSTGR
NYGLKQTRNYDARRDVVASTTAALNMMQRLNKMFDDDWLLTVAAYNSGEG
RVMKAIKTNKARGKSTDFWSLPLPQETKQYVPKMLALSDILKNSKRYGVR
LPTTDESRALARVHLSSPVEMAKVADMAGISVSKLKTFNAGVKGSTLGAS
GPQYVMVPKKHADQLRESLASGEIAAVQSTLVADNTPLNSRVYTVRSGDT
LSSIASRLGVSTKDLQQWNKLRGSKLKPGQSLTIGAGSSAQRLANNSDSI
TYRVRKGDSLSSIAKRHGVNIKDVMRWNSDTANLQPGDKLTLFVKNNNMP
DS
>SFV_2392 fabB, 3-oxoacyl-[acyl-carrier-protein] synthase I
MRSHVWGNIKLDTTGLIDRKVVRFMSDASIYAFLSMEQAIADAGLSPEAY
QNNPRVGLIAGSGGGSPRFQVFGADAMRGPRGLKAVGPYVVTKAMASGVS
ACLATPFKIHGVNYSISSACATSAHCIGNAVEQIQLGKQDIVFAGGGEEL
CWEMACEFDAMGALSTKYNDTPEKASRTYDAHRDGFVIAGGGGMVVVEEL
EHALARGAYIYAEIVGYGATSDGADMVAPSGEGAVRCMQMAMHGVDTPID
YLNSHGTSTPVGDVKELAAIREVFGNKSPAISATKAMTGHSLGAAGVQEA
IYSLLMLEHGFIAPSINIEELDEQAAGLNIVTETTDRELTTVMSNSFGFG
GTNATLVMRKLKRLIRSRPE
>SFV_3414 feoB, ferrous iron transport protein B
MKKLTIGLIGNPNSGKTTLFNQLTGARQRVGNWAGVTVERKEGQFSTTDH
QVTLVDLPGTYSLTTISSQTSLDEQIACHYILSGDADLLINVVDASNLER
NLYLTLQLLELGIPCIVALNMLDIAEKQNIRIEIDALSARLGCPVIPLVS
TRGRGIEALKLAIDRYKANENVELVHYAQPLLNEADSLAKVMPSDIPLKQ
RRWLGLQMLEGDIYSRAYAGEASQHLDATLARLRNEMDDPALHIADARYQ
CIAAICDVVSNTLTAEPSRFTTAVDKIVLNRFLGLPIFLFVMYLMFLLAI
NIGGALQPLFDVGSVALFVHGIQWIGYTLHFPDWLTIFLAQGLGGGINTV
LPLVPQIGMMYLFLSFLEDSGYMARAAFVMDRLMQALGLPGKSFVPLIVG
FGCNVPSVMGARTLDAPRERLMTIMMAPFMSCGARLAIFAVFAAAFFGQN
GALAVFSLYMLGIVMAVLTGLMLKYTIMRGEATPFVMELPVYHVPHVKSL
IIQTWQRLKGFVLRAGKVIIIVSIFLSAFNSFSLSGKIVDNINDSALASV
SRVITPVFKPIGVHEDNWQATVGLFTGAMAKEVVVGTLNTLYTAENIQDE
EFNPAEFNLGEELFSAVDETWQSLKDTFSLSVLMNPIEASKGNGEMGTGA
MGVMDQKFGSAAAAYSYLIFVLLYVPCISVMGAIARESSRGWMGFSILWG
LNIAYSLATLFYQVASYSQHPTYSLVCILAVILFNIVVIGLLRRARSRVD
IELLATRKSVSSCCAASTTGDCH
>SFV_1436 gapA, glyceraldehyde-3-phosphate dehydrogenase A
MTIKVGINGFGRIGRIVFRAAQKRSDIEIVAINDLLDADYMAYMLKYDST
HGRFDGTVEVKDGHLIVNGKKIRVTAERDPANLKWDEVGVDVVAEATGLF
LTDETARKHITAGAKKVVMTGPSKDNTPMFVKGANFNKYAGQDIVSNASC
TTNCLAPLAKVINDNFGIIEGLMTTVHATTATQKTVDGPSHKDWRGGRGA
SQNIIPSSTGAAKAVGKVLPELNGKLTGMAFRVPTPNVSVVDLTVRLEKA
ATYEQIKAAVKAAAEGEMKGVLGYTEDDVVSTDFNGEVCTSVFDAKAGIA
LNDNFVKLVSWYDNETGYSNKVLDLIAHISK
>SFV_2953 gcvT, aminomethyltransferase
MAQQTPLYEQHTLCGARMVDFHSWMMPLHYGSQIDEHHAVRTDAGMFDVS
HMTIVDLRGSRTREFLRYLLANDVAKLTKSGKALYSGMLNASGGVIDDLI
VYYFTEDFFRLVVNSATREKDLSWITQHAEPFGIEITVRDDLSMIAVQGP
NAQAKAATLFNDAQRQVVEGMKPFFGVQVGDLFIATTGYTGEAGYEIALP
NEKAADFWRALVEAGVKPCGLGARDTLRLEAGMNLYGQEMDETISPLAAN
MGWTIAWEPTDRDFIGREALEVQREHGTEKLVGLVMTEKGVLRNELPVRF
TDAQGNQHEGIITSGTFSPTLGYSIALARVPEGIGETAIVQIRNREMPVK
VTKPVFVRNGKAVA
>SFV_3035 glcC, transcriptional activator for glc operon
MKDERRPICEVVAESIERLIIDGVLKVGQPLPSERRLCEKLGFSRSALRE
GLTVLRGRGIIETAQDRDSHVARLNREQDTSPLIHLFSTQPRTLYDLLDV
RALLEGESARLAATLGTQADFVVITRCYEKMLAASENHKEISLIEHAQLD
HAFHLAICQASHNQVLVFTLQSLTDLMFNSVFASVNNLYHRPQQKKQIDR
QHARIYNAVLQRLPHVAQRAARSARSCTDREKESPRYRAGRPPFDSLGGA
AGDEQGGNVSRVLPYPQGEGTAPALNRIWC
>SFV_0615 gltA, citrate synthase
MADTKAKLNLNGDTAVELDVLKGTLGQDVIDIRTLGSKGVFTFDPGFTST
ASCESKITFIDGDEGILLHRGFPIDQLATDSNYLEVCYILLNGEKPTQEQ
YDEFKTTVTRHTMIHEQITRLFHAFRRDSHPMAVMCGITGALAAFYHDSL
DVNNSRHREIAAFRLLSKMPTMAAMCYKYSIGQPFVYPRNDLSYAGNFLN
MMFSTPCEPYEVNPILERAMDRILILHADHEQNASTSTVRTAGSSGANPF
ACIAAGIASLWGPAHGGANEAALKMLEEISSVKHIPEFVRRAKDKNDSFR
LMGFGHRVYKNYDPRATVMRETCHEVLKELGTKDDLLEVAMELENIALND
PYFIEKKLYPNVDFYSGIILKAMGIPSSMFTVIFAMARTVGWIAHWSEMH
SYGMKIARPRQLYTGYEKRDFKSDIKR
>SFV_1316 goaG, 4-aminobutyrate aminotransferase
MSNNEFHQRRLSATPRGVGVMCNFFAQSAENATLKDVEGNEYIDFAAGIA
VLNTGHRHPDLVAAVEQQLQQFTHTAYQIVPYESYVTLAEKINTLAPVSG
QAKTAFFTTGAEAVENAVKIARAHTGRPGVIAFSGGFHGRTYMTMALTGK
VAPYKIGFGPFPGSVYHVPYPSDLHGISTQDSLDAIERLFKSDIEAKQVA
AIIFEPVQGEGGFNVAPKELVAAIRRLCDEHGIVMIADEVQSGFARTGKL
FAMDHYADKPDLMTMAKSLAGGMPLSGVVGNANIMDAPAPGGLGGTYAGN
PLAVAAAHAVLNIIDKESLCERANQLGQRLKNTLIDAKESVPAIAAVRGL
GSMIAAEFNDPQTGEPSAAIAQKIQQRALAQGLLLLTCGAYGNVIRFLYP
LTIPDAQFDAAMKILQDALSD
>SFV_2080 hisD, histidinol dehydrogenase
MSFNTIIDWNSCTAEQQRQLLMRPAISASESITRTVNDILDNVKTRGDEA
LREYSAKFDKTTVTALKVSADEIAAASERLSDELKQAMAVAVKSIETFHT
AQKLPPVDVETQLGVRCQQVTRPVASVGLYIPGGSAPLFSTVLMLATPAR
IAGCKKVVLCSPPPIADEILYAAQLCDVQDVFNVGGAQAIAALAFGTESV
PKVDKIFGPGNAFVTEAKRQVSQRLDGAAIDMPAGPSEVLVIADSGATPD
FVASDLLSQAEHGPDSQVILLTPDADMARRVAEAVERQLAELPRAETTRQ
ALNASRLIVTKDLAQCVEISNQYGPEHLIIQTRNARELVDGITSAGSVSL
GDWSPESAGDYASGTNHVLPTYGYTATCSSLGLADFQKRMTVQELSKEGF
SALASTIETLAAAERLTAHKNAVTLRVNVLKEQA
>SFV_4076 hydG, Transcriptional Regulatory protein
MTHDNIDILVVDDDISHCTILQALLRGWGYNVALANSGRQALEQVREQVF
DLVLCDVRMAEMDGIATLKEIKALNPAIPVLIMTAYSSVETAVEALKTGA
LDYLIKPLDFDNLQATLEKALAHTHSIDAETPAVTASQFGMVGKSPAMQH
LLSEIALVAPSEATVLIHGDSGTGKELVARAIHASSVRSEKPLVTLNCAA
LNESLLESELFGHEKGAFTGADKRREGRFVEADGGTLFLDEIGDISPMMQ
VRLLRAIQEREVQRVGSNQTISVDVRLIAATHRDLAAEVNAGRFRQDLYY
RLNVVAIEVPSLRQRREDIPLLAGHFLQRFAERNRKAVKGFTPQAMDLLI
HYDWPGNIRELENAVERAVVLLTGEYISERELPLAIASTPIPLGQSQDIQ
PLVEVEKEVILAALEKTGGNKTEAARQLGITRKTLLAKLSR
>SFV_1387 ipaH_2, invasion plasmid antigen
MSIMLPINNNFSLSQNSFYNTIPGTYADYFSAWDKWEKQALPGENRNEAV
SLLKECLINQFSELQLNRLNLSSLPDNLPPQITVLEITQNALISLPELPA
SLKHLDVDNNQLTMLPELPALLEYINADNNQLTMLPELPTSLEVLSVRNN
QLTFLPELPESLEALDVSTNLLESLPAVPVRNHHSEETEIFFRCRENRIT
HIPENILSLDPTCTIILEDNPLSSRIRESLSQQTAQPDYHGPRIYFSMSD
GQQNTLHRPLADAVTAWFPENKQSDVSQIWHAFEHEEHANTFSAFLDRLS
DTVSARNTSGFREQVAAWLEKLSTSAELRQQSFAVAADATESCEDRVALT
WNNLRKTLLVHQASEGLFDNDTGALLSLGREMFRLEILEDIARDKVRTLH
FVDEIEVYLAFQTMLAEKLQLSTAVKEMRFYGVSGVTANDLRTAEAMVRS
REENEFTDWFSLWGPWHAVLKRTEADRWAQAEEQKYEMLENEYSQRVADR
LKASGLSGDADAEREAGAQVMRETEQQIYRQLTDEVLALRLPENGSQLHH
S
>SFV_1872 ipaH_3, invasion plasmid antigen
MLPTNNNHRLISNSFSTYSIDTSRAYENYLTHWTEWKNNRIQEEQRDIAF
QRLVSCLQNQETNLDLSELGLTTLPEIPPGIKSINISKNNLSLISPLPAS
LTQLNVSYNRLIELPALPQGLKLLNASHNQLITLPTLPISLKELHVSNNQ
LCSLPVLPELLETLDVSCNGLAVLPPLPFSLQEISAIGNLLSELPPLPHN
IHSIWAIDNMLTDIPYLPENLRNGYFDINQISHIPESILNLRNECSIDIS
DNPLSSHALQSLQRLTSSPDYHGPQIYFSMSDGQQNTLHRPLADAVTAWF
PENKQSDVSQIWHAFEHEEHANTFSAFLDRLSDTVSARNTSGFREQVAAW
LEKLSASAELRQQSFAVAADATESCEDRVALTWNNLRKTLLVHQASEGLF
DNDTGALLSLGREMFRLEILEDIARDKVRTLHFVDEIEVYLAFQTMLAEK
LQLSTAVKEMRFYGVSGVTANDLRTAEAMVRSREENEFTDWFSLWGPWHA
VLKRTEADRWAQAEEQKYEMLENEYPQRVADRLKASGLSGDADAEREAGA
QVMRETEQQIYRQLTDEVLALRLPENGSQLHHS
>SFV_0639 kdpE, regulator of kdp operon (transcriptional effector)
MRVFEAETLQRGLLEAATRKPDLIILDLGLPDGDGIEFIRDLRQWSPVPV
IVLSARSEESDKIAALDAGADDYLSKPFGIGELQARLRVALRRHSATAAP
DPLVKFSDVTVDLAARVIHRGEEEVHLTPIEFRLLAVLLNNAGKVLTQRQ
LLNQVWGPNAVEHSHYLRIYMGHLRQKLEQDPARPRHFITETGIGYRFML
>SFV_4436 lasT, hypothetical protein
MRITIILVAPARAENIGAAARAMKTMGFSELRIVDSQAHLEPATRWVAHG
SGDIIDNIKVFPTLVESLHDVDFTVATTARSRAKYHYYATPVELVLLLEE
KSSWMSHAALVFGREDSGLTNEELALADVLTGVPMVADYPSLNLGQAVMV
YCYQLATLIQQPAKSDTTADQHQLQALRERVMALLTTLAVADDIKLVDWL
QQRLGLLEQRDTAMLHRLLHDIEKNITK
>SFV_1637 malI, repressor of malX and Y genes
MATAKKITIHDVALAAGVSVSTVSLVLNGKGRISTATGERVNAAIEELGF
VRNRQASALRGGQSGVIGLIVRDLSAPFYAELTAGLTEALEAQGRMVFLL
HGGKDGEQLAQRFSLLLNQGVDGVVIAGAAGSSDDLRRMAEEKAIPVIFA
SRDSYLDDVDTVRPDNMQAAQLLTEHLIRNGHQRIAWLGGQSSSLTRAER
VGGYCATLLKFGLPFHSDWVLECTSSQKQAAEAITALLRHNPTISAVVCY
NETIAMGAWFGLLKAGRQSGESGVDRYFEQQVSLAAFTDATPTTLDDIPV
TWASTPARELGITLADRMMQKITHEETHSRNLIIPARLIAAK
>SFV_1638 malX, PTS system, maltose and glucose-specific II ABC
MTAKTAPKVTLWEFFQQLGKTFMLPVALLSFCGIMLGIGSSLSSHDVITL
IPVLGNPVLQAIFTWMSKIGSFAFSFLPVMFCIAIPLGLARENKGVAAFA
GFVGYAVMNLAVNFWLTNKGILPTTDAAVLKANNIQSILGIQSIDTGILG
AVIAGIIVWMLHERFHNIRLPDALAFFGGTRFVPIISSLVMGLVGLVIPL
VWPIFAMGISGLGHMINSAGDFGPMLFGTGERLLLPFGLHHILVALIRFT
DAGGTQEVCGQTVSGALTIFQAQLSCPTTHGFSESVTRFLSQGKMPAFLG
GLPGAALAMYHCARPENRHKIKGLLISGLIACVVGGTTEPLEFLFLFVAP
VLYVIHALLTGLGFTVMSVLGVTIGNTDGNIIDFVVFGILHGLSTKWYMV
PVVAAIWFVVYYVIFRFAITRFNLKTPGRDSEVASSIEKAVAGAPGKSGY
NVPAILEALGGADNIVSLDNCITRLRLSVEDMSLVNVQTLKDNRAIGVVQ
LNQHNLQVVIGPQVQSVKDEMAGLMHTVQA
>SFV_3645 mobB, molybdopterin-guanine dinucleotide biosynthesis protein B
MAGKTMIPLLAFAAWSGTGKTTLLKKLIPALCARGIRPGLIKHTHHDMDV
DKPGKDSYELRKAGAAQTIVASQQRWALMTETPDEEELDLHFLASRMDTS
KLDLILVEGFKHEEIAKIVLFRDGAGHRPEELVIDRHVIAVASDVPLNLD
VALLDINDVEGLADFVVEWMQKQNG
>SFV_0084 murC, L-alanine adding enzyme, UDP-N-acetyl-muramate:alanine ligase
MNTQQLAKLRSIVPEMRRVRHIHFVGIGGAGMGGIAEVLANEGYQISGSD
LAPNPVTQQLLNLGATIYFNHRPENVRDASVVVVSSAISADNPEIVAAHE
ARIPVIRRAEMLAELMRFRHGIAIAGTHGKTTTTAMVSSIYAEAGLDPTF
VNGGLVKAAGVHARLGHGRYLIAEADESDASFLHLQPMVAIVTNIEADHM
DTYQGDFENLKQTFINFLHNLPFYGRAVMCVDDPVIRELLPRVGRQTTTY
GFSEDADVRVEDYQQIGPQGHFTLLRQDKEPMRVTLNAPGRHNALNAAAA
VAVATEEGIDDEAILRALESFQGTGRRFDFLGEFPLEPVNGKSGTAMLVD
DYGHHPTEVDATIKAARAGWPDKNLVMLFQPHRFTRTRDLYDDFANVLTQ
VDTLLMLEVYPAGEAPIPGADSRSLCRTIRGRGKIDPILVPDPAQVAEML
APVLTGNDLILVQGAGNIGKIARSLAEIKLKPQTPEEEQHD
>SFV_3249 nanT, sialic acid transporter
MSTTTQNIPWYRHLNRAQWRAFSAAWLGYLLDGFDFVLIALVLTEVQGEF
GLTTVQAASLISAAFISRWFGGLMLGAMGDRYGRRLAMVTSIVLFSAGTL
ACGFAPGYITMFIARLVIGMGMAGEYGSSATYVIESWPKHLRNKASGFLI
SGFSVGAVVAAQVYSLVVPVWGWRALFFIGILPIIFALWLRKNIPEAEDW
KEKHAGKAPVRTMVDILYRGEHRIANIVMTLAAATALWFCFAGNLQNAAI
VAVLGLLCAAIFISFMVQSTGKRWPTGVMLMVVVLFAFLYSWPIQALLPT
YLKTDLAYNPHTVANVLFFSGFGAAVGCCVGGFLGDWLGTRKAYVCSLLA
SQLLIIPVFAIGGANVWVLGLLLFFQQMLGQGIAGILPKLIGGYFDTDQR
AAGLGFTYNVGALGGALAPIIGALIAQRLDLGTALASLSFSLTFVVILLI
GLDMPSRVQRWLRPEALRTHDAIDGKPFSGAVPFGSAKNDLVKTKS
>SFV_2830 nrdH, glutaredoxin-like protein; hydrogen donor
MRIMRITIYTRNDCVQCHATKRAMENRGFDFEMINVDRVPEAAEALRAQG
FRQLPVVIAGDLSWSGFRPDMINRLHPAPHAASA
>SFV_3075 parE, DNA topoisomerase IV subunit B
MTQTYNADAIEVLTGLEPVRRRPGMYTDTTRPNHLGQEVIDNSVDEALAG
HAKRVDVILHADQSLEVIDDGRGMPVDIHPEERVPAVELILCRLHAGGKF
SNKNYQFSGGLHGVGISVVNALSKRVEVNVRRDGQVYNIAFENGEKVQDL
QVVGTCGKRNTGTSVHFWPDETFFDSPRFSVSRLTHVLKAKAVLCPGVEI
TFKDEINNTEQRWCYQDGLNDYLAEAVNGLPTLPEKPFIGNFAGDTEAVD
WALLWLPEGGELLTESYVNLIPTMQGGTHVNGLRQGLLDAMREFCEYRNI
LPRGVKLSAEDIWDRCAYVLSVKMQDPQFAGQTKERLSSRQCAAFVSGVV
KDAFILWLNQNVQAAELLAEMAISSAQRRMRAAKKVVRKKLTSGPALPGK
LADCTAQDLNRTELFLVEGDSAGGSAKQARDREYQAIMPLKGKILNTWEV
SSDEVLASQEVHDISVAIGIDPDSDDLSQLRYGKICILADADSDGLHIAT
LLCALFVKHFRALVKHGHVYVALPPLYRIDLGKEVYYALTEEEKEGVLEQ
LKRKKGKPNVQRFKGLGEMNPMQLRETTLDPNTRRLVQLTIDDEDDQRTD
AMMDMLLAKKRSEDRRNWLQEKGDMAEIEV
>SFV_1508 pheM, phenylalanyl-tRNA synthetase (pheST) operon leader peptide
MLLFSASFFTLAPESRRLAREKRNGKQRLKASQWRLFLYARFEK
>SFV_1399 prc, carboxy-terminal protease for penicillin-binding protein 3
MNMFFRLTALAGLLAIAGQTFAVEDITRADQIPVLKEETQHATVSERVTS
RFTRSHYRQFDLDQAFSAKIFDRYLNLLDYSHNVLLASDVEQFAKKKTEL
GDELRSGKLDVFYDLYNLAQKRRFERYQYALSVLEKPMDFTGNDTYNLDR
SKAPWPKNEAELNAQWDSKVKFDELSLKLTGKTDKEIRETLTRRYKFAIR
RLAQTNSEDVFSLAMTAFAREIDPHTNYLSPRNTEQFNTEMSLSLEGIGA
VLQMDDDYTVINSMVAGGPAAKSKAISVGDKIVGVGQTGKPMVDVIGWRL
DDVVALIKGPKGSKVRLEILPAGKGTKTRTVTLTRERIRLEDRAVKMSVK
TVGKEKVGVLDIPGFYVGLTDDVKVQLQKLEKQNVSSVIIDLRSNGGGAL
TEAVSLSGLFIPAGPIVQVRDNNGKVREDSDTDGQVFYKGPLVVLVDRFS
ASASEIFAAAMQDYGRALVVGEPTFGKGTVQQYRSLNRIYDQMLRPEWPA
LGSVQYTIQKFYRVNGGSTQRKGVTPDIIMPTGNEETETGEKFEDNALPW
DSIDAATYVKSGDLTAFEPELLKEHNARIAKDPEFQNIMKDIARFNAMKD
KRNIVSLNYAVREKENNEDDATRLARLNERFKREGKPELKKLDDLSKDYQ
EPDPYLDETVNIALDLAKLEKARPAEQPAPVK
>SFV_1084 pyrC, dihydro-orotase
MTAPSQVLKIRRPDDWHLHLRDGDMLKTVVPYTSEIYGRAIVMPNLAPPV
TTVEAAVAYRQRILDAVPAGHDFTPLMTCYLTDSLDPNELERGFNEAVFT
AAKLYPANATTNSSHGVTSIDAIMPVLERMEKIGMPLLVHGEVTHADIDI
FDREARFIESVMEPLRQRLTALKVVFEHITTKDAADYVRDGNERLAATIT
PQHLMFNRNHMLVGGVRPHLYCLPILKRNIHQQALRELVTSGFNRVFLGT
DSAPHARHRKESSCGCAGCFNAPTALGSYATVFEEMNALQHFEAFCSVNG
PQFYGLPVNDTFIELVREEQQVAESIALTDDTLVPFLAGETVRWSVKQ
>SFV_3072 qseC, QseC
MKFTQRLSLRVRLTLIFLILASVTWLLSSFVAWKQTTDNVDELFDTQLML
FAKRLSTLDLNEINAADRMAQTPNKLKHGHVDDDALTFAIFTHDGRMVLN
DGDNGEDIPYSYQREGFADGQLVGEDDPWRFVWMTSPDGKYRIVVGQEWE
YREDMALAIVAGQLIPWLVALPIMLIIMMVLLGRELAPLNKLALALRMRD
PDSEKPLNATGVPSEVRPLVESLNQLFARTHAMMVRERRFTSDAAHELRS
PLTALKVQTEVAQLSDDDPQARKKALLQLHSGIDRATRLVDQLLTLSRLD
SLDNLQDVAEIPLEDLLQSSVMDIYHTAQQAKIDVRLTLNAHGIKRTGQP
LLLSLLVRNLLDNAVRYSPQGSVIDVTLNADNFIVRDNGPGVTPEALARI
GERFYRPPGQTATGSGLGLSIVQRIAKLHDMNVEFGNAEQGGFEAKVSW
>SFV_2897 recD, ATP-dependent dsDNA/ssDNA exonuclease V subunit
MKLQKQLLEAVEHKQLRPLDVQFALTVAGDEHPAVTLAAALLSHDAGEGH
VCLPLSRLENNEASHPLLATCVSEIGELQNWKECLLASQAVSRGDEPTPM
ILCGDRLYLNRMWCNERTVARFFNEVNHTIEVDEALLAQTLDKLFPVSDE
INWQKVAAAVALTRRISVISGGPGTGKTTTVAKLLAALIQMADGERCRIR
LAAPTGKAAARLTESLGKALRQLPLTDEQKKRIPEDACTLHRLLGAQPGS
QRLRHHAGNPLHLDVLVVDEASMIDLPMMSRLIDALPDHARVIFLGDRDQ
LASVEAGAVLGDICAYANAGFTAERARQLSRLTGTHVPAGTGTEAASLRD
SLCLLQKSYRFGSDSGIGQLAAAINRGDKTAVKTVFQQDFTDIEKRLLQS
GEDYIAMLEEALAGYGRYLDLLQARAEPDLIIQAFNEYQLLCALREGPFG
VAGLNERIEQFMQQKRKIHRHPHSRWYEGRPVMIARNDSALGLFNGDIGI
ALDRGQGTRVWFAMPDGNIKSVQPSRLPEHETTWAMTVHKSQGSEFDHAA
LILPSQRTPVVTRELVYTAVTRARRRLSLYADERILSAAIATRTERRSGL
AALFSSRE
>SFV_3232 rpoN, DNA-directed RNA polymerase subunit N
MKQGLQLRLSQQLAMTPQLQQAIRLLQLSTLELQQELQQALDSNPLLEQI
DTHEEIDTRETQDSETLDTADALEQKEMPEELPLDASWDTIYTAGTPSGT
SGDYIDDELPVYQGETTQTLQDYLMWQVELTPFSDTDRAIATSIVDAVDE
TGYLTVPLEDILESMGDEEIDIDEVEAVLKRIQRFDPVGVAAKDLRDCLL
IQLSQFDKTTPWLEEARLIISDHLDLLANHDFRTLMRVTRLKEDVLKEAV
NLIQSLDPRPGQSIQTGEPEYVIPDVLVRKHNGHWTVELNSDSIPRLQIN
QHYASMCNNARNDGDSQFIRSNLQDAKWLIKSLESRNDTLLRVSRCIVEQ
QQAFFEQGEEYMKPMVLADIAQAVEMHESTISRVTTQKYLHSPRGIFELK
YFFSSHVNTEGGGEASSTAIRALVKKLIAAENPAKPLSDSKLTSLLSEQG
IMVARRTVAKYRESLSIPPSNQRKQLV
>SFV_1625 rstB, sensor histidine protein kinase
MKKLFIQFYLLLFVCFLVMSLLVGLVYKFTAERAGKQSLDDLMNSSLYLM
RSELREIPPDDWGKTLKDMDLNLSFDLRVEPLSKYHLDDISMHRLRGGEI
VALDDQYTFLQRIPRSHYVLAVGPVPYLYYLHQMRLLDIALIAFIAISLA
FPVFIWMRPHWQDMLKLEAAAQRFGDGHLNERIHFDEGSSFERLGIAFNQ
MADNINALIASKKQLIDGIAHELRTPLVRLRYRLEMSDNLSAAESQALNR
DISQLEALIEELLTYARLDRPQNELHLSEPDLPLWLSTHLADIQAVTPDK
TVRIKTLMQGHYAALDMRLMERVLDNLLNNALRYCHSTVETSLLLSGNRA
TLIVEDDGPGIAPENREHIFEPFVRLDPSRDRSTGGCGLGLAIVHSIALA
MGGTVNCDISELGGARFSFSWPLWHNIPQFTSA
>SFV_3942 selA, selenocysteine synthase
MTTETRSLYSQLPAIDRLLRDSSFLSLRDTYGHTRVVELLRQMLDEAREV
IRGSQTLPAWCENWAQEVDARLTKEAQSALRPVINLTGTVLHTNLGRALQ
AEAAVEAVAQAMRSPVTLEYDLDDAGRGHRDRALAQLLRRITGAEDACIV
NNNAAAVLLMLAATASGKEVVVSRGELVEIGGAFRIPDVMRQAGCTLHEV
GTTNRTHANDYRQAVNENTALLMKVHTSNYSIQGFTKAIDEAELVALGKE
LDVPVVTDLGSGSLVDLSQYGLPKEPMPQELIAAGVSLVSFSGDKLLGGP
QAGIIVGKKEMIARLQSHPLKRALRADKMTLAALEATLRLYLHPEALSEK
LPTLRLLTRSAEVIQIQAQRLQAPLVAHYGAEFAVQVMPCLSQIGSGSLP
VDRLPSAALTFTPHDGRGSHLESLAARWRELPVPVIGRIYDGRLWLDLRC
LEDEQRFLEMLLK
>SFV_1377 sitC, Iron transport protein
MLGLPFSLGAFFSGGLAAGSMLFLNQRTRLKEDAIIGLIFSSFFGLGLFM
VSLNPTSVNIQTIVLGNILAIDPADILQLTIIGILSIIVLFFKWKDLMVT
FFDENHARAIGLHPGRLKILFFTLLSVSTVAALQTVGAFLVICLVVTPGA
TAWLLTDRFPRLLIIAVTIGSVTSFLGAWVSYFLDGATGGIIVVAQTLLF
LLAFVFAPTHGLLANRRRAHKALEDRS
>SFV_0922 smtA, S-adenosylmethionine-dependentmethyltransferase
MQDRNFDDIAEKFSRNIYGTTKGQLRQAILWQDLDRVLAEMGPQKLRVLD
AGGGEGQTAIKMAERGHQVILCDLSAQMIDRAKQAAEAKGVSDNMQFIHC
AAQDVASHLETPVDLILFHAVLEWVADPRSVLQTLWSVLRPGGVSSLMFY
NAHGLLMHNMVAGNFDYVQAGMPKKKKRTLSPDYPRDPAQVYLWLEEAGW
QIMGKTGVRVFHDYLREKHQQRDCYEALLELETRYCRQEPYITLGRYIHV
TARKPQSKDKV
>SFV_1284 sohB, putative protease
MKEELAAALMDSHQQKQWHKAQKKKHKQEAKAAKAKAKLGEVATDSKPRV
WVLDFKGSMDAHEVNSLREEITAVLAAFKPQDQVVLRLESPGGMVHGYGL
AASQLQRLRDKNIPLTVTVDKVAASGGYMMACVADKIVSAPFAIVGSIGV
VAQMPNFNRFLKSKDIDIELHTAGQYKRTLTLLGENTEEGREKFREELNE
THQLFKDFVKRMRPSLDIEQVATGEHWYGQQAVEKGLVDEINTSDEVILS
LMEGREVVNVRYMQRKRLIDRFTGSAAESADRLLLRWWQRGQKPLM
>SFV_0599 tolQ, TolQ protein
MNILDLFLKASLLVKLIMLILIGFSIASWAIIIQRTRILNAAAREAEAFE
DKFWSGIELSRLYQESQGKRDNLTGSEQIFYSGFKEFVRLHRANSHAPEA
VVEGASRAMRISMNRELENLETHIPFLGTVGSISPYIGLFGTVWGIMHAF
IALGAVKQATLQMVAPGIAEALIATAIGLFAAIPAVMAYNRLNQRVNKLE
LNYDNFMEEFTAILHRQAFTVSESNKG
>SFV_1454 topB, DNA topoisomerase III
MANWCQRAAVVPWLRSVNLMRLFIAEKPSLARAIADVLPKPHRKGDGFIE
CGNGQVVTWCIGHLLEQAQPDAYDSRYARWNLADLPIVPEKWQLQPRPSV
TKQLNVIKRFLHEASEIVHAGDPDREGQLLVDEVLDYLQLAPEKRQQVQR
CLINDLNPQAVERAIDRLRSNSEFVPLCVSALARARADWLYGINMTRAYT
ILGRNAGYQGVLSVGRVQTPVLGLVVRRDEEIENFVAKDFFEVKAHIVTP
ADERFTAIWQPSEACEPYQDEEGRLLHRPLAEHVVNRISGQPAIVTSYND
KRESESAPLPFSLSALQIEAAKRFGLSAQNVLDICQKLYETHKLITYPRS
DCRYLPEEHFAGRHAVMNAISVHAPDLLPQPVVDPDIRNRCWDDKKVDAH
HAIIPTARSSAINLTENEAKVYNLIARQYLMQFCPDAVFRKCVIELDIAK
GKFVAKARFLAEAGWRTLLGSKERDEENDGTPLPVVAKGDELLCEKGEVV
ERQTQPPRHFTDATLLSAMTGIARFVQDKDLKKVLRATDGLGTEATRAGI
IELLFKRGFLTKKGRYIHSTDAGKALFHSLPEMATRPDMTAHWESVLTQI
SEKQCRYQDFMQPLVGTLYQLIDQAKRTPVRQFRGIVAPGSGGSADKKKA
APRKRSAKKSPPADEAGSGAIA
>SFV_1277 trpD, anthranilate phosphoribosyltransferase
MADILLLDNIDSFTYNLADQLRSNGHNVVIYRNHIPAQTLIERLATISNP
VLMLSPGPGVPSEAGCMPELLTRLRGKLPIIGICLGHQAIVEAYGGYVGQ
AGEILHGKASSIEHDGQAMFAGLTNPLPVARYHSLVGSNIPAGLTINAHF
NGMVMAVRHDADRVCGFQFHPESILTTQGARLLEQTLAWAQQKLEPANTL
QPILEKLYQAQTLSQQESHQLFSAVVRGELKPEQLAAALVSMKIRGEHPN
EIAGAATALLENAAPFPRPDYLFADIVGTGGDGSNSINISTASAFVAAAC
GLKVAKHGNRSVSSKSGSSDLLAAFGINLDMNANKSRQALDELGVCFLFA
PKYHTGFRHAMPVRQQLKTRTLFNVLGPLINPAHPPLALIGVYSPELVLP
IAETLRVLGYQRAAVVHSGGMDEVSLHAPTIVAELHDGKIKSYQLTAEDF
GLTPYHQEQLAGGTPEENRDILTRLLQGKGDAAHEAAVAANVAMLMRLHG
HEDLQANAQTVLEVLRSGSAYDRVTALAARG
>SFV_3456 ugpB, Glycerol-3-phosphate-binding periplasmic protein precursor
MKPLHYTASALALGLALMGNAQAVTTIPFWHSMEGELGKEVDSLAQRFNA
ENPDYKIVPTYKGNYEQNLSAGIAAFRTGNAPAILQVYEVGTATMMASKA
IKPVYDVFKEAGIQFDESQFVPTVSGYYSDSKTGHLLSQPFNSSTPVLYY
NKDAFKKAGLDPEQPPKTWQDLADYSAKLKASGIKCGYASGWQGWIQLEN
FSAWNGLPFASKNNGFDGTDAVLEFNKPEQVKHIAMLEEMNKKGDFSYVG
RKDESTEKFYNGDCAMTTASSGSLANIREYAKFNYGVGMMPYDADAKDAP
QNAIIGGASLWVMQGKDKETYTGVAKFLDFLAKPENAAEWHQKTGYLPIT
KAAYDLTREQGFYEKNPGADTATRQMLNKPPLPFTKGLRLGNMPQIRVIV
DEELESVWTGKKTPQQALDTAVERGNQLLRRFEKSTKS
>SFV_1631 uidC, membrane-associated protein
MRKIVAMAVICLTAASGLTSAYAAQLADDEAGLRIRLKNELRRADKPSAG
AGRDIYAWVQGGLLDFNSGYYSNIIGVEGGAYYVYKLGARADMSTRWYLD
GDKSFGFALGAVKIKPSENSLLKLGRFGTDYSYGSLPYRIPLMVGSSQRT
LPTVSEGALGYWALTPNIDLWGMWRSRVFLWTDSTTGIRDEGVYNSQTGK
YDKHRARSFLAASWHDDTSRYSLGASVQKDVSNQIQSILEKSIPLDPNYT
LKGELLGFYAQLEGLSRNTSQPNETALVSGQLTWNAPWGSVFGSGGYLRH
AMNGAVVDTDIGYPFSLSLDRNREGMQSWQLGANYRVTPQFTLTFAPIVT
RGYESSKRDVRIEGAGILGGMNLSGQRRAVTRDEFLSCCR
>SFV_1471 xthA, exonuclease III
MKFVSFNINGLRARPHQLEAIVEKHQPDVIGLQETKVHDDMFPLEEVAKL
GYNMFYHGQKGHYGVALLTKETPIAVRRGFPGDDEEAQRRIIMAEIPSLL
GNVTVINGYFPQGESRDHPIKFPAKAQFYQNLQNYLETELKRDNPVLIMG
DMNISPTDLDIGIGEENRKRWLRTGKCSFLPEEREWMDRLMSWGVVDTFR
HANPQTADRFSWFDYRSKGFDDNRGLRIDLLLASQPLAECCVETGIDYEI
RSMEKPSDHAPVWATFRR
>SFV_3974 xylF, xylose binding protein transport system
MLKIGYVYRGDCYLLKLSSNYRRPYTMKIKNILLTLCTSLLLTNVAAHAK
EVKIGMAIDDLRLERWQKDRDIFVKKAESLGAKVFVQSANGNEETQMSQI
ENMINRGVDVLVIIPYNGQVLSNVVKEAKQEGIKVLAYDRMINDADIDFY
ISFDNEKVGELQAKALVDIVPQGNYFLMGGSPVDNNAKLFRAGQMKVLKP
YVDSGKIKVVGDQWVDGWLPENALKIMENALTANNNKIDAVVASNDATAG
GAIQALSAQGLSGKVAISGQDADLAGIKRIAAGTQTMTVYKPITLLANTA
AEIAVELGNGQEPKADTSLNNGLKDVPSRLLTPIDVNKNNIKDTVIKDGF
HKESEL
>SFV_3973 xylG, putative ATP-binding protein of xylose transport system
MPYLLEMKNITKTFGSVKAIDNVSLRLNAGEIVSLCGENGSGKSTLMKVL
CGIYPHGSYEGEIIFAGEEIQASHIRDTERKGIAIIHQELALVKELTVLE
NIFLGNEITHNGIMDYDLMTLRYQKLLAQVSLSISPDTRVGDLGLGQQQL
VEIAKALNKQVRLLILDEPTASLTEQETSVLLDIIRDLQQHGIACIYISH
KLNEVKAISDTICVIRDGQHIGTRDAAGMSEDDIITMMVGRELTALYPNE
PHTTGDEILRIEHLTAWHPVNRHIKRVNDISFSLKRGEILGIAGLVGAGR
TETIQCLFGVWPGQWEGKIYIDGKQVDIRNCQQAIAQGIAMVPEDRKRDG
IVPVMAVGKNITLAALNKFTGGISQLDDAAEQKCILESIQQLKVKTSSPD
LAIGRLSGGNQQKAILARCLLLNPRILILDEPTRGIDIGAKYEIYKLINQ
LVQQGIAVIVISSELPEVLGLSDRVLVMHEGKLKANLINHNLTQEQVMEA
ALRSEHHVEKQSV
>SFV_0307 yafH, putative acyl-CoA dehydrogenase
MMILSILATVVLLGALFYHRVSLFISSLILLAWTAALGVAGLWSAWVLVP
LTIILVPFNFAPMRKSMISAPVFRGFRKVMPPMSRTEKEAIDAGTTWWEG
DLFQGKPDWKKLHNYPQPRLTAEEQAFLDGPVEEACRMANDFQITHELAD
LPPELWAYLKEHRFFAMIIKKEYGGLEFSAYAQSRVLQKLSGVSGILAIT
VGVPNSLGPGELLQHYGTDEQKDHYLPRLARGQEIPCFALTSPEAGSDAG
AIPDTGIVCMGEWQGQQVLGMRLTWSKRYITLAPIATVLGLAFKLSDPEK
LLGGAEDLGITCALIPTTTPGVEIGRRHFPLNVPFQNGPTRGKDVFVPID
YIIGGPKMAGQGWRMLVECLSVGRGITLPSNSTGGVKSVALATGAYSHIR
RQFKISIGKMEGIEEPLARIAGNAYVMDAAASLITYGIMLGEKPAVLSAI
VKYHCTHRGQQSIIDAMDITGGKGIMLGQSNFLARAYQGAPIAITVEGAN
ILTRSMMIFGQGAIRCHPYVLEEMEAAKNNDVNAFDKLLFKHIGHVGSNK
VRSFWLGLTRGLTSSTPTGDATKRYYQHLNRLSANLALLSDVSMALLGGS
LKRRERISARLGDILSQLYLASAVLKRYDDEGRNEADLPLVHWGVQDALY
QAEQAMDNLLQNFPNRVVAGLLNVVIFPTGRHYLAPSDKLDHKVAKILQV
PNATRSRIGRGQYLTPSEHNPVGLLEEALVDVIAADPIHQRICKELGKNL
PFTRLDELAHNALTKGLIDKDEAAILVKAEESRLRSINVDDFDPEELATK
PVKLPEKVRKVEAA
>SFV_0360 yaiD, hypothetical protein
MAMQGRRQFVIMPAKFNDKAVEIIMLWFKNLMVYRLSREISLRAEEMEKQ
LASMAFTPCGSQDMAKMGWVPPMGSHSDALTHVANGQIVICARKEEKILP
SPVIKQALEAKIAKLEAEQARKLKKTEKDSLKDEVLHSLLPRAFSRFSQT
MMWLDTVNGLIMVDCASAKKAEDTLALLRKSLGSLPVVPLSMENPIELTL
TEWVRSGSAAQGFQLLDEAELKSLLEDGGVIRAKKQDLTSEEITNHIEAG
KVVTKLALDWQQRIQFVMCDDGSLKRLKFCDELRDQNEDIDREDFAQRFD
ADFILMTGELAALIQNLIEGLGGEAQR
>SFV_0415 ybaU, putative protease maturation protein
MMDSLRTAANSLVLKIIFGIIIVSFILTGVSGYLIGGGNNYAAKVNDQEI
SRGQFENAFNSERNRMQQQLGDQYSELAANEGYMKTLRQQVLNRLIDEAL
LDQYARELKLGISDEQVKQAIFATPAFQIDGKFDNSRYNGILNQMGMTAD
QYAQALRNQLTTQQLINGVAGTDFMLKGETDELAALVAQQRVVREATIDV
NALAAKQPVTEQEIASYYEQNKNNFMTPEQFRVSYIKLVAATMQQPVSDA
DIQSYYDQHQDQFTQPQRTRYSIIQTKTEDEAKAVLDELNKGGDFAALAK
EKSADIISARNGGDMGWLEDATIPDELKNAGLKEKGQLSGVIKSSVGFLI
VRLDDIQPAKVKSLDEVRDDIAAKVKHEKALDAYYALQQKVSDAASNDTE
SLAGAEQAAGVKATQTGWFSKDNLPEELNFKPVADAIFNGGLVGENGASG
INSDIITVDGDRAFVLRVSEHKPEAVKPLADVQEQVKALVQHNKAEQQAK
VDAEKLLVDLKAGKGAEAMQAAGLKFGEPKILSRSGRDPISQAAFALPLP
AKDKPSYGMATDMQGNVVLLALDEVKQGSMPEDQKKAMVQGITQNNAQIV
FEALMSNLRKEAKIKIGDALEQQ
>SFV_0467 ybbA, putative ATP-binding component of a transport system
MPAENIVEVHHLKKSVGQGEHELSILTGVELVVKRGETIALVGQPLHNMD
EEARAKLRAKHVGFAFQSFMLIPTLNALENVELPALLRGESSAESRNGAK
ALLEQLGLGKRLDHLPAQLSGGEQQRVALARAFNGRPDVLFADEPTGNLD
RQTGDKIADLLFSLNREHGTTLIMVTHDLQLAARCDRCLRLVNGQLQEEA
>SFV_0667 ybeZ, putative ATP-binding protein in pho regulon
MNIDTREITLEPADNARLLSLCGPFDDNIKQLERRLGIEINRRDNHFKLT
GRPICVTAAADILRSLYVDTAPMRGQIQDIEPEQIHLAIKEARVLEQSAE
SVPEYGKAVNIKTKRGVIKPRTPNQAQYIANILDHDITFGVGPAGTGKTY
LAVAAAVDALERQEIRRILLTRPAVEAGEKLGFLPGDLSQKVDPYLRPLY
DALFEMLGFEKVEKLIERNVIEVAPLAYMRGRTLNDAFIILDESQNTTIE
QMKMFLTRIGFNSKAVITGDVTQIDLPRNTKSGLRHAIEVLADVEEISFN
FFHSEDVVRHPVVARIVNAYEAWEEAEQKRKAALAAERKREEQEQK
>SFV_0777 ybhF, putative ATP-binding component of a transport system
MNDAVITLNGLEKRFPGMDKPAVAPLDCTIHAGYVTGLVGPDGAGKTTLM
RMLAGLLKPDSGSATVIGFDPIKNDGALHAVLGYMPQKFGLYEDLTVMEN
LNLYADLRSVTGEARKQTFARLLEFTSLGPFTGRLAGKLSGGMKQKLGLA
CTLVGEPKVLLLDEPGVGVDPISRRELWQMVHELAGEGMLILWSTSYLDE
AEQCRDVLLMNEGELLYQGEPKALTQTMAGRSFLMTSPHEGNRKLLQRAL
KLPQVSDGMIQGKSVRLILKKEATPDDIRHADGMPEININETTPRFEDAF
IDLLGGAGTSESPLGAILHTVEGTPGETVIEAKELTKKFGDFAATDHVNF
AVKRGEIFGLLGPNGAGKSTTFKMMCGLLVPTSGQALVLGMDLKESSGKA
RQHLGYMAQKFSLYGNLTVEQNLRFFSGVYGLRGRAQNEKISRMSEAFGL
KSIASHATDELPLGFKQRLALACSLMHEPDILFLDEPTSGVDPLTRREFW
LHINSMVEKGVTVMVTTHFMDEAEYCDRIGLVYRGKLIASGTTDDLKAQS
ANDEQPDPTMEQAFIQLIHDWDKEHSNE
>SFV_0866 ybjD, hypothetical protein
MILERVEIVGFRGINRLSLMLEQNNVLIGENAWGKSSLLDALTLLLSPES
DLYHFERDDFWFPPGDINGREHHLHIILTFRESLPGRHRVRRYRPLEACW
TPCTDGYHRIFYRLEGESAEDGSVMTLRSFLDKDGYPIDVEDINDQARHL
VRLMPVLRLRDARFMRRIRNGTVPNVPNVEVTARQLDFLARELSSHPQNL
SDGQIRQGLSAMVQLLEHYFSEQGAGQARYRLMRRRASNEQRSWRYLDII
NRMIDRPGGRSYRVILLGLFATLLQAKGTLRLDKDARPLLLIEDPETRLH
PIMLSVAWHLLNLLPLQRIATTNSGELLSLTPVEHVCRLVRESSRVAAWR
LGPSGLSTEDSRRISFHIRFNRPSSLFARCWLLVEGETETWVINELARQC
GHHFDAEGIKVIEFAQSGLKPLVKFARRMGIEWHVLVDGDEAGKKYAATV
RSLLNNDREAEREHLTALPALDMEHFMYRQGFSDVFHRVAQIPENVPMNL
RKIISKAIHRSSKPDLAIEVAMEAGRRGVDSVPTLLKKMFSRVLWLARGR
AD
>SFV_0935 ycbE, putative ATP-binding component of a transport system
MNTARLNQGTPLLLNAVSKHYAENIVLNQLDLHIPAGQFVAVVGRSGGGK
STLLHLLAGLETPTAGDVLAGTTPLAEIQDDTRMMFQDARLLPWKSVIDN
VGLGLKGQWRDAARRALAAVGLENRAGEWPAALSGGQKQRVALARALIHR
PGLLLLDEPLGALDALTRLEMQDLIVSLWQQHGFTVLLVTHDVSEAVAMA
DRVLLIEEGKISLDLTVDIPRPRRLGSVRLAELEAEVLQRVMRRGHSEQL
IRRHG
>SFV_1279 yciV, putative enzymes
MIYDLHSHTTASDGCLTPEALVHRAVEMRVGTLAITDHDTTAAIAPAREE
ISRSGLALNLIPGVEISTVWENHEIHIVGLNIDITHPLMCEFLAQQTERR
NQRAQQIAERLEKAQIPGALEGAQRLAQGGAVTRGHFARFLVECGKASSM
ADVFKKYLARGKTGYVPPQWCTIEQAIDVIHHSGGKAVLAHPGRYNLSAK
WLKRLVAHFAEHHGDAMEVAQCQQSPNERTQLATLARQHHLWASQGSDFH
QPCPWIELGRKLWLPAGVEGVWQLWEQPQNTTEREL
>SFV_1342 ycjI, putative carboxypeptidase
MIIRKICTTLPVILPEPETIMTVTRSRAERGAFPPGTEHYGRSLLGAPLI
WFPAPAASRESGLILAGTHGDENSSVVTLSCALRTLTPSLRRHHVVLCVN
PDGCQLGLRANANGVDLNRNFPAANWKEGETVYRWNSAAEERDVVLLTGD
KPGSEPETQALCQLIHRIQPAWVVSFHDPLACIEDPRHSELGEWLAQSFE
LPLVTSVGYETPGSFGIWCPDLNLHCITAEFPPISSDEASEKYLFAMANL
LRWHPKDAIRPS
>SFV_1332 ycjT, hypothetical protein
MTRPVTLSEPHFSQHTLNKYASLMAQGNGYLGLRASHEEDYTRQTRGMYL
AGLYHRAGKGEINELVNLRGVLGVEIAINGEVFSLSREAWQRELDFASGE
LRRSVVWRTSNGAGYTIASRRFVSADQLPLIALEITIMPLDADASVLIST
GIDATQTNHGRQHLDETQVRVFGQHLMQGIYTTQDGRSDVAISCCCQVSG
DVQQCYTAKERRLLQHTSTQLHAGETVTLQKLVWIDWRDDRQAALDEWGS
VSLRQLEMCAQQSYDQHLAASTENWRQWWQKRRITVNGGDAHDQQALDYA
LYHLRIMTPAHGERSSIAAKGLTGEGYKGHVFWDTEVFLLPFHLFSDPTV
ARSLLRYRWHNLPGAQEKARRNGWQGALFPWESARSGEEETPEFAAINIR
TGLRQKVASAQAEHHLVADIAWAVIQYWQTTGDESFIAHEGMALLLETAK
FWISRAVRVNDRLEIHDVIGPDEYTEHVNNNAFTSYMVYYNVQQALSIAR
QFGCSDDAFIHRAEMFLKELRLPEIQPDGVLPQDDSFMTKPAINLAKYKA
AAGKQTILLDYSRAEVNEMQILKQADVVMLNYMLPEQFSSASCLANLQFY
EPRTIHDSSLSKAIHGIVAARCGLLTQSYQFWREGTEIDLGADPHSCDDG
IHAAATGAIWLGAIQGFAGVSVRDGELHLNPALPEQWQQLSFPLFWQGCE
LQVTLDAQRIAIRTSAPVSLRLDGQLISVAEESVFCLGDFILPFNGTATT
HQEDE
>SFV_1336 ycjW, putative LACI-type transcriptional regulator
MSPTIYDIARVAGVSKSTVSRVLNKQTNISPEAREKVLRAIEELQYQPNK
LARALTSSGFDAIMVISTRSTKTTAGNPFFSEVLHAITAKAEEEGFDVIL
QTSHNPAEDLQKCESKIKQKMIKGIIMLSSPADESFFAQLDKYDIPVVVI
GKVEGQYAHVYSVDTDNFGDSIALTDALIESGHQNIACLHAPLDVHVSVD
RVNGYKQSLSAHNIAVRDKWIVDGGYTHETALQAARQLLSQSPLPEAVFA
TDSMKLMSIYRAAAEKNIAIPQQLAVVGYSNETLSFILTPAPGGIDVPTQ
ELGQQSCELLFRLISGKPSPQNITVATHMTLK
>SFV_1630 ydgA, hypothetical protein
MNKSLVAVGVIVALGVVWTGGAWYTGKKIETHLEDMVAQANAQLKLTAPE
SNLEVSYQNYHRGVFSSQLQLLVKPIAGKENPWIKSGQSVIFNESVDHGP
FPLAQLKKLNLIPSMASIQTTLVNNEVSKPLFDMAKGETPFEINSRIGYS
GDSSSDISLNPLNYEQKDEKVAFSGGEFQLNADRDGKAISLSGEAQSGRI
DAVNEYNQKVQLTFNNLKTDGSSTLGSFGERVGNQKLSLEKMTISVEGKE
LALLEGMEISGKSDLVNDGKTINSQLDYSLNSLKVQNQDLGSGKLTLKVG
QIDGEAWHQFSQQYNAQTQALLAQPEIANNPELYQEKVTEAFFSAQPLML
KGDPVITIAPLSWKNSQGESALNLSLFLKDPATTKEAPQTLAQEVDRSVK
SLDAKLTIPVDMATELMTQVAKLEGYQEDQAKKLAKQQVEGASAMGQMFR
LTTLQDNTITTSLQYANGQITLNGQKMSLEDFVGMFAMPALNVPAVPAIP
QQ
>SFV_1523 ydiD, putative ligase/synthetase
MKVTLTFNEQRSSAYRQQGLWGDASLADYWQQTARAMPDKIAVVDNHGAS
YTYSALDHAASCLANWMLAKGIESGDRIAFQLPGWCEFTVIYLACLKIGA
VSVPLLPSWREAELVWVLNKCQAKMFFAPTLFKQTRPVDLILPLQNQLPQ
LQQIVGVDKLAPATSSLSLSQIIADNIPLTTAITTHGDELAAVLFTSGTE
GLPKGVMLTHNNILASERAYCARLNLTWLDVFMMPAPLGHATGFLHGVTA
PFLIGARSVLLDIFTPDACLALLEQQRCTCMLGATPFVYDLLNLLEKQPA
DLSALRFFLCGGTTIPKKVARECQQRSIKLLSVYGSTESSPHAVVNLDNS
LSRFMHTDGYAAAGVEIKVVDDARKTLPPGCEGEEASRGPNVFMGYFDEP
ELTARALDEEGWYYSGDLCRMDEAGYIKITGRKKDIIVRGGENISSREVE
DILLQHPKIHDACVVAMPDERLGERSCAYVVLKAPHHSLSLEEMPPEVSR
RRVAKYKYPEHIVVIDELPRTASDKIQKFSLRKDIIRRLTQDVCEEIE
>SFV_1476 ydjS, hypothetical protein
MDNFLALTLTGKKPVITEREINGVRWRWLGDGVLELTPLTPPQGALVISA
GIHGNETAPVEMLDALLGAISHGEIPLRWRLLVILGNPPALKQGKRYCHS
DMNRMFGGRWQLFAESGETCRARELEQCLEDFYDQSKESVRWHFDLHTAI
RGSLHPQFGVLPQRDIPWDEKFLTWLGAAGLEALVFHQEPGGTFTHFSVR
HFGALACTLELGKALPFGQNDLRQFAVTASAIAALLSGESVGIVRTPPLR
YRVVSQITRHSPSFEMHMASDTLNFMPFEKGTLLAQDGEERFTVTHDVEY
VLFPNPLVALGLRAGLMLEKIS
>SFV_1416 yeaB, hypothetical protein
MEYRSLTLDDFLSRFQLLRPQINREPLNHRQAAVLIPIVRRPQPGLLLTQ
RSIHLRKHAGQVAFPGGAVDDTDASVIAAALREAEEEVAIPPSAVEVIGV
LPPVDSVTGYQVTPVVGIIPPDLPYRASEDEVSAVFEMPLAQALHLGRYH
PLDIYRRGDSHRVWLSWYEQYFVWGMTAGIIRELALQIGVKP
>SFV_1864 yebB, hypothetical protein
MNINYPAEYEIGDIVFTCIGAALFGQISAASNCWSNHVGIIIGHNGEDFL
VAESRVPLSTITTLSRFIKRSSNQRYAIKRLDAGLTEQQKQRIVEQVPSR
LRKLYHTGFKYESSRQFCSKFVFDIYKEALCIPVGEIETFGQLLNSNPNA
KLTFWKFWFLGSIPWERKTVTPASLWHHPGLVLIHAEGVETPQPELTEAV
>SFV_1866 yebC, hypothetical protein
MAGHSKWANTRHRKAAQDAKRGKIFTKIIRELVTAAKLGGGDPDANPRLR
AAVDKALSNNMTRDTLNRAIARGVGGDDDANMETIIYEGYGPGGTAIMIE
CLSDNRNRTVAEVRHAFSKCGGYLGTDGSVAYLFSKKGVISLEKGDEDTI
MEAALEAGAEDVVTYDDGAIDVYTAWEEMGKVRDALEAAGLKADSAEVSM
IPSTKADMDAETAPKLMRLIDMLEDCDDVQEVYHNGEISDEVAATL
>SFV_1393 yebU, putative nucleolar proteins
MLVAQHTVYFPDAFLTQMREAMPSTLSFDDFLAACQRPLRRSIRVNTLKI
SVADFLQLTAPYGWTLTPIPWCEEGFWIERDNEDALPLGSTTEHLSGLFY
IQEASSMLPVAALFADGNAPQRVMDVAAAPGSKTTQIAARMNNEGAILAN
EFSASWVKVLHANISRCGISNVALTHFGGRVFGAAVPEMFDAILLDAPCS
GEGVVRKDPDALKNWSPESNQEIAATQRELIDSAFHALRPGGTLVYSTCT
LNREENEAVCRWLKETYPDEVEFLPLGDLFPGANKALTEEGFLHVFPQIY
DCEGFFVARLRKTQAIPALPAPKYKVGNFPFSPVKDREAGQIRQAAASVG
LNWDGNLRLWQRDKELWLFPVGIEALIGKVRFSRLGIKLAETHNKGYRWQ
HEAVIAHASPDNVNAFELTPQEAEEWYRGRDVYPQAAPVADDVLVTFQHQ
PIGLAKRIGSRLKNSYPRELVRDGKLFTSNA
>SFV_2130 yegD, putative heat shock protein
MFIGFDYGTANCSVAVMRDGKPHLLKMENDSTLLPSMLCAPTREAVSEWL
YRHHDVPADDDETQALLRRAIRYNREEDIDVTAKSVQFGLSSLAQYIDDP
EEVWFVKSPKSFLGASGLKPQQVALFEDLVCAMMLHIRQQAQAQLPEAIT
QAVIGRPINFQGLGGDEANAQAQGILERAAKRAGFKDVVFQYEPVAAGLD
YEATLQEEKRVLVVDIGGGTTDCSLLLMGPQWRSRLDREASLLGHSGCRI
GGNDLDIALAFKNLMPLLGMGGETEKGIALPILPWWNAVAINDVPAQSDF
YSSANGRLLNDLVRDAREPEKVALLQKVWRQRLSYRLVRSAEECKIALSS
VAETRASLPFISNELATLISQRGLESALSQPLARILEQVQLALDNAQEKP
DVIYLTGGSARSPLIKKALAEQLPGIPIAGGDDFGSVTAGLARWAEVVFR
>SFV_2142 yegQ, hypothetical protein
MPGVGCWRPALYQIPPTALQCPPLKWGPPLTASSGEADLTCHQNERIMFK
PELLSPAGTLKNMRYAFAYGADAVYAGQPRYSLRVRNNEFNHENLQLGIN
EAHALGKKFYVVVNIAPHNAKLKTFIRDLKPVVEMGPDALIMSDPGLIML
VREHFPEMPIHLSVQANAVNWATVKFWQQMGLTRVILSRELSLEEIEEIH
NQVPDMEIEIFVHGALCMAYSGRCLLSGYINKRDPNQGTCTNACRWEYNV
QEGKEDDVGNIVHKYEPIPVQNVEPTLGIGAPTDKVFMIEEAQRPGEYMT
AFEDEHGTYIMNSKDLRAIAHVERLTKMGVHSLKIEGRTKSFYYCARTAQ
VYRKAIDDAAAGKPFDTSLLETLEGLAHRGYTEGFLRRHTHDDYQNYEYG
YSVSDRQQFVGEFTGERKGDLAAVAVKNKFSVGDSLELMTPQGNINFTLE
HMENAKGEAMPVAPGDGYTVWLPVPQDLELNYALLMRNFSGETTRNPHGK
>SFV_2206 yehY, putative transport system permease protein
MTYLRINPVLALLLLLTAIAAALPFISYAPNRLVSGEGLHLWQLWPQTIW
MLVGVGCAWLTACFVPAKKGSIFALILAQFVFVLLVWGAGKAATQLAQNG
SALARTSLGSGFWLAAALALLACSDAIRRISTHPLWRWLLHMQIAIIPLW
LLYSGTLNDLSLMKEYANRQDVFDDALAQHLTLLFGAVLPALVIGVPLGI
WCYFSTARQGAIFSLLNVIQTVPSVALFGLLIAPLAALVTAFPWLGKLGI
AGTGMTPALIALVLYALLPLVRGVVVGLNQIPRDVLESARAMGMSGAQRF
LHVQLPLALPVFLRSLRVVMVQTVGMAVIAALIGAGGFGALVFQGLLSSA
IDLVLLGVIPVIVLAVLTDALFDLLIALLKVKRND
>SFV_2262 yejH, putative ATP-dependent helicase
MIFTLRPYQQEAVDATLNHFRRHKTPAVIVLPTGAGKSLVIAELARLARG
RVLVLAHVKELVAQNHAKYQALGLEADIFAAGLKRKESHGKVVFGSVQSV
ARNLDAFQGEFSLLIVDECHRIGDDEESQYQQILTHLTKVNPHLRLLGLT
ATPFRLGKGWIYQFHYHGMVRGDEKALFRDCIYELPLRYMIKHGYLTPPE
RLDMPVVQYDFSRLQAQSNGLFSEADLNRELKKQQRITPHIISQIMEFAE
KRKGVMIFAATVEHAKEIVGLLPAEDAALITGDTPGAERDVLIEDFKAQR
FRYLVNVAVLTTGFDAPHVDLIAILRPTESVSLYQQIVGRGLRLAPGKTD
CLILDYAGNPHDLYAPEVGTPKGKSDNVPVQVFCPACGFANTFWGKTTAD
GTLIEHFGRRCQGWFEDDDGHREQCDFRFRFKNCPQCNAENDIAARRCRE
CDTVLVDPDDMLKAALRLKDALVLRCSGMSLQHGHDEKGEWLKITYYDED
GADVSERFRLQTPAQRTAFEQLFIRPHTRTPGIPLRWITAADILAQQALL
RHPDFVVARMKGQYWQVREKVFDYEGRFRRAHELRG
>SFV_2430 yfdE, putative enzyme
MSFHLRLFSRHKQMTNNESKGPFEGLLVIDMTHVLNGPFGTQLLYNMGAR
VIKVEPPGHGDDTRTFGPYVDGQSLYYSFINHGKESVVLDLKNDHDKSIF
INMLKQADVLAENFRLGTMEKLGFSWERLQEINPRLIYASSSGFGHTGPL
KDAPAYDTIIQAMSGIMMETGYPDAPPVRVGTSLADLCGGVYLFSGIVSA
LYGREKSQRGAHVDIAMFDATLSFLEHGLMAYIATGKSPQRLGNRHPYMA
PFDVFDTQDKPITICCGNDKLFSALCQALELTELVNDPRFSSNILRVQNQ
AILKQYIERTLKTQAAEVWLAKIHEVGVPVAPLLSVAEAINLPQTQARNM
LIEAGGIMMPGNPIKISGCADPHVMPGAATLDQHGEQIRQEFSS
>SFV_3004 yggR, putative protein transport
MNMEEIVALSVKHNVSDLHLCSAWPARWRIRGRMEAAPFDAPDVEELLRE
WLDDDQRAILLENGQLDFAVSLAENQRLRGSAFAQRQGISLALRLLPSHC
PQLEQLGAPPVLPELLKSENGLILVTGATGSGKSTTLAAMVGYLNQHADA
HILTLEDPVEYLYASQRCLIQQREIGLHCMTFASGLRAALREDPDVILLG
ELRDSETIRLALTAAETGHLVLATLHTRGAAQAVERLVDSFPAQEKDPVR
NQLAGSLRAVLSQKLEVDKQEGRVALFELLINTPAVGNLIREGKTHQLPH
VIQTGQQVGMLTFQQSYQQRVGEGRL
>SFV_3030 yghK, putative permease
MVTWTQMYMPMGGLGLSALVALIPIIFFFVALAVLRLKGHVAGAITLILS
ILIAIFAFKMPIDMAFAAAGYGFIYGLWPIAWIIVAAVFLYKLTVASGQF
DIIRSSVISITDDQRLQVLLIGFSFGALLEGAAGFGATVAFGALGVPILV
AGQVTGIDPFHIGAMAGRQLPFLSVLVPFWLVAMMDGWKGVKETWPAALV
AGGSFAVTQFFTSNYIGPELPDITSALVSIVSLALFLKVWRPKNTETAIS
MGQSAGAMVVNKPSSGGPVPSEYSLGQIIRAWSPFLILTVLVTIWTMKPF
KALFAPGGAFYSLVINFQIPHLHQQVLKAAPIVAQPTPMDAVFKFDPLSA
GGTAIFIAAIISIFILGVGIKKGIGVFAETLISLKWPILSIGMVLAFAFV
TNYSGMSTTLALVLAGTGVMFPFFSPFLGWLGVFLTGSDTSSNALFGSLQ
STTAQQINVSDTLLVAANTSGGVTGKMISPQSIAVACAATGMVGRESELF
RYTVKHSLIFASVIGVITLLQAYVFTGMLVS
>SFV_3052 yghYX, Predicted hydrolase
MPRLTAKDFPQELLDYYDYYVHGKISKREFLNLAAKYAVGGMTALALFDL
LKPNYALATQVEFTDPEIVAEYITYPSPNGHGEVRGYLVKPAKMSGKTPA
VGVVHENRGLNPYIEDVARRVAKAGYIALAPDGLSSVGGYPGNDDKGREL
QQQVDPTKLMNDFFAAIEFMQRYPQATGKVGITGFCYGGGVSNAAAVAYP
ELACAVPFYGRQAPTADVAKIEAPLLLHFAELDTRINEGWPPYEAALKDN
NKVYEAYIYPGVNHGFHNDSTPRYDKSAADLAWQRTLKWFDKYLS
>SFV_3126 ygjO, putative enzyme
MDLIVIHHFIDRRGITQKLIFQILSILLESFIGGVVVCSRIAFCQHIAQT
LFVNQCSHLRKQLLGITLKISKIAHNRPRKREKGYTHAPFSGATSKFYHS
GGPMSHLDNGFRSLTLQRFPATDDVNPLQAWEAADEYLLQQLDDTEIPGP
VLILNDAFGALSCALAEHKPYSIGDSYISELATRENLRLNGIDESSVKFL
DSTADYPQQPGVVLIKVPKTLALLEQQLRALRKVVTSDTRIIAGAKARDI
HTSTLELFEKVLGPTTTTLAWKKARLINCTFNEPPLADAPQTVSWKLEGT
DWTIHNHANVFSRTGLDIGARFFMQHLPENLEGEIVDLGCGNGVIGLTLL
DKNPQAKVVFVDESPMAVASSRLNVETNMPEALDRSEFMINNALSGVEPF
RFNAVLCNPPFHQQHALTDNVAWEMFHYARRCLKINGELYIVANRHLDYF
HKLKKIFGNCTTIATNNKFVVLKAVKLGRRR
>SFV_3165 yhaE, putative dehydrogenase
MKVGFIGLGIMGKPMSKNLLKAGYSLVVADRNPEAIADVIAAGAETASTA
KAIAEQCDVIITMLPNSPHVKEVALGENGIIEGAKPGTVLIDMSSIAPLA
SREISEALKAKGIDMLDAPVSGGEPKAIDGTLSVMVGGDKAIFDKYYDLM
KAMAGSVVHTGAIGAGNVTKLANQVIVALNIAAMSEALTLATKAGVNPDL
VYQAIRGGLAGSTVLDAKAPMVMDRNFKPGFRIDLHIKDLANALDTSHGV
GAQLPLTAAVMEMMQALRADGLGTADHSALACYYEKLAKVEVTR
>SFV_3153 yhaP, putative L-serine dehydratase
MISAFDIFKIGIGPSSSHTVGPMNAGKSFIDRLESSGLLTATSHIVVDLY
GSLSLTGKGHATDVAIIMGLAGNSPQDVVIDEIPAFIELVTRSGRLPVAS
GAHIVDFPVAKNIIFHPEMLPRHENGMRITAWKGQEALLSKTYYSVGGGF
IVEEEHFGLSHDVETSVPYDFHSAGELLKMCDYNGLSISGLMMHNELALR
SKAEIDAGFARIWQVMHDGIERGMNTEGVLPGPLNVPRRAVALRRQLVSS
DNISNDPMNVIDWINMYALAVSEENAAGGRVVTAPTNGACGIIPAVLAYY
DKFRRPVNERSIARYFLAAGAIGALYKMNASISGAEVGCQGEIGVACSMA
AAGLTELLGGSPAQVCNAAEIAMEHNLGLTCDPVAGQVQIPCIERNAINA
VKAVNAARMAMRRTSAPRVSLDKVIETMYETGKDMNDKYRETSRGGLAIK
VVCG
>SFV_3375 yhfL, hypothetical protein
MNKFIKVALVGAVLATLTACTGHIENRDKNCSYDYLLHPAISISKIIGGC
GPTTQ
>SFV_3377 yhfN, putative transport protein
MLSEENKMLDIDKSTVDFLVTENMVQEVEKVLSHDVPLVHAIVEEMVKRD
IDRIYFVACGSPLNAAQTAKHLADRFSDLQVYAISGWEFCDNTPYRLDDR
CAVIGVSDYGKTEEVIKALELGRACGALTAAFTKRADSPITSAAEFSIDY
QADCIWEIHLLLCYSVVLEMITRLAPNAEIGKIKNDLKQLPNALGHLVRT
WEEKGRQLGELASQWPMIYTVAAGPLRPLGYKEGIVTLMEFTWTHGCVIE
SGEFRHGPLEIVEPGVPFLFLLGNDESRHTTERAINFVKQRTDNVIVIDY
AEISQGLHPWLAPFLMFVPMEWLCYYLSIYKDHNPDERRYYGGLVEY
>SFV_3381 yhfS, hypothetical protein
MKTFPLQSLTLAEAQQKQFALVDTICRHFPGSEFLAGGDLGLTPGLNQPR
ITQRVEQVLADAFHAQAAALVQGAGTGAIRAGLAALLKPGQRLLVHDAPV
YPTTRVIIEQMGLTLITADFNDLSALKQVVDEQQPDAALVQHTRQQPQDS
YVLADVLATLRAAGVPALTDDNYAVMKVARIGCECGANVSTFSCFKLFGP
EGVGAVVGDADVISRIRATLYSGGSQIQGSQALEVLRGLVFAPVMHAVQA
GVSERLLALLNGGAVAEVKSAVIANAQSKVLIVEFHQLIAARVLEEAQKR
GALPYPVGAESKYEVPPLFYRLSGTFRQANPQLEHCAIRINPNRSGEETV
LRILRESIASI
>SFV_3412 yhgF, hypothetical protein
MMNDSFCRIIAGEIQARPEQVDAAVRLLDEGNTVPFIARYRKEITGGLDD
TQLRNLETRLSYLRELEERRQAILKSISEQGKLTDDLAKAINATLSKTEL
EDLYLPYKPKRRTRGQIAIEAGLEPLADLLWSDPSHTPEVAAAQYVDADK
GVADTKAALDGARYILMERFAEDAALLAKVRDYLWKNAHLVSTVVSGKEE
EGAKFRDYFDHHEPLSTVPSHRALAMFRGRNEGVLQLSLNADPQFDEPPK
ESYCEQIIMDHLGLRLNNAPADSWRKSVVSWTWRIKVLMHLETELMGTVR
ERAEDEAINVFARNLHDLLMAAPAGLRATMGLDPGLRTGVKVAVVDATGK
LVATDTIYPHTGQAAKAAMTVAALCEKHNVELVAIGNGTASRETERFYLD
VQKQFPKVTAQKVIVSEAGASVYSASELAAQEFPDLDVSLRGAVSIARRL
QDPLAELVKIDPKSIGVGQYQHDVSQTQLARKLDAVVEDCVNAVGVDLNT
ASVPLLTRVAGLTRMMAQNIVAWRDENGQFQNRQQLLKVSRLGPKAFEQC
AGFLRINHGDTPLDASTVHPEAYPVVERILAATQQALKDLMGNSSELRNL
KASDFTDEKFGVPTVTDIIKELEKPGRDPRPEFKTAQFADGVETMNDLQP
GMILEGAVTNVTNFGAFVDIGVHQDGLVHISSLSNKFVEDPHTVVKAGDI
VKVKVLEVDLQRKRIALTMRLDEQPGETNSRRGGGNERPQNNRPAAKPRG
REAQPAGNSAMMDALAAAMGKKR
>SFV_3415 yhgG, hypothetical protein
MPLMASLIQVRDLLALRGRMEATQISQTLNTPQPMINAMLQQLESMGKAV
RIQEEPDGCLSGSCKSCPEGKACLREWWALR
>SFV_3511 yhiR, hypothetical protein
MCAPNTGQFLLTGTPLPMLSYRHSFHAGNHADVLKHTVQSLIIESLKEKD
KPFLYLDTHAGAGRYQLGSEHAERTGEYLEGIARIWQQDDLPAELEAYIN
VVKHFNRSGQLRYYPGSPLIARQLLREQDSLQLTELHPSDYPLLRSGFQK
DSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGIAEG
YKRFATGTYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRR
GMTASGMIVINPPWKLEQQMNNVLSWLHSKLVPAGTGHATVSWIVPE
>SFV_3986 yhiV, putative transport system permease protein
MANYFIDRPVFAWVLAIIMMLAGGLAIMNLPVAQYPQIAPPTITVSATYP
GADAQTVEDSVTQVIEQNMNGLDGLMYMSSTSDAAGNASITLTFETGASP
DIAQVQVQNKLQLAMPSLPEAVQQQGISVDKSSSNILMVAAFISDNGSLN
QYDIADYVASNIKDPLSRTAGVGSVQLFGSEYAMRIWLDPQKLNKYNLVP
SDVISQIKVQNNQISGGQLGGMPQAADQQLNASIIVQTRLQTPEEFGKIL
LKVQQDGSQVLLRDVARVELGAEDYSTVARYNGKPAAGIAIKLAAGANAL
DTSRAVKEELNRLSAYFPASLKTVYPYDTTPFIEISIQEVFKTLVEAIIL
VFLVIYLFLQNFRATIIPTIAVPVVILGTFAILSAVGFTINTLTMFGMVL
AIGLLVDDAIVVVENVERVIAEDKLPPKEATHKSMGQIQRALVGIAVVLS
AVFMPMAFMSGATGEIYRQFSITLISSMRLSVFVAMSLTPALCATILKAT
PEGGHKPNALFARFNTLFEKSTQHYTDSTRSLLRCTGRYMVVYLLICAGM
AVLFLRTPTSFLPEEDQGVFMTTAQLPSGATMVNTTKVLQQVTDYYLTKE
KDNVQSVFTVGGFGFSGQGQNNGLAFISLKPWSERVGEENSVTAIIQRAM
IALSSINKAVVFPFNLPAVAELGTASGFDMELLDNGNLGHEKLTQARNEL
LSLAAQSPDQVTGVRPNGLEDTPMFKVNVNAAKAEAMGVALSDINQTIST
AFGSSYVNDFLNQGRVKKVYVQAGTPFRMLPDNINQWYVRNASGTMAPLS
AYSSTEWTYGSPRLERYNGIPSMEILGEAAAGKSTGDAMKFMADLVAKLP
AGVGYSWTGLSYQEALSSNQAPALYAISLVVVFLALAALYESWSIPFSVM
LVVPLGVVGALLATDLRGLSNDVYFQVGLLTTIGLSAKNAILIVEFAVEM
MQKEGKTPIEAIIEAARMRLRPILMTSLAFILGVLPLVISHGAGSGAQNA
VGTGVMGGMFAATVLAIYFVPVFFVVVEHLFARFKKA
>SFV_3564 yhjG, hypothetical protein
MSKAGKITAAISGAFLLLIVVAIILIATFDWNRLKPTINQKVSAELNRPF
AIRGNLGVVWERQKQETGWRSWVPWPHVHAEDIILGNPPDIPEVTMVHLP
RVEATLAPLALLTKTVWLPWIKLEKPDARLIRLSEKNNNWTFNLANDDNK
DANAKPSAWSFRLDNILFDQGRIAIDDKVSKADLEIFVDPLGKPLPFSEV
TGSKGKADKEKVGDYIFGLKAQGRYNGEPLTGTGKIGGMLALRGEGTPFP
VQADFRSGNTRVAFDGVVNDPMKMGGVDLRLKFSGDSLGDLYELTGVLLP
DTPPFETDGRLVAKIDTEKSSVFDYRGFNGRIGDSDIHGSLVYTTGKPRP
KLEGDVESRQLRLADLGPLIGVDSGKGAEKSKRSEQKKGEQSVQPAGKVL
PYDRFETDKWDVMDADVRFKGRRIEHGSSLPISDLSTHIILKNADLRLQP
LKFGMAGGSIAANIHLEGDKKPMQGRADIQARRLKLKELMPDVELMQKTL
GEMNGDAELRGSGNSVAALLGNSNGNLKLLMNDGLVSRNLMEIVGLNVGN
YIVGAIFGDDEVRVNCAAANLNIANGVARPQIFAFDTENALINVTGTASF
ASEQLDLTIDPESKGIRIITLRSPLYVRGTFKNPQAGVKAGPLIARGAVA
AALATLVTPAAALLALISPSEGEANQCRTILSQMKK
>SFV_3540 yhjX, putative resistance protein
MTPSNYQRTRWLTLIGTIITQFALGSVYTWSLFNGALSAKLDAPVSQVAF
SFGLLSLGLAISSSVAGKLQEHFGVKRVTVASGILLGLGFFLTAHSNNLM
MLWLSAGVLVGLADGAGYLLTLSNCVKWFPERKGLISAFAIGSYGLGSLG
FKFIDTQLLETVGLEKTFVIWGAIALVMIVFGATLMKDAPKQEVKTSNGV
VEKDYTLAESMRKPQYWMLAVMFLTACMSGLYVIGLAKDIAQSLAHLDVV
SAANAVTVISIANLSGRLVLGILSDKIARIRVITIGQVISLVGMAALLFA
PLNAVTFFAAIACVAFNFGGTITVFPSLVSEFFGLNNLAKNYGVIYLGFG
IGSIFGSIIASLFGGFYVTFYVIFALLILSLALSTTIRQPEQKMLREAHG
SL
>SFV_3844 yicP, probable adenine deaminase
MNNSINHKFHHISRAEYQELLAVSRGDAVADYIIDNVSILDLINAGEISG
PIVIKGRYIAGVGAEYADTPALQRIDARGATAVPGFIDAHLHIESSMMTP
VTFETATLPRGLTTVICDPHEIVNVMGEAGFAWFARCAEQARQNQYLQVS
SCVPALEGCDVNGASFTLEQMLAWRDHPQVTGLAEMMDYPGVINGQNALL
DKLDAFRYLTLDGHCPGLGGKELNAYIAAGIENCHESYQLEEGRRKLQLG
MSLMIREGSAASNLNALAPLINEFNSPQCMLCTDDRNPWEIAHEGHIDAL
IRRLIEQHNVPLHVAYRVASWSTARHFGLNHLGLLAPGKQADIVLLSDAR
KVTVQQVLVKGEPIDAQTLQAEESARLALSAPPYGNTIARQPVSASDFAL
QFTPGKRYRVIDVIHNELITHSRSSVYSENGFDRDDVCFIAVLERYRQRL
APACGLLGGFGLNEGALAATVSHDSHNIVVIGRSAEEMALAVNQVIQDGG
GLCVVRNGQVQSHLPLPIAGLMSTDTAQSLAEQIDALKAAARECGPLPDE
PFIQMAFLSLPVIPALKLTSQGLFDGEKFAFTTLEVTE
>SFV_3815 yidA, hypothetical protein
MAIKLIAIDMDGTLLLPDHTISPAVKNAIAAARARGVNVVLTTGRPYAGV
HNYLKELHMEQPGDYCITYNGALVQKAADGSTVAQTALSYDDYRFLEKLS
REVGSHFHALDRTTLYTANRDISYYTVHESFVATIPLVFCEVEKMDPNTQ
FLKVMMIDEPAILDQAIARIPQEVKEKYTVLKSAPYFLEILDKRVNKGTG
VKSLADVLGIKPEEIMAIGDQENDIAMIEYAGVGVAMDNAIPSVKEVANF
VTKSNLEDGVAFAIEKYVLN
>SFV_3826 yidE, putative transport protein
MSDIALTVSILALVAVVGLFIGNVKFRGIGLGIGGVLFGGIIVGHFVSQA
GMTLSSDMLHVIQEFGLILFVYTIGIQVGPGFFASLRVSGLRLNLFAVLI
VIIGGLVTAILHKLFDIPLPVVLGIFSGAVTNTPALGAGQQILRDLGTPM
EMVDQMGMSYAMAYPFGICGILFTMWMLRVIFRVNVETEAQQHESSRTNA
GALIKTINIRVENPNRHDLAIKDVPILNGDKIICSRLKREETLKVPSPDT
IIQLGDLLHLVGQPADLHNAQLVIGQEVDTSLSTKGTDLRVERVVVTNEN
VLGKRIRDLHFKERYDVVISRLNRAGVELVASGDISLQFGDILNLVGRPS
AIDAVANVLGNAQQKLQQVQMLPVFIGIGLGVLLGSIPVFVPGFPAALKL
GLAGGPLIMALILGRIGSIGKLYWFMPPSANLALRELGIVLFLSIVGLKS
GGDFVNTLVNGEGLSWIGYGALITAVPLITVGILARMLAKMNYLTMCGML
AGSMTDPPALAFANNLHPTSGAAALSYATVYPLVMFLRIITPQLLAVLFW
SIG
>SFV_3635 yihI, hypothetical protein
MLSVRFRRKSILNKTAKRRTIMKPSSSNSRSKGHAKARRKTREELDQEAR
DRKRQKKRRGHAPGSRAAGGNTTSGSKGQNAPKDPRIGSKTPIPLGVTEK
VTKQHKPKSEKPMLSPQAELELLETDERLDALLERLEAGETLSAEEQSWV
DAKLDRIDELMQKLGLSYDDDEEEEEDEKQEDMMRLLRGN
>SFV_3619 yihS, hypothetical protein
MKWFNTLSHNRWLEQETDRIFDFGKNSVLPTGFGWLGNKGQIKEEMGTHL
WITARMLHVYSVAAAMGRPGAYALVDHGIKAMNGALRDKKYGGWYACVND
EGVVDASKQGYQHFFALLGAASAVTTGHPEARKLLDYTIEIIEKYFWSEE
EQMCLESWDEAFSKTEEYRGGNANMHAVEAFLIVYDVTHDKKWLDRAIRV
ASVIIHDVARNNHYRVNEHFDTQWNPLPDYNKDNPAHRFRAFGGTPGHWI
EWGRLMLHIHAALEARCEQPPAWLLEDAKGLFNATVRDAWAPDGADGIVY
TVDWEGKPVVRERVRWPIVEAMGTAYALYTVTGDRQYETWYQTWWEYCIK
YLMDYENGSWWQELDADNKVITKVWDGKQDIYHLLHCLVIPRIPLAPGMA
PAVAAGLLDINAK
>SFV_3579 yiiP, putative transport system permease protein
MNQSYGRLVSRAAIAATAMASLLLLIKIFAWWYTGSVSILAALVDSLVDI
GASLTNLLVVRYSLQPADDNHSFGHGKAESLAALAQSMFISGSALFLFLT
GIQHLVSPTPMTDPGVGVIVTIVALICTIILVSFQRWVVRRTQSQAVRAD
MLHYQSDVMMNGAILLALGLSWYGWHRADALFALGIGIYILYSALRMGYE
AVQSLLDRALPDEERQEIIGIVTSWPGVSGAHDLRTRQSGPTRFIQIHLE
MEDSLPLVQAHMVADQVEQAILRRFPGSDVIIHQDPCSVVPREGKRSMLS
>SFV_4371 yjiA, hypothetical protein
MNPIAVTLLTGFLGAGKTTLLRHILNEQHGYKIAVIENEFGEVSVDDQLI
GDRATQIKTLTNGCICCSRSNELEDALLDLLDNLDKGNIQFDRLVIECTG
MADPGPIIQTFFSHEILCQLYLLDGVIALVDAVHADEQMNQFTIAQSQVG
YADRILLTKTDVAGEAEKLRERLARINARAPVYTVTHGDIDLGLLFNTNG
FMLEENVVSTKPRFHFIADKQNDISSIVVELDYPVDISEVSRVMENLLLE
SADKLLRYKGMLWIDGEPNRLLFQGVQRLYSADWDRPWGDEKPHSTMVFI
GIQLPEEEIRAAFAGLRK
>SFV_1704 ynhC, hypothetical protein
MLRTGLPTRKHENWKYTPLEGLTNSQFVSIAGDISPQQRDALALTLDAVR
LVFVDGRYVPALSDAIEGSGYEVSINDDRQGLPDAIQAEVFLHLTESLAQ
SVTHIAVKRGQRPAKPLLLMHITQGVAGEEVNTAHYRHHLDLAEGAEATV
IEHFVSLNDARHFTGARFTINVAANAHLQHIKLAFENPVSHHFAHNDLLL
ADDATAFSHSFLLGGAVLRHNTSTQLNGENSKLRINSLAMPVKNEVCDTR
TWLEHNKGFCNSRQLHKTIVSDKGRAVFNGLINVAQHAIKTDGQMTNNNL
LMGKLAEVDTKPQLEIYADDVKCSHGATVGRIDDEQMFYLRSRGINQQDA
QQMIIYAFAAELTEALRDEGLKQQVLARIGQRLPGGAR
>SFV_3145 yqjF, hypothetical protein
MILSIDSNDANSAPLHKKTISSLSGAVESMMKKLEDVGVLVAHILMPILF
ITAGWGKITAYSGTQQYMEAMGVPGFMLLTAFLFHSNFAEGVNSLMFMKN
LTISGGFLLLAITGPGAYSIDRLLNKKW
>SFV_3181 yraQ, hypothetical protein
MTGQSSSQAATPIQWWKPALFFLVVIAGLWYVKWQPYYGKAFTAAETHSI
GKSILAQADANPWQAALDYAMIYFLAVWKAAVLGVILGSLIQVLIPRDWL
LRTLGQSRFRGTLLGTLFSLPGMMCTCCAAPVAAGMRRQQVSMGGALAFW
MGNPVLNPATLVFMGFVLGWGFAAICLVAGLVMVLLIATLVQKWVRETPQ
TQAPVEIDIPEAQSGFFSRWGRALWTLFWSTIPVYILAVLVLGAARVWLF
PHADGAVDNSLMWVVAMAVAGCLFVIPTAAEIPIVQTMMLAGMGTAPALA
LLMTLPAVSLPSLIMLRKAFPAKALWLTGAMVAVSGVIVGGLALLF
>SFV_4277 ytfF, putative transmembrane subunit
MPVMISGVLYALLAGLMWGLIFVGPLIVPEYPAMLQSMGRYLALGLIALP
IAWLGRVRLRQLARRDWLTALMLTMMGNLIYYFCLASAIQRTGAPVSTMI
IGTLPVVIPVFANLLYSQRDGKLAWGKLAPALICIGIGLASVNIAELNHG
LPDFDWARYTSGIVLALVSVVCWAWYALRNARWLRENPDKHPMMWATAQA
LVTLPVSLIGYLVACYWLNIQTPDFSLPFGPRPLVFISLMVAIAVLCSWV
GALCWNVASQRLPTVILGPLIVFETLAGLLYTFLLRQQMPPLMTLSGIAL
LVIGVVIAVRAKPEKPLTESVSES
>SFV_3434 yzgL, hypothetical protein
MQNRKWILTSLVMTFFGIPILTQFFAAVIAMLGAGLAAIPEVCNLLFTPT
IYLLLNIFMLTLGALMLFFSGRVGNVPEFCYVGYDGVGFAVIP


# Yersinia enterocolitica subsp. enterocolitica 8081, 8081

>YE0544 putative transferase
MHKPIYILGTALSHDGSTCLMKDGEIIFAMEKERISRIKHDGFNDNATIQ
YCLDAAGIEYNDLTLIVEQNSHNPIFSEQLEYRKNRILPPMVPVVTLSHH
LAHAYSAIGTSPFDDMGVVVIDGHGGSVDSCNDINGNVYTTDNIHYNNRY
RYWETCSYYVYQNGRMLPIFKDFSRWVNRKDRKIYPVSTWEIENSIGEFY
EGIALYIFGELDCAGKLMGLAPYGRENVINWQPFFLNSGKVILRNDWWIN
IDPLLNTDPAHFKNNFQYYADLAYWAQSKLEEALFYLFNHYYQLHPMKNF
AYAGGVALNATANEKLINGCNFDNLYIQPAAGDNGLSIGCCYYGWLEVLK
KERVKHTGSPFFGKNYNDISVITELEKHNDKIEYTYCENIEAMAADSIAS
GQVIAWFQGESEFGPRALGNRSILADPRSAQMKDHINANVKFREDFRPFA
PAILHEEVQEYFHLDHDSDYMLFIAYVKEQYRTDLPSIVHVDGSARVQTV
KKDINRKFHKLITSFFEQTKIPLLLNTSLNTKGMPIVETPKDAIDLFLSC
GLDVLFINNYKVSKKQSNQ
>YE1988 hypothetical protein
MSEARDLLAQGLWKNNSALVQLLGLCPLLAVSSTATNALGLGLATTLVLV
CTNTAVSALRRWVPNEIRIPIYVMIIASVVSTVQMLINAYAFGLYQSLGI
FIPLIVTNCIVIGRAEAYAARNPVGLSALDGLAMGLGATCALFVLGSLRE
ILGNGTLFDGADMLLGDWAKVLRIEVLHLDSPFLLAMLPPGAFIGLGLLL
AGKYVIDEKMKARKARALATAPQLQDGVAEKAL
>YE0374 putative N-acetylmuramoyl-L-alanine amidase-family protein
MVIVGLFTLTSAFAAKLTDIKVNNGPTESRVTLSFDGQPVYAFFSLNSPE
RVVLDVRQSGNLSGLPLEFSGQNLLKRIRSSTPKDAQSTRLVLELTQKVK
TRAVTQQSGNNYTVVMTMSAAASAPVRQTQPTLSQNQNNVSQNNVPLNQT
NTPSPNAGRLTSQASSTNTASRVTSVNTNSVAKNPFNNKPTVVVSSESVT
TNTSRPIKTSSAANSSRVVVAIDAGHGGQDPGAIGQNGLKEKNVTIAIAR
RLEALLNSDPMFKPVLTRNGDYFISVMGRSDVARKQGANVLVSVHADAAP
NRSATGASVWVLSNRRANSEMGNWLEQHEKQSELLGGAGDVLANTASDPY
LSQAVLDLQFGHSQRVGYDVATKVLNELQRVGDIHKRRPEHASLGVLRSP
DIPSLLVETGFISNSTEERLLGSSAYQEKIAQAIYKGLRSYFLAHPLQAD
PKVENRPLIETAAVDSSSQRSGVNQPDPIVSNAQSGRVSTTKPAAAGAVT
KNRSQIHKVQRGETLSGIASQYGVSMAVLRQNDTLKNDVVWVGQRLKVPA
SGATVAAAAPKAVAKAKPSKSQPLKHQVKRGDTLSAIAARYGVSMSEIER
ANKIKSGNVQLGQTLTIPQS
>YE3714 hypothetical protein
MIAFGSVAQANECDTKAKEIQQQIDYAKQHGNTRRAAGLETALKQVKNNC
TVESLAADRQSKIKEKQHKVTERKEELKEAQQKGDADKITKQQKKLTEAQ
AELKQAQAQK
>YE3019 putative ABC transport protein
MDRSRILKFLLPYLWPKNNPKLRYYLIAAVIFMVVSKVSTTLVPLAYRAM
IDTLSSENAKMMAIPITLIIAYGVARISASLFEELRNVMFVHVSQNATRL
LGLRVFKQLHDLSLRFHLDRQTGGLSLSIERGTQAVATVLSRILFSIFPI
LFEITLVSLIMWHLLDGWFAVAILVTVACYILFTVMAVSWRTRFRRELNQ
ANADANTKSIDSLINYETVKYFGNEQFEAERFNHSRQLYEYAAIKNQFSF
TALSLGQTAIISVGLVVMMSMAAQGITQGRMTIGDFVLVNAYLLQLYQPL
NFFGFIYSEIRQALIDMENMLDLLMVKREITDRPNAPTLQLTKGEVRFDS
VSFGYDPRRPILNKVSFTIPAGKTVAVVGASGAGKSTLSRLLFRFYDVTD
GAIYIDDQDVRNVTQASLREAIGIVPQDTVLFNDTLRYNIAYGRTSSSFA
EIERAAKLAHIHDFIISLPDGYETRVGERGLKLSGGEKQRVAIARTILKN
PAILVFDEATSALDTHTEREIQAHLREVSRDHTTLVIAHRLSTIIDADEI
IVMEAGAIVERGRHEELLLQNGRYSAMWQNQYHEEEQQVL
>YE0380 hypothetical protein
MRKSFLLIVVVVLVALYASLFVVQEGQRGIVLRFGKVLRDSDNKPLVYAP
GLHFKIPFIETVKTLDARIQTMDNQADRFVTNEKKDLIVDSYLKWRISDF
SRYYLATGGGDVSQAEVLLKRKFSDRLRSEIGRLNVRDIVTDSRGRLTSD
VRDALNTGSVGDEAVTTEADDAIASAAARVEQETRGKQPAVNPNSMAALG
IEVVDVRIKQINLPAEVSDAIFQRMRAEREAVARRHRSQGQEEAEKLRAT
ADYEVTRTLAEAERQARITRGGGDAEAARLFADAFSKDPDFYAFIRSLRA
YENSFSSGNDVMVLSPDSDFFRYMKSPDNSSKRP
>YE3675 hypothetical protein
MTVEIELKFIATPAAIAALPERITSWQSQHSAPQTLTNIYFETADNRLRQ
HDIGLRIRGYDGRYEMTVKTGGKVVGGLHQRAEYNVDIDNDKLDLARFPA
DIWPEGWAVDALQTELQPLFRTDFTREKWVITYGESEIELALDLGAISAG
ELSEPLSEIELELKKGNQTDLLALATELAQIGGLRQGNLSKAARGYHLAQ
GSHLSQGQPPRELRPLLVLQPAPKSTVEQGMVAGLEMALDHWQYHEELWL
RGEAAAKPMIIEALGMIRQTLAIFGGLVPRKASTELRAALIALEPQLEPK
NAKAELICYSADYLKCKLALTSWLVTSGWRPFMDDKALAKFNGSFKRFCD
IMLSRSAADLKEAFGHHLDDDGYLAQLPRLNRQIMAFQLLSGFYPQNEWH
PYIDGWFGLQLAIMERQGHWRDTARKEALSQPAFWLNGAVR
>YEP0011 hypothetical protein
MSEDCMAAIFRFSTKSFGIGYPGWITPNTTKLFTRYNITINLPANGNTAT
DAIPALTFLVLLLMLLSSFDWP
>YE1892 putative LysR-family transcriptional regulatory protein
MNVFISKKMRNFIILAQTNSMAKAAEKLHMTASPFGKSIAALEELVGYAL
FTRNEKSISLNKAGQELYQELFPIYQRLSAIDNTIVSLSQRQKNIVIGID
NTYPTIIFDQLFSLSDKYDGVTAQPFEFSENSVIDDLLDRRVDFIISPQQ
ASPRVTHIDSLQTTELPLLRLGFLVSRRFENCQSLELLNTLPWLQMRFQN
RANFESLLDAHFRSIGINPTIIYRPYSFMAKISAVEQGQFLTVVPQFAYR
LVNPAKLKYFDAPNQPMYMREYLYSLKNNHHIEQVMGYIHSDRDGD
>YE0643 hypothetical protein
MDIYLNRRERDNNYFLALAHSAANDLMKTAKIVSSRHIKDFFLKARFESE
VKQLSDGNLNIIRNAKTDSECRAAISNIQEECANIERQGTMLSLDRAKVY
MTINMEKYNNEIGYTINAIGVVTSGLQMSTGFGLIIKSGFILGKLAGAHL
AVSGASTAAENFLRLLGDDEYTGVMEKGYRATAKFMGFEPKVGSMAYHAV
DMASSLYGVVALSLKPDVLRLFRYIPSDYYRNITKMSQGALMLKFAGAGN
KIRIISDINHREDANHF
>YE0575 putative DNA-binding protein
MSFLEMLLRQGPMTASSLAQALATSQPTISRQLTRLKERVLKLGKGKSTR
YALLRLVAGVSEFPLYRIDLDGNASHFANLYPIYPAEMCCVHQVDDDTWQ
LFDSLPWYLADMRPQGFIGRIWGKSAAELLQLDKDIRGWNEDHILLALSR
MADDVNGNILIGQGGYQQWLDKPDAVPIQPSDKPRRYNQLSLMALAGEMV
GSSVGGEQPKFTCYAGYEGKLPSYVIVKFTAVQDNPNAQRWADLLIAESL
ALNTLRGAGYSAAFNHVVQNEDRQTFLEIERFDRHGQRGRIGMVSLEAVN
AEFVGMPVASWPKVCRELTAKGLLSIAELEQVRLIWAFGVLIANTDMHHG
NLSFLHPENRPLSLAPIYDMLPMAFAPSGMGNMRQTAPDITLSVDVSRDH
WVRAQQLAGQFWQNVSSHPNVSVEFKAIACQMREKIASLTPIIERMA
>YE2909 hypothetical protein
MKQVVYVASPDSQQIHVWQLGSTGELTLLQTVEVPGQVQPMTINPNQRHL
YVGVRPDFGIVSYNIADDGTLTAAGMAPLPGSPTHIGTDLQGRFLFSASY
SFNCVSISPIDEHGVVQTPIQQLNDLPAPHSANIDPTNQILLVPCLKEDK
VRLFDLSAEGKLTPHAQSDVTVAAGAGPRHMAFHPNQQVAYCVNELNSSV
DVYQISNNGQEYKIIQTLDAMPADFTDTCWAADIHITPNGRHLYISDRTA
SLLGIFSVSEDGREIALVGHHQTEAQPRGFNIDHNGQFLISSGQKSDHIE
VYRIDQISGELTTLKRYPVGKGPMWVSILAPRA
>YE2678 putative phosphotransferase system component
MKNVAILGSSGGNLFNLGGAYPERLLQEIYTQLHSAGLGVSAVQFIAAEE
SMDVAKPTTAAAVYSITAEEKDKPQISFQGKLSEVNDAVKSSDAAIAAQI
RAGAIDGIIIMSADPDRSNKEAILAAIEKKIPIVGTGGTSMAIITSKGAN
VIATSGTTGTNSRTRAISFTASLCKYWQIKYTPVLGSGDATMPGSDKSLL
KRINIRSIMIPALPGFIAMAIVLALSHIPGLQSLNAIFELLLKGLPVIVA
VIAAKQVSELDEVSIVAGVVAGVLSVEGGLIGGILGGIGAGIMVRYLFGL
CLKWQFPMTTINIVAGGLSGLFSGLVMYYLLSPLALAAGDYIKLAIEATL
AFSPILAGLLAGLVIWPAILGGVYHAVILPLVLLEMEKSGVSFLGAVDMV
GLVMVAAGINLANVIAPREKSEAAVAAPGLLINLGFGTFVESAYPFMFSN
KVVFAGAIFSAGVGGMLLGLLNIKGVAYVPAFASPFLSNNAFYMAIVMAV
TLVLTCVITLLANKFVAIKKAVPESPTPGISAKS
>YE1843 hypothetical protein
MVTNSSIRLNKYISESGICSRRDADRYIEQGNVFINGKRATVGEQVYAGD
VVKVNGQLIEPRDQNDLVLIALNKPVGIICTTEDGEGDNIVDFVNHSKRV
FPIGRLDKDSQGLIFLTNHGDLVNKILRAGNDHEKEYVVTVNKPLTDEFI
LGLGAGVPMLGTVTKKCKVRKEAPFVFSITLVQGLNRQIRRMCKYFGYEV
TKLERTRIMNINLKGLPIGEWRDLRDDELIELFKLIENSSSDEKPQKKPK
AKPVVAKKPAISRSKPAEKSEANSAGRKRFTQPGRKKKGR
>YE3461 putative phage integrase
MAKLNKSLVTLARQAGGSFKTVSDRMKIADRLAERLLKMNIQIRDANHLK
TNHIARYINSRLAENISKRTLQNEMAAIRALLRLAGKTFMANPTHEKLSN
EALGISGASREGTKVAIPIDVYESVLNKVRKTDKGAAAAMELSRQLGLRT
EEAIQSVKSLKTWQKTIINQQEKIRIVFGTKGGRPRDTTIINSDALTKII
NDAINIADKNNGKLIDKLGLHLAIERYRNIVRAAGLVGQYAPHSLRYAYS
RDALAHYRNKGFSQKEAQALVSMDLGHGDGRGQYVARIYNKVTIE
>YE0523 hypothetical protein
MHVTLVEINVKEDKVEQFVEVFRANHQGSLLEPGNLRFDVLQDESIPTRF
YIYEAYVDEAAVAAHKKTPHYLRCVEELEGLMTGPRKKTTFIGLMP
>YE0532 putative O-antigen biosynthesis protein
MPGTSPANPRGEMLNSIAALLLGIALPTSNPLINISLVLILISLIINRKS
LPLKPLLTNPLVYLPAAMFALLALSLLFHHNSYGPEMVSKDKKFLYVLPL
ALFFINQPQRVKLFCLGFLLANAVALLGTLVVGVLHIPIGQVDPTNPTIF
KLQITQNFFMALAAMLWLMLAVKSQGWKRWGYGLLVVAACYSILFLVLGR
TGYVALVVGLGVWLFFSLGSRQRWLLVLLGAIAFAALLLIPNKATQRITQ
GVDEIKVCMAASADNVNDACSTSMGQRSAFAIEAVRLIKEAPILGHGAGG
FYYENPEINYKINNPHNQYLLETIQSGVVGLIIFLAWIVCCYRVIWQQTP
ALRNVLLAVLTSYMACNFFNSFLLDSSEGHLFMIFVAILAGYSVTGTQPR
LELGKTT
>YE2922 putative protease
MGVNNTNYEISSEESNLNTLKNLGQSGKDIKIGIIDGLIERNHPLLNHIN
IHCSFINEEKKTTHGTAVCSIVGGKGLGIAENVEIYNIPVFHEDSQGKLQ
GCSELTLAKTINEARQQGCHIINISGASLSVNGRGSDDLRKAVNNCQQAG
ILIIAAVGNEGINQESLPASLDSVLAVGACDKEGYPAKFNNFGHKLRKKM
LLAPGISIPVAITGNNISLVSGSSFAAPIIAGFSALIIRALSLNGNNQAA
KVVNRLLFETATKLQPPALERQIYTLHRLNISSLLQRVQQELTLYQQNRN
AIMSTEDSIINPSSTDLYLAPENVITPATEENENYIFEATKLSSQPQNSS
LTLPKINDSAANNYVHAPINRVKPQADFDARHIRHQEKIFLIGTIGYDFG
TEARLDYFTQVMGNGQGHPFDPQQMAKHLVAGDNIEQSDALIWVLKVDGI
PVYAIKPDSQFAVLEYARIVNFLNDQEEEGVERVSVSGVVTGETRLFNGQ
VIPTVSPVSRPDSDCWDVVLEFFNPKERLTTARKLYRYTIDVSDIMPVTI
GTLRSWYVY
>YE1687 hypothetical 8.2 kDa protein in gpa 5'region
MPDLMDIAQERQEMLLAMQIAKARSKPITASAFICASCEAEIPEQRRITV
PGVIFCVACQQLHEEKKKHYRGIA
>YE3020 putative lyase
MFLKSVREQISLLQEQGDLRVINRPVDSYLEAAAIIRRCAEIEAPVPLMT
NIKDYPGCSIVGGLAALSSHAEYPLSRVAMSLGLPLTATAQDIVQYLVDG
LKKTPYLPREIERNSAACKQNILHGDKATLARFPIPQVHQYDGNRYVNTW
GVFIVESPDGKWCNWSIQRVQYHNDRQMIALVFPSQHIADIWEEWVKIGK
PMPYALVQGGEPIVPYVAGLPLLDRDVNEADYIGALYGKGIEVVKCETHH
LRVPASSEVVIEGYFAITREGIEGPFGEFAGYTPNENSLQPILSIEAISW
QDNPIWPVVAEGKPVDEYHTCSGIGDAAAIQNVLLEANIPVSFVWGPLYS
ACHLMIVSIKHNWRQLHPDLSSEQLTKKIGDIIHQTRNSIKLPKIVVLDD
DIDPTETNSLLWALSTRVHPEKRRYYYESAILPLLSCYSQAERHNRKGKR
VVIDALLPENHGSISSFDYAYPEAIKQRVLKNWVSDFGDNE
>YE1810 Insertion element protein
MKKRFSDQQIISILREAEAGVSARELCRKHAISDATFYTWRKKYGGMEVP
EVKRLKSLEEENARLKKLLAEAMLDKEALQVALGRKY
>YE0787 hypothetical protein
MHNYCEYKIEFSIRYFYIHEAGQLLLFIYDRDFPWCLFVYLIILNKYYFQ
VVMCHINIV
>YE1040 putative integral membrane protein
MDYIHIVVVALLTGMTALLSHRSVAVFHDGIRPILPQLVEGNMSRREAGS
IAFGLSVGFIASVGISFTLSTGLLNSWLLFLPTDIIGVLAINSYLAFALG
AAWGILALTSLPAVNTALTSLPVNFIGSLGELSSPVISAFALFPLVAIFY
QFGWKTAVVSAFIVLLTRLIITRFTGLFPESIEIFVGMILLIGIAIAQDL
RAKTHLGGGGGHSVFEERTQRIIKNLPMLAIVGGLISAVASMKIFGGSEV
SIYTLAKAYAPGITPEESLALIHQASLAEFMRGLGFVPLIATTALATGVY
AVAGFTFVFVVGYLVPNPLLAGIIGAVVISLEVLMLRSIGKWLGRFPSVR
NASDNIRNAMNLLMEYALLIGAIFASIKMAGYTGFTITTALYFLNEAIGR
PVMKIAAPVVAVIIAGIVLNLFYWLGLFVPA
>YE1100 putative protease
MSLTKVSSSSTNTPFTPELLSPAGTLKNMRYAFAYGADAVYAGQPRYSLR
VRNNEFNHENLALGIQEAHALGKRFYVVVNIAPHNSKLKTFLRDLKPVID
MGPDALIMSDPGLIMMVREAFPQMDIHLSVQANAVNWATVKFWQQMGLTR
VILSRELSLEEIAEIRAQVPDMELEIFVHGALCMAYSGRCLLSGYINKRD
PNQGTCTNACRWEYNVQEGKEDDVGNIVHIHQPIAVKNVEPTLGIGAPTD
KVFMIEEAQRPGEYMTASEDEHGTYIMNSKDLRAIQHVERLTKMGVHSLK
IEGRTKSFYYCARTAQVYRRAIDDAVAGKPFDPTLLTTLEGLAHRGYTEG
FLRRHAHEEQQTYEYGYSVSERQQFVGEFTGIRRDGLAEVIVKNKFSVGD
SVELMTPNGNIQFILESMQDKKGQPADVAPGNGHIMYLPIPEDINVEYGL
LIRNLNGTNSRNPHEKA
>YE2494 hypothetical protein
MLWRMLKQSWFRNVRRKSLAVFTVFLAAGLISSLLAVSIDIGDKMSRELK
SYGANILIEPAGQVALPSLFGEKSNPLTGQNFLDQAELPNIKDIFWRNNI
VGFAPLLSGETEVNGQDIPLLGTYFNQPVDVPDEEDYHTGQQIISPYWQV
NGNWPQEPVKPDVTEVEALLGKQLAQQSGWKIGDELNLDGASGPLKVKIS
GVLSSGGEEESRLVLPLAAVQSLLGLPGKVQAIRVSALTVPENELSRKAR
ENLEALNAEEYDLWYCTAYVSSIAHQLEEAISGSVVRPIWQVAASEGVVI
EKIQLLLAVVTLAALIAAAMGIASLMTSTIMERAKEIGLMKALGARQWQI
LMLFYLEAAISGLIGGLAGCLAGWGLAKTIGLMLFGAPLSFAWMVVPCVL
VISVLIAVIGTWFPARRIARLYPVEVLYGR
>YE2035 putative membrane transport protein
MNLTTSISSSRHPFHYAPFRNLVFARLLTVLGNGIAPIALAFAVLDIGGS
ATDLGLVVAARSIFNLAFLLIGGVLADRYSRSIVLLSSATIAALSQGGVA
WLVLDGSATIMGLAILGTINGAAAGIALPASSAMVPQTVPAHNLRAANAF
IQVGVYAGTIIGASLGGILTSTIGPGWGLAVDALGFAAAAPLYFFIRVNS
PLSLESNTNIVQDLRDGWKEFIARSWVWAIVAQFTIINAAFSGVVMILGP
IIADASFGRAGWGIIVAAESVGLIVGSFLALRWRPRRDLFIGVILVSLCA
VPIVLLSMISSTFVLMAGFFIAGIGFGQFGVVWANSLQTHIPADKLARVY
AYDAMGSFVAIPVGELAAGPLAMHYGNSTVLLAAAVAVVIATIAASFTPA
IRQLDNGPKVKPCDE
>YE1146 putative carboxypeptidase
MTANTLTPQMITGRSTEHLVVLTGNHRMQPQAVDAFLAMQRAAKVAGFDL
QPASTFRDFDRQLAIWNGKFRGERPVLDKDSQPIDISQLDAASRCEAILR
WSALPGASRHHWGSDLDIYDPSLLPLEAKLQLEPWEYQLGGYFYPLTQWL
DTHMAEFGFYRPFSADTGGVAAEPWHLSYRPLATTAEYLLTPAILLEAWQ
SQDVAGSEWLTGHLPMIFSRFITIPNRFQAATNT
>YE3832 hypothetical protein
MPHYLTVFICVLPWLLTTRCLAQTDVESTKCSNTDFSQVLSCYKNKLATE
PLIYNLLSQENLPGLEWRRYQLTSQQWSPEGLVTPAAWQHEVEVFIPSES
TSTQGLVVINNGTNHATDKVPASGPTDFSIDELQNIARQTNTVVIAVSTI
PNQYLEYQQDNEPRKEDYSVARSWTLFMDAPESRSTLPLQVPMAAAISQA
MTMAQQALPEKALDSFIVTGLSKRGWTTWLTAIVDSRVVAIAPFVIDLLD
TRAALEHMYQSYGGNWPIAFTPYYLDEVDKKLDTASFAKLMQIIDPLQYM
GTEYQSRLSIPKYIINASGDDFYVPDNTDFYYDKLPGEKALRVAPNASHS
GIKAFSEQSLISFVNRMRQSMPMPQVNASIAIKDKLQTLTVSLSETPDKV
LLWTATNTDARDFRYACNVRYSEFPLAITPANTLDIALTTPTTGWQATFV
EATFSDGFVATTPVYISPKDVYPTTAPPVAGVACKTLPGRNAIAASEVVT
PIGGAAPEGVQP
>YE1809 hypothetical protein
MDITMKINGHTTTTFGIELKMFRGKFIGLNSSADFYTYCLVELKKEGTLS
GGVFG
>YE0198 hypothetical protein
MPRHIFIATTTDTKGDELAYVSELIKATGLKTVTVDLSTKEARRAGGADI
TAETVASHHPDGRQAVFCGDRGRAISAMAVAFERFIASRDDVAALLGLGG
SGGTALITPAMQSLPIGIPKLMVSTMASGDVSGYIGASDIAMMYSVTDIA
GLNRISRRVLSNAAHQIAGAVYFAKEVFAQAELATDDKPALGLTMFGVTT
PCIQAVSAALSAEYDCLVFHATGSGGKAMEKLAESGLLAGALDLTTTEVC
DLLFDGVLACGPERFDAIAHSQIPYVGSCGALDMVNFGSPATIPVKYADR
LFYEHNAQVTLMRTTKQENIEMARWIGEKLNRCEGEVRFLIPQGGFSALD
APGQPFWDEKALQAFIHTLQETVIQTDKRRLVHYPFNINDPQFAQAAVEN
FKEIAKTPSH
>YE2134 hypothetical protein
MGIQGNDSRIDNIEFTVSDIARSKDFYGTVFDWRFTDYGPSYCEFTDGRL
TGGFTLGEVKPNGGPLVILYADNLEQMQKRLEQTGAKIVIPIFAFPGGRR
FHFTDPDGYQLAVWSNN
>YE0598 transposase for insertion sequence element IS1665
MHIAQALDLVSRYDSLRNPLTTLGDYLDPQLISRCLAESGTVTLRKRRLP
LEMMVWCIVGMALERKEPLHQIVNRLDIMLPGDRPFVAPSAVIQARQRLG
SEAVRRVFSQTAQLWHGSVTHPHWCGLTLLAVDGVVWQTDNATEQADAAE
KKPELVEQELWGVLLGYNLVRYQMIKMTGALKGYWPNQLSFSESCGMVIR
MLMTLQGASPGRIPELMRDMESMAQMVKLPIRRERAFPRVVKERPYKYGK
ARNKNASQLLN
>YE3094A hypothetical protein
MSRWLLIALGWLAVVLATLGVVLPLLPTTPFLLLAAWFFARSSPRFHHWL
LYRSWFGSYLRTWQQHRALPPGAKWKAILVTVLTFAISLWLVKIWWVRGI
LLVILTLLLTFMLRMPVIDLSQQKRP
>YE2418 hypothetical protein
MALVSQARSLGKYFLLLDNLLVVLGFFIVFPLISIRFVDQLGWAAVLVGL
ALGLRQLVQQGLGIFGGAIADRFGAKPMIVTGMLMRAAGFALMAMADEPW
ILWLACALSGLGGTLFDPPRTALVIKLTRPHERGRFYSLLMMQDSAGAVI
GALIGSWLLQYDFHFVCWTGAVIFILAAGWNVWLLPAYRISTVRAPMKEG
LMRVLRDRRFVTYVLTLTGYYMLSVQVMLMLPIVVNEIAGSPAAVKWMYA
IEAALSLTLLYPIARWSEKRFRLEQRLMFGLLIMTLSLFPVGLITHLQTL
FMFICFFYMGSIIAEPARETLSASLADSRARGSYMGFSRLGLALGGALGY
TGGGWMYDTGRTLEMPELPWFLLGVIGLITLVGLYWQFNQRRIESAMLSG
S
>YE2822 putative esterase
MNTSLELLEEHRMFGGWQQRYRHAASSLNCNMTFSIYLPPPRDDNPPPVL
YWLSGLTCNDENFTLKAGAQRIAAELGLVLVMPDTSPRADDVPNDDGYDL
GQGAGFYLNATQAPWSEHFHMYDYVSQELPALIRQHFSVSDRQSISGHSM
GGHGALMLALRNPQQYRSVSAFAPIVNPCQVPWGRKAFSAYLGADESQWL
QYDSCHLLTHAPTQLPILVDQGDGDQFLADQLQPAKFAELARQRDWPLTL
RIQPGYDHSYFTIATFIEDHLRFHAGYLHR
>YE0645 putative pyruvate formate-lyase 3 activating enzyme
MIFNLQRYSTHDGPGIRSVVFLKGCSLSCRWCQNPESRSRQPEVLFDSRL
CLSGCDLCQQTCPQAIQRIDNKIVIHRQQLAENDIQVLKVRCPSGALTVC
GSPIDIDATMETLLRDLPFYRRSGGGVTLSGGEPFMQPEVAAELLQRCHQ
QGIHTAVESCLHTPWRYIAPALPWLDLMLADLKHTDEQRFQQWTGGSAKR
VMDNFRKLAAAGTNITVRVPLIPDFNADLRSLQGIVDFAADEIGVKEIHF
LPYHTLGINKYSLLGEEYLAAKTPLDSPDLLAFAENYARDKGLTAILRG
>YE2705 putative luxR-family regulatory protein
MLSVAITTQNAFYELGLHSLLQKVFNKEGVEDYLFLNPHEKKSIHRANVI
FKDFMVIVNIYQESGFIAKCTSSNHTPMMTVNIPFNSSQLDIYDIISKIK
KILTIARLSHCNMMNSDIFRRLGLKNYIQLSITESRVVELTGQGYTITDI
SNILERSEKTVLTHRRNAIRKLGVLNRLEFYNYASHMKNYSNKEAVFICI
>YE2277 hypothetical protein
MKKVQGIYRAPHQHWVGDGFPVRSLFSYQSHGKQLSPFLLLDYAGPMDFT
PTQQRRGVGQHPHRGFETVTIVYHGEVEHRDSTGKGGVIGPGDVQWMTAG
GGILHEEFHSDAFAKRGGAFEMVQLWVNLPAKNKMTAPGYQAIRSETIPQ
VALPEGAGSLRVIAGDYAGKNGPANTFSPLNVWDIRLNQGKSTEFSLPDG
WNTALIVLRGTVQVNGDALARDAEMVLLDSTGSHVTIEANNDAVLLLLSG
EPIDEPIIGYGPFVMNSQEQIAKAIADFNSGRFGSMDNHA
>YE0321 hypothetical protein
MRTISYSEARQNLSTTMVQTVEDRAPILITRQNGTSCVLMSLEEYESLEE
TAYLLRSPANAKHLMDSIEELRAGKGIQRELEA
>YE3411 hypothetical protein
MIGLLSFNTYADDCFERAGRDYRIDPDLLRAISWNESKGNIHAIGKNPDN
SLDIGLMQINTQHEPELKRYGITRHHLTADPCMNIYTGAYYLAIAFRHWG
VNWDAVGAYNAGFAKNIKQDKRRKHYARKIHATYVEIKAQKKAAH
>YE2954 hypothetical protein
MLKIIRSGIYTTVQDSGRGGFRRLGISQGGALDLPALSMANMLVGNDADA
AGLEITLGQFTAEFTQPGWIAVTGAGCEAMLGDQLLWTGWRYPVKPGQQL
KLSVPHRGMRSYLAISGGIAVPEMLGSRSTDLKAGFGGHQGRLIKEGDNL
PLGQPTRLPRESVGIKQLLFGNRVRAVPGPEYHEFDDVAQGAFWREAWRL
SPQSNRMGYRLTGRELTRTTSREMLSHGLLPGVVQVPHNGQPIVLMADAQ
TTGGYPRIACVIEADLYHLAQLRLGESVHFVPCTVEEALRAKSEQQHFLQ
QIEWGLQSGFDTVEAK
>YE3691 putative glutaminase
MTIDLARLNHVVKDSYSQYSTLAGGANASYIPYLASVPSQLAGLAIVTID
GDIISQGDTDFRFALESISKVCSLALALEDIGPQAVQDKIGADPTGLPFN
SVIALELHNGKPLSPLVNAGAMSTVSAIKASNREDRWARILDMQQQLVGA
PIALSDEVNHSEQTTNFHNRAIAWLLYSAQAMYCDPMEACDVYTRQCSTL
LNTIELATMGATFAAAGVNPVTKKQVLTASNTPFILAEMTMEGMYGSSGD
WAYTVGLPGKSGVGGGILAVVPGVMGIAAFSPPLDPVGNSVRGQKMVASV
AQQLEYNLYKGSL
>YE0356 hypothetical protein
MASIVTRNMPIREDWLQQLADVITDPDELLRILQLNEHPNLQQGTAARRL
FPLRVPRAFVARMQPGNPSDPLLLQVLTAREEFIAAPGFTNDPLDEQRSV
VPGLLHKYRNRALLLVKGGCAVNCRYCFRRHFPYQDNQGNKANWRQALDY
VRQHPELDEIIFSGGDPLMAKDSELSWLLDEIENISHIKRLRIHTRLPVV
IPARITAELCQRLSDSRLQVLMVTHINHANEIDASFRDSMAQLKRAGVTL
LNQSVLLRGVNDDDEVLAALSNALFDAGILPYYIHVLDKVQGAAHFMVDD
DEARQLMKGLLSRVSGYLVPRLAREIGGQPSKTPLDLRLMQDELQ
>YE0514 putative transcriptional regulator
MDLPDKNNETYFKGLIAMMEHLSEPWGIKDLNSRHIYMNKAAFLYTNTPL
SFEVEGKMDHEFPGNWTEFAPDLIEHDKRTEETQDRVTVIETHYWYGKNT
LTPYISEKLPVYNDRKEVIGVMWNAKPFNTLSPLKYINQQKPSILTTEIN
NSMFTRAELDVIFLMLQRLSVKEIAKIYNISTKTIENRIYTIYQKSDVHT
LQQFEEFCKFAHLDNYIPDRLVAKGIQFI
>YE1591 putative transposase for IS1664 element
MLYVADLPRSTFYWQVKSSGREETYADEKQRIKTLFHHHKGRYGYRRITL
ALRNEGGSLNHKTVRKLMRQQQLASNLRRKKYQSYQGAYGKVVPNILARK
FTAEAPNQKWVTDVTEFNVRGKKLYLSPVLDLYNSEVVAWQMDTHPGMNL
IDKMLDDALQKLNSGDEPVLHSDQGWQYQMASYQKRLGSGEVKQSMSRKG
NCLDNAVIENFFGLLKTECWHNEKYEDVEQLKKAVDEYIHYYNNERIKVK
LNGLSPVQYRNQAMSTARKSVQ
>YE0449 hypothetical protein
MKYALGAVLYYWPKTDIETFYQAAASSSADIIYLGENVCTKRREMKVGDW
LALAKDVAASGKQVVISTLALLQAPSELNELKRYVENGEFLLEANDLGAV
NMAADRGLPFVAGHALNCYNAYTLRILHRQGMMRWCMPVELSRDWLANVL
QQCEELGFRDKFEVEVLSYGHLPLAYSARCFTARSEDRAKDECETCCIKY
PQGRKVLSQEDQQVFILNGIQTQSGYCYNLGNDLISMQGLVDIVRLSPQG
METLDVIDQFRANELGLNPLTLADKADCNGYWRRLAGLELVS
>YE2223 hypothetical protein
MLYVIFATDVPDSLEKRLSVRPAHLARLQALQDQGRLLTAGPNPAIDSAD
PGAAGFTGSTVIAEFTSQQEAEAWAEQDPYVAAGVYHSVIVRPYKRVF
>YE0454 putative amino acid permease
MSKTNNKMGVVQLTILTAVNMMGSGIIMLPTKLAEVGTISIVSWLVTAVG
SMALAYAFAQCGMFSRKSGGMGGYAEYAFGKSGNFMANYTYGASLLIANI
AIAISAVGYGTELLGATLTPLGICIATIGVLWLATVANFGGARITGQISS
VTIWGVIIPVVGISIIGWFWFSGSAYAAAWNPHGVPTFEAIGSSISMTLW
AFLGLESACANTDAVENPERNVPIAVLGGTLGAAVIYIISTNVIAGIVPN
MDLANSTAPFGLAFAYMFTPAVGKIIMALMIMSCVGSLLGWQFTIAQVFK
SSADEGFFPKIFSKVSKADAPIKGMLTIVVIQSVLSLMTISPSLNKQFNV
LVNLAVVTNIIPYILSMAALVIIQKTANVPPAKARKANIIAFIGAMYSFY
ALYSSGQEAMTWGAIVTFLGWTLYGLVSPRFEFAAKTK
>YE2172 hypothetical protein
MQQESVGTFSLDDNVWQGVTITDSAAAQITRLMQQDPEIKGLQLGVKQSG
CAGFAYVMDMAKEPASDALVFEHGSARLYVPLKAMPFIDGTVVDYVREGL
NQIFKFNNPKAQHACGCGESFGV
>YE3676 hypothetical protein
MQKLRLICLIVLSLTLSLSAYAEEKRYISDELDTYVHSGPGNQYRIVGTL
KGGDEVTLISVNDGTNYGQIRDSKGKTTWIPLDQLSETPSLRVRVPDLEQ
QVKTLTDKLANIDNSWNQRTAEMQQKVAASDSVISELQKENESLKNQLVV
AQKKVNAVNLQLDDKQRTIILQWFMYGGGVAGIGLLLGLILPHLIPSRKK
NNRWMN
>YE2013 hypothetical protein
MIAKICSSQLYRAKDINSEDVTRVFFSVSSQQKKLVKKLTYVSNAQVITD
LYIKIKSPVENQTKTEAVNRVQRSFTLNKDSSFIDSTTNLIKNTIDANKS
ATKKLSPVRSKRKHKKKNH
>YE0288 transposase for insertion sequence element IS1660
MHIGQALDLVSRYDSLRNPLTTLGDYLDPQLISRCLAESGTVTLRKRRLP
LEMMVWCIVGMALERKEPLHQIVNRLDIMLPGDRPFVAPSAVIQARQRLG
SEAVRRVFSQTAQLWHGSVTHPHWCGLTLLAVDGVVWRTPDTPENDTAFP
RQTYAGQPGLYPQVKMVCQMELTSHLLTAAAFGTMKESEYTLAEQLIDQT
ADNTLTLMDKGYYSLGLLNAWSQAGEHRHWMIPLKKGAQYEEIRKLGKGD
HLVKLKTSPQARKKWPELGTEMTARLLTITRKGKVYHLLTSMTDTMRYPG
GEMADLYGHRWEIELDYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY
NLVRYQMIKMAGALKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELM
RDMESMAQMVKLPIRRERAFPRVVKERPYKYGKARNKNASQLLN
>YE2635 putative metallo-beta-lactamase superfamily protein
MNVYLNDAGNGDCIIVETPNTVIMIDGGTASSYKLWNCNLERLSEIDALF
VTHIDNDHVNGLICMFEKRRHPVIKEVFFNGIEQLTNSKLIEESISHEES
STLDKMISNFSDVDEGGVDIGFSEGTSLSYILKTLNYNINERFNGEVLTT
ATVLNEFSIGDITFEIIGPTVKNIERINNSWLEVLSEYDLKMKWLHKKHS
VAFEKYIESLKNEIGYSNICEEMDDSIEVLADREFADDNSLNNMSSISFL
ATSDGKKILFLGDSDTKTILEWMDRKGYETLEVDAVKLPHHGSKHNYNKS
LLERVLCKKYLISTNGKKHSHPDIETLARIVKYSKIQPVDIYINHSVEHI
NNAFKENFSSYSSESNIFSIQKRFLYENG
>YE0035 LuxR family transcription regulatory protein
MEASLFNSLKMLIKFWECSSEPWGVKDNQSRYVYANNRLHKLFALPDKFS
MEGRTDGELPTPISEFELEFQEHDCKVKLLQDRVTSVEIHAWNGHSYYQP
YFFDKYPLIDEHGVSQGIISHSRPVEDVILTHLNKIKVPISLILTPPSDL
FSKREWEVLFYILHSFSSMEIATKLHLSSITVDNIIQKIYKKIGISGRQQ
LVDYCYENKINNYVPQSFFEYSGSFPLV
>YEP0095 putative plasmid copy number control protein
MIEGVHERKRRPKGPQKSNASYQRDFWERRKLTHSRIVMVVPNEVKEDLE
LLSSVYGKTKTEVIEQLIKMAKDGCGFKDPL
>YE3398 hypothetical protein
MSAQPVDIQIFGRSLRVNCPPEQQDALNMAAEDLNQRLQDLKVRTRVTNT
EQLVFIAALNVCHELAQERLKTRDYASNMEQRIRMLQQTIEQALLEQGRI
SERQDAQFE
>YE1025 putative RpiR-family transcriptional regulatory protein
MSSLLRIRQLYPTLAQNDRKLADFLLNNPEQARHLSSQKLAQQTGVSQSA
VVKFAQKLGYKGFPALKQALSEIVAAPEQAVTLHNQILSTDSLKTVGEKL
LAEKAAALRATLDINSEQRLTEALDMLRSARRVILTGLGASGLVAKDLAY
KLLKIGVMAVSETDMHAQLAAVQALDSRDLLLAISFSGERREINLAAEEA
QRCGAKVLALTSFTPNSLQQRADHCLYTISEEPAIRSAAISSSTAQYALT
DLLFMAMIQQDLESAQDHIKHSAALVKKLV
>YE2445 transposase (pseudogene)
MDEKSLYAHILNLSAPWQVESLSLDENAGSVTVVVGIAKNTLLICPTCGQ
QCPVHDHRHRKWRHLDTCQFMTVVEANVPVMCPDHGCQTLPVPWAGVGSR
YTLLFESFVLSWLKISTIDAVRKQLNLSWNAVDGIMTQAVKRGLARIKKP
LSARHMHVDEVAFKKGHRYITVISDREGRALALTDDRGTENLASYLRTLT
DGQLEAIKTFSMDMAGYIKAARIHLPGAVGKIAFDRFHVAKQLGEIVDKT
RQTEHPRLLVDSRRQAKGTRFLWQYSEKRMTEPRQEKLTWLREQMQLTRQ
CWTLKELAKDIWNRPWSTERRTDWQQWIALARSCDVPVMKNMAGTISKRL
YGILNAMRHNVSNGNAEALNSKIRLLRIKARDYRNRERFKLGVMFHYGKL
NMSF
>YE0160 putative fimbrial chaperone
MGISFANASVVMSGSRIIYSAGEKEHSIQLTNNDNFPNAVQVWLDSGDTQ
STPETGKAPFIVTPPFFRIEANAGQTLRLKYTGSGLPTDRESVFYLNFLQ
VPPVNKAEKNNKMLVLMRNRIKVFYRPENIAGRVDQVSSALTFNVRQQGK
DVVVTGKNPTGFYATIASGEVVGGGKKLKMKSEMIPPMSQAQWVIPNSSV
PSNAIVNFLLVNDFGGQDTGSYKIQ
>YE2660 putative binding protein-dependent transport system, inner-membrane component
MTHSHALRLPQTRDTIFYWLAFAVAAFALLPAFSLDYGLLEASRSELLAA
YGWSSLNISTLWFALPLVLLFRPLSPQGREHRTRHLFDAGFALFCALFVV
ITSAWLQRGLGFATIGLFIALGAVITLALARLEWLGGDRFVLGSLVTIIA
LITVFILFPSIAIFIPIFTDESGAFAPWQFMAILGQAHILQVIWNSFVLS
VAVGIGCTFFGMVLAIYTTRIAKRSALLGRIFSILPIVTPPFVVGLGVTL
MMGRSGYVTELMVTWFGLTNTNWLYGFTGIWLAQVLAFTPMSFMILDGAI
KTIHPSLEEASYTLRASRWQTFMQVFLPLLKPALANSFLIVIVQSLADFS
NPLVLGGNFDVLATQIYFYITGAQLDYPAASTLGVILLMFSLFIFCIQYM
WIGKRSYVTISGKSSRGDVQPLPVSLVWGTTAMLYVWIAFNVLLYGSIFY
GSFTVNWGVDYTLTLNNFNQLFGQGFSDGAWPSLLDTLLYAGIAAPITAA
FGLLIAYIVVRQQFRGKKTIEFSTMLCFAVPGTVAGVSYILAFNSAPVYL
TGTAAIVIISMVMRNVPVGIRAGIAGLGQLDKSLDEASLSLRAGSLRTVI
YILLPLLRPAILSALIYSFVRAMTTVSAIVFLVTPDTRVATSYILNRVED
GEYGIAIAYGSILIVVMLAIIFLFDYLVGEARIARSKASNAE
>YE2190 probable formyl transferase
MKAIVFAYHDIGCVGLKALVEAGYDIQAVFTHTDSPNENRFFSSVARVAA
DLDLPVFAPEDVNHPLWIERIQQLQPDIIFSFYYRNMLCDDILSSAPRGG
FNLHGSLLPKYRGRAPINWVLVNGETETGVTLHQMVKKADAGPIVGQHKV
MISGSDTALTLHTKMRDAANELLRDLLPKMKNASLPLEPQKESEASCFGR
RTADDGAIDWQKSALAINNLVRAVTEPYPGAFSYLGQRKLIIWRSQPLDV
VHSKLAGTVLSTSPLVVACGDGALEIITGQGEEGLYVQGSRLAQEMGIVT
GIRLGNKPSQKLKRRTRVLILGVNGFIGNHLTERLLRDNRYEVYGLDIGS
DAICRFLDNPNFHFVEGDISIHSEWIEYHIKKCDVILPLVAIATPIEYTR
NPLRVFELDFEENLKIVRDCVKYNKRIVFPSTSEVYGMCDDKEFDEDSSR
LIVGPINKQRWIYSVSKQLLDRVIWAYGAKENLKFTLFRPFNWMGPRLDN
LDAARIGSSRAITQLILNLVEGSPIKLVDGGEQKRCFTDIHDGIEALFRI
IENRDGACDGQIINIGNPTNEASIRELAEMLLRCFEKHELRHNFPPFAGF
KDIESSAYYGKGYQDVEYRTPSIRNARRILDWQPEIALEQTVMETLDFFL
RGVVLEQNVLEQNVLEQNVLEQNVIEQSKTSKDEHHV
>YE0095 hypothetical protein
MTIQQWCFSLKGRIGRRDFWIWIGLWLLAMLIIFTLAGQNWLSTQTAAFA
IVFLLWPTAAVMVKRLHDRNKAGWWALLVVLAWMLMAGNWQMLAPIWQWG
VGRFIPTLIMVMMLIDCGAFLGTEGENRFGPEAVPVKFLAEKSQ
>YE0202 hypothetical protein
MPVTPLTLESARNLIGEIFVYHMPFNRELGLKLTRFEQDFAEITFDNNDK
LVGNIAQRILHGGVIAAVLDVAAGLVCVGNSLVRHEPLIQEQLQMKLAKM
GTIDLRVDYLRPGRGEHFIASSCILRSGNKVSVARVELHNENQMHIASAT
ATYLVG
>YE0517 hypothetical protein
MRILFIYLNYSTVILFILYVYSIESPTELMIRIVVRLVPSNSLYPNITFI
DLLQGGALSVLY
>YE1467 putative ABC transport ATP-binding subunit
MISIERLSKTYTQGGLPMVALEEVSLEIPTGSVFGIIGRSGAGKSTLIRC
LNLLERPTSGRIQVDGRELTTLSDRELRLQRQNIGMIFQNFHLLHSRNVW
DNIAVGLEIIGMPKAQRQTRVADLLDLVGLGDKAHAFPSQLSGGQKQRVG
IARALAAKPSYLLSDEATSALDPETTASILALLSDINRQLGLTIVLITHE
LDVVKSICDTAALLERGRVVETGAIADLLSSPHSRLGRSLLPARGPASLS
GAPVAELTFFDTVSASPVLSELAQRHAVGVTLLGGGVESIGGQRVGRLQV
DFSHPDGGLNLAEVLQFLNERGVRAELI
>YE0827 hypothetical protein
MKKILNAFFPLYSATVLMLLGSGLLTTYISLRLTAIHVSGALIGAIIAAN
YIGLVIGGKVGHFLIARVGHIRAYVACAGIITAAVLSHGLTEYIPAWVVL
RLIIGLCMMCQFMVLESWLNDQAESSQRGTVFGFYMAATYAGMALGQVVL
MLQPELGLSTLMIIALFFALCLVPVALTTRSNAQQMSPAPMELKFFINSI
PKILASTLVIGMIVGSFYGLAPVYASLQSLSTQETGLFMALSIFAGLVAQ
LPLSWLSDRYNRTLLLRINALLLALTALPLALVPHISFHFLLGLGFIVSM
LQFTLYPLVVALANDLIAPERRVSLAACLLMSFGVGASIGPLAVGALIAP
LGGNILYAFFSLCGLSLIAFSRTVKQDQDEFVNDAPVPHMAIPDSLVSSP
LSPALNPSFDEQLIHDIMPPPDQAEAEKESVESSSEDEPQQQ
>YE2173 hypothetical protein
MLWKRTATLEQLNQQSQGCMVGHLGIEFTRLSDDELEATMPVDNRTTQPF
GLLHGGASVVLAESLGSMAGYLCTTEGQNIVGLEINANHLKAVKSGKVRG
CCRAIHVGRSHQVWQIEIFDEQNRLCCTSRLTTAVLSSPK
>YE0542 multifunctional PTS-system sugar transferase
MVNLVVVSHSALLAQGVAELAQQMTQGGCQLAVAAGVDDVDHPIGTDAIK
VMEAIESVYSPSGVLVLMDLGSALLSAETALELLDPEMARNVQLCAAPLV
EGTLAAVVAASSGASLAEVRAEAMGALVAKAAQLGEGIAPDANSAVVAKA
APDAQSVSWVVRNPNGLHVRPAAKLVEVLAPFTADLLLEKNGQCVNPRSL
NQLAILQVRKGDTIRLLASGEQAGEALDAFMQLAHQHFGESVSTISDSGF
TGVMVPRGAITAPVLQWLPAIPVFLPQTINVGSVANEQLRLHQALAHTVA
DLQQLAQQAEQQISAQAAAIFNAHAMLIDDEELYASMDKRIEQQLVCAES
ALQDELMSMVADYQALTDDYLRVRELDIRDILNRVLGHLTGLPPVPFSVD
REILLLAEELFPSQMIGLNHQHVKGICLSRGHILSHSAILATELDIPMLV
GAVGCLDASRNGQNALLDTVTGVLKLQ
>YE3251 hypothetical protein
MWYSPYLRPLILAPLLFASSACTHTANDSWTGKDKAQHFFASAALAAAGT
AYGEHQNWSDARSRNFGLLFSIGIGAGKELYDSRQGGTGWSWKDFAWDVA
GAVTGYSLYQAVN
>YE3577 hypothetical protein
MSSIPTSSRAAWLALVARLHFYIGLFIAPFIFFAALSGTWYVIAPNIEKN
LYAEQLYSTGEGKAHPLMQQITAAQGAAGKNAQIIAVRPAPTSGESTRVM
FAASGLNTGESRAIFIDPVTLAVLGDMPVYGSSGSLPLRTWLDKLHSGAL
FGDLGRNYSELAASWMWVAALGGIIMWAVQRRKKKPAAATKKTHRYRHIV
IGLTLLPLLLFISVTGLTWSKWAGDNISVLRTTLAWKTPSLNTALTSKIP
GTNDPHAHHCAGHEPGMAMPSYQLWRYDAVLEQARLAGIDAAKVEIRPGS
SANHAWTVAEIDRSWPTQVDARAFDIKTMQLVDQLNFDRFPLVAKLIRWG
IDAHMGILFGAANQLVLIFFGLGLCSTIIMGYCMWWRRRPKHQRFPVQGS
LMSSLGRLSLTGKALCLLPTLLLAFSLPLMGISLAAFLLIDSLCWFKARK
LRNADLKTA
>YE3237 probable sugar transporter
MIVLMSVATGLAVASNYYAQPLLETIAQAFNLSVNQAGFIVTAAQLGYAV
GLMFLVPLGDMFERRGLIVGMTLLAAGGMLITAMSQNLAMMIVGTALTGL
FSVVAQLLVPLAATLAAPEKRGKVVGIIMSGLLLGILLARTVAGALASIG
GWRTIYWVASVLMIIMALILWRYLPRYKQHSGLNYGQLLGSIFSLFIRTP
VLRTRALLGALSFANFSVLWTSMAFLLASPPFGYSEATIGLFGLVGAAGA
LMATKAGQLADKGKARITTSVGLILLLLSWIPIALGQHSITALIIGIIVL
DLAVQGVHVTNQSVIYRMMPEARNRLTAGYMTTYFIGGALGSLISAAAYQ
HAGWYGVALAGLVLCILNITTWLAGKRFDPPANQPVE
>YE2111 hypothetical protein
MSQYRPRSTRGNLVTLGQHYGTSLLGAPLIYFAAANPSAQTGLIIAGTHG
DESAAIVALSCALRSISPEQQRHHVILAVNPDGCQLGLRANANGVDLNRN
FPAKNWQAGETVYRWNSAADARDVVLSTGECAASEPETQALCALITQLSP
RWVVSFHEPLACIEDPDSSALGEQLAANFNLPLVTNVGYATPGSFGSWCA
DISLPCITAELPPISADAASECYLAALIDLLTRPD
>YE1740 hypothetical protein
MFVVSLTYHQPIDVVEALTESHKDWLKKYYAQGVFIASGRKVPRTGGIIL
AKSIMREELDKILAEDPFNAVAHYEVTEFIPSMTIESVAALKTL
>YEP0090 transposase for insertion sequence element IS1665
MHIAQALDLVSRYDSLRNPLTTLGDYLDPQLISRCLAESGTVTLRKRRLP
LEMMVWCIVGMALERKEPLHQIVNRLDIMLPGDRPFVAPSAVIQARQRLG
SEAVRRVFSQTAQLWHGSVTHPHWCGLTLLAVDGVVWQTDNATEQADAAE
KKPELVEQELWGVLLGYNLVRYQMIKMTGALKGYWPNQLSFSESCGMVIR
MLMTLQGASPGRIPELMRDMESMAQMVKLPIRRERAFPRVVKERPYKYGK
ARNKNASQLLN
>YE0699 aceE, pyruvate dehydrogenase E1 component
MSERLNNDVDPIETRDWLQAIESVIREEGVERAQYLIDQVLGEARKGGVS
VAAGSASRDYINTIPVEEEPAYPGNLELERRIRSAIRWNAVMTVLRASKK
DLDLGGHMASFQSSATFYEVCFNHFFRARNQKDGGDLVYFQGHISPGVYA
RAFLEGRLTQEQMDNFRQEVDGKGLSSYPHPKLMPEFWQFPTVSMGLGPI
SAIYQAKFLKYLSHRGLKDTSAQTVYAFLGDGEMDEPESKGAITIATREK
LDNLVFVINCNLQRLDGPVTGNGKIINELEGIFSGAGWQVLKVIWGGRWD
ELLRKDTSGKLIQLMNETLDGDYQTFKSKDGAYVREHFFGRFPETAALVK
DMSDDEIWSLNRGGHDPKKVFAALKKAQETTGKPTVILAHTIKGYGMGET
AEGKNIAHQVKKMNMEGVHHFRDRFNVPVADADIEKLPYITFEKDSEEYK
YLHERRQALEGYVPTRMPKFTEKLEMPALSDFSSLLEEQNKEISTTIAFV
RALNVMLKNKSIKDRIVPIIADEARTFGMEGLFRQIGIYSPNGQQYTPQD
REQVAYYKEDEKGQILQEGINELGAASSWLAAATSYSTNDLPMIPFYIYY
SMFGFQRIGDLCWAAGDQQARGFLIGGTSGRTTLNGEGLQHEDGHSHIQS
LTIPNCISYDPAYAYEVAVIMHDGLVRMYGDAQENIYYYLTTLNENYHMP
AMPQGAEEGIRKGIYKLETLEGKNGKVQLLGSGSILRHVREAAQILANDY
GVGSDTYSVTSFTELARDGQDCERWNMLHPTETPRVPYIAQVMNEAPAVA
STDYMKLFAEQVRTFVPASDYRVLGTDGFGRSDSRENLRHHFEVDASYVV
VAALGELAKRGEIEASVVADAIKKFNINPEKVNPRLA
>YE2680 argG, argininosuccinate synthase
MTTILKHLPINQRVGIAFSGGLDTSAALLWMQKKGAIPYAYTANLGQPDE
EDYDAIPRKAMEYGAEKARLIDCRKQLVAEGIAAIQCGAFHNTTAGVTYF
NTTPLGRAVTGTMLVAAMKEDGVNIWGDGSTYKGNDIERFYRYGLLTNAE
LKIYKPWLDTDFIDELGGRHEMSEFMIQSGFDYKMSTEKAYSTDSNMLGA
THEAKDLEFLNSSVKIVNPIMGVKFWDENVVVKAEEVSVRFERGYPVALN
GVVFDDSVELMMEANRIGGRHGLGMSDQIENRIIEAKSRGIYEAPGMALL
HIAYERLLTGIHNEDTIEQYHANGRVLGRLLYQGRWFDPQALMLRDSAQR
WVASEITGEVTLELRRGNDYSILNTVSENLTYQPERLTMEKGDSVFSPDD
RIGQLTMRNLDITDTRKKLFSYATTGLLSASAEVGLPQVDNNNLTARGLQ
DKSK
>YE0617 b0027, lipoprotein signal peptidase
MSKPICSTGLRWLWLAVLVVIVDLSSKQWVMTHFALYESVPLIPFFNLTY
AQNFGAAFSFLADKSGWQRWFFAGIAIGISVLLMVLMYRSTAKQRLLNCA
YALIIGGALGNLFDRMVHGAVIDFIDFHVNNWHFPTFNIADTAICIGAAL
VIFEGFISPAEKTAMNKGE
>YE0364 b4154, fumarate reductase flavoprotein subunit
MQTFNADLAIIGAGGAGLRAAIAAAEANPQLKIALISKVYPMRSHTVAAE
GGSAAVTQDHDTFDYHFHDTVAGGDWLCEQDVVDHFVHSCPEEMAQLEIW
GCPWSRKPDGSVNVRRFGGMKIERTWFAADKTGFHMLHTLFQTSLKYPQI
QRFDEHFVLDILVDDGQARGVVAMNMMEGTRVQIRANAVVMATGGAGRVY
RYNTNGGIVTGDGMGMAFHHGVPLRDMEFVQYHPTGLPGSGILMTEGCRG
EGGILVNKDGYRYLQDYGMGPETPLGEPKNKYMELGPRDKVSQAFWHEWR
AGRTIATPRGDVVYLDLRHLGEKKLLERLPFICELAKAYVGVDPVKEPIP
VRPTAHYTMGGIETNQQCETRIKGLFAVGECSSVGLHGANRLGSNSLAEL
VVFGRLAGEQAALRAMESTPANGSALDAQARDVETRLSNLMKQEGTENWS
KIRDEMGLSMEEGCGIYRTPELMQKTVDKLAELKERFKRVKITDNSSVFN
TDLLYTIELGYGLDVAECMAHSALNRRESRGAHQRLDEGCTERDDVNFLK
HTLAFHTPGGAPRLEYSDVKITKLAPAKRVYGGEATAQDAKDAKDAKDLK
DKEQAND
>YE0376 b4171, tRNA delta(2)-isopentenylpyrophosphate transferase
MNDIENLDQPPAIFIMGPTASGKTALSIALRQRLPVELVSVDSALIYRGM
DIGTAKPSAEEQALAPHRLIDIRDPAESYSAADFRKDALKEMADITAAGR
IPLLVGGTMLYFKALLDGLSPLPSADPQVRQRIEQQAAELGWEALHQQLA
EIDPVAAARIHPNDPQRLSRALEVFFISGKTLTELTKISGETLPYQVHQF
AIAPVSRELLHQRIELRFHQMLDAGFETEARALFDRGDLHTDMPAIRCVG
YRQMWSYLSGEIDYDEMVYRGICATRQLAKRQMTWLRGWGSVQWLDSDKP
GEALDSVIQVVSA
>YE3179 brnQ, branched-chain amino acid transport system II carrier protein
MSHRLSSKDIMALGFMTFALFVGAGNIIFPPMVGLQSGEHVWWAALGFLI
TAVGLPVITVIALARVGGGIDALSTPIGRGAGLVLATVCYLAVGPLFATP
RTATVSFEVGIAPLTGDGPLPLFIYSVVYFALVIGISLYPGRLLDTVGHI
LAPLKILALAILGIAALIWPAGPLIPATDAYQNAAFSSGFVNGYLTMDTL
GALVFGIVIVNAARSRGVVSAGLLTRYTIWAGLIAGIGLTLVYLSLFKLG
SSSGGLVPDAQNGAVVLHAYVQHTFGGLGSVFLAALIFIACMVTAVGLTC
ACAEFFAQYLPLSYRALVFILGIFSMMVSNLGLSHLIQISIPVLTAIYPP
CIVLVLMSFTLRWWHHAPRIVAPVMLVSLLFGILDAVKASTFAQYLPEWT
QHLPLAEQGLAWLSPSLLVFVVVGLYDRLCCRQVVAAKQ
>YE2350 cI, repressor protein CI
MKTTLAERLNIAMQLRGNMTQGALAKASGISQPTIWRLIKGEAKGTKKLV
DIANALNVNAEWLANGVGEMEGNNPTPRVDRIDNNSYVPVWTVAGQTNDS
VVAPDGKVTPSWRAYILDRNSGCSEAPAGSIVIVDTSLKPGTNDLVVAIN
GGSASVYRFLDGGSNGYLSVDDSRIPLVDLSLSAELVGVAIFILRDLRR
>YE1405 cheD2, putative methyl-accepting chemotaxis protein
MLDSIRSRILAACVIIVAGSLAINTYFNYSVANKYNSSAIDNTLKAVTAS
HGVGIADWVAMKTQMIVSLKESALAADPIAALRQVAAAGNFINVYIGYAN
KTAVFSNPDGIPAGYDPTGRPWYLQAVKAGSPVVTPPYIDAGTNQLVVTF
ALPIIQDGSVKGVLAADVTMDSVIANVKSIHPTDDSFGMLIDADGTIIAH
PDSQLTLKPLSEIAPTLDLKTLLTAISPTAAEIGGSTKLLLAQAVPGTQW
FTVVALDKAHATAGMRSLLTTSLVTLIVIILIAVVIISLITQRALTPLTH
VHKAMDAISSGTEDLTQRLPVEGHDEVAKIALSFNNFADKLSGVMAQIRN
TSESVSIAANEIAAGNQDLSGRTESAAASLQQTSAALEQISATVAQSASA
ARQANTAVLSAANDASRGGEVIAKVITTMESIEAASGKIGDITSVIDGIA
FQTNILALNAAVEAARAGEQGRGFAVVAGEVRTLAQRSAQAAKEIKTLID
STVSSVASGSGQVRQASNTMTEIVSSVSDVTTIMSEITNAADEQMRGIHE
INSAVTQLDTMVQQNAALVQESTAASAALQAQAADLTSAVNQFRI
>YE0085 cpxA, two component sensor kinase
MINSLTARIFAIFWFTLALVLMLVLMVPKLDSRQMTTLLDSEQRQGTMLE
QHIEAELASDPANDLMWWRRLYRAIEKWAPPGQHLVLVTTEGRVIGAQRH
EMQMVRNFIGQSDNSDQPKKKKYGRVEMVGPFSIRDGEDNYQLYLLRPAS
SPQSDFINLMFDRPLLLLIATMLISAPLLLWLAWSLAKPARKLKNAADDV
ARGNLKQHPELESGPQEFLATGASFNQMISALDRMVVAQQRLISDISHEL
RTPLTRLQLATALMRRRHGEGKELERIEMEAQRLDSMINDLLVLSRSQHK
NELHREPIKANELWSEVLENAQFEADQMGKTLEVTAPPGPWTLFGNPAAL
DSALENIVRNALRYSHHHIAVAFSSDNQGITITVDDDGPGVSPEDREQIF
RPFYRTDEARDRESGGTGLGLAIVETAVNQHRGWVRAEDSPLGGLRLIIW
LPLHPLKV
>YE3060 cueR, putative regulatory protein
MNISDVAKKTGLTSKAIRFYEEKKLVTPPIRKDNGYRSYSAKHIEELTLL
RQARQVGFTLDECRELLALFHDPARHSADVKAATLQKVAEIEKHINDLNQ
MRLRLLALADECPGDDGADCPIINNLAGCCHGSEKQKVG
>YE1952 cycA, D-serine/D-alanine/glycine transporter
MGKMVDQSKIVAEPLPEPEEHLQRSLSNRHIQLIAIGGAIGTGLFMGSGK
TISLAGPSIIFVYMIIGFMLFFVMRAMGELLLSNLKYKSFSDFAADLLGP
WAGFFTGWTYWFCWVITGIADVVAITAYAQFWFPGFSQWVASLLVVLLLL
SLNLATVKMFGEMEFWFAMIKIVAIVALIFAGLTMVLMSYQSPSGTTASF
THLWNDGGMFPKGISGFFAGFQIAVFAFVGIELVGTTAAETKDPEVVLPR
AINSIPIRIIMFYVFSLIMIMSVTPWSSVVADKSPFVELFVLVGLPAAAS
VINFVVLTSAASSANSGVFSTSRMLFGLAKEGDAPKQFGKLSRRSVPASG
LTFSCICLLGGVVLIYLIPNVMTVFTLVTTVSAILFMFVWTIILCSYLVY
RKRRPALHKKSIYKMPAGIFMSWVCMAFFAFVLVLLTLESDTRQALMVTP
LWFVILTIGYIILKKRRTRLFDSNNN
>YE3683 dnaG, DNA primase
MAGRIPRVFINDLLARTDIVDLIDARVKLKKQGKNYHACCPFHHEKTPSF
TVNGEKQFYHCFGCGAHGNAVDFLMNYDRLEFVESIEELATMHGLEVPYE
AGTGTTQLERHQRQSLYQLMESLSAFYQQSLKGQNANQAREYLKHRGLSE
EIIQHFAIGFAPPGWDNALKRFGRDGESRTALNDAGMLVTNDTGRVYDRF
RERVMFPIRDKRGRVIAFGGRILGDGVPKYLNSPETEIFHKGRQLYGLYE
AQVSHPNPTRLLVVEGYMDVVALAQFGIDYAVASLGTATTAEHIQLLFRA
TDNVICCYDGDRAGRDAAWRALETALPYLNDGRQLRFMFLPDGEDPDTLV
RKEGKDAFEQRMDEAQPLSTFLFETLMPQVDLSSPDGRAKLSTLALPLIS
QVPGETLRLYLRQQLGNKLGLLDDSQLDKLMPKQAENANTYQPPQLKRTT
MRILIGLLVQNPQLATLIPSLKGVEQAKLAGLPLFIELVETCLAQPGLTT
GQLLELYRDNKFSQQLETLATWNHMIVEDMVEQTFVDTLTSLYDSILEQR
QETLIARDRTHGLNAEERKELWSLNLALARKK
>YE1371 elaB, hypothetical protein
MNRDKEQQTSLDDDLTMLTDTLEEVLRASGDAADESYQEIKARAEKALKE
VQNRLSGRSECYIKRAKVLACCTDDYVREKPWCSVGISATVGLVVGLLLA
RR
>YE1790 exoX, putative exodeoxyribonuclease
MYFRVIDTETYGLEGGIVEIASIDVMDGALSNPMSDLVSPDRPISLDAMV
IHHITEEMVEGKPRIAVAVRKYQGSPYYVAHNAPFDRGVLPEMGGQWICT
LKLARMLYPDIKHSNQYLRYALRLNVSVPDNLYPHRALYDCYVTAALLQR
IMRDSGWSAEQMAEITQQPQLLSTFKFGKYRGKSIEQIARQDPDYLRWML
ASITDLTPDMRHTLTYYLAE
>YE1298 flk, putative flagellar assembly regulatory protein,Flk
MQPLNGPGVPIANDRNVAPTKLPSAGQVEERALTPAQRTTLEKLIVRIMA
LSPIKSAEIWAGLRHDLSLSSTSDLLARHFQPAEQLLQTRLSQAQENHAN
HQLRQQLTELLPQGNNRQAVSDFIRQQFGHTVLSQLSHAELQQVLVLLQS
GTLNIPQPQLTTITDRPLLPAEHQHIQSLVAKLSAATGEQPAKIWQALFD
MVGVKSNDPLPARHFQILSQFLQAKVALSQQTAPTLINLQTALKQPADAQ
EQQLLIDYSLNRFQASPTTPLTQAQLNDIINVLFTARLDRANAAQRLAED
QKTLQPLINPLIAALPQSLQPLLQKPSLAFVALIIVMAFLLAIFI
>YE0428 ftsH, cell division protein
MAKNLILWLVIAVVLMSVFQSFGPSESNGRRVDYSTFMSDVTQDQVREAR
INGREINVSKKDNSKYTTFIPVNDPKLLDTLLTKNVKVVGEPPEEPSLLA
SIFISWFPMLLLIGVWIFFMRQMQGGGGKGAMSFGKSKARMLTEDQIKTS
FADVAGCDEAKEEVSELVEYLREPSRFQKLGGKIPKGVLMVGPPGTGKTL
LAKAIAGEAKVPFFTISGSDFVEMFVGVGASRVRDMFEQAKKAAPCIIFI
DEIDAVGRQRGAGLGGGHDEREQTLNQMLVEMDGFEGNEGIIVIAATNRP
DVLDPALLRPGRFDRQVVVGLPDVRGREQILKVHMRRVPLDIDIDASVIA
RGTPGFSGADLANLVNEAALFAARGNKRVVSMVEFEKAKDKIMMGAERRS
MVMTEAQKESTAYHEAGHAIIGRLVPEHDPVHKVTIIPRGRALGVTFFLP
EGDAISASRQKLESQISTLYGGRLAEEIIYGPEKVSTGASNDIKVATSIA
RNMVTQWGFSEKLGPLLYAEEEGEVFLGRSVAKAKHMSDETARIIDQEVK
LLIERNYQRARKLLLENMDVLHSMKDALMKYETIDAPQIDDLMNRKEVRP
PAGWDDANKTKSTDNDSTPKAPTPVDEPHTPTPGNTMSEQLGDK
>YE2845 glnH, putative glutamine-binding periplasmic protein
MKSLVKVSLAALALAFAVSSHAAEKELIVATDTAFVPFEFKQGDKYVGFD
IDLWDAIAKKLDLKYTLKPMDFSGIIPALQTKNVDLALAGITITDERKKA
VDFSDGYYNSGLLVMVKANNNDIKGEADLAGKVLAVKSGTGSVDYAKANI
KTKDLRQFPNIDNAYLELGTGNADAVLHDTPNILYFIKTAGNGQFKAVGD
SIKAQQYGVAFPQGSDLREKVNAALKSLKEDGTYAAIYKKWFGVEPK
>YE2993 gltK, putative glutamate/aspartate transport system permease
MYEFDWSSIAPSLPYLLQGLAVTAKITLIAIVFGIVWGTVLAVMRLSPVK
AISWFATAYVNVFRSIPLVMVLLWFYLVVPSLLQNVLGLSPKTDIRLISA
MVAFSLFEAAYYSEIIRAGIQSISRGQSSAALALGMTQGQSMRLVILPQA
FRAMVPLLLTQGIVLFQDTSLVYVLSLADFFRTATTIGERDGTQVEMVLF
AGLVYFVISFSASMLVNYLKKRTV
>YE1677 gpO, putative phage capsid scaffolding protein
MPKLSKFFRVAVEGATTDGRIINRQDLLDIALSYDPKVYGARVDLEHYKS
PYPDSVFHCYGDITAVKTEEIAEGALKGKLALFAQIDPTDELLTLNKGRQ
KVYSSIQFDPNFATSGRAYLKGLALTDDPASLGTELLQFCAKQVAESKPN
PLAGRKQSPDCLFTALEETFIEFEEVQPADDTSKKFTAKIKELLFGAEKK
TDGNLDDIRQAVQVIAESQKTVLETQQQFTASRQEVTDLKDQLSQLSTSF
ASLTTQLQSEDSQHTSRPPAKGGPEGSTDDTIDC
>YE3603a hybF, hydrogenase expression/formation protein
MHEISLCLSTLELIEKQARLNGATRITAVWLEIGALSCIEESALRFSFEA
ASRKTLAENCQLHLSYMPAVAWCWECSNSVAIERHDAGCPHCGSHALQVE
SGSNLQVKQIEVE
>YE2618 irp1, yersiniabactin biosynthetic protein
MDNLRFSSAPTADSIDASIAQHYPDCEPVAVIGYACHFPESPDGETFWQN
LLEGRECSRRFTREELLAVGLDAAIIDDPHYVNIGTVLDNADCFDATLFG
YSRQEAESMDPQQRLFLQAVWHALEHAGYAPGAVPHKTGVFASSRMSTYP
GREALNVTEVAQVKGLQSLMGNDKDYIATRAAYKLNLHGPALSVQTACSS
SLVAVHLACESLRAGESDMAVAGGVALSFPQQAGYRYQPGMIFSPDGHCR
PFDASAEGTWAGNGLGCVVLRRLRDALLSGDPIISVILSSAVNNDGNRKV
GYTAPSVAGQQAVIEEALMLAAIDDRQVGYIETHGTGTPLGDAIEIEALR
NVYAPRPQDQRCALGSVKSNMGHLDTAAGIAGLLKTVLAVSRGQIPPLLN
FHTPNPALKLEESPFTIPMSAQAWQDEMRYAGVSSFGIGGTNCHMIVASL
PDALNARLPNTDSGRKSTALLLSAASDSALRRLATDYAGALRENTDASDL
AFTALHARRLDLPFRLAAPLNRETAAALSDWAGEKSGALVYSGHGASGKQ
VWLFTGQGSHWRTMGQTMYQHSTAFADMLDRCFSACSEMLTPSLREAMFN
PDSAQLDNMAWAQPAIVAFEIAMAAHWHAEGLKPDFAIGHSVGEFAAAVV
CGHYTIEQVMPLVCRRGALMQQCASGAMVAVFADEDTLMPLARQFELDLA
ANNGTQHTVFSGPEARLAVFCTTLSQHNINYRRLSVTGAAHSALLEPILD
RFQDACAGLHAEPGQIPIISTLTADVIDESTLNQADYWRRHMRQPVRFIQ
SIQMAHQLGARVFLEMGPDAQLVASGQREYRDNAYWIASARRNKEASDVL
NQALLQLYAAGVALPWTDLLAGDGQRIAAPCYPFDTERYWKERVSPACEP
ADAALSAGLEVASRAATALDLPRLEALKQCATRLHAIYVDQLVQRCTGDA
IENGVDAITIIRRGRLLPRYQQLLQRLLNNCVVDGDYRCTDGRYVRAHPI
EHQQRESLLTELAGYCEGFQAIPDTIARAGDRLYDMMSGAEEPVAIIFPQ
SASDGVEVLYQEFSFGRYFNQIAAGVLRGIVQTRQPRQSLRILEVGGGTG
GTTAWLLPELNGVPALEYHFTDISALFTRRAQQKFADYDFVKYSELDLEK
EAQSQGFQAQSYDLIVAANVIHATRHIGRTLDNLRPLLKPGGRLLMREIT
QPMRLFDFVFGPLVLPLQDLDAREGELFLTTAQWQQQCRHAGFSKVAWLP
QDGSPTAGMSEHIILATLPGQAVSAVTFTAPSEPVLGQALTDNGDYLADW
SDCAGQPERFNARWQEAWRLLSQRHGDALPVEPPLVAAPEWLGEVRLSWQ
NEAFSRGQMHVEARHPDGEWLPLSPAAPLPAPQTHYQWRWTPLNVASVDH
PLTFSAGTLARSDELAQYGIIHDPHASSRLMIVEESEDTLALAEKVIAAL
IASAAGLIVVTRRAWRVEENEALSASHHALWALLRVAANEQPERLIAAID
LAENTPWETLHQGLSAVSLSQRWLAARGNTLWLPSLALNTGCAAELPANV
FTGDNRWHLVTGAFGGLGRLAVNWLREKGARRIALLAQRVDESWLRDVEG
GQTRVCRCDVGDAGQLATVLDDLAANGGIAGAIHAAGVLADAPLQELDDH
QLATVFAVKAQAANQLLQTLRNHDGRYLILYSSAAATLGAPGQSAHALAC
GYLDGLAQQFSTLDAPKTLSVAWGAWGESGRAATPEMLATLASRGMGALS
DAEGCWHLEQAVMRGAPWRLAMRVFTDKMPPLQQALFNISATEKAATPVI
PPADDNAFNGSLSDETAVIAWLKKRIAVQLRLSDPASLRPNQDLLQLGMD
SLLFLELSSDIQHYLGVRINAERAWQDLSPHGLTQLICSKPETTPAASQP
EVLQHDADKRYAPFPLTPIQHAYWLGRTHLIGYGGVACHVLFEWDKRHDE
FDLAILEKAWNQLIARHDMLRMVVDADGQQRVLGTTPEYHIQRDDLRALS
PEEQRIALEKRRHEMSYRVLPADQWPLFELVVSEIDDCHYRLHMNLDLLQ
FDVQSFKVMMDDLAQVWRGETLAPLAITFRDYVMAEQARRQTSAWHDAWD
YWQGKLPQLPLAPELPVVETRPETPHFTTFKSTIGKTEWQAVKQRWQQQG
VTPSAALLTLFAATLERWSRTTAFTLNLTFFNRQPIHPQINQLIGDFTSV
TLVDFNFSTLVTLQEQMQQTQQRLWQNMAHSEMNGVEVIRELGRLRGSQR
QPLMPVVFTSMLGMTLEGMTIDQAMSHLFGEPCYVFTQTPQVWLDHQVME
SDGELMFSWYCMDNVLEPGAAEAMFNDYCAILQAVIAAPESLKTLASGIA
GHIPRRRWPLNAQADYDLRDIEQATLEYPGIRQARAEITEQGALTLDIVM
ADDPSPSAATPDEHELTQLALSLPEQAQLDELEATWRWLEARALQGIAAT
LNRHGLFTTPEIAHRFSAIVQALSAQASHQRLLRQWLQCLTERAWLIREG
ESWRCRVPLSEIPEPQEACPPSQWSQALAQYLETCIARHDALFSGQCSPL
ELLFNEQHRVTDALYRDNPASACLNRYTAQIAALCGAERILEVGAGTAAT
TAPVLKATRNTRKSYHFTDVSAQFLNDARARFHDESRVSYALFDINQPLD
FTAHPEAGYDLIVAVNVLHDASHVVQTLRRLKLLLKAGGRLLIVEATERN
SVFQLASVGFIEGLSGYRDFRRRDEKPMLTRSAWQEVLVQAGFANELAWP
AQESSPLRQHLLVAHSPGVNRPDKEAVSRYLQQRFGTGLPVLQIRQREAL
FTPLHAPSDALIEPAKPTPVAGGNPALEKQVAELWQSLLSRPVARHHDFF
ELGGDSLMATRMVAQLNRRGIARANLQDLFSHSTLSDFCAHLQAATSGED
NPIPLCQGDGDETLFVFHASDGDISAWLPLASALNRRVFGLQAKSPQRFA
TLDQMIDEYVGCIRRQQPHGPYVLAGWSYGAFLAAGAAQRLYAKGEQVRI
ALIDPVCRQDFCCENRAALLRLLAEGQTPLALPEHFDQQTPDSQLADFIS
LAKTAGMVSQNLTLQAAETWLDNIAHLLRLLTEHTPGENVPVPCLMVYAA
GRPARWTPAETEWQGWINNADDAVIEASHWQIMMEAPHVQVCAQHITRWL
CATSTQPENTL
>YE2617 irp2, yersiniabactin biosynthetic protein
MISGAPSKDSLLPDNRHAADYQQLRERLIQELNLTPQQLHDESNLIQAGL
DSIRLMRWLHWFRKNGYRLTLRELYAAPTLAAWNQLMLSRSPENAEEETL
PDESSWPNMTESTPFPLTPVQHAYLTGRMPGQTLGGVGCHLYQEFEGHCL
TASQLEQAITTLLQRHPMLHIAFRPDGQQVWLPQPYWNGVTVHDLRHNDA
ESRQAYLDALRQRLSHRLLRVEIGETFDFQLTLLPDNRHRLHVNIDLLIM
DASSFTLFFDELNALLAGESLSAIDTRYDFRSYLLHQQKINQPLRDDARA
YWLAKASTLPPAPVLPLVCEPATLREVRNTRRRMIVPATRWHAFSNRAGE
YGVTPTMALATCFSAVLARWGGLTRLLLNITLFDRQPLHPAVGAMLADFT
NILLLDTACDGDTVSNLARKNQLTFTEDWEHRHWSGVELLRELKRQQRYP
HGAPVVFTSNLGRSLYSSRAESPLGEPEWGISQTPQVWIDHLAFEHHGEV
WLQWDSNDALFPPALVETLFDAYCQLINQLCDDESAWQKPFADMMPASQR
AIRERVNATGAPIPEGLLHEGIFRIALQQPQALAVTDMRYQWNYHELTDY
ARRCAGRLVECGVQPGDNVAITMSKGAGQLVAVLAVLLAGAVYVPVSLDQ
PAARREKIYADASVRLVLICQHDASAGSDDIPVLAWQQAIEAEPIVNPVV
RAPTQPAYIIYTSGSTGTPKGVVISHRGALNTCCDINTRYQVGPHDRVLA
LSALHFDLSVYDIFGVLRAGGALVMVMENQRRDPHAWCELIQRHQVTLWN
SVPALFDMLLTWCEGFADATPENLRAVMLSGDWIGLDLPARYRAFRPQGQ
FIAMGGATEASIWSNACEIHDVPAHWRSIPYGFPLTNQRYRVVDERGRDC
PDWVSGELWIGGIGVAEGYFNDSLRSEQQFLTLPDERWYRTGDLGCYWPD
GTIEFLGRRDKQVKVGGYRIELGEIESALSQLAGVKQATVLAIGEKEKTL
AAYVVPQSEAFCVTDHRNPALPKAWHTLAGTLPCCAISPEISAEQVADFL
QHRLLKLKPGHTAGADPIPLMNSLAIQPRWQAVVERWLAFLVTQRRLKPA
AEGYQVCAGEEREDEHPHFSGHDLTLSQILRGARNELSLLNDAQWSPESL
AFNHPASAPYIQELATICQQLAQRLQRPVRLLEVGTRTGRAAESLLAQLN
AGQIEYVGLEQSQEMLLSARQRLASWPGARLSPWNADTLAAHAHSGDIIW
LNNALHRLLPEDPGLLATLQQLAVPGALLYVMEFRQLTPSALLSTLLLTN
GQPEALLHNSADWAALFSAAAFNCQHSDEVAGLQRFLVQCPDRQVRRDPR
QLQAALAGRLPGWMVPQRIVFLDALPLTANGKIDYQALKRRHTPKAENQA
EADLPQGDIEKQVAALWQQLLSTGNVTRETDFFQQGGDSLLATRLTGQLH
QAGYEAQLSDLFNHPRLADFAATLRKIDVPVEQPFVHSPEERYQPFALTD
VQQAYLVGRQPGFTLGGVGSHFFVEFEIADLDLTRLETVWNRLIARHDML
RAVVLDGQQQVLEQTPPWVIPTHTLHTPEEALRVREKLAHQVLNPEVWPV
FDLQVGYVDGMPARLWLCLDNLLLDGLSMQILLAELEHGYRYPQQLLPPL
PVTFRDYLQQPSLQSPNPDSLAWWQAQLDDIPPAPALPLRCLPQEVETPR
FARLNGALDSTRWHRLKKRAADAHLTPSAVLLSVWSTVLSAWSAQPEFTL
NLTLFDRRPLHPQINQILGDFTSLMLLSWHPGESWLHSAQSLQQRLSQNL
NHRDVSAIRVMRQLAQRQNVPAVPMPVVFTSALGFEQDNFLARRNLLKPV
WGISQTPQVWLDHQVYESEGELRFNWDFVAALFPAGQVERQFEQYCALLN
RMAEDESSWQLPLAALVPPVKHAGQCAERPPRVCPEHSQPHIAADESTVS
LICDAFREVVGESVTPAENFFEAGATSLNLVQLHVLLQRHEFSTLTLLDL
FTHPSPVALADYLAGVATVEKTKRPRPVRRRQRRI
>YE0069 kdtA, 3-deoxy-D-manno-octulosonic-acid transferase
MLLRLYQVLLYLIQPLIWLRLLLRSRKAPAYRKRWGERYGFCAGKVVAGG
IMLHSVSVGETLAAIPLVRALRHRYPSLPITVTTMTPTGSERVQSAFGKD
VHHVYLPYDLPGSVNRFLDQVNPKLVIIMETELWPNLINALHRRKIPLVI
ANARLSARSAAGYKKIGSFIRNMLQRITLIAAQNQEDGDRFIELGLRRSQ
LTVTGSLKFDISVTPELAARAVTLRRQWAPHRPVWIATSTHDGEETILLE
AHRQLLQQFPTLLLILVPRHPERFPKAIELTQKAGLSYTLRSKGEVPSSS
TQVVIGDTMGELMLLYGIADLAFVGGSLVERGGHNPLEAAAHAIPVLMGP
HTFNFKDICAKLEQAEGLITVTDTLSLVKEITQLLTDEDCRLYYGRHAVD
VLHENQGALQRLLHLLEPYLPQRSH
>YE0233 livG, high-affinity branched-chain amino acid transport, ATP-binding protein
MSTQPLLAVEGLSMRFGGLLAVNNVGLNLNQGEIVSLIGPNGAGKTTIFN
CLTGFYRPTGGTIKLRDRHIEGLPGQVIARMGVIRTFQHVRLFREMTVVE
NLLVAQHQHLKSGVFAGLLKTPGFRRAEADALERAATWLERVGLLELANR
QAGNLAYGQQRRLEIARCMVTRPELLMLDEPAAGLNPKETDELNQLIMEL
RDQHQVSVLLIEHDMKLVMGISDRIYVVNQGTPLAQGSPIEIRNNPDVIR
AYLGE
>YE1671 lysB, phage lysin
MPIFNAASLVWPIVGALLVISGVQTHRLAESRQTLIDQQAADSASKSGQL
IALALTANANNQAQAQLRQQVASADQLLAQRNSQIKRLYRENETLRRWAD
TPLPDDIIRLRRRPAITGAADYRQWLSESHILPVSSNRAAN
>YE1377 menB, naphthoate synthase
MLYPSEEQLYAAIEWQDCSAGFEDIRYHKSSDGIAKITINRPHVRNAFRP
LTVKEMIQALADARYDDNIGVIILTGEGEKAFCSGGDQKVRGDYGGYQDA
SGTHHLNVLDFQRQIRTCPKPVVAMVAGYSIGGGHVLHMMCDLTVAADNA
IFGQTGPKVGSFDGGWGAAYMARIVGQKKAREIWFLCRQYDAKQALDMGL
VNTVVPLASLEKETVRWCREMLQNSPMALRCLKAALNADCDGQAGLQELA
GNATMLFYMTDEGQEGRNAFNEKRQPDFSKFKRNP
>YE3003 mrdB, rod shape-determining protein
MTDNQQKGSLWYKMHIDLPFLLCVLALLAYSAFVMWSASGQDMGMMERKV
GQIAMGLVVMLVMAQIPPRVYESWAPYLYFVCVILLVLVDAFGQISKGAQ
RWLDLGFIRFQPSEIAKIAVPLMVARFMNRDVCPPSLKNTGIALILIFMP
TLLVAAQPDLGTSILIAASGLFVLFLSGMSWRLIAIAAILVAAFIPILWF
FLMHGYQRDRVMMLLDPESDPLGAGYHIIQSKIAIGSGGLSGKGWLHGTQ
SQLEFLPERHTDFIFAVLAEELGLIGVLVLLALYLCLIMRGLVIAAHAQT
TFGRVMVGGLMLILFVYVFVNIGMVSGILPVVGVPLPLVSYGGSALIVLM
AGFGIVMSIHTHRKMLSKNL
>YE1153 narP, nitrate/nitrite response regulator protein NarP
MTKSHTIMIVDDHPLMRRGIKQLLELDSNFDVVAEANCGSDAIIEAAKCQ
PDVILLDLNMKGMSGLDTLKALRNEGIDARIIVLTVSDARSDVYAMIDAG
ADGYLLKDSEPEILLENIRLASKGENVFSDAVTQYLSSRDEQVNPFSELT
ERELDVLQEVARGMSNKQVAFELHISEETVKVHIRNLLRKLNVRSRVAAT
IMYLENKQY
>YE2923 nuc, DNA/RNA non-specific endonuclease
MKFNLIKLLPVLLLTACTTTDHSTAPKTTPSAPTVTEQNLPAAAIDNCLV
GCPTGGSDQTIIRDVYTLNNNSHTKFANWVAYKVTKSSQASNHPRKWAQD
PDLPASDTLAPADYTGANQKLAVDRGHQAPLSLLAGNNDSQALNYLSNIT
PQKAALNQGAWVRLEDKERTLANRQDVTAVYSVTGPLFERNIGTLPAKPS
VAFPSGYWKIIFIGTSPDKGQYAAFLMDQNTAKSANFCDYQVTVDTIEAK
TNPQLTIWSNLPAEVAQIIKSQKGTLAQTIGCN
>YE3349 outD, general secretion pathway protein D
MYQNNINIICISIWFLFKFVILAAIFPIVGYAENFSASFRDADIKEFINT
VSKNINKTIIIDPKVQGLVSVRSYELLDEEKYYQFFLNVLDVYGYTVVEM
PNNILKVIPAKRAKGSVVPLQNNAAPPQGDELINRVFKLKHLLAKNLAPL
LRQLNDNSESGSIVNYDPSNVILITGRAAVVNRLYAIISTLDQPGETEVE
LYQLNHAVAADIIKLVNQVINPVNITAKQESFNAATVVADDRTNSVIISG
DRHIRKKTLQMIKRLDHQQDSYGSTKVVYMKYAQASKLLDVLNGVSQGSQ
SDKTKKNSGKSHIKNVSIKAYDQTNALVITADPKVMKELDQVIERLDVRR
AQVLVEAIIVETQNGEGLNLGIQWANKLYGGANFLQKPNSIQTNNNSGNP
IPMMIAGLTAGFYKGNWDGLFTALATNSNNNILATPSIVTLDNMEAEFNV
GQEVPVLTSTQTTATDKVYNSISRQSVGVMLKVKPQINKGDSVLLEIRQE
VSSVADSSDVNANNLGSVFNKRVVNNAVLVKSGETVVVGGLLDKKINKII
NKVPLLGDIPFIGGLFRQSKEKIEKSNLILFIRPTILRETSDYSQVTVDK
YAEYNNLHSINSGMERPIDIVSDRINYDAFNTLKSDIIKFYEMVEIKI
>YE2740 pduO, putative propanediol utilization protein: B12 related
MSIYTKTGDAGTTALFTGQRVKKSHPRVETYGTLDELNAALSLCARVAQG
EENLQLLDAIQHQLFYFSAELASEGIETPPCGRKSISEQDIQALEQAVDR
CIAQLPPVQGFILPGNTEAGSRLHFARTLARRCERRLIELAEQVPVRPVL
LQYLNRLSDCLYALARDEDQRQTLQQTAHTVVARYLAATTEKPVATTQPT
SAGLGFSDVHQLVKLAVEAAMTLQITVVVALADRHGNMIMTYRMPDTLLV
SSELAPKKAWTAVALKTATHQLSAAIQPGADLFQLEASTGGKVVSFGGGY
PLWRDGQLVGGLGISGGSVEQDMYIAETAISALHLRNE
>YE1718 phoP, response regulator protein
MRVLVVEDNALLRHHLTVQMREMGHQVDAAEDAKEADYFLQEHAPDIAII
DLGLPGEDGLSLIRRWRSHQTNLPILVLTARESWQDKVAVLEAGADDYVT
KPFHLEEVIARIQALMRRNIGLASQIIEFPPFQIDLSRRELCVNQQQIKL
TAFEYTIIETLIRNAGKVVSKDTLMLQLYPDAELRESHTIDVLMGRLRKK
LLAEHEGEVITTIRGQGYRFDAN
>YE0438 pnp, polyribonucleotide nucleotidyltransferase
MLTPIIRKFQYGQHTVTIETGMMARQATAAVMVSMDDTAVFVTVVGQKKA
KPGQSFFPLTVNYQERTYAAGRIPGSFFRREGRPSEGETLTSRLIDRPIR
PLFPDSFLNEVQVIATVVSVNPQINPDIVALIGASAALSLSGIPFNGPIG
AARVGFINDQYVLNPTTDELKESRLDLVVAGTAGAVLMVESEADILSEDQ
MLGAVVFGHEQQQVVIENINALVAEAGKPKWDWHAEPVNEALHARVAELA
AARLGDAYRITEKQERYTQVDAIKADVTEALLAQDDTLDAAEIQDILGSV
EKDVVRSRVLRGEPRIDGREKDMIRGLDVRTGVLPRTHGSALFTRGETQA
LVTATLGTARDAQNIDELMGERTDSFLLHYNFPPYSVGETGMVGSPKRRE
IGHGRLAKRGVLAVMPSPSEFPYTVRVVSEITESNGSSSMASVCGASLAL
MDAGVPIKAAVAGIAMGLVKEDENFVVLSDILGDEDHLGDMDFKVAGSRD
GITALQMDIKIEGITREIMQVALNQAKGARLHILGVMEQAISTPRGDISE
FAPRIYTMKINPEKIKDVIGKGGSVIRALTDETGTTIEIEDDGTIKIAAT
DGDKAKHAIRRIEEITAEIEVNRIYAGKVTRIVDFGAFVAIGGGKEGLVH
ISQIADKRVDKVTDYLQMGQEVPVKVIEVDRQGRIRLSMKEATTPDAEAP
APEAAE
>YE1933 poaA, bifunctional PutA protein [includes: proline dehydrogenase and delta-1-pyrroline-5-carboxylate
MGSTTMGVKLDEATRDRIKAAAQRIDRTPHWLIKQAIFNYLERLENNNEL
PELPTAQISSSSETDDIMPQVIESVHQPFLDFAEQVLPQSVSRAAITAAY
RRPETEAIPMLLEQARLPEDLAQATHKLAYSIAEKLRNQKSANGRAGIVQ
GLLQEFSLSSQEGVALMCLAEALLRIPDKPTRDALIRDKISNGNWHSHLG
RSPSMFVNAATWGLLFTGHLVSTHNEAKLSSSLNRIIGKGGEPLIRKGVD
MAMRLMGEQFVTGETIAEALANARKLEDKGFRYSYDMLGEAALTEADAQA
YLLSYQQAIHAIGKASNGRGIYEGPGISIKLSALHPRYSRAQYERVMDEL
YPRLLSLTLQARQYDIGINIDAEEADRLEISLDLLEKLCFEPKLAGWNGI
GFVIQAYQKRCPFTIDAVIDMAQRSRRRLMIRLVKGAYWDSEIKRAQVDG
LEGYPVYTRKVYTDVSYLACARKLLAVPNLIYPQFATHNAHTLSAIYHLA
GQNYYPGQYEFQCLHGMGEPLYEQVVGKVADGKLNRPCRIYAPVGTHETL
LAYLVRRLLENGANTSFVNRIADATLPLDELVADPVSAVEAIAASEGQLG
LPHPRIPLPRELYGKERANSSGLDLSNEHRLASLSSALLTSASQVWRAEP
LIDAELDSGTELGNSMEHPVINPAEPTDIIGYVREATDAEVSRALDAAAS
AGAIWFATPPVERAAILVRAAELMENQMQTLMGILVREAGKTFNNAIAEV
REAVDFLHYYAGMVRDNFSNDSHRPLGPVVCISPWNFPLAIFTGQIAAAL
AAGNSVLAKPAEQTPLVAAQAVRILLEAGIPQGVLQLLPGRGDSVGAALV
NDARVRGVMFTGSTEVAAILQRSIAGRLDPQGRPTPLIAETGGLNAMIVD
SSALTEQVVTDVVASAFDSAGQRCSALRILCIQDDVAEHTLQMLRGAMAE
CRMGNPERLSTDIGPVIDADAKAGIERHIQAMRAKGRKVYQAAQANNMDE
KEWQRGTFIKPTLIELDSFDELQKEIFGPVLHVVRFQRQNLAALVDQINA
SGYGLTLGIHTRIDETIAQVTEKAKVGNLYVNRNMVGAVVGVQPFGGEGL
SGTGPKAGGPLYLYRLLCSRPEDAVINTLTHHDSGQLQNVSGREALQTAH
HALIKWAEEQQQPVVAQLARRYGELAQGGTVRVLPGPTGERNTYALLPRQ
RILCLADTEADALTQLAAVLAIGSEVLWPENTIQTDLCRTLPAVVKSRIT
LTKDWQTANIAFDGVIYHGDADQLRILCEQIAQIDGPIISVQGFARGETN
ILLERLLIEHSLSVNTAAAGGNASLMTIG
>YE1486 potH, putrescine transport system permease protein
MIPESTHGAAEPPTRAVGPVKALIQRFQMSHGRKLVIGIPYLWLFLLFML
PFLIVFKISLAEMARAVPPYTDLVTWLDSKLDISLNLGNYLQLLDDPLYI
DAYLQSLQVAAVSTLCCLIIGYPLAWAIAHSKSSTRNILLLLVILPSWTS
FLIRVYAWMGILKNNGILNNFLIWAGIIDQPLIILHTNLAVYIGVVYSYL
PFMVLPIYTALTRLDYSLVEAALDLGAKPFKTFVSVIVPLTKGGIVAGSM
LVFIPAVGEFVIPELLGGPDSIMIGRILWQEFFNNRDWPVASAVATVMLV
LLIVPIIWFHKHQNKAAGGAV
>YE3594 proP, proline/betaine transporter
MGLRRKSVKPMQINDITIIDDAKLKKAITAAALGNAMEWFDFGVYGFVAF
ALGQVFFPGADPGIQMIAALATFSVPFLVRPLGGLFFGALGDKYGRQKVL
SVTIIIMSVSTFCIGLIPSYASIGIWAPILLLLAKLAQGFSVGGEYTGAA
IFVAEYSPDRKRGFLGSWLDFGSIAGFVLGAGVVLLISSIVGEASFLEWG
WRIPFFVAAPLGLIGIYLRHALEETPTFQQHVDKIDNDSRHNIAQPPKVS
FREILTKQWKNLTICVGMVIATNVTYYMLLTYMPSYLSHSLHYSEDHGVL
IIIAIMLGMLFVQPVMGLMSDRFGRKPFVICGSIGLFILAIPSFILINSG
IIGLIFCGLLILAILLNCFTGVMASILPAMFPTHIRYSALAIAFNISVLV
AGLTPTAAAWLVETTQNLFMPAYYLMVVAVIGLVTGVFMKETANKPLKGA
TPAASDKAEAKEILQEHHDHIEQNIEDIDQQIAELEKKRKNLIAQHPDIN
>YE1720 purB, adenylosuccinate lyase
MELSSLTAVSPIDGRYGDKVSALRPIFSEFGLLKFRVQVEVRWLQKLAAC
AEIKEVPAFDANANAYLDKIVQEFNEQDALRIKTIERTTNHDVKAVEYFL
KEKVESVPALHAVSEFIHFACTSEDINNLSHALMLQTARQEVILPEWRKI
IDSIKALAHQYRDLPLLSRTHGQPATPSTIGKELANVAYRMERQFRQLSQ
VEILGKINGAVGNYNAHMVAYPEVDWHQFSESFVTSLGINWNPYTTQIEP
HDYIAELFDCVARFNTILIDFDRDIWGYIALNHFKQKTIAGEIGSSTMPH
KVNPIDFENSEGNLGLSNAVLGHLASKLPVSRWQRDLTDSTVLRNLGVGL
GYALIAYQATMKGISKLEVNEAHLLEELDHNWEVLAEPIQTVMRRYGIEK
PYEKLKELTRGKRVDAAGMQAFIDSLALPEAEKTRLKAMTPANYIGRATT
MVDELK
>YE2385 pykA, pyruvate kinase II
MSRRLRRTKIVTTLGPATDRDNNLEKVIAAGANVVRMNFSHGSAEDHIKR
ANDVRKIAAKLGRHVAILGDLQGPKIRVSTFKEGKVFLNIGDKFLLDANM
AKGEGDKEKVGIDYKGLPADVVPGDILLLDDGRVQLKVIEVQGMKVFTEV
TVGGPLSNNKGINKLGGGLSAEALTEKDKADIITAAKIGVDFLAVSFPRT
GEDLHYARRLARDAGCNAKIVSKVERAEAVATDEAMDDIILASDVVMVAR
GDLGVEIGDPELVGIQKKLIRRARQLNRAVITATQMMESMITNPMPTRAE
VMDVANAVLDGTDAVMLSAETAAGQYPAETVAAMAKVCLGAEKIPSINVS
KHRLDTQFDNIEEAIAMSTMYAANHLKGITAIIAMTESGRTALMMSRISS
GLPIFAMSRHEHTLNLTAIYRGVTPVFCDTHTDGVLAATEAVNSLKDKGY
LYSGDLVIVTQGDVMGTVGTTNTSRILRVE
>YE0201 rarD, hypothetical protein
MDKQRTRQGIFFALAAYFIWGIAPAYFKLIQQVPADEILTHRIIWSFFFM
LILLTVSRNWPQVRSAIKNRKRLLLLAVTAVLIASNWLLFIWAVNHNHML
EASLGYFINPLVNVLFGMLFLGERFRRMQWVAVALAFGGVLIQLWQFGSL
PVIGLGLAITFALYGLIRKKLGIDAQTGMLVETMWLLPIAAVYLFFIADS
PTSHMGANAWSLNVLLAAAGVITTIPLLFFTAAATRLRLSTLGFFQYLGP
TLMFILAVTFYGETIGNDKMVTFVFIWAALLLFTLDALYTQRKLRG
>YE3310 recB, exodeoxyribonuclease V beta chain
MTSMTPQRLEPLTLPLYGERLIEASAGTGKTFTIGVLYLRLLLGLGGEAA
FRRPLMVEEILVVTFTEAATEELRGRIRDNIHELRIACVRGVSDDPMHKD
LLAEITDLNKAAADLLAAERQMDEAAIYTIHGFCQRMLANNAFESGILFE
QTLVQDEQPLRRQACADFWRRHCYPLPLAIARAVSQEWSGPEALLKDLSA
YLQGETPKFRQAPGDDETILSRHQQIVVQIEAVKAQWRAAAPELEALISG
SGVDKRSYSARYLPGWLEKVGSWAEQETGDYQLPKELEKFRQSVLFEKTK
KGEAPQHDVFHAIDRIFEQPLTLRDLILARAISEIRISVQQEKRQRAELG
FDDLLSKLDAALQQAGGELLAQSIRTRYPVAMIDEFQDTDPQQYRIFHTL
YGGQEECGLLLIGDPKQAIYAFRGADIFTYIRARSEVSAHYTLETNWRSS
FPMVQSVNRLFSLVDVPFLFQQIPFINVAPAQKNKQLSFEIKGKKQPAMS
FWLQPGEGVGVSEYQQLMARQCAAQIRDWLTAGQKGLAQLVTATSSRPVQ
ASDITILVRSRAEAALVRDALSALAIPSVYLSNRDSVFDTAEAKDLLWLL
QAVLAPEQERALRSAMATGILGLDARMLDELNHDERAWDALVDEFDGYRQ
HWQRRGVLPVLREIMAQRHLAENLLATQGGERRLTDLLHLGELLQEAASQ
LDSEHALIRWLAQQIAQPNHQSDNQQLRLESDRHLVQVITIHKSKGLEYP
LVWLPFVGNFRQQQDVLYHDRHSFEALLDLNADEESQALAEEERLAEDLR
LLYVALTRAVYHCSIGIAPLIKGGRKKQGESDMHLSALGYLVQQGQAGDA
QYLADKLAGLAALADGEISVSQAGQLDETPWQPQQDELPALAARQFSRQI
HDFWRVTSYSGLQQHGSSKSVASQALSVLLQELLPRLDTDAVGEQTLITE
PQLTPHTFPRGAAPGTFLHDLFEPLDFSQPIEQGWLTEQLQQQGFGEQWS
SMLYQWLTDIVQTPLNETGVTLAQLTPQSKQAELQFYLPIDALLQARELD
TLIKRYDPLSRQCPVLDFQQVRGMLKGFIDLVFCWRGKYYLLDYKSNWLG
EDSSAYTHDAMAQAMAEHRYDLQYQLYTLALHRYLRHRLSHYDYQRDFGG
VIYLFLRGVDKQHPGNGIFSCRPERELIEGMDCLFSGGEVGPHISDNSGM
AITGEGTSI
>YE0204 recQ, ATP-dependent DNA helicase
MSTAAVINRELLAEQVLRDTFGYQQFRPGQQEIINATLSGQDCLVVMPTG
GGKSLCYQIPALVTDGLTLVVSPLISLMKDQVDQLLAYGVGAGCLNSSQT
REQQLAVMDGCRSGQIKLLYIAPERLVMESFLDQLHQWRPALLAVDEAHC
ISQWGHDFRPEYRALGQLKQRFPNLPVIALTATADEATRGDIVRLLNLDQ
PLIQVSSFDRPNIRYTLVEKFKPLDQLWRFVQDQRGKSGIIYCNSRAKVE
DTTARLQSRGLSVAAYHAGLDNERRAQVQEAFQRDDLQVVVATVAFGMGI
NKPNVRFVVHFDIPRTIESYYQETGRAGRDGLPAEAMLLYDPADMAWLRR
CLEEKPAGAQQDIERHKLNAMGAFAEAQTCRRLVLLNYFGEGKQQSCGNC
DICLDPPKRYDGLADAQKALSCVYRVGQRFGLGYIVEVLRGANNQRIREF
DHDKLSVYGIGREQSHEHWVSVLRQLIHLGLLSQNIAMFSALQLTEAARP
VLRAELPLQLAVPRIQSLKVRSSANQKSYGGNYDRKLFAKLRKLRKSIAD
EGNIPPYVVFNDATLLEMAEQMPITASELLSVNGVGQRKLERFGAPFMAM
IRDHVDNIHVDNNVDD
>YEP0097 repA, replication initiation protein
MDKQKFSNFSKDHSWEDIDFEALERASIEYFQEQTSFDTSKTEKKRTLRK
RGEHSTECKCPNPFFSRPEHYKPLAGELGHAYRRLTQKDKKTGKVSLRIR
ISRHPYFVWARKAVGRQRDFRPVREQLLDALFVLLVSCVDRATHIVTMNE
SRLAAELSPKDDNGNVIPATAVTVTRICRLLQELNKFGLIALPEGVQWDA
YNKQFFPKHVILTEQAWKLIGVNLDKLYAEQEERLQAEIDGVLQPGEDVS
VKTARKRWYDRMRHATLVRRRGEAIKQKRANKLGKLELNDRVYEMSNFLQ
RTLPYDELYHMTPEHFEKLVWQRLHQLDIAMAHEPPEPYH
>YE2997 rlpB, rare lipoprotein B precursor
MRHRILMLLLGLAVLVTAGCGFNLRGTTQVPPELQKLLLESSDPYGPLTR
AIRQQLRLSNVTIVDDPMRKDLPALRIIGSSENQDTVSIFRNGVTAEYQL
VLHVQAQVLIPGHDIYPIRVNIFRTFFDNPLTALAKEAEAEVLRQEMRDQ
AAQQLVRKLLVVHAAEIKNAQENGDTLTSSKRATGAAKMADVEEINIGKP
AVSTPAQ
>YE0576 smp, hypothetical protein
MAKAKLKFRLHRTAIVLICLALLVLLMQGASYFSLSHQLARSEQVEELAK
TLAKQVTFSLSPIMDTGDDDIDSQKVDAIIQQLTQSSRILDASVYQIDGT
LVANAGENVKVRDRLALDGKRPGSYFNHQIVELIPGKSGPTGFLRMTLDT
HVLATESKQVDNTTNLLRLMILVALAVGIILARTLLQGRRSRWQQSPYLL
TANKPVKEDESEEETAESDKKPDDK
>YE1553 smtA, putative methyltransferase
MQDRNFDDIAEKFSRNIYGTTKGMIRQAVVWQDITGLLAQLPQRPLRILD
AGGGEGHMACQLAALGHQVLLCDLSAEMIQRAKIAAEEKGVSHNMQFVQS
AAQDITQHLAQPVDLILFHAVLEWIAEPQQVLQILFNALNPGGALSLMFF
NANGLVMRNAVLGNFQLAIPGVKRRRQRSLSPQYPLDPPTVYGWLEQMGL
SISGKTGVRVFHDYMKNKQQQVDEFAELLALEQRYCRQEPFISLGRYVHV
MAFKPNLKDPL
>YE1124 speG, spermidine acetyltransferase
MSTTSSVRLRPLERDDLPFVHQLDNNASVMRYWFEEPYEAFVELSDLYDK
HIHDQSERRFIIESQGTKVGLVELVEINHIHRRAEFQIIIDPSHQGKGFA
GSAARLAMEYGFSVLNLYKLYLIVDKENEKAIHIYTKLGFEIEGELKQEF
FINGEYRTVIRMCIFQPQYLAKYKTPSIKSA
>YE0642 tbpA, thiamine-binding periplasmic protein
MLLSAATVAADKPTLTVYTYDSFAADWGPGPAIKKAFEAECDCQLKFVAL
EDGVSLLNRLRMEGKNSQADVILGLDNNLVQAAEQTGLFVPSQVDTSKLT
LPEKWQNNTFVPYDYGYFAFVYDKNKLKNPPKSLQELVDSKEPWKVIYQD
PRTSTPGLGLMLWMQKVYGDKAPQAWQQLAKKTVTVTKGWSEAYGLFLKG
EGDLVLSYTTSPAYHLIEEKKSNYAAADFSEGHYLQVEVAGVVASSKQPE
LAQRFMQFMVTPAFQNHIPTGNWMYPVIKMDLPAGFDTLTVPQKALQFDA
KDVADNRGKWIQAWQSAVSR
>YE0338 tdcD, propionate kinase
MFYPMKHGRSNHMTLNPTVLVINCGSSSIKFSVLTADNCEAVISGIADGI
GTEKPFLRIDRVTQFQLAKWNYSDALAAIADELDKRGLSKSISLIGHRIA
HGGEIFSESVLIDDRVVEEIKKVSPLAPLHNYANLHGVGAARQLFPGIKQ
VAVFDTGFHQTLKPEAYLYALPYRYFREQGVRRYGFHGTSYRYVVGEAAS
FLGFDGSDCGLIIAHLGNGASLCAVQDGKSVDTSMGMTPLEGLIMGTRSG
DVDYGALAYLARQTGQSLEDLDHMVNKESGLLGISGLSADMRVLENAYHE
GHEGARLAINTFVHRLARHIGGHASSLRRFDALIFTGGIGENSSLIRQLT
LEHLAVFGIDIDHAKNKRLQHGTSQIITTARSRVTAAVIPTNEEKMIALD
AIRLGYAQQGTVVLEAVI
>YE0640 thiQ, thiamine transport ATP-binding protein
MRFDLRIQPGERVAILGPSGAGKSTLLSLIAGFLAPASGRMLLNNQDYTI
TPPAQRPVSMLFQENNLFAHLTVEQNIGLGLHPGLKLNHEQRLLLGLIAQ
QVGLEACLDRLPAQLSGGQRQRAALARCLVRSQPILLLDEPFSALDPALR
NEMLQLVSQVCTNRNLTLLMVSHNLDDAARIAERTLLVVDGRIYYDGPTQ
ALVNGTAPEASVLGITAL
>YE1826 tnpB, transposase B
MIKPDVLMLHAAAKHVHAEMDATYGSRRMCIELREQGFNVGRYRVRQIMK
NLLLVAKRPGRHRYPRGGKPAVVAANLLNRQFNPETLNTWWSGDITYLHT
AQGWLYLAIVMDLCSRKIVSWAFSDKPDSDLTVRALRLAVNKRRPTGSVV
FHSDQGAQYTSAQFQSCQQELDVTGSMSRKGNCLDNAVTERFFRSLKAER
VNYRRYETRSQGIADVIDYIDSFYNLKRRHYRLGNISPDEYERRLQQCA
>YE3969 trpS, tryptophanyl-tRNA synthetase
MSKPTEIPAVSDKQKPIVFSGAQPSGELTIGNYMGALRQWVQMQDDYDCI
YCIVDLHAITARQDPALLRKRTLDTLALYLACGIDPQKSTIFVQSHVPEH
SQLGWALNCYTYFGELSRMTQFKDKSARYAENINAGLFDYPVLMAADILL
YQTNQVPVGEDQKQHLELSRDIASRFNNLYGDIFKIPEPFIPKAGARVMS
LQDPSKKMSKSDDNRNNVIELLEDPKSVVKKIKRAMTDSDEPAVIRYDTE
KKAGVSNLLDILSGVTGQSIPELEAQFEGQMYGHLKGAVAEAVSDMLSEL
QARYHQYREDEAFLQEVMREGAAKARARAQDTLAKVYEAIGFVAHP
>YE0953 ureC, urease alpha subunit
MPQISRQEYAGLFGPTTGDKIRLGDTNLFIEIEKDLRGYGEESVYGGGKS
LRDGMGANNHLTRDNGVLDLVITNVTIVDARLGVIKADVGIRDGKIAGIG
KSGNPGVMDGVTPGMVVGVSTDAISGEHLILTAAGIDSHIHLISPQQAYH
ALSNGVATFFGGGIGPTDGTNGTTVTPGPWNIRQMLRSVEGLPVNVGILG
KGNSYGRGPLLEQAIAGVVGYKVHEDWGATANALRHSLRMADEMDIQVSV
HTDSLNECGYVEDTIDAFEGRTIHTFHTEGAGGGHAPDIIRVASQPNVLP
SSTNPTLPYGVNSQAELFDMIMVCHNLNPNVPADVSFAESRVRPETIAAE
NVLHDMGVISMFSSDSQAMGRVGENWLRVMQTANAMKASRGKLPEDAPGN
DNFRVLRYVAKITINPAIAQGVSHVIGSVEVGKMADLVLWDPRFFGAKPK
MVIKGGMINWAAMGDPNASLPTPQPVFYRPMFGAMGKTMQDTCVTFVSQA
ALDDGVKEKAGLDRQVIAVKNCRTISKHDLVRNDQTPNIEVDPETFAVKV
DGVHATCEPIDTAAMNQRYFFG
>YE3079 wbcF, hypothetical protein
MIPSFSDPLNTFSTGVTFLKSILFKESVVSKNICSKLYLFTRFFSKNKCE
KVVFIPIEYKCLLAYKDDIIILPDDSLRSIQSIAKVRFANRSYLKFTYQV
YRYIITYILISSLRRCVLYTVSKDDETSLKRKFPKNNIKYLPHPIQARYN
LGRISDFTLNVKTIGFINLQNHYSVDATNFLSYSDLTELKNKTIIFHGSA
SKNWFQKAKELYVGVDFIEKRYIEDFDTFFDSLDLVIMPLDAGAGVKNIL
LNSVYKNKLVFGTKEAFSGIPEHLAKPFIINSIGDINEKLKKMLSLEKDF
FRLREYILEHHTIENFKSALCE
>YE1800 xis, excisionase
MTKLLTLEEWAEETYRSKQPTPQTLQRWARGGNIYPAPEKHGREYRVQPG
AIYIQPKSYRLAKEILKTSPSTSSSLIEKINHGIKAKTI
>YE3061 ybaR, putative cation-transporting ATPase
MLQTTVLALHGLSCMNCAKRVKTALESREDVHHAEVNVHYAKVTGEAETT
TLIDTVKQAGYQAEEAQTPDIELQLSGLSCGHCTESTRKALEAVPGVIAA
DVSLDNAKVYGKVEAQTLIDAVEQAGYHATLPGAQSPKTEPLTDSAPSSP
EYLAAASSTIPAATTDIKNTQPSQPVAEPADNDSVQLLLTGMSCASCVSK
VQNALQSVDGVEVARVNLAERSALVTGHPSNEALIAAVKNAGYGAEIIED
ETERRERQQQMSQASMKRFQWQAALGLLLGIPLMGWGLFGGSMTLTPETQ
TPWLIVGIITLLVMIFAGGHFYRNAWVSLKNGSATMDTLVALGTGAAWIY
SITVNIWPDVFPMEARHLYYEASAMIIGLINLGHAMEQRARQRSSNALER
LLDLAPPTARLVTDEGEKLIPLADVQLGMTLRLTTGDRVPVDGEIVQGEV
WMDEAMLTGEPIPQQKSTGDVVHTGTQVQDGTVLFRANAIGSQTTLARII
KLVRQAQSSKPEIGKLADRISAVFVPTVVAIAVIAGLIWYFFGPQPQLIY
TLVVATTVLIIACPCALGLATPMSIISGVGRAAEFGVLVRDADALQQASN
LDTLVFDKTGTLTEGHPQVVAIHTFNGVSEQQALEWAAALETGSNHPLAR
AILQRAEGLTLATVNQFRTLRGLGVSGEVDGVALLLGNNRLLEEQQIDTS
ELQSLIQQQAESGTTPVILTAQGKPAALLSIRDPLRSDSISALQRLHQRG
YNLVMLTGDNPITANAIAKEAGIDQVIAGVLPDGKAEAIKQLQAAGHKVA
MIGDGINDAPALAQADVGIAMGGGSDIAIETAAITLMRHSLHGVADAVEL
SKATLRNMKQNLLGAFFYNALGIPIAAGILFPFTGTLLSPVVAGAAMALS
SITVVSNANRLLRFKPKQ
>YE2817 yeiB, hypothetical protein
MRQRIATLDSARGLAILGILLLNISAFGLPKAAYLNPAYLGLPSVSDSWT
WAILDIVAQAKFLSIFAILFGAGLELLLKRGKGWIRARLSLLLLLGLIHG
IFFWDGDILFAYSLIGLVCWRMIRDAKDAASLLRTGGVLYLLGVAVLLLL
GFVTSGEPGRFWQPGPADLQYEQLWKLQGGVEAWKNRLDLLSSNLIAIGA
QYGWELAGSMLFGAGLMRCGWLRGEFSLRHYRLLAACLIPLSLVIQIPGV
VLQWRVGWDFRWTGFLLQVPRELGAPLQAVGYLALLYGFWPTLSRWRVSH
WLAQVGRMALSNYLLQTLLCTLIFYHFGLYQQLDRLQLLAVVPFVWLCNI
LFSLLWLHYFVQGPVEWLWRKLTAYACGQSLQPRNTRS
>YE1600 yenI, N-acylhomoserine lactone synthase
MVFIMLKLFNVNFNNMPERKLDEIFSLRKITFKDRLDWKVTCIDGKESDQ
YDDENTNYILGTIDDTIVCSVRFIDMKYPTMITGPFAPYFSDVSLPIDGF
IESSRFFVEKALARDMVGNNSSLSTILFLAMVNYARDRGHRGILTVVSRG
MFILLKRSGWNITVLNQGESEKNEVIYLLHLGIDNDSQQQLINKILRVHQ
VEPKTLETWPMIVPGIIK
>YE1808 yenI, methyltransferase-endonuclease
MLEEVDEIRVKANANLDVNKKGELGQFFTSSSICIFMASLFNELKGDISL
LDPGCGPGSLTAAFTEEVIRRGSARSLELHAIDIERKIKPFLDVVLDKCV
SASNAAGIKCKIYPQINDYITAASVTKHDFGTEMYTHCIINPPYKKITSA
SDYRKILSAIGIEAVNLYAGFVALAIMQLKKQGEMVAIIPRSFCNGPYYL
PFRNFIFQHCAIKHVHIFDSRSHAFSEDDVLQENIIIHLVKNGIQESVKI
TSSPNSDFFFDQESNSVSASDMTVRNIPFESLVNMLDKDKFIHIAANNRD
QSIIDRLNVFYTSLNELGISVSTGPVVDFRLKSDLRENIEPGAVPLIYPV
HLNGVVDWPKKSKKPNAINVSERSRSWLWSNQGYFVIVRRFSSKEEKRRI
VATVYDGSLPGEWIGFENKLNVFHINKSGMDKDIAYGLSAFLNSMLLDKY
YRLFGGHTQINATDLRSLHYPDRKSLQRIGSYISSQGLSQENINEAINTE
IKRLSKNDDKNPLAAQEKLDQALEIITLLGMPKSQQNERSALTFLALVNL
RPEGSWQELEKPLVGVTPIMDWCRDIYGKEYAPNTRETFRRQTLHQFIDG
GLVLYNPDKPNRAVNSPKACYQIAPELFDVLNTYGTPLWNKALGEWLMQR
ETLVEQYAMKREMHMIPLTIDNGTEIHLSPGDHSQLIHDIVTEFGPRFAP
GSQVIYLGDTGAKEDFFRKDALADLGVTVNRKGKLPDVVLYWPQRDWLIL
IESVTSHGPVDGKRHSELANLFKDARPGLVYVSAFPDKKTMSKFFSEISW
ETEVWIAEAPTHMIHLNGDRFLGPHN
>YE0766 ygbE, hypothetical protein
MLNITQITIDEQREREEPSYSFLGGVTGFVFYWLAFAIPFFVYGPNTVFF
LLYTWPFFLALMPVSVLIGITLSMLSRGNVLITVGGAGIAVVCLFWMLFS
FLTGW
>YE3435 yggT, hypothetical protein
MLTLTFLAKTVIDLYVMVLLLRIWMQWVHSDFYNPFSQFVVKITQPIVGP
LRRVIPSLGPIDSASLLLAFLLMTIKYPLLVLIQSGSMSLSLYNLLFGII
SLVKAAGYLIFWVMIIRALMSWVSQGRSPMDYLLYQLTEPLMAPIRRILP
AMGGIDFSAMVVILILYLINYLGMDLLGELWFVL
>YEP0012 yopD, translocator protein
MTINIKTDSPIITTGSQIDAITTETVGQSGEVKKTEDTRHEAQAIKSSEA
SLSRSQVPELIKPSQGINVALLSKSQGDLNGTLSILLLLLELARKAREMG
LQQRDIENKAAITAQKEQVAEMVSGAKLMIAMAVVSGIMAATSTVASAFS
IAKEVKIVKQEQILNSNIAGRDQLIDTKMQQMSNTSDKAVSREDIGRIWK
PEQVADQNKLALLDKEFRMTDSKANAFNAATQPLGQMANSAIQVHQGYSQ
AEVKEKEVNASIAANEKQKAEEAMNYNDNFMKDVLRLIEQYVSSHTHAMK
AAFGVV
>YEP0005 yopT, plasmid type III secretion system effector protein
MDSIHGHYHIQLSNYSAGENLQSATPPEGVIGAHRVKVETALSHSNRQKK
LSATIKHNQSSRSMLDRKLTSDGKVNQRSSFTFSMIMYRMIHFVLSTRVP
AVRESVANYGGNINFKFAQTKGAFLHQIIKHSDTARGACEALCAHWIRSH
AQGQSLFDQLYVGGRKGKFQIDTLYSIKQLQIDGCKADVDQDEVTLDWLK
KNGISERMIERHCLLPTVDVTGTTGSEGPDQLLNAILDTNGIGYGYKKIS
LSGQMSGHTIAAYVNENSGVTFFDPNFGEFHFSDKEKFSKWFTNSFWENS
MYHYPLGVGQSFSVFTFDSKEV
>YE3712 yqjA, putative DedA-family membrane protein
MDIIKELLHALWAQDYETLANPSLVWAIYILLFVILFLENGLLPAAFLPG
DSLLILVGVLIAKGAMSFPVTLVVLTTAASLGCWVSYIQGRWLGNTKVVQ
GWLSHLPAHYHQRAHNLFHRHGLSALLVGRFLAFVRTLLPTIAGLSGLNN
ARFQFFNWMSGFLWVLILTTMGFAFGKTPVFLKYEDEVMFFLMLLPLALL
VIGLFGSLYVLWRKKNTTPVNDSNDKGKPE


# Yersinia pestis Antiqua, Antiqua

>YPA_3350 insertion element IS1661 DNA-binding protein
MIGFGCYGFTAHLASDEPTAKGAISALNPRSWIFLSMIWRFGSMKHPFST
RLAAVQHYLSGKATLRETARQFSVGKSPLTRWIRAFRRQGEAGLEHHLSR
TYTPEFRLCVVRYMMANRCSAADASAHFNIPNETIIQNWMKRYREGGKEA
LNPSKTGPTMPKDKYEHDSKPFSEMTHAELEKELEYLRAENAYLKKRKAL
REEKALREQQKKLSTTDEK
>YPA_1299 pili assembly chaperone
MILIARQIIACSMLLLTSVSLSVQASVVMTGTRIIYPEGSREKVLQLSNK
DDHPNLVQLWMDDGNNQSSPSKSDVPFALTPQIFRMEANSGQVVRLTYIA
RNLPKNRESVFYLNFLQIPALKADTLGEKISVTLDTTTGNKIKVHNPTGY
ISLRDAKIVSNGKTVSFATSEMFAPDSTTDLALPIGIKAKKGELLILNVV
NDYGATIPNNYYL
>YPA_0158 putative lipopolysaccharide biosynthesis protein
MANICWIYYMPVHASIEPLGWESEFFQRQSAKLIFSDSAPPLNPAELAAF
TLVQAKVPTHRLDLIDALSQLDFHLVEGEIDLSLVVGEKEGIGTENATSE
PNMGAYSLRVATEADIPQLRRVAASAFALSRFRAPWYDAQDSGRFYALWV
EKAVLGTFDHQCLLVLDPTDQPVGFVTLRDLQDGSARIGLLAVFPGAQSK
GIGLRLMSAAKQWCQHHGLHRLRVATQMSNIAALRLYIRSGASIESTAYW
LCRG
>YPA_1713 glutathione S-transferase
MADAYLFTVSRWANALNLQIKERSHLDQYMARVAERPAVKAALAAEDIK
>YPA_3124 hypothetical protein
MSTLLYIHGFNSSPSSAKATAFKEWLQQHYPHIEMLTPQLPPYPAPAAEM
LENIVMDKAGQSIGIVGSSLGGYFATWLSQRFGIPAVVVNPAVRPYELLS
DYLGKNENPYTGQQYVLESRHIYDLKAMQIEKLESPDLLWLLQQTGDEIL
DYRQAVAYYTPCRQTVESGGNHAFIGFDHYFSPIVTFLGLANT
>YPA_3634 carbohydrate kinase
MSVFILGSYAKALVMTTDRIPLAGETLIGYDFRQTWGGKGSDMAVQAVRL
GAEVAYAGVVGDDTFGHEFVGLMQEEGVNIDALTISGELPTGAGLIVKDK
EARNVIVVDMGANKLFTPALVDSALSQLKQSNVVLTQLEIPLETARYGLQ
RAKEFGKITILNPAPARDLRGLDLSAIDYLTPNETEARVALGLPPDDPRS
NREIANLLLETGCQYVVMTLGESGSAVFGRNDTQEIPPCIIDVVDSNGAG
DSFNAALAVALDEGLPISEAVLFANATAALCCMDWETVPSYRYREDVDAF
MRSITVKEE
>YPA_3344 Glycogen/starch/alpha-glucan phosphorylase
MGNGGLGRLAACFLDSMATVEQPATGYGLNYQYGLFRQSFRECKQQEAPD
NWQRESYPWFRHNAALAVDVGFGGNLVKQADGRQLWRPAFTLRGEAWDLP
VLGFRNGVTQPLRLWQATHQHPFDLTLFNDGKFLLAEQNGVEAEKLTKVL
YPNDNHLAGKRLRLMQQYFQCACSVADILRKHHLAGRKLAELPDYEVIQL
NDTHPTIAIPEMLRVLLDEHQLSWDAAWAITSKTFAYTNHTLMPEALECW
DEKLVRSLLPRHFVIIKQINAQFKKLVNKQWPGNDEVWAKLAVHHNKQVR
MANLCVVSGFAVNGVAQLHSDLIIKDLFPEYYQLWPNKFHNVTNGITPRR
WLKQCNPALSGLIDDTLKVEWANDLDVLQDLEPYAEDPAFRQRYQQIKYD
NKVKLAHYVKRVMGLVINPDAIFDVQIKRLHEYKRQHLNLLHILSLYRQI
RDNPALDIAPRVFLFGAKAAPGYYLAKNIIYAINQVADKINNDPIVKDRL
KVVFIPDYRVSVAELMIPAADVSEQISTAGKEASGTGNMKMALNGALTVG
TLDGANVEIAEQVGDENIFIFGHTVDQVKAILAKGYQPKKYVKADPHLKS
ILDELASGAFSQGDKQAFDMMLHSLLEGGDPYLVLADFASYCQAQKQIDA
LYRDKDEWTRRAILNTARVGMFSSDRSIRDYQQRIWQAKR
>YPA_3395 insertion element IS1661 DNA-binding protein
MRSFLASVRSDGGSSQTRCTPGHDPGALNPRSWIFLSMIWRFGSMKHPFS
TRLAAVQHYLSGKATLRETARQFSVGKSPLTRWIRAFRRQGEAGLEHHLS
RTYTPEFRLCVVRYMMANRCSAADASAHFNIPNETIIQNWMKRYREGGKE
ALNPSKTGPTMPKDKYEHDSKPFSEMTHAELEKELEYLRAENAYLKKRKA
LREEKALREQQKKPE
>YPA_3567 3-isopropylmalate dehydratase small subunit
MAKFIQHIGLVAPLDAANVDTDAIIPKQFLQKVTRTGFGQHLFNDWRFLD
DASKVPNPDFVLNLPRYQGATILLARENFGCGSSREHAPWALTDFGFKVV
IAPSFADIFYGNAFNNQLLPVTLSEADIDTLFQLVKENEGIEFVVDLEQQ
TVNAGGKSYAFEIDPFRRHCMINGLDSIGLTLQHEHNISAYEKQQPEFLR
>YPA_3800 metalloprotease
MSYSREYSNTIIKNEQVMRRGIHYKNEVKGVIAPQISSRQSWKENTIHNK
NTNLTYSFSRAYTLWDYDRTFQQNAYISLFNPAQIHQAKIAMQSWADVAN
ISFTEASADSSANILFLNFQRPSNVAGYAYYPNPGSFSPIWINYSFSDNQ
HPSRLNDGGGVLTHEIGHALGLGHSHAPHGYTQQMSVMSYLSEQGSGANY
GQHYLSTPQMYDIAAIQYLYGANLHTRTGDTVYGFNSTSYRDHFTATHAS
DALIFCVWDAGGNDTFDFSGYKQNQMINLNELCFSDVGGLKGNVSIAADV
TIENAIGGSGHDDIIGNHTNNILTGNGGSDQLWGNGGNNTFRYASARDSM
TTSPDTIHDFKSGRDKIDLSQLMPSTDRVIFVDRLSFNGQTEMGQQYNEV
ADITYLMIDFDAQVSECDMMIKFTGRHHFTANDFILSTSLTA
>YPA_2100 putative long-chain fatty acid transport protein
MNQKNLFTRSALAAAIALISSNVSAAGFQLNEYSAAALGRAFSGEGAVAD
NASVGSRNPAAMTLFDRPSFSGGVIYIDPSVDITGTSPSGKSTDASNIAP
SAWVPNLHFIMPLDEQWAIGASATSNYGLATEFNDDYVAGMLGGQTDLKT
ANLNLSAAYRLNDNFSFGLGFDAVYADAKIVRHLGEAGGGLLPANTEAAR
LEGTKWGYGWNTGILYEIDKENRYSFTYRSEVNIDFDGDYSNQLPVIFGG
LGGKTVPGSLTLNLPAVWEVSGYNKVAPQWAIHYSMAYTTWSSFKELKAT
ASNGDVLFDKHEGFRDAYRIALGTTYYYDDNWTFRTGIAFDDSPIPAGNR
SISIPDQDRFWLSAGTTYAFNKNASVDVGIAYMKGQNVSITEKTPAPSNT
TYEFNSKGSAMLYGVNFNYTF
>YPA_3203 putative MocA-family oxidoreductase
MIRFAVIGTNWNTARFVDAAHESGKMKLVAVYSRKLDQAQTFGDDYHVTD
CFDNLEALAASDLIDAVYIASPNALHYSQAKLFLSHKKHVICEKSLASNL
AEVESLIACAHQHQVVLFEAFKTAYLPNFIQLQQALPRVGKLRKAFINFC
QYSSRYQRYLNGENPNTFNPAFSNGSIMDIGYYCLASALALWGEPKSVWA
SASLLPSGVDAHGTVSLNYGDFDVIIIHSKVSQSDIPSEIQGEDGSLVIE
SISECLSVAFTPRGSHSQDLTQPQHINTMLYEAEAFARLVENNRVDHDGL
HLSLLTSRIQTEIRRQTGVIFPADTQYPAAS
>YPA_3208 altronate oxidoreductase
MFSNTTEAGIAWNEADQFSDAPPSSFPAKLTRLLFERFEHFDGAADKGWV
LLPCELIDYNGEALRELVLRYASHWQLPAAFTHWLTENNTFCSTLVDRIV
TGYPRDEVAALQTELGYQDSFLDTAEYFYLFVIQGPQGLAQELRLDQLDL
NVRIVDDIKPYKERKVAILNGAHTALVPVAYLSGLDTVGQTMDDAQISRF
VEKTITEEIVPVLDLPEDELLSFSQAVLSRFRNPFIQHQLLSIALNGMTK
FRTRILPQLLTYQQQKGQLPPRLTFALAALIAFYRGEREGQTYPLQDDAH
WLERYSTLWNGVKHGDIALAELVNRVLSDANHWGQDLTAVPQLANQVTEQ
LQTILSRGMRAAVAAYS
>YPA_1550 hypothetical protein
MECPVCNNTQLVMTERKSIEIDYCPNCRGVWLDRGKLDKLIEKSVENSPA
TSFSDEREHGNHGYSKSKHYRKKGFLSRFFD
>YPA_3714 aspartate carbamoyltransferase regulatory chain
MMTQDYKLQVEAIKCGTVIDHIPAQIGFKLLSLFKLTATDQRITIGLNLP
SKRSGRKDLIKIENTFLTEQQANQLAMYAPDATVNRIDNYEVVKKLTLSL
PERIDAVLTCPNSNCISHNEPVDSSFTVKAQRGEISLKCKYCEKEFDHLT
VLHAD
>YPA_3568 MFS family transporter, sugar efflux pump
MHRSLQLQIFNAIFIGIVAGIGMLYFQDLMPGRAGAATTLFTNSISSGVI
LAGVLQGVLTETWGHNAVYVAAMVLVILALIICAKVREA
>YPA_3694 putative insecticial toxin
MQDNIDHTASVIADTDNTIHQQAKAEERHRQAVRRATQLRNDPVLSGINK
LAFSVAPKILQPEARTDLSLAEGIPERANEYADPASIQSLFSPGRYLCEL
YHVAKELHEDGNKLHIDKRRPDLQELVLSNSNMNQEVSSLEILLNVLQTN
APLAKLAKDTEAHANDVSFTLPYDDNLTVINAILEDKAISLREIAALLAE
NNDPWANPITPALVQEQLGLNPASYALIDIKSPLDDNSAKRLAHATQLSV
EQLQWLNKNAIESSSDKDSPLRPEILTIISEYRRLHQRYGLSVDPFIAII
NAVNTTHTNENKTSFFQQIFSTLDVDAGFNFLDQGSWEVIIRKALGITAE
ELLRIAKYCFGKSSISNVKMNSKKFSQLYRMAMIPRTLGVSFSQAEYLWQ
LYSHPDENIMEKIAQGNALTIIDAIIVPSMDE
>YPA_3587 hypothetical protein
MAPPGQWLKKSAWPDSVPFKLIGFRPDNEEISGAFPAVSARAFVWDNPSA
PPSEVTLLRKTLWLLPDNDMGLMVFTGSVPLTHLFDEPIDTLLVGLDDSH
SLRELEYYQQVYKSRSVEGAASFEFLKDPELMPEGMPLNVIRDLADHPDS
LRYSASAMSEAESERFYQDVQDAIDRQEQQKSEEQETLGDLNVPAAGKEE
AGTQWLESKEDTATNVTFLGTDFSGMTLDNKQFRYCMFTGCHFDKATFKD
CTFEHCQFTQSDFENSRWNNVHLSGCLFKQAEWQKAAFTHCKWEKSTFEY
GVFKHAQFTDNALDNCLINHSDFSLGTFDHCTLNGCFFSETHCDQTQFNQ
VIITSCIFEKCDGPKACFTESTIEKTSFISSSWVGGRLSHCYLNSLTTGL
NTNLSESHFEQCSLNKMGFLKVNLQSSTFINCSMLESCCDKADFSQATLI
ACDMTAVRLKDANLVHSHWQNTSLQQSMFYNADLRDATFQRCNLAGANLA
MISQNMDTRFEHCLTEKTHWIPRRYTVPA
>YPA_3632 putative araC-family transcriptional regulatory protein
MEKNMQRTQVVANWYRESDEFSTLVNYRILRAGHIRAADNFHVRRQSVAG
HELIFCLNGSGFIRLENNLHEVKKGNLAWLPVRWPHEHFPNKQEPWEILW
LRIDGAKLNNIMQILDVAQQPVFEFTSPETITDIYHRLFDLMQSHTLVAD
AHCDVLCSQLIYTLLENRSFDATKSPVISHRGLGRLIYQIHSHYNDDWDI
DKFMQYCQVSKSQLFRLFQETFNQSPLRWLKNYRLSQARRLLVETEETIS
RIAGLVGYNDPLHFSRDFHRSVGLSPSDFRRQERQLDNDRHD
>YPA_3109 ExbD/TolR-family transport protein
MRMNDSLDESGELHEINVTPFIDVMLVLLIIFMVAAPLATVDIKVDLPAS
SAVPQPRPEKPVFLTVKADNQLYVGDQPVDRETLATALDKVTQSNKETTI
FFQADKVVDYETLMSVMDALRKSGYLKVGLVGMEAGSSGAK
>YPA_3372 putative siderophore biosysnthesis protein
MGRWWRYKWITFHPSLTIKPVTSIKSATSIKSVASIEPVASTTWLSDEEA
VGWLPISAKTEAALLATARQLAEVLSQHTALARAVVAAWRTKRSHHFPIR
ALIQYHSLSDLIAQLHQVTGEKSDPRCRFEKAQSPKQWLAGGAFDWAAVT
PSEGAVSAEVLALLPLYPFTRQRYWAEGLLAESTPLSSPSVIAHPLDFSF
VRTWVAETLAIASGALSDEDDLLSLGLDSLQMLDLVDECKKRHITLTLAR
LFEKTTLGAWEQYWDSICRSGVCGESVSAVTPLTEEQTTDEYWQGEPFVL
TPVQQAYWQGRQKGQTLGQVACQVYLELDCPSLADERLHQAVTQLFHRHE
LLRMRINEQGMGVIQEIEPVTVTQYHWDQLSARAGAKERYALRARLSHRM
ADLTDESGFQLIASHDGQNTRLHINVDMVIADAMSLQILLEELGLLLSSQ
NTPFAPLDYHFPQYLLEQKSTPSTGESEAYWQQRLIAGLPLAPQLPLAIK
PEQIDKPIFQHREWRLASEHWAKLQQIARQHRLTPSMLLAGCFAETLRGW
AKLPDFSLNLTIFNRRGAHPQLAKLVADFTSLLILACEVKENESLLDHFR
RLNQQFMVDLDHGDYSAVHVLRQWSQQRGEQVTLPVVFTSNLGRELLGEN
APGALHYLVSQTPQVWLDCQVMEYQKALLISWDTVDALFPEGMLDQMFAF
MQQLLQSIVENEQVLQSPVQAYVDDKVLAHRQQPLSGVHGVPFEAKTLHQ
GFWQQVALVPNNIAVINASGVLTYREMAIRVGDLAGYLQQRIARQGHVGI
CLPKGIEQVIAVLAVLSIGAVYVPLDITAPSERLQQLIAQADIGVLISDH
EISANCDTVNINITGGFPSPFTPSVFTSSAAAPSIFVPSIFVQPDEPAYL
IFTSGSTGVPKAVVVTHQAAMNTIDEINRRYVDDGKIILFALSALNFDLS
VYDIFGPLSVGGSLVLPNAGDEKEAKQWLSALHQHQVTHWNSVPALFEML
LIAAEQGTQALPRSLQQVLLSGDWIGLDLLPRLRALGSQARFTTLGGATE
AAIWSNALDIAVIPEEWVSIPYGYPLAHQYYRVVDQAGRDCPDWVTGELW
IGGRGVALGYYREPEKTAAQFVSVHRAGFYRTGDYGRFWPNGCLEFLGRQ
DRQIKLHGYRIELAEIEAVAEHLATIKRAVVLYLDQPQKHLYLFVEPQNQ
VSTLFSATVDDVLPETRLVTDIREQENTITAFLLCHVLSNVLKLDLAEYQ
STQVLQEKGGISEAYQPLFLAWLHWLVEQGVLLSRDDQFKTTGIQPVRPE
VTEPHLLPLLMTLEQKASWIGEVVTGRGNPLLLLDDTLLSPEILVANTHD
AQVVIDVLCQRIARLSRQLQRPVNVTELNGRTGLFASRVLSRLGTEVLSY
CLLESANSLRQHAKYRLSGYSHETDVIATPDEQQQLADILIVNNSLHRFD
ALPEGIAMLQSLCHGGTEIVLFESLALSPLSLLTVLLLQHPNGFRDQRQG
ALSPLLDLPAWQSLLTDSGIAIQQLSIIEPYSALFVCQPQITKVPVSREQ
IKQHLQQHLPAYMVPSSITVLETLPLTPNGKIDRTSLYKQAVEYGQLSHD
GEAPRVLSAQQKPPDERERSDKRELSDKKEQCVARLWQQVLGVTPQSDSN
FFLCGGDSLHATRIVALLEKEGTVGVPLSQVFLLPVLRDFAHSLAFRHAP
VQEAQLVHCAAQRYSPFELTDVQRAYLMGRQSGFPLGGVATHYYQEYEYQ
EYEYQENEIYQEHESHQQYKGTPFDRERLERALTLLVARHDALRIVFDEQ
GTQRVLPHQPSPALQVIHCGAVEWETVTATQRDRLSHYVSEPTQWPLFHA
TLIQSDEGRNRFCLGLDNLVLDGLSMQIFFKELAVLYEDHTATLPSVEIG
FRDYLLAESSSPHRQVSEDYWRESLPMLPDAPRLPLVCDPALVGTPKFKR
WQAVLSASDWQMLLHKAKQHQITPSCLLLTCYAQVLAKWSESASLTINVT
LFDRKPCHPDINHIMGDFTSLLLLGCRQEKGEGWLATAQRIQQQLWRDLE
HREVSAVWVLRELNRQRGTTHIHMPVVFTSALGAQVDGYPDSAFQAPLWG
ISQTPQLWIDHQVFERDGELHFNWDVVEALFPAGVIDAMFAAYCQLLEKL
CVTEHWHDSVEIALPAPQQQSRQRVNATHQAMIFSPMHCRVAHAMAAYAQ
QTALIWGGQPPQLSTT
>YPA_3599 putative transposase
MMQSRTVYNYRSHRDDRAITQRIREIAETRIRYGCPRIHILLRREGWLVN
HKKTHRIYCLEGLNLRSKRPRWHVTARHRHASPEVTALDQCWSMDFVADN
LFNGRRVRALTIVDNFSRECLAIEVGQGLRGDDVVAVMDRLKHSLGRIPQ
RLQTDNGSEFISKSMDRWAYENRVSMDFSGPGKPTDNAFIESFNGSLRDE
CLNVHGFLSLEDAQEKIEQWRQEYNHFRPYSSLNNLTPAEFARSHQKGPD
L
>YPA_3608 hypothetical protein
MYPSLPKAFIQDKVMLSLHNKAALDSTIKGDAVRALLLHAYVDIKGNEVE
RRTTAVGIQKLYEEVLQELLYQDPKCISVMISHTANPPTPLSIASPERVM
QMMHQNIRQDIASQKTITDRTQTLHNLLSYKNQFQYKAIYDQPRLKPEEN
YNFKMVNDVHENLHSIQTDSCCKAGALTGATYFLQDSPGEIHVFGIRITQ
ANEESGKVELFQGLKSDVQTNNKLDSLYLLLRGNSHQSPDDISSSVG
>YPA_1609 putative ABC RTX toxin transporter, fused ATP binding/permease domains
MLTPLFWRYANPRILIFDEATSALDDESQSEIQKNMARIIANRTVITIAH
RLSTVRHCHRIAVITQGRVTELASHDELLQLNGSYARLWQQQVHFNQNKS
TLTKTSPL
>YPA_3850 putative metalloenzyme
MRTPDKPMPYFIDTHCHFDFPPFTGDEAASLACAAEANVRQLIVPSVKAA
YFSRILALADRYPPLFAALGLHPLYIAEHEDADLAALASHLADKPPKLVA
IGEIGLDLYMDEPQFPHQLVILNMQLELAKQHDLPVILHSRRSHDPLAAA
LRKAALPRAGVIHGFAGSLAQAQAFIRLGYYIGVGGTITYKRAQKTRHVM
ASLPLSSLLLETDAPDMPLASFQGQANRPERAANVFAALCELRPEPADEI
AAELMCNSQRLFSLPPLRP
>YPA_3590 putative Clp ATPase
MYQRERLFNRLGTFAYKTFVEATRLCRSYRHEYVELEHWLKILMDQQRGD
IPALLTHYGLSQTVISTQLDRIIQHMPVTKASVQDLSSRLESVVEGGLVL
SQLMASPSPIRTSHILLALLQDTQHQRWLYRLCDEFKKLPVPQVAEEYEG
LLVNSIEQTSEVQRGDALVTETDSSSSKEAHAALNKWCNDLTQQARDGEI
DPVIGREAELRQVIDILLRRRQNNPILVGEAGVGKTAVVEALARKLASGD
VPPLLQGARLLSLDLGRMQAGASMRGEFESRLKSLIDGISQSDTPVILFC
DEAHTLVGAGGQAGTGDAVNLLKPMLARGALRMIAATTWSEYKQFIEPDA
ALTRRFQRVLIGEPNESTAVDMLRAIAPHFAQHHNVTIRDGAIHAAVRLS
LRNLPSRQLPDKAISLLDTACARVALSQYAQPQAIEQLSAQLDVLKTEHQ
YLVREKQLGEAVDQRLDDVQTQIQAFEHELTALQSRHLHEQQLVREVLPE
GEGSGMMTENARWGELMALQEASPLVYPWVDEQSIAAVLSDWTGIPSGKM
LQDDIECVLNLEQRLGDHIFGQRNAIKEISQAIRIARAGIQSQERPLGIF
LLAGSTGTGKTETANVLVETLYGGAHNLITFNMSEFQEAHTLSTLKGAPP
GYVGYGKGGKLTEAVRRKPYSVILLDEFDKAHADIQDAFYQVFDKGWMED
AEGRRVSFRQCFILLTCNQGAEEIEQAYLTANDIKPGALKPLVYDALLRR
FAPALLARVNIIPYIPLDQDALAQIASHHLARLQTRLWDEIGATLVTEGD
IPGWIASRVCSHPNHGRAVEELLRQTLLPAVGNEVLKRRHEAEPLREIRL
IVEETELSIEFA
>YPA_4132 outer membrane fimbrial usher porin
MMAIIRLKDGSSPPLGASVITDKTGAEVGIVGDDGLTYLAGLQDTERLTV
QWGKKQCTLILPKDKGMNSGKVLLPCQ
>YPA_1858 flagellum biosynthesis transcription activator
MVEKSIVQEAKDIHLAMELITLGARLQMLESETQLSRGRLIKLYKELRGS
PPPKGMLPFSTDWFMTWEQNIHSSMFYNAYSFLLKSGYCTGVEAVIRAYR
LYLEQCPDLPEVPPLLALTRAWTLVRFVDSGMLQLSRCNCCGGTFITHAH
QPLNSFVCSLCQPPSRAVKKRKLSPQSADITSQLLDEQVRRAV
>YPA_3396 hypothetical protein
MHRGHGREYTSFESFHHQIEHSQEKTALYYRLRVKNSLFRKGFEHYISFV
RSDESKLVLAEENVSVSLTCTNRELPLSLRVGDINQLSTDSPSFATFRNI
TRPSVPLYPVLDGGLHWSLLSNMSLNYMSLLDKDALKQILHTYDFPSLHN
RQSARASQKKLDAIQRIETQPIDRLFRGLPVRGLQTTLWLEQGAFSSEGE
LYLFSTVLARFFSLYASVNAFHLLKVINLDNQECYEWPVQTGQHALM
>YPA_MT0083 DNA-binding protein
MAIAGCDLPTFATRLNEVLTLSVPVRTPEKLFGRDKQLETIQLALHSPGR
HVFIYGDRGVGKTSLAHTAASLIQSSDNRPITVSCDHDSTLETVIESVIS
QGMMRMPVDRYKTSATFGLNIPVLKAEARVEERETSRVRSVVNMASAVEA
LNYLTERYSDNTVIVIDEFDLIRSEEQRARFGVLLKQLSDGDVPVRIIFT
GIGQSVSDLIGGHLSSQRQIEQVDLERLHWTGRQRIVESAFRYFDINIPD
DIADRICALSDGFPYYVHLMCSKLLHECYMADEVVSTVTRDLFLASLDAA
VLSAEETLRSCYEAATCRDEHMHHILWAMAEGADLNRMKDHIITSYIQVM
KYLDIEPLTQKNFDSRFARLRKENHGSILCHALVGKDGVRPGWFRFRENM
IRGFVRMQAEKCGIVLDFDRQYSAHTASTRTAAVRGVYNPLSTVERSVAR
LRRDDEKEAEENE
>YPA_1642 putative acetolactate synthase large subunit
MRYTGAQLIVRLLEQQGITLVSGIPGGAALPLYDALGQSRIIRHVLARHE
QGAGFMAQGMARATGETAVCLASSGPGATNLVTAIADAKLDSIPLVCITG
QVPSSMIGTDAFQEVDTYGISIPITKHNYLVRDISELAQVIPQAFRIAQS
GRPGPVWIDIPKDVQTAEIDLATLPPPGIADTPGPVDLSAIAQVAKMINQ
SVRPVLYLGGGIVSSGAHQQAIQLAELASLPTTMTLMALGTMPVDHPLSL
GMLGMHAARSTNLIMQQADLLIVLGARFDDRAIGKAEQFCPHANIIHIDI
DPAELGKIRCPHLAMNADIAQVLTHLLPMIDAQLRSDWRSTVADMQREFP
FNQPNSENPLCHYGLIRAAAEALDDETIITTDVGQHQMWVAQAYPLHRPR
QWLTSGGLGTMGFGLPAAIGAALAKPESKVVCFSGDGSLMMNIQEMATAA
EEQLNIKIILMNNQSLGLVHQQQDLFFGKRIFAADYAYRTNFIHIAEGFG
FSTCDLNTASDPYTALHEALNRPGPVLIHALINVDEKVYPMVPPGAANID
MIGGE
>YPA_3495 deoxyuridine 5'-triphosphate nucleotidohydrolase
MMKKIDIKILDPRVGNEFPLPTYATEGSAGLDLRACLDHAVELQPGQTTL
LPTGLAIHIGDSALAAVILPRSGLGHKHGIVLGNLVGLIDSDYQGQLMVS
VWNRGQQPFTIEPGERIAQMVFVPVVQAEFNLVEDFTDSERGTGGFGHSG
RQ
>YPA_3290 putative taurine-binding periplasmic protein precursor
MAITFSSATLAASRRSFLQLRTILLASLLPASLLLATSAQAVDVIVAYQT
SAEPAKVAQADNSFAKLSGANADWRKFDSGSSVVRALASGDVQIGNIGSS
PLAVAASQNVPIEVFLLASQLGSSEALVVKKEIKTPQDLIGKRIAVPFIS
TAHYSLLAALKHWGIKPGQVTILNLQPPAIAAWQRGDIDGAYVWAPAVSE
LAKTGTVLTDSAQVGQWGAPTLDVWVVRKDFAKAHPEVVTAFARSALAAQ
AAYLNNPEQWLQNQAHLAPLSRLSGVPTEQVPALVKGNTYLPVAEQITQL
GQPVDQAIRDTAEFLKQQGKIPQVASDYRDFVTDRFVKEIQATPQS
>YPA_0679 hypothetical protein
MQRMQCIYIATWLIAVFTSPIVIANNTNINGSTNNNGNGTINIFDASSNN
DIHTLTGLGNEQLGGFSNHLIDSHNNTIDGGQSNNLVSSDGNMISAISLG
DGLFYGAQNNTLINSNNNLLIVTQGSTIIDSDSNTVSGISNNLIESNSNI
IGNENSCYSDPASPSGAWCVDNQNTLIGSDNNTITGALNGLHNSHNNDII
ASSVNNLMDTHNNIIAGGHYNTISGGGNNDIFGSENNVTDSTDANINGSN
NYVIDGNGIGRDDLTEDNSILGGSGNMGVGDSVTAITNSVVFGGNTSGNS
TGSTLTDSVSVSGNGTSGNNVVNIGGAANGNNSASLGTGSVSSEGGIALG
SGSIATRNDELNIGDRQITSVKKGVENTDTINVSQLNDSFDDVLNLSNEY
SDNSFSTVTENINNYTDASLDTVLNTTGEYTDNSILLVTNESNNYTDNGM
ESVSNYANIYADESLLAIYNEEANYMSNLIDVTLNNANNYTDLSVNTIIY
TGKQYTDSRINEYQRTFKNEFLTYSNGKFGGFDKDINQKQKQLNAGIAAT
MAAAVIPQKSGSKVSIGVGLAGYSDQGAGSVGAIWHVNQRITMNTTMTYD
TQRGVSLLTGLSIGI
>YPA_3345 hypothetical protein
MSQPMLKKDDFLAALTRQWQRFGLTSAQQMTPYQWWEAVSAALAEQLSAQ
PAPSKPKNVQRHVNYISMEFLIGRLTANNLINLGWYDTVDALLAEQQVKL
SDLLEQETTQHWATGA
>YPA_3598 hypothetical protein
MKWRFDCEVQMNGAAFVSSMILAVIFTTATVQASPVAYQKEAITENNLPV
FYPQLKQQMNYQSSWLAGKYTDFALWRSCQRRMKSDPLISPPTAQY
>YPA_1497 hypothetical protein
MRKTSSLFLLFGTLFSAGGYGATFLISAWFHSQGGNDIDAGATLGMALFG
TLIGVPLVGWFAGKLDASRLAALAALIQSCGYFLLGSLSGGSGYLPHVAA
LLIGLGWGMFYIGGPMALSERLTDTERGPGFTRFSAFQMTGICGSPILLT
IAVVQGGIPIQAAFLLVGVMGVFASILLMIFGTREPLIRHENSLRPWVKK
ITVLAKSGVIRPIVMVCLGGAVFSGMMSFQDSLTDGSLAIASTFFAVHAH
YRRGFTPVTCTQTIKLAPYPTRSGTLVLPNDRPCVPVRNTDACEFPDRCR
HANRRWLRSVIPRDSNLGGQ
>YPA_3539 hypothetical protein
MRENRPQLLDVLFDDAVAAENGPLHNVQQRATALLKLNRAVKGLLPSQLQ
PWCRVANYRQSVLVLETANASWLMRLRYEQPALLSALRAQILPSLSSIDI
RINPSLMAKGHNVTQDAAKSPQDRVKSPPVRHLSLESAKELRGLASRSPE
KLRVILERLAALAGEGANATKRDK
>YPA_3125 Icc-like protein
MHWLRRGLPPSKQVLAGEHWQILLLDSQVFGVPYGELSDYQLEWMERCLI
AYPERYTLILLHHHPMPSGCTWLDQHSLRNAHMLAAILTRYPRVTTLLCG
HIHQDLDLDWYGKRLLASPSTCVQFKPHCTNFTLDAVAPGWRYLDLLPDG
GLETEVHRLDSDEFCPDMDSDGY
>YPA_3847 hypothetical protein
MHSRLKACCVAISESACGRLSSHIPVKQGDALFLAQEIIRKKRDGQPLSE
EEIRFFINGIRDNVVSEGQIAALAMTIYFHDMSMPERVALTMAMRDSGTV
LNWKSLNLNGPLVDKHSTGGVGDVTSLMLGPMVAACGGYVPMISGRGLGH
TGGTLDKLEAIPGFDIFPDDNAFRKIIMLVWRLSAKPARWPLPISVFTRP
AILRQQSILFH
>YPA_3859 hypothetical protein
MITSTLAAYRSWGITGLLSRGMSSGWGILLPFTLLPVLGWVDISIGQLRI
LIVVAMLATVSMLYHVRLRHFLLLPSCLALLGGLVALMLMHGGVS
>YPA_0079 inositol monophosphatase family protein
MLEQIGQLAREAGVAIMAVYQGDKPLDIAHKKDDSPVTAADLAAHQIIKA
GLARLTPDIPLLSEEDPPAWDVRQHWQRYWLVDPLDGTKEFLNRNGEFTV
NIALIDQGEPVLGVVYVPVTEVMYSAANGQAWKEECGRRMQIQVRDAQPP
LVVVSRSHSDAELEDYLSQLGEHQTISVGSSLKFCLVAEGKAQLYPRFGP
TNVWDTAAGHAVAIAAGAQVHDWQGKTLSYTPRESFLNPGFRVSLF
>YPA_3880 putative phosphoenolpyruvate-protein phosphotransferase
MTKQVTFICGLLNGVHARPASHIERVCNRFQSRFHWYNPRSGIVGDGKSV
LSLIAANILLGDECQVTIEGEDEQVAFERLSIFIEHELPHADPILPKREE
SAEWEPLPASLAHLHPTLLRARSVSPGTACGKLLSLIRADLNALGDLPVA
QGIEREQQMLADGVAQLGKAWESLLVANSSTAANSSTTENNSTTRAIREV
HRSLLRDGTFRQRLLSHIVAGESCATAIVATAAYFSQQLALAANTYLRER
ELDIRDVSFQLLQQIYGEQRFPSQQALSEDSLCIADELTPSQFLALDKRY
LKGLLLGRGGSTSHTVILARSFNIPTLVGVDAAALQPYINQSLQIDGELG
LVVCLLDEPVRRYYRQEQWLHDQLREQQSRYQNMPGRTLDGVRMVVAANI
THAVEVEGAFNQGAESIGLFRTEILYMDRAAAPSEEELYTLYAQALGAAK
GKPIIIRTIDIGGDKPVSYLNIPAESNPFLGYRAVRIYHEFLSLFHTQLR
AILRASMHGPLKIMIPMISSMEEILWVKDQLAEVKQSLRISHLQFDETVP
LGMMLEVPSVMFIIDQCCEEMDFLSIGSNDLTQYLLAVDRDNAKVSEHYH
CLPPALLRALDYAVCEVHRHGKWIGLCGELAAKDSVLPLLVAMGLDEISM
SASFIGATKARLAKLDRGECRLLLNRVMACRTSREVEYLLVQYDGEQRDE
PLIIPECITLGADWRSKEEVIKGMVDNLLLAGRCRYPRKLAADVWAREAL
FSTGLGFGFAIPHAQSEHIEQSTISVAKLAKPVNWGEEDALFVMMLTLNK
QASGDQHMRIFSKLARRIMHEGFRDALVSAGTAQQVEMLLKHELEL
>YPA_1716 pyridoxamine 5'-phosphate oxidase
MIVILWRRGQNDNHSFPARYRVYMTENNEFDVADLRREYIRGGLRRSDLT
ENPLELFERWLKQACEARLPDPTAMCVATVDTNGQPYQRIVLLKHYDDQG
LVFYTNLGSRKAQQLAENPHISLLFPWHMLDRQVIFLGKAERLSTLEVLK
YFHSRPKDSQIGAWVSQQSSRISARGVLESKFLELKQKFQQGDVPLPSFW
GGFRVKFDSVEFWQGGEHRLHDRFIYQREADAWKIDRLAP
>YPA_3481 hypothetical protein
MIKRKIFLDDSQNEAAIRQQFNRAVALARRNGSAIAIGHPHPATIKVLQQ
MLPQLPADIVLVRPSALLNEPVQSLSPDKTKPREPVKGQRLPAIKQCKAK
ASYVPEKIYADKLFILLGESLMQNPAVIFIQQHWQQYFTPAPPATPIDEQ
KAIENTEKLPPKKAAQ
>YPA_4150 hypothetical protein
MDMAKIAVASATSFLLGSVLTAVGFLGGSVIAVAFAVFVLGVAITIGLDF
LDKKYGISVKIIALLKKAMEIHPRTPEANLQYILNGLGGNNEQ
>YPA_3573 thiamine-binding periplasmic protein
MITTVSSRIPMLTTASDTFATFMFKECKVFKHIISCLLLISATSALAAEK
PTLTVYTYDSFAADWGPGPAIKQAFEAECDCQLKFVALEDGVSLLNRLRM
EGKNSQADVILGLDNNLVQAAEQTGLFTPSQVDTRNLTLPEPWQNKTFVP
YDYGYFAFVYNKEKLKNPPKSLHELISSKEPWKVIYQDPRTSTPGLGLML
WMQKVYGDQAPQAWQQLAQKTVTVTKGWSEAYGLFLKGEADLVLSYTTSP
AYHLIAEKNNNYAAADFSEGHYLQVEVAGQLAASKQPELAQRFMQFIVTP
AFQNHIPTGNWMYPVIKMDLPAGFETLAVPQTALQFDAKDVADNRSKWIQ
AWQSAVSR
>YPA_3687 putative insecticidal toxin
MWSALAWRWCAVMLTGSQRYAFCNDRFHNRKITRLRNNFMPNILPTDLCA
NTPTLAIHDNRGFAIRTLAYNRRDHNETIGELISRNRYNASGQLIASRDP
RLEVDNFRYQYSLSGVPLRTDSVDSGSTLQLADSAGRTVLTLDAHHTRRW
VEYETGEHSLGRPLSYHEQAKGGLKTVTDRFFYATNSEQDKSGNLNGQCV
RHYDSAGLQALINQSIIGVPLQQQRRLLTNPKGPVDWFGEKENWGARLSE
QPFVSHSTTDALGQLLTQTDAKGHIQRMAYNRAGQLIGSWLTIKNSAEQV
ILRSLTYSAAGQKLREESGNGVITEYRYEPQTQRLIGIKTTRPAKKDRPT
RLQDLRYDYDPVGNILAIHNDAEATRFYRNQKIVPETTYRYDALYQLIEA
TGREADTNGIQNSQLPALASLNDSNQFVNYTRSYHYDRAGNLLKIQHTGA
SQYSTHITVSDSSNHGIQQQEGITARDIRSQFDAAGNQQQLQPGQPLRWN
SRNQLQQVEPVPRNDGISDSESYLYDGGGSRVVKTSLHKTHNAIQTRSVI
YLAGLELRSQYNGNNLTEDFQVITVGAAGRAQVRVLHWERGQPVDIVNDQ
LRYSFDNHLGSALIELDSDGDIISQEEYYPFGGTAVLASRNTVEAKYKTV
RYSGKERDATGLYYYGYRYYQPWLGRWLSADPAGTIDGLNLYRMVRNNPV
GLMDGDGLMTDKLLAKHEANFAKKNISSMAELKSEIEKLGLLPADSKQLF
LHLNGGESDDEPSGSSGSSGSSEILENTSPHKIKNFHFISEINLATMPRP
YYKDFSSTEDMLESAERLKAYGSIDTLLTLDLTSEDIPEFTSILADKGIN
YIAEKQYEIIDYFSEDELSSENIDRIVNMIKTIQNNNHKVGIHCAAGNGR
SGLIATAMIINKKYTQSRINSFEEKNKLKEIIDKNKNEINVDAITYDAMK
LVRKTNPFAGERTTDIKAAREYSRYLYSKQNR
>YPA_3535 hypothetical protein
MEQDVRRLLHFLCPENPQLGWGFSKAFSFGSVIKTPFDLLKQFAVWQLQM
FNKCQRRMKSDPLISPPTAQY
>YPA_3360 putative autotransporter protein
MNTIFKVIWNASLNVWVVVSELAKGRIKTKSSRNLISEGVLPKFEQSMVS
KLFRKNLLALSLGSIVFLSTGPVFAADITVSTQAELSAALSNGTYDKIIL
GADITLIGSLTVNMTSNQVVIDGQGKFGLTVNNTTNYGLVVSSGSGTLTL
QNMSKIDSANYYSMVVLNGANTAVNVIYNNIDFLGSSQLIYMGAYGAATN
SIMTFGDILNDVVVNDRAQEIGEVNKLAFTGRFHVTHTGSSVTSFVSTGG
ANNTSTMDFASGADVKIDRTGSTGDLTSTGVNAFAYTFADGASFELIANQ
NVFSGTTTNRGLEIGSYNSIDGFGSGVKIVLQSRSDGSIISGNGIDNATT
NAAGINNNASGDANVIYNLGTGSILKATNTGILATKNANNASDIYIRSAG
DITAATGISATHNGTGTVKIKNDGTITSTTAGIAISSASIKEISVDNTDG
TITATAGTGVNVLASAILNLFGGTINTSATANGITFAGTEGGHTLTDLTI
NLLGTGIALSNVAGVNLTLSNVTLNTLNGTALNSLTGLTLVDSLNGRNTI
NIEGAGIGIAATNTELNTFDAEALDINVNGAGIGIQATGGGVNLSASNLI
INVANTLGTALQITDGIDNTTTIGNEIQLNAENATAINFLGSSSKTLNNN
GTIKGSVIFAGVADHIINNNGTLDGTLTTGAGNDTLVLDSSSQSNDVINL
GDGNNSVTIQNGATVSSIITGNGNDTFTINGMSVGSTYLGSLDAGTGLNT
LNFNASTDELAAATSLQGFTNINLVDSHITLVSDDNIGSGMVNIDSSSEL
LFGSTFDGILHATLGAGTGSAIVNNSANVSLEQASMFAGTWQVNQGGALT
ASNSNQLGSAKIGLDGTLNLDNIALFNHVLTGNGTLNVAKNLATTAFDFG
STVGGAFSGIVNLTKTTFALSADNAAALASATLKLSDDSVTTVGTTDRTL
HGLDLSGGTLIFDGAVPQSQTSGVVTVTDLALNSGTVNITGSGSWDNTDP
LATNVSILEQDRAGSTLELINATNVTGDIDALDLLVNGTAITSGTQGVQS
AIQQGGSTVANAIHNYGLASSNSNGDSGLYVNYTLSALELLADGADALLL
ATESGLTANRVLNAELFGVGGLVVDAQNGALTLANGSNRYEGTTTVTAGE
LILGANGAFGQTSLLDIASGASANINGYSQTVGAVTNVGTVTLGSGGVLT
SGLLTNGGILDLTGGALNLTAGGASTVAGGLTGAGTLNINGGNLSVSAAN
SGLSGQTHIADVASVTLTDTGTLGTSAVEVLGTLNLNGANAAMTNVLSGD
GTINTNAAVTLSGNNSFSGAHQIGTDGELTVGQASNLGASSATVNLGTLT
SHLILNGVSESIANVLSGVAGSTVDIIGGADTALTANNSGFLGQYALAGN
SKLTVASTNNLGASSSVALAGAGDTLSLSGFNGTFGNSVTGSGVLQVTDD
AEVTLTSSNGVSNAVTIDIADATLNLDDIALFNHVLTGNGLLNVAKNDAS
TAFDFGSTVGGAFSGIVNLTNTTFALSADNAAALARATLKLSDDSVTTVG
ATDRTLHGLDLNGGTLIFDGSPPQSQANGVVTVTDLALNSGTISITGAGN
WENEHPVTPPNVSLLEQDRGDILLELINAANVTGNANNLDLLVDGTAITS
GTQGVESAIQQGGSTVANAIHNYGLTSSNGNGGSGLYVNYTLSALELLAN
GANALLLATESGLTANRVLNAELFGVGGLVVDAQNGALTLANGNNRYEGT
TTVTAGELILGANGAFGQTSLLNIASGASANINGYRQTVGAVTNSGAVTL
GNGGVLTSGLLTNGGILDLTGGALNLAAGGSSTVAGGLTGAGTLNINGGD
LAVSATNSGLSGQPHIADVASVTLTGTGTLGTSAVEVLGTLNLNGANAAM
TNVLSGGGVINTNAAVTLSGNNSFSGAHQIGTDGELTVGQASNLGASSAT
VNLGTLTSHLILNGVSESIANVLSGVAGSTVDIIGGADTALTANNSGFLG
QYALAGNSKLTVASTNNLGASSSVALAGAGDTLSLSGFNGTFGNSVTGSG
VLQVTDDAEVTLTSSNGVGNTVKVDIADATLNLNDIALFDHVLTGNGTLN
VAKNLATTAFDFGSTVGGAFSGIVNLTNTTFALSADNAAALARATLKLSD
DSVTTVGTTDRILHGLDLNGGTLIFDGSPPQSQANGVVTVTDLALNSGTI
SITGAGNWENEHPVTPPNVSLLEQDRGDILLQLIDADNVTGNANDLELMI
NGTTISAGQGVQSTVQQGGYTVANATHNYGMTSNGGSGLYVNYTLSALEL
LADGANALLLATESGLTANRELNAELSGVGGLVVDAQNGALTLANGNNRY
EGTTTVTAGELILGANGAFGQTSLLNIASGASANINGYRQTVGAVTNTGT
VTLGNGGELTSTDTLINTGMINVTDGILNLENGGASSISGGLTGNGILNI
KGGDFTISIDNNGLAGQTNISDGASVTLGNGGTIIGTGNLGSSVIDVLGD
LNLVADNSLANVISGDGTINTTATVTLSGNSSFSGAHQIGTNGELTVGQA
SNLGASSATVNLGTLTSHLILNGVSESIANVLSGVAGSTVDIIGGADTAL
TANNSGFLGQYALAGNSKLTVASTNNLGASSSVALAGTGDTLSLSGFNGT
FGNSVTGSGVLQVTDDAEVTLTSSNGVSNAVTIDIADATLNLDDIALFNH
ALTGNGLLNVAKNDASTAFDFGATVGGAFTGTVNLNNSTFDLSGNNTTVL
AQATLKLSSGNLTSVGNGVQNIGTLAMNGGTLLFDNIVDNAGIITSDGTI
AANSINTTGGGEVRVNLPSNLAPSLDGLSVMELDEGEIIVTLATGAATGT
GHELTLTDENGDPISAVTYQGVHNAGSTSAAATGSFNYGMTTGEDYDGLY
VNYGLTALELLSTGSEALVLTAILANNGTQSNDLSAQITGSGDLAFASAN
DGSTASLSNSTNSYTGTTWVSSGNLRLDADSALGQTSLLAMSTATHVDIN
GTQQVVGELATEGGSTLDLNDGKLTVTGGGQIDGALTGGGELVLSGGLLN
VSYDNAGFTGSTDIANGAVAHLSQAQGLGNGTINNNGTLHLDNTIGTLFN
ALTGSDGEVLLSNNASVQLAGDNSGYSGLFTNQAGSILIANSAEHLGGSS
IANSGALILDTGSVWELTNTISGTGTLVKRGSGTVKIEGDTVSAGLTTIE
EGLLQLGSSAVTQTLSLEESLQERALLVSFASNMANLTSNVLITANGSLG
GYGQVTGNVENYGNLIMPNALTGGDFGTFTIDGNYTGDEGMITFNTILAG
DTSVTDRLVITGDTAGQSYVTVNNIGGVGARTFEGIKIIDVGGDSAGQFT
LNGRAVGGAYEYFLYQGGASTPDDGNWYLRTEADDRRPEPASYTANLAAA
NNMFVTSLADRMGETLYTDVFTGEQKTTSLWLRNEGSHNRSRDDSGELKT
QDNRYVMQLGGDVAQWSRNAQDLWRVGVMAGYANSSSSTVAQVAGYRSTG
SVDGYSVGIYGSWLADNADDTGAYVDSWVQYSWFDNRVSGQDLATEKYDS
KGFTASVEGGYAFKVGESVNQSYFIQPKAQVVWMGVKADDHTETNGTVIS
GDGNGNIQTRLGAKAFINPSDKAKVSGPAFKPFVEANWIHNTKDFGTTLD
GVTVKQAGTANIAELKLGVDGQVNSQLNLWGNIGQQVGNKGYSETSVVLG
VKYNF
>YPA_3603 hypothetical protein
MQRVLITFGLTLGLSACSSLSDPPQFSASGYIADSGVVRLWRQDNAQQQP
QVLMSVYSPYFGGNTRVTFYEYQNGILREIRRNDLGATPQSVQLRFDEQG
QVSFMQRQLASRRESLSADDIAVYQLEAKRILELSRVLRAGDVRLIQGRW
QDGMLTTCAGKTLRLNLDDNSQAWLTTRGEKNTQPLGVAWLDSSEGQQLL
LVANQDFCRWEPQAGSL
>YPA_1586 hypothetical protein
MSEISTLTIKIPLELKEKIKAAALEKQFSLSTEVCERLAQSFEADIKGSI
SSSKNAKLHHDEIDNQGVEEISEQPLSQKELKKLRQLLKDSAKTAQKKK
>YPA_3434 hypothetical protein
MQPYLIPAMPVTISEEIKKSRFITLLAHTCGVNEAKDFIQQVKQQHPTAR
HHCWAFVVPRKPPARLGVQ
>YPA_3461 hypothetical protein
MKYSNVIVSGLCMSMVSLATAAPTEVPSDEPVTLRIISSMATRQFLTEVI
AQFAQQSKYQVELESVGGVDATKRVEAGEAFDVVILSANAIDKLIDSGKI
LPNSRIDLVKSGVAIAVKEGAQIMDVSSEETVKQAVLAANTIAYSTGPSG
VYLTEVFEHWGIAEQIKDRIVKVPPGVPVGSLVAKGEVELGFQQLSELLH
LKGIIILGPLPTDIQIMTHFSAGVPLKTNQQKAIKVLLDFLASPAATEAK
IKNGMEPI
>YPA_3393 hypothetical protein
MRIIMSKWSEEKSWEWHKQQGWLRGFNYLPRTAVNWTEMWQQDTFDPACM
TQELQWAAEAGYNTLRTNLPFVVWQHDPDGLMTRVEHFLDICQRVDIKVM
LTLMDDCGFSGDHPFIGQQKQPIPGLHNSQAAASPGRNVVMQPYLWGQVE
RYLKEIIGHFAHDERIVIWDLYNEPTNRMIFTLAGEDAFDPALEEYSHQL
MEKAFMWAREVNPTQPLTVGAWHIANILDLGAPIYQHPTDIRALELSDII
SFHAYVPLDLMNKAIGLLKEYKRPMLCTEWLARHAESKMHEQLPLFNAEG
IGCYQWGLVKGATQTHIPWPEIKRTDNDYASRWFHDVFNEEGTAYALNPK
TWTSALLLFYLSPVFCRTHSACFQANALCIVIMDIFYNVTL
>YPA_3825 Cellulase
MFERLLAWTENNLAAGDLTSRLPAWLWGQNSQNNWDILDPNSASDADILI
AYNLLEAGRLWGNRRYLIMGTLLLQRIAQEEVMDIPGLGQMLLPGKIGFN
DEDTWRLNPSYLPPQLLARFSSIDGPWEAMVEVNQRMWLETAPNGFSLDW
VVWQKGKGWQPDTIKPDVGSNDAILVYLWAGMLAMDSPQKAELIARFQPM
AVITQQQGLPPFTTNSDNGKTNGDGAVGFSAALLPFLASSPEPFNQQTLN
LQQRRVQNSPPGADDYYSAILTLFGQGWLQHRYHFTHQGELQPSWHRQR
>YPA_3565 3-isopropylmalate dehydrogenase
MTKTYHIAVLPGDGIGPEVMAQASKVLDAVRQRFGLKISTSVYDVGGAAI
DRHGSPLPAATVAGCEQADAILFGSVGGPKWEHLPPAEQPERGALLPLRK
HFKLFSNLRPARLYQGLEDFCPLRSDIAARGFDILCVRELTGGIYFGQPK
GREGQGMHERAFDTEVYHRFEIERIARIAFESARKRRSKVTSIDKANVLQ
SSILWREVVNGIAADYPDVALSHMYIDNATMQLIKDPSQFDVLLCSNLFG
DILSDECAMITGSMGMLPSASLNEQGFGLYEPAGGSAPDIAGKNIANPIA
QILSLTLLLRFSLGKDDAADAIERAINQALEQGYRTADLAGDGHAIGTHE
MGDIIAKFVVEGV
>YPA_3129 ABC-transporter outer membrane component
MKKLLPLLIGLSLAGFSTMSQAENLLQVYKQARDSNPDLRKAAADRDAAY
EKINEVRSPLLPQLGLSAGYTHANGFRDASNSPDSNATSGSLKLTQTIFD
MSKWRALTLQEKAAGIQDVTFQTSEQQLILNTATAYFNVLRAIDSLSYTE
AQKQSVYRQLDQTTQRFNVGLVAITDVQNARASYDTVLAAEVAARNNLDN
ALESLRQITGVYYPELASLNVERLKTQRPDAVNNLLKEAEKRNLSLLSAR
LSQDLAREQIKSAETGYMPTVDLTASSSITNTRYSGGTPSSQQVNNDSGQ
NQIGVQFSLPLYSGGATNSAVKQAQYNFVGASELLESAHRNMVQTLRSSF
NNISASISSINAYQQVVISNQSSLDAMEAGYQVGTRTILDVLTATTNLYQ
SKQQLADARYNYLINQLNIKSALGTLNMNDLMALNAVLDKPVPTSAAALA
PENTTRQTVTTPRAQ
>YPA_3805 glutathione reductase
METTLMTKHYDYLAIGGGSGGIASINRAAMYGKKCALIEAKQLGGTCVNV
GCVPKKVMWHAAQIAEAIHLYGPDYGFDTTVNHFDWKKLIANRTAYIDRI
HQSYERGLGNNKVDVIQGFARFVDAHTVEVNGETITADHILIATGGRPSH
PDIPGAEYGIDSDGFFELDEMPKRVAVVGAGYIAVEIAGVLNGLGTETHL
FVRKHAPLRTFDPLIVETLLEVMNTEGPKLHTESVPKAVIKNADGSLTLQ
LENGTEVTVDHLIWAIGREPATDNLNLSVTGVKTNDKGYIEVDKFQNTNV
KGIYAVGDNTGVVELTPVAVAAGRRLSERLFNNKPDEHLDYSNIPTVVFS
HPPIGTIGLTEPQAREKFGDDQVKVYTSSFTAMYSAVTQHRQPCRMKLVC
VGAEEKIVGIHGIGFGMDEILQGFAVAMKMGATKKDFDNTVAIHPTAAEE
FVTMR
>YPA_3739 hypothetical protein
MQQSSPITLYQQALDAGGYQPDDVQRRAVARLETIYQALNQYQNVPAASA
SLRNRLGRLFGKPARRPPVSPVQGLYMWGGVGRGKTWLMELFFHSLPGER
KLRLHFHRFMLRVHQELTELQGHENPLEIVADGFKAQTDVLCFDEFFVSD
ITDAMLLATLLEALFARGITLVATSNIPPDNLYHNGLQRGRFLPAIALIK
QHCEVMNVDAGIDYRLRTLTQANLYLTPLNSQTEQAMAAIFVKLAGKEGG
KATVLEVNHRPLPAICVAEGVLAVDFHTLCEEARSQLDYIALSKRYHTVL
LHNVRCMAARDENTARRFLALVDEFYERRVKLIIAAEASMFEIYSGERLK
FEYQRCLSRLQEMQSEEYLSLPHLP
>YPA_3157 hypothetical protein
MAPPVAGECVHQWAGRLRNANLTKDGFQKQFLARSGELKSLARPELVSYL
AECHVEFILIHPFREGNGRLSRLLCDVLAVLAGKGLLDYSLWDEHKAFYF
KAIQAGVSGNYSPMMRLVSDILPD
>YPA_0849 ATPase
MSKTLLPKTLLSVSGLHDQYKLRDISLTLHQGEILGIAGLAGAGKTELCK
ALFGDTPSTLERGELSGKAWRPRSPDRSVAQGLALVPEERRKEGIFIDEG
IPMNLSVAADDSFSRWSLFSRRQELSWAKELIERLGIRTSSPQQKLAHLS
GGNQQKVAIGKWLRGDAQVLIFDEPTKGVDIKAKQDLFSLIDQLAQQGKG
IIYASGEFSELVGLCDRICVLWDGRIVAELNAAEVDEETLLLFSTGGTPQ
>YPA_3454 hypothetical protein
MLGKWICSSMTIQQWCFSFKGRIGRREFWIWMGLWLLAMLVIFTLAGKEW
LPIQSASFALVFLLWPTAAVVVKRLHDRNKAGWWALLAVLAWMLMAGNWQ
MLTPIWQWGVGRFIPTLIFVMMFIDCGAFLGTEGDNRFGPEAVPVKFFAD
KAK
>YPA_3082 flagellar motor switch protein
MIFFLKDYNKSGFSIFIDAPHIDRLIDTIKTKNEKAVEKNVSLSERQLEH
LVKKLPVTLTSQLSNINLTLAELMALKEGDIISASLPEYFPVFIGKEALF
SAAITENRGKLFFSEFNDQPNEMNHD


# Yersinia pestis Nepal516, Nepal516

>YPN_2909 DNA-binding prophage protein
MAKARLHDDAMVQLLMEDPEFAQVYLHQALLDIDEEGGQEAFLMALRHVV
EARGGMASVAKKAGVSRETLYRTLSPSGNPTLKTLLSVVSATGFQFSHLA
SITA
>YPN_2179 acetyl-coenzyme A carboxylase carboxyl transferase subunit beta
MSWIERILNKSNITQTRKASIPEGVWTKCDSCGQVLYRAELERNLEVCPK
CDHHMRMSARARLHMLLDAGSEVELGSELEPKDILKFRDSKKYKDRISAA
QKDTGEKDALVAMKGTLQGMPIVAASFEFAFMGGSMASVVGARFVRAVEQ
ALEDNCPLVCFSSSGGARMQEALMSLMQMAKTSAALAKMQERGLPYISVL
TDPTMGGVSASLAMLGDINIAEPKALIGFAGPRVIEQTVREKLPPGFQRS
EFLIEKGAIDIIVRRPVMRQTLASILSKLTHQPQPSVVESKADTVAQPEN
QADV
>YPN_2337 hypothetical protein
MKGFVLMLSLLMLSANAMAAGKIITVSKFEFGKQWAFTREEVMLECRAGN
ALFVINPATLVQYPLNDIATEQMKSGHVLAKPLDVLLLNDSNNPGQKMSL
EPFQQRAMALCQQ
>YPN_3762 hypothetical protein
MRKEIVSMRIILLLAALLLITFMLITTINHAHADPTNDSSPPKEGAPPIA
PYLLFNAPTFDLTLVKFRESYNRANPTLPINEFHAITVKEDSPPLTRAAS
KINENLYASTALEKGTGKIKTLQITYLPIKGNEEKTAKLLAINYMAALMR
QFEPTLSVVQSLANVQKLLTEGKGSPFYAHTIGAIRYVVADNGEKGLTFA
VEPIKLSLSEA
>YPN_1439 transposase
MDEKKLKALAAELAKGLKTEADLNAFSRMLTKLTVETALNAELTEHLGHE
KNTPKSGSNTRNGYSSKTLLCDDGEIELNTPRDRENTFEPQLIKKNQTRI
TQMDSQILSLYAKGMTTREIVATFKEMYDADVSPTLISKVTDAVKEQVAE
WQNRQLDALYPIVYMDCIVVKVRQNGSVINKAVFLALGINTEGQKELLGM
WLAENEGAKFWLSVLTELKNRGLQDILIACVDGLKGSRMR
>YPN_0524 Icc-like protein
MASGARVRILQITDTHLFAGEHETLLGVNTFHSYRAVLDAIIAEQHPFDL
VVATGDLAQDHSVAAYQNFAKGISRLPVPCVWLPGNHDFQPAMFDALAEA
GIAPSKQVLAGEHWQILLLDSQVFGVPYGELSDYQLEWMERCLIAYPERY
TLILLHHHPMPSGCTWLDQHSLRNAHMLAAILTRYPRVTTLLCGHIHQDL
DLDWYGKRLLASPSTCVQFKPHCTNFTLDAVAPGWRYLDLLPDGGLETEV
HRLDSDEFCPDMDSDGY
>YPN_1128 ATP-binding protein
MMMELQHQRLMALAGQLQLESLISAAPALSQQAVDQEWSYMDFLEHLLHE
EKLARHQRKQAMYTRMAAFPAVKTFEEYDFTFATGAPQKQLQSLRSLSFI
ERNENIVLLGPSGVGKTHLAIAMGYEAVRAGIKVSLHNSSRSVTSVIYGT
TSGPL
>YPN_2800 D-lactate dehydrogenase
MSQFNNEKIQLLLTQLQHIVGTRYLLTGERQTERYRTGFRSGKGVALAVV
FPSTLLQQWQLLQACVAADTIIIMQAANTGLTEGSTPSGDDYDRPIVILN
TLRLNNIQLLDDGKQVIGFPGSTLNQLEKRLKPYGREPHSVIGSSCIGAS
VVGGICNNSGGSLVQRGPAYTEMALYAQIDAQGELQLINHLGISLGNTPE
EILQRLEQGQYGAEDIEQTGQQASDSEYATRVRDIDATTPSRFNADPRRL
FEASGCAGKLAIFAVRLDTFPSEKQQQVFYIGTNQTQVLTELRRIILRDF
KHLPIAGEYLHRDIFDIAETYGKDTFVMINSMGTNNMPRFFTLKGKIDAR
LNKIPHLVDHLTDRVMQGFSQILPNHLPKRLKTYRNQYEHHLLLKMSGDG
ITEAQQHLRAFFATAEGNFIACTADEGKKAFLHRFAAAGAAVRYHAVHAD
QVEDILALDIALRRNEKQWFETLPPEIEKCLIAKLYYGHFLCHVFHQDYI
VKKGVDVKALKEKMLSLLNDKGAEYPAEHNVGHLYLAKPALKDFYQQIDP
TNSFNPGIGKTSKRKRWQQD
>YPN_2393 IS100 transposase
MMELQHQRLMALAGQLQLESLISAAPALSQQAVDQEWSYMDFLEHLLHEE
KLARHQRKQAMYTRMAAFPAVKTFEEYDFTFATGAPQKQLQSLRSLSFIE
RNENIVLLGPSGVGKTHLAIAMGYEAVRAGIKVRFTTAADLLLQ
>YPN_1753 virulence factor
MPIYSNVIPTNSKSTCYWCVPPPISVQIRSRRNPGLIWALTPHDQRITAN
LPLSTPTDAQTHDANIKNYDEAVQRYVGNPGDSWGTLLALDARGVERMIS
YLSKEILRDIKSERLTEQLHELQRELTNNLFTGWYQPSVTDERQQKQRIV
EILLKALQTRTGVHGELLEQLLPSRDELRRLYLQQHLYLSPRLYQQEKKG
VADFFIPLASTEPFSIGIDIDLFSDLPTPTEPSLTPQARHDNDEAEYAAH
VHYYWINHLRQLPDNTALLELLGVTKPTIELLVAEFITASIRLDIARNLR
QALADNEPADLHRAAKADRQVSRALTVLGDFIAWLGFLQISEDKRPDSRI
NRGYKIFAQPPKSVSTLGVSHRLTQLALTPTNSTAFYIYDWLVGLGEMII
QNVGYSASNEISPAQRQQLAAILSVIKPAND
>YPN_0999 hypothetical protein
MLKLLTPFEERGKAQLGVRIGSVNGRLEGDRNKVRFEQTNMARLLLASQI
ERSGADFAVMSGGGVRDSIGAGDITYKDVLKVQPFGNTLVYVDMKGSEVE
KYLAVVANKQPDSGAYAQFANVSLVADGQGVSNVKIQGEPLDPNKTYRMA
TLNFNALGGDGYPRIDNSPSYVNTGFIDAEVLKQYIEKHSPLDAEQYQLK
GEIVYNQ
>YPN_3153 hypothetical protein
MRLLLLIFLLCTWAVSAKTITDITGQRIEIPDNPQRIVLGESRMLYTLAL
LEPGYPAARVVGWPLDLKKYDPQSWALYTDKFPQMTHIPALGSTGLRDIN
PEKVLELAPDLVILPVLAKTTHEEIALISLLKQAHIPVIRVDLRVDLLNH
TIPSLRLLGEALNRQDRAEAFISLYQSHMDRIHQRIAAYQGPHPSVLLQL
HLGRGDECCTTTLKGNLGQLLEFAGGNNIAAAQVKGVFGQLSEEYVLARS
PEIYIATGMATAGSQAKTLALGPQVSTAQAQESFRRVIATQSKLQHLPAI
QNGHAWGLWHNFYLSPLHVVAAEFFAKTLYPHLFADVSPQDTLQTIYQQF
LPLDFTGTFWSHLDNE
>YPN_0746 transcriptional regulatory protein
MEKQMIPQDNELLTEIAVSYYQDEMTQEEIAHKFGISRIKVGRLLKRAKT
EGIVEINVRYHPVFSTRLEQQLIEHFGLKRALIALDYQDEDEQRRQVAAL
VSAYLATNLKDNTVLAVGQGRNVAAIADHVSVVPERHCKFICGIGGTHRP
GDAINADHISRRLAKKFGGTSETLYAPAYVENRALKEAFMQNGTIKETLD
RARKADMALVGIGDMNENSYMVELGWFTPQEIVDASLHQGVTGEIAGYDF
FDAQGQPANTVMNDRVIGISTQELRQIPCVIAIAAENTKALALLGALRTG
AIDILATSASNARTILKMIQHQS
>YPN_3216 siderophore biosysnthesis protein
MNINLDSSSATQATDLLVNLEQQEVRFWLEGERIKFRAPEGVMTGAVMAS
LKALKPQLVAILAVREQQETLPIDTENRFVAFPLSHVQAAYFVGRQSVFE
YGGVGCHGYFEILAPCWDKAKLECIWEQMIIRHDMLRAIITPNGEQRILP
SVAPFVIEQVDVSSETVRQAMSHRVYQTDKWPLFDLKISHQNGKDCLHFS
IDLLIADFLSVQILLAELFKGYLGQTFPAAPLTASFRDVICYERQQQSGR
AYGDARQYWFSRLPTLPACPQLPTCVQLPTGSPSPAMQTNASEIPRFVRY
RHQLAPSCWRRLQQGCQQQGVSPSAFLLTVFSEVIGRWSENRHFTLNLTV
MNRPNIHKDIDKLVGDFTSVSLLEVDLRSPKTVYERVKQIQKQLWQDLEH
RTFSGIEVMREWGRQQGVGHARMPIVFTSALVGEGKNQVQEAISETQLDL
VFDYGISQTPQVWIDCQIMVLQGRLQLNWDVMADVFPKGVVAAMFSTFID
SVERIDSVECIDRVKCLCMEEVGTGEALCDSPLFWHSALSLPLPESQQQR
RKQVNATEKTLTPRCLHDGILARAAQQPQAIALVDPQHCLTYAALITRAQ
ALAAKLNSHQRYAVLMEKRHEQVVAVLGIFIAGSAYVPVDIHQPPARILT
ILSDAAVSGVVTASPQACLADAHFREINLSLLAQIPDAIEAKILPLPHDL
AYVIYTSGSTGQPKGVMISHDAAYNTLADMQQRMALTPDDRVLALARLSF
DLSVFDIFGVLGAGGALIFPDEGDLQNPARWAHDIAQHQITLWNSVPAQM
KMLTDYLRAEQITLPSLRYILLSGDWIPVNLPPAIAQIAPHCTQLALGGA
TEAAIWSNYWRIDPQITYPVSIPYGVPLTNQQFRVVNPWGEDCPDWGAGE
LLIGGRGVAQGYWQDEAKTQAHFFIDTQGLRWYRTGDLGRYTSEGVIEFL
GRRDHQIKVRGYRVETGEIEAQLLKLPEVAQAVVMQTTQAIQTEQAEQAT
QIAQATQATQATQATQATQATQAAQAAQTAQTTQTTQTTQTAQATQIAQT
TQAIQTTQATPTTPTTPTTQATPTTHTELQAYIEAAQVAAASDYTQSLRE
QGRTALAHLQSAAQQAGDALDRPCIDRLFSLLDKVALMQMVKALSDMQTV
NTHHAVTLDAIMEKGQIASVNRQLLRRWLRALTQHQYLQQINDTYRLISV
VAEQEIRLCWQQCEQLITQLDDNRGLLTYLERSSQCLPELLQGKEDPLNL
LFPDGRLDVATQAYQNNLISQFMNRLLLKAAEQKVKNQAQGRVLKVFEIG
AGVGGTSNVLIPLFAEGNAEYTFTDISPFFLNEARKRYQHYDFIHYQLFD
FNQSPVQQGLDCGQYDLVVAANVLHNAIIARHGLNNLRQLLAPGGWLLII
EATRDNYQLMTSMEFKQGLTAFEDERLALDSPFLPQQNWIHALQDVGAEM
LWAYPPTDDVLHKMGQSFIFAQFNSQRMSCEPDALLAYLQQQLPDYMVPA
KLVILDKLPVSANGKIDRKQLPRLPESQQRQAPALQDTDSPLEKTLLALG
RTLIGNNAMGVDDDFFTSGGDSLLITQWINQVREVLGEDKVPWEGCLRQV
LQQPTARALAAYLRNIREEEPESQRPQMAVTCLKESACLKERSEEPVVIL
HEGSGTVLPYLSLVDKLSGPIYGVNVTDSAAFLSIPAERLICQLASTYLP
EIQKVGDQCHLVGYCMGGLLAFEIARQALENGRPVSSLTIISSYQIPYHI
NDPLMIDLVMSWTLGVEPPWNLLKADLEGIYHAILADKPAVITVNAVVER
AKANGLTALANEYACLQQQSNEVRLSQLLTAIEQHSTASSHVSRQPFDVF
RHSIQGVCQYQPNYFAGDMTFVCQQGDTYLLPALQQDMTAFWQTYCLGQI
NRLTLPGDHFTCLSADNISPLIQHLEQCKGVRG
>YPN_2924 hypothetical protein
MFDLLTFTCVKAAVLKDYFLQIGILTGVKMFTALVTLQDLILFLPAETGV
DIVLRVPTQIV
>YPN_2371 PAS
MALYPASFDDHVFRLLIEAVDDYAIYIIDAKGHVLTWNNGAERNTGYCAE
EVIGQSFELFYTPEDLVNGLSVKGLKHANQFGHYESQGWRVRKNGKRFWG
NVTLSALHNSENILQGFVHVTRDLTDQWQRDNALRKSEELFRHLISEVED
CAIYMIDTAGRILTWNKGGGTA
>YPN_MT0008 phage tail protein
MQEIYLTAIKRYPNEACGFLVRTTGEKYRFMEARNVSENPENTFVMHADD
IIAAEDAGDVVAIWHSHTDESADASDADRAGCEATEVPWLILAVRKNVEG
DAPFHFSEMNVITPDGFEMPYLGRPYVFGVFDCWMLCRDYLKREFNVELN
PNPHLHIPSWYTGDTDILDQNYRNEGLVRLAPGTEPQRGDVFFIQYGKMP
DHCAVYIGDGMILHHQIDRLSCRAYYGGMYQKHTTHHLRHRDLLKGDETC
LS
>YPN_2110 chemotactic transducer
MDHYCTVRYTHAPWRAELTSIDNAVPMIIFKPDGTVVQVNKLFLAAMGYQ
KDEVIGKHHKIFCDPQYAASDAYRRHWQLLNEGRPISDNIKRIRKDGNVI
WLQGTYTPVVDRQGNVIEIIKIASDVTERILQSQEHQSLLEALNRSMGMI
TFTPQGIILAANDNLLNVIGYSLADIQHKSHQILCLPEFAHSEEYHQHWQ
RLARGEFIIGRFERLNRRGERVWLEASYNPIMDNEGNVLKVVKIAQDITA
LMLQQEQEENLIRDVHHLSLTTDRSASQGAEIVQQAVRGMQEVESIARET
SDVVSDLGRCSQEIGSIVEAIRKISSQTNLLAINASIEAAHAGEHGRGFS
VVASEVRLLADQSRKAASEIEQMTKTIQNGVTAAIKGMGTCVEQAGGGVI
LTQDAGEVINLVNVGMQDVVKLMTTFSQVKNQ
>YPN_3816 membrane permease
MPPLPIWKYHAMSTQSAELDTAPPSPAHPSELIYHLEDRPPLPQTLFAAC
QHLLAMFVAVITPGLLICQALGLPAEDTQRIISMSLFASGLASLLQIKTW
GPVGSGLLSIQGTSFNFVSPLIMGGLALKNGGADIPTMMAALFGTLMVAS
CTEILLSHVLHLARRIITPLVSGIVVMIIGLSLIQVGLTSIGGGYGAMSD
NTFGAPKNLLLAGAVLGVIILLNRQRNPYLRVASLVIAMAVGYLLAWALG
MLPESRPVVDTALITIPTPLYYGLSFDWNLLIPLMLIFMVTSLETIGDIT
ATSDVSEQPVRGPLYMKRLKGGVLANGLNSMLSAIFNTFPNSCFGQNNGV
IQLTGVASRYVGFVVAIMLIILGLFPAVAGFVQHIPEPVLGGATLVMFGT
IAASGVRIVSRETLNRRAIMIMALSLAVGMGVAQQPLILQFAPDWIKTLF
SSGIAAGGITAIVLNLLFPQEK
>YPN_1964 chemotaxis protein CheA
MDITAFYQTFFDEADELLADMEQHLLLLDPLTPDNEQLNAIFRAAHSIKG
GAATFGFTVLQETTHLLENLLDGARRDEMRLSTEIINLFLETKDIMQEQL
DAYKTSQEPHSESFEYICQALRQLALDALDQPTTEDQPTTENQSATDQHS
TAANTQATASTRVSSSLAGNTSPALLQGGMRIRLSGLKAPEISLMLEELG
NLGEVQDPHQGADSLEATLITSVSEEDISAVLCFVLEPEQISFVSAGDTE
QIYAEQTVESEQSSEPERLFEPAITEASVIAESVQPVATPAATPAVEVQS
TLKADVHAAPANAEQVRPKAKASESTSIRVAVEKVDQLINLVGELVITQS
MLAQRSSSLDPVINGDLLNSMGQLERNARDLQESVMSIRMMPMEYVFSRF
PRLVRDLASKLNKQVELTLLGSSTELDKSLIERIIDPLTHLVRNSLDHGI
EEPATRIAAGKSPVGNLTLSAEHQGGNICIEVIDDGAGLNRQRILAKAQS
QGLAVNENMSDEDVGMLIFAQGFSTAEQVTDVSGRGVGMDVVKRNIQEMG
GHVQVSFQAGLGTSIRILLPLTLAILDGMSVKVSNEVFILPLNAVMESLQ
PLAEDLHPLAGGERVLQVRGEFLPLVELYRVFDVKNAKTEATQGIVVILQ
SAGRRYALLVDQLIGQHQVVVKNLESNYRKVPGISAATILGDGSVALIVD
VSALQALNREKRVAADEVVTA
>YPN_2589 hypothetical protein
MQRMQCIYIATWLIAVFTSPIVIANNTNINGSTNNNGNGTINIFDASSNN
DIHTLTGLGNEQLGGFSNHLIDSHNNTIDGGQSNNLVSSDGNMISAISLG
DGLFYGAQNNTLINSNNNLLIVTQGSTIIDSDSNTVSGISNNLIESNSNI
IGNENSCYSDPASPSGAWCVDNQNTLIGSDNNTITGALNGLHNSHNNDII
ASSVNNLMDTHNNIIAGGHYNTISGGGNNDIFGSENNVTDSTDANINGSN
NYVIDGNGIGRDDLTEDNSILGGSGNMGVGDSVTAITNSVVFGGNTSGNS
TGSTLTDSVSVSGNGTSGNNVVNIGGAANGNNSASLGTGSVSSEGGIALG
SGSIATRNDELNIGDRQITSVKKGVENTDTINVSQLNDSFDDVLNLSNEY
SDNSFSTVTENINNYTDASLDTVLNTTGEYTDNSILLVTNESNNYTDNGM
ESVSNYANIYADESLLAIYNEEANYMSNLIDVTLNNANNYTDLSVNTIIY
TGKQYTDSRINEYQRTFKNEFLTYSNGKFGGFDKDINQKQKQLNAGIAAT
MAAAVIPQKSGSKVSIGVGLAGYSDQGAGSVGAIWHVNQRITMNTTMTYD
TQRGVSLLTGLSIGI
>YPN_3006 methyl-accepting chemotaxis protein
MFSRMNYVVKKYFGYIMSKLSLSESRLGILFGFTLVISLFSLLQIFSIGY
LSHILKSTKANVEITHRNHQQEALMDRARMELLIASDKLNRAGIYYMEDK
ETGSEGSWQSLLNEALQSLQQSQDSYLQLQAVSAQDSRQELNELKESYHQ
LYQGLAEIAQGLAQKHNIDTFFDVPIQGFQSDFTEKYYRYLQESEKGSTA
MDEQLLSSLSSAKQIIIGALAILLVLAFSAWLGVTRWMIAPLNHLISRIN
KIAAGDLSHRIEDTSFACREIRQLTYSVRHMQEGLVALVSQVRGGAEVIL
TGVNQITADNHRLSEQTHSQTQSLAVTAQSMQQLTVRVKQNSMSAEQANH
LVNEARDTASQGGEMMSNVVSSMADISTGSREISEIITLIESVAFQTNIL
ALNAAIEAAHAGEHGRGFSVVAREVGTLSHQSGHSAQNIKRLIQHSANSV
STGASLVNRSGENLQAIIDVVKKVTDLMAEITTASHYQSKGIEEMAAQVE
MINGATKQNAALVEQSAHASEVLQQQTLQLNQSVARFHLPVEKYSPLRLA
KESTIAAIDSRVS
>YPN_0047 ABC transporter protein
MDAMESCKTPGASSVPPTILSVLAGNKKILWGIGLFTAVINLLMLAPAIY
MLQVYDRVLASANTMTLLMLTVLVLGVFVFIGLLEWVRSAVVIRLGTQID
MQLNQPVFNAAFAANLKGHNTPAAQALNDLTVLRQFATGNALFAFFDAPW
FPLYLLVIFLLHPWLGMLAAAGAGILVVLAWLNQWICKKPLHDASIITSH
ATQQANANLRNADVIEAMGMLKALRERWLMQHANFLYQQNLASDKSSRVT
AVAKSSRQALQSMMLGLGALLVIYNEITAGVMIAGSILIGRVLGPIDQLI
AVWKQWSHARLAYQRLSQLLAQHPSSPTGMVLPAPQGKLNVTQLMACKPG
THIPVLHSINFELQPGDVLGILGPSGSGKSTLAKLLVASQPTFSGTVRLD
SADLSRWDKTQLGEFIGYLPQNIQLFRGTVAENIARFGAIDTAKVVAAAQ
LADVHDLILHLPQGYDTPLGDDGEGLSGGQRQRIALARAMYGIPRLIVLD
EPNASLDKEGEQALLASIIQLKQQGCTIVMITHKPELLSGSDYLLFLKNG
QMDLFDRTQAVLQNIQGKDKPAVQPEAKILNSRSGWSNGVSYGIGPARTT
SSPKP
>YPN_2839 molybdopterin (mpt) converting factor, subunit 2
MENTRIRVGAEAFSVGDEYTWLSQCDEDGAVVTFTGKVRNHNLGASVSAL
TLEHYPGMTEKALTEIIADARSRWSLQRVSVIHRVGPLFPGDEIVFVGVT
SAHRSMAFEAAEFIMDYLKTRAPFWKREATVEGERWVESRDSDHIAAKRW
>YPN_1752 virulence factor
MKSLTPQLLNGQQLHDQLQQVTQGIDQAIDWVKSTRQHAVRLDREAEHLT
IKLRRMRNKAHPLSETALTAMTIGFVGQSQAGKKHLISALAANEHGRLEN
TLGGKKLDFWQQIRPEYQTTGLVTRFSYQTEGHQTHGHESRATNEAYPVQ
LTLLSEVDIAKIIARAFLLDCPQENPAYSLDEQQITEHLRQLMMHRRPII
NAGINSDDVIALWDSLAHHDTQRQKELATHFWPTAIELAPYLSIDDRAKL
FSLLWGENDPLTDAYRHFSYILQHLSGTQKLLAPLSVLVDDTLLPANGVM
NIATLGDLNTPADNPIQVLPLINGHTAKCVTLSQAELTLLAVELKIPLDK
PARESAFESVELLNFPDARGSQTIPALMENAAYPLASLLSQAKNAYLLER
YTNQQQINLLLVCTATDQRSDTLAS
>YPN_2370 ferrichrome receptor protein
MGINTMNQSLSISTEPKRSAPRLLCVMIGVALGSLSASSWSAAVANSTKT
ASAVAETNSAESSNKADTITVVGTPDTFRAGGNDLIPTYLDGQVANGGRI
GFLGQQDARNVPFNIIGYTSKMIEDQQAATIADVVKNDASVQNVRGYGNP
SQNYRIRGFNLDGDDISFGGLFGVLPRQIVSTSMVERVEVFKGANAFING
ISPSGSGVGGMINLEPKRAGDTPLTRVTVDYGSAAQVGGGLDVGRRYGDN
DQYGVRVNVLHREGETAIKDEKNRLTAVSTGLDYRSDRARTSLDVGYQKQ
TIHHMRTTVGVGAVTVIPEPPSTTLNYGQPWVYTDLETTFGMLRSEYDVS
QNWTVYGSIGASHNEETGQYGSPILTDNSGNATISRLYVPYISESIAGLG
GIRGHFDTGFITHKVNLGYATNYRTTKSAFNMSESINTNIYNPGVIPFPP
TSTNPNFYSPNQDPTLTSQVRASGFSLSDTLSMMDDKVQLMLGVRRQDVT
IRNFNNGVPNSAGSLDAMKVTPIYGVLVRPWEKVSLYANHIEALGPGKSA
PNEFNGKPVVNKGQIPGIVHSKQNEVGVKFDNQRYGGTLALFEITRPTGT
VDPATNTYGFYGEQRNRGVELNVFGEPVLGTRLLGSAVWLDPKLTKTQNG
TNNGNYAVGVARYQLVFGGEYDIPSVEGLTATGTLTRSGSQYANDANTLK
IKPWTRLDLGVRYTMPMKETNLIWRANLENVTNERYWESIEDSGVYLYQG
DPRTLKLSVSMDF
>YPN_3330 enhancing factor
MRQVTKSKKIKILPYPEWLVKAGMSKGIDHDRQHLGIILAAGEMIKVRQV
NAEYKEKLKLYLLNDNKNTQRSISFNTDWIELSVDAISVPFINTPYSDGI
IPEIVFEYPDTSKLLPVYEKGEDESIFFESWDKQNAEFGVVESEYVIILI
PEVSKDRLKSFSTSGGIDTVLGFYQDIFSFNNSLAGLSFEPQRYSDGNTR
NRYFAKADKGGGGAAYYSNNWIASSSGSINTFWLSPNATNWGCLHEIAHG
YQGGFIDDKYFSTREVWNNIYAACYQDVMLGAEKFNKGWLYNFGKQKEVE
KSILNNINNGKEVNAWGERSKLYFIMLMLEKAGVNCFTHFNQIYRERKNT
DQSVGSVLDMLSNSFATVENRVDVTLFLQMVGGHISKNQYQRNLFSHAKA
VYPLNQLLQGDELTAVKKTLDLNSELSLVDVDSIEFTDIKSNLTLQFSID
DFAQIYGEELLILNGDNYVYNQSIMDKKQTLYNLPVGAYTLRIPTGRNKK
YAPQINYLIVKNEDSQTQVDFVHRIGSPIVSQKISLQGLNDYTFATIFFD
QENDTVSVDITKTKPHYHFPGMYARIRIKDKDNNELLNEVITGTNQSLSK
NDFPLSSCYVIDIFHKEPGRVKLTPAYKGVIDNKSSYNEFIITPYGLQNI
KLNNDPKVFLLENIKSAAEMLRSHPIMWHANFSEAKDNIYLAIDIFPSPQ
KEGLLEQYADCISSYYEKPNEDLGNAFSFIFKGINDREFLTTKLNLVTKK
LEVKIVSGTPHSGFKRTYAVLRYFNADGNELLNLDIIGSKKQAGQEWVFP
ISGYGGEKLYLQHVEPKNRLVITNIMQGIRLSSRTSIQTYEIESLGLSRC
T


# Yersinia pestis biovar Orientalis str. IP275, IP275

>YpIP275_pIP1202_0197 hypothetical protein
MKNEYREIESTLDLLLMVLSDSFSESESIEVQEFIDVGEYGIALETIIDI
INEESKNITNEAEFLIEKAGRIMNMDTTSIVDKISKHIDK
>YpIP275_pIP1202_0219 hypothetical protein
MKDHEEFSTLSAAERRELIIAELKRKSRIRTLLRGLPLDEVREIIDRMKG
VLNELEEEYKKREEEEKEKRAQAERIMSDMESCGVDIGLLNEMFTSRSEP
DNAKYSKDGVSWSGQGRRPDAFKGLGAVELERYRIPQKK
>YpIP275_pIP1202_0181 S-(hydroxymethyl)glutathione dehydrogenase/class III alcohol dehydrogenase
MKSRAAVAFGPGLPLEIVEIDVAPPKKGEVLVKISHTGVCHTDAYTLSGD
DPEGLFPVVLGHEGAGVVVEVGEGVTSVKPGDHVIPLYTAECGECDFCTS
GKTNLCVAVRETQGKGVMPDGTSRFSYNGQPLYHYMGCSTFSEYTVVAEV
SLAKINPQANAEQVCLLGCGVTTGIGAVHNTAKVQEGDSVAVFGLGGIGL
AVVQGARQAKAGRIFAIDTNPSKFELAKQFGATDCINPNDYDKPVQQVLV
EMTKWGVDHTFECIGNVNVMRSALESAHRGWGQSVIIGVAGAGKEISTRP
FQLVTGRVWKGTAFGGVKGRTQLPGMVEDAMSGKIELAPFVTHTMELDKI
NEAFDLMHDGKSIRTVIHY