TitleGenColors Logo

Gene list

Applied filters:

COG category: Replication, recombination and repair
Gene type: CDS
Genomic element: chromosome

Number of genes found: 353

Free access
Sort by:

 



# Escherichia coli O157:H7 EDL933, EDL933

>Z1573 unknown in IS600
MCQVFGVSRSGYYNWVQHEPSDRKQSDERLKLEIKVAHIRTRETYGTRRL
QTELAENGIIVGRDRLARLRKELRLRCKQKRKFRATTNSNHNLPVAPNLL
NQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDVYTCEIVGYAMGERMT
KELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQFGLKTSMS
RKGNCYDNAPMESFWGTLKNESLSHYRFKSRDEAISVIREYIEIFYNRQR
RHSRLGNISPAAFREKYHQMAA
>Z0366 unknown protein encoded in ISEc8
MIPLPSGTKIWLVAGITDMRNGFNGLAAKVQTALKDDPMSGHVFIFRGRS
GSQVKLLWSTGDGLCLLTKRLERGRFAWPSARDGKVFLTQAQLAMLLEGI
DWRQPKRLLTSLTML
>Z1159 unknown in ISEc8
MEQKILSSEPRRSFSNEFKLQMVKLASQPGAXVARIAREHDINDNLLFKW
LRLWQNERRISRRLPVTTSSGAGVELLPVEITPDEQKEPMAALTPLLSTP
SQSTVSASSCKVEFRHGNMTLENPSPELLTVLIRELTGRGR
>Z2254 partial H repeat-associated protein of Rhs element
MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDF
GETHPDVLK
>Z4064 hypothetical protein
MTFVPLSPIPLKDRTSMIFLQYGQIDVLDGAFVLIDKTGIRTHIPVGSVA
CIMLEPGTRVSHAAVHLAATVGTLLVWVGEAGVRVYSSGQPGGARADKLL
YQAKLALTEDLRLKVVRKMYELRFREPPPARRSVDQLRGIEGSRVRQTYA
LLAKQYGVKWNGRKYDPKDWEKGDVVNRCISAATSCLYGISEAAVLAAGY
APAIGFIHSGKPLSFVYDIADIIKFDSVVPKAFEIAARQPAEPDKEVRLA
CRDIFRSTKLTGKLIPLIEKVLAAGEIEPPQPAPDMLPPAIPEPETLGDS
GHRGRGG
>Z1207 partial transposase
MATYGGQFTLTDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWY
NNRRLLERLGHTPPAEAEKAYYASIGNDDLAA
>Z3115 putative endonuclease encoded within prophage CP-933U
MLIDLVLPYPPTVNTYWRRRGSTYFVSKAGERYRRAVALIVRQQRLKLSL
SGRLAIKIIAEPPDKRRRDLDNILKAPLDALTHAGLLMDDEQFDEINIVR
GQLVPGERLGIKITELECA
>Z1123 unknown in IS1N
MALICELDEQWSFVENKARQQWHWYAYKTKADGVLAYTFGPRTDETCREL
PEFLKPFSAGMITRDNRSSYTREMPQDKHLVGKIFTRRIERNNLTLRTHI
KRPARKTICFLRSLEIHEKPLVHLSKTHVLLTGVITRASFAVFLP
>Z2101 putative endonuclease encoded within prophage CP-933O
MRIEFVLPYPPTVNTYWRRRGSTYFVSKAGERYRRDVALIVRQQRLKLNL
SGRLAIKIIAEPPDKRRRDLDNILKAPLDALTHAGLLIDDEQFDEINIVR
G
>Z4338 unknown protein encoded by ISEc8
MISLPSGTRIWLVAGVTDMCKSFNGLGEQVQHVLNDNPFSGHLFIFRGRR
GDTVKILWADADGLCLFTKRLEEGQFIWPAVRDGKVSITRSQLAMLLDKL
DWRQPKTSRLNALTML
>Z5815 putative transposase
MKKETDIRRGRHCVFLMHVHLVFVTRYRRQIFDHDATEKLRTYFSKVCAD
FEAELVEMDGEPDHVHLLINYPPKLAISSLVNSLKGVSGRLLRRDRPDIA
VRYYYKGVLWSPGYFASSCGGAPISAIRQYIEQQQTPG
>Z0067 ATP-dependent helicase HepA
MPFTLGQRWISDTESELGLGTVVAVDARTVTLLFPSTGENRLYARSDSPV
TRVMFNPGDTITSHDGWQMQVEEVKEENGLLTYIGTRLDTEESGVALREV
FLDSKLVFSKPQDRLFAGQIDRMDRFALRYRARKYSSEQFRMPYSGLRGQ
RTSLIPHQLNIAHDVGRRHAPRVLLADEVGLGKTIEAGMILHQQLLSGAA
ERVLIIVPETLQHQWLVEMLRRFNLRFALFDDERYAEAQHDAYNPFDTEQ
LVICSLDFARRSKQRLEHLCEAEWDLLVVDEAHHLVWSEDAPSREYQAIE
QLAEHVPGVLLLTATPEQLGMESHFARLRLLDPNRFHDFAQFVEEQKNYR
PVADAVAMLLAGNKLSNDELNMLGEMIGEQDIEPLLQAANSDSEDAQSAR
QELVSMLMDRHGTSRVLFRNTRNGVKGFPKRELHTIKLPLPTQYQTAIKV
SGIMGARKSAEDRARDMLYPERIYQEFEGDNATWWNFDPRVEWLMGYLTS
HRSQKVLVICAKAATALQLEQVLREREGIRAAVFHEGMSIIERDRAAAWF
AEEDTGAQVLLCSEIGSEGRNFQFASHMVMFDLPFNPDLLEQRIGRLDRI
GQAHDIQIHVPYLEKTAQSVLVRWYHEGLDAFEHTCPTGRTIYDSVYNDL
INYLASPDQTEGFDDLIKNCREQHEALKAQLEQGRDRLLEIHSNGGEKAQ
ALAESIEEQDDDTNLIAFAMNLFDIIGINQDDRGDNMIVLTPSDHMLVPD
FPGLSEDGITITFDREVALAREDAQFITWEHPLIRNGLDLILSGDTGSST
ISLLKNKALPVGTLLVELIYVVEAQAPKQLQLNRFLPPTPVRMLLDKNGN
NLAAQVEFETFNRQLNAVNRHTGSKLVNAVQQDVHAILQLGEAQIEKSAR
ALIDAARNEADEKLSAELSRLEALRAVNPNIRDDELTAIESNRQQVMESL
DQAGWRLDALRLIVVTHQ
>Z1866 putative integrase of prophage CP-933X
MAASPRSHKISIPNLYCKLDKRTGKVYWQYKHPLSGRFHSLGTDENEAKQ
VATEANTIIAEQRTRQILSVNERLERMKGRRSDITVTEWLDKYNSIQEDR
LQHNELRPNSYRQKGKPIRLFREHCGMQHLKDITALDIAEIIDAVKAEGH
NRMAQVVRMVLIDVFKEAQHAGHVPPGFNPAQATKQPRNRVNRQRLSLPE
WQAIFDSVSRRQPYLKCGMLLALVTGQRLSDICNLKFSDIWDDMLHITQE
KTGSKLAIPLNLKCDALNITLREVISQCRDAVVSKYLVHYRHTTSQANRG
DQVSANTLTTAFKKAREKCGIKWEQGTAPTFHEQRSLSERLYREQGLDTQ
KLLGHKSRKMTDRYNDDRGKDWIIVDIKTA
>Z6019 putative transposase fragment
MATFFAYPADIRKVIYTTNAIESLNSVIRHAIRKRKVFPTDESVKKVVWL
AIQAASQKWTMPLRDWRMAMSRFIIEFGDRLDGHF
>Z1122 hypothetical protein
MVHNGAGVRDSSRTLKVDINTVILTLKNAHHVK
>Z4313 putative pathogenicity island integrase
MALTDAKIRAAKPTDKAYKLTDGAGMFLLVHPNGSRYWRLRYRILGKEKT
LALVVYPEVSLSEARTKRDEARKLISEGVDPCEQKRAKKVVPDLQLSFEH
IARRWHASNKQWAQSHSDKVLKSLETHVFPFIGNRDITTLNTPDLLIPVR
AAEAKQIYEIASRLQQRISAVMRYAVQSGIIRYNPALDMAGALTTVKRQH
RPALDLSRLPELLSRIGSYKGQPVTQLAVMLNLLVFIRSSELRYARWSEI
DIDNAMWTIPAEREPLPGVKFSHRGSKMRTPHLVPLSKQVVAILAELQTW
AGENGLIFTGAHDPRKPISENTVNKALRVMGYDTTQDVCGHGFRAMACSA
LIESGLWSRDAVERQMSHQERNGVRAAYIHKAEHLEERRLMLQWWADFLD
ANRERFISPFEYAKINNPLKQ
>Z6016 unknown protein encoded in ISEc8
MNDISSDDIFLLKQRLAEQEALIHALQEKLSNREREIDHLQAQLDKLRRM
NFGSRSEKVSRRIAQMEADLNRLQKESDTLTGRVYDPAVQRPLRQTRTRK
PFPESLPRDEKRLLPAAPCCPNCGGSLSYLGEDTAEQLELMRSAFRVIRT
VREKHACTQCDAIVQAPAPSRPIERGIAGPGLLARVLTSKYAEHTPLYRQ
SEIYGRQGVELRRSLLSGWVDACCRLLSPLEEALHGYVMTDGKLHADDTP
VQVLLPGNKKTKTGRLWAYVRDDRNAGSALAPAVWFAYSPDRKGIHPQTH
LACFSGVLQADAYAGFNELYRNGGITEAACWAHARRKIHDVHVRIPSALT
EEALEQIGQLYAIEADIRGMPAEQRLAERQRKTKPLLKSLESWLREKMKT
LSRHSELAKAFAYALNQWPALTYYANDGWVEIDNNIAENALRAVSLGRKN
FLFFGSDHGGERGALLYSLIGTCKLNDVDPESYLRHVLGVIADWPVNRVS
ELLPWRIALPAE
>Z0700 putative receptor
MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDL
GETHLNFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS
DDKDVIAIDGKTLRHSYDKSRRRRAIHVISAFSTMHSLVIGQIKTDEKSN
EITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQG
RLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDF
TFEWKGLKKLCVAVSFRSIIAEQKKESEMTVRYYISSADLTAEKFATAIR
NHWHVENKLHWRLDVVVNEDDCKIRRGNAAELFSGIRHIAINILTNDKVF
KAGLRRKMRKAAMDRNYLASVFAGSGLS
>Z2127 putative IS encoded protein encoded within prophage CP-933O
MARGKAAITFFREPPATSCDSRCSCRTARIARRGPGNPQYQLLGNVPARA
APLQWQCQRKAPDSADTGTEAMIPLPSGTKIWLVAGITDMRNGFNGLAAK
VQTALKDDPMSGHVFIFRGRSGSQVKLLWSTGDGLCLLTKRLERGRFAWP
SARDGKVFLTQAQLAMLLEGIDWRQPKRLLTSLTML
>Z1802 unknown protein encoded by prophage CP-933N
MLTTQKRKFALALMSGKNKTASAIAAGYSAKTARVKGSQLAKDPEVLAFI
ARKQCETVEVDEVPVYRQKKSEQEDKPRRREAAAIPQPDENNPEMPPSAV
MSPGIEYMEDGLPDPVKAMGRLLVENINTDPRLALDAAYKLAQFTHHKKG
DAGKKSAKGDAAKKAANRFAVPPPPRLVVNNDNEGNG
>Z5028 hypothetical protein
MKQQEEHNNKIDLLEKQQAQLKSQLETIQKQQTGIISSTKTLTHVIKSVK
DQQNTFIFTEFNPAKTKYFILNNGSVALAGRVLSIDATENGSVIHISLVN
LLSTPISNIGFNATWGGEKPVDAKEFARWQQLLFNTSMKSTLKLLPGQWQ
DINLTLKGVSPNNLGYLKLAINMENIQFDNLPSAENRQKRSKK
>Z2111 putative transposase encoded within prophage CP-933O
MARCTVARLMAVMGLAGVLRGKKVRTTXSRKAVAAGDRVNRQFVAERPDQ
LWVADFTYVSTWQGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLNALEQA
LWARRPSGTIHHSDKGSQYVSLAYTERLKEAGLLASTGSTGDSYDNAMAE
SINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLGRLGHTPPAEA
EKAYYASIGNDDLAA
>Z2430 putative transposase for IS629
MARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQ
LWVADFTYVSTWQGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQA
LWARRPSGTIHHSDKGSQYVSLAYTERLKEAGLLASTGSTGDSYDNAMAE
SINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLGRLGHTPPAEA
EKAYYASIGNDDLAA
>Z2073 putative transposase within CP-933O
MARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQ
LWVADFTYVSTWQGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQA
LWARRPSGTIHHSDKGSQYVSLAYTERLKEAGLLASTGSTGDSYDNAMAE
SINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLGRLGHTPPAEA
EKAYYASIGNDDLAA
>Z4375 hypothetical protein
MTNLTLDVNIIDFPSIPVAMLPHRCRPELLNYSVAKFIMWRKETGLSPVN
QSQTFGVAWDDPATTAPEAFRFDICGSVSEPIPDNRYGVSNGELTGGRYA
VARHVGELDDISHTIWGIIRHWLPASGEKMRKAPILFHYTNLAEGVTEQR
LETNVYVPLA
>Z4315 unknown protein encoded by ISEc8
MNSQTKKDIPCFRSYLPDALRLRFEDKLTIRAIAQRLGLSHSTIHTLFQR
FIASGIAWPLPDSVSLAQLDAILYANRKKELTEPQISEGTWRKERRTSYS
REFKIRLVKQALQPGAVVARIAREHGINDNLLLKWKSQYEDGLLSDDDIQ
ECMPVPVALTDTPEPTRPVTNPFWRNKPDECPESDPGNVPRCELHLKSGV
VKLFDPLTPEMLRALIREMKGGTR
>Z1572 partial putative transposase
MRRTFTAEEKASVFELWKNGTGFSEIANILGSKPGTIFTMLRDTGGIKPH
ERKRAVAHLTLSEREEIRAGLSAKMSIRAIATALNRSPSTISREVQRNRK
RTA
>Z4318 hypothetical protein
MVLLEHEYVMAKLINISASTEERMTPGERRVASRLESFLNDDCFVWYDIP
VGRKNRHPDFVIIDPDNGLVFLEVKDWTVSTLRKANQEQVTLETDGLLKS
EINPLVQVRRYACDTVNALPADPCLRQNDGQYKGRLNLAWAYGVVFTRIT
RQQLKALTGNNENAVEKIFPSAQTICQDEMTQSVLPEVFRQKIAGMFTTG
FRTRVTPRMRDILRAHLFPEVTVKQNSQIKIMDIQQEILARNIGDGHRVI
HGVAGSGKTMILLFRCLYLAETTSGSYAAPVQLPGGRFHTGYCRSPPSHC
GSIRSERPIASADR
>Z1647 partial transposase
MATYGGQFTLTDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWY
NNRRLLERLGHTPPAEAEKAYYASIGNDDLAA
>Z2376 putative IS629 transposase within prophage CP-933R
MARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQ
LWVADFTYVSTWQGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQA
LWARRPSGTIHHSDKGSQYVSLAYTERLKEAGLLASTGSTGDSYDNAMAE
SINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLGRLGHTPPAEA
EKAYYASIGNDXLAA
>Z1602 unknown in ISEc8
MFSGLFAMLTPDNVFLVVKPVDMRRGIDTLTQYVQNELNAAWHDGAAFVF
TNKVRSRIKVLRWDKHGVWLCTRRLHRGSFRWPRKGDATWHLTQDEFHWL
VFGVDWQQVKGHDLAKWVYQ
>Z1934 putative transposase for IS629
MARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAXRPDQ
LWVADFTYVSTWQGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQA
LWARRPSGTIHHSDKGSQYVSLAYTERLKEAGLLASTGSTGDSYDNAMAE
SINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLGRLGHTPPAEA
EKAYYASIGNDDLAA
>Z2562 putative transposase (partial)
MWRRAAPTLRIRAPPKDKKMATIHNALDECSTEHPVFYEDEVFIHLNPKI
GADWKLLGKQKRGVTPEQNEKYSLDVALHSGTG
>Z1160 unknown in ISEc8
MISLPSGTRIWLVAGVTDMRKSFNGLGEQVQHVLDENPFSGHLFIFRGRR
GDTIKILWADADGLCLFTKRLEEGQFIWPAVRDGKVSITRSQLAMLLDKL
DWRQPKTSRLNALTML
>Z1571 unknown protein encoded in ISEc8
MIPLPFGTKIWLVAGITDMRNGFNGLAAKVQTALKDDPMSGHVFIFRGRS
GSQVKLLWSTGDGLCLLTKRLERGRFAWPSARDGKVFLTQAQLAMLLEGI
DWRQPKRLLTSLTML
>Z1827 putative IS encoded protein
MVRRLRFSGPKTSIICSPMTSLKTSIKTITYLSDTGCLEIQGASLVIYTT
NAIESLNSVIRHAIKKRKVFPTDDSVKKVVWLAIQAASQKWTMPLRDWRM
AMSRFIIEFGDRLDGHF
>Z2851 putative enzyme
MTDDFAPDGQLAKAIPGFKPREPQRQMAVAVTQAIEKGQPLVVEAGTGTG
KTYAYLAPALRAKKKVIISTGSKALQDQLYSRDLPTVSKALKYTGNVALL
KGRSNYLCLERLEQQALAGGDLPVQILSDVILLRSWSNQTVDGDISTCVS
VAEDSQAWPLVTSTNDNCLGSDCPMYKDCFVVKARKKAMDADVVVVNHHL
FLADMVVKESGFGELIPEADVMIFDEAHQLPDIASQYFGQSLSSRQLLDL
AKDITIAYRTELKDTQQLQKCADRLAQSAQDFRLQLGEPGYRGNLRELLA
NPQIQRAFLLLDDTLELCYDVAKLSLGRSALLDAAFERATLYRTRLKRLK
EINQPGYSYWYECTSRHFTLALTPLSVADKFKELMAQKPGSWIFTSATLS
VNDDLHHFTSRLGIEQAESLLLPSPFDYSRQALLCVPRNLPQTNQPGSAR
QLAAMLRPIIEANNGRCFMLCTSHAMMRDLAEQFRATMTLPVLLQGETSK
GQLLQQFVSAGNALLVATSSFWEGVDVRGDTLSLVIIDKLPFTSPDDPLL
KARMEDCRLRGGDPFDEVQLPDAVITLKQGVGRLIRDADDRGVLVICDNR
LVMRPYGATFLASLPPAPRTRDIARAVRFLAIPSSR
>Z3161 unknown protein encoded by IS629
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGSGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>Z1863 putative phosphohydrolase
MFKPHVTVACVVHAEGKFLVVEETINGKALWNQPAGHLEADETLVEAAAR
ELWEETGISAQPQHFIRMHQWIAPDKTPFLRFLFAIELEQICPTQPHDSD
IDCCRWVSAEEILQASNLRSPLVAESIRCYQSGQRYPLEMIGDFNWPFTK
GVI
>Z6022 putative integrase fragment
MAAMLNTYVAEGKAVSARVIRSTLVDVFRGAIAEGHVATNPVTTTRAAKS
EVRRSRLTANEYVAIYHAAEHLPIWLRLSMDLAVVTGQRVGDLCRMKWSD
INDGHLHIGQSKTGAKIAIPLALTIDALDISLVDTLQKCREASSSETIIA
STYHEPLSPATVSRYLTKARNASGISFDGDPPTFHELRSLSARLYRNQIG
YKFAQRLLGHKSDSMAAHYRDSRGREWDKIEIG
>Z1561 hypothetical protein
MVHNGAGVRDSSRTLKVDINTVILTLKNAHHVK
>Z1444 putative serine/threonine kinase encoded by bacteriophage BP-933W
MLTPYKRADVEFEWISDLEEQGCFSKVYLAHDRHLAHDLVIKEIEKKENT
NHDDYFNEARLLYKHAHPNIVQVQYAAQCESNIYIAMPFYHNGSLNQLMK
KNNLTSREIIRYSIQFLSGLYHIHSKGLMHFDIKPNNIMISNRNEAMLSD
FGLSQLVNEESRAAPEFGYHFHVPPEYFSLSTNDYNFTYDIYQAGLTIYR
MCVGHDNFERERSAFSTIEQLRESIINGCYPLKEYPPHIHKKLITIVNKC
IHVDPNERYQSVLDVLNDLSAISDGVLDWRLQMTKPTNGTCEWQKKSGDA
ILSIVFDAENSSTTGFRLYDDGRKRRATNLTISSGCTPTKLYRLLKDN
>Z5890 partial putative integrase
MSILMSIFADSILLVYSRDTNGRKTIMALTDTKVRSAKPEEKEYSLVDGD
GMSLLVKPGGSKYWRFRFRFGGKQHLMAFGVYPDVSLADARKKREEARKL
VAAGIDPREHKRAVKEEQAKEIITFEKVAREWLVTNQKWSEDHANRVKKS
LEDNIFPTIGTRNIAELGTRDLLIPIKAVEKSGRLEVASRLQQRTTAIMR
YAVQSGLIDYDPAQEMSGAVASSNRQHRPALELKRIPELLDKIDSYTGRP
LTHCTTELTLLIFIRSSKLHFARWSEIDFETSMWTIPLDWYCSKQRVGLV
Q
>Z4200 hypothetical protein
MMADVQEEGKPQLWNHKQNDALGLYLDLLIQAINTGTINAEDWQKGDRLK
SVALLIAYLDKANFYVMEDSGAWEEDARLNTSSVALVTSGLERLSNLLSK
KDSVFVSDLLREAKVNELDETLSTTRLNHLIDKGYERITLQLDLGGESPG
YLEKDKHYREADAALLNVIYPANLSKINTRRKEQVLKIVKKLAGPYGIKR
YEKDNYQSANFWFNDIKTDTDQNSHAKREKSFIPSTEAEWFFDSWYAKSA
AIVYKESRKEEYLNDSVQFMNRSLAQITGENMIGANGRSVPEMALPESYN
YIHKSGTLHEAPSPIIPLNWSKASMTLMLKEMSNLINDEGIK
>Z4803 putative ATP-dependent DNA helicase (together with adjacent 3 orfs)
MEKHAAELLLQRMLGNTTATFREGQWEAIDAVVNQRRKLLVVQRTGWGKS
AVYFIASKIFRDRGAGPTIIISPLLALMRNQVAAAERLGITAETLNSTNR
EEWQRISDKLLQGEVDCLLVSPERLANQDFIETVLYPIADHIGLLVVDEA
HCISDWGHDFRPDYRRILDILRNYLRIPLFWVQPRQRITVSLRISVSNWV
TL
>Z1600 unknown in ISEc8
MSRKYLIRITELERLLSEQAEALRQKDQQLSLVEETEAFLRSALARAEEK
IEEEEREIEHLRAQIEKLRRMLFGTRSEKLQREVEQAEAQLKQREQESDR
YSGREDDPQVPRQLRQSRHRRPLPAHLPREIHRLEPEESCCPECGSELDY
LGEVSAEQLELVSSALKVIRTVRVKKACTKCDCIVEAPAPSRPIERGIAG
SGLLARVLTGKYCEHLPLYRQSEIFARQGIELSRALLSNWVDACCQLMTL
LNDTLYRYVMNTRKVHTDDTPVKVLAPGRKKAKTGRIWTYVRDDRNAGSS
EPPAVWFAYSPDRQGKHPVQHLRPFRGILQADAFSGYDRLFSAEREGGAL
TEVACWAHARRKIHDVYISSKSATAEEALKRISELYAIEDEIRGLPESER
LAVRQQRSKVLLTSLHEWMVEKNGTLSKKSRLGEACSYVLNQWDALCYYS
DDGLAEADNNAAERALRAVCLGKKNFMFFGSDHGGERGALLYGLIGTCRL
NGIDPEAYLRHILSVLPEWPSNRVDELLPWNVVLTNK
>Z3155 unknown protein encoded by ISEc8
MIPLPSGTKIWLVAGITDMRNGFNGLAAKVQTALKDDPMSGHVFIFRGRS
GSQVKLLWSTGDGLCLLTKRLERGRFAWPSARDGKVFLTQAQLAMLLEGI
DWRQPKRLLTSLTML
>Z4799 putative DNA processing protein
MDADSSNSTVTPGRLPDGLSSCPCFYLGKKLMNLSANAQATLLLTSDFSR
AAASKYKPLSNSEWGKFALWLKHQRISPAELLVPQPQEKLTGWSDPRISQ
ERILGLLARGHSLALAVDKWQRAGLWILTRGDADYPVRLKNRLRTDAPPV
LFGCGNKALLQAEGMAIVGSRDAPTDDLRYTQQLAAKLAQQGICVISGGA
RGIDECAMASALEAGGTAVGVLADSLLKTSTLVKWREGLIAGNLVLISPF
YPEVRFTVGNAMARNKYIYCLAESAMVVRAGMTGGTITGAMEALKHQWLP
VQVKPNQDMQSANSRLVENGASWSAEQAENVTIRLPDVPGLMYDRALRNA
QPELFSLHEDDANYAVMPAYTPVDFYQLFVAELAILAKESISIERLASCT
GLTIEQISVWLNRAEEEGRVIRLGEGHYQFR
>Z2981 IS629 transposase encoded within prophage CP-933T
MPLLDKLREQYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KREIQRVYDENHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTVSRKAVSAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFSGCIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTERLKEAKLLASTGSTGDSYDNVFY
>Z1020 putative ATP-dependent helicase
MALTAALKAQIAAWYKALQEQIPDFIPRAPQRQMIADVAKTLAGEEGRHL
AIEAPTGVGKTLSYLIPGIAIAREEQKTLVVSTANVALQDQIYSKDLPLL
KKIIPDLKFTAAFGRGRYVCPRNLTALASTEPTQQDLLAFLDDELTPNNQ
EEQKRCAKLKGDLDTYKWDGLRDHTDIAIDDDLWRRLSTDKASCLNRNCY
YYRECPFFVARREIQEAEVVVANHALVMAAMESEAVLPDPKNLLLVLDEG
HHLPDVARDALEMSAEITAPWYRLQLDLFTKLVATCMEQFRPKTIPPLAI
PERLNAHCEELYELIASLNNILNXYMPAGQEAEHRFAMGELPDEVLEICQ
RLAKLTEMLRGLAELFLNDLSEKTGSHDIVRLHRLILQMNRALGMFEAQS
KLWRLASLAQSSGAPVTKWATREEREGQLHLWFHCVGIRVSDQLERLLWR
SIPHIIVTSATLRSLNSFSRLQEMSGLKEKAGDRFVALDSPFNHCEQGKI
VIPRMRVEPSIDNEEQHIAEMAAFFREQVESKKHLGMLVLFASGRAMQRF
LDYVTDLRLMLLVQGDQPRYRLVELHRKRVANGERSVLVGLQSFAEGLDL
KGDMLSQVHIHKIAFPPIDSPVVITEGEWLKSLNRYPFEVQSLPSASFNL
IQQVGRLIRSHGCWGEVVIYDKRLLTKNYGKRLLDALPVFPIEQPEVPEG
IVKKKEKTKSPRRRRR
>Z5816 putative virulence protein
MKRLQAFKFQLRPGGQQEREMRRFAGACRFVFNRALALQNENHEAGNKYI
PYGKMASWLVEWKNATETQWLKDAPSQPLQQSLKDLERAYKNFFRKRAAF
PRFKKRGQNDAFRYPQGVKLDQENSRIFLPKLGWMRYRNSRQVTGVVKNV
TASQSCGKWYISIQTENEVSTPVHPSALMVGLDAGVAKLATLSDGTVFGP
VNSFQKNQKTLARLQRQLSRKVKFSNNWQKQKRKIQRLHSCIANICRDYL
HKVTTTVSKNHAMIVIEDLKVSNMSKSAAGTVSQPGRNVRAKSGLNRSIL
DQGWYEMRRQLEYKQLWRGGQVLAVPPAYTSQRCACCGHTAKENRLSQSK
FRCQACGYTANADVNGARNILAAGHAVLACGEMVQSGRPLKQEPTEMIQA
TA
>Z5088 unknown protein encoded by IS911 within prophage CP-933L
MISSPQHKTGDLMNKKTKRTFTPEFRLECAQLIVDKGYSYRQASEAMNVG
STTLESWVRQLRRERQGIAPSATPITPDQQRIRELEKQVRRLEEHNTILK
KATTALLMSDSLNGSR
>Z6021 hypothetical protein
MLSPSSINLGCSWNSLTRNLTSPDNRVLSSVRDAAVHSDSGTQVTVGNRT
YRVVVTDNKFCVTRESHSGCFTNLLHRLGWPKGEISRKIEAMLNTSPVST
TIERGSVHSNRPDLPPVDYAQPELPPADYTQSELPRVSNNKSPVPGNVIG
KGGNAVVYEDMEDTTKVLKMFTISQSHEEVTSEVRCFNQYYGSGSAEKIY
NDNGNVIGIRMNKINGESLLDIPSLPAQAEQAIYDMFDRLEKKGILFVDT
TETNVLYDRMRNEFNPIDISSYNVSDISWSEHQVMQSYHGGKLDLISVVL
SKI
>Z1853 unknown protein encoded by prophage CP-933C
MARPPKAPAYLDDIAVKQWREKSRQLAERGDLTPADWSNLELYCVNYSIY
RKAVADLAARGFSIVNSQGGESRNPALSAKSDAERVMIKMASLLGFDPIS
RRKNPPETEEEDELDRLE
>Z2060 putative DNA adenine methyltransferase encoded by prophage CP-933O
MLNTVKISSCELINADCLEFMRSLPENSVDLIVTDPPYFKVKPEGWDNQW
AGDEDYLKWLDQCLAQFWRVLKPAGSLYLFCGHRLASDTEIMMRERFNVL
NHIIWAKPSGRWNGCNKESLRAYFPATERILFAEHYQGPYRPKDAGYEAK
GRTLKQHVMAPLIAYFRDARAVLGITAKQIADATGKKNMVSHWFSAGQWQ
LPNESDYLKLQALFARVAEEKHQRGELEKPHHQLVDTYASLNRQYAELQS
EYKHLRRYFGVTVQVPYTDVWTYKPVQYYPGKHPCEKPAEMLQQIISASS
RPGDLVADFFMGSGSTVKAAMALGRRATGVELETERFEQTVREVQDLIIR
NG
>Z1448 regulatory protein Cro of bacteriophage BP-933W
MQNLDEPIKGVGIPEVAKACGVSERAVYKWLKNGFLPKTEFFGKTKYASK
IEEISGGKYQASEMLEISKKNLLAA
>Z3297 putative transposase for IS629
MMPLLDKLREQYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQHDDW
LKREIQRVYDENHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVL
RGKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVA
FIIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTIHHSDKGSQY
VSLAYTERLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNR
AEVELATLTWVDWYNNRRLLGRLGHAPPAEAEKAYYASIGNDDLAA
>Z2130 putative IS encoded protein encoded within prophage CP-933O
MNDISSDDIFLLKQRLAEQEALIHALQEKLSNREREIDHLQAQLDKLRRM
NFGSRSEKVSRRIAQMEADLNRLQKESDTLTGRVYDPAVQRPLRQTRTRK
PFPESLPRDEKRLLPAAPCCPNCGGSLSYLGEDTAEQLELMRSAFRVIRT
VREKHACTQCDAIVQAPAPSRPIERGIAGPGLLARVLTSKYAEHTPLYRQ
SEIYGRQGVELRRSLLSGWVDACCRLLSPLEEALHGYVMTDGKLHADDTP
VQVLLPGNKKTKTGRLWAYVRDDRNAGSALAPAVWFAYSPDRKGIHPQTH
LACFSGVLQADAYAGFNELYRNGGITEAACWAHARRKIHDVHVRIPSALT
EEALEQIGQLYAIEADIRGMPAEQRLAERQRKTKPLLKSLESWLREKMKT
LSRHSELAKAFAYALNQWPALTYYANDGWVEIDNNIAENALRAVSLGRKN
FLFFGSDHGGERGALLYSLIGTCKLNDVDPESYLRHVLGVIADWPVNRVS
ELLPWRIALPAE
>Z1222 unknown in IS
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGSGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>Z1199 unknown in IS
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGSGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>Z1826 putative IS encoded protein
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQY
>Z2081 putative IS encoded protein within CP-933O
MIPLPSGTKIWLVAGITDMRNGFNGLAAKVQTALKDDPMSGHVFIFRGRS
GSQVKLLWSTGDGLCLLTKRLERGRFAWPSARDGKVFLTQAQLAMLLEGI
DWRQPKRLLTSLTML
>Z1338 DNA replication protein DnaC
MKNIAAAGVLERIRRLAPQASVPPYRTVEEWREWQLAEGRKRSEEINRQN
HQLRVEKILNRSGIQPLHSKCSFANYQVQNDGQKYALSQAKSIADELMTG
CTNFVFSGKTGTGKNHLAAAMGNRLMAKGRSVIIVTVSDVMSVLHDSYDN
GKSGEKFLQELCSVDLLVLDEIGVQRETKNEQVVLHQIIDRRTASLCSVG
MLTNLNHAAMSTLLGERIMDRMTMNGGRWVTFNWDSWRPNVSNMRVVK
>Z3945 hypothetical protein
MGYKVKKFIMSSGERGCLILDKKSNLPTYYQNLFLTTDIRNRGATASTME
IVATNLLIFSNFLDGRGIDIIERVELKKYLSVAEIDDLVRYAKQRFDRQK
IINIKSTNYRFIAKRTFSYRIHVFSRYLKWLCGLVHSSKGIHAKYEVDVF
IESIRAHIPRNSSLNMNEISEKSLNEEEIKVLFRLLEIGGIENPFHKEVQ
VRNRLIFTLLLNLGLRAGELLNLKIDDFDLRDNTLSVVRRHDSKEDGRSY
QPLVKTGERVIPLSDELAREIFDYISNSREKMTKRKKHNFLFVAHCTGKN
AGEPLSISAYEKVISTLKRASPELYNLSGHRLRHSWNYMYSKRNRRC
>Z1132 unknown protein encoded in ISEc8
MIPLPFGTKIWLVAGITDMRNGFNGLAAKVQTALKDDPMSGHVFIFRGRS
GSQVKLLWSTGDGLCLLTKRLERGRFAWPSARDGKVFLTQAQLAMLLEGI
DWRQPKRLLTSLTML
>Z2092 unknown protein encoded within prophage CP-933O
MKIKHEHIRMAMNAWARPDGEKVPAAGITQAYFELGMTFPELYDDSHPEA
LARNTQKIFRWIEKDTPDAVEKMQALLPAIEKAMPPLLVARMRSHSSEYY
REIVERRDRLVKDVDDFVAAAIAWGTLTNSGGQPGNAVVVH
>Z1589 unknown in IS
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKAAVDSICQCNTPFN
YLFHCPRRYRLSV
>Z4335 unknown protein in IS629
MTKNTRFSPEVRQRAIRMVLESQDEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTSAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>Z5200 hypothetical protein
MVDNVTVSRVCIQSPSFVPDLDGEKNKSQLFVDDIVAYLKSPSVYSLEKE
GPLNHFVNHCSEVELGFYSDGAYSILVSRSKQQPEGMILTVSDADAINIV
HISVSPVLIKFLDDIFTCLHTYPDDESFTKEQIKANSKYDIVDYNCLLHF
TGKPKSLIECRHFALQYCIDSMNEHTGKVPLKAYYSSPEDIQKHIPFELE
QQFNNLQKNPPPGTCVVASDKFGEALSVFFHRMEKEKLTHMTAIVQSQTH
AMAVRLRIKKTPAGETEYVVSFYDPNATNTAVRYKANNCDSFGSLQSFIN
IQQAKQKWVITDICSECVGITPYLPREQAHLLSGIENELQPPLSPPALFL
LMRMGIYKNIVLFFDKLKNSQEMTASKALDILAAKSPEGIYGLCVLLYHN
TIDKFNDYITNLKELTRKYNFSQEDLETLLLAKDNLGVSWIPRALKNNQN
KIVKAWLLAIDDFEKEFGVNKNEILLRIGKEIDSIDDLNSAIRTNDYNVV
NILLANIKAKMFKNELNKEDILKLMAAREKVAGASDKWTKASGLYSAIVK
GHTKIVAAWMETAEVIASHYENDKDVVRELLSLSRNNAVCSLYVASYKTM
SKQVIDVYLNAAIRLALQHGFTFDEILEQFTRDFDGKSFSLAVEKADDIY
GSLAENIQNCGW
>Z5879 orf; hypothetical protein in IS
MTKNTRFSPEVRQRAIRMVLESQDEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTSAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>Z2806 putative transposase
MPLLDKLREQYGVGPVCSELHIAPSTDYHCQQQRHHPDKRSARAQRDDWL
KREIQRVYDENHQVYGVRKVWRQLLREGIRVARXTXARLMAVMGLAGVLR
GKKVRTTVSRKAVSAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFSGCIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTERLKEAKLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLGRLGHIPPAEAEKAYYASIRNDDLAA
>Z1882 putative DNA packaging protein of prophage CP-933X
MEVNKKQLADIFGASIRTIQNWQEQGMPVLRGGGKGNEVLYDSAAAIKWY
AERDAEIENEKLRREVEELRQDSETDLQPGTIEYERHRLTRAQADAQELK
NARDSAEVVETAFCTFVLSRIAGEIASILDGIPLSVQRRFPELENRHVDF
LKRDIIKAMNKAAALDELIPGLLSEYIEQSG
>Z1192 IS1 protein InsB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>Z1825 insertion sequence 2 OrfB protein
MDSARALIARGWGVSLVSRCLRVSRAQLYVILRRTDDWMDGRRSRHTDDT
DVLLRIHHVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRIMRQNA
LLLERKPTVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFAL
DCCDREALHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNG
SCYRANETRQFARMLGLEPKNTAVRSPESNGIAENFVKTIKRDYISIMPK
PDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLE
I
>Z4503 putative transposase
MARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQ
LWVADFTYVSTWQGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQA
LWARRPSGTIHHSDKGSQYVSLAYTERLKEAGLLASTGSTGDSYDNAMAE
SINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLGRLGHTPPAEA
EKAYYASIGNDDLAA
>Z0946 putative integrase encoded by prophage CP-933K; partial
MIILLRYIHGLIATKKSSPAEESSRRHFINYMSKIKAIRRGLPDAPLEDI
TTKEIAAMLNGYIDEGKAASAKLIRSTLSDAFREAIAEGHITTNPVAATR
AAKSEVRRSRLTADEYLKIYQAAESSPCWLRLAMELAVVTGQRVGDLCEM
KWSDIVDGYLYVEQSKTGVKIAIPTTLHVDALGISMKETLDKCKEILGGE
TIIASTRREPLSSGTVSRYFMRARKASGLSFEGDPPTFHELRSLSARLYE
KQISDKFAQHLLGHKSDTMASQYRDDRGREWDKIEIK
>Z2396 DNA replication protein DnaC
MKNIATGGVLERIRRLAPPHVTAPFRTVAEWREWQLAEGQKRCEEINRQN
RQLRVEKILNRSGIQPLHRKCSFANYQVQNDGQRYALSQAKSIADELVTG
CTNFAFSGKPGTGKNHLAAAIGNRLLKDGQTVIVVTVADVMSALHASYDD
GQSGEKFLRELCEVDLLVLDEIGIQRETKNEQVVLHQIVDRRTASMRSVG
MLTNLNYEAMKTLLGERIMDRMTMNGGRWVNFNWESWRPNVGQPGIEK
>Z1202 hypothetical protein
MLAVERAFSSQISVIEGPPGTGKTQTILNIVANILIQNKTVAILSNNNSA
VSNVYEKMDKQQLGYVMARLGSTENRQQFFSTSISRSEEVLPDSPSANAI
DDVLQQVKKHLNAINQVASLKAEINELNIEYKYLQQWQSQNLRPEELFSH
KYRFSSQKTTDLMAYIHYLSDRRIGFRNRIDLLLNFMILKVKPLMIPERR
LALFTSLQLSYYEKNIREKQISLNEYEEAFKKSDFKILLGRLTSWSMLYL
KQHLRRNVSTRSSFSAETYRDEFDRFIKRFPIIGSSTHSIINSIGKGALL
DYVIIDEASQQDIVPGILGLGCARNVIVVGDRKQLPHVPVLLPNSPSPPA
EYYNCEKYSLLDSVCMLFRNMVPVTLLKEHYRCHPKIIQFCNKQFYDNAL
IPLTVDSGEASLSLVITAKGNHTRNFSNLRELESLEGHYWDEESSRGYIA
PYNAQVNLAEKVLPADFVKSTVHKFQGRECDEIVFSTVLDKKRSSQHSRN
IAFVDNPELVNVAVSRARNKFTLVTGNDVFERHAGHIAALIRYIKYYADD
GEIFESPVISAFDLLYSEYDKSLERLNSRLNSNDSHFKSEQIVACLLRDI
LSQDSYRSMMFHSQIALNQLVLLERGDFTHREQLFMRNRASCDFVVYYKV
GKTPLGVIEVDGGYHLTSVQAERDELKNSILKKCGLPLLRLRTIDSDIEG
KLGAFLSGLTG
>Z1598 unknown in ISEc8
MEQKILSSEPRRSFSNEFKLQMVKLASQPGAXVARIAREHDINDNLLFKW
LRLWQNERRISRRLPVTTSSGAGVELLPVEITPDEQKEPMAALTPLLSTP
SQSTVSASSCKVEFRHGNMTLENPSPELLTVLIRELTGRGR
>Z5878 putative integrase
MSQRYKLYRRTSGIYVVRISVPQRFRRYAGQCEIHTSTGTHDLHEAKQKS
ALLLAVWYQTLQEYEQLDYRTLSDCAPLLAGEGMISLSNFAQSIELPISQ
LIREVVNRNLPVFWLATGQFGFYVDEFNAVEREPGAKREKQSDDEKDQPK
EVIILNSAFELGIESFANGYLRPFNPRHTLDCLLSAGVSEGEAAFRTSGD
NQSGGWFFDLPGVDITADSLLISKVHAEGLRLTWLVKTTPPAVSIHPAVP
LVAPVIANEYVHRKHYNENLSWLREEYLKHRRKGKVSEAALRDIRYYFDL
MIEVMGDIQLEDFDRDFLRAYESKLRTIPANRNLMKGKHGVKTLDELIAK
AAECGDKLMTEESVKKYINGLYGAMEWAVDDGKFLKSPCDNFFPPDDKGE
REQDHTDIFEPHEIKAIFSQPWFVAGTVERNAQGRFHQYCPFHYWAPLLG
LMTGARVNEIAQLMLDDVLADDGVYYLNLESDSENGKKLKNANSRRKIPV
HSTLIELGFIEYVDALKAAGYDRLFPELKPHKTKGYGRPVSAWFNESLLA
GRLKLERDRSKSFHSFRHSVSTLLKEKGVSSELRGQLLGHVRGKTETEVR
YSKDLKPVHMVEVVEKIDFSLPEIARFNIPDGLDAVRDALXXKRGKQTG
>Z3154 unknown protein encoded by ISEc8
MGTKVSDMQKNVTPGRRKGCPNYPPEFKQQLVAASCEPGISISKLALENG
INANLLFKWRQQWREGKLLLPSSESPQLLPVTLDAAAEQPESLAEDPETL
SISCEVTFRHGTLRFNGNVSEKLLTLLIQELKR
>Z5880 putative transposase
MARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQ
LWVADFTYVSTWQGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQA
LWARRPSGTIHHSDKGSQYVSLAYTERLKEAGLLASTGSTGDSYDNAMAE
SINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLGRLGHTPPAEA
EKAYYASIGNDDLAA
>Z0275 hypothetical protein
MSIQSLLDYISVIPDIRQQGKVKHKLSDILFLTVCAVIAGADEWQEIEDF
GHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKIFIEWMQECHEI
TDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSN
EITAIPELLNLLDLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQG
KLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTPEFCDF
EFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRA
HWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIK
GGVKRKRKKVALNTCYIEEVLASCSELGFRTDKMKNLTQI
>Z1648 unknown in putative ISEc8
MELQDWRKEXRKNYSNEFKLRMVELASQPGASVARIAREHDINDNLLFKW
LRLWQNEGRISRRLQVTTSSDTGVELLPVEITPDEQKEPVAAIAPSLSTS
TQTSVSAGSCKVEFRHGNMTLENPSPELLTLLIRELTGRGR
>Z6015 unknown protein encoded in ISEc8
MIPLPSGTKIWLVAGITDMRNGFNGLAAKVQTALKDDPMSGHVFIFRGRS
GSQVKLLWSTGDGLCLLTKRLERGRFAWPSARDGKVFLTQAQLAMLLEGI
DWRQPKRLLTSLTML
>Z5899 putative ATP-dependent helicase
MNKQNYAPGMRVVIRDAEWRIRRADDSGDGGYLLTCDGISELVRGKEGLF
LTKLEQKVEILDPAKTHLVEDESANYQAAQLYIESQLRQRVPTDSKVHFG
HLAAMDSMPFQLDPTRMALAQPRQRILIADAVGLGKTLEAGILVSELIRR
GRGKRILVLAVKSMLTQFQKEFWSRFAIPLTRLDSAGLQKVRNRIPTNHN
PFHYFDKTIISIDTLKQDIEYRHHLENAWWDIIVIDEAHNVAERGTSSLR
SKLAKLLAGRSDTLIMLSATPHDGKAESFASLMNMLDPTAIANPKEYEYA
DFADKNLVVRRFKKDVKDQMSGEFPERNIVKLTRLASGAEEEAYRRLVES
QFRDDDDEQAQSNKGRLFKITLEKALFSSPMACASVVANRLKRLESRKDH
NSQSQINELESLLLALNNIDASQFSKYQLLLDTIRKDLAWKANNTEDRLV
IFTESIKTLEFLEQQLRADLKLKDDQIATLRGDQGDTVLMETVEAFGKTQ
SPLRLLVCSDVASEGINLHHLSHKMIHFDIPWSLMVFQQRNGRIDRYGQK
HQPQIRYLLTEASEPQINGDMRVLEVLINKDEQAQKNIGDSSEFTGKFTQ
EEEEEQVAEFMMQDDGASLFDQLLNSNVSESAEHDLFGEICSAVSSDASM
VTETDTSLFASEQAYCERALGYLKASGQTIQYETLPDNTLSLVAPEELRR
RFNQLPPEIAPENWQLYLSQDKTVITDAIARARGEQHAWPDVQYLWQINP
VVQWLDDKISSAFGRHQAPVIRLPYLLEPDEDHFILSGLFPNRKSHPMVN
PWIVVSFNRESLIGSQPFAEFLQRHPQLSNKLTNSGGKDRNHQRQQDLLE
AAIAHAREVFIHDRNAFETHINQQLNEHLQKLDVLRGRQLSQLELDFADN
KQQLSVKQSRKEQRQREIEHNFDSYIEWIEDTMTTEKEPYIQVIAVITGA
EG
>Z6069 DNA replication protein DnaC
MKNIAAVGVLERIRRLAPQGAVPPYRTVEEWREWQLAEGRKRSEEINRLN
HQVRVEKILNRAGIQPLHRKCSFGNYRVQNDGQRHALSQAKSIADELMTG
CTNFVFSGKPGTGKNHLAAAIGNRLMAKGRSVIIVTVSDVMSVLHDGYDN
GQSGEKFLQELCGVDLLVLDEIGMQRDTRNEQVTLNQIVDRRTASMRSVG
MLTNLNHAAMSTLLGDRVMDRMTMNGGRWVNFNWESWRSNVGRQGM
>Z4334 IS629 transposase
MARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQ
LWVADFTYVSTWQGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQA
LWARRPSGTIHHSDKGSQYVSLAYTERLKEAGLLASTGSTGDSYDNAMAE
SINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLGRLGHTPPAEA
EKAYYASIGNDDLAA
>Z1642 hypothetical protein
MLAVERAFSSQISVIEGPPGTGKTQTILNIVANILIQNKTVAILSNNNSA
VSNVYEKMDKQQLGYVMARLGSTENRQQFFSTSISRSEEVLPDSPSANAI
DDVLQQVKKHLNAINQVASLKAEINELNIEYKYLQQWQSQNLRPEELFSH
KYRFSSQKTTDLMAYIHYLSDRRIGFRNRIDLLLNFMILKVKPLMIPERR
LALFTSLQLSYYEKNIREKQISLNEYEEAFKKSDFKILLGRLTSWSMLYL
KQHLRRNVSTRSSFSAETYRDEFDRFIKRFPIIGSSTHSIINSIGKGALL
DYVIIDEASQQDIVPGILGLGCARNVIVVGDRKQLPHVPVLLPNSPSPPA
EYYNCEKYSLLDSVCMLFRNMVPVTLLKEHYRCHPKIIQFCNKQFYDNAL
IPLTVDSGEASLSLVITAKGNHTRNFSNLRELESLEGHYWDEESSRGYIA
PYNAQVNLAEKVLPADFVKSTVHKFQGRECDEIVFSTVLDKKRSSQHSRN
IAFVDNPELVNVAVSRARNKFTLVTGNDVFERHAGHIAALIRYIKYYADD
GEIFESPVISAFDLLYSEYDKSLERLNSRLNSNDSHFKSEQIVACLLRDI
LSQDSYRSMMFHSQIALNQLVLLERGDFTHREQLFMRNRASCDFVVYYKV
GKTPLGVIEVDGGYHLTSVQAERDELKNSILKKCGLPLLRLRTIDSDIEG
KLGAFLSGLTG
>Z1785 putative endodeoxyribonuclease of prophage CP-933N
MRIEFVLXYPPTVNTYWRRRGSTYFVSKAGERYRRAVALIVRQQRLKLSL
SGRLAIKIIAEPPDKRRRDLDNILKAPLDALTHAEVLIDDEQFDEINIVR
GQPVPGGRLGVKIYEIRGGNDGA
>Z6061 putative endonuclease encoded by cryptic prophage CP-933P
MRIEFVLPYPPTVNTYWRRRGSTYFVSKAGERYRRDVALIVRQQRLKLNL
SGRLVIKIIAEPPDKRRRDLDNILKAPLDVLTHAGLLIDDEQFDEINIVR
GQLVPGERLGIKITELECA
>Z1120 putative P4-family integrase
MAVLTDTKARHIKPDDKPLPHGGITGLTLHPSSVKGRGKWVFRYVSPVTQ
KRRNAGLGTYPEVSIAEAARTARIMREQLAAGDDPLEIKKAESEKVAIPT
FADAARRVHAELSPGWENPKHVRQWLSTLENYAFPQLGAKTLDSITAADV
AETLRPVWLTLSETASRVKQRIHVVMQWGWAHGFCVANPVDVVDHLLPQQ
TRGRDEHQPAMPWRQLPLFVATSVYTDEPYNVTRALLLMVILTATRSGEA
RGMRWAEIDFHKRVWTIPAERMKARIQHRVPLSRQAIYILENIRGLHDEL
VFPSPRKQQILSDMVLTSFLRKKKAVSDIPGRVATAHGFRSTFRDWCSEQ
GYSRDLAERALAHTLKNKVEAAYHRTDLLEQRVPMMQAWADYVMSQIVNK
>Z2563 putative transposase (partial)
MGANSKSSALFIRLLKRLKATYCRTKTITLLEDNYIIHKGRETQRWLKDK
PKCRVIYQPV
>Z0394 hypothetical protein
MLRHACGFALADNGVDTRLLQDYLGHRNIQHTVRYTASNAARFKGVWKKK
PR
>Z2791 hypothetical protein
MKMIEVVAAIIERDGKILLAQRPAQSDQAGLWEFAGGKVEPDESQQQALV
RELNEELGIEATVGEYVASHQREVSGRIIHLHAWHVPDFHGTLQAHEHQA
LVWCSPEEALQYPLAPADIPLLEAFMALRAARPAD
>Z3162 IS629 transposase
MMPLLDKLREQYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSAHAQHDDW
LKKEIQRVYDENHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVL
RGKKVRTTVSRKAVSAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVA
FIIDVFAGCIVGWRVSSSMETTFVLDALEQALWARRPSGTIHHSDKGSQY
VSLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNR
TEVELATLTWVDRYNNRRLLERLGHIPPAEAEKAYYASIGNNDLAA
>Z2561 putative transposase (partial)
MSIIPPISRDERRLIQKAIHKRLTAMLMLHRGDRVSDVARTLCCARSSVG
RWIN
>Z6046 putative terminase subunit encoded by cryptic prophage CP-933P
MLTTQKRKFALALMSGKNKTASALAAGYSAKTARVKGSQLAKDPEVLAFI
ARKQCETVEVDEVPVYRQKKSEPEDKPRRREAAAIPQPDETNPEMPPPVV
ISPGIEYMEDGLPDPVKAMGRLLVENINTDPRLALDAAYKLAQFTHHKKG
DAGKKSAKGDAAKKAANRFAVPPPPRLVVNNDNEGNG
>Z1845 putative single stranded DNA-binding protein of prophage CP-933C
MTAQIAAYGRLVDDPQVKQTSKGTPMTLARMAVSLPCSQAQDGQATLWLS
VIAFGKQADFLAKHQKGDVASVSGTMQVSQWTGQNGETRQGYQVIADSVI
SARAARPGGNRRKTTGTQGNQPPAGGDDPYGDGIPF
>Z0367 unknown protein encoded in ISEc8
MNDISSDDIFLLKQRLAEQEALIHALQEKLSNREREIDHLQAQLDKLRRM
NFGSRSEKVSRRIAQMEADLNRLQKESDTLTGRVYDPAVQRPLRQTRTRK
PFPESLPRDEKRLLPAAPCCPNCGGSLSYLGEDTAEQLELMRSAFRVIRT
VREKHACTQCDAIVQAPAPSRPIERGIAGPGLLARVLTSKYAEHTPLYRQ
SEIYGRQGVELRRSLLSGWVDACCRLLSPLEEALHGYVMTDGKLHADDTP
VQVLLPGNKKTKTGRLWAYVRDDRNAGSALAPAVWFAYSPDRKGIHPQTH
LACFSGVLQADAYAGFNELYRNGGITEAACWAHARRKIHDVHVRIPSALT
EEALEQIGQLYAIEADIRGMPAEQRLAERQRKTKPLLKSLESWLREKMKT
LSRHSELAKAFAYALNQWPALTYYANDGWVEIDNNIAENALRAVSLGRKN
FLFFGSDHGGERGALLYSLIGTCKLNDVDPESYLRHVLGVIADWPVNRVS
ELLPWRIALPAE
>Z3923 hypothetical protein
MPLLDKLREQYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQHDBWL
KREIQRVYDENHQVXXCA
>Z5089 putative transposase
MLTQNGVPMSRYRAGRLMKYLNLSSCQPGKHQYKNARQEHTCLPNLLERQ
FAVPEPDRVWCGDITYIWAGNRWCYLAVVMDLFARRVIGWSLSANADTAL
ISSALRMAYEVRGQPRDVMFHSDQGSQYTGLKYQQLLWRYRIKQSVSRRG
NCWDNSPMERFFRSLKTEWVPTDGYTGKDVARQQISSYILNYYNSVRPHH
YNGGLTPEESENRYHFYCKTVASIT
>Z1927 unknown protein encoded by ISEc8 in prophage CP-933X
MGTKVSDMQKNVTPGRRKGCPNYPPEFKQQLVAASCEPGISISKLALENG
INANLLFKWRQQWREGKLLLPSSESPQLLPVTLDAAAEQPESLAEDPETL
SISCEVTFRHGTLRFNGNVKRKAPDSADTGTEAMIPLPXGTKNLAGLPVS
PI
>Z2982 unknown protein of IS629 encoded within prophage CP-933T
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>Z5491 orf; hypothetical protein in IS (partial)
MALICKLSQQWSFVGSKARQHWLWYVYNTKTGGVLAYTFGPRTDETCREL
LALLTLLPSAC
>Z1843 unknown protein encoded by prophage CP-933C
MILGVATSLAAPLIGLVGADGFGVHLFEQSSAGKTTTQNIASSLWGEPDA
QRLTWYGTALGIANEAEAHNDGLLPLDEIGQAGNAREVSTSAYTLFNGSG
KLQGAKDGGNREIKHWRTVAISTGEMDVETFLKSEGIKVKAGQLVRLLNV
PMEKATKFHEYSNGKEHADALKDAWTANHGAAGREWVKWLAGHQQEAKDT
VRECRERWRNLIPESYGEQVHRVGERFAILEAALVLSGHVTGWVVQECRD
AIQHNFNAWVKEFGTGNREFKQMVEQAEAFLSSFGFSRYLPHPNTDERDL
PIKELAGYRKGSIRNEDDEMRYYTFPHVFESEIAKGFNPAHFARALDAAG
MLEKGSDRRYKKKALGKIGGKQHVFYVLMFQPDDED
>Z1958 unknown in IS629
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGSGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>Z2074 putative IS encoded protein within CP-933O
MTKNTRFSPEVRQRAIRMVLESQDEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTSAERQRLKELERENRELRRSNDILRQAXAYFAKA
EFDRLWKK
>Z2804 unknown protein encoded within IS629
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>Z1562 unknown in IS1N
MALICELDEQWSFVENKARQQWHWYAYKTKADGVLAYTFGPRTDETCREL
PEFLKPFSAGMITRDNRSSYTREMPQDKHLVGKIFTRRIERNNLTLRTHI
KRPARKTICFLRSLEIHEKPLVHLSKTHVLLTGVITRASFAVFLP
>Z0324 integrase protein for prophage CP-933I
MCIGLCICSCSVWIPIHMPLNDMQIRRAKPEAKAYTFGDGLGLSLLIEPN
GSKSWRFRYRYAGKPKMISLGVYPTITLADARSRRDEARKLVAEGKNPSE
VRKEQKLAMQTESENAFEKIAREWHQLKSAKWSAGYASDIMEAFKNDIFP
YVGTRPVGEIKPLELLNVLRKIEKRGALEKMRKVRQRCSEVFRYAIATGR
AEYNPAADLSSALEVHQSNHFPFLKADEIPDFLRALEGYSGSKLVQIATK
LLMITGVRTIELRAALWQEFDLDNAIWEIPAERMKMRRPHLVPLSSQAVD
LLNELKIMTGNYRYVFPGRNDPNRPMSEASINQAIKRIGYGGKVTGHGFR
HTLSTILHEQGFESAWIEIQLAHVDKNSIRGTYNHAQYFSGRKSMMDWYS
NLIFERLKRS
>Z2079 putative IS encoded protein within CP-933O
MHVRIPSALTEEALEQIGQLYAIEADIRGMPAEQRLAERQRKTKPLLKSL
ESWLREKMKTLSRHSELAKAFAYALNQWPALTYYANDGWVEIDNNIAENA
LRAVSLGRKNFLFFGSDHGGERGALLYSLIGTCKLK
>Z3561 putative regulator
MEQRRLASTEWVDIVNEENEVIAQASREQMRAQCLRHRATYIVVHDGMGK
ILVQRRTETKDFLPGMLDATAGGVVQADEQLLESARREAEEELGIAGVPF
AEHGQFYFEDKNCRVWGALFSCVSHGPFALQEDEVSEVCWLTPEEITARC
DEFTPDSLKALALWMKRNAKNEAVETETAE
>Z2389 putative DNA modification methyltransferase encoded within prophage CP-933R
MNVIDLFSGVGGLSLGAARAGFDVKMAVEIDQHAINTHAINFPRSLHVQE
DVSLLNAEIIKGFFKNDMPIDGIIGGPPCQGFSSIGKGNPDDSRNQLYMH
FYRLVSELQPLFFLAENVPGIMQEKYSGIRNKAFNLVSGDYDILDPIKVK
ASDYGAPTIRTRYFFIGVKKSLKLDISDEVFMPKMIDPVTVKDALYGLPD
IIDANWQSDSESWRTIKKDRKGGFYEKLWGQIPRNVGDTESIAKLKNNII
SGCTGTLHSKIVQERYASLSFGETDKISRSTRLDPNGFCPTLRAGTARDK
GSFQAVRPIHPYHPRVITPREAARLQGFPDWFRFHVTKWHSFRQIGNSVS
PIVAEYILKGLYNLLNKRVQPEYLNHNSLEVRV
>Z1638 putative transposase
MPLLDKLHEQYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KREIQRVYDENHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKXRTTVSRKAVSAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGCIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYI
SLAYTERLKEAKLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLGRLGHIPPAEAEKAYYASIRNDDLAA
>Z3024 hypothetical protein
MVVTTSDVVMCQMRHSDVQGVYRVYGSWMAENFQDQVSISNQIMSKFAPS
MPHAVRSDVINNRLHNLYLHAHYFLICRHQLITHLNPHLHRN
>Z2110 putative transposase encoded within prophage CP-933O
MTKNTRFSPEVRQRAIRMVLESQDEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTSAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>Z2365 putative DNA packaging protein of prophage CP-933R; terminase small subunit
MATQTEVARHLSLTDRQLRRLQKLPGAPISNKRGQLDLDAWRDFYISYLR
RSKNDVPDGDSEDDYEEKLLIARWELTAEQAVTQQLKNEVSKGKLIDTGF
CIFALSKLAMALSSTLDSIPLSMQRQFPDLTPRHLDHLKTLIAKGANQCA
RAGDKLPDLLDEYIRATTE
>Z5428 hypothetical protein
MGWHHCQYHSFTALDVINNYVACLLLNRGKEAVMVANINLIKQESYSVVN
LEKQLMSNESPKQSLLNDFSHRLKMEGEEQLFHCVNELKYGLSSNNEYDL
FKNRETSLWKRKVPESAALAMTGTVYKLFGSRLLNARNQLDKNLLEMAMQ
MVYGKDMVTKSGDILIDIQLNKDGSLQSKHYTFDVGEVINSYNIDLFVNA
SSINQTYLHDKNPGDVISLYGTLDNIPLVVKECTGKNHQINVKAKLPQER
SEKVEDMCERMRGGISMFNHTTKTAGNIEHNLQLSFLVDSRPSITNKYSA
KDVAPDILVLPVNVYHADFKIKINNAVEKT
>Z3348 unknown protein encoded within prophage CP-933V
MAKTSCEEMTMLLIQPGFGLSIKKGHMFGEKESQRKMVSIRLPFISIYWL
NREATNYWYTCARAAFNDPDWFVKNHHAVRQAKRKANMTYMKAYQKAWKE
HRDRYQQDMEKLESENMELRRKLGEAKRDIDAYKRLFNGESHA
>Z1323 putative integrase for cryptic prophage CP-933M
MGRPRKNKKDNVLPPRVRSNGYSYVWKPEGSTRSIGLGRVRKTSVAKVWQ
NYELEKAKLHNIMTVAKLWHMFMDSPAFTELAPRTQKDYRQHQKALLMVF
GKVLADNVKTEQVRIFMDKRGLESKTQANHELASLSRVYGWGYERGYVKN
NPCKGVRKFSLKARTVYITDEQYAAIYAEAIPQLRIAMEISYLCAARLGD
VLELKWQDIMDKGIYIEQNKTGTKQIKEWSPRLRTAIQLARNVSSCTCEY
VINTTKGGKVIAKTLNNWWNQAKRAAEQKVGVPFGCNFHDIKAKGISDYE
GSSRDKQIFSGHKTENQVLIYDRKTKITPTLDLPLVVSK
>Z3664 putative virulence protein
MKRLQAFKFQLRPGGQQEREMRRFAGACRFVFNRALALQNENHEAGNKYI
PYGKMASWLVEWKNATETQWLKDAPSQPLQQSLKELERAYKNFFRKRAAF
PRFKKRGQNDAFRYPQGVKLDQENSRIFLPKLGWMRYLNSRQVTGVVKNV
TVSQSCGKWYISIQTESEVSTPVHPSASMVGLDAGVAKLATLSDGTVFEP
VNSFQKNQKKLARLQRQLSRKVKFSNNWQKQKRKIQRLHSCTANIRRDYL
HKVTTTVSKNHAMIVIEDLKVSNMSKSAAGTVSQPGRNVRAKSGLNRSIL
DQGWYEMRRQLEYKQLWRGGQVLAVPPAYTSQRCACCGHTAKENRLSQSK
FRCQVCGYTANADVNGARNILAAGHAVLACGEMVQSGRSLKQEPTEMIQA
TA
>Z1134 unknown in IS600
MCQVFGVSRSGYYNWVQHEPSDRKQSDERLKLEIKVAHIRTRETYGTRRL
QTELAENGIIVGRDRLARLRKELRLRCKQKRKFRATTNSNHNLPVAPNLL
NQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDVYTCEIVGYAMGERMT
KELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQFGLKTSMS
RKGNCYDNAPMESFWGTLKNESLSHYRFKSRDEAISVIREYIEIFYNRQR
RHSRLGNISPAAFREKYHQMAA
>Z2987 putative tail fiber component of prophage CP-933T
MKTLDIRVAFSAVDRLTRPAENARRLMGQFGDSIQRTQGAIKNLERQARS
FERARDAVSKADAGIVKARRQLNALNQLQRTGTVLSEKQQKLMQQLSTRL
ERLNESRTREIQKMRELGGELKRHGISLTGSDNTIQQAIRRTEQYNNQLE
RERQALARVTQARERYSHAQETAGKLKTGGALAIGAAAAGGYAAGRFLQP
AIGFGKEMSRVQALTRIDKNSPQFKALREQALKLAETQFTASDAASGQSF
LAMAGFTPQAIQAALPGVLNMALAGGVELGETADIGSNILTQFNLTADQM
DRVGDTLTAAFTRTNTDLRALGETMKYTGPVAAKLGISLEEAAAMAGMLA
NNGLRGSDAGTAMRASLSRLASPPKAAADALKELGVSVADARGKMRPMED
VLLDLYKATQKYGQVDQVSFFKDIAGEXAFVGLQTLVAAAGSGELQKLTR
ELQGARGEADRVAKVMADNLDGDLKNLDSAWEGLRIRISDLVDGPLRSVT
QWLTRVLEKITSLAQAHPVLTRQLLIAGGALLAMTATIGSLSLVIGVLYG
KLATLRLGFDILTRSMNVVRVLPALWGMVTGSVSLLGGVIGALFSPVGLI
VAALAGAAVLIWKYWDPIRAFFAGVFSGIMERLTPLRDTFERFGPVFDVI
GSGSARCLTGLNRCCHRWSPARKRWINVPVLARYSVTFLVVRYSLF
>Z4340 unknown protein encoded by ISEc8
MLFGTRSEKLRREVEQAEALLKQREQDSDRYSGREDDPQVPRQLRQSRHR
RPLPAHLPREIQRLESEESCCPECGGELDYLGEVSAEQLELVSSALKVIR
TERVKKACTKCDCIVEAPAPSRPIERGIAGPGLLARVLTGKYCEHLPLYR
QSEIFARQGVELSRALLSNWVDACCQLMTPLNDALYRYVMNTRKLHTDDT
PVKVLAPGLKKTKTGRIWTYVRDDRNAGSSSPPAVWFAYSPNRQGKHPEQ
HLRPFRGILQADAFTGYDRLFSAEREGGALTEVACRAHARRKIHDVYISS
KSATAEEALKRISELYAIEDEIRGLPESERLAVRQQRSKALLTSLHEWMM
EKNGTLSKKSRLGEAFSYVLNQWDALCYYSDDGLAETDNNTAERALRAVC
LGKKNYVFFGSDHGGERGALLYGLIGTCRLNGIDPEAYLRHILSVLPEWP
SNRVDELLPWNVVLTNK
>Z6014 unknown protein encoded in ISEc8
MGTKVSDMQKNVTPGRRKGCPNYPPEFKQQLVAASCEPGISISKLALENG
INANLLFKWRQQWREGKLLLPSSESPQLLPVTLDAAAEQPESLAEDPETL
SISCEVTFRHGTLRFNGNVSEKLLTLLIQELKR
>Z0328 unknown protein encoded in prophage CP-933I
MNDNFFTFRKIKVTGFNKLDAIIEFGSKLTILYGGSDSGKTYIYYLIRYL
LGSEKLKNKDIDHAQGYDLAYLEFNFQGRVMTIERSLQDSAHYRLYDSSI
ENVSEANLLMVFSKSASSKKSFSSYFYGRLNFKEAKVRTNLSNTLHKFNL
NNVFEFFCIDELRVLTEKSLILSDIPSEETKRKSEFKFLLTQRDDTNSLA
EKPNKKARYF
>Z2253 H repeat-associated protein of Rhs element
MRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQI
KTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLF
AVKGNQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDV
PDELIDFTFEWKGLKKLCVAVSFRSIIAEQQNSRTAEQKKEPKNDGQILY
QFC
>Z1344 putative endonuclease of cryptic prophage CP-933M
MRIEFVLLYPPTVNTYWRRRGSTYFVSKAGERYRRAVALIVRQQRLKLSL
SGRLAIKIIAEPPDKRRRDLDNILKAPLDALTHAGLLMDDEQFDEINIVR
AQPVSGGRLGVKIYPIMLEGQVKK
>Z0857 putative receptor
MEHKLSDILLLIICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVH
DTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSR
RRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAPPELLNILDIKGKIITT
DAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHD
SYAISEKSHGREEIRLHIVCEVPDELIDFTFEWKGLKKLCVAVSFRSIIA
EQKKEPKMTVRYYISSADLTAGKFATAIRNHWHVENKLHWRLDVVMNEDD
CKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASV
LAGSGLS
>Z5097 unknown protein encoded by ISEc8 within prophage CP-933L
MIPLPSGTKIWLVAGITDMRNGFNGLAAKVQTALKDDPMSGHVFIFRGRS
GSQVKLLWSTGDGLCLLTKRLERGRFAWPSARDGKVFLTQAQLAMLLEGI
DWRQPKRLLTSLTML
>Z1161 unknown in ISEc8
MSRKYLIRITELERLLSEQAEALRQKDQQLSLVEETEAFLRSALARAEEK
IEEEEREIEHLRAQIEKLRRMLFGTRSEKLQREVEQAEAQLKQREQESDR
YSGREDDPQVPRQLRQSRHRRPLPAHLPREIHRLEPEESCCPECGSELDY
LGEVSAEQLELVSSALKVIRTVRVKKACTKCDCIVEAPAPSRPIERGIAG
SGLLARVLTGKYCEHLPLYRQSEIFARQGIELSRALLSNWVDACCQLMTL
LNDTLYRYVMNTRKVHTDDTPVKVLAPGRKKAKTGRIWTYVRDDRNAGSS
EPPAVWFAYSPDRQGKHPVQHLRPFRGILQADAFSGYDRLFSAEREGGAL
TEVACWAHARRKIHDVYISSKSATAEEALKRISELYAIEDEIRGLPESER
LAVRQQRSKVLLTSLHEWMVEKNGTLSKKSRLGEACSYVLNQWDALCYYS
DDGLAEADNNAAERALRAVCLGKKNFMFFGSDHGGERGALLYGLIGTCRL
NGIDPEAYLRHILSVLPEWPSNRVDELLPWNVVLTNK
>Z1162 unknown in ISEc8
MDISLLSTTSDPEQLRALAIAMVQKVMAENAELQNRIRILEEQMKLARQQ
RFGKKCESLAGMQRSLFEEDVDADIAEISAHLDKLLPQTGDEEKTTTRPV
RKPLPSPLPRAEKVIPPAEERCPDCDAPLHFIRDEVSEKLEYIPAQVVVN
RYIRPQYSCPCCEKVFSGKMPAHILPKSAVEPSVIAQVVISKYTDHLPLY
RQQHIFSRMGVELPVSTMADMVGVAGAALAPLAKLLRHELLTRDVIHADE
TSLRLLDTRKGGKSCSGWLWAYVSGERVSVNGAPY
>Z0322 unknown protein encoded by IS2
MQDVMLGAIEKRFGDKVPEQSIQWLTDNGSAYRAHETRQFARELNLEPCT
TAISSPQSNGIAERFVKTMKEDYIAFMPKPNVRTALHNLAVAIEHYNENH
PHSALGYRSPREYRRQRVTLT
>Z1933 unknown in IS629
MTKNTRFSPEVRQRAIRMVLESQDEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTSAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>Z3333 unknown protein encoded within prophage CP-933V
MLTTQKRKFALALMSGKNKTASAXAAGYSAKTARVKGSQLAKDPEVLAFI
ARKQCETVEVDEVPVYRQKKSEPEDKPRRREAAAIPQPDETNPEMPPPVV
ISPGIEYMEDGLPDPVKAMGRLLVENINTDPRLALDAAYKLAQFTHHKKG
DAGKKSAKGDAAKKAANRFAVPPPPRLVVNNDNEGNG
>Z1660 transposase for IS629
MPLLDKLXEQYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KREIQRVYDENHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTVSRKAVSAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGCIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYI
SLAYTERLKEAKLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLGRLGHIPPAEAEKAYYASIRNDDLAA
>Z5114 hypothetical protein
MNEKFRTDLAHTFGIALEEQTDVLSFHDNDGHEWILECASQSEILFFYCY
LLNSESIQINSILEMNSNRELLGMFFLSLKDDNILLNIAFPADKIDITEF
ANLMENGYLLKNEIIRSLSSRPTDFLP
>Z5098 unknown protein encoded by ISEc8 within prophage CP-933L
MNDISSDDIFLLKQRLAEQEALIHALQEKLSNREREIDHLQAQLDKLRRM
NFGSRSEKVSRRIAQMEADLNRLQKESDTLTGRVYDPAVQRPLRQTRTRK
PFPESLPRDEKRLLPAAPCCPNCGGSLSYLGEDTAEQLELMRSAFRVIRT
VREKHACTQCDAIVQAPAPSRPIERGIAGPGLLARVLTSKYAEHTPLYRQ
SEIYGRQGVELRRSLLSGWVDACCRLLSPLEEALHGYVMTDGKLHADDTP
VQVLLPGNKKTKTGRLWAYVRDDRNAGSALAPAVWFAYSPDRKGIHPQTH
LACFSGVLQADAYAGFNELYRNGGITEAACWAHARRKIHDVHVRIPSALT
EEALEQIGQLYAIEADIRGMPAEQRLAERQRKTKPLLKSLESWLREKMKT
LSRHSELAKAFAYALNQWPALTYYANDGWVEIDNNIAENALRAVSLGRKN
FLFFGSDHGGERGALLYSLIGTCKLNDVDPESYLRHVLGVIADWPVNRVS
ELLPWRIALPAE
>Z3925 partial putative transposase
MGWRVSSSMETTFVLDALEQALWARRPSGTIHHSDKGSQYVSLAYTERLK
EAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTW
VDWYNNRRLLGRLGHTPPAEAEKAYYASIGNDDLAA
>Z1150 unknown in IS
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKAAVDSICQCNTPFN
YLFHCPRRYRLSV
>Z4801 putative ATP-dependent DNA helicase (together with adjacent 3 orfs)
MGRAGRGIDSAVGILLCGSEDRAIHKFFRESAFPAEAQIHEILNVLSVND
GLTLRGIEQRTNLRYGQIEKALKLLVAENPSPVVYTEKLWRRTIVSFSPD
HERINHLMNQRKSELADVESYITTKECKMQFLRRALDEPGAEHCGKCSSC
LQHPLLSPDIDSGLLHAANLFIKHADLSLNLNKQVAAGAFTQYGFKGNLP
ASLQGSTGRILSRWGYSGWGKQVAQEKKTGRFSDELVEACAEMVRQRWNP
HPEPTWVCCVPSLKHLDLVPDFESTSTWFLILPGDWRRNLAYLLLMPLKK
SWTIHRRKCSKTVSTSVKISTGRL
>Z3156 unknown protein encoded by ISEc8
MNDISSDDIFLLKQRLAEQEALIHALQEKLSNREREIDHLQAQLDKLRRM
NFGSRSEKVSRRIAQMEADLNRLQKESDTLTGRVYDPAVQRPLRQTRTRK
PFPESLPRDEKRLLPAAPCCPNCGGSLSYLGEDTAEQLELMRSAFRVIRT
VREKHACTQCDAIVQAPAPSRPIERGIAGPGLLARVLTSKYAEHTPLYRQ
SEIYGRQGVELRRSLLSGWVDACCRLLSPLEEALHGYVMTDGKLHADDTP
VQVLLPGNKKTKTGRLWAYVRDDRNAGSALAPAVWFAYSPDRKGIHPQTH
LACFSGVLQADAYAGFNELYRNGGITEAACWAHARRKIHDVHVRIPSALT
EEALEQIGQLYAIEADIRGMPAEQRLAERQRKTKPLLKSLESWLREKMKT
LSRHSELAKAFAYALNQWPALTYYANDGWVEIDNNIAENALRAVSLGRKN
FLFFGSDHGGERGALLYSLIGTCKLNDVDPESYLRHVLGVIADWPVNRVS
ELLPWRIALPAE
>Z2082 putative transposase within CP-933O; partial
MDTTLSNSSDSDQTVQAVRLPDVSPALVSKVTDAVMEQVVEWQNRPLDAV
YPIVYLDCIVLKVRQDSRVINKSVFLALGINIEGQKELLGMWLAENEGAK
FWLNVLTELKNRGLNDILIACVDGLKGFPDAINTVYPEARIQLCIVHMVR
NSLRFVSWKDYKAVTRDLKAIYQAPTEEAGQQALEAFASAWDSRYPQISR
SWQANWTNLAMFFAYPADIRKVIYTTNAIESLNSVIRHAIKKRKVFPTDD
SVKKVVWLAIQAASQKWTMPLRDWRMAMSRFIIEFGDRLDGHF
>Z1500 unknown protein encoded by bacteriophage BP-933W
MENEGDNIITLVQPKRDEEKLLNITVTGRKNYTQQSCKHRAIEVHEQDHV
ILCLQCGCVVDPFQYVLRCANDGEAVVREIRQLHNRHDQLRESVASLERE
EKNTKARLRAARTAILYAENDLKNIEQKVNQ
>Z3922 putative transposase
MTKNTRFSPEVRQRAIRMVLESQDEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTSAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>Z1632 IS1 protein InsB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>Z2078 putative transposase within CP-933O
MDEKQLQALANELAKNXKTPEDLSQFDRLLKKISVEAALNAEMSHHLGYD
KNQPKPGANSRNGYSTKTVITGDGHLELRTPRDRDGSFEPZLVKKNQTRI
TGMDNQILSLYAKGLTTREIAAAFKELYDADVVSVQRGPY
>Z1163 unknown in ISEc8
MFSGLFAMLTPDNVFLVVKPVDMRRGIDTLTQYVQNELNAAWHDGAAFVF
TNKVRSRIKVLRWDKHGVWLCTRRLHRGSFRWPRKGDATWHLTQDEFHWL
VFGVDWQQVKGHDLAKWVYQ
>Z1657 putative DNA repair protein, RADC family
MQQLSFLPGEMTPGERSLILRALQTLDRHXHEPGVAFTSTRAAREWLILN
MAGLEREEFRVLYLNNQNQLIAGETXFTGTINRTEVHPREVIKRALYHNA
AAVVLAHNHPSGEVTPSKADRLITERLVQALGLVDIRVPDHLIVGGNQVF
SFAEHGLL
>Z1217 putative DNA repair protein, RADC family
MQQLSFLPGEMTPGERSLILRALQTLDRHLHEPGVAFTSTRAAREWLILN
MAGLEREEFRVLYLNNQNQLIAGETLFTGTINRTEVHPREVIKRALYHNA
AAVVLAHNHPSGEVTPSKADRLITERLVQALGLVDIRVPDHLIVGGNQVF
SFAEHGLL
>Z2429 unknown protein in IS629
MTKNTRFSPEVRQRAIRMVLESQDEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTSAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>Z2894 exodeoxyribonuclease X
MLRIIDTETCGLQGGIVEIASVDVIDGKIVNPMSHLVHPDRPISPQAMAI
HRITEAMVADKPWIEDVIPHYYGSEWYVAHNASFDRRVLPEMPGEWICTM
KLARRLWPGIKYSNMALYKTRKLNVQTPPGLHHHRALYDCYITAALLIDI
MNTSGWTAEQMADITGRPSLMTTFTFGKYRGKAVSDVAERDPGYLRWLFN
NLDSMSPELRLTLKHYLENT
>Z1198 putative transposase
MPLLDKLHEQYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KREIQRVYDENHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKXRTTVSRKAVSAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGCIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYI
SLAYTERLKEAKLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLGRLGHIPPAEAEKAYYASIRNDDLAA
>Z0854 hypothetical protein
MRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQI
KTDEKSNEITAPPELLNILDIKGKIITTDAMGCQKDIAEKIQKQGGDYLF
AVKGNQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCEV
PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPKMTVRYYISSADLTAG
KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINI
LTNDKVFKAGLRRKMRKAAMDRNYLASVLAGSGLS
>Z4316 unknown protein encoded by ISEc8
MITLPTGTRIWIIAGITDMRCGFNGLASKVQNTLKDAPFSGHIFVFRGRS
GKMVKILWADRDGLCLFAKRLERGRFVWPVTREGKVHLTPAQLSMLLEGI
AWQHPKRTERPGIRI
>Z1601 unknown in ISEc8
MDISLLSTTSDPEQLRALAIAMVQKVMAENAELQNRIRILEEQMKLARQQ
RFGKKCESLAGMQRSLFEEDVDADIAEISAHLDKLLPQTGDEEKTTTRPV
RKPLPSPLPRAEKVIPPAEERCPDCDAPLHFIRDEVSEKLEYIPAQVVVN
RYIRPQYSCPCCEKVFSGKMPAHILPKSAVEPSVIAQVVISKYTDHLPLY
RQQHIFSRMGVELPVSTMADMVGVAGAALAPLAKLLRHELLTRDVIHADE
TSLRLLDTRKGGKSCSGWLWAYVSGERVSVNGAPY
>Z1158 unknown in ISEc8
MALRKIAGLYRIEKFIRERPVEKIRQWRQRYSRPIVNDLFAWPEEQEPCC
PPDGPLNKAINYILNRRDELSCFLSDGAVPLDNNICERAIRPVVMGRKAW
LFAGSLMAGNRAAQIMSLLETAKRNGLEPHAWLTDVLTRLPEWPEDRLEE
LLPLEGFTFSG
>Z1475 putative terminase small subunit of bacteriophage BP-933W
MAKLDWKKLEQAFRREHAETGITLLDWCRKKKINYNTARTRIKMGKIDHE
IDHKTDHEIDHDISDEEPCNDAGSGDEKCAKNSEKNCANSAETKRIRGSR
LLPPSNAFSQRNTHAVRHRGYAKYLEADNLMDDASDMVLFDELVFTRARA
LSVTKALKGMFADLEEATDVETRVALYDKILKAEQALDRNIARIESIERS
LLTLDVLAETAPKLRADRERINAARDKLRAETDILTNQRRGVVTPVSDIV
SSLHEMSNSGRLDDIPEE
>Z4314 unknown protein encoded by ISEc8
MARGKAAITFFREPPATSCDSRCSCRTARIARRGPGNPQYQLLSTTDDYC
TVRDSVLQADAYAGFNELYRNGGITEAACWAHARRKIHDVHVRIPSALTE
EALEQIGQLYAIEADIRGMPAEQRLAERQRKTKPLLKSLESWLREKMKTL
SRHLELAKAFAYALNQWPALTYYANDGWVEIDNNIAENALRAVSLGRKNF
LFFGSDHGGERGALLYSLIGMCKLNDVDPESYXRHVLGVIADWPVNRVSE
LLPWRIALPAE
>Z1928 unknown protein encoded by ISEc8 in prophage CP-933X
MFLTQXQLTMLLEGIDWRQPKRLLTSLTML
>Z0395 hypothetical protein
MTRKYLTQDEVYRLMDAAQSMSFPERNRCLIMMAFIHGFRASELLDLRLS
DIDASGKQLNIRRIKNGFSTTHPLLPDEYNLIKLWLKQRKLIENGVEGDW
LFLSRKRRPISRQHFFLSFVRLEDVQD
>Z0271 hypothetical protein
MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDF
GETHLDFLRQYGDFENAIPVHDTIARVVSCISPAKFHECFINWMRDCHSS
DDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSN
EITAPPELLNILDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQG
RLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCEVPDELIDF
TFEWKGLKKLCVAVSFRSIIAEQKKEPKMTVRYYISSADLTAGKFATAIR
NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVF
KAGLRRKMRKAAMDRNYLASVLAGSGLS
>Z5902 putative helicase
MTVQTPKVALSDGFLGAFARIPKAQQKKVQEFISKFRQDPTSNGLNYEKI
HDARSKNVHSVRIDQTYRGIVLKPEQGALYMLMWVDKHDEAYDWARRHDC
SIHPVTGAIQVIDISYIKPAAETVVDKPKLFAAYSAEQILALGVPPVFID
QVMALTDEAGLNQLESIMPAEAWEPLHWLAEGLDYQEVLEEFNDHRDEPV
DTNDFMEAIERSKRRFHVVENEQELLQMLNAPLEKWRVFLHPSQRKLVES
PANGPVRVLGGAGTGKTVVAMHRARWLSQRLADKPGKKVLFTTFTRNLAA
DIRANLQRLCTREEMARIDVVNIDAWMSDQLKRHGYDFRVVYDSDEGRRK
CWNYALQQAPAELGLPDNFYAEEWQRVVQPNAVYTREEYLKVSRVGRGTA
LSRIQRAKIWPVFEEFRAQMARAKLREMGDAMHEAIVLFKEKQVQLPYSS
IVVDEAQDIGAPAFTLIRSLVPESPXDLFIVGDGHQRIYRNKVVLGQCGI
NVRGRRSKKLKINYRTTEETRQFAVGLLTGVKVDNXDGEADTSNDYLSLL
HGEKPMITHAADFKEEAATXVKQIQALLANQVRSQDICITARTKHXCDRY
ASALNDAGIETFNLGNDSGDSDARPGVRVATMHRIKGLEFQYVFLVGINE
GVVPEIKALASDDPVEQRDALFNERALLHVAATRAVKGLFVSSSGVPSPL
LVAD
>Z2771 putative excinuclease subunit
MVRRLTSPRLEFEAAAIYEYPEHLRSFLNDLPTRPGVYLFHGESDTMPLY
IGKSVNIRSRVLSHLRTPDEAAMLRQSRRISWICTAGEIGALLLEARLIK
EQQPLFNKRLRRNRQLCALQLNEKRVDVVYAKEVDFSRAPNLFGLFANRR
AALQALQSIADEQKLCYGLLGLEPLSRGRACFRSALKRCAGACCGKESHE
EHALRLRQSLERLRVVCWPWQGAVALKEQHPEMTQYHIIQNWLWLGAVNS
LEEATTLIRTPAGFDHDGYKILCKPLLSGNYEITELDPANDQRAS
>Z4193 type III secretion apparatus protein
MQLKNLQSLLDMKELLGEVVFRQDIFYSLRKVTVIQQQIAEINLEKQKIA
ERRKILNKEIVQQQAQRKHWWLKGEKYDRLKKRIKKQLLNQMLYQDELEQ
EEKYNGRSQEN
>Z4802 putative ATP-dependent DNA helicase (together with adjacent 3 orfs)
MIQRGTLARESLALDALVLGEHSSRLAWLATVIPQFSKSGIVYTLTTRDA
ELVAEWLRKNGISAFAYYSGVTCEGAEDSNTAREYLEQALLANKIKVLVA
TTALGMGFDKPDLGFVIHYQMPGSIVGYY
>Z3093 unknown in IS629 encoded within prophage CP-933U
MTKNTRFSPEVRQRAIRMVLESQDEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTSAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>Z3095 putative transposase encoded within prophage CP-933U
MSSTGSDRYAXNCILPRQRITIVSNSDIIPDKRSARAQHDDWLKREIQRV
YDENHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLRGKKVRTT
ISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAFIIDVFAG
YIVGWRVSSSMETTFVLXALEQALWARRPSGTIHHSDKGSQYVSLAYTER
LKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRAEVELATL
TWVDWYNNRRLLGRLGHTPPAEAEKAYYASIGNDDLAA
>Z0365 unknown protein encoded in ISEc8
MGTKVSDMQKNVTPGRRKGCPNYPPEFKQQLVAASCEPGISISKLALENG
INANLLFKWRQQWREGKLLLPSSESPQLLPVTLDAAAEQPESLAEDPETL
SISCEVTFRHGTLRFNGNVSEKLLTLLIQELKR
>Z1559 putative P4-family integrase
MAVLTDTKARHIKPDDKPLPHGGITGLTLHPSSVKGRGKWVFRYVSPVTQ
KRRNAGLGTYPEVSIAEAARTARIMREQLAAGDDPLEIKKAESEKVAIPT
FADAARRVHAELSPGWENPKHVRQWLSTLENYAFPQLGAKTLDSITAADV
AETLRPVWLTLSETASRVKQRIHVVMQWGWAHGFCVANPVDVVDHLLPQQ
TRGRDEHQPAMPWRQLPLFVATSVYTDEPYNVTRALLLMVILTATRSGEA
RGMRWAEIDFHKRVWTIPAERMKARIQHRVPLSRQAIYILENIRGLHDEL
VFPSPRKQQILSDMVLTSFLRKKKAVSDIPGRVATAHGFRSTFRDWCSEQ
GYSRDLAERALAHTLKNKVEAAYHRTDLLEQRVPMMQAWADYVMSQIVNK
>Z1929 unknown protein encoded by ISEc8 in prophage CP-933X
MNDISSDDIFLLKQRLAEQEALIHALQEKLSNREREIDHLQAQLDKLRRM
NFGSRSEKVSRRIAQMEADLNRLQKESDTLTGRVYDPAVQRPLRQTRTRK
PFPESLPRDEKRLLPAAPCCPNCGGSLSYLGEDTAEQLELMRSAFRVIRT
VREKHACTQCDAIVQAPAPSRPIERGIAGPGLLARVLTSKYAEHTPLYRQ
SEIYGRQGVELRRSLLSGWVDACCRLLSPLEEALHGYVMTDGKLHADDTP
VQVLLPGNKKTKTGRLWAYVRDDRNAGSALAPAVWFAYSPDRKGIHPQTH
LACFSGVLQADAYAGFNELYRNGGITEAACWAHARRKIHDVHVRIPSALT
EEALEQIGQLYAIEADIRGMPAEQRLAERQRKTKPLLKSLESWLREKMKT
LSRHSELAKAFAYALNQWPALTYYANDGWVEIDNNIAENALRAVSLGRKN
FLFFGSDHGGERGALLYSLIGTCKLNDVDPESYLRHVLGVIADWPVNRVS
ELLPWRIALPAE
>Z3622 putative resolvase
MNVRIYCRASTEGQHADRALTSLREFSKSKGWQIAGEYIENASGAKLERV
ELMRLLSEAQSGDLLLVEAIDRLSRLEHSAWVELKDTLNRKGLIIVSMDL
PTSWQMVEMAGNDLTSGILRAVNAMLIDILATMARQDYETRRKRQQQGIE
RAKSEGIYIGRAKNQEAREIVREMLEQGVKPELIMKAAGISRATYYRIKN
ELLIVKSE
>Z1131 unknown protein encoded in ISEc8
MNDISSDDIFLLKQRLAEQEALIHALQEKLSNREREIDHLQAQLDKLRRM
NFGSRSEKVSRRIAQMEADLNRLQKESDTLTGRVYDPAVQRPLRQTRTRK
PFPESLPRDEKRLLPAAPCCPNCGGSLSYLGEDTAEQLELMRSAFRVIRT
VREKHACTQCDAIVQAPAPSRPIERGIAGPGLLARVLTSKYAEHTPQYRQ
SEIYGRQGVELRRSLLSGWVDACCRLLSPLEEALHGYVMTDGKLHADDTP
VQVLLPGNKKTKTGRLWAYVRDDRNAGSALAPAVWFAYSPDRKGIHPQTH
LACFSGVLQADAYAGFNELYRNGGITEAACWAHARRKIHDVHVRIPSALT
EEALEQIGQLYAIEADIRGMPAEQRLAERQRKTKPLLKSLESWLREKMKT
LSRHSELAKAFAYALNQWPALTYYANDGWVEIDNNIAENALRAVSLGRKN
FLFFGSDHGGERGALLYSLIGTCKLNDVDPESYLRHVLGVIADWPVNRVS
ELLPWRIALPAE
>Z5096 unknown protein encoded by ISEc8 within prophage CP-933L
MGTKVSDMQKNVTPGRRKGCPNYPPEFKQQLVAASCEPGISISKLALENG
INANLLFKWRQQWREGKLLLPSSESPQLLPVTLDAAAEQPESLAEDPETL
SISCEVTFRHGTLRFNGNVSEKLLTLLIQELKR
>Z1957 transposase for IS629
MMPLLDKLREQYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQHDDW
LKKEIQRVYDENHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVL
RGKKVRTTVSRKAVAACDRVNRQFVAERPDQLWVADFTWVSTWQGFVYVA
FIIDVFAGYIVGWQVSSSMETTFVLDALEQALWARRPSGTIHHSDKGSQY
VSLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNR
TEVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNNDLAA
>Z1597 unknown in ISEc8
MALRKIAGLYRIEKFIRERPVEKIRQWRQRYSRPIVNDLFAWPEEQEPCC
PPDGPLNKAINYILNRRDELSCFLSDGAVPLDNNICERAIRPVVMGRKAW
LFAGSLMAGNRAAQIMSLLETAKRNGLEPHAWLTDVLTRLPEWPEDRLEE
LLPLEGFTFSG
>Z1208 unknown in putative ISEc8
MELQDWRKEPRKNYSNEFKLRMVELASQPGASVARIAREHDINDNLLFKW
LRLWQNEGRISRRLQVTTSSDTGVELLPVEITPDEQKEPVAAIAPSLSTS
TQTSVSAGSCKVEFRHGNMTLENPSPELLTLLIRELTGRGR
>Z3943 hypothetical protein
MEFSMENKCIESEQIFFAKMNRYSFKLSDKKWQLDKENCVYPHKVVDRMP
TKMKLSYLKTLAYYASEYSSFYIQSINNLFYEWFGAMTIDTIDDKAIYQL
NVYLGSERNYKLNLIKAFIIKWKNLNYPGVEATAIRMLEKIKIIPNQTGD
AVKRRDPNKGPLTEAEFNNIINAVGKFYHEKKIQCFLYCYILLLAITGRR
PLQLISLKAKDLIKNERGCFLNVPKVKQRKCFRKEFNMVMIEPFLYDSLS
MLINQNQAFVEDKFSVGISNYRGELPIFMNLDKITETKRIEDFLYDLTTD
FFHMKNSVMSKLLKHFPSKFDVRSERTNSYIELNARRFRYTLGSRLANEG
ASIEVIAKALDHKSVNSSIIYIKNNPDNVYDIDKRLSAFFNPLSNILMGI
EIEENKNFFIKFVSDAFFLLEDTKEDLKCLTCKKFNPWRAL
>Z1133 partial putative transposase
MRRTFTAEEKASVFELWKNGTGFSEIANILGSKPGTIFTMLRDTGGIKPH
ERKRAVAHLTLSEREEIRAGLSAKMSIRAIATALNRSPSTISREVQRNRK
RTA
>Z3299 unknown protein in IS629
MTKNTRFSPEVRQRAIRMVLESQDEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTSAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>Z6017 putative transposase fragment
MFLALGINIAGQKELLGMRLAENEGANFWFNVLTELKNRGLNDILIACVY
GLKEFPEARIQLCIVHMVRNSMRFVSWKEYKAVTRDLKAISLPQKRQASR
HWKRLLRPGTAAIRR
>Z4317 unknown protein encoded by ISEc8
MNNTLPDNIEQLKALLIAQQAVIVRLSGEITGYAREISSLRALVAKLQRM
LFGRSSEKSREKIEKKIARAETRITELQNRLGEAQLQLTSMAGETAPKTS
DSPVRKALPATLPRDRQVISPAETECPVCSGKLKPLGESISEQLDIINTA
FRVIETVRPKRACSRCDCIVQAPQPPKPIERSYASPALLARIIMAKFAEH
LPLYRQSEIYARQGVELHRNTMGRWVDIMGEQLRPLYDELKHYVLMPGKV
HADDTPVNVLEPGQGKTRTGRLWVYVRDDRNAGSTMPAAVWFSYSPDRKG
IHPQQHLADYRGILQADAYAGYNALYESGQATEAACMAHARRKIHDVHVR
HPTTVTGEALRRIGELYAIEAEIRGSPAEERLAVRKARTVPLMQSLYEWL
QGQMNTLSRHSDTAKAFTYLLKQWDALNEYCRNGWVEIDNNLCENALRVV
ALGRRNYRTWFLPGKGNGTKESWFFRNRPHHE
>Z4330 putative transposase
MQKAIYKTHDKNYARRLTAMLMLHRSARVSDVARTLCCARSSVRRWINWF
TLSSVEGLKSLPAGRSRRWPFEHICTLFRELIKHSPGYFGYQRSRWSTEL
LAIKINEITECQLHAGTVRRWLPLVGLVWRRAAPTLRTRNPHKAAIHKAL
DECSAEHPVFYEDEVYIYLNPKTGADWQLRRQQKHVVTPGQNEKYYLAGA
LHSGTGKVSYVGGNSKSSALFISLLKHLKLTYRRDKPITLIVDNYIIHKS
RETQRWLKENPKFRVIYQPVYSPWVNHVERL
>Z4502 orf; hypothetical protein in IS629
MTKNTRFSPEVRQRAIRMVLESQDEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTSAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>Z2057 putative endonuclease of prophage CP-933O
MLIDLVLPYPPTVNTYWRRRGSTYFVSKAGERYRRAVVLIVRQQRLKLSL
SGRLAIKIIAEPPDKRRRDLDNILKAPLDALTHAGVLMDDEQFDEINIVR
GQPVSGGRLGVKIYKIESE
>Z1639 unknown in IS
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGSGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>Z1871 unknown protein encoded by prophage CP-933X
MKKAIAYMRFSSPGQMSGDSLNRQRRLIAEWLKVNSDYYLDTITYEDLGL
SAFKGKHAQSGAFSEFLDAIEHGYILPGTTLLVESLDRLSREKVGEAIER
LKLILNHGIDVITLCDNTVYNIDSLNEPYSLIKAILIAQRANEESEIKSS
RVKLSWKKKRQDALESGTIMTASCPRWLSLDDKRTAFVPDPDRVKTIELI
FKLRMERRSLNAIAKYLNDHAVKNFSGKESAWGPSVIEKLLANKALIGIC
VPSYRARGKGISEIAGYYPRVISDDLFYAVQEIRLAPFGISNSSKNPMLI
NLLRTVMKCEACGNTMIVHAVSGSLHGYYVCPMRRLHRCDRPSIKRDLVD
YNIINELLFNCSKIQPVENKKDANETLELKIIELQMKINNLIVALSVAPE
VTAIAEKIRLLDKELRRALVSLKTLKSKGVNSFSDFYAIDLTSKNGRELC
RTLAYKIFEKIIINTDNKTCDIYFMNGIVFKHYPLMKVISAQQAISALKY
MVDGEVYF
>Z2084 putative integrase within CP-933O; partial
MIHLIKPAIDALRSQMALTRLSKEHIIDVHLREFGRTEKQKCTFVFQPEV
SAKVKNYGDHFTVDSIRQMWDAAVKRAGIRHRKSYQSRHTYACWSLTAGA
NPAFIANQMGHADAQMVFQVYGKWMSENNNAQVTLLNTQLSEFAPTMPHN
EAMKS
>Z1599 unknown in ISEc8
MISLPSGTRIWLVAGVTDMRKSFNGLGEQVQHVLDENPFSGHLFIFRGRR
GDTIKILWADADGLCLFTKRLEEGQFIWPAVRDGKVSITRSQLAMLLDKL
DWRQPKTSRLNALTML
>Z4337 unknown protein encoded by ISEc8
MNDISSDDIFLLKQRLAEQEALIHALQEKLSNREREIDHLQAQLDKLRRM
NFGSRSEKVSRRIAQMEADLNRLQKESDTLTGRVYDPAVQRPLRQTRTRK
PFPESLPRDEKRLLPAAPCCPNCGGSLSYLGEDTAEQLELMRSAFRVIRT
VREKHACTQCDAIVQAPAPSRPIERGIAGPGLLARVLTSKYAEHTPLYRQ
SEIYGRQGVELRRSLLSGWVDACCRLLSPLEEALHGYVMTDGKLHADDTP
VQVLLPGNKKTKTGRLWAYVRDDRNAGSALAPAVWFAYSPDRKGIHPQTH
LACFSGVLQADAYAGFNELYRNGGITEAACWAHARRKIHDVHVRIPSALT
EEALEQIGQLYAIEADIRGMPAEQRLAERQRKTKPLLKSLESWLREKMKT
LSRHSELAKAFAYALNQWPALTYYANDGWVEIDNNIAENALRAVSLGRKN
FLFFGSDHGGERGALLYSLIGTCKLNDVDPESYLRHVLGVIADWPVNRVS
ELLPWRIALPAE
>Z1661 unknown protein encoded by IS629
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGSGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>Z1772 unknown protein encoded by prophage CP-933N
MKIKHEHIRMAMNAWAYPDGEKVPAAEIARTYFELGMTFPELYDDSHPEA
LARNTQKIFRWLDKDTPDAVEKMQALLPAIEKAMPPLLVARMRSHSSEYY
REIVERRDRLVKDVDDFVAAAIAWGTLTNSGGQPGNAVVVH
>Z5117 hypothetical protein
MASLWKRLFYSSGRRRRYFEEGEHSFSILCGRLRGIVLTIKCSNGIIYLS
IKVSPNNRNHVFLYHKKDYVFDKLKEIFPDEAIEFTIEYEN
>Z3924 partial putative transposase
MARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVXXRPDQ
LWVADFTYVSTWQGFVYVAFIN
>Z1221 putative transposase
MPLLDKLHEQYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KREIQRVYDENHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTVSRKAVSAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGCIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYI
SLAYTERLKEAKLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLGRLGHIPPAEAEKAYYASIRNDDLAA
>Z5885 putative resolvase
MTMAIFGYGRVSTSQQDTENQRMELEQAGWTFDFWFSDVVSGKVPAVQRK
AFFEMLSKIRDGETLVVAKLDRLGRDAIDVLQTVRTLADRNIKVIVHQLG
TTDLTSAAGKLLLSMLAAVAEMERDLLIERTNAGLLRAKAEGRKLGRPAK
IAPEARGAILDKRAAGVSVSALAREYGVSRATLAALLNKGY
>Z4336 unknown protein encoded by ISEc8
MIPLPSGTKIWLVAGITDMRNGFNGLAAKVQTALKDDPMSGHVFIFRGRS
GSQVKLLWSTGDGLCLLTKRLERGRFAWPSARDGKVFLTQAQLAMLLEGI
DWRQPKRLLTSLTML
>Z1570 unknown protein encoded in ISEc8
MNDISSDDIFLLKQRLAEQEALIHALQEKLSNREREIDHLQAQLDKLRRM
NFGSRSEKVSRRIAQMEADLNRLQKESDTLTGRVYDPAVQRPLRQTRTRK
PFPESLPRDEKRLLPAAPCCPNCGGSLSYLGEDTAEQLELMRSAFRVIRT
VREKHACTQCDAIVQAPAPSRPIERGIAGPGLLARVLTSKYAEHTPQYRQ
SEIYGRQGVELRRSLLSGWVDACCRLLSPLEEALHGYVMTDGKLHADDTP
VQVLLPGNKKTKTGRLWAYVRDDRNAGSALAPAVWFAYSPDRKGIHPQTH
LACFSGVLQADAYAGFNELYRNGGITEAACWAHARRKIHDVHVRIPSALT
EEALEQIGQLYAIEADIRGMPAEQRLAERQRKTKPLLKSLESWLREKMKT
LSRHSELAKAFAYALNQWPALTYYANDGWVEIDNNIAENALRAVSLGRKN
FLFFGSDHGGERGALLYSLIGTCKLNDVDPESYLRHVLGVIADWPVNRVS
ELLPWRIALPAE
>Z1357 unknown protein encoded by cryptic prophage CP-933M
MLTTQKRKFALALMSGKNKTASALAAGYSAKTARVKGSQLAKDPEVLAFI
ARKQCZTVEVDEVPVYRQKKSEPEDKPRRREAAAIPQPDETNPEMPPPVV
ISPGIEYMEDGLPDPVKAMGRLLVENINTDPRLALDAAYKLAQFTHHKKG
DAGKKSAKGDAAKKAANRFAVPPPPRLVVNNDNEGNG
>Z2072 putative IS encoded proteinen coded by prophage CP-933O
MIPLPSGTKIWLVAGITDMRNGFNGLAAKVQTALKDDPMSGHVFIFRGRS
GSQVKLLWSTGDGLCLLTKRLERGRFAWPSARDGKVFLTQAQLAMLLEGI
DWRQPKRLLTSLTML
>Z1835 putative integrase of prophage CP-933C
MMTGIKIMSRALNKLSDTQLRKINGTPAQKTAFLNDGGNLSVRHSTSGLL
TWYFTYRAGTGRGAPPERIKLGNYPDLSLKSAREKAAQCRAWLAEGKNPR
HELNYTVQEALKPVTVGDALTYWLESYAKENRVDYAALKKRLNNHVIQHI
GAMPLDKCELRHWLACFDQVAKRTPVTAGFLLQTCKQALKFCRRRRYAIS
NVLDDMSVADVGKKPDISERVLSTKELGELLQALDKKIFSPYYIALIRLL
IVFGCRTVELRLSEISEWDFTEMLWTVPKEHSKTKVAIFRPIPEAILPFV
TQLVEQNRHTGLLLGEVKQETSVSQYGRLAHRRLNHPHWSLHDIRRTFTT
MLNDLGVDPHVVEQLTGHQMPGMQRVYNHSRYLDAKRNALDMWTERLGIL
AGTHENVTTLPVARRK
>Z2375 orf; hypothetical protein in IS629 within prophage CP-933R
MTKNTRFSPEVRQRAIRMVLESQDEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTSAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>Z3759 DNA replication initiation factor
MVNFSRFCEILVEVSLNTPAQLSLPLYLPDDETFASFWPGDNSSLLAALQ
NVLRQEHSGYIYLWAREGAGRSHLLHAACAELSQRGDAVGYVPLDKRTWF
VPEVLDGMEHLSLVCIDNIECIAGDELWEMAIFDLYNRILESGKTRLLIT
GDRPPRQLNLGLPDLASRLDWGQIYKLQPLSDEDKLQALQLRARLRGFEL
PEDVGRFLLKRLDREMRTLFMTLDQLDRASITAQRKLTIPFVKEILKL
>Z3471 ada, O6-methylguanine-DNA methyltransferase; transcription activator/repressor
MKNATCLTDDQRWQSVLARDPNADGEFVFAVRTTGIFCRPSCRARHALRE
NVSFYANASEALAAGFRPCKRCQPDKANAQQHRLDKITHACRLLEQETPV
TLEALADQVAMSPFHLHRLFKATTGMTPKAWQQAWRARRLRESLAKGESV
TTSILNAGFPDSSSYYHKADETLGMTAKQFRHGGENLAVRYALADCELGR
CLVAESERGICAILLGDDDATLISELQQMFPAADNAPADLTFQQHVREVI
ASLNQRDTPLTLPLDIRGTAFQQQVWQALRTIPCGETVSYQQLANAIGKP
KAVRAVASACAANKLAIVIPCHRVVRGDGTLSGYRWGVSRKAQLLRREAE
NEER
>Z3237 alkA, 3-methyl-adenine DNA glycosylase II, inducible
MYTLNWQPPYDWSWMLGFLAARAVSGVETVADDYYARSLAVGEYRGVVTA
IPDIARHTLHINLSADLEPVAAECLAKMSRLLDLQCNPQIVNGALGKLGA
ARPGLRLPGSVDAFEQGVRAILGQLVSVAMAAKLTAKVVQLYGERLDDFP
EYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPG
DVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTP
AQIRRYAERWKPWRSYALLHIWYTEGWQPDEA
>Z3470 alkB, DNA repair system specific for alkylated DNA
MLDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMV
TPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLC
QRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGL
PAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLT
TDCRYNLTFRQAGKKE
>Z4740 dam, DNA adenine methylase
MKKNRAFLKWAGGKYPLLDDIKRHLPKGECLVEPFVGAGSVFLNTDFSRY
ILADINSDLISLYNIVKMRTDEYVQAARELFVPETNCAEVYYQFREEFNK
SQDPFRRAVLFLYLNRYGYNGLCRYNLRGEFNVPFGRYKKPYFPEAELYH
FAEKAQNAFFYCESYADSMARADDASVVYCDPPYAPLSATANFTAYHTNS
FTLEQQAHLAEIAEGLVERHIPVLISNHDTMLTREWYQRAKLHVVKVRRS
ISSNGGTRKKVDELLALYKPGVVSPAKK
>Z2417 dbpA, ATP-dependent RNA helicase
MTAFSTLNVLPPAQLTNLNELGYLTMTPVQAAALPAILAGKDVRVQAKTG
SGKTAAFGLGLLQQIDASLFQTQALVLCPTRELADQVAGELRRLARFLPN
TKILTLCGGQPFGMQRDSLQHAPHIIVATPGRLLDHLQKGTVSLDALNTL
VMDEADRMLDMGFSDAIDDVIRFAPASRQTLLFSATWPEAIAAISGRVQR
DPLAIEIDSTDALPPIEQQFYETSSKGKIPLLQRLLSLHQPSSCVVFCNT
KKDCQAVCDALNEVGQSALSLHGDLEQRDRDQTLVRFANGSARVLVATDV
AARGLDIKSLELVVNFELAWDPEVHVHRIGRTARAGNSGLAISFCAPEEA
QRANIISDMLQIKLNWQTPPANSSIVTLEAEMATLCIDGGKKAKMRPGDV
LGALTGDIGLDGADIGKIAVHPAHVYVAVRQAVAHKAWKQLQGGKIKGKT
CRVRLLK
>Z3054 dcm, DNA cytosine methylase
MQENISVTDSYSTGNAAQAMLEKLLQIYDVKTLVAQLNGVGENHWSAAIL
KRALANDSAWHRLSEKEFAHLQTLLPKPPAHHPHYAFRFIDLFAGIGGIR
RGFESIGGQCVFTSEWNKHAVRTYKANHYCDPATHHFNEDIRDITLSHKE
GVSDEAAAEHIRQHIPEHDVLLAGFPCQPFSLAGVSKKNSLGRAHGFACD
TQGTLFFDVVRIIDARRPAMFVLENVKNLKSHDQGKTFRIIMQTLDELGY
DVADAEDNGPDDPKIIDGKHFLPQHRERIVLVGFRRDLNLKADFTLRDIS
ECFPAQRVTLAQLLDPMVEAKYILTPVLWKYLYRYAKKHQARGNGFGYGM
VYPNNPQSVTRTLSARYYKDGAEILIDRGWDMATGEKDFDDPLNQQHRPR
RLTPRECARLMGFEAPGEAKFRIPVSDTQAYRQFGNSVVVPVFAAVAKLL
EPKIKQAVALRQQEAQHGRRSR
>Z4523 deaD, inducible ATP-independent RNA helicase
MMSYVDWPPLILRHTYYMAEFETTFADLGLKAPILEALNDLGYEKPSPIQ
AECIPHLLNGRDVLGMAQTGSGKTAAFSLPLLQNLDPELKAPQILVLAPT
RELAVQVAEAMTDFSKHMRGVNVVALYGGQRYDVQLRALRQGPQIVVGTP
GRLLDHLKRGTLDLSKLSGLVLDEADEMLRMGFIEDVETIMAQIPEGHQT
ALFSATMPEAIRRITRRFMKEPQEVRIQSSVTTRPDISQSYWTVWGMRKN
EALVRFLEAEDFDAAIIFVRTKNATLEVAEALERNGYNSAALNGDMNQAL
REQTLERLKDGRLDILIATDVAARGLDVERISLVVNYDIPMDSESYVHRI
GRTGRAGRAGRALLFVENRERRLLRNIERTMKLTIPEVELPNAELLGKRR
LEKFAAKVQQQLESSDLDQYRALLSKIQPTAEGEELDLETLAAALLKMAQ
GERTLIVPPDAPMRPKREFRDRDDRGPRDRNDRGPRGDREDRPRRERRDV
GDMQLYRIEVGRDDGVEVRHIVGAIANEGDISSRYIGNIKLFASHSTIEL
PKGMPGEVLQHFTRTRILNKPMNMQLLGDAQPHTGGERRGGGRGFSGERR
EGGRNFSGERREGGRGDGRRFSGERREGRAPRRDDSTGRRRFGGDA
>Z0285 dinJ, damage-inducible protein J
MAANAFVRARIDEDLKNQAADVLAGMGLTISDLVRITLTKVAREKALPFD
LREPNQLTIQSIKNSEAGVDVHKAKDADDLFDKLGV
>Z0292 dinP, DNA polymerase IV
MRKIIHVDMDCFFAAVEMRDNPALRDIPIAIGGSRERRGVISTANYPARK
FGVRSAMPTGMALKLCPHLTLLPGRFDAYKEASNHIREIFSRYTSRIEPL
SLDEAYLDVTDSVHCHGSATLIAQEIRQTIFSELQLTASAGVTPVKFLAK
IASDMNKPNGQFVITPAEVSAFLQTLPLAKIPGVGKVSAAKLEAMGLRTC
GDVQKCDLVILLKRFGKFGRILWERSQGIDERDVNSERLRKSVGVERTMA
EDIHHWSECEAIIERLYPELERRLAKVKPDLLIARQGVKLKFDDFQQTTQ
EHVWPRLNKADLIATARKTWDERRGGRGVRLVGLHVTLLDPQMERQLVLG
L
>Z5193 dnaA, chromosomal replication initiation protein
MSLSLWQQCLARLQDELPATEFSMWIRPLQAELSDNTLALYAPNRFVLDW
VRDKYLNNINGLLTSFCGADAPQLRFEVGTKSVTQTPQAAVTSNVAAPAQ
VAQTQPQRAAPSTRSGWDNVPAPAEPTYRSNVNVKHTFDNFVEGKSNQLA
RAAARQVADNPGGAYNPLFLYGGTGLGKTHLLHAVGNGIMARKPNAKVVY
MHSERFVQDMVKALQNNAIEEFKRYYRSVDALLIDDIQFFANKERSQEEF
FHTFNALLEGNQQIILTSDRYPKEINGVEDRLKSRFGWGLTVAIEPPELE
TRVAILMKKADENDIRLPGEVAFFIAKRLRSNVRELEGALNRVIANANFT
GRAITIDFVREALRDLLALQEKLVTIDNIQKTVAEYYKIKVADLLSKRRS
RSVARPRQMAMALAKELTNHSLPEIGDAFGGRDHTTVLHACRKIEQLREE
SHDIKEDFSNLIRTLSS
>Z5650 dnaB, replicative DNA helicase
MAGNKPFNKQQAEPRERDPQVAGLKVPPHSIEAEQSVLGGLMLDNERWDD
VAERVVADDFYTRPHRHIFTEMARLQESGSPIDLITLAESLERQGQLDSV
GGFAYLAELSKNTPSAANISSYADIVRERAVVREMISVANEIAEAGFDPQ
GRTSEDLLDLAESRVFKIAESRANKDEGPKNIADVLDATVARIEQLFQQP
HDGVTGVNTGYDDLNKKTAGLQPSDLIIVAARPSMGKTTFAMNLVENAAM
LQDKPVLIFSLEMPSEQIMMRSLASLSRVDQTKIRTGQLDDEDWARISGT
MGILLEKRNIYIDDSSGLTPTEVRSRARRIAREHGGIGLIMIDYLQLMRV
PALSDNRTLEIAEISRSLKALAKELNVPVVALSQLNRSLEQRADKRPVNS
DLRESGSIEQDADLIMFIYRDEVYHENSDLKGIAEIIIGKQRNGPIGTVR
LTFNGQWSRFDNYAGPQYDDE
>Z5961 dnaC, DNA replication protein DnaC
MKNVGDLMQRLQKMMPAHIKPAFKTGEELLAWQKEQGAIRSAALERENRA
MKMQRTFNRSGIRPLHQNCSFENYRVECEGQMNALSKARQYVEEFDGNIA
SFIFSGKPGTGKNHLAAAICNELLLRGKSVLIITVADIMSAMKDTFRNSG
TSEEQLLNDLSNVDLLVIDEIGVQTESKYEKVIINQIVDRRSSSKRPTGM
LTNSNMEEMTKLLGERVMDRMRLGNSLWVIFNWDSYRSRVTGKEY
>Z0196 dnaE, DNA polymerase III subunit alpha
MSEPRFVHLRVHSDYSMIDGLAKTAPLVKKAAALGMPALAITDFTNLCGL
VKFYGAGHGAGIKPIVGADFNVQCDLLGDELTHLTVLAANNTGYQNLTLL
ISKAYQRGYGAAGPIIDRDWLIELNEGLILLSGGRMGDVGRSLLRGNSAL
VDECVAFYEEHFPDRYFLELIRTGRPDEESYLHAAVELAEARGLPVVATN
DVRFIDSSDFDAHEIRVAIHDGFTLDDPKRPRNYSPQQYMRSEEEMCELF
ADIPEALANTVEIAKRCNVTVRLGEYFLPQFPTGDMSTEDYLVKRAKEGL
EERLAFLFPDEEERVKRRPEYDERLETELQVINQMGFPGYFLIVMEFIQW
SKDNGVPVGPGRGSGAGSLVAYALKITDLDPLEFDLLFERFLNPERVSMP
DFDVDFCMEKRDQVIEHVADMYGRDAVSQIITFGTMAAKAVIRDVGRVLG
HPYGFVDRISKLIPPDPGMTLAKAFEAEPQLPEIYEADEEVKALIDMARK
LEGVTRNAGKHAGGVVIAPTKITDFAPLYCDEEGKHPVTQFDKSDVEYAG
LVKFDFLGLRTLTIINWALEMINKRRAKNGEPPLDIAAIPLDDKKSFDML
QRSETTAVFQLESRGMKDLIKRLQPDCFEDMIALVALFRPGPLQSGMVDN
FIDRKHGREEISYPDVQWQHESLKPVLEPTYGIILYQEQVMQIAQVLSGY
TLGGADMLRRAMGKKKPEEMAKQRSVFAEGAEKNGINAELAMKIFDLVEK
FAGYGFNKSHSAAYALVSYQTLWLKAHYPAEFMAAVMTADMDNTEKVVGL
VDECWRMGLKILPPDINSGLYHFHVNDDGEIVYGIGAIKGVGEGPIEAII
EARNKGGYFRELFDLCARTDTKKLNRRVLEKLIMSGAFDRLGPHRAALMN
SLGDALKAADQHAKAEAIGQADMFGVLAEEPEQIEQSYASCQPWPEQVVL
DGERETLGLYLTGHPINQYLKEIERYVGGVRLKDMHPTERGKVITAAGLV
VAARVMVTKRGNRIGICTLDDRSGRLEVMLFTDALDKYQHLLEKDRILIV
SGQVSFDDFSGGLKMTAREVMDIDEAREKYARGLAISLTDRQIDDQLLNR
LRQSLEPHRSGTIPVHLYYQRADARARLRFGATWRVSPSDRLLNDLRGLI
GSEQVELEFD
>Z4419 dnaG, DNA primase
MAGRIPRVFINDLLARTDIVDLIDARVKLKKQGKNFHACCPFHNEKTPSF
TVNGEKQFYHCFGCGAHGNAIDFLMNYDKLEFVETVEELAAMHNLEVPFE
AGSGPSQIERHQRQTLYQLMDGLNTFYQQSLQQPVATSARQYLEKRGLSH
EVIARFAIGFAPPGWDNVLKRFGGNPENRQSLIDAGMLVTNDQGRSYDRF
RERVMFPIRDKRGRVIGFGGRVLGNDTPKYLNSPETDIFHKGRQLYGLYE
AQQDNAEPNRLLVVEGYMDVVALAQYGINYAVASLGTSTTADHIQLLFRA
TNNVICCYDGDRAGRDAAWRALETALPYMTDGRQLRFMFLPDGEDPDTLV
RKEGKEAFEARMEQAMPLSAFLFNSLMPQVDLSTPDGRARLSTLALPLIS
QVPGETLRIYLRQELGNKLGILDDSQLERLMPKAAESGVSRPVPQLKRTT
MRILIGLLVQNPELATLVPPLENLDENKLPGLGLFRELVNXCLSQPGLTT
GQLLEHYRGTNNAATLEKLSMWDDIADKNIAEQTFTDSXNHMFDSLLELR
QEELIARERTHGLSNEERLELWTLNQELAKK
>Z5192 dnaN, DNA polymerase III subunit beta
MKFTVEREHLLKPLQQVSGPLGGRPTLPILGNLLLQVADGTLSLTGTDLE
MEMVARVALVQPHEPGATTVPARKFFDICRGLPEGAEIAVQLEGERMLVR
SGRSRFSLSTLPAADFPNLDDWQSEVEFTLPQATMKRLIEATQFSMAHQD
VRYYLNGMLFETEGEELRTVATDGHRLAVCSMPIGQSLPSHSVIVPRKGV
IELMRMLDGGDNPLRVQIGSNNIRAHVGDFIFTSKLVDGRFPDYRRVLPK
NPDKHLEAGCDLLKQAFARAAILSNEKFRGVRLYVSENQLKITANNPEQE
EAEEILDVTYSGAEMEIGFNVSYVLDVLNALKCENVRMMLTDSVSSVQIE
DAASQSAAYVVMPMRL
>Z0241 dnaQ, DNA polymerase III subunit epsilon
MSTAITRQIVLDTETTGMNQIGAHYEGHKIIEIGAVEVVNRRLTGNNFHV
YLKPDRLVDPEAFGVHGIADEFLLDKPTFAEVADEFMDYIRGAELVIHNA
AFDIGFMDYEFSLLKRDIPKTNTFCKVTDSLAVARKMFPGKRNSLDALCA
RYEIDNSKRTLHGALLDAQILAEVYLAMTGGQTSMAFAMEGETQQQQGEA
TIQRLVRQASKLRVVFATDEELAAHEARLDLVQKKGGSCLWRA
>Z0587 dnaX, DNA polymerase III subunits gamma and tau
MSYQVLARKWRPQTFADVVGQEHVLTALANGLSLGRIHHAYLFSGTRGVG
KTSIARLLAKGLNCETGITATPCGVCDNCREIEQGRFVDLIEIDAASRTK
VEDTRDLLDNVQYAPARGRFKVYLIDEVHMLSRHSFNALLKTLEEPPEHV
KFLLATTDPQKLPVTILSRCLQFHLKALDVEQIRHQLEHILNEEHIAHEP
RALQLLARAAEGSLRDALSLTDQAIASGDGQVSTQAVSAMLGTLDDDQAL
SLVEAMVEANGERVMALINEAAARGIEWEALLVEMLGLLHRIAMVQLSPA
ALGNDMAAIELRMRELARTIPPTDIQLYYQTLLIGRKELPYAPDRRMGVE
MTLLRALAFHPRMPLPEPEVPRQSFAPVAPTAVMTPTQVPPQPQSAPQQA
PTVPLPETTSQVLAARQQLQRVQGATKAKKSEPAAATRARPVNNAALERL
ASVTDRVQARPVPSALEKAPAKKEAYRWKATTPVMQQKEVVATPKALKKA
LEHEKTPELAAKLAAEAIERDAWAAQVSQLSLPKLVEQVALNAWKEESDN
AVCLHLRSSQRHLNNRGAQQKLAEALSMLKGSTVELTIVEDDNPAVRTPL
EWRQAIYEEKLAQARESIIADNNIQTLRRFFDAELDEDSIRPI
>Z4290 endA, DNA-specific endonuclease I
MYRYLSIAAVVLSAAFSGPALAEGINSFSQAKAAAVKVHADAPGTFYCGC
KINWQGKKGVVDLQSCGYQVRKNENRASRVEWEHVVPAWQFGHQRQCWQD
GGRKNCXKDPVYRKMESDMHNLQPSVGEVNGDRGNFMYSQWNGGEGQYGQ
CAMKVDFKEKAAEPPARARGAIARTYFYMRDHYNLTLSRQQTQLFNAWDK
MYPVTDWECERDERIAKVQGNHNPYVQRACQARKS
>Z5910 fimB, recombinase involved in phase variation; regulator for fimA
MKNKADNKKRNFLTHSEIESLLKAANTGPHAARNYCLTLLCFIHGFRASE
ICRLRISDIDLKAKCIYIHRLKKGFSTTHPLLNKEVQALKNWLSIRTSYP
HAESEWVFLSRKGNPLSRQQFYHIISTSGGNAGLSLEIHPHMLRHSCGFA
LANMGIDTRLIQDYLGHRNIRHTVWYTASNAGRFYGIWDRARGRQRHAVL
>Z5911 fimE, recombinase involved in phase variation; regulator for fimA
MSKRRYLTGKEVQAMMQAVCYGATGARDYCLILLAYRHGMRISELLDLHY
QDLDLNEGRINIRRLKNGFSTVHPLRFDEREAVERWTLERANWKGADRTD
AIFISRRGSRLSRQQAYRIIRDAGIEAGTVTQTHPHMLRHACGYELAERG
ADTRLIQDYLGHRNIRHTVRYTASNAARFAGLWERNNLINEKLKREEV
>Z4621 fis, DNA-binding protein Fis
MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDL
YELVLAEVEQPLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN
>Z3484 gyrA, DNA gyrase subunit A
MSDLAREITPVNIEEELKSSYLDYAMSVIVGRALPDVRDGLKPVHRRVLY
AMNVLGNDWNKAYKKSARVVGDVIGKYHPHGDSAVYDTIVRMAQPFSLRY
MLVDGQGNFGSIDGDSAAAMRYTEIRLAKIAHELMADLEKETVDFVDNYD
GTEKIPDVMPTKIPNLLVNGSSGIAVGMATNIPPHNLTEVINGCLAYIDD
EDISIEGXMEHIPGPDFPTAAIINGRRGIEEAYRTGRGKVYIRARAEVEV
DAKTGRETXIVHEIPYQVNKARLIEKIAELVKEKRVEGISALRDESDKDG
MRIVIEVKRDAVGEVVLNNLYSQTQLQVSFGINMVALHHGQPKIMNLKDI
IAAFVRHRREVVTRRTIFELRKARDRAHILEALAVALANIDPIIELIRHA
PTPAEAKTALVANPWQLGNVAAMLERAGDDAARPEWLEPEFGVRDGLYYL
TEQQAQAILDLRLQKLTGLEHEKLLDEYKELLDQIAELLRILGSADRLME
VIREELELVREQFGDKRRTEITANSADINLEDLITQEDVVVTLSHQGYVK
YQPLSEYEAQRRGGKGKSAARIKEEDFIDRLLVANTHDHILCFSSRGRVY
SMKVYQLPEATRGARGRPIVNLLPLEQDERITAILPVTEFEEGVKVFMAT
ANGTVKKTVLTEFNRLRTAGKVAIKLVEGDELIGVDLTSGEDEVMLFSAE
GKVVRFKESSVRAMGCNTTGVRGIRLGEGDKVVSLIVPRGDGAILTATQN
GYGKRTAVAEYPTKSRATKGVISIKVTERNGLVVGAVQVDDCDQIMMITD
AGTLVRTRVSEISIVGRNTQGVILIRTAEDENVVGLQRVAEPVDEEDLDT
IDGSAAEGDDEIAPEVDVDDEPEEE
>Z5190 gyrB, DNA gyrase subunit B
MSNSYDSSSIKVLKGLDAVRKRPGMYIGDTDDGTGLHHMVFEVVDNAIDE
ALAGHCKEIIVTIHADNSVSVQDDGRGIPTGIHPEEGVSAAEVIMTVLHA
GGKFDDNSYKVSGGLHGVGVSVVNALSQKLELVIQREGKIHRQIYEHGVP
QAPLAVTGETEKTGTMVRFWPSLETFTNVTEFEYEILAKRLRELSFLNSG
VSIRLRDKRDGKEDHFHYEGGIKAFVEYLNKNKTPIHPNIFYFSTEKDGI
GVEVALQWNDGFQENIYCFTNNIPQRDGGTHLAGFRAAMTRTLNAYMDKE
GYSKKAKVSATGDDAREGLIAVVSVKVPDPKFSSQTKDKLVSSEVKSAVE
QQMNELLAEYLLENPTDAKIVVGKIXDAARAREAARRAREMTRRKGALDL
AGLPGKLADCQERDPALSELYLVEGXSAGGSAKQGRNRKNQAILPLKGKI
LNVEKARFDKMLSSQEVATLITALGCGIGRDEYNPDKLRYHSIIIMTDAD
VDGSHIRTLLLTFFYRQMPEIVERGHVYIAQPPLYKVKKGKQEQYIKDDE
AMDQYQISIALDGATLHTNASAPALAGEALEKLVSEYNATQKMINRMERR
YPKAMLKELIYQPTLTEADLSDEQTVTRWVNALVSELNDKEQHGSQWKFD
VHTNAEQNLFEPIVRVRTHGVDTDYPLDHEFITGGEYRRICTLGEKLRGL
LEEDAFIERGERRQPVASFEQALDWLVKESRRGLSIQRYKGLGEMNPEQL
WETTMDPESRRMLRVTVKDAIAADQLFTTLMGDAVEPRRAFIEENALKAA
NIDI
>Z1313 helD, DNA helicase IV
MELKATTLGKRLAQHPYDRAVILNAGIKVSGDRHEYLIPFNQLLAIHCKR
GLVWGELEFVLPDEKVVRLHGTEWGETQRFYHHLDAHWRRWSGEMSEIAS
GVLRQQLDLIATRTGENKWLTREQTSGVQQQIRQALSALPLPVNRLEEFD
NCREAWRKCQAWLKDIESARLQHNQAYTEAMLTEYADFFRQVESSPLNPA
QARAVVNGEHSLLVLAGAGSGKTSVLVARAGWLLARGEASPEQILLLAFG
RKAAEEMDERIRERLHTEDITARTFHALALHIIQQGSKKVPIVSKLENDT
AARHELFIAEWRKQCSEKKAQAKGWRQWLTEEMQWSVPEGNFWDDEKLQR
RLASRLDRWVSLMRMHGGAQAEMIASAPEEIRDLFSKRIKLMAPLLKAWK
GALKAENAVDFSGLIHQAIVILEKGRFISPWKHILVDEFQDISPQRAALL
AALRKQNSQTTLFAVGDDWQAIYRFSGAQMSLTTAFHENFGEGERCDLDT
TYRFNSRIGEVANRFIQQNPGQLKKPLNSLTNGDKKAVTLLDESQLDALL
DKLSGYAKPEERILILARYHHMRPASLEKAATRWPKLQIDFMTIHASKGQ
QADYVLIVGLQEGSDGFPAAARESIMEEALLPPVEDFPDAEERRLMYVAL
TRARHRVWALFNKENPSPFVEILKNLDVPVARKP
>Z2741 himA, integration host factor alpha subunit
MALTKAEMSEYLFDKLGLSKRDAKELVELFFEEIRRALENGEQVKLSGFG
NFDLRDKNQRPGRNPKTGEDIPITARRVVTFRPGQKLKSRVENASPKDE
>Z1258 himD, integration host factor beta subunit
MTKSELIERLATQQSHIPAKTVEDAVKEMLEHMASTLAQGERIEIRGFGS
FSLHYRAPRTGRNPKTGDKVELEGKYVPHFKPGKELRDRANIYG
>Z0787 holA, DNA polymerase III subunit delta
MIRLYPEQLRAQLNEGLRAAYLLLGNDPLLLQESQDAVRQVAAAQGFEEH
HTFSIDPNTDWNAIFSLCQAMSLFASRQTLLLLLPENGPNAAINEQLLTL
TGLLHDDLLLIVRGNKLSKAQENAAWFTALANRSVQVTCQTPEQAQLPRW
VAARAKQLNLELDDAANQVLCYCYEGNLLALAQALERLSLLWPDGKLTLP
RVEQAVNDAAHFTPFHWVDALLMGKSKRALHILQQLRLEGSEPVILLRTL
QRELLLLVNLKRQSAHTPLRALFDKHRVWQNRRGMMGEALNRLSQPQLRQ
AVQLLTRTELTLKQDYSQSVWAELEGLSLLLCHKPLADVFIDG
>Z1738 holB, DNA polymerase III subunit delta
MRWYPWLRPDFEKLVASYQAGRGHHALLIQALPGMGDDALIYALSRYLLC
QQPQGHKSCGHCRGCQLMQAGTHPDYYTLAPEKGKNTLGIDAVREVTEKL
NEHARLGGAKVVWVTDAALLTDAAANALLKTLEEPPAETWFFLATREPER
LLATLRSRCRLHYLAPPPEQYAVTWLSREVTMSQDALLAALRLSAGSPGA
ALALFQGDNWQARETLCQALAYSVPSGDWYSLLAALNHEQAPARLHWLAT
LLMDALKRHHGAAQVTNVDVPGLVAELANHLSPSRLQAILGDVCHIREQL
MSVTGINRELLITDLLLRIEHYLQPGVVLPVPHL
>Z5871 holC, DNA polymerase III subunit chi
MKNATFYLLDNDTTVDGLSAVEQLVCEIAAERWRSGKRVLIACEDEKQAY
RLDEALWARPAESFVPHNLAGEGPRGGAPVEIAWPQKRSSSPRDILISLR
TSFADFATAFTEVVDFVPYEDSLKQLARERYKAYRVAGFNLNTATWK
>Z5973 holD, DNA polymerase III subunit psi
MTSRRDWQLQQLGITQWSLRRPGALQGEIAIAIPAHVRLVMVANDLPALT
DPLVSDVLRALTVSPDQVLQLTPEKIAMLPQGSRCNSWRLGTDEPLSLEG
AQVASPALTELRANPTARAALWQQICTYEHDFFPRND
>Z2313 hrpA, helicase, ATP-dependent
MLRDRLRFSRRLHGVKKVKNPDAQQAIFQEMAKEIDQAAGKVLLREAARP
EITYPDNLPVSQKKQDILEAIRDHQVVIVAGETGSGKTTQLPKICMELGR
GIKGLIGHTQPRRLAARTVANRIAEELKTEPGGCIGYKVRFSDHVSDNTM
VKLMTDGILLAEIQQDRLLMQYDTIIIDEAHERSLNIDFLLGYLKELLPR
RPDLKIIITSATIDPERFSRHFNNAPIIEVSGRTYPVEVRYRPIVEEADD
TERDQLQAIFDAVDELSQESPGDILIFMSGEREIRDTADALNKLNLRHTE
ILPLYARLSNSEQNRVFQSHSGRRIVLATNVAETSLTVPGIKYVIDPGTA
RISRYSYRTKVQRLPIEPISQASANQRKGRCGRVSEGICIRLYSEDDFLS
RPEFTDPEILRTNLASVILQMTALGLGDIAAFPFVEAPDKRNIQDGVRLL
EELGAITTDEQASAYKLTPLGRQLSQLPVDPRLARMVLEAQKHGCVREAM
IITSALSIQDPRERPMDKQQASDEKHRRFHDKESDFLAFVNLWNYLGEQQ
KALSSNAFRRLCRTDYLNYLRVREWQDIYTQLRQVVKELGIPVNSEPAEY
REIHIALLTGLLSHIGMKDADKQEYTGARNARFSIFPGSGLFKKPPKWVM
VAELVETSRLWGRIAARIDPEWVEPVAQHLIKRTYSEPHWERAQGAVMAT
EKVTVYGLPIVAARKVNYSQIDPALCRELFIRHALVEGDWQTRHAFFREN
LKLRAEVEELEHKSRRRDILVDDETLFEFYDQRIGHDVISARHFDSWWKK
VSRETPDLLNFEKSMLIKEGAEKISKLDYPNFWHQGNLKLRLSYQFEPGA
DADGVTVHIPLPLLNQVEESGFEWQIPGLRRELVIALIKSLPKPVRRNFV
PAPNYAEAFLGRVTPLELPLLDSLERELRRMTGVTVDREDWHWDQVPDHL
KITFRVVDDKNKKLKEGRSLQDLKDALKGKVQETLSAVADDGIEQSGLHI
WSFGQLPESYEQKRGNYKVKAWPALVDERDSVAIKLFDNPLEQKQAMWNG
LRRLLLLNIPSPIKYLHEKLPNKAKLGLYFNPYGKVLELIDDCISCGVDQ
LIDANGGPVWTEEGFAALHEKVRAELNDTVVDIAKQVEQILTAVFNINKR
LKGRVDMTMALGLSDIKAQMGGLVYRGFVTGNGFKRLGDTLRYLQAIEKR
LEKLAVDPHRDRAQMLKVENVQQAWQQWFNKLPPARREDEDVKEIRWMIE
ELRVSYFAQQLGTPYPISDKRILQAMEQISG
>Z0159 hrpB, helicase, ATP-dependent
MLQCGAKNVNPLERFVSSLPVAAVLPELLTALDYAPQVLLSAPTGAGKST
WLPLQLLAHPGINGKIILLEPRRLAARNVAQRLAELLNEKPGDTVGYRMR
AQNCVGPNTRLEVVTEGVLTRMIQRDPELSGVGLVILDEFHERSLQADLA
LALLLDVQQGLRDDLKLLIMSATLDNDRLQQMLPEAPVVISEGRSFPVER
RYLPLPTHQRFDDAVAVATAEMLRQESGSLLLFLPGVGEIQRVQEQLASR
IGSDVLLCPLYGALSLNDQRKAILPAPQGMRKVVLATNIAETSLTIEGIR
LVVDCAQERVARFDPRTGLTRLITQRISQASMTQRAGRAGRLEPGICLHL
IAKEQAERATAQSEPEILQSDLSGLLMELLQWGCSDPAQMSWLDQPPTVN
LLAAKRLLQMLGALDGERLSAQGQKMAALGNDPRLAAMLVSAKSDDEAAT
AAKIAAILEEPPRMGNSDLGVAFSRNQPAWQQRSQQLLKRLNVRGGEADS
SLIAPLLAGAFADRIARRRGQDGRYQLANGMGAMLDADDALSRHEWLIAP
LLLQGSASPDARILLALPVDIDELVQRCPQLVQQSDTVEWDDAQGTLKAW
RRLQIGQLTVKVQPLAKPSEDELHQAMLNGIRDKGLSVLNWTAEAEQLRL
RLLCAGKWLPEYDWPAVDDESLLATLETWLLPHMAGVHSLRGLKSLDIYQ
ALRGLLDWGMQQRLDSELPAHYTVPTGSRIAIRYHEDNPPALAVRMQEMF
GEATNPTIAQGRVPLVLELLSPAQRPLQITRDLGAFWKGAYREVQKEMKG
RYPKHVWPDDPANTAPTRRTKKYS
>Z5576 hupA, DNA-binding protein HU-alpha (HU-2)
MNKTQLIDVIAEKAELSKTQAKAALESTLAAITESLKEGDAVQLVGFGTF
KVNHRAERTGRNPQTGKEIKIAAANVPAFVSGKALKDAVK
>Z0547 hupB, DNA-binding protein HU-beta, NS1 (HU-1)
MNKSQLIDKIAAGADISKAAAGRALDAIIASVTESLKEGDDVALVGFGTF
AVKERAARTGRNPQTGKEITIAAAKVPSFRAGKALKDAVN
>Z3613 intC, putative prophage integrase
MDKIILPTGFLPMLTVKQIEAAKPKEKPYRLLDGNGLYLYVPVSGKKVWQ
LRYKIDGKEKILTVGKYPLMTLQEARDKAWTARKDISVGIDPVKAKKASS
NNNSFSAIYKEWYEHKRQVWSAAYATELAKMFDDDILPIIGGLEIQDIEP
MQLLEVIRRFEDRGAMERANKARRRCGEVFRYAIVTGRAKYNPAPDLAEA
MKGYRKKNFPFLPADQIPAFNKALATFSGSIVSLIATKVLRYTALRTKEL
RSMQWKNVDFENRIITIEASVMKGRKIHVVPMSDQVVELLTTLSSITKPV
SEFVFAGRNDKKKSICENAVLLVIKQIGYEGLESGHGFRHEFSTIMNEHE
WPADAIEVQLAHANGGSVRGIYNHAQYLDKRREMMQWWADWLDGKVE
>Z0307 intH, putative integrase for prophage CP-933H
MHKHAAANVAQRNRLNGKQIGFWLQHFAGMQLRDITESKIYSAMQKMTNR
RHEENWKLRAEACRKKGKPVPEYTPKPASVATKATHLSFIKALLRAAERE
WKMLDKAPIIKVPQPKNKRLRWLEPHEAQRLIDECPEPLKSVVEFALATG
LRRSNIINLEWQQIDMQRRVAWINPEESKSNRAIGVALNDTACRVLKKQI
GNHHRWVFVYKESCTKPDGTKAPTVRKMRYDANTAWKAALRRAGIDDFRF
HDLRHTWASWLVQAGVPLSVLQEMGGWESIEMVRRYAHLAPNHLTEHARQ
IDSILNPSVPNLSQSKNKEGTNDV
>Z5087 intL, putative integrase for prophage 933L and the LEE pathogenicity island
MALTDVKVKTAKPKERPYKLADGGGMYLLINANGSKYWRMKYRFAGKEKM
LSIGVYPDVTLADAREKRSEARKILAAGGDPGEAKKEEKIALQMSLKNTF
EAVAREWHQTKADRWSLRYRDEIIDTFEKDIFPYIGKRPIAEIKPMELLE
ALRKMEKRGALEKMRKVRQRCGEVFRYAIVTGRADYNPAPDLASALATPK
KVHFPFLTANELPHFLTDLAGYTGSIITKTATQIIMLTGVRTQELRFAHW
EDIDFEAKLWEIPAEVMKMKRPHIVPPSEQVIALFKQLEPISKHHPLVFI
GRNDPRKPISKESINQVIELLGYKGRLTGHGFRHTMSTILHEQGFNSAWI
EMQLAHVDKNSIRGTYNHAQYLDGRREMMQWYADYIDSLSELA
>Z1764 intN, partial integrase for prophage CP-933N
MSPRPRKNSTDVAGLYEKFDRRTGRVYYQYKNPVTGKFHGLGTDKGKAEK
IASTANQRIAAAEAEYFMRKIDESPSATKRRGIRLKAWVDRYLKIQDTRL
KNGDIAATTHKEKTRMAAYLVSRLGNHPLKELEVRDFALILDEWLDKDMV
STARVNRGLWVDIYKEAQHAGEVPPGWNPPEATRKPIPKVTRARLTMEDW
QKIYNATPEKHFIRNAMLLAIVTGQRRDDICHMRFSDVWNEHLHITQGKT
GMRLALPLTLRCDAIGITLKEVIDGCRDRILSPYLIHSRHQKQPKPMSKD
NLSDYFAKARDLAGVIPPAGKTPPTFHEQRSLSERLYRAQGIDTKTLLGH
KVQATTDRYNDTRGQEWVKLVI
>Z2036 intO, putative integrase for prophage CP-933O
MARPRKYKTDVPGLSPYFDKRNNKVYWRYRHPITGKNHGLGSIDQKLAET
IAAEANSRLARQQMEQMLSLQEKIISDTGGSSTVTIFLNNYRKIQQERYE
NGEIKLNTLKQKAAPLRVFDERFGTRPLDAITVKDVVSVLEEYKARGHNR
MGQIFRKVLIDVFREAQQTGDVPPGFNPAESAKKPQVRISRQRLTFDEWM
MIYNAAEKDGYFLQRGMLLALMTGQRLSDICKMQFSDIRDGYLHVEQQKT
GTRIAIPLALRCDKLNLTLDDVVSSCRDCVLSPWLLHHHHAKGTAKRGGM
VKPATLTVAFKKARDSVDYNWRANGTPPSFHEQRSLSERLFREQGVDTKI
LLGHSNQKMTDIYNDARGKEWKKLVI
>Z2566 intP_1, integrase fragment, cryptic prophage CP-933P
MREVEMKYPTGVENHGGKLRIWFVYKGVRVRENLGFLTQQKTGALQVSYA
PLFVTQ
>Z2568 intP_2, integrase fragment, cryptic prophage CP-933P
MEKFGEARQDLTIKELAEKFLALKETEVAKTSLNTYRAVIKNILSIIGEK
NLASSINKEKLLEVRKELLTGYQIPKSNYIVTQPGRSAVTVNNYMTNLNA
VFQFGVDNGYLADNPFKGISPLKESRTIPDPLSREEFIRLIDACRNQQAK
NLWCNTVLSGDF
>Z2415 intR, putative integrase for prophage CP-933R
MSKLPTGVEIRGKYIRIWFMFRGKRCRETLKGWEVTNSNIKKAGNLRALI
VHEINSGEFEYLRRFPQSSTGAKMVTTRVIKTFGELCDIWTKIKETELTT
NTMKKTKSQLKTLRIIICESTPISHIRYSDILNYRNELLHGETLYLDNPR
SNKKGRTVRTVDNYIALLCSLLRFAYQSGFISTKPFEGVKKLQRNRIKPD
PLSKTEFNALMESEKGQSQNLWKFAVYSGLRHGELAALAWEDVDFEKGVV
NVRRNLTILDMFGPPKTNAGIRTVTLLQPALEALKEQYKLTGHHRKSEIT
FYHREYGRTEKQKLHFVFMPRVCNGKQKPYYSVSSLGARWNAAVKRAGIR
RRNPYHTRHTFACWLLTAGANPAFIASQMGHETAQMVYEIYGMWIDDMND
EQVAMLNARLS
>Z2966 intT, integrase for prophage CP-933T
MSVRKIPSGKWLCECYPYGASGKRIRKQFATKSEALSYERRLMNSRVGDE
FQDGSGPRLSELIARWFEMYGKTLSSGAERKVKLEAICSRLGDPFASQFD
KNMFATYRERRLSGEWNPKGKKKLSEATVNREQSYLHAVFAELKRLGEWS
GENPLTGIRKFREEEKELAFLYVDEIERLLIACDESRNKDLGVVVRIGLA
TGARWSEAEGLKQSQVLPGRITFVKTKGKKNRTVPISPQLQAMLPKKRGA
LFSPCYEAFDAAIKRAKIELPDGQLTHVLRHTFASHFMMRGGNILVLQKI
LGHSDIKMTMRYAHFAPGHLEAAVELNPFDNRG
>Z3130 intU, putative integrase for prophage CP-933U
MGRRRKNPEHEKLPPKVYPNKYSVWKPTSRESVTLTAIEDGLAALWKKYE
ETVNHRDRAMTFGRLWEKFLASAYYSELSPRTQKDYLQHQKKLLAVFGKV
LADSVKPEHIRRYMDKRGEQSKTQANHEKSSMSRVYSWGYERGYVKANPC
AGVSKFKAKNRERYVTDKEYQAVLSVAPLPVFIAMEIAYLCAARVSDVLS
LKWEQIGNDGIFIQQGKTGKKQIKAWSPRLQAAIEKAKQLPTSAYVISNQ
YGNRYMYKGFNEMWVEARNHAGKISGILTDFTFHDLKAKGISDYEGSSRD
KQLFSGHKTEGQVLIYDRKVKVSPTLDVPLPENIPRKYSK
>Z3375 intV, putative integrase for prophage CP-933V
MSNASYPTGVENHGGSLRIWFHYNGKRVRENLGVPDTAKNRKIAGELRTS
VCFAIRMGSFDYAAQFPNSPNLKHFGLGKREITVKALSEKWLDLKKIEIC
ANALNRYQSVIKNMLPMLGEKKLVSSITKEDLLFVRRDLLTGYQKLSNGK
TSSIKGRSVVTVNYYMTTIAGMFQFATDNGYTSGNPFNGLAPLKKSKVKP
DPLTRDEFIRFIEACRHQQTKNLWILAVYTGIRHGELVSLAWEDIDLKAR
TITIRRNYTKLGEFTPPKTDAGTGRTIHLVQPAIDALKSQAEMTMLGKQH
SVEVKQREYGRTAVHKCTFVFSPQVTKQQQLSGPHYKVDSIRESWTSILK
RAGLRHRKSYQSRHTYACWSLAAGANPSFIASQMGHTNAQMVFNVYGAWM
KDNNHEQIELLNKRLSESVPCMPHKKAG
>Z1424 intW, integrase for bacteriophage BP-933W
MLLDAGGTMANSAYPAGVENHGGKLRITFKYRGKRVRENLRVPDTPKNRK
IAGELRASVCFAIRTGTFDYADRFPDSPNLKLFGLVKKDITVGELAQKWL
TLKAMEIGSNALNRYQSVMKNMLPRLGPGRLASSITKEDLLFIRKDLLTG
EKGSRKTSTSRKGRTVPTVNYYMTTTAGMFSFAAENGYLEKNPFNSITPL
RKSKPVPDPLTRDEFSRLIDACHHQQTKNLWTVAVFTGMRHGEIAALAWE
DIDLKAGTITVRRNFTKIGDFTLPKTDAGTNRVIHLLAPAIEALKNQAML
TRLSRQHQITVQLREYGRTILHECTFVFCPQIVRKNHKAGINYAVSSIGA
TWDSAIKRAGIRSRKAYQSRHTYACWALSSGANPTFIASQMGHSSASMVY
NVYGAWMPECSVTQVAMLNNVLNARAPDVPQSDQEDEIKLYFSK
>Z3677 lig, DNA ligase
MESIEQQLTELRTTLRHHEYLYHVMDAPEIPDAEYDRLMRELRELETKHP
ELITPDSPTQRVGAAPLAAFSQIRHEVPMLSLDNVFDEESFLAFNKRVQD
RLKSNEKVTWCCELKLDGLAVSILYENGVLVSAATRGDGTTGEDITSNVR
TIRAIPLKLHGENIPARLEVRGEVFLPQAGFEKINEDARRTGGKVFANPR
NAAAGSLRQLDPRITAKRPLTFFCYGVGVLEGGELPDTHLGRLLQFKKWG
LPVSDRVTLCESAEEVLAFYXXXEEDRPTLGFDIDGVVIKVNSLEQQEQL
GFVARAPRWAVAFKFPAQEQMTFVRDVEFQVGRTGAITPVARLEPVHVAG
VLVSNATLHNADEIERLGLRIGDKVVIRRAGDVIPQVVNVVLSERPEDTR
EVVFPTYCPVCGSDVERVEGEAVARCTGGLICGAQRKESLKHFVSRRAMD
VDGMGDKIIDQLVEKEYVHTPADLFKLTAGKLTGLERMGPKSAQNVVNAL
EKAKETTFARFLYALGIREVGEATAAGLAAYFGTLEALEAASIEELQKVP
DVGIVVASHVHNFFAEESNRNVISELLAEGVHWXAPIVINAEEIDSPFAG
KTVVLTGSLSQMSRDDAKARLVELGAKVAGSVSKKTDLVIAGEAAGSKLA
KAQELGIEVIDEAEMLRLLGS
>Z1754 mfd, transcription-repair coupling factor; mutation frequency decline
MPEQYRYTLPVKAGEQRLLGELTGAACATLVAEIAERHAGPVVLIAPDMQ
NALRLHDEISQFTDQMVMNLADWETLPXDSFSPHQDIISSRLSTLYQLPT
MQRGVLIVPVNTLMQRVCPHSFLHGHALVMEKGQRLSRDALRTQLDSAGY
RHVDQVMEHGEYATRGALLDLFPMGSELPYRLDFFDDEIDSLRVFDVDSQ
RTLEEVEAINLLPAHEFPTDKAAIELFRSQWRDTFEVKRDPEHIYQQVSK
GTLPAGIEYWQPLFFSEPLPPLFSYFPANTLLVNTGDLENSAERFQADTL
ARFENRGVDPMRPLLPPQSLWLRVDELFSELKNWPRVQLKTEHLPTKAAN
ANLGFQKLPDLAVQAQQKAPLDALRKFLESFDGPVVFSVESEGRREALGE
LLARIKIAPQRIMRLDEASDRGRYLMIGAAEHGFVDTVRNLALICESDLL
GERVARRRQDSRRTINPDTLIRNLAELHIGQPVVHLEHGVGRYAGMTTLE
AGGITGEYLMLTYANDAKLYVPVSSLHLISRYAGGAEENAPLHKLGGDAW
SRARQKAAEKVRDVAAELLDIYAQRAAKEGFAFKHDREQYQLFCDSFPFE
TTPDQAQAINAVLSDMXQPLAMDRLVCGDVGFGKTEVAMRAAFLAVDNHK
QVAVLVPTTLLAQQHYDNFRDRFANWPVRIEMLSRFRSAKEQTQILAEVA
EGKIDILIGTHKLLQSDVKFKDLGLLIVDEEHRFGVRHKERIKAMRANVD
ILTLTATPIPRTLNMAMSGMRDLSIIATPPARRLAVKTFVREYDSLVVRE
AILREILRGGQVYYLYNDVENIQKAAERLAELVPEARIAIGHGQMREREL
ERVMNDFHHQRFNVLVCTTIIETGIDIPTANTIIIERADHFGLAQLHQLR
GRVGRSHHQAYAWLLTPHPKAMTTDAQKRLEAIASLEDLGAGFALATHDL
EIRGAGELLGEEQSGSMETIGFSLYMELLENAVDALKAGREPSLEDLTSQ
QTEVELRMPSLLPDDFIPDVNTRLSFYKRIASAKTENELEEIKVELIDRF
GLLPDPARTLLDIARLRQQAQKLGIRKLEGNEKGGVIEFAEKNHVNPAWL
IGLLQKQPQHYRLDGPTRLKFIQDLSERKTRIEWVRQFMRELEENAIA
>Z4149 mutH, DNA mismatch repair protein
MSQPRPLLSPPETEEQLLAQAQQLSGYTLGELAALAGLVTPENLKRDKGW
IGVLLEIWLGASAGSKPEQDFAALGVELKTIPVDSLGRPLETTFVCVAPL
TGNSGVTWETSHVRHKLKRVLWIPVEGERSIPLAQRRVGSPLLWSPNEEE
DRQLREDWEELMDMIVLGQVERITARHGEYLQIRPKAANAKALTEAIGAR
GERILTLPRGFYLKKNFTSALLARHFLIQ
>Z5777 mutL, DNA mismatch repair protein
MPIQVLPPQLANQIAAGEVVERPASVVKELVENSLDAGATRIDIDIERGG
AKLIRIRDNGCGIKKDELALALARHATSKIASLDDLEAIISLGFRGEALA
SISSVSRLTLTSRTAEQQEAWQAYAEGRDMDVTVKPAAHPVGTTLEVLDL
FYNTPARRKFLRTEKTEFNHIDEIIRRIALARFDVTINLSHNGKIVRQYR
AVPEGGQKERRLGAICGTAFLEQALAIEWQHGDLTLRGWVADPNHTTPAL
AEIQYCYVNGRMMRDRLINHAIRQACEDKLGADQQPAFVLYLEIDPHQVD
VNVHPAKHEVRFHQSRLVHDFIYQGVLSVLQQQLETPLPLDDEPQPAPRP
IPENRVAAGRNHFAEPAVREPVAPRYTPAPASGSRPAAPWPNAQPGYQKQ
QGEVYRQLLQTPAPMQKPKAPEPQEPALAANSQSFGRVLTIVHSDCALLE
RDGNISLLALPVAERWLRQVQLTPGEAPVCAQPLLIPLRLKVSGEEKSAL
EKAQSALAELGIDFQSDAQHVTIRAVPLPLRQQNLQILIPELIGYLAKQS
VFEPGNIAQWIARNLMSENAQWSMAQAITLLADVERLCPQLVKTPPGGLL
QSVDLHPAIKALKDE
>Z5059 mutM, formamidopyrimidine-DNA glycosylase
MPELPEVETSRRGIEPHLVGATILHAVVRNGRLRWPVSEEIYRLSDQPVL
SVQRRAKYLLLELPEGWIIIHLGMSGSLRILPEELPPEKHDHVDLVMSNG
KVLRYTDPRRFGAWLWTKELEGHNVLAHLGPEPLSDDFNGEYLHQKCAKK
KTAIKPWLMDNKLVVGVGNIYASESLFAAGIHPDRLASSLSLAECELLAR
VIKAVLLRSIEQGGTTLKDFLQSDGKPGYFAQELQVYGRKGEPCRVCGTP
IVATKHAQRATFYCRQCQK
>Z4043 mutS, DNA mismatch repair protein
MSAIENFDAHTPMMQQYLKLKAQHPEILLFYRMGDFYELFYDDAKRASQL
LDISLTKRGASAGEPIPMAGIPYHAVENYLAKLVNQGESVAICEQIGDPA
TSKGPVERKVVRIVTPGTISDEALLQERQDNLLAAIWQDSKGFGYATLDI
SSGRFRLSEPADRETMAAELQRTNPAELLYAEDFAEMSLIEGRRGLRRRP
LWEFEIDTARQQLNLQFGTRDLVGFGVENAPRGLCAAGCLLQYAKDTQRT
TLPHIRSITMEREQDSIIMDAATRRNLEITQNLAGGAENTLASVLDCTVT
PMGSRMLKRWLHMPVRDTRVLLERQQTIGALQDFTAELQPVLRQVGDLER
ILARLALRTARPRDLARMRHAFQQLPELRAQLETVDSAPVQALREKMGEF
AELRDLLERAIIDTPPVLVRDGGVIASGYNEELDEWRALADGATDYLERL
EVRERERTGLDTLKVGFNAVHGYYIQISRGQSHLAPINYMRRQTLKNAER
YIIPELKEYEDKVLTSKGKALALEKQLYEELFDLLLPHLEALQQSASALA
ELDVLVNLAERAYTLNYTCPTFIDKPGIRITEGRHPVVEQVLNEPFIANP
LNLSPQRRMLIITGPNMGGKSTYMRQTALIALMAYIGSYVPAQKVXIGPI
DRIFTRVGAADDLASGRSTFMVEMTETANILHNATEYSLVLMDEIGRGTS
TYDGLSLAWACAENLANKIKALTLFATHYFELTQLPEKMEGVANVHLDAL
EHGDTIAFMHSVQDGAASKSYGLAVAALAGVPKEVIKRARQKLRELESIS
PNAAATQVDGTQMSLLSVPEETSPAVEALENLDPDSLTPRQALEWIYRLK
SLV
>Z0109 mutT, 7,8-dihydro-8-oxoguanine-triphosphatase, prefers dGTP, causes AT-GC transversions
MKKLQIAVGIIRNENNEIFITRRAADAHMANKLEFPGGKIEMGETPEQAV
VRELQEEVGITPQHFSLFEKLEYEFPDRHITLWFWLVESWEGEPWGKEGQ
PGKWMSLVGLNADDFPPANEPVIAKLKRVYVG
>Z4306 mutY, adenine glycosylase; G.C --> T.A transversions
MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIP
YFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLH
GGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYA
VSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCS
LCPLQNGCIAATNNSWSLYPGKKPKQTLPERTGYFLLLQHEDEVLLAQRP
PSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLD
IVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTGAPV
>Z0865 nei, endonuclease VIII and DNA N-glycosylase with an AP lyase activity
MPEGPEIRRAADNLEAAIKGKPLTDVWFAFPQLKTYQSQLIGQHVTHVET
RGKALLTHFSNDLTLYSHNQLYGVWRVVDTGEESQTTRVLRVKLQTADKT
ILLYSASDIEMLTPEQLTTHPFLQRVGPDVLDPNLTPEVVKERLLSPRFR
NRQFAGLLLDQAFLAGLGNYLRVEILWQVGLTGNHKAKDLNAAQLDALAH
ALLDIPRFSYATRGQVDENKHHGALFRFKVFHRDGELCERCGGIIEKTTL
SSRPFYWCPGCQH
>Z5574 nfi, endonuclease V (deoxyinosine 3'endoduclease)
MDLASLRAQQIELASSVIREDRLDKDPPDLIAGADVGFEQGGEVTRAAMV
LLKYPSLELVEYKVARIATTMPYIPGFLSFREYPALLAAWEMLSQKPDLV
FVDGHGISHPRRLGVASHFGLLVDVPTIGVAKKRLCGKFEPLSSEPGALA
PLMDKGEQLAWVWRSKARCNPLFIATGHRVSVDSALAWVQRCMKGYRLPE
PTRWADAVASERPAFVRYTANQP
>Z3416 nfo, endonuclease IV
MKYIGAHVSAAGGLANAAIRAAEIDATAFALFTKNQRQWRAAPLTTQTID
EFKAACEKYHYTSAQILPHDSYLINLGHPVTEALEKSRDAFIDEMQRCEQ
LGLSLLNFHPGSHLMQISEEDCLARIAESINIALDKTQGVTAVIENTAGQ
GSNLGFKFEHLAAIIDGVEDKSRVGVCIDTCHAFAAGYDLRTPAECEKTF
ADFARTVGFKYLRGMHLNDAKSTFGSRVDRHHSLGEGBIGHDAFRWIMQD
DRFDGIPLILETINPDIWAEEIAWLKAQQTEKAVA
>Z2644 nth, endonuclease III; specific for apurinic and/or apyrimidinic sites
MNKAKRLEILTRLRENNPHPTTELNFSSPFELLIAVLLSAQATDVSVNKA
TAKLYPVANTPAAMLELGVEGVKTYIKTIGLYNSKAENIIKXCRILLEQH
NSEVPEDRAALEALPGVGRKTANVVLNTAFGWPTIAVDTHIFRVCNRTQF
APGKNVEQVEEKLLKVVPAEFKVDCHHWLILHGRYTCIARKPRCGSCIIE
DLCEYKEKVDI
>Z2917 ntpA, dATP pyrophosphohydrolase
MKDKVYKRPVSILVVIYAQDTKRVLMLQRRDDPDFWQSVTGSVEEGETAP
QAAMREVKEEVTIDVVAEQLTLIDCQRTVEFEIFSHLRHRYAPGVTRNTE
SWFCLALPHERQIVFTEHLAYKWLDAPAAAALTKSWSNRQAIEQFVINAA
>Z5571 nudC, NADH pyrophosphatase
MDRIIEKLDHGWWVVSHEQKLWLPKGELPYGEAANFDLVGQRALQIGEWQ
GEPVWLIQQQRRYDMGSVRQVIDLDVGLFQLAGRGVQLAEFYRSHKYCGY
CGHEMYPSKTEWAMLCSHCRERYYPQIAPCIIVAIRRDDSILLAQHTRHR
NGVHTVLAGFVEVGETLEQAVAREVMEESGIKVKHLRYVTSQPWPFPQSL
MTAFMAEYDSGDIVIDPKELLEANWYRYDDLPLLPPPGTVARRLIEDTVA
MCRAEYE
>Z2432 ogt, O-6-alkylguanine-DNA/cysteine-protein methyltransferase
MLRLLEEKIATPLGPLWVICDEQFRLRAVEWEEYSERMVQLLDIHYRKEG
YERISATNPGGLSDKLREYFAGNLSIIDTLPTATGGTPFQREVWKTLRTI
PCGQVMHYGELAEQLGRPGAARAVGAANGSNPISIVVPCHRVIGRNGTMT
GYAGGVQRKEWLLRHEGYLLL
>Z4373 parC, DNA topoisomerase IV subunit A
MSDMAERLALHEFTENAYLNYSMYVIMDRALPFIGDGLKPVQRRIVYAMS
ELGLNASAKFKKSARTVGDVLGKYHPHGDSACYEAMVLMAQPFSYRYPLV
DGQGNWGAPDDPKSFAAMRYTESRLSKYSELLLSELGQGTADWVPNFDGT
LQEPKMLPARLPNILLNGTTGIAVGMATDIPPHNLREVAQAAIALIDQPK
TTLDQLLDIVQGPDYPTEAEIITSRAEIRKIYENGRGSVRMRAVWKKEDG
AVVISALPHQVSGARVLEQIAAQMRNKKLPMVDDLRDESDHENPTRLVIV
PRSNRVDMDQVMNHLFATTDLEKSYRINLNMIGLDGRPAVKNLLEILSEW
LVFRRDTVRRRLNYRLEKVLKRLHILEGLLVAFLNIDEVIEIIRNEDEPK
PALMSRFGLTETQAEAILELKLRHLAKLEEMKIRGEQSELEKERDQLQGI
LASERKMNNLLKKELQADAQAYGDDRRSPLQEREEAKAMSEHDMLPSEPV
TIVLSQMGWVRSAKGHDIDAPGLNYKAGDSFKAAVKGKSNQPVVFVDSTG
RSYAIDPITLPSARGQGEPLTGKLTLPPGATVDHMLMESDDQKLLMASDA
GYGFVCTFNDLVARNRAGKALITLPENAHVMPPVVIEDASDMLLAITQAG
RMLMFPVSDLPQLSKGKGNKIINIPSAEAARGEDGLAQLYVLPPQSTLTI
HVGKRKIKLRPEELQKVTGERGRRGTLMRGLQRIDRVEIDSPRRASSGDS
EE
>Z4387 parE, DNA topoisomerase IV subunit B
MTQTYNADAIEVLTGLEPVRRRPGMYTDTTRPNHLGQEVIDNSVDEALAG
HAKRVDVILHADQSLEVIDDGRGMPVDIHPEEGVPAVELILCRLHAGGKF
SNKNYQFSGGLHGVGISVVNALSKRVEXNVRRDGQVYNIAFENGEKVQDL
QVVGNCGKRNTGTSVHFWPDETFFDSPRFSVSRLTHVLKAKAVLCPGVEI
TFKDEINNTEQRWCYQDGLNDYLAEAVNGLPTLPEKPFIGNFAGDTEAVD
WALLWLPEGGELLTESYVNLIPTMQGGTHVNGLRQGLLDAMREFCEYRNI
LPRGVKLSAEDIWDRCAYVLSVKMQDPQFAGQTKERLSSRQCAAFVSGVV
KDAFILWLNQNVQAAELLAEMAISSAQRRMRAAKKVVRKKLTSGPALPGK
LADCTAQDLNRTELFLVEGDSAGGSAKQARDREYQAIMPLKGKILNTWEV
SSDEVLASQEVHDISVAIGIDPDSDDLSQLRYGKICILADADSDGLHIAT
LLCALFVKHFRALVKHGHVYVALPPLYRIDLGKEVYYALTEEEKEGVLEQ
LKRKKGKPNVQRFKGLGEMNPMQLRETTLDPNTRRLVQLTIDDEDDQRTD
AMMDMLLAKKRSEDRRNWLQEKGDMAEIEV
>Z0859 phrB, deoxyribodipyrimidine photolyase (photoreactivation)
MTTHLVWFRQDLRLHDNLALAAACRNSSARLLALYIATPRQWAAHNMSPR
QAELINAQLNGLQIALAEKGIPLLFREVDDFAASVEIVKQVCAENSVTHL
FYNYQYEVNERARDVQVERTLRNVVCEGFDDSVILPPGAVMTGNHEMYKV
FTPFKNAWLKRLREGMPECVAAPKVRSSGSIKPAPSITLNYPRQSFDTAH
FPVEEKAAIAQLRQFCQNGAGEYEQQRDFPAVEGTSRLSASLATGGLSPR
QCLHRLLAEQPQALDGGAGSVWLNELIWREFYRHLMTYYPSLCKHCPFIA
WTDRVQWQXNPAHLQAWQEGKTGYPIVDAAMRQLNSTGWMHNRLRMITAS
FLVKDLLIDWREGERYFMSQLIDGDLAANNGGWQWAASTGTDAAPYFRIF
NPTTQGEKFDREGEFIRQWLPELRNVPGKSVHEPWKWAEKAGVKLDYPQP
IVEHKEARVQTLAAYEAARKGK
>Z0318 pinH, DNA invertase from prophage CP-933H
MASFLLLSGRSTMLIGYVRVSTNDQNTDLQRNALNCAGCELIFEDKISGT
KSERPGLKKLLRTLSAGDTLVVWKLDRLGRSMRHLVILVEELRERGVNFR
SLTDAIDTSTPMGRFFFHVMGALAEMERELIVERTKAGLEAARAQGRIGG
RRPKLTPEQWAQAGRLIAAGIPRQKVAIIYDVGVSTLYKKFPAGDK
>Z5398 polA, DNA polymerase I
MVQIPQNPLILVDGSSYLYRAYHAFPPLTNSAGEPTGAMYGVLNMLRSLI
MQYKPTHAAVVFDAKGKTFRDELFEHYKSHRPPMPDDLRAQIEPLHAMVK
AMGLPLLAVSGVEADDVIGTLAREAEKAGRPVLISTGDKDMAQLVTPNIT
LINTMTNTILGPEEVVNKYGVPPELIIDFLALMGDSSDNIPGVPGVGEKT
AQALLQGLGGLDTLYAEPEKIAGLSFRGAKTMAAKLEQNKEVAYLSYQLA
TIKTDVELELTCEQLEVQQPAAEELLGLFKKYEFKRWTADVEAGKWLQAK
GAKPAAKPQETSVADEAPEVTATVISYDNYVTILDEETLKAWIAKLEKAP
VFAFDTETDSLDNISANLVGLSFAIEPGVAAYIPVAHDYLDAPDQISRER
ALELLKPLLEDEKALKVGQNLKYDRGILANYGIELRGIAFDTMLESYILN
SVAGRHDMDSLAERWLKHKTITFEEIAGKGKNQLTFNQIALEEAGRYAAE
DADVTLQLHLKMWPDLQKHKGPLNVFENIEMPLVPVLSRIERNGVKIDPK
VLHNHSEELTLRLAELEKKAHEIAGEEFNLSSTKQLQTILFEKQGIKPLK
KTPGGAPSTSEEVLEELALDYPLPKVILEYRGLAKLKSTYTDKLPLMINP
KTGRVHTSYHQAVTATGRLSSTDPNLQNIPVRNEEGRRIRQAFIAPEDYV
IVSADYSQIELRIMAHLSRDKGLLTAFAEGKDIHRATAAEVFGLPLETVT
SEQRRSAKAINFGLIYGMSAFGLARQLNIPRKEAQKYMDLYFERYPGVLE
YMERTRAQAKEQGYVETLDGRRLYLPDIKSSNGARRAAAERAAINAPMQG
TAADIIKRAMIAVDAWLQAEQPRVRMIMQVHDELVFEVHRDDVDAVAKQI
HQLMENCTRLDVPLLVEVGSGENWDQAH
>Z0068 polB, DNA polymerase II
MAQAGFILTRHWRDTPQGTEVSFWLATDNGPLQVTLAPQESVAFIPADQV
PRAQHILRGEQGFRLTPLALKDFHRQPVYGLYCRAHRQLMNYEKRLREGG
VTVYEADVRPPERYLMERFITSPVWVEGDMHNGAIVNARLKPHPDYRPPL
KWVSIDIETTRHGELYCIGLEGCGQRIVYMLGPENGDASALDFELEYVAS
RPQLLEKLNAWFANYDPDVIIGWNVVQFDLRMLQKHAERYRIPLRLGRDN
SELEWREHGFKNGVFFAQAKGRLIIDGIEALKSAFWNFSSFSLETVAQEL
LGEGKSIDNPWDRMDEIDRRFAEDKPALATYNLKDCELVTQIFHKTEIMP
FLLERATVNGLPVDRHGGSVAAFGHLYFPRMHRAGYVAPNLGEVPPHASP
GGYVMDSRPGLYDSVLVLDYKSLYPSIIRTFLIDPVGLVEGMAQPDPEHS
TEGFLDAWFSREKHCLPEIVTNIWHGRDEAKRQGNKPLSQALKIIMNAFY
GVLGTTACRFFDPRLVSSITMRGHQIMRQTKALIEAQGYDVIYGDTDSTF
VWLKGAHSEEEATKIGRALVQHVNVWWAETLQKQQLTSALELEYETHFCR
FLMPTIRGADTGSKKRYAGLIQEGDKQRMVFKGLETVRTDWTPLAQQFQQ
ELYLRIFRNEPYQEYFRETIDKLMAGELDARLVYRKRLRRPLSEYQRNVP
PHVRAARLADEENQKRGRPLQYQNRGTIKYVWTTNGPEPLDYQRSPLDYE
HYLTRQLQPVAEGILPFIEDNFATLMTGQLGLF
>Z5482 priA, primosome assembly protein PriA
MPVAHVALPVPLPRTFDYLLPEGMAVKAGCRVRVPFGKQQERIGVVVSVS
DVSELPLNELKAVVEVLDSEPVFTHSVWRLLLWAADYYHHPIGDVLFHAL
PILLRQGRPAANAPMWYWFATEQGHAVDLNSLKRSPKQQQALAALRQGKI
WRDQVATLEFNDAALQALRKKGLCDLASETPEFSDWRTNYAVSGERLRLN
TEQATAVGAIHSAADTFSAWLLAGVTGSGKTEVYLSVLENVLAQGKQALV
MVPEIGLTPQTIARFRERFNAPVEVLHSGLNDSERLSAWLKAKNGEAAIV
IGTRSALFTPFKNLGVIVIDEEHDSSYKQQEGWRYHARDLAVYRAHSEQI
PIILGSATPALETLCNVQQKKYRLLRLTRRAGNARPAIQHVLDLKGQKVQ
AGLAPALITRMRQHLQADNQVILFLNRRGFAPALLCHDCGWIAECPRCDH
YYTLHQAQQHLRCHHCDSQRPVPRQCPSCGSTHLVPVGLGTEQLEQTLAP
LFPGVPISRIDRDTTSRKGALEQQLAEVHRGGARILIGTQMLAKGHHFPD
VTLVALLDVDGALFSADFRSAERFAQLYTQVAGRAGRAGKQGEVVLQTHH
PEHPLLQTLLYKGYDAFAEQALAERRMMQLPPWTSHVIVRAEDHNNQHAP
LFLQQLRNLILSSPLADDKLWVLGPVPALAPKRGGRWRWQILLQHPSRVR
LQHIINGTLALINTIPDSRKVKWVLDVDPIEG
>Z5810 priB, primosomal replication protein N
MTNRLVLSGTVCRTPLRKVSPSGIPHCQFVLEHRSVQEEAGFHRQAWCQM
PVIVSGHENQAITHSITVGSRITVQGFISCHKAKNGLSKMVLHAEQIELI
DSGD
>Z0584 priC, primosomal replication protein N''
MKTALLLEKLEXQLATLRQRCAPVSQFATLSARFNRHLFQTRATTLQACL
DKAGDNLAALRHAVEQQQLPQVAWLAEHLAAQLEAIAREATAWSLREWDS
APPQIARWQRKRIQHQDFERRLREMVAERRARLARVTDLVEQQTLHREVE
AYEARLARCRHALEKIENRLARLTR
>Z5062 radC, DNA repair protein RadC
MKVKNNAQLLMPREKMLKFGISALTDVELLALFLRTGTRGKDVLTLAKEM
LENFGSLYGLLTSEYEQFSGVHGIGVAKFAQLKGIAELARRYYNVRMREE
SPLLSPEMTREFLQSQLTGEEREIFMVIFLDSQHRVITHSRLFSGTLNHV
EVHPREIIREAIKINASALILAHNHPSGCAEPSKADKLITERIIKSCQFM
DLRVLDHIVIGRGEYVSFAERGWI
>Z0492 rdgC, recombination associated protein
MLWFKNLMVYRLSREISLRAEEMEKQLASMAFTPCGSQDMAKMGWVPPMG
SHSDALTHVANGQIVICARKEEKILPSPVIKQALEAKIAKLEAEQARKLK
KTEKDSLKDEVLHSLLPRAFSRFSQTMMWIDTVNGLIMVDCASAKKAEDT
LALLRKSLGSLPVVPLSMANPIELTLTEWVRSGSAAQGFQLLDEAELKSL
LEDGGVIRAKKQDLTSEEITNHIEAGKVVTKLALDWQQRIQFVMCDDGSL
KRLKFCDELRDQNEDIDREDFAQRFDADFILMTGELAALIQNLIEGLGGE
AQR
>Z4002 recA, recombinase A
MAIDENKQKALAAALGQIEKQFGKGSIMRLGEDRSMDVETISTGSLSLDI
ALGAGGLPMGRIVEIYGPESSGKTTLTLQVIAAAQREGKTCAFIDAEHAL
DPIYARKLGVDIDNLLCSQPDTGEQALEICDALARSGAVNVIVVDSVAAL
TPKAEIEXEIGDSHMGLAARMMSQAMRKLAGNLKQSNTLLIFINQIRMKI
GVMFGNPETTTGGNALKFYASVRLDIRRIGAVKEGENVVGSETRVKVVKN
KIAAPFKQAEFQILYGEGINFYGELVDLGVKEKLIEKAGAWYSYKGEKIG
QGKANATAWLKDNPETAKEIEKKVRELLLSNPNSTPDFSVDDSEGVAETN
EDF
>Z4137 recB, DNA helicase, ATP-dependent dsDNA/ssDNA exonuclease V subunit, ssDNA endonuclease
MSDVAETLDPLRLPFQGERLIEASAGTGKTFTIAALYLRLLLGLGGSAAF
PRPLTVEELLVVTFTEAATAELRGRIRSNIHELRIACLRETTDNPLYERL
LEEIDDKAQAAKWLLLAERQMDEAAVFTIHGFCQRMLNLNAFESGMLFEQ
QLIEDESLLRYQACADFWRRHCYPLPREIAQVVFETWKGPQALLRDINRY
LQGEAPVIKAPPPDDETLASRHAQIVARIDTVKQQWRDAVGELDALIESS
GIDRRKFNRSNQAKWIDKISAWAEEERNSYQLPESLEKFSQRFLEDRTKA
GGETPRHPLFEAIDQLLAEPLSIRDLVITRALAEIRETVAREKRRRGELG
FDDMLSRLDSALRSESGEVLAAAIRTRFPVAMIDEFQDTDPQQYRIFRRI
WHHQPETALLLIGDPKQAIYAFRGADIFTYMKARSEVHAHYTLDTNWRSA
PGMVNSVNKLFSQTDDAFMFREIPFIPVKSAGKNQALRFVFKGETQPAMK
MWLMEGESCGVGDYQSTMAQVSAAQIRDWLQAGQRGEALLMNGDDARPVR
ASDISVLVRSRQEAAQVRDALTLLEIPSVYLSNRDSVFETLEAQEMLWLL
QAVMTPERENTLRSALATSMMGLNALDIETLNNDEHAWDAVVEEFDGYRQ
IWRKRGVMPMMRALMSARNIAENLLATAGGERRLTDILHISELLQEAGTQ
LESEHALVRWLSQHILEPDSNASSQQMRLESDKHLVQIVTIHKSKGLEYP
LVWLPFITNFRVQDQAFYHDRHSFEAVLDLNAAPESVDLAEVERLAEDLR
LLYVALTRSVWHCSLGVAPLVRRRGDKKGDTDVHQSALGRLLQKGEPQDA
AGLRTCIEALCDDDIAWQTAQTGDNQPWQVNDALTAELNARTLQRLPGDN
WRVTSYSGLQQRGHGIAQDLMPRLDVDAAGVVSVVEEPTLTPHQFPRGAS
PGTFLHSLFEDLNFTQPVDPNWVQEKLELGGFESQWEPVLTEWITAVLQA
PLNETGVSLSQLSDRDKQVEMEFYLPISEPLIASQLDALIRQFDPLSAGC
PPLEFMQARGMLKGFIDLVFRHEGRYYLLDYKSNWLGEDSSAYTQQAMAA
AMQAHRYDLQYQLYTLALHRYLRHRIADYDYERHFGGVIYLFLRGVDKEH
PQQGIYTTRPNAGLIDLMDEMFAGMTLEEA
>Z4139 recC, DNA helicase, ATP-dependent dsDNA/ssDNA exonuclease V subunit, ssDNA endonuclease
MLRVYHSNRLDVLEALMEFIVERERLDDPFEPEMILVQSTGMAQWLQMTL
SQKFGIAANIDFPLPASFIWDMFVRVLPEIPKESAFNKQSMSWKLMTLLP
QLLEREDFTLLRHYLTDDSDKRKLFQLSSKAADLFDQYLVYRPDWLAQWE
TGHLVEGLGEAQAWQAPLWKALVEYTHELGQPRWHRANLYQRFIETLESA
TTCPPGLPSRVFICGISALPPVYLQALQALGKHIEIHLLFTNPCRYYWGD
IKDPAYLAKLLTRQRRHSFEDHELPLFRDSENAGQLFNSDGEQDVGNPLL
ASWGKLGRDYIYLLSDLESSQELDAFVDVTPDNLLHNIQSDILELENRAV
AGVNIEEFSRSDNKRPLDPLDSSITFHVCHSPQREVEVLHDRLLAMLEED
PTLTPRDIIVMVADIDSYSPFIQAVFGSAPADRYLPYAISDRRARQSHPV
LEAFISLLSLPDSRFVSEDVLALLDVPVLAARFDITEEGLRYLRQWVNES
GIRWGIDDDNVRELELPATGQHTWRFGLTRMLLGYAMESAQGEWQSVLPY
DESSGLIAELVGHLASLLMQLNIWRRGLAQERPLEEWLPVCRDMLNALFL
PDAETEAAMTLIEQQWQAIIAEGLGAQYGDAVPLSLLRDELAQRLDQERI
SQRFLAGPVNICTLMPMRSIPFKVVCLLGMNDGVYPRQLAPLGFDLMSQK
PKRGDRSRRDDDRYLFLEALISAQQKLYISYIGRSIQDNSERFPSVLVQE
LIDYIGQSHYLPGDEALNCDESEARVKAHLTCLHTRMPFDPQNYQPGERQ
SYAREWLPAASQAGKAHSEFVQPLPFTLPETVPLETLQRFWAHPVRAFFQ
MRLQVNFRTEDSEIPDTEPFILEGLSRYQINQQLLNALVEQDDAERLFRR
FRAAGDLPYGAFGEIFWETQCQEMQQLADRVIACRQPGQSMEIDLACNGV
QITGWLPQVQPDGLLRWRPSLLSVAQGMQLWLEHLVYCASGGNGESRLFL
RKDGEWRFPPLAAEQALHYLSQLIEGYREGMSAPLLVLPESGGAWLKTCY
DAQNDAMLDDDSTLQKARTKFLQAYEGNMMVSGEGDDIWYQRLWRQLTPE
TMEAIVEQSQRFLLPLFRFNQS
>Z4136 recD, DNA helicase, ATP-dependent dsDNA/ssDNA exonuclease V subunit, ssDNA endonuclease
MKLQKQLLEAVEHKQLRPLDVQFALTVAGDEHPAVTLAAALLSHDAGEGH
VCLPLSRLENNEESHPLLATCVSEIGELQNWEECLLASQAVSRGDEPTPM
ILCGDRLYLNRMWCNERTVARFFNEVNHAIEVDEALLAQTLDKLFPTGDE
INWQKVAAAVALTRRISVISGGPGTGKTTTVAKLLAALIQMADGERCRIR
LAAPTGKAAARLTESLGKALRQLPLTDEQKKRIPEDASTLHRLLGAQPGS
QRLRHHAGNPLHLDVLVVDEASMIDLPMMSRLIDALPDHARVIFLGDRDQ
LASVEAGAVLGDICAYANAGFTAERAGQLSRLTGTHVPAGTGTEAASLRD
SLCLLQKSYRFGSDSGIGQLAAAINRGDKTAVKTVFQQDFTDIEKRLLQS
GEDYIAMLEEALAGYGRYLDLLQARAEPDLIIQAFNEYQLLCALREGPFG
VAGLNERIEQFMQQKRKIHRNPHSRWYEGRPVMIARNDSALGLFNGDIGI
ALDRGQGTRVWFAMPDGNIKSVQPSRLPEHETTWAMTVHKSQGSEFDHAA
LILPSQRTPVVTRELVYTAVTRARRRLSLYADERILSAAIATRTERRSGL
AALFXSRE
>Z5191 recF, recombination protein F
MSLTRLLIRDFRNIETADLALSPGFNFLVGANGSGKTSVLEAIYTLGHGR
AFRSLQIGRVIRHEQEAFVLHGRLQGEERETAIGLTKDKQGDSKVRIDGT
DGHKVAELAHLMPMQLITPEGFTLLNGGPKYRRAFLDWGCFHNEPGFFTA
WSNLKRLLKQRNAALRQVTRYEQLRPWDKELIPLAEQISTWRAEYSAGIA
ADMADTCKQFLPEFSLTFSFQRGWEKETEYAEVLERNFERDRQLTYTAHG
PHKADLRIRADGAPVEDTLSRGQLKLLMCALRLAQGEFLTRESGRRCLYL
IDDFASELDDERRGLLASRLKATQSQVFVSAISAEHVIDMSDENSKMFTV
EKGKITD
>Z5078 recG, DNA helicase, resolution of Holliday junctions, branch migration
MGYYAGCRVSAMTGRLLDAVPLSSLTGVGAALSNKLAKINLHTVQDLLLH
LPLRYEDRTHLYPIGELLPGVYATVEGEVLNCNISFGGRRMMTCQISDGS
GILTMRFFNFSAAMKNSLATGRRVLAYGEAKRGKYGAEMIHPEYRVQGDL
STPELQETLTPVYPTTEGVKQATLRKLTDQALDLLDTCAIEELLPPELSQ
GMMTLPEALRTLHRPPPTLQLSDLETGQHPAQRRLILEELLAHNLSMLAL
RAGAQRFHAQPLSANDALKNKLLAALPFKPTGAQARVVAEIERDMALDVP
MMRLVQGDVGSGKTLVAALAALRAIAHGKQVALMAPTELLAEQHANNFRN
WFEPLGIEVGWLAGKQKGKARLSQQEAIASGQVQMIVGTHAIFQEQVQFN
GLALVIIDEQHRFGVHQRLALWEKGQQQGFHPHQLIMTATPIPRTLAMTA
YADLDTSVIDELPPGRTPVTTVAIPDTRRTDIIDRVRHACITEGRQAYWV
CTLIEESELLEAQAAEATWEELKLALPELNVGLVHGRMKPAEKQAVMASF
KQGELHLLVATTVIEVGVDVPNASLMIIENPERLGLAQLHQLRGRVGRGA
VASHCVLLYKTPLSKTAQIRLQVLRDSNDGFVIAQKDLEIRGPGELLGTR
QTGNAEFKVADLLRDQAMIPEVQRLARHIHERYPQQAKALIERWMPETER
YSNA
>Z4230 recJ, ssDNA exonuclease, 5' --> 3' specific
MKQQIQLRRREVDETADLPAELPPLLRRLYASRGVRSAQELERSVKGMLP
WQQLSGVEKAVEILYNAFREGTRIIVVGDFDADGATSTALSVLAMRSLGC
SNIDYLVPNRFEDGYGLSPEVVDQAHARGAQLIVTVDNGISSHAGVEHAR
SLGIPVIVTDHHLPGDTLPAAEAIINPNLRDCNFPSKSLAGVGVAFYLML
ALRTFLRDQGWFDERGIAIPNLAELLDLVALGTVADVVPLDANNRILTWQ
GMSRIRAGKCRPGIKALLEVANRDAQKLAASDLGFALGPRLNAAGRLDDM
SVGVALLLCDNIGEARVLANELDALNQTRKEIEQGMQIEALTLCEKLERS
RDTLPGGLAMYHPEWHQGVVGILASRIKERFHRPVIAFAPAGDGTLKGSG
RSIQGLHMRDALERLDTLYPGMMLKFGGHAMAAGLSLEEDKFELFQQRFG
ELVTEWLDPSLLQGEVVSDGPLSPAEMTMEVAQLLRDAGPWGQMFPEPLF
DGHFRLLQQRLVGERHLKVMVEPVGGGPLLDGIAFNVDTALWPDNGVREV
QLAYKLDINEFRGNRSLQIIIDNIWPI
>Z3909 recN, protein used in recombination and DNA repair
MLAQLTISNFAIVRELEIDFHSGMTVITGETGAGKSIAIDALGLCLGGRA
EADMVRTGAARADLCARFSLKDTPAALRWLEENQLEDGHECLLRRVISSD
GRSRGFINGTAVPLSQLRELGQLLIQIHGQHAHQLLTKPEHQKFLLDGYA
NETSLLQEMTARYQLWHQSCRDLAHHQQLSQERAARAELLQYQLKELNEF
NPQPGEFEQIDEEYKRLANSGQLLTTSQNALALMADGEDANLQSQLYTAK
QLVSELIGMDSKLSGVLDMLEEATIQIVEASDELRHYCDRLDLDPNRLFE
LEQRISKQISLARKHHVSPETLPQYYQSLLEEQQQLDDQADSQETLALAV
TKHHQQALETARALHQQRQHYAEELAQLITDSMHALSMPHGQFTIDVKFD
EHHLGADGADRIEFRVTTNPGQPMQPIAKVASGGELSRIALAIQVITARK
METPALIFDEVDVGISGPTAAVVGKLLRQLGESTQVMCVTHLPQVAGCGH
QHYFVSKETDGAMTETHMQSLNKKTRLQELARLLGGSEVTRNTLANAKEL
LAA
>Z3846 recO, DNA repair protein RecO
MEGWQRAFVLHSRPWSETSLMLDVFTEESGRVRLVAKGARSKRSTLKGAL
QPFTPLLLRFGGRGEVKTLRSAEAVSLALPLSGITLYSGLYINELLSRVL
EYETRFSELFFDYLHCIQSLAGVTGTPEPALRRFELALLGHLGYGVNFTH
CAGSGEPVDDTMTYRYREEKGFIASVVIDNKTFTGRQLKALNAREFPDAD
TLRAAKRFTRMALKPYLGGKPLKSRELFRQFMPKRTVKTHYE
>Z5343 recQ, ATP-dependent DNA helicase
MNVAQAEVLNLESGAKQVLQETFGYQQFRPGQEEIIDTVLSGRDCLVVMP
TGGGKSLCYQIPALLLNGLTVVVSPLISLMKDQVDQLQANGVAAACLNST
QTREQQLEVMTGCRTGQIRLLYIAPERLMLDNFLEHLAHWNPVLLAVDEA
HCISQWGHDFRPEYAALGQLRQRFPTLPFMALTATADDTTRQDIVRLLGL
NDPLIQISSFDRPNIRYMLMEKFKPLDQLMRYVQEQRGKSGIIYCNSRAK
VEDTAARLQSKGISAAAYHAGLENNVRADVQEKFQRDDLQIVVATVAFGM
GINKPNVRFVVHFDIPRNIESYYQETGRAGRDGLPAEAMLFYDPADMAWL
RRCLEEKPQGQLQDIERHKLNAMGAFAEAQTCRRLVLLNYFGEGRQEPCG
NCDICLDPPKQYDGSTDAQIALSTIGRVNQRFGMGYVVEVIRGANNQRIR
DYGHDKLKVYGMGRDKSHEHWVSVIRQLIHLGLVTQNIAQHSALQLTEAA
RPVLRGESSLQLAVPRIVALKPKAMQKSFGGNYDRKLFAKLRKLRKSIAD
ESNVPPYVVFNDATLIEMAEQMPITASEMLSVNGVGMRKLERFGKPFMAL
IRAHVDGDDEE
>Z0589 recR, recombination protein RecR
MQTSPLLTQLMEALRCLPGVGPKSAQRMAFTLLQRDRSGGMRLAQALTRA
MSEIGHCADCRTFTEQEVCNICSNPRRQENGQICVVESPADIYAIEQTGQ
FSGRYFVLMGHLSPLDGIGPDDIGLDRLEQRLAEEKITEVILATNPTVEG
EATANYIAELCAQYDVEASRIAHGVPVGGELEMVDGTTLSHSLAGRHKIR
F
>Z2410 recT, recombinase, DNA renaturation protein encoded by prophage CP-933R
MTKQPPIAKADLQKTQGNRAPAAIKNNDVISFINQPSMKEQLAAALPRHM
TAERMIRIATTEIRKVPALGNCDTMSFVSAIVQCSQLGLEPGSALGHAYL
LPFGNKNEKSGKKNVQLIIGYRGMIDLARRSGQIASLSARVVREGDEFNF
EFGLDEKLIHRPGENEDAPVTHVYAVARLKDGGTQFEVMTRKQIELVRSQ
SKAGNNGPWVTHWEEMAKKTAIRRLFKYLPVSIEIQRAVSMDEKEPLTID
PADSSVLTGEYSVIDNSEE
>Z5288 rep, rep helicase, a single-stranded DNA dependent ATPase
MRLNPGQQQAVEFVTGPCLVLAGAGSGKTRVITNKIAHLIRGCGYQARHI
AAVTFTNKAAREMKERVXQTLGRKEARGLMISTFHTLGLDIIKREYAALG
MKANFSLFDDTDQLALLKELTEGLIEDDKVLLQQLISTISNWKNDLKTPA
QAAAEAKGERDRIFAHCYGLYDAHLKACNVLDFDDLILLPTLLLQRNEEV
RERWQNKIRYLLVDEYQDTNTSQYELVKLLVGSRARFTVVGDDDQSIYSW
RGARPQNLVLLSQDFPALKVIKLEQNYRSSGRILKAANILIANNPHVFEK
RLFSELGYGTELKVLSANNEEHEAERVTGELIAHHFVNKTQYKDYAILYR
GNHQSRVFEKFLMQNRIPYKISGGTSFFSRPEIKDLLAYLRVLTNPDDDS
AFLRIVNTPKREIGPATLKKLGEWAMTRNKSMFTASFDMGLSQTLSGRGY
EALTRFTHWLAEIQRLAEREPIAAVRDLIHGMDYESWLYETSPSPKAAEM
RMKNVNQLFSWMTEMLEGSELDEPMTLTQVVTRFTLRDMMERGESEEELD
QVQLMTLHASKGLEFPYVYMVGMEEGFLPHQSSIDEDNIDEERRLAYVGI
TRAQKELTFTLCKERRQYGELVRPEPSRFLLELPQDDLIWEQERKVVSAE
ERMQKGQSHLANLKAMMAAKRGK
>Z5290 rhlB, ATP-dependent RNA helicase
MSKTHLTEQKFSDFALHPKVVEALEKKGFHNCTPIQALALPLTLAGRDVA
GQAQTGTGKTMAFLTSTFHCLLSHPAIADRKVNQPRALIMAPTRELAVQI
HADAEPLAEATGLKLGLAYGGDGYDKQLKVLESGVDILIGTTGRLIDYAK
QNHINLGAIQVVVLDEADRMYDLGFIKDIRWLFRRMPPANQRLNMLFSAT
LSYRVRELAFEQMNNAEYIEVEPEQKTGHRIKEELFYPSNEEKMRLLQTL
IEEEWPDRAIIFANTKHRCEEIWGHLAADGHRVGLLTGDVAQKKRLRILD
EFTRGDLDILVATDVAARGLHIPAVTHVFNYDLPDDCEDYVHRIGRTGRA
GANGHSISLACEEYALNLPAIETYIGHSIPVSKYNPDALMTDLPKPLRLT
RPRTGNGPRRTGAPRNRRRSG
>Z1017 rhlE, putative ATP-dependent RNA helicase
MSFDSLGLSPDILRAVAEQGYREPTPIQQQAIPAVLEGRDLMASAQTGTG
KTAGFTLPLLQHLITRQPHAKGRRPVRALILTPTRELAAQIGENVRDYSK
YLNIRSLVVFGGVSINPQMMKLRGGVDVLVATPGRLLDLEHQNAVKLDQV
EILVLDEADRMLDMGFIHDIRRVLTKLPAKRQNLLFSATFSDDIKALAEK
LLHNPLEIEVARRNTASDQVTQHVHFVDKKRKRELLSHMIGKGNWQQVLV
FTRTKHGANHLAEQLNKDGIRSAAIHGNKSQGARTRALADFKSGDIRVLV
ATDIAARGLDIEELPHVVNYELPNVPEDYVHRIGRTGRAAATGEALSLVC
VDEHKLLRDIEKLLKKEIPRIAIPGYEPDPSIKAEPIQNGRQQRGGGGRG
QGGGGRGQQQPRRGEGGAKSASAKPAEKPSRRLGDAKPAGEQQRRRRPRK
PAAAQ
>Z0239 rnhA, ribonuclease H
MLKQVEIFTDGSCLGNPGPGGYGAILRYRGREKTFSAGYTRTTNNRMELM
AAIVALEALKEHCEVILSTDSQYVRQGITQWIHNWKKRGWKTADKKPVKN
VDLWQRLDAALGQHQIKWEWVKGHAGHPENERCDELARAAAMNPTLEDTG
YQVEV
>Z0195 rnhB, ribonuclease HII
MIEFVYPHTQLVAGVDEVGRGPLVGAVVTAAVILDPARPIAGLNDSKKLS
EKRRLALCEEIKEKALSWSLGRAEPHEIDELNILHATMLAMQRAVAGLHI
APEYVLIDGNRCPKLPMPAMAVVKGDSRVPEISAASILAKVTRDAEMAAL
DIVFPQYGFAQHKGYPTAFHLEKLAEHGATEHHRRSFGPVKRALGLAS
>Z2671 rnt, ribonuclease T
MSDNAQLTGLCDRFRGFYPVVIDVETAGFNAKTDALLEIAAITLKMDEQG
WLMPDTTLHFHVEPFVGANLQPEALAFNGIDPNDPDRGAVSEYEALHEIF
KVVRKGIKASGCNRAIMVAHNANFDHSFMMAAAERASLKRNPFHPFATFD
TAALAGLALGQTVLSKACQTAGMDFDSTQAHSALYDTERTAVLFCEIVNR
WKRLGGWPLPAAEEV
>Z1873 rus, endodeoxyribonuclease RUS (Holliday junction resolvase) of prophage CP-933X
MNTYSITLPWPPSNNRYYRHNRGRTHISAEGQAYRDNVARIIKGSMLDIG
LAMPVKIRIECHMPDRRRRDLDNLQKAAFDALTKAGFWLDDAQVVDYRVV
KMPVTKGGKLELTITELGNE
>Z2913 ruvA, Holliday junction DNA helicase motor protein
MIGRLRGIIIEKQPPLVLIEVGGVGYEVHMPMTCFYELPEAGQEAIVFTH
FVVREDAQLLYGFNNKQERTLFKELIKTNGVGPKLALAILSGMSAQQFVN
AVEREEVGALVKLPGIGKKTAERLIVEMKDRFKGLHGDLFTPAADLVLTS
PASPATDDAEQEAVAALVALGYKPQEASRMVSKIARPDASSETLIREALR
AAL
>Z2912 ruvB, Holliday junction DNA helicase RuvB
MIEADRLISAGTTLPEDVADRAIRPKLLEEYVGQPQVRSQMEIFIKAAKL
RGDALDHLLIFGPPGLGKTTLANIVANEMGVNLRTTSGPVLEKAGDLAAM
LTNLEPHDVLFIDEIHRLSPVVEEVLYPAMEDYQLDIMIGEGPAARSIKI
DLPPFTLIGATTRAGSLTSPLRDRFGIVQRLEFYQVPDLQYIVSRSARFM
GLEMSDDGALEVARRARGTPRIANRLLRRVRDFAEVKHDGTISADIAAQA
LDMLNVDAEGFDYMDRKLLLAVIDKFFGGPVGLDNLAAAIGEERETIEDV
LEPYLIQQGFLQRTPRGRMATTRAWNHFGITPPEMP
>Z2915 ruvC, Holliday junction resolvase
MAIILGIDPGSRVTGYGVIRQVGRQLSYLGSGCIRTKVDDLPSRLKLIYA
GVTEIITQFQPDYFAIEQVFMAKNADSALKLGQARGVAIVAAVNQELPVF
EYAARQVKQTVVGIGSAEKSQVQHMVRTLLKLPANPQADAADALAIAITH
CHVSQNAMQMSESRLNLARGRLR
>Z3173 sbcB, exonuclease I, 3' --> 5' specific; deoxyribophosphodiesterase
MMNDGKQQSTFLFHDYETFGTHPALDRPAQFAAIRTDDEFNVIGEPEVFY
CKPADDYLPQPGAVLITGITPQEARAKGENEAAFAARIHSLFTVPKTCIL
GYNNVRFDDEVTRNVFYRNFYDPYAWSWQHDNSRWDLLDVMRACYALRPE
GINWPENDDGLPSFRLEHLTKANGIEHSNAHDAMADVYATIAMAKLVKTR
QPRLFDYLFTHRNKHKLMALIDVPQMKPLVHVSGMFGAWRGNTSWVAPLA
WHPENRNAVIMVDLAGDISPLLELDSDTLRERLYTAKTDLGDNAAVPVKL
VHINKCPVLAQANTLRPEDADRLGINRQHCLDNLKILRENPQVREKVVAI
FAEAEPFTPSDNVDAQLYNGFFSDADRAAMKIVLETEPRNLPALDITFVD
KRIEKLLFNYRARNFPGTLDYAEQQRWLEHRRQVFTPEFLQGYAEEIQML
AQQYAXDKEKVALLKALWQYAEXXV
>Z0495 sbcC, ATP-dependent dsDNA exonuclease
MKILSLRLKNLNSLKGEWKIDFTREPFASNGLFAITGPTGAGKTTLLDAI
CLALYHETPRLSNVSQSQNDLMTRDTAECLAEVEFEVKGEAYRAFWSQNR
ARNQPDGNLQVPRVELARCADGKILADKVKDKLELTATLTGLDYGRFTRS
MLLSQGQFAAFLNAKPKERAELLEELTGTEIYGQISAMVFEQHKSARTEL
EKLQAQASGVALLTPEQVQSLTASLQVLTDEEKQLLTAQQQEQQSLNWLT
RLDELQQEASRRQQALQQALAEEEKAQPQLAALSLAQPARNLRPHWERIA
EHSAALAHTRQQIEEVNTRLQNTMALRASIRHHAAKQSAELQQQQQSLNT
WLQEHDRFRQWNNELAGWRAQFSQQTSDREHLRQWQQQLTHAEQKLNALA
AITLMLTADEVATALAQHAEQRPLRQRLVALHGQIVPQQKRLAQLMVTIQ
NVTLEQTQRNAALNEMRQRYKEKTQQLADVKTICEQEARIKTLEAQRAQL
QAGQPCPLCGSTSHPAVEAYQALEPGVNQSRLLALENEVKKLGEEGAALR
GQLDALTKQLQRDENEAQSLRQDEQALTQQWQAVTASLNITLQPQDDIQP
WLDAQDEHERQLRLLSQRHELQGQIAAHNQQIIQYQQQIEQRQQQLLTAL
AGYALTLPQEDEEESWLATRQQEAQSWQQRQNELTALQNRIQQLTPILET
LPQSDDLPHSEETVALDNWRQVHEQCLALHSQQQTLQQQDVLAAQSLQKA
QAQFDTALQASVFDDQQAFLAALMDEQTLTQLEQLKQNLENQRRQAQTLV
TQTAETLAQHQQHRPDGLALTVTVEQIQQELAQTHQKLRENTTSQGEIRQ
QLKQDADNRQQQQTLLQQIAQMTQQVEDWGYLNSLIGSKEGDKFRKFAQG
LTLDNLVHLANQQLTRLHGRYLLQRKASEALEVEVVDTWQADAVRDTRTL
SGGESFLVSLALALALSDLVSHKTRIDSLFLDEGFGTLDSETLDTALDAL
DALNASGKTIGVISHVEAMKERIPVQIKVKKINGLGYSKLESTFAVK
>Z0496 sbcD, ATP-dependent dsDNA exonuclease
MRILHTSDWHLGQNFYSKSREAEHQAFLDWLLETAQTHQVDAIIVAGDVF
DTGSPPSYARTLYNRFVVNLQQTGCHLVVLAGNHDSVATLNESRDIMAFL
NTTVVASAGHAPQILPRRDGTPGAVLCPIPFLRPRDIITSQAGLNGIEKQ
QHLLAAITDYYQQHYADACKLRGDQPLPIIATGHLTTVGASKSDAVRDIY
IGTLDAFPAQNFPPADYIALGHIHRAQIIGGMEHVRYCGSPIPLSFDECG
KSKYVHLVTFSNGKLESVENLNVPVTQPMAVLKGDLASITAQLEQWRDVS
QEPPVWLDIEITTDEYLHDIQRKIQALTESLPVEVLLVRRSREQRERVLA
SQQRETLSELSVEEVFNRRLALEELDESQQQRLQHLFTTTLHTLAGEHEA
>Z3170 sbmC, SbmC protein
MNYEIKQEDKRTVAGFHLVGPWEQTVKKGFEQLMMWVDSKNIVPKEWVAV
YYDNPDETPAEKLRCDTVVTVPNNFTLPENSEGVILTEISGGQYAVAVAR
VVGDDFAKPWYQFFNSLLQDSAYEMLPKPCFEVYLNNGAEDGYWDIEMYV
AVQPKHH
>Z0836 seqA, negative modulator of initiation of replication
MKTIEVDDELYSYIASHTKHIGESASDILRRMLKFSAASQPAAPVTKEVR
VASPAIVEAKPIKTIKDKVRAMRELLLSDEYAEQKRAVNRFMLLLSTLYS
LDAQAFAEATESLHGRTRVYFAADEQTLLKNGNQTKPKHVPGTPYWVITN
TNTGRKCSMIEHIMQSMQFPAELIEKVCGTI
>Z4656 smf, hypothetical protein
MVDTEIWLRLMSISSLYGDDMVRIAHWLARQSHIDAVVLQQTGLTLRQAQ
RFLSFPRKSIESSLCWLEQPNHHLIPADSEFYPPQLLATTDYPGALFVEG
ELHALHSFQLAVVGSRAHSWYGERWGRLFCETLAKHGVTITSGLARGIDG
VAHKAALQVNGVSIAVLGNGLNTIHPRRHARLAASLLEQGGALVSEFPLD
VPPLAYNFPRRNRIISGLSKGVLVVEAALRSGSLVTARCALEQGREVFAL
PGPIGNPGSEGPHWLIKQGAILVTEPEEILENLQFGLHWLPBAPENSFYS
PDQEDVALPFPELLANVGDEVTPVDVVAERAGQPVPEVVTQLLELELAGW
IAAVPGGYVRLRRACHVRRTNVFV
>Z3859 srmB, ATP-dependent RNA helicase
MTVTTFSELELDESLLEALQDKGFTRPTAIQAAAIPPALDGRDVLGSAPT
GTGKTAAYLLPALQHLLDFPRKKSGPPRILILTPTRELAMQVADHARELA
KHTHLDIATITGGVAYMNHAEVFSENQDIVVATTGRLLQYIKEENFDCRA
VETLIXDEADRMLDMGFAQDIEHIAGETRWRKQTLLFSATLEGDAIQDFA
ERLLEDPVEVSANPSTRERKKIHQWYYRADDLEHKTALLVHLLKQPEATR
SIVFVRKRERVHELANWLREAGINNCYLEGEMVQGKRNEAIKRLTEGRVN
VLVATDVAARGIDIPDVSHVFNFDMPRSGDTYLHRIGRTARAGRKGTAIS
LVEAHDHLLLGKVGRYIEEPIKARVIDELRPKTRAPSEKQTGKPSKKVLA
KRAEKKKAKEKEKPRVKKRHRDTKNIGKRRKPSGTGVPPQTTEE
>Z5658 ssb, single-strand DNA-binding protein
MASRGVNKVILVGNLGQDPEVRYMPNGGAVANITLATSESWRDKATGEMK
EQTEWHRVVLFGKLAEVASEYLRKGSQVYIEGQLRTRKWTDQSGQDRYTT
EVVVNVGGTMXMLGGRQGGGAPAGGNIGGGQPQGGWGQPQQPQGGNQFSG
GAQSRPQQSAPAAPSNEPPMDFDDDIPF
>Z4974 tag, 3-methyladenine DNA glycosylase I
MERCGWVSQDPLYIAYHDNEWGVPETDRKKLFEMICLEGQQAGLSWITVL
KKRENYRACFHQFDPVKVAAMQEEDVERLVQDAGIIRHRGKIQAIIGNAR
AYLQMEQNGEPFADFVWSFVNHQPQVTQATTLSEIPTSTPASDALSKALK
KRGFKFVGTTICYSFMQACGLVNDHVVGCCCYPGNKP
>Z5361 tatD, hypothetical protein
MEYRMFDIGVNLTSSQFAKDRDDVVARAFDAGVNGLLITGTNLRESQQAQ
KLARQYSSCWSTAGVHPHDSSQWQAATEEAIIELAAQPEVVAIGECGLDF
NRNFSTPEEQERAFVAQLRIAADLNMPVFMHCRDAHERFMTLLEPWLDKL
PGAVLHCFTGTREEMQACVAHGIYIGITGWVCDERRGLELRELLPLIPAE
KLLIETDAPYLLPRDLTPKPSSRRNEPAHLPHILQRIAHWRGEDAAWLAA
TTDANVKTLFGIAF
>Z2536 topA, DNA topoisomerase I
MGKALVIVESPAKAKTINKYLGSDYVVKSSVGHIRDLPTSGSAAKKSADS
TSTKTAKKPKKDERGALVNRMGVDPWHNWEAHYEVLPGKEKVVSELKQLA
EKADHIYLATDLDREGEAIAWHLREVIGGDDARYSRVVFNEITKNAIRQA
FNKPGELNIDRVNAQQARRFMDRVVGYMVSPLLWKKIARGLSAGRVQSVA
VRLVVEREREIKAFVPEEFWEVDASTTTPSGEALALQVTHQNDKPFRPVN
KEQTQAAVSLLEKARYSVLEREDKPTTSKPGAPFITSTLQQAASTRLGFG
VKKTMMMAQRLYEAGYITYMRTDSTNLSQDAVNMVRGYISDNFGKKYLPE
SPNQYASKENSQEAHEAIRPSDVNVMAESLKDMEADAQKLYQLIWRQFVA
CQMTPAKYDSTTLTVGAGDFRLKARGRILRFDGWTKVMPALRKGDEDRIL
PAVDKGDALTLVELTPAQHFTKPPARFSEASLVKELEKRGIGRPSTYASI
ISTIQDRGYVRVENRRFYAEKMGEIVTDRLEENFRELMNYDFTAQMENSL
DQVANHEAEWKAVLDHFFSDFTQQLDKAEKDPEEGGMRPNQMVLTSIDCP
TCGRKMGIRTASTGVFLGCSGYALPPKERCKTTINLVPENEVLNVLEGED
AETNALRAKRRCPKCGTAMDSYLIDPKRKLHVCGNNPTCDGYEIEEGEFR
IKGYDGPIVECEKCGSEMHLKMGRFGKYMACTNEECKNTRKILRNGEVAP
PKEDPVPLPELPCEKSDAYFVLRDGAAGVFLAANTFPKSRETRAPLVEEL
YRFRDRLPEKLRYLADAPQQDPEGNKTMVRFSRKTKQQYVSSEKDGKATG
WSAFYVDGKWVEGKK
>Z2796 topB, DNA topoisomerase III
MRLFIAEKPSLARAIADVLPKPHRKGDGFIECGNGQVVTWCIGHLLEQAQ
PDAYDSRYARWNLADLPIVPEKWQLQPRPSVTKQLNVIKRYLHEASEIVH
AGDPDREGQLLVDEVLDYLQLAPEKRQQVQRCLINDLNPQAVERAIDRLR
SNSEFVPLCVSALARARADWLYGINMTRAYTILGRNAGYQGVLSVGRVQT
PVLGLVVRRDEEIENFVAKDFFEVKAHIVTPADERFTAIWQPSEACEPYQ
DEEGRLLHRPLAEHVVNRISGQPAIVTSYNDKRESESAPLPFSLSALQIE
AAKRFGLSAQNVLDICQKLYETHKLITYPRSDCRYLPEEHFAGRHAVMNA
ISVHAPDLLPQPVVDPDIRNRCWDDKKVDAHHAIIPTARSSAINLTENEA
KVYNLISRQYLMQFCPDAVFRKCVIELDIAKGKFVAKARFLAEAGWRTLL
GSKERDEENDGTPLPVVAKGDELLCEKGEVVERQTQPPRHFTDATLLSAM
TGIARFVQDKDLKKILRATDGLGTEATRAGIIELLFKRGFLIKKGRYIHS
TDAGKALFHSLPEMATRPDMTAHWESVLTQISEKQCRYQDFMQPLVGTLY
QLIDQAKRTPVRQFRGIVAPGSGGSADKKKAAPRKRSAKKSPPADEAGSG
AIA
>Z2993 tra8_2, IS30 transposase encoded within prophage CP-933T
MSDFINNVSVDSIGQRNSYVKTWGCGGLELWKNGTGFSEIANILGSKPGT
IFTMLRDTGGIKPHERKRAVAHLTLSEREEIRAGLSAKMSIRAIATALNR
SPSTISREVQRNRGRRYYKAVDANNRANRMAKRPKPCLLDQNLPLRKLVL
EKLEMKWSPEQISGWLRRTKPRQKTLRISPETIYKTLYFRSREALHHLNI
QHLRRSHSLRHGRRHTRKGERGTINIVNGTPIHERSRNIDNRRSLGHWEG
DLVSGTKNSHIATLVDRKSRYTIILRLRGKDSVSVNQALTDKFLSLPSEL
RKSLTWDRGMELARHLEFTVSTGVKVYFCDPQSPWQRGTNENTNGLIRQY
FPKKTCLAQYTQHEDLVAAQLNNRPRKTLKFKTPKEIIERGVALTD
>Z3936 tra8_3, IS30 transposase
MRRTFTAEEKASVFELWKNGTGFSEIANILGSKPGTIFTMLRDTGGIKPH
ERKRAVAHLTLSEREEIRAGLSAKMSIRAIATALNRSPSTISREVQRNRG
RRYYKAVDANNRANRMAKRPKPCLLDQNLPLRKLVLEKLEMKWSPEQISG
WLRRTKPRQKTLRISPETIYKTLYFRSREALHHLNIQHLRRSHSLRHGRR
HTRKGERGTINIVNGTPIHERSRNIDNRRSLGHWEGDLVSGTKNSHIDSF
F
>Z1947 umuC, DNA polymerase IV
MFALCDVNAFYASCETVFRPELWGKPVVVLSNNDGCVIARNAEAKALGVK
MGDPWFKQKDLFRRCGVVCFSSNYELYADMSNRVMSTLEELSPRVEIYSI
DEAFCDLTGVRNCRDLTDFGREIRATVLQRTHLTVGVGIAQTKTLAKLAN
HAAKKWQRQTGGVVDLSNLXXQRKLMSALPVDDVWGIGRRISKKLDAMGI
KTVLDLADTDIRFIRKHFNVVLERTVRELRGEPCLQLEEFAPTKQEIICS
RSFGERITDYTSMRQAICSYAARAAEKLRSEHQYCRFISTFIKTSPFALN
EPYYGNSASVKLLTPTQDSRDIINAATRSLDAIWQAGHRYQKAGVMLGDF
FSQGVAQLNLFDDNAPRPGSEQLMAVMDTLNAKEGRGTLYFAGQGIQQQW
QMKRAMLSPRYTTRSSDLLRVK
>Z3864 ung, uracil-DNA glycosylase
MANELTWHDVLAEEKQQPYFLNTLQTVASERQSGVTIYPPQKDVFNAFRF
TELGDVKVVILGQDPYHGPGQAHGLAFSVRPGIATPPSLLNMYKELENTI
PGFTRPNHGYLESWARQGVLLLNTVLTVRAGQAHSHASLGWETFTDKVIS
LINQHREGVVFLLWGSHAQKKGAIIDKQRHHVLKAPHPSPLSAHRGFFGC
NHFVLANQWLEQHGETPIDWMPVLPAESE
>Z5657 uvrA, excinuclease ABC subunit A
MDKIEVRGARTHNLKNINLVIPRDKLIVVTGLSGSGKSSLAFDTLYAEGQ
RRYVESLSAYARQFLSLMEKPDVDHIEGLSPAISIEQKSTSHNPRSTVGT
ITEIHDYLRLLYARVGEPRCPDHDVPLAAQTVSQMVDNVLSQPEGKRLML
LAPIIKERKGEHTKTLENLASQGYIRARIDGEVCDLSDPPKLELQKKHTI
EVVVDRFKVRDDLTQRLAESFETALELSGGTAVVADMDDPKAEELLFSAN
FACPICGYSMRELEPRLFSFNNPAGACPTCDGLGVQQYFDPDRVIQNPEL
SLAGGAIRGWDRRNFYYFQMLKSLADHYKFDVEAPWGSLSANVHKVVLYG
SGKENIEFKYMNDRGDTSIRRHPFEXVLHNMERRYKETESSAVREELAKF
ISNRPCASCEGTRLRREARHVYVENTPLPAISDMSIGHAMEFFNNLKLAG
QRAKIAEKILKEIGDRLKFLVNVGLNYLTLSRSAETLSGGEAQRIRLASQ
IGAGLVGVMYVLDEPSIGLHQRDNERLLGTLIHLRDLGNTVIVVEHDEDA
IRAADHVIDIGPGAGVHGGEVVAEGPLEAIMAVPESLTGQYMSGKRKIEV
PKKRVPANPEKVLKLTGARGNNLKDVTLTLPVGLFTCITGVSGSGKSTLI
NDTLFPIAQRQLNGATIAEPAPYRDIQGLEHFDKVIDIDQSPIGRTPRSN
PATYTGVFTPVRELFAGVPESRARGYTPGRFSFNVRGGRCEACQGDGVIK
VEMHFLPDIYVPCDQCKGKRYNRETLEIKYKGKTIHEVLDMTIEEAREFF
DAVPALARKLQTLMDVGLTYIRLGQSATTLSGGEAQRVKLARELSKRGTG
QTLYILDEPTTGLHFADIQQLLDVLHKLRDQGNTIVVIEHNLDVIKTADW
IVDLGPEGGSGGGEILVSGTPETVAECEASHTARFLKPML
>Z0998 uvrB, excinuclease ABC subunit B
MSKPFKLNSAFKPSGDQPEAIRRLEEGLEDGLAHQTLLGVTGSGKTFTIA
NVIADLQRPTMVLAPNKTLAAQLYGEMKEFFPENAVEYFVSYYDYYQPEA
YVPSSDTFIEKDASVNEHIEQMRLSATKAMLERRDVVVVASVSAIYGLGD
PDLYLKMMLHLTVGMIIDQRAILRRLAELQYARNDQAFQRGTFRVRGEVI
DIFPAESDDIALRVELFDEEVERLSLFDPLTGQIVSTIPRFTIYPKTHYV
TPRERIVQAMEEIKEELAARRKVLLENNKLLEEQRLTQRTQFDLEMMNEL
GYCSGIENYSRFLSGRGPGEPPPTLFDYLPADGLLVVDESHVTIPQIGGM
YRGDRARKETLVEYGFRLPSALDNRPLKFEEFEALAPQTIYVSATPGNYE
LEKSGGDVVDQVVRPTGLLDPIIEVRPVATQVDDLLSEIRQRAAINERVL
VTTLTKRMAEDLTEYLEEHGERVRYLHSDIDTVERMEIIRDLRLGEFDVL
VGINLLREGLDMPEVSLVAILDADKEGFLRSERSLIQTIGRAARNVNGKA
ILYGDKITPSMAKAIGETERRREKQQKYNEEHGITPQGLNKKVVDILALG
QNIAKTKAKGRGKSRPIVEPDNVPMDMSPKALQQKIHELEGLMMQHAQNL
EFEEAAQIRDQLHQLRELFIAAS
>Z3001 uvrC, excinuclease ABC subunit C
MYDAGGTVIYVGKAKDLKKRLSSYFRSNLASRKTEALVAQIQQIDVTVTH
TETEALLLEHNYIKLYQPRYNVLLRDDKSYPFIFLSGDTHPRLAMHRGAK
HAKGEYFGPFPNGYAVRETLALLQKIFPIRQCENSVYRNRSRPCLQYQIG
RCLGPCVEGLVSEEEYAQQVEYVRLFLSGKDDQVLTQLISRMETASQNLE
FEEAARIRDQIQAVRRVTEKQFVSNTGDDLDVIGVAFDAGMACVHVLFIR
QGKVLGSRSYFPKVPGGTELSEVVETFVGQFYLQGSQMRTLPGEILLDFN
LSDKTLLADSLSELAGRKINVQTKPRGDRARYLKLARTNAATALTSKLSQ
QSTVHQRLTALASVLKLPEVKRMECFDISHTMGEQTVASCVVFDANGPLR
AEYRRYNITGITPGDDYAAMNQVLRRRYGKAIDDSKIPDVILIDGGKGQL
AQAKNVFAELDVSWDKNHPLLLGVAKGADRKAGLETLFFEPEGEGFSLPP
DSPALHVIQHIRDESHDHAIGGHRKKRAKVKNTSSLETIEGVGPKRRQML
LKYMGGLQGLRNASVEEIAKVPGISQGLAEKIFWSLKH
>Z5330 uvrD, DNA-dependent ATPase I and helicase II
MDVSYLLDSLNDKQREAVAAPRSNLLVLAGAGSGKTRVLVHRIAWLMSVE
NCSPYSIMAVTFTNKAAAEMRHRIGQLMGTSQGGMWVGTFHGLAHRLLRA
HHMDANLPQDFQILDSEDQLRLLKRLIKAMNLDEKQWPPRQAMWYINSQK
DEGLRPHHIQSYGNPVEQTWQKVYQAYQEACDRAGLVDFAELLLRAHELW
LNKPHILQHYRERFTNILVDEFQDTNNIQYAWIRLLAGDTGKVMIVGDDD
QSIYGWRGAQVENIQRFLNDFPGAETIRLEQNYRSTSNILSAANALIENN
NGRLGKKLWTDGADGEPISLYCAFNELDEARFVVNRIKTWQDNGGALAEC
AILYRSNAQSRVLEEALLQASMPYRIYGGMRFFERQEIKDALSYLRLIAN
RNDDAAFERVVNTPTRGIGDRTLDVVRQTSRDRQLTLWQACRELLQEKAL
AGRAASALQRFMELIDALAQETADMPLHVQTDRVIKDSGLRTMYEQEKGE
KGQTRIENLEELVTATRQFSYNEEDEDLMPLQAFLSHAALEAGEGQADTW
QDAVQLMTLHSAKGLEFPQVFIVGMEEGMFPSQMSLDEGGRLEEERRLAY
VGVTRAMQKLTLTYAETRRLYGKEVYHRPSRFIGELPEECVEEVRLRATV
SRPVSHQRMGTPMVENDSGYKLGQRVRHAKFGEGTIVNMEGSGEHSRLQV
AFQGQGIKWLVAAYARLETV
>Z3053 vsr, DNA mismatch endonuclease, patch repair protein
MADVHDKATRSKNMRAIATRDTAIEKRLASLLTGQGLAFRVQDASLPGRP
DFVVDEYRCVIFTHGCFWHHHHCYLFKVPATRTEFWLEKIGKNVERDRRD
ISRLQELGWRVLIVWECALRGREKLKDEALTERLEEWICGEGASAQIDTQ
GIHLLA
>Z5054 waaP, putative LPS biosynthesis enzyme
MVWMVELKEPFATLWRGKDPFEEVKTLQGEVFRELETRRTLRFEMAGKSY
FLKWHRGTTLKEIIKNLLSLRMPVLGADREWNAIHRLRDVGVDTMYGVAF
GEKGMNPLTRTSFIITEDLTPTISLEDYCADWATNPPDVRVKRMLIKRVA
TMVHDMHAAGINHRDCYICHFLLHLPFSGKEEELKISVIDLHRAQLRTRV
PRRWRDKDLIGLYFSSMNIGLTQRDIWRFMKVYFAAPLKDILKQEQGLLS
QAEAKATKIRERTIRKSL
>Z5052 waaY, putative LPS biosynthesis protein
MIYNKTINGVKVFIKDNDPFYEQVLNDFLTCRVKTLKVFRSIDDTKVILI
DTARGPLVLKVYAPKHKMTERFLKSCIKKDYYENLIYQTDRVRGEGIQSI
NDYFLLAERKTLNFAHYYIMLIEYIEGVGLNEYLEISEDLKDQLSESIKE
LHQHGMVSGDPHKGNFIVSEKGLRLIDLSGKKTTAVLKAKDRIDLERHYN
IKNELKDFGYTYLIFKKKIKKVIRDVKVKLGLKSK
>Z3196 wbdQ, GDP-mannose mannosylhydrolase
MFLHSQDFATIVRSTPLISIDLIVENEFGEILLGKRINRPAQGYWFVPGG
RVLKDEKLQTAFERLTEIELGIRLPLSVGKFYGIWQHFYEDNSMGGDFST
HYIVIAFLLKLQPNILKLPKSQHNAYCWLSRAKLINDDDVHYNCRAYFNN
KTNDAIGLDNKDIICLMRQ
>Z3215 wcaH, GDP-mannose mannosyl hydrolase
MMFLRQEDFATVVRSTPLVSLDFIVENSRGEFLLGKRTNRPAQGYWFVPG
GRVQKDETLEAAFERLTMAELGLRLPITAGQFYGVWQHFYDDNFSGTDFT
THYVVLGFRFRVAEEELLLPDEQHDDYRWLTPDALLASNDVHANSRAYFL
AEKRAGVPGL
>Z5328 xerC, tyrosine recombinase
MTDLHTDVERYLRYLSVERQLSPITLLNYQRQLEAIINFASENGLQNWQQ
CDAAMVRNFAVRSRRKGLGAASLALRLSALRSFFDWLVSQNELKANPAKG
VSAPKTPRHLPKNIDVDDINRLLDIDINDPLAVRDRAMLEVMYGAGLRLS
ELVGLDIKHLDLESGEVWVMGKGSKERRLPIGRNALSWIEHWLDLRDLFG
SEDDALFLSKLGKRISARNVQKRFAEWGIKQGLNNHVHPHKLRHSFATHM
LESSGDLRGVQELLGHANLSTTQIYTHLDFQHLASVYDAAHPRAKRGK
>Z4232 xerD, tyrosine recombinase
MKQDLARIEQFLDALWLEKNLAENTLNAYRRDLSMMVEWLHHRGLTLATA
QSDDLQALLAERLEGGYKATSSARLLSAVRRLFQYLYREKFREDDPSAHL
ASPKLPQRLPKDLSEAQVERLLQAPLIDQPLELRDKAMLEVLYATGLRVS
ELVGLTMSDISLRQGVVRVIGKGNKERLVPLGEEAVYWLETYLEHGRPWL
LNGVSIDVLFPSQRAQQMTRQTLWHRIKHYAVLAGIDSEKLSPHVLRHAF
ATHLLNHGADLRVVQMLLGHSDLSTTQIYTHVATERLRQLHQQHHPRA
>Z4115 xni, exonuclease IX
MRGLFPISHPAIACSSIECYPYRLIFKGVIVAVHLLIVDALNLIRRIHAV
QGSPCVETCQHALDQLIMHSQPTHAVAVFDDENRSSGWRHQRLPDYKAGR
PPMPEELHDEMPALRAAFEQRGVPCWSTSGNEADDLAATLAVKVTQAGHQ
ATIVSTDKGYCQLLSPTLRIRDYFQKRWLDAPFIDKEFGVQPQQLPDYWG
LAGISSSKVPGVAGIGPKSATQLLVEFQSLEGIYENLDAVAEKWRKKLET
HKEMAFLCRDIARLQTDLHIDGNLQQLRLVR
>Z3773 xseA, exodeoxyribonuclease VII large subunit
MLPSQSPAIFTVSRLNQTVRLLLEHEMGQVWISGEISNFTQPASGHWYFT
LKDDTAQVRCAMFRNSNRRVTFRPQHGQQVLVRANITLYXPRGDYXIIVE
SMQPAGEGLLQXKYEQLKAKLQAEGLFDLQYKKPLPSPAHCVGVITSKTG
AALHDILHVLKRRDPSLPVIIYPTAVQGDDAPGQIVRAIELANQCNECDV
LIVGRGGGSLEDLWSFNDERVARAIFASRIPVVSAVGHETDVTIADFVAD
LRAPTPSAAAEVVSRNQQELLRQVQSTRQRLEMAMDYYLANRTRRFTQIH
HRLQQQHPQLRLARQQTMLERLQKRMSFALENQLKRAGQQQQRLTQRLNQ
QNPQPKIHRAQTRIQQLEYRLAETLRAQLSATRERFGNAVTHLEAVSPLS
TLARGYSVTTATDGKVLKKVKQVKAGEMLTTRLEDGWVESEVKNIQPVKK
SRKKVH
>Z0525 xseB, exodeoxyribonuclease VII small subunit
MPKKNEAPASFEKALSELEQIVTRLESGDLPLEEALNEFERGVQLARQGQ
AKLQQAEQRVQILLSDNEDASLTPFTPDNE
>Z2781 xthA, exonuclease III
MKFVSFNINGLRARPHQLEAIVEKHQPDVIGLQETKVHDDMFPLEEVAKL
GYNVFYHGQKGHYGVALLTKETPIAVRRGFPGDEEEAQRRIIMAEIPSPL
GNVTVINGYFPQGESRDHPIKFPAKAQFYQNLQNYLETELKRENPVLIMG
DMNISPGDLDIGIGEENRKRWLRTGKCSFLPEEREWMERLMSWGLVDTFR
HANPQTADRFSWFDYRSKGFDDNRGLRIDLLLASQPLAECCVETGIDYEI
RSMEKPSDHAPVWATFRR
>Z0127 yacH, hypothetical protein
MKMTLPFKPHVLALICSAGLCAASAGLYIKSRTVEAPVETQSTQLAVSDA
AAVTLPATVSAPPVTPAVVKSAFSTAQIDQWVAPVALYPDALLSQVLMAS
TYPTNVAQAVQWSHDNPLKQGDAAIQAVSDQPWDASVKSLVAFPQLMALM
GENPQWVQNLGDAFLAQPQDVMDSVQRLRQLAQQTGSLKSSTEQKVITTT
KKTVPVTQTVTAPVIPSNTVSTANPVITEPATTVISIEPGNPDVVYIPNY
NPTVVYGNWANTAYPPVYLPPPAGEPFVDSFVRGFGYSMGVATTYALFSS
IDWDDDDHDHHHHDDDNYHHHDGGHRDGNGWQHNGDNINIDVNNFNRITG
EHLTDKNMAWRHNPNYRNGVPYHDQDMAKRFHQTDVNGGMSATQLPAPTR
DSQRQAAASQFQQRTHAAPVITRDTQRQAAAQRFNEAEHYGSYDDFRDFS
RRQPLTQQQKDAARQRYQSASPEQRQAVHEKMQTNPQNQQRREAARERIQ
PASPEQRQAVREKMQTNPQIQQRRDAARERIQSASPEQRQVFKEKVQQRP
LNQQQRDNARQRVQSASPEQRQVFREKVQESRPQRLNDSNHTARLNNEQR
SAVRERLSERGARRLER
>Z0288 yafM, hypothetical protein
MSEYRRYYIKGGTWFFTVNLRNRRSHLLTAQFQMLRNAIINVKRDRPFEI
NAWVVLPEHMHCIWTLPESDDDFSSRWREIKKQFTHACGLKNIWQPRF
>Z0549 ybaV, hypothetical protein
MKHGIKALLITLSLACAGMSHSALAAASVAKPTAVETKAEAPAAQNKAAV
PAKASDEEGSRVSINNASAEELARAMNGVGLKKAQAIVSYREEYGPFKTV
EDLKQVPGMGNSLVERNLAVLTL
>Z0566 ybaZ, hypothetical protein
MRLHSGVFPDYAEKLSQEEKMEKEDSFPQRVWQIVAAIPEGYVTTYGDVA
KLAGSPRAARQVGGVLKRLPEGSTLPWHRVVNRHGTISLTGPDLQRQRQA
LLAEGVMVSGSGQIDLQRYRWNY
>Z1110 ybjD, hypothetical protein
MILERVEIVGFRGINRLSLMLEQNNVLIGENAWGKSSLLDALTLLLSPES
DLYHFERDDFWFPPGDINGREHHLHIILTFRESLPGRHRVRRYRPLEACW
TPCTDGYHRIFYRLEGESAEDGSVMTLRSFLDKDGHPIDVEDINDQAHHL
VRLMPVLRLRDARFMRRIRNGTVPNVPNVEVTARQLDFLARELSSHPQNL
SDGQIRQGLSAMVQLLEHYFSEQGAGQARYRLMRRRASNEQRSWRYLDII
NRMIDRPGGRSYRVILLGLFATLLQAKGTLRLDKDARPLLLIEDPETRLH
PIMLSVAWHLLNLLPLQRIATTNSGELLSLTPVEHVCRLVRESSRVAAWR
LGPSGLSTEDSRRISFHIRFNRPSSLFARCWLLVEGETETWVINELARQC
GHHFDAEGIKVIEFAQSGLKPLVKFARRMGIEWHVLVDGDEAGKKYAATV
RSLLNNDREAEREHLTALPALDMEHFMYRQGFSDVFHRVAQIPENVPMNL
RKIISKAIHRSSKPDLAIEVAMEAGRRGVDSVPTLLKKMFSRVLWLARGR
AD
>Z1238 ycaJ, putative polynucleotide enzyme
MSNLSLDFSDNTFQPLAARMRPENLAQYIGQQHLLAAGKPLPRAIEAGHL
HSMILWGPPGTGKTTLAEVIARYANADVERISAVTSGVKEIREAIERARQ
NRNAGRRTILFVDEVHRFNKSQQDAFLPHIEDGTITFIGATTENPSFELN
SALLSRARVYLLKSLSTEDIEQVLTQAMEDKTRGYGGQDIVLPDETRRAI
AELVNGDARRALNTLEMMADMAEVDDSGKRVLXPELLTEIAGERSARFDN
KGDRFYDLISALHKSVRGSAPDAALYWYARIITAGGDPLYVARRCLAIAS
EDVGNADPRAMQVAIAAWDCFTRVGPAEGERAIAQAIVYLACAPKSNAVY
TAFKAALADARERPDYDVPVHLRNAPTKLMKEMGYGQEYRYAHDEANAYA
AGEVYFPPEIAQTRYYFPTNRGLEGKIGEKLAWLAEQDQNSPIKRYR
>Z1739 ycfH, hypothetical protein
MFLVDSHCHLDGLDYESLHKDVDDVLAKAAARDVKFCLAVATTLPGYLHM
RDLVGERDNVVFSCGVHPLNQNDPYDVEDLRRLAAEEGVVALGETGLDYY
YTPETKVRQQESFIHHIQIGRELNKPVIVHTRDARADTLAILREEKVTDC
GGVLHCFTEDRETAGKLLDLGFYISFSGIVTFRNAEQLRDAARYVPLDRL
LVETDSPYLAPVPHRGKENQPAMVRDVAEYMAVLKGVAVEELAQVTTDNF
ARLFHIDASRLQSIR
>Z2856 yeaB, hypothetical protein
MEYRSLTLDDFLSRFQLLRPQINRETLNHRQAAVLIPIVRRPQPGLLLTQ
RSIHLRKHAGQVAFPGGAVDDTDASVIAAALREAEEEVAIPPSAVEVIGV
LPPVDSVTGYQVTPVVGIIPPDLPYRASEDEVSAVFEMPLAQALHLGRYH
PLDIYRRGDSHRVWLSWYEQYFVWGMTAGIIRELALQIGVKP
>Z3443 yejH, putative ATP-dependent helicase
MIFTLRPYQQEAVDATLNHFRRHKTPAVIVLPTGAGKSLVIAELARLARG
RVLVLAHVKELVAQNHAKYQALGLEADIFAAGLKRKESHGKVVFGSVQSV
ARNLDAFQGEFSLLIVDECHRIGDDEESQYQQILTHLTKVNPHLRLLGLT
ATPFRLGKGWIYQFHYHGMVRGDEKALFRDCIYELPLRYMIKHGYLTPPE
RLDMPVVQYDFSRLQAQSNGLFSEADLNRELKKQQRITPHIISQIMEFAA
TRKGVMIFAATVEHAKEIVGLLPAEDAALITGDTPGAERDVLIEDFKAQR
FRYLVNVAVLTTGFDAPHVDLIAILRPTESVSLYQQIVGRGLRLAPGKTD
CLILDYAGNPHDLYAPEVGTPKGKSDNVPVQVFCPACGFANTFWGKTTAD
GTLIEHFGRRCQGWFEDDDGHREQCDFRFRFKNCPQCNAENDIAARRCRE
CDTVLVDPDDMLKAALRLKDALVLRCSGMSLQHGHDEKGEWLKITYYDED
GADVSERFRLQTPAQRTAFEQLFIRPHTRTPGIPLRWITAADILAQQALL
RHPDFVVARMKGQYWQVREKVFDYEGRFRRAHELRG
>Z3509 yfaO, hypothetical protein
MRQRTIVCPLIQNDGAYLLCKMADDRGVFPGQWALSGGGVEPGERIEEAL
RREIREELGEQLLLTEITPWTFSDDIRTKTYADGRKEEIYMIYLIFDCVS
ANREVKINEEFQDYAWVKPEDLAHYDLNVATRKTLRLKGLL
>Z3723 yffH, hypothetical protein
MTQQITLIKDKILSDNYFTLHNITYDLTRKDGEVIRHKREVYDRGNGATI
LLYNAKKKTVVLIRQFRVATWVNGNESGQLIETCAGLLDNDEPEVCIRKE
AIEETGYEVGEVRKLFELYMSPGGVTELIHFFIAEYSDNQRANAGGGVED
EDIEVLELPFSQALEMIKTGEIRDGKTVLLLNYLQMSHLID
>Z3895 yfiL, hypothetical protein
MKTIRYALKKEKEMMKKFIAPLLALLVSGCQIDPYTHAPTLTSTDWYDVG
MEDAISGSAIKDDDAFSDSQADRGLYLKGYAEGQKKTCQTDFTYARGLSG
KSFPASCNNVENASQLHEVWQKGADENASAIRLN
>Z4147 ygdP, dinucleoside polyphosphate hydrolase
MIDDDGYRPNVGIVICNRQGQVMWARRFGQHSWQFPQGGINPGESAEQAM
YRELFEEVGLSRKDVRILASTRNWLRYKLPKRLVRWDTKPVCIGQKQKWF
LLQLVSGDAEINMQTSSTPEFDGWRWVSYWYPVRQVVSFKRDVYRRVMKE
FASVVMSLQENTPKPQNASAYRRKRG
>Z4421 ygjF, hypothetical protein
MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDHQ
LKPQEAQHLLDYRCGVTKLVDRPTVQANEISKQELHAGGRKLIEKIEDYQ
PQALAILGKQAYEQGFSQRGAQWGKQTLSIGSTQIWVLPNPSGLSRVSLE
KLVEAYRELDQALVVRGR
>Z4516 yhbQ, hypothetical protein
MTPWFLYLIRTADNKLYTGITTDVERRYQQHQSGKGAKALRGKGELTLAF
SAPVGDRSLALRAEYRVKQLTKRQKERLVAEGAGFAKLLSSLQTPEIKSD
>Z4622 yhdJ, putative methyltransferase
MTMRTGCEPTRFGNEAKTIIHGDALAELKKLPTESVDLIFADPPYNIGKN
FDGLIEAWKEDLFIDWLLEVIAECHRVLKKQGSMYIMNSTENMPFIDLQC
RKLFTIKSRIVWSYDSSGVQAKKHYGSMYEPILMMVKDAKNYTFNGDAIL
VEAKTGSQRALIDYRKNPPQPYNHQKVPGNVWDFPRVRYLMDEYENHPTQ
KPEALLKRIILASSNPGDIVLDPFAGSFTTGAVAIASGRKFIGIEINSEY
IKMGLRRLDVASHYSAEELAKVKKRKTGNLSKRSRLSEVDPDLITK
>Z4839 yhhF, hypothetical protein
MKKPNHSGSGQIRIIGGQWRGRKLPVPDSPGLRPTTDRVRETLFNWLAPV
IVDAQCLDCFAGSGALGLEALSRYAAGATLIEMDRAVSQQLIKNLATLKA
GNARVVNSNAMSFLAQKGTPHNIVFVDPPFRRGLLEETINLLEDNGWLAD
ETLIYVESEVENGLPTVPANWSLHREKVAGQVAYRLYQREAQGESDAD
>Z5073 yicF, DNA ligase
MKMKVWMAILISILCWQSSVWAVCPAWSPARAQEEIFRLQQQIKQWDDDY
WKEGKSEVEDGVYDQLSARLTQWQRCFVSEPRDVMMPPLNGAVMHPVAHT
GVRKMADKNALSLWMRERSDLWVQPKVDGVAVTLVYRDGKLNKAISRGNG
LKGEDWTQKVSLISAVPQTVSGPLANSTLQGEIFLQREGHIQQQMGGINA
RAKVAGLMMRQDDSDTLNSLGVFVWAWPDGPQLMTDRLKELATAGFTLTQ
RYTRAVKNADEVARVRNEWWKAKLPFVTDGVVVRXAKEPESRHWLPGQAE
WLVAWKYQPVAQVAKVKAIQFAVGKSGKISVVASLAPVMLDDKKVQRVNI
GSVRRWQEWDIAPGDQILVSLAGQGIPRIDDVVWRGAERTKPTPPENRFN
SLTCYFASDVCQEQFISRLVWLGSKQVLGLDGIGEAGWRALHQTHRFEHI
FSWLLLTPEQLQNTPGIAKSKSAQLWHRFNLARKQPFTRWVMAMGIPLTR
AALNASDERSWSQLLFSTEQFWQQLPGTGSGRARQVIEWKENAQIKKLGS
WLAAQQITGFEP
>Z5980 yjjV, hypothetical protein
MQALAENYQPLYAALGLHPGMLEKHSDVSLDQLQQALERRPAKVVAVGES
GLDLFGDDPQFERQQWFLDEQLKLAKRYDLPVILHSRRTHDKLAMHLKRH
DLSRTGVVHGFSGSLQQAERFVQLGYKIGVGGTITYPRASKTRVVIAKLP
LASLLLETDAPDMPLNGFQGQPNRPEQAARVFDVLCELRPEPEDEIAEVL
LNNTYRCLTFVGSLPXVGSIRQSAHNA
>Z4294 yqgF, Holliday junction resolvase-like protein
MSGTLLAFDFGTKSIGVAVGQRITGTARPLPAIKAQDGTPDWNLIERLLK
EWQPDEIIVGLPLNMDGTEQPLTARARKFANRIHGRFGVEVKLHDERLST
VEARSGLFEQGGYRALNKGKIDSASAVIILESYFEQGY
>Z4391 yqiE, hypothetical protein
MLKPDNLPVTFGKNDVEIIARETLYRGFFSLDLYRFRHRLFNGQMSHEVR
REIFERGHAAVLLPFDPVRDEVVLIEQIRIAAYDTSETPWLLEMVAGMIE
EGESVEDVARREAIEEAGLIVKRTKPVLSFLASPGGTSERSSIMVGEVDA
TTASGIHGLADENEDIRVHVVSREQAYQWVEEGKIDNAASVIALQWLQLH
HQALKNEWA
>Z4507 yraN, hypothetical protein
MATVPTRSGSPRQLTTKQTGDAWEVQARRWLEGKGLRFVAANVNERGGEI
DLIMREGRTTVFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHN
GSFDTVDCRFDVVAFTGNEVEWIKDTFNDHS
>Z4654 yrdD, putative DNA topoisomerase
MRNNESCPKCGAELVIRSGKHGPFLGCSQYPACDYVRPLKSSADGHIVKV
LEGQVCPACGANLVLRQGRFGMFIGCINYPECEHTELIDKPDETAITCPQ
CRTGHLVQRRSRYGKTFHSCDRYPECQFAINFKPIAGECPECHYPLLIEK
KTAQGVKHFCASKQCGKPVSAE
>Z4751 yrfE, hypothetical protein
MSKSLQKPTILNVETVARSRLFTVESVDLEFSNGVRRVYERMRPTNREAV
MIVPIVDDHLILIREYAVGTESYELGFSKGLIDPGESVFEAANRELKEEV
GFGANDLTFLKKLSMAPSYFSSKMNIVVAQDLYPESLEGDEPEPLPQVRW
PLAHMMDLLEDPDFNEARNVSALFLVREWLKGQGRV