Gene list
Applied filters:
Gene type: CDS
Gene type: CDS
Gene type: CDS
Genomic element: pCP301
Number of genes found: 261
Hide UniProt / TrEMBL protein name | View in Fasta format (DNA) | View as list | ||||
# Shigella flexneri 2a str. 301, 301 >CP0113 ISSfl4 ORF2 MITLPTGTKIWIIAGITDMRCGFNGLASKVQNTLKDDPFSGHIFVFRGRS GKMVKILWADRDGLCLFAKRLAGDPGRESAPDASSVIHATGGDRVATSQT DRTAWHPDITRDKTRE >CP0068 putative protein encoded within IS MLPSETMIWQPEFTDKTLSRKPGAVHAVRQQRSKALLTSLNEWMVEKNGT LSKKSRLGEAFSYVLNQWDALCYYSDDGLAEADNNTAERALRTVCLGKKN YMFFGSDHGGDRGALLYGLIGSCRLNGIDPEAYLRHILSVLPEWPSNRVD ELLPWNVVLTDK >CP0200 hypothetical protein MRGESMYGTCETLCRALAAKYSGDTPLMLVIWSPEEIQALADGMDISLSD HEIRTVLAHLEDIPED >CP0057 IS600 ORF2 MCRVPGVSRSGYYDRVQHAPSDRKQSDERLKLEIKVAHIRTRETYGTRRL QTELADNGIIVG >CP0014 hypothetical protein MKVSFKSLGYIFHDIYNKKHTIDEFNDVVKKAVLSGKINELNACHKVAIF LAEKDNEITKKDKAKIIDTLTENYSIEFQQLMNISERTLNSSLYITPGES GFVSFVNREGKICHTAYVKSSDNSMTYYHANGSSIDKYITDMCGLICMRH IESTGIIFYMLDEKVLSAIAEFMNEKGWRAAFCSAKNLYKCV >CP0177 putative reverse transcriptase MPQGGVISPLLSNIILNEFDQYLNKRYLSGKARKDRWYWNHSIQRGRSTA VKENWQWKPAVAYCCYADDCVPRRRVLGT >CP0162 hypothetical protein MSGRVKTRCTQSQSGRRFSCVAIHRSVAFFPQDGQARLPHELVTYLTWGH SGLSQMYMLHAQYPRAAGQHFCDSLNLDIAQTARIQEGCPALVGREQTFQ RAGSKSGQHEDGLTPGIL >CP0239 IS630 ORF MVVSAIASTPQLHRGDRVSDVARTLCCARSSVGRWINWFTQSGVEGLKSL PAGRARRWPFEHICTLLRELVKHSPGDFGYQRSRWSTELLAIKINEITGC QLNAGTVRRWLPSVGIVWRRAAPTLRIRDPHKDEKMAAIHKALDECSAEH PVFYEDEVDIHLHPKIGADWQLRGQQKRVVTPGQNEKYYLAGALHSGTGK VSYVGGNSKSSALFISLLKRLKATYRRAKTITLIVDNYIIHKSRETQSWL KENPKFRGIYQPVYSPWVNHVERLWQALHDTITRNHQCRSMWQLLKKVRH FMETVSPFPGGKHGLAKV >CP0019 ISSfl4 ORF2 MISFPAGSRIWLVAGITDMRNGFNGLVSKVQNVLKDDPFSGHLFIFRGRR GDQIKVLWADSDGLCLFTRRLERGRFVWPVTRDGKVHLTPAQLSMLLEGI DWKHPKRTERAGIRI >CP0272 IS629 ORF1 MTKNTRFSPEVRQRAVRMVLESQGEYDSQWAAIFSHENVINSVNQFKKYT LYLRK >CP0267 IS600 ORF1 MSRKTQRYSTEFKAEAVKTVPENQLSISEGASRLSVPEGTLGQWVTAARK GLNTPGSRTVAELESEVMQLRKALNEARLERDILKKATAYFARNPRPIQQ CFKFTLLAFSHLFLGNTGECHDGALSE >CP0213 ISSfl4 ORF3 MNNELPDDIELLKAMLRKQQSRLRQYACQVAGYEQEIERLKAQLDRLRRM LFGQSSEKKRHKLENQIRQAEKRLSELENRLNTARNLLEDASSVTDSPDT SPPSENPIASKPEFPGRKSSRKPLPAELPRETHRLLPAETSCPACGGVLK EMGETISEQLDIINTAFKVIETIRPKLACSRCDVIVQAPLPPKPIERGYA SAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEMADKLR PLYIALNDYVLEAGKVHADDTPVKVLAPGNGKTKTGRLWVYVRDDRNAGS SLPAAVWFAYSADRKGEHPQLHLAKYQGVLQADAYAGYNVLYETGRVKEA GCLAHARRKIHDEDVRRPTEMTQEALRRIAELYDIEAEIRGSPAEERLAV RKARSVQLMQSLYDWIQLQRKTLSKHAEMAKAFDYILNHWNALNEFCRDG WVEIDNNIGENALRSVAVGRKNYLFFGSDKGGESAAIIYSLLVTCKQNEV EPEDWLREVIEKLNDWPSNQVHELLPWNFSSVK >CP0268 IS911 ORF1 MICSPQNNTGAPMKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLST MTRWVKQLRDERQGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKAT ALLMSDSLNSSR >CP0188 putative reverse transcriptase MKDRNGSGAKGLPHCADGAAATTGDNADGRTAVKSAKPFPVSKRQVWEAY KRVKANRGAAGIDGQTLAGFDENVTDNLYKLWNRMASGSYMPQAVRRVDI PKADGGVRPLGIPAVSDRIAQMVVKQILEPVLEPLFHADSYGYRPGKSAH QAIAQARKRCWKFDWVVEVDIKGFFDDIDHDLLLKTVQHHTQARWVVMYI ERRLKAPVQMPDGAMLARGRGTPQGGVISPLLSNLFLHYAFDMWMQRQFP GVPFERYADDVVCHSRI >CP0169 hypothetical protein MAFILSSLILLFSASAFPFDTPWRMAFRIPYNLFPIVLATFFIMGTSLLA AAISASRCVFPRSMVCSRYPLPVLSAPSTGSSASSVHAAISAFPAWIRSS IYRCAATGSEFPLPDDDFQQEDADVHSEPPRESWRLNFLRKR >CP0002 putative resolvase MNHYPSVTSLETPEARCRSGVPPLPACRQRESIYGLIELFIQIVHRLSVR SERRLVKTLLADFQRVHGKTALLFRIAEAALNNPDGLVKEVVYHLHIVEP DRSGKRSSSYLAQLRDVSARGDAVKNGRTLPEQDSGLPALVSDPGLPRMI STVL >CP0074 ISSfl1 ORF2 MLSPGQAHESQFAQRLLDGIGVQRQNGSMKRRGHAVLADKAYSGRALRNE LKNNGIKAVIPRKSNEKMASDGRAQLDRDAYCNRNVVERCFGRLKEYRRI ATRYDKTARNYLAMVKLGCIRLFYQRLRN >CP0206 ISSfl4 ORF1 MNSQTTKDIPCFRSYLPDALRLRFEDKLTIRAIAQRLGLSHSTIHTLFQR FLASGIAWPLPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYS REFKVRLAKQALQPGAVVARIAREHDINNNLLVKWKSQYEDGLLSDDDIQ ECMPVPVALTDTPEPTRPVTNPFWRNKHDERPEGAPGNVPRCELHLKSGV VKLFDPLTPELLRALIREMKGGIR >CP0027 IS100 ORF1 MISLNCWHKSVDHIMLCLSRFLGIPQPFRAQTKGKVERMVQYTRNSFYIP LMTRLRPMGSTVDVETANRHGLRWLHDVANQRKHETIQARPCDRWLEEQQ SMLALPPEKKEYDVHPGENLVSFDNPPQHHPLSIYDSFCRGVA >CP0050 putative IS1 encoded protein MPCFTAMRAEIALMSGSAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYT KSISWQHYRIEDRWSYSSLK >CP0067 IS629 ORF2 MLREGIRVARCTVARLMVVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQF VAERPDQLWVADFTYVSTWQGFVYVAFIIDVFAGCIVGWRVSSSMETTFV LDALEQALWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDS YDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLG HIPPAEAEKAYYASIGNDDLAA >CP0075 IS630 ORF MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSVGIVWRRAAPTL RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLHPKIGADWQLRGQ QKRVVTPGQNEKYYLAGALHSGTGKVSYVGGNSKSSALFISLLKRLKATY RRAKTITLIVDNYIIHKSRETQSWLKENPKFRGIYQPVYSPWVNHVERLW QALHDTITRNHQCRSMWQLLKKVRHFMETVSPFPGGKHGLAKV >CP0262 IS1294 transposase MLSAFTPRPLKRLFTTNQCWTSFLDAGGLRDIEVEAVTKMLACGTRILGV KEYICDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDCDWV HLVFTLPDTQWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFCAIH TYGRRLNWHPHVHVSVTCGGLNKHGQWKKLSFLKDAMRSRWMWNMRQRLL KAWSEGLAMPESLSHITTESQRRSLVLKAGGKYWHVYMSKKTAGGRNTAR YLGRYLKKPPIAASRLAHYNGGASLSFRYLDHKTGETATETLTQRELVAR LKQHIPEKFFKMVRYFGFLANRVCGEKLPQVYRALGMDKPEPVAKVCYAQ MVKQFLSRDPFECVLCGSQMRFTGLKRGYRLAEQVLMHEPLARMRWCG >CP0186 hypothetical protein MNQKVKSVGSDNVIDDHHVFFADSRCDFVKVVSAYVCDMGMQLLYFVFLL LPVVAEFNLAA >CP0209 hypothetical protein MINGVSLQGTAGYEAHTEEGNVNVKKLLESLNSKSLGDMDKDSELAATLQ KMINPSGGDGNCSGCALHACMAMLGYGVREAPVPNEISEYMTGFFHRHLE QIDSEGIVSHPNETYSKFRERIAENILQNTSKGSVVMISIEQATHWIAGF NDGEKIMFLDVQTGKGFNLYDPVEKSPDAFVDENSSVQVIHVSDQEFDHY ANSSSWKSKRLC >CP0158 IS600 ORF1 MSRKTQRYSTEFKAEAVKTVPENQLSISEGASRLSVPEGTLGQWVTAARK GLNTPGSRTVAELESEVMQLRKALNEARLERDILKKATAYFAQESLKNTR >CP0018 putative protein encoded within IS MEQSLQPGACVAQIARENGINDNLLFNWRHQYRKGGLLPSGKNMPALLPV TLTPEPDNKIPAPAQEPEQINTPSDSLCCELVLPAGTLRLKGKLTPALLQ ILIREIKGSSH >CP0217 hypothetical protein MQMKNNTAQATKVITAHVPLPMADKVDQMAARLERSRGWVIKQALSAWLA QEEERNRLTLEALDDVTSGQVIDHQAVQSWADSLSTDHLLPVPR >CP0077 iso-IS1 ORF2 MLAYTCGPRNDETCRELLALLTPFCIGMVTSDDWGSYAREVPEEKHLTGK IFTQRMNVTT >CP0116 IS600 ORF1 MSRKTQRYSTEFKAEAVKTVPENQLSISEGASRLSVPEGTLGQWVTAARK GLNTPGSRTVAELESEVMQLRKALNEARLERDILKKATAYFAQESLKNTR >CP0203 putative transposase MRFVQPRTETQQAIRALHRVRESLIRDKVKTTNQIHGFLLEFGISLPTGD AVIKRLSLVLAEHEIPEYLSRLLVRLHTHYLYLVEQIAELESELSQSINA DDTAQRIMTIPGVGPITASLLSSQLGDGKQFSCSRDFAASTGLVPRQYST GGKSTLLGISKRGDKNLRRLLVQCARSFMMQLERQHGKLAEWVREQLNKK HSNVVACALANKLARIA >CP0165 IS3 ORF2 MRSGWYTWCQRRTGISPRQQFRQHCDSVVLAAAFTRSKQRYGAPRLTDEL RAQGYHFNVKTVAASLRRQGLRAKASRKFTYRKLKNQTIPLSTPYAT >CP0219 hypothetical protein MLSGQIFCIPLNNLVGDKINYDEITKITARDWRQYRAPGWQITHQKRYCQ TLRVMTPTY >CP0055 IS629 ORF1 MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW VRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNDILRQASAYFAKA EFDRLWKK >CP0099 ISSfl1 ORF1 MQSRFFTILRSNRHNLCGDLQQGMVHKSDSDELSALRAENARIIKPLLPP EPATPRAGRPWAEHRKIINGMFWVLCSSAPWRDLPERYGAWKTVYNRFNR WSKSGVINIIFNRLLSLLDANGFIDWSATALDGSNIRALKCAAGAQKNIP ISTEIMGRVALAAVLAPKSIWQQTEVASR >CP0091 iso-IS10R ORF MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPT KARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSD IREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASIL PSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPI SNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCH HPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKS PAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQA NTVRNRNVLSTVRLGMEVLRHSGYTITREDSLVAATLLTQNLFTHGYVLG KL >CP0095 ISSfl4 ORF3 MDTSLAHENARLRALLQTQQDTIHQMAEYNRLLSQRMAAYASEINRLKAL VAKLQRMQFGKSSEKLRAKTERQIPFSRAIYATQALGCSDCSIKAILNS >CP0047 IS2 ORF2 MADNGSAYTAHETRQFARELNLEPCTTAVSSPQSNGIAERFMKTMKEDCI AFMPKPRTALHNLAVAIEHYNENHPHSALGYLSPREYRRQRVMST >CP0233 IS3 ORF2 MAASLRRQGLRAKASRKFSPVSYRAHGLPVSENLLEQDFYASGPNQKWAG DITYLRTDEVRLHPVSTEPHAF >CP0176 putative IS91 ORF2 MACDYRYKNRQYHCLSGSYMARSAKPRKRKPAPQRSKLLRYVVKLHEDDF FDEEEAEVLRFDNFDDAVECCADLNIPFFVDAGNKKLVFWFVRVDDEGYP EIARCTEREFATIPAGINADGMYCPECGTVHWPDGVIPPF >CP0069 IS21 ORF2 MTLTELLWRESEKLRRYKKEARLPVAKTLSEYDFIQLPELNGAQFQQLCE TTDWVDAGENVLLFGASGLGKSHLAAAIVDGVVGQGYRARFYSAGELLQE LRKARAQLKLNELLLKLDRYRVIVVDDLGYVKRDSAETGVLFELIAHRYE RGSLVITSNHPFSMWGSIFVDETMAVAAADRLIHHGYMFELKGESYRKKT AKAVTSVT >CP0012 hypothetical protein MLNWLSKLRAARIHLPNAVEKIAFDRFHVAKQPGEVVDKTRQNEHPHLPV ESRRQAKGTRFLWQHSDKWMTESRQEKLIWLRAQMKLTSLCWALKELAKD IWSRPWSEERRNDWQRWLRPTVTSP >CP0223 ISSfl4 ORF3 MNDLFAWLEEQEPCCPPDGPLNKAINYILNRRDELSCFLGDGAVPLDNNI CERAIRPVVMGRKAWLFAGSLMAGNRAAQIMSLLETAKRNGLEPHAWLTD VLTRLPEWPEERLAELLPLEGFTFTG >CP0017 IS4 ORF MPDSFMHIGQALDLGSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLR KRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQA RQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPEN DAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQ LIGQTGDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRK LGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDA MRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVKQELWG VLLAYNLVRYQMIKMAEHLKGYCPNQLSFSESCGMVMRMLMTLQGASPGR IPELMRDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA >CP0101 ISSfl2 ORF MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA QTTQASNRILGLLTQIHPAPERVLGPRLEHPAVLDLLQRYPSPEKLASLG EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR DPLSRAYYTRKMSQGKRHNQVLIALARRRCDVLFAMMRDGTFYTPQGS >CP0108 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >CP0189 IS600 ORF2 MVLRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQFGLKTSMSRKGNCYDNA PMESFWGTLKNGTGTE >CP0191 ISSfl1 ORF2 MLSPGQAHESQFAQRLLDGIGVQRQNGSMKRRGHAVLADKAYSGRALRNE LKNNGIKAVIPRKSNEKMASDGRAQLDRDAYCNRNVVERCFGRLKEYRRI ATRYDKTARNYLAIVKLGCIRLFYQRLRN >CP0179 IS629 ORF2 MPLLDKLREQYGVGPVCSELHIAPSTYYHCQQQRHHHDKRSARAQRDDWL KKEILRVYDENHQVYAVRKVWHQLLREGIRVARCTVARLMAVMGLAGVLR GKKVHTTVSRKAVAAGDRVNRHQGNVPRTPGGPQRLVYVVSAADKDKHTS AVPSALRQRCPQGFYPVQRYGAPRLTDELCALVTTLT >CP0231 IS629 ORF1 MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW GLRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNDILRQASAYFAK AEFDRLWKK >CP0007 IS600 ORF2 MVVSAIASTPHLVYIRTRETYGTRRLQTELADNGIIVGRDRLAGLRKELR LHCKQKRKFRATTNSDHNLPVTPNLLNQNFTPTAPNQVWVADSVVQAFRN QPTEGAGRETAAYAVR >CP0201 hypothetical protein MEIISNVRENRQVTVPAELLETLTQIAEQALWKREWAARDHGFPLPEYVT RRQAMVDQARSLLKNNTHEND >CP0204 ISSfl4 ORF3 MNNELPDDIELLKAMLRKQQSRLRQYACQVAGYEQEIERLKAQLDRLRRM LFGQSSEKKRHKLENQIRQAEKRLSELENRLNTARNLLEDASSVTDSPDT SPPSENPIASKPESPGRESSRKPLPAELPRETHRLLPAETSCPACGGVLK EMGETISEQLDIINTAFKVIETIRPKLACSRCDVIVQAPLPPKPIERGYA SAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEMADKLR PLYIALNDYVLEAGKVHADDTPVKVLAPGNGKTKTGRLWVYVRDDRNAGS SLPAAVWFAYSADRKGEHPQLHLAKYQGVLQADAYAGYNVLYETGRVKEA GCLAHARRKTHDEDVRRPTEMTQEALRRIAELYDIEAEIRGSPAEERLAV RKARSVQLMQSLYDWIQLQRKTLSKHAEMAKAFDYILNHWNALNEFCRDG WVEIDNNIGENALRSVAVGRKNYLFFGSDKGGESAAIIYSLLVTCKQNEV EPEDWLREVIEKLNDWPSNQVHELLPWNFSSVK >CP0100 hypothetical protein MSWLISQCAHQCTDNKKTETDAIYDKVRSSYLLSCILKKNKNVGLILHAP SFVSVSEKIARIVMANYSRNWSNSELASAVLMSESSLKRRMYKEVGSIST FVHKIKLTEAIRKLRRTNTPISVISSELGYSSPSYFSKVFFKYLKTYPQN IRKKNGR >CP0034 hypothetical protein MHQPVKQLIARKFGLSCGFAMSIDTEKIKVRFSQINAGDFLFRHGIPLTF VVKVYPSWRN >CP0184 IS629 ORF2 MLREGIRVARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQF VAERPDQLWVADFTYVSTWRGFVYVAFIIDVFAGYIVGWRVSSSMETTFV LDALEQALWARRPSGTVHHSDKGSQYVSLVYTQRLKEAGLLASTGSTGDS YDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLG HIPPAEAEKAYYASIGNDDLAA >CP0120 ISSfl2 ORF MTESSDYESVLVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH AGEAKTDARNAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA QTTQASNRIRGLLTQIHPAPERVLGPRLEHPAVLDLLQRYPSPEKLASLG EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFTALR DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS >CP0072 IS1294 transposase MLSAFTPRPLKRLFTANQCWTSFLDAGGLRDIEVEAVTKMLACGTRILGV KEYICDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDCDWV HLVFTLPDTQWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFCAIH TYGRRLNWHPHVHVSVTCGGLNKHGQWKKLSFLKDAMRSRWMWNMRQRLL KAWSEGLAMPESLSHITTESQRRSLVLKAGGKYWHVYMSKKTAGGRNTAR YLGRYLKKPPIAASRLAHYNGGASLSFRYLDHKTGETATETLTQRELVAR LNQHIPEKFFKMVRYFGFLANRVCGEKLPQVYRALGMDKPEPVAKVCYAQ MVKQFLSRDPFECVLCGGRMVYRRAIAGLNVEGLKKNARDISLLRYMPA >CP0193 hypothetical protein MTMATINARIDDDIKNQADEVLKLMNISQTQAIAAFYQYITEQKKLPFVI TSIVKTPHDLLRESTDMLAEALAVISNLQVWTEQQDGIGKAKLMEYYRRL DALYCCAKEKIGLLSDNRDAELGCVP >CP0106 insertion sequence 2 OrfA protein MIVLILVFRLVIGEQIIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >CP0073 ISSfl1 ORF1 MFWVLCSSAPWRDLPERYGAWKTVYNRFNRWSKSGVINIIFNRLLSLLDA NGFIDWSATALDGSNIRALKCAAGAQKNIPISTEIMGRVALAAVLAPKSI WQQTEVASR >CP0045 IS100 ORF2 MQYISAAPALSQQAVDQEWSYMDFLEHLLHEEKLARHQRKQAMYTRMAAF PAVKTFEEYDFTFATGAPQKQLQSLRSLSFIERNENIVLLGPSGVGKTHL AIAMGYEAFKIFYDISKISLELYHNIH >CP0030 iso-IS1 ORF2 MGRWWRYKWITFHPSLTQHWLWYAYNTKTGGVLAYTFGPRNDETCRELLA LLTPFCIGMVTSDDWGSYAREVPKEKHLTGKIFTQRIARNNRTLRTRIKR LARKTICFSRSVEIHEKVIGSFIEKHMFY >CP0207 IS2 ORF2 MDGPRSSHTDDTDVLLGIHHVIGELPTYGYRRVWALLRRQAELDGMPAIN AKRVYRLMRQNALLLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFCC DNGERLRVTFALDCCDREALHWAVTTGGFNSETVQDVMLGAVERRFGNDL PSSPVEWLTDNGSCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFVK TVSAPSATSCENCPVWQQSRPSILMDTNDEFPDNKRYSLLPFLFA >CP0087 IS629 ORF2 MLREGIRVARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQF VAERPDQLWVADFTYVSTWQGFVYVAFIIDVFAGYIVGWRVSSSMETTFV LDALEQALWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDS YDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLG HTPPAEAEKLIMLPSETMIWQPEFTDKTLSRKPGAVQSASPVPPDRARHS GPRITCVRRQKKP >CP0107 IS600 ORF2 MCQVFGVSRSGYYNWVQHEPSDRKQSDERLKLEIKVAHIRTRETYGTRRL QTELAENGIIVGRDRLARLRKELRLRCKQKRKFRATTNPNHNLPVAPNLL NQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDVYTCEIVGYAMGERMT KELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQSGLKTSMS RKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIFYNRQR RHSRLGNISPAAFREKYHQMAA >CP0197 hypothetical protein MKYDGDGRATARFFSDKGCRRAPLFTAPADAARHKRCLWSVSRVRRARDG RFYRSRLVSVTVYASPSPFSDERPSSRFRGITLLSKRRRLRYSTVGLTRY RKR >CP0084 iso-IS10R ORF MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPT KARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSD IREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASIL PSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPI SSLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCH HPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKS PAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQA NTVRNRNVLSTVRLGMEVLRHSGYTITREDSLVAATLLTQNLFTHGYVLG KL >CP0172 IS629 ORF2 MLREGIRVARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQF VAERPDQLWVADFTYVSTWRGFVYVAFIIDVFAGYIVGWRVSSSMETTFV LDALEQALWARRPSGTVHHSDKGSQYVSLALHTAA >CP0006 putative protein encoded within IS MAETLGEQYDPVLPSSLRQSSARKPLPASLPRAPRVIRPEEECCPACGGE LSPLGCDVSEQLELISSAFKVIEKQRPKLACRRCDHIVQAPVPSKPIARS YAGAGLLAHVVTGKYADHLPLYRQSDLLFHTAI >CP0254 IS600 ORF2 MCQVFGVSRSGYYNWVQHEPSDRKQSDERLKLEIKVAHIRTRETYGTRRL QTELAENGIIVGRDRLACLRKELRLRCKQKRKFRATTNPNHNLPVAPNLL NQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDVYTCEIVGYAMGERMT KELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQSGLKTSMS RKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIFYNRQR RHSRLGNISPAAFREKYHQMAA >CP0020 ISSfl4 ORF3 MDTSLAHENARLRALLQTQQDTIRQMAKYNRLLSQRVAAYASEINRLKAL VAKLQRMQFGKSSEKLRAKTERQILEAQERISALQEEMAETLGEQYDPVL PSPLRQSSARKPLPASLPRETRVIRPEEECCPACGGELSSLGCDVSEQLE LISSAFKVIETQRPKLACCRCDHIVQAPVPSKPIARSYAGAGLLAHVVAG KYADYLPLYRQSEIYRRQGVELSRATLGRWTGAVAELLEPLYDILRQYVL MPGKVHADDIPVPVQEPGSGKTRTARLWVYVRDDRNAGSEMPPAVWFAYS PDRKGIHPQNHLAGYSGVLQADAYGGYRALYESGRITEAACMAHVRRKIH DVHARVPTDITTEALQRIGELYAIEAEVRGCSAEQRLAARKARAAPLMQS LYDWIQQQMKIHSLKMECLHGEHYYPSGNSAGNSV >CP0040 IS629 ORF1 MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW VRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNNILRLASAYFAKA EFDRLWKK >CP0270 IS3 ORF2 MECLHGEHFIYREIVRATVFNYIECNYNRWRRHRWCGGLSPEQFENQNLA >CP0056 IS629 ORF2 MLREGIRVARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQF VAERPDQLWVADFTYVSTWRGFVYVAFIIDVFAGCIVGWRVSSSMGLTFV LDALEQALWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDS YDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLQERLG HIPPAEAEKAYYASIGNDDLAA >CP0071 IS1294 transposase MLTRKSIDTVLLSVGAEKLSQREWDWMKMLKPMDPPPAMVAASILERRGD TAALTRLQDTGG >CP0255 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >CP0121 IS100 ORF1 MVTFETVMEIKILHKQGMSSRTIARELGLSRNTVKRYLQAKSEPPKYTPR PAVASLLDEYRDYIRQRIADAHPYKIPATVIAREIRDQGYRGGMTILRAF IRSLSVPQEQEPAVRFETEPGRQMQVDWGTMRNGRSPLHVFVAVPGYSRM LYIEFTDNMRYDTLETCHRNAFRFFGGVPREVLYDNMKTVVLQRDAYQTG QHRFHPSLWQFGKEMGFSPRLCRPFRAQTKGKVERMVQYTRNSFYIPLMT RLRPMGSTVDVETANRHGLRWLHDVANQRKHETIQARPCDRWLEEQQSML ALPPEKKEYDVHPGENLVSFDNPVTLFVPLIMGC >CP0013 putative IS1 ORF MPEPVYRTLLSSTSHVISKKCTQRIERHNLNLRTHLKRLTRKTICFSKSD DMHYKIIGWYLTINHHH >CP0214 IS100 ORF2 MLHEEKLARHQRKQAMYTRMAAFPAVKTFEEYDFTFATGAPQKQLQSLRS LSFIERNENIVLQGTSDITNPRVGICV >CP0011 hypothetical protein MDDRIQAGKADMAACADEADEPVLGAERTGKGYLEQTMERGKTQRLAEMA AANSDVPMMKNVAKTIGKRLYGILNAMRHGVSNGNAEALNSKIRLLRIKA KGYRNRERFKLGVMFHYGKLNMAF >CP0269 IS911 ORF2 MVTLCHVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY LERQFAVTEPNQVWCGDVTYIWTGKRWAYLAVVLDLFARKPVGWAMSFSP DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL RPHEYNGGLPPNESENRYWKNSNSVASFC >CP0076 iso-IS1 ORF1 MRFSMTTVTVHCPRCNSDEVYRHGRSCSRHERFRCRSCKRVFQLTYSYEA RKLGVKEQIVEMAHNGAGGRDTARTLKIGINTVIRTLKSSRPGG >CP0109 hypothetical protein MGCVTAPEPLSSFHQVAEFVSSEAVLDDWLKQKELKNQAIGATRTFVVCR KGTQQIVGFYSLATGSVNHTEATGNLRRNMPDPIPVIILARLAVDVSFRG KGLGADLLHDAVRRCYRVAENIGVRAIMVHALTENAKQFYIHHGFKPSKT QVQTLFLKLPQ >CP0247 oriT nicking and unwinding protein MSKGYTFMMSIAQVRSAGSAGNYYTDKDNYYVLGSMGERWAGQGAEQLGL QGSVDKDVFTRLLEGRLPDGADLSRMQDGSNRHRPGYDLTFSAPKSISMM AMLGGDKRLIEAHNQAVDFAVRQVEASAST >CP0170 hypothetical protein MDDRIQAGKADMAACTDEADEPVLGAERTGKGYLEQTMERGKTQRLAEMA AANSDVPMMKNVAKTIGKRLYGILNAMRHGVSNGNAEALNSKIRLLRIKA KGYRNRERFKLGVMFHYGKLNIAF >CP0248 oriT nicking and unwinding protein MLTGNLVMALFNHDTSRDQEPQLHTHAVVTNVTQYNGEWKTLSSDKVGKT GFIENVYANQIAFGRLYREKLKEQVEALGYETEVVGKHGMWEMPGVPVEA FSGRSQTIREAVGEDASLKSRDVAALDTRKSKQHVDPEVRMAEWMQTLKE TGFDIRAYRDAAEQRAYTRTQTPGPASQDGPDVQQAVTQAIAGLSERKVQ FMYTDLLARTVGILPPENGVIERARAGIDEAISREQLIPLDREKGLFTFG IHMLDELSVRALSRDIMKQNRVTVHLEKSVPRTAGYSDAVSVLAQDRPSL AIVSGQGGAAGQRERVAELVMMAREQGREVQIIAADRRSQMNLKQDERLS GELITGRRQLLEGMAFTPGSTVIVDQGEKLSLKETLTLLDGAARHNVQVL ITDSGQRTGTGSALMAMKDAGVNTYRWQGGEQRPATIISEPDRNVGWPEI LRPA >CP0026 IS100 ORF1 MVTFETVMEIKILHKQGMSSRAIARELGLSRNTVKRYLQAKSEPPKYTPR PAVASLLDEYRDYIRQRIADAHPYKIPATVIAREIRDQGYRGGMTILRAF IRSLSVPQEQEPAVRFETEPGRQMQVDWGTMRNGRSPLHVFVAVPGYSRM LYIEFTDNMRYDTLETCHRNAFRFFGGVSREVLYDNMKTVVLQRDAYQTG QHRFHPSLWQFGKEMGFSPRLCRPFRLRDPHKITANKPAPYFGRFWTDGL ELCSVLFVLK >CP0264 putative transposase MDEKKLKALAAELAKGLKTEADLNQFSRMQTKLTVETVLNAELTDHLGHE KSYIR >CP0042 IS1294 transposase MLTRKSIDTVLLSVGAEKLSQREWDWMKMLKPMDPPPAMVAASILERRGD TAALTRLQDTGG >CP0021 putative transposase MDAENETVLNANMTHHLGCEKNQLRSGSNSRNGCLTKIITTGDEPLEIRT LRDRNGTFEPQQLKKNQP >CP0039 IS629 ORF2 MLREGIRVARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNHQF VAERPDQLWVADFTYVSTWQGFVYVAFIIDVFAGCIVGWRVSSSMETTFV LDALEQALWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDS YDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLG HTPPAEAEKAYYASIGNDDLAA >CP0092 IS1294 transposase MTRSGGDFQPRPLKRLFTTNQCWTSFLDAGGLRDIEVEAVTKMLACGTRI LGVKEYICDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDC DWVHLVFTLPDTQWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFC AIHTYGRRLNWHPHVHVSVTCGGLNKHGQWKKLSFLKDAMRSRWMWNMRQ RLLKAWSEGLAMPESLSHITTESQRRSLVLKAGGKYWHVYMSKKTAGGRN TARYLGRYLKKPPIAASRLAHYNGGASLSFRYLDHKTGETATETLTQPDE SPNDFYQNH >CP0060a hypothetical protein MSHNLEHQKVHTRMVKEVLKAVARANNHPYKSVFADFITGHPSCTVCFWE TFHKMYPDSPYEYVTFCHTCRRFDLYETEAEMKADDPKWW >CP0171 IS629 ORF1 MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW GRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNNILRLASAYFAKA EFDRLWKK >CP0168 hypothetical protein MNDNSLLRNSSLFIAYMGCVGWVSAYSYGWGTSFYYGFPWWVVGAGLDDV ARSLLYAIIVMGILFTGWGIGILFFLLIKKRSKIQDLSFFRLFFAITLLF FPVIFELLILKQYFILPLSLSFIISSLVISIIIRIYGRIFSVSCFSDIPF VREHRIKLIMAGFLVYFWFFSFLVGWYKPQLKKEYQMLCYNNSWYYVLAR YDSRLVLSSSFKDDSNRFLIFNTEQSGFYEINDVYVRK >CP0157 IS600 ORF2 MCRVPGVSRSGYYDRVQHAPSDRKQSDERLKLEIKVAHIRTRETYGTRRL QTELADNGIIVGRDRLARLRKELRLHCKQKRKFRATTSSDHNLPVTPNLL NQNFTPTAPNQVWVADITYVATREGWLYLAGVKDVYTCEIVGYAMGERMT KELTGKALFMALRSQHPPAGLIHHSARGSQYCAYDYRVIQEQSGLKTSMS RKGNCYDNAPMEIFWGTLKNESLSHYRFKSRDISSAYGKTD >CP0242 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEDASRLFLPEGTLGQWVTAARK GLGTPGSRTLAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >CP0037 hypothetical protein MGSPECVPCHSTSGSVFPALVAETMFPNIAGSRSCIDCHHVAGRTFSKSA RRIRGAPLLRRTSRYDAQTSAFGISNDFRCISYVIPFWVVSSAMNVIPIP SLQTHYRSFITTTNGSAPVSRYVPETGSHVP >CP0098 ISSfl1 ORF2 MLSPGQAHESQFAQRLLDGIGVQRQNGSMKRRGHAVLADKAYSGRALRNE LKNNGIKAVIPRKSNEKMASDGRAQLDRDAYCNRNVVERCFGRLKEYRRI ATRYDKTARNYLAMVKLGCIRLFYQRLRN >CP0064 putative IS1 encoded protein MPCFTAMRAEIALMSGSAFAVTHHAFSSGAGRTSDGNGSYASTLKSPSYT KSVSWQHYPLFVERP >CP0215 hypothetical protein MNAHWSSKKSNFFRKNIKLLTKYLFFESQGIPDKVDIVSRLKTYGYSISG VETDDGYKALVRAFQLHFRQKNYDGIMDAETAAILYALLEKYFPGK >CP0038 hypothetical protein MHQLRHPLLGCVFCNECDPHPFAPDPLQIFHHYYEWVRPCFPIRAGNRFS RSLRWSGPGSCCLHTGCRAVSKQVSSALIRGRLYGPILASSKISMLHRTV YFRSASRTHVTVDLCLFPESLTTSFLRMKQHRVVWSVLL >CP0205 ISSfl4 ORF2 MITLPTGTKIWIIAGITDMRCGFNGLASKVQNTLKDDPFSGHIFVFRGRS GKMVKILWADRDGLCLFAKRLAGDPGRESAPDASSVIHATGGDRVATSQT DRTAWHPDITRDKTRE >CP0174 putative transposase MLNKRAFFGAFLIFWGFKFLSMDMNAGYIRAARIHLPNAVEKIAFDRFHV AKQPGEVVDKTRQNEPPRFSWRVFYL >CP0028 ISSfl3 ORF MCQQFNEITAMPVHKVCQNFFRDALAPFHQYRQNALMDATMALINGASLT QTSIGRFLPGNAQVKNKIKRIDRLMGNEALHRDIPMIFRNITSMLTRQLS LCVIAVDWSGYPSQEHHVLRASLLCDGRSIPLLSKVVPSEKQNNPLIQHD FLDSLAQSLPPDARVIIVTDAGFQSAWFHHITSLGWDFIGRIRNNVQYCL DNAPERWLKVSDSPECKTPEYMGAGRLVKERKKSIRGHFYTYKKSAKGRK KKRSKGQSGLNKTDKEQSKSAKEAWLIFSSTNDFRAREIIKLYSRRMQIE QNFRDEKNGRFGFGLRASKSRSTGRILVLSLLATLSTIVMWLLGYHAENK GLHLKYQANSIKSRRVISYLTLAKNVLRHSPLILRRTVLSTVLNHLSRTY RNMVLVY >CP0015 IS91 ORF MLPRFADIFQQGNRWLNWLEKQPVQMSRLEHYAGQDEIGLRYNSHRTKRE ENLVMSGDEFMERFSWHVADKGFRMVIRGPESGEAAITGRCGVRHNGDSE KNGEANHKERDVSAVTEG >CP0090 IS1294 transposase MSGRHIPQQTDIPRVFLQPLHIQSRDGPAVYHPAATQHAFERVTTQELFH HLCIAHFRHWFRFIHPQSTVHLRQLLSTHPVGKEPEVPHHLKKLLRDVLF QPRDQLTLCPERSPHNFPKT >CP0043 IS1294 transposase MLSAFTPRPLKRLFTANQCWTSFLDAGGLRDIEVEAVTKMLACGTRILGV KEYICDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDCDWV HLVFTLPDTQWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFCAIH TYGRRLNWHPHVHVSVTCGGLNKHGQWKKLSFLKDAMRSRWMWNMRQRLL KAWSEGLAMPESLSHITTESQRRSLVLKAGGKYWHVYMSKKTAGGRNTAR YLGRYLKKPPIAASRLAHYNGGASLSFRYLDHKTGETATETLTQRELVAR LKQHIPEKFFKMVRYFGFLANRVCGEKLPQVYRALGMDKPEPVAKVCYAQ MVKQFLSRDPFECVLCGGRMVYRRAIAGLNVEGLKKNARDISLLRYMPA >CP0088 IS629 ORF1 MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW GRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNDILRQASAYFAKA EFDRLWKK >CP0102 hypothetical protein MKKQIFINNKPPVVPYSGTHAKIFKYIEIPLPFFYFIYTSGEPFHISVQN TVIYVSKYNGIFINKLVPFSLLFDRDISVLQRRDICVVRFTSEEISEHNV LFDHDIERLKKISKAQLISPDYVLIDFSSGGPIKLSSML >CP0221 putative IS1 encoded protein MPCFTAMRAEIALMSGSAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYT KSVSWQHYPAPVSPAAHFLPNGCGAPRQYNGGYGRCCRSCTGPAGETATP >CP0234 hypothetical protein MFSKAFLRKISMFFARRKPAAMKICLYHTLNPDTIPGYKKFAQAIATDNF VQADVRKIDTNLYRARLSIRDRLLFSLYRYHGETICLVLEYIRNHAYNTS RFLRRNVVIDEGRLQQQPVPDPVDIATEALTYINPSHGRFHRLDKMLSFD DDQQALYEHPLPLVIVGSAGSGKTALVLEKMKQAAGDILYLSLSSFLVEK ARTLYDASGEGSEVQNIDFLSLTEFLETLRIPEGREVTFSAFSDWLPRNR AIAALGAAHTLYEEFRGVIGAVASGNGPLSREAYLSLGIRQSLYGMEDRP TVYVLFERYIAWLKQSHQYDSNLLSHQYLSLATPRYDVIFVDEVQDMTPV QLQLVLKTLRHPGQFLLCGDANQIVHPSFFSWSSLKSLFFRQQQGNDTTV NILQANYRNGHHVTALANRLLRLKQVRFSAIDRESHHFVRSCGQAEGTIR LLDDREETKQELNAKTSLSNRVAVIVMHPEQKAQARCWFSTPLVFSVQEV KGLEYETVILYNIVSAARQAFDDICEGLTPADLEGEARYSRPRDRQDRSA EIYKFFTNALYVALTRAKHNVYLVEQQVEHPLWSLLALTHQEEPLNLQEE ISSRDEWQKTAHLLEKQGKQEQADTIRSRILQTSEMPWQIITAEDARQWK QHILAGTADKTIQLQALEYSLIYSLFPLYNALYREDFKPTRQPRTKTLQL LELKYFRPYSMNNPVAVLRDIERYGVDHRSPFNLTPLMSAARAGNIALVQ LLLERGADPLLTGNDGLAAYHQVLSAAVSTPRYAQQKSAQLYTLLKPESL SLQVEGRLIKLDNRQMAMFLVILMQALFHTHLGSALFFSEAFSAARLAEC VVHLPEALLPERRKRRSYISSQLSQHEVNSKNPYGKKLFLRLNHGQYILN PGLKIRKGDVWRAVYELQSPEDLGHDLQTYLQDMSPELVDMLGGKKGFYE RSEKSVGYWVGGIRRAAQKA >CP0175 IS91 ORF MLPRFADIFQQGNRWLNWLEKQPVQMSRLEHYAGQDEIGLRYNSHRTKRE ENLVMSGDEFMERFSWHVADKGFRMVIRGPESGEAAITGRCGVRHNGDSE KNGEANHKERDVSAVTEG >CP0044 IS150 ORF1(ORF A) MSKPKYPFEKRLEVVNHYFTTDDGYRIISARFGVPRTQVRTWVALYEKHG EKGLIPKPKGVSADPELRIKVVKAVIEQHMSLNQAAAHFMLAGSGSVARW LKVYEERGEAGLRALKIGTKRNIAISVDPEKAASALELSKDRRIEDLERQ VRFLETRLVYLKKLKALAHPTKK >CP0178 IS3 ORF2 MCSGYHFNVKTVAASLRRQELSAKASQKFSPISYRAHGLPVSENLLTQDF YASGPNQKWAGDITYYYSSPTAGKHGAPGY >CP0025 IS3 ORF2 MSKLILPSNTVSYRAHGLPVSENLLEQDFYASGPNQKWAGDITYLRTDEG WLYLAVVIDLWSRAVIGWSMSPRLTAQLACDALQMALWRRKRPRNVIVHS DRGSQYCSADYQALLKWHNLRGSMSAKGCCYDNACVESFFHSLKVECLHG EHFISREIMRATVFNYIECDYNRWRRHSWCGGLSPEQFENQNLA >CP0218 hypothetical protein MELKWTSKALSDLARLYDFLVLASKPAAARTVQSLTQAPVILLTHPRMGE QLFQFEPREVRRIFTGEYEIRYELTGQTIYVLRLWHTRENR >CP0249 oriT nicking and unwinding protein MKAGEESVAQVSGVREQAILTQAIRSELKTQGVLGHPEVTMTALSPVWLD SRSRYLRDMYRPGMVMEQWNPETRSHDRYVTERVTAQSHSLTLRNAQGET QVVRISSLDSSWSLFRPEKMPVADGERLRVTGKIPGLRVSGGDRLQVASV SEDAMTVVVPGRAEPATLPVSDSPFTALKLENGWVETPGHSVSDSATVFA SVTQMAMDNATLNGLARSGRDVRLYSSLDETRTAEKLARHPSFTVVSEQI KARAGETLLETAISLQKAGLHTPAQQAIHLALPVLESKNLAFSMVDLLTE AKSFAAEGTGFADLGGEINAQIKRGDLLYVDVAKGYGTGLLVSRASYEAE KSILRHILEGKEAVTPLMERVPGELMEKLTSGQRAATRMILETSDRFTVV QGYAGVGKTTQFRAVMSAVNMLPESERPRVVGLGPTHRAVGEMRSAGVDA QTLASFLHDTQLQQRSGETPDFSNTLFLLDESSMVGNTDMARAYALIAAG GGRAVASGDTDQLQAIAPGQPFRLQQTRSAADVVIMKEIVRQTPELREAV YSLINRDVERALSGLERVKPSQVPRLEGAWAPEHSVTEFSHSQEAKLAEA QQKAMLKGEAFPDVPMTLYEAIVRDYTGRTPEAREQTLIVTHLNEDRRVL NSMIHDAREKAGELGKVQVMVPVLNTANIRDGELRRLSTWENNPDALALV DNVYHRIAGISKDDGLITLQDAEGNTRLISPREAVAEGVTLYTPDTIRVG TGDRIRFTKSDRERGYVANSVWTVTAVSGDSVTLSDGQQTRVIRPGQERA EQHIDLAYAITAHGAQGASETFAIALEGTEGNRKLMAGFESAYVALSRMK QHVQVYTDNRQGWTDAINNAVQKGTAHDVFEPKPDREVMNAERLFSTARE LRDVAAGRAVLRQAGLAGGDSPARFIAPGRKYPQPYVALPAFDRNGKSAG IWLNPLTTDDGNGLRGFSGEGRVKGSGDAQFVALQGSRNGESLLADNMQD GVRIARDNPDSGVVVRIAGEGRPWNPGAITGGRVWGDIPDNSVQPGAGNG EPVTAEVLAQRQAEEAIRRETERRADEIVRKMAENKPDLPDGKTEQAVRE IAGQERDRAAITEREAALPESVLREPQRVREAVREVARENLLQERLQQME RDMVRDLQKEKTPGGD >CP0160 IS1294 transposase MVVSAIASTPQAMCAVTDRAEPRQPDGCKLKTAPVRPGVAARSAEYVCSE DLFY >CP0080 IS629 ORF1 MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW GRQHERDTGGGDGGLITAERQRLKEPERENRELRRSNNILRLASAYFAKA EFDRLWKK >CP0117 IS600 ORF2 MKRCVGYLVYIRTRETYGTRRLQTELADNGIIVGRDRLARLRKELRLHCK QKRKFRATTNSDHNLPVTPNLLNQNFTPTAPNQVWVADITYVATREGWLY LAGVKDVYTCEIVGYAMGERMMKELTGKALFMALRSQRLPAGLIHHTDRG SQYCAYDYRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRF KSRDEAISVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >CP0212 ISSfl4 ORF2 MITLPTGTKIWIIAGITDMRCGFNGLASKVQNTLKDDPFSGHIFVFRGRS GKMVKILWADRDGLCLFAKRLERGRFVWPVTREGKVHLTPAQLSMLLEGI AWPHPKRTERPGIRI >CP0097 hypothetical protein MTLPVFITVIADHDKPQPSGCLLESQGSLCPICRQRITHETGWNVHHKVK KVMGAVKNYLTLSCYIQIAIDSYTVVKPALSKRAYKGLSGVPGNRYAPFL GEGSPAMNCPYPTNIQNERNVLESAYNPL >CP0023 hypothetical protein MLQRQRGKVGFAQLPVDFVAIEPDSVQGVGKRANLTNRCFIIRINDSFKK RQGFIEFISNSGSGHTVTVYTKRRFQRGVFMNSLNTNVVKPVMYRLRSSA DARP >CP0008 IS600 ORF2 MCQVFGVSRSGYYNWVQHEPSDRKQSDERLKLEIKVAHIRTRETYGTRRL QTELAENGIIVGRDRLARLRKELRLRCKQKRKFRATTNPNHNLPVAPNLL NQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDVYTCEIVGYAMGERMT KELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQSGLKTSMS RKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIFYNRQR RHSRLGNISPAAFREKYHQMAA >CP0111 hypothetical protein MTLTTTALNGSSSRRFEGCVWTPPLISHTALHPTSRHWMHSWHTYAGNAE IPGL >CP0085 IS100 ORF2 MLHEEKLARHQRKQAMYTRMAAFPAVKMFEEYDFTFATGAPQKQLQSLRS LSSERSPHNFPKT >CP0210 iso-IS1 ORF1 MATVTVHCPRCHSDEVYRHGRSCSRHERFRCRSCKRVFQLTYSYEARKPG VKEQIVEMAHNGAGGRDTARTLKIGINTVIRTLKSSRPGG >CP0096 putative IS91 ORF2 MVCNYRYKNRQCHCLSGGYMARSAKSRKRKPASQRSKLPRYVVKLHEDDF FDEEDAEVLRFD >CP0105 insertion sequence 2 OrfB protein MDSARALIARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDT DVLLRIHHVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNA LLLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFAL DCCDREALHWAGTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNG SCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPK PDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLE I >CP0060 hypothetical protein MEVFMSTAASVRKTPREHQINIRATDEERAVIDYAASLVNKNRTDFIMEL AYQEAKNIILDQRLFVLDNERYDSFITQLEAPVQNAEGRERLMAVKPEWK >CP0180 IS629 ORF1 MTKNTRFSPEVRQRAVRMVLESQGEYDSQWAAICSIAPKIGCILETLRVW VRQHERDTGGGEVGSPPLNVSV >CP0232 IS629 ORF2 MLREGIRVARCTVARLMAVMGLAGVLRGKKVRMTISRKAVAAGDRVNRQF VAERPDQLWVADFTYVSTWQGFVYVAFIIDVFAGYIVGWRVSSSMETTFV LDALEQALWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDS YDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLG HIPPAEAEKAYYASIGNDDLAA >CP0089 transposase/IS protein MDFLEHLLHEEKLARHQRKQAMYTRMAAFPAVKTFEEYDFTFATGAPQKQ LQSLRSLSFIERNENIVLLGPSGVGKTHLAIAMGYEAVRVGIKVRFTTAA DLLLQLSTAQRQGRYKTTLQRGVMAPRLLIIDEIGYLPFSQEEAKLFFQV IAKRYEKSAMILTSNLPFGQWDQTFAGDAALTSAMLGRILHHSHVVQIKG ESYRLRQKRKAGVIAEANPE >CP0263 IS1294 transposase MLTRKSIDTVLLSVGAEKLSQREWDWMKMLKPMDPPPAMVAASILERRGD TAALTRLQDTGG >CP0110 hypothetical protein MIYGGFMKSGVQLNLRARESQRILIDAAAEILHKSRTDFILEMACKAAED VILDRRVFNFNDRQYEEFIEMLDAPVADDPAIEKLLARKPQWDV >CP0041 IS150 ORF B MRQQQDEQGRFSICSRQAAVVQRLMGILSLKAAIKVKRYRSYRGEVGQTA PYVLQRDFKATRPNEKWVTDFTEFAVNGRKLYLSPVIDLFNNEVISYSLS ERPVMNMVENMLDQAFKKLNEPPRESWRLNFLRKR >CP0033 putative transposase MQQPKMTVAMEAGGASHYWAREIRKLDHDVILLPAQHVKAYQRCQKNDYN DAQAIAEACQHGTIRPVPIKTLEQQDVQTFLNMRRLVSMERTQLINHIRG LLAEYGIVFSKGAAELRQK >CP0164 IS1294 transposase MVRYFGFLANRVCGEKLPQVYRALGMDKPEPVAKVCYAQMVKQFLSRDPF ECVLCGGRMVYRRAIAGLNVEGLKKNARDISLLRYMPA >CP0208 insertion sequence 2 OrfA protein MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >CP0161 IS1294 transposase MLTRKSIDTVLLSVGAEKLSQREWDWMKMLKPMDPPPAMVAASILERRGD TAALTRLQDTGG >CP0066 IS629 ORF1 MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW VRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNNILRLASAYFAKA EFDRLWKK >CP0119 putative transposase MAEWLGEIQKRVITVCDREADIWHYLYYKVSHGQRGACCTESPAGRGTRQ ALRTAGSPGNRRKPHAECDAKRRAGSPSGPDVHQLQRSQHKKSRQQRPGA PAHVCLLPGAGRGRCLLASADVRKSGECRRCTTYCQPLRATLADRGIPQG VEKWWYMESLRMQTRDNLERMVVILAFIAVRVLGLRQGGVSEETQNDSCE KILTPTEWKLLWVKLEGKPLPVQAPTLKWAWGDGMTANAQVVPVGASCGM AGQTSGYG >CP0016 putative IS91 ORF2 MDAGNKKLVFWFVRVDDEGYPEIARCMEREFATIPAGINADGMYCPECGT VHWPDGVIPPF >CP0001 IS2 ORF2 MVHATELMKHASSPGCWDFVEPKNTAVRSPESNRIAKSFVKTIKCDYISI MPKPDGLTAAKNLAEAFEHYNEWHPHSALDYRSPREYLRQRANDNRCLEI >CP0187 hypothetical protein MPGASNAAMCKITETIKKWRIHRSTAESLLDFARRYNAIVRGWIEYYGKF WSRNFSYRLWSAMQSRLLKWMQSKYRLSNRKAQRKLALVRKQYPKLFAHW YLLRASNE >CP0266 hypothetical protein MSVKLRLPQLSSGEYLPGSLQDKILSDDCLEKEQMVVSAIASTPQASYHI >CP0216 hypothetical protein MEPVYVILNALLDSGRFTRKLILLGLSGTFSYIFGSIVATLGMGLVVDYL GWGATFIVLILSAVFAIIFTLMSRERSLEFEKE >CP0081 IS629 ORF2 MLREGIRVARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQF VAERPDQLWVADFTYVSTWRGFVYVAFIIDVFAGYIVGWRVSSSMETTFV LDALEQALWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDS YDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLG HTPPAEAEKLIMLPSETMIWQPEFTDKTLSRKPGAVQ >CP0058 IS600 ORF1 MSRKTQRYSTEFKAEAVKTVPENQLSISEGASRLSVPEGTLGQWVTAARK GLNTPGSRTVAELESEVMQLRKALNEARLERDILKKATAYFAQESLKNTR >CP0083 hypothetical protein MNAHWSSKKSNFLRKNIKLLTKYLFFESQGIPDKVDIVSRLKTYGYSISG VETDDGYKALVRAFQLHFRQKNYDGIMDAETAAILYALLEKYFPGK >CP0183 IS629 ORF1 MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW VRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNDILRQASAYFAKA EFDRLWKK >CP0261 hypothetical protein MAENGYGLAGLGMGKVKSVNQYRLTPGFGGFTPVSHVTTACRLPCRWRGI RIIQAAFNAFAKV >CP0062 ISSfl1 ORF2 MQREKTPEWREKQKSSRGIRRGQGYRLVFQFPIRERCFGRLKEYRRIATR YDKTARNYLAMVKLGCIRLFYQRLRN >CP0167 hypothetical protein MNTALIVALMCMWYAVPAAAKETLLAMPRNSTEHCYAEINVHGPYGVYFR VVPHPPGGKSWVECNSDYYYSDKPPGVQILGTRAGCRVYGICGTTSTLHV AGRGVVCIKNICSPRGMIIHRIRKRPVVAVSDEM >CP0173 IS629 ORF2 MYRWPCTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKN RAEVELATLTWVDWYNNRRLQERLGHIPPAEAEKAYYASIGNDDLAA >CP0059 hypothetical protein MEINVTAPALLTDEHILQPFDCGNEVLSNWLRGRAMKNQMLNASRTFVIC LEDTLRIVGYYSLATGSVTHAELGRSLRHNMPNPVPVVLLGRLAVDVCTQ GHGFGKWLLSDAIHRVVNLADQVGIKAVMVHAIDDDARAFYERFGFVQSV VAPNTLFYKV >CP0086 putative protein encoded within IS MRATAEEALKRISELYAIEDEIRGLPESECLAVRQQRSKALLTSLHEWMV EKNGTLSKKSRLGEAFSYVLNQWDALCYYSDDGLAEADNNTAERALRAVC LGKKNYVFFGSDHGGERGALLYGLIGTCRLNGIDPEAYLRHILSVLPEWP SNRVDDLLPWKVVLPSG >CP0114 ISSfl4 ORF1 MNSQTTKDIPCFRSYLPDALRLRFEDKLTIRAIAQRLGLSHSTIHTLFQR FLASGIAWPLPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYS REFKVRLAKQALQPGAVVARIAREHDINNNLLVKWKSQYEDGLLSDDDIQ ECMPVPVALTDTPEPTRPVTNPFWRNKHDERPEGAPGNVPRCELHLKSGV VKLFDPLTPELLRALIREMKGGIR >CP0061 IS1294 transposase MTRSGGDFQPRPLKRLFTTNQCWTSFLDAGGLRDIEVEAVTKMLACGTRI LGVKEYICDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDC DWVHLVFTLPDTQWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFC AIHTYGRRLNWHPHVHVSVTCGGLNKHGQWKKLSFLKDAMRSRWMWNMRQ RLLKAWSEGLAMPESLSHITTESQRRSLVLKAGGKYWHVYMSKKTAGGRN TARYLGRYLKKPPIAASRLAHYNGGASLSFRYLDHKTGETATETLTQREL VARLKQHIPEKFFKMVRYFGFLANRVCGEKLPQVYRALGMDKPEPVAKVC YAQMVKQFLSRDPFECVLCGGRMVYRRAIAGLNVEGLKKNARDISLLRYM PA >CP0256 hypothetical protein MNINQFMVRAGAAWVYEQYNTDPVLPVLQNEARQQKRGLWSDADPVPPWI WRHRK >CP0166 IS3 ORF1 MTHMTKTVSTSKKTRKQNSPEFCSEALKLAERIGVAAAARELSLYESQLY AWRSKLQQQMTSSERESELPA >CP0112 ISSfl4 ORF3 MNNELPDDIELLKAMLRKQQSRLRQYACQVAGYEQEIERLKAQLDRLRRM LFGQSSEKKRHKLENQIRQAEKRLSELENRLNTARNLLEDASSVTDSPDT SPPSENPIASKPEFPGRKSSRKPLPAELPRETHRLLPAETSCPACGGVLK EMGETISEQLDIINTAFKVIETIRPKLACSRCDVIVQAPLPPKPIERGYA SAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEMADKLR PLYIALNDYVLEAGKVHADDTPVKVLAPGNGKTKTGRLWVYVRDDRNAGS SLPAAVWFAYSADRKGEHPQLHLAKYQGVLPADAYAGYNVLYETGRVKEA GCLAHARRKIHDEDVRRPTEMTQEALRRIAELYDIEAEIRGSPAEERLAV RKARSVQLMQSLYDWIQLQRKTLSKHAEMAKAFDYILNHWNALNEFCRDG RVEIDNNIGENALRSVAVGRKNYLFFGSDKGGESAAIIYSLLVTCKQNEV EPEDWLREVIEKLNDWPSNQVHELLPWNFSSVK >CP0118 IS150 ORF B MVDCFDGKVVSWSLSTRPDAELVNTMLDSAVETLNAGERPVIHSDRGGHY RWPGWLERVNAAGLIRSMSRKGCSPDNAACEGFFGRLKTEMYYGRKWSGI TPEKFMQQVDAYIRWYNERRIKLSLGAVSPKMYRQQCGLE >CP0211 ISSfl4 ORF1 MNSQTTKDIPCFRSYLPDALRLRFEDKLTIRAIAQRLGLSHSTIHTLFQR FLASGIAWPLPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYS REFKVRLAKQALQPGAVVARIAREHDINDNLLFKWKSQYEDGLLSDDDIQ ECMPVPVALTDTPEPTRPVTNPFWRNKHDERPEGAPGNVPRCELHLKSGV VKLFDPLTPELLRALIREMKGGIR >CP0029 transposase/IS protein MMMELQHQRLMALAGQLQLESLISAAPALSQQAVDQEWSYMDFLEHLLHE EKLARHQRKQAMYTRMAAFPAVKTFEEYDFTFATGAPQKQLQSLRSLSFI ERNENIVLLGPSGVGKTHLAIAMGYEAVRVGIKVRFTTAADLLLQLSTAQ RQGRYKTTLQRGVMAPRLLIIDEIGYLPFSQEEAKLFFQVIAKRYEKSAM ILTSNLPFGQWDQTFAGDAALTSAMLGRILHHSHVVQIKGESYRLRQKRK AGVIAEANPE >CP0192 ISSfl1 ORF1 MARYDLPDEAWTIIKPLLPPEPATPRAGRPWAEHRKIINGMFWVLCSSAP WRDLPERYGAWKTVYNRFNRWSKSGVINIIFNRLLSLLDANGFIDWSATA LDGSNIRALKCAAGAQKNIPISTEIMGRVALAAVLAPKSIWQQTEVASR >CP0159 IS600 ORF2 MESFWGTLKNESLSHYRFKSRDEAISVIREYIEIFYNRQRRHSRLGNISP AAFRIKYYQMTA >CP0222 putative protein encoded within IS MSDGYSVYKSLADNHPGITSACCWSHAGRGFANLYKASREPRAGVELRKI AGLYRIEKLIRERPVEKIRQWR >CP0124 acp, Acp, putative acyl carrier protein MIKEKILSIVAFCYGIAYSKLSEETKFIEDLSADSLSLIEMLDMISFEFN LRIDESTLEHIITIGDLISVVKNSTKSI >CP0224 ccdA, post-segregation antitoxin MKQRITVTIDSDSYQLLKSANVNISGLVNTAMQKEARRLRAERWQAENQQ GMAEIARFIEMNGSFADENRDW >CP0225 ccdB, post-segregation toxin MRTGTGEMQFKVYAYKRESRYRLFVDVQSDIIDTPGRRMVIPLASARLLS DKVSRELYPVVHVGDESWRMMTTDMASVPIFVIGEEVADLSHRENDIKNA INLMFWGI >CP0251 finO, FinO, putative fertility inhibition protein MTEQKRPVLTLKRKTEGTAPVRSRKTIINVTTPPKWKVKKQKLAEKAARE AELAAKKAQARQALSIYLNLPTLDEAVNTLKPWWPGLFDGDTPRLLACGI RDVLLEDVAQRNIPLSHKKLRRALKAITRSESYLCAMKAGACRYDTEGYV TEHISQEEEAYAGARLAKIRRQNRIKAELQAVLDEK >CP0182 icsA/virG, IcsA (VirG), outermembrane protein exposed to the bacterial surface by a C-terminal autotransporter domain and involved in the movement of intracellul MNQIHKFFCNMTQCSQGGAGELPTVKEKTCKLSFSPFVVGASLLLGGPIA FATPLSGTQELHFSEDNYEKLLTPVDGLSPLGAGEDGMDAWYITSSNPSH ASRTKLRINSDIMISAGHGGAGDNNDGNSCGGNGGDSITGSDLSIINQGM ILGGSGGSGADHNGDGGEAVTGDNLFIINGEIISGGHGGDSYSDSDGGNG GDAVTGVNLPIINKGTISGGNGGNNYGEGDGGNGGDAITGSSLSVINKGT FAGGNGGAAYGYGYDGYGGNAITGDNLSVINNGAILGGNGGHWGDAINGS NMTIANSGYIISGKEDDGTQNVAGNAIHITGGNNSLILHEGSVITGDVQV NNSSILKIINNDYTGTTPTIEGDLCAGDCTTVSLSGNKFTVSGDVSFGEN SSLNLAGISSLEASGNMSFGNNVKVEAIINNWAQKDYKLLSADKGITGFS VSNISIINPLLTTGAIDYTKSYISDQNKLIYGLSWNDTDGDSHGEFNLKE NAELTVSTILADNLSHHNINSWDGKSLTKSGEGTLILAEKNTYSGFTNIN AGILKMGTVEAMTRTAGVIVNKGATLNFSGMNQTVNTLLNSGTVLINNIN APFLPDPVIVTGNMTLEKNGHVILNNSSSNVGQTYVQKGNWHGKGGILSL GAVLGNDNSKTDRLEIAGHASGITYVAVTNEGGSGDKTLEGVQIISTDSS DKNAFIQKGRIVAGSYDYRLKQGTVSGLNTNKWYLTSQMDNQESKQMSNQ ESTQMSSRRASSQLVSSLNLGEGSIHTWRPEAGSYIANLIAMNTMFSPSL YDRHGSTIVDPTTGQLSETTMWIRTVGGHNEHNLADRQLKTTANRMVYQI GGDILKTNFTDHDGLHVGIMGAYGYQDSKTHNKYTSYSSRGTVSGYTAGL YSSWFQDEKERTGLYMDAWLQYSWFNNTVKGDGLTGEKYSSKGITGALEA GYIYPTIRWTAHNNIDNALYLNPQVQITRHGVKANDYIEHNGTMVTSSGG NNIQAKLGLRTSLISQSCIDKETLRKFEPFLEVNWKWSSKQYGVIMNGMS NHQIGNRNVIELKTGVGGRLADNLSIWGNVSQQLGNNSYRDTQGILGVKY TF >CP0132 icsB, IcsB, invasion protein MILKISNFIDASNTKGPIRVEDTEHGPILIAQKFNLKDLFFRTLSTINAK INSQILNEQLKNYRLENQKSLLLFLNTLASEKSAESAFAAYEAAKNSIQH SFTGRDIKLMLNTAERFHGIGTAKNLERHLVFRCWGNRGITHLGHTSISI KNNLLQEPTHTYLSWYPGGNVTKDTEINYLFEKRSGYSVDTYKQDKLNMI SDQTAERLDAGQEVRNLLNSKQDQNNNKKIFFPRANQKKDPYGYWGVSAD KVYIPLSGDNKTKDGKISHNLFGLDETNMSKFICKKKADAFRQLANYKLI SKSENCAGMALNVLKAGNSEIYFPLPDVKLVATPNDVYAYANKVRQRIES LNQSYNEIMKYIESDFDLSRLTQLRRSYLKSFNKINLIHTPKTFKPLSIS LYKHPTENVSSEDFDAVINACHSYLVKSAPSNMTRVLNELKTEATDKKEE IIEKSIKIIDYYNSLKSPDLGTKLYIHDLLQINKLLLNNSHSNI >CP0271 icsP/sopA, IcsP (SopA), outermembrane protease of the OmpP family, involved in cleavage of surface exposed IcsA MKLKFFVLALCVPAIFTTHATTNYPLFIPDNISTDISLGSLSGKTKERVY HPKEGGRKISQLDWKYSNATIVRGGIDWKLIPKVSFGVSGWTTLGNQKAS MVDKDWNNSNTPQVWTDQSWHPNTHLRDANEFELNLKGWLLNNLDYRLGL IAGYQESRYSFNAMGGSYIYSENGGSRNKKGAHPSGERTIGYKQLFKIPY IGLTANYRHENFEFGAELKYSGWVLSSDTDKHYQTETIFKDEIKNQNYCS VAANIGYYVTPSAKFYIEGSRNYISNKKGDTSLYEQSTNISGTIKNSASI EYIGFLTSAGIKYIF >CP0049 insA, IS1 ORF1 MVRNGKSTAGHQRNLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGC RASARIMGIGLNTVLRHLKNSGRSR >CP0220 insB, IS1 ORF2 MSRQRTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >CP0051 insB, IS1 ORF2 MIVCAEMDEHWGYVGAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERL LSLLSAFEVVVWMTDGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHL ARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ >CP0065 insB, IS1 ORF2 MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLNRPGNP GD >CP0125 ipaA, IpaA, secreted by the Mxi-Spa machinery, modulates entry of bacteria into epithelial cells MHNVNNTQAPTFLYKATSPSSTEYSELKSKISDIHSSQTSLKTPASVSEK ENFATSFNQKCLDFLFSSSGKEDVLRSIYSNSMNAYAKSEILEFSNVLYS LVHQNGLNFENEKGLQKIVAQYSELIIKDKLSQDSAFGPWSAKNKKLHQL RQNIEHRLALLAQQHTSGEALSLGQKLLNTEVSSFIKNNILAELKLSNET VSSLKLDDLVDAQAKLAFDSLRNQRKNTIDSKGFGIGKLSRDLNTVAVFP ELLRKVLNDILEDIKDSHPIQDGLPTPPEDMPDGGPTPGANEKTSQPVIH YHINNDNRTYDNRVFDNRVYDNSYHENPENDAQSPTSQTNDLLSRNGNSL LNPQRALVQKVTSVLPHSISDTVQTFANNSALEKAFNHTPDNSDGIGSDL LTTSSQERSANNSLSRGHRPLNIQNSSTTPPLHPEGVTSSNDNSSDTTKS SASLSHRVASQINKFNSNTDSKVLQTDFLSRNGDTYLTRETIFEASKKVT NSLSNLISLIGTKSGTQERELQEKSKDITKSTTEHRINNKLKVTDANIRN YVTETNADTIDKNHAIYEKAKEVSSALSKVLSKIDDTSAELLTDDISDLK NNNDITAENNNIYKAAKDVTTSLSKVLKNINKD >CP0128 ipaB, IpaB, secreted by the Mxi-Spa secretion machinery, required for entry into epithelial cells MHNVSTTTTGFPLAKILASTELGDNTIQAANDAANKLFSLTIADLTANQN INTTNAHSTSNILIPELKAPKSLNASSQLTLLIGNLIQILGEKSLTALTN KITAWKSQQQARQQKNLEFSDKINTLLSETEGLTRDYEKQINKLKNADSK IKDLENKINQIQTRLSELDPESPEKKKLSREEIQLTIKKDAAVKDRTLIE QKTLSIHSKLTDKSMQLEKEIDSFSAFSNTASAEQLSTQQKSLTGLASVT QLMATFIQLVGKNNEESLKNDLALFQSLQESRKTEMERKSDEYAAEVRKA EELNRVMGCVGKILGALLTIVSVVAAAFSGGASLALAAVGLALMVTDAIV QAATGNSFMEQALNPIMKAVIEPLIKLLSDAFTKMLEGLGVDSKKAKMIG SILGAIAGALVLVAAVVLVATVGKQAAAKLAENIGKIIGKTLTDLIPKFL KNFSSQLDDLITNAVARLNKFLGAAGDEVISKQIISTHLNQAVLLGESVN SATQAGGSVASAVFQNSASTNLADLTLSKYQVEQLSKYISEAIEKFGQLQ EVIADLLASMSNSQANRTDVAKAILQQTTA >CP0127 ipaC, IpaC, secreted by the Mxi-Spa secretion machinery, required for entry into epithelial cells MEIQNTKPTQILYTDISTKQTQSSSETQKSQNYQQIAAHIPLNVGKNPVL TTTLNDDQLLKLSEQVQHDSEIIARLTDKKMKDRSEMSHTLTPENTLDIS SLSSNAVSLIISVAVLLSALRTAETKLGSQLSLIAFDATKSAAENIVRQG LAALSSSITGAVTQVGITGIGAKKTHSGISDQKGALRKNLATAQSLEKEL AGSKLGLNKQIDTNITSPQTNSSTKFLGKNKLAPDNISLSTEHKTSLSSP DISLQDKIDTQRRTYELNTLSAQQKQNIGRATMETSAVAGNISTSGGRYA SALEEEEQLISQASSKQAEEASQVSKEASQATNQLIQKLLNIIDSINQSK NSTASQIAGNIRA >CP0126 ipaD, IpaD, secreted by the Mxi-Spa machinery, required for entry of bacteria into epithelial cells MNITTLTNSISTSSFSPNNTNGSSTETVNSDIKTTTSSHPVSSLTMLNDT LHNIRTTNQALKKELSQKTLTKTSLEEIALHSSQISMDVNKSAQLLDILS RHEYPINKDARELLHSAPKEAELDGDQMISHRELWAKIANSINDINEQYL KVYEHAVSSYTQMYQDFSAVLSSLAGWISPGGNDGNSVKLQVNSLKKALE ELKEKYKDKPLYPANNTVSQEQANKWLTELGGTIGKVSQKNGGYVVSINM TPIDNMLKSLDNLGGNGEVVLDNAKYQAWNAGFSAEDETMKNNLQTLVQK YSNANSIFDNLVKVLSSTISSCTDTDKLFLHF >CP0265 ipaH1.4, invasion plasmid antigen, secreted by the Mxi-Spa secretion machinery MIKSTNIQAIGSGIMHQINNVYSLTPLSLPMELTPSCNEFYLKTWSEWEK NGTPGEQRNIAFNRLKICLQNQEAELNLSELDLKTLPDLPPQITTLEIRK NLLTHLPDLPPMLKVIHAQFNQLESLPALPETLEELNAGDNKIKELPFLP ENLTHLRVHNNRLHILPLLPPELKLLVVSGNRLDSIPPFPDKLEGLALAN NFIEQLPELPFSMNRAVLMNNNLTTLPESVLRLAQNAFVNVAGNPLSGHT MRTLQQITTGPDYSGPQIFFSMGNSATISAPEHSLADAVTAWFPENKQSD VSQIWHAFEHEEHANTFSAFLDRLSDTVSARNTSGFREQVAAWLEKLSAS AELRQQSFAVAADATESCEDRVALTWNNLRKTLLVHQASEGLFDNDTGAL LSLGREMFRLEILEDIARDKVRTLHFVDEIEVYLAFQTMLAEKLQLSTAV KEMRFYGVSGVTANDLRTAEAMVRSREENEFTDWFSLWGPWHAVLKRTEA DRWAQAEEQKYEMLENEYSQRVADRLKASGLSGDADAEREAGAQVMRETE QQIYRQLTDEVLALRLSENGSNHIA >CP0054 ipaH2.5, invasion plasmid antigen, probably secreted by the Mxi-Spa machinery MIKSTNIQVIGSGIMHQINNIHSLTLFSLPVSLSPSCNEYYLKVWSEWER NGTPGEQRNIAFNRLKICLQNQEAELNLSELDLKTLPDLPPQITTLEIRK NLLTHLPDLPPMLKVIHAQFNQLESLPALPETLEELNAGDNKIKELPFLP ENLTHLRVHNNRLHILPLLPPELKLLVVSGNRLDSIPPFPDKLEGLALAN NFIEQLPELPFSMNRAVLMNNNLTTLPESVLRLAQNAFVNVAGNPLSGHT MRTLQQITTGPDYSGPQIFFSMGNSATISAPEHSLADAVTAWFPENKQSD VSQIWHAFEHEEHANTFSAFLDRLSDTVSARNTSGFREQVAAWLEKLSAS AELRQQSFAVAADATESCEDRVALTWNNLRKTLLVHQASEGLFDNDTGAL LSLGREMFRLEILEDIARDKVRTLHFVDEIEVYLAFQTMLAEKLQLSTAV KEMRFYGVSGVTANDLRTAEAMVRSREENEFTDWFSLWGPWHAVLKRTEA DRWAQAEEQKYEMLENEYSQRVADRLKASGLSGDADAEREAGAQVMRETE QQIYRQLTDEVLA >CP0079 ipaH4.5, invasion plasmid antigen, probably secreted by the Mxi-Spa machinery MKPINNHSFFRSLCGLSCISRLSVEEQCTRDYHRIWDDWAREGTTTENRI QAVRLLKICLDTREPVLNLSLLKLRSLPPLPLHIRELNISNNELISLPEN SPLLTELHVNGNNLNILPTLPSQLIKLNISFNRNLSCLPSLPPYLQSLSA RFNSLETLPELPSTLTILRIEGNRLTVLPELPHRLQELFVSGNRLQELPE FPQSLKYLKVGENQLRRLSRLPQELLALDVSNNLLTSLPENIITLPICTN VNISGNPLSTHVLQSLQRLTSSPDYHGPQIYFSMSDGQQNTLHRPLADAV TAWFPENKQSDVSQIWHAFEHEEHANTFSAFLDRLSDTVSARNTSGFREQ VAAWLEKLSASAELRQQSFAVAADATESCEDRVALTWNNLRKTLLVHQAS EGLFDNDTGALLSLGREMFRLEILEDIARDKVRTLHFVDEIEVYLAFQTM LAEKLQLSTAVKEMRFYGVSGVTANDLRTAEAMVRSREENEFTDWFSLWG PWHAVLKRTEADRWAQAEEQKYEMLENEYSQRVADRLKASGLSGDADAER EAGAQVMRETEQQIYRQLTDEVLA >CP0078 ipaH7.8, invasion plasmid antigen, probably secreted by the Mxi-Spa machinery MFSVNNTHSSVSCSPSINSNSTSNEHYLRILTEWEKNSSPGEERGIAFNR LSQCFQNQEAVLNLSDLNLTSLPELPKHISALIVENNKLTSLPKLPAFLK ELNADNNRLSVIPELPESLTTLSVRSNQLENLPVLPNHLTSLFVENNRLY NLPALPEKLKFLHVYYNRLTTLPDLPDKLEILCAQRNNLVTFPQFSDRNN IRQKEYYFHFNQITTLPESFSQLDSSYRINISGNPLSTRVLQSLQRLTSS PDYHGPQIYFSMSDGQQNTLHRPLADAVTAWFPENKQSDVSQIWHAFEHE EHANTFSAFLDRLSDTVSARNTSGFREQVAAWLEKLSASAELRQQSFAVA ADATESCEDRVALTWNNLRKTLLVHQASEGLFDNDTGALLSLGREMFRLE ILEDIARDKVRTLHFVDEIEVYLAFQTMLAEKLQLSTAVKEMRFYGVSGV TANDLRTAEAMVRSREENEFTDWFSLWGPWHAVLKRTEADRWAQAEEQKY EMLENEYSQRVADRLKASGLSGDADAEREAGAQVMRETEQQIYRQLTDEV LALRLSENGSRLHHS >CP0226 ipaH9.8, invasion plasmid antigen, secreted by the Mxi-Spa secretion machinery MLPINNNFSLPQNSFYNTISGTYADYFSAWDKWEKQALPGEERDEAVSRL KECLINNSDELRLDRLNLSSLPDNLPAQITLLNVSYNQLTNLPELPVTLK KLYSASNKLSELPVLPPALESLQVQHNELENLPALPDSLLTMNISYNEIV SLPSLPQALKNLRATRNFLTELPAFSEGNNPVVREYFFDRNQISHIPESI LNLRNECSIHISDNPLSSHALPALQRLTSSPDYHGPRIYFSMSDGQQNTL HRPLADAVTAWFPENKQSDVSQIWHAFEHEEHANTFSAFLDRLSDTVSAR NTSGFREQVAAWLEKLSASAELRQQSFAVAADATESCEDRVALTWNNLRK TLLVHQASEGLFDNDTGALLSLGREMFRLEILEDIARDKVRTLHFVDEIE VYLAFQTMLAEKLQLSTAVKEMRFYGVSGVTANDLRTAEAMVRSREENEF TDWFSLWGPWHAVLKRTEADRWAQAEEQKYEMLENEYPQRVADRLKASGL SGDADAEREAGAQVMRETEQQIYRQLTDEVLALRLPENGSQLHHS >CP0122 ipaJ, IpaJ, invasion plasmid antigen MSEQRKPCKRGCIHTGVMLYGVLLQGAIPREYMISHQTDVRVNENRVNEQ GCFLARKQMYDNSCGAASLLCAAKELGVDKIPQYKGSMSEMTRKSSLDLD NRCERDLYLITSGNYNPRIHKDNIADAGYSMPDKIVMATRLLGLNAYVVE ESNIFSQVISFIYPDARDLLIGMGCNIVHQRDVLSSNQRVLEAVAVSFIG VPVGLHWVLCRPDGSYMDPAVGENYSCFSTMELGARRSNSNFIGYTKIGI SIVITNEAL >CP0131 ipgA, IpgA, similarities to IpgE, putative chaperone MCRKLYDKLYEITGAKLDFNDKNQAFILLEEQIPVCITDNDEYIFLTGLL NEHELFTENIINPEHILILNYSLSRDYGSSICLLPDTHQCVLTKKHYKKY LSPDELIESLYEFLFCIKLTIANITSEVN >CP0130 ipgB1, IpgB1, secreted by the Mxi-Spa machinery, function unknown MQILNKILPQVEFAIPRPSFDSLSRNKLVKKILSVFNLKQRFPQKNFGCP VNINKIRDSVIDKIKDSNSGNQLFCWMSQERTTYVSSMINRSIDEMAIHN GVVLTSDNKRNIFAAIEKKFPDIKLDEKSAQTSISHTALNEIASSGLRAK ILKRYSSDMDLFNTQMKDLTNLVSSSVYDKIFNESTKVLQIEISAEVLKA VYRQSNTN >CP0024 ipgB2, IpgB2, probably secreted by the Mxi-Spa secretion machinery MLGTSFNNFGISLSHKRYFSGKVDEIIRCTMGKRIVKISSTKINTSILSS VSEQIGENITDWKNDEKKVYVSRVVNQCIDKFCAEHSRKIGDNLRKQIFK QVEKDYRISLDINAAQSSINHLVSGSSYFKKKMDELCEGMNRSVKNDTTS NVANLISDQFFEKNVQYIDLKKLRGNMSDYITNLESPF >CP0129 ipgC, IpgC, cytoplasmic chaperone for IpaB and IpaC MSLNITENESISTAVIDAINSGATLKDINAIPDDMMDDIYSYAYDFYNKG RIEEAEVFFRFLCIYDFYNVDYIMGLAAIYQIKEQFQQAADLYAVAFALG KNDYTPVFHTGQCQLRLKAPLKAKECFELVIQHSNDEKLKIKAQSYLDAI QDIKE >CP0133 ipgD, IpgD, secreted by the Mxi-Spa machinery, modulates entry of bacteria into epithelial cells MHITNLGLHQVSFQSGDSYKGAEETGKHKGVSVISYQRVKNGERNKGIEA LNRLYLQNQTSLTGKSLLFARDRAEVFCEAIKLAGGDTSKIKAMMERLDT YKLGEVNKRHINELNKVISEEIRAQLGIKNKKELQTKIKQIFTDYLNNKN WGPVDKNISHHGKNYGFQLTPASHMKIGNKNIFVKEYNGKGICCASTRES DHIANMWLSKVVDDEGKEIFSGIRHGVISAYGLKKNSSERAVAARNKAEE LVSAALYSRPELLSQALSGKTVDLKIVSTSLLTPTSLTGGEESMLKDQVN ALKGLNSKRGEPTKLLIRNSDGLLKEVSVNLKVVTFNFGVNELALKMGLG WRNVDKLNDESICSLLGDNFLKNGVIGGWAAEAIEKNPPCKNDVIYLANQ IKEIVTKKLQKNDNGEPYKLSQRMTLLAYTIGAVPCWNCKSGKDRTGMQD AEIKREIIRKHETGQFSQLNSKLSSEEKRLFSTILMNSGNMEIQEMNTGV PGNKVMKKLPLSSLELSYSERIGDPKIWNMVKGYSSFV >CP0134 ipgE, IpgE, cytoplasmic chaperone for IpgD MEDLADVICRALGIPLIDIDDQAIMLDDDVLIYIEKEGDSINLLCPFCAL PENINDLIYALSLNYSEKICLATDDEGGNLIARLDLTGINEFEDVYVNTE YYISRVRWLKDEFARRMKGY >CP0135 ipgF, IpgF, periplasmic protein, similarities to the catalytic site of lyzozymes MSRFVFILLCFIPYLGRADCWDKAGERYNIPSSLLKAIAEKESGFNKSAV NVNNNGSKDYGIMQINDFHSKRLREMGYSEEMLISHPCLSVHYAAKLLNE FMMMYGRGWEAVGAYNAGTSPKKKKERLKYAEDIYRRYLRIAAESKQNNR RI >CP0082 ipgH, invasion plasmid gene product MRQLVISITEGLNMSLFTEPKEIERLPSEEIERLYPVLRYRVFISIFLGY MGYYFVRNTTSVLSGVLHMSATEIGIISCAGFLSYGISKFVSGLISDRSN SKVFLSLGLFLSGLVNFLIGYIPGIITSVTLFSTMYLLNGWIQGMGYPPG AKTLVFWYEHRERITWATLWNLSHNVGGALAPVLIGFSFGFFGDSALDHA RAAFIFPGVLCMAMSVLIYFIQVDRPVSVGLPPIEEWKGNVVSHPAKGRE QGPRLSIPDIIRKHIIRNNKLIYCCIYGSFVYILRYGIVSWAPKFLSDSL DVGGKDMGKLASMGGGSVFEIGGVAGMLLAGYLSVRLFRNSKPLTNTLFL ALTIILLIAYWYVPSGNEYLWLNYTILILLGLAVYGPVMFIGLYSMELVP KEAAGAASGLSGTFSYIFGSIVATLGMGLVVDYLGWGATFIVLILSAVFA IIFTLMSRERSLEFEKE >CP0010 mkaD, mouse killing factor MPIKKPCLKLNLDSLNVVRSEIPQMLSANERLKNNFNILYNQIRQYPAYY FKVASNVPTYSDICQSFSVMYQGFQIVNHSGDVFIHACRENPQSKGDFVG DKFHISIAREQVPLAFQILSGLLFSEDSPIDKWKITDMNRVSQQSRVGIG AQFTLYVKSDQECSQYSALLLHKIRQFIMCLESNLLRSKIAPGEYPASDV RPEDWKYVSYRNELRSDRDGSERQEQMLREEPFYRLMIE >CP0228 mob9, plasmid mobilization protein MSLAGNPCVIRLVAQVCMWLKFIIRDRGGFSGGLLLFLPVCCRDRTERIL AVHTIKILR >CP0238 msbB2, lipid A biosynthesis lauroyl acyltransferase MKKYKSEFIPEFKKNYLSPVYWFTWFVLGMIAGISMFPPSFRDPVLAKIG RWVGRLSRKARRRATINLSLCFPEKSDTEREIIVDNMFATALQSIVMMAE LAIRGPEKFQKRVFWKGLEILEEIRHNNRNVIFLVPHGWSVDIPAMLLAA QGEKMAAMFHQQRNPVIDYVWNSVRRKFGGRLHSREDGIKPFIQSVRQGY WGYYLPDQDHGPEYSEFADFFATYKATLPIIGRLMNISQAMIIPLFPVYD EKKHFLTIEVRPPMDACIASADNKMIARQMNKTVEILVGSHPEQYIWVLK LLKTRKSNEADPYP >CP0245 mvpA, plasmid maintenance protein MLKFMLDTNICIFTIKNKPASVRERFNLNQGKMCISSVTLMELIYGAEKS QMPERNLAVIEGFVSRIDVLDYDAAAATHTGQIRAELARQGRPVGPFDQM IAGHARSRGLIIVTNNTREFERVGGLRTEDWS >CP0246 mvpT, plasmid maintenance protein METTVFLSNRSQAVRLPKAVALPENVKRVEVIAVGRTRIITPAGETWDEW FDGHSVSTDFMDNREQPGMQERESF >CP0147 mxiA, MxiA, innermembrane protein, component of the Mxi-Spa secretion machinery MIQSFLKQVSTKPELIILVLMVMIIAMLIIPLPTYLVDFLIGLNIVLAIL VFMGSFYIERILSFSTFPSVLLITTLFRLALSISTSRLILVDADAGKIIT TFGQFVIGDSLAVGFVIFSIVTVVQFIVITKGSERVAEVAARFSLDGMPG KQMSIDADLKAGIIDAAGAKERRSILERESQLYGSFDGAMKFIKGDAIAG IIIIFVNLIGGISVGMSQHGMSLSGALSTYTILTIGDGLVSQIPALLISI SAGFIVTRVNGDSDNMGRNIMSQIFGNPFVLIVTSALALAIGMLPGFPFF VFFLIAVTLTALFYYKKVVEKEKSLSESDSSGYTGTFDIDNSHDSSLAMI ENLDAISSETVPLILLFAENKINANDMEGLIERIRSQFFIDYGVRLPTIL YRTSNELKVDDIVLLINEVRADSFNIYFDKVCITDENGDIDALGIPVVST SYNERVISWVDVSYTENLTNIDAKIKSAQDEFYHQLSQALLNNINEIFGI QETKNMLDQFENRYPDLLKEVFRHVTIQRISEVLQRLLGENISVRNLKLI MESLALWAPREKDVITLVEHVRASLSRYICSKIAVSGEIKVVMLSGYIED AIRKGIRQTSGGSFLNMDIEVSDEVMETLAHALRELRNAKKNFVLLVSVD IRRFVKRLIDNRFKSILVISYAEIDEAYTINVLKTI >CP0146 mxiC, MxiC, secreted by and putative component of the Mxi-Spa secretion machinery, similarities to YopN (secreted by the type III secretion machinery of Yer MLDVKNTGVFSSAFIDKLNAMTNSDDGDETADAELDSGLANSKYIDSSDE MASALSSFINRRDLEKLKGTNSDSQERILDGEEDEINHKIFDLKRTLKDN LPLDRDFIDRLKRYFKDPSDQVLALRELLNEKDLTAEQVELLTKIINEII SGSEKSVNAGINSAIQAKLFGNKMKLEPQLLRACYRGFIMGNISTTDQYI EWLGNFGFNHRHTIVNFVEQSLIVDMDSEKPSCNAYEFGFVLSKLIAIKM IRTSDVIFMKKLESSSLLKDGSLSAEQLLLTLLYIFQYPSESEQILTSVI EVSRASHEDSVVYQTYLSSVNESPHDIFKSESEREIAINILRELVTSAYK KELSR >CP0145 mxiD, MxiD, outermembrane protein of the secretin family, component of the Mxi-Spa secretion machinery MKKFNIKSLTLLIVLLPLIVNANNIDSHLLEQNDIAKYVAQSDTVGSFFE RFSALLNYPIVVSKQAAKKRISGEFDLSNPEEMLEKLTLLVGLIWYKDGN ALYIYDSGELISKVILLENISLNYLIQYLKDANLYDHRYPIRGNISDKTF YISGPPALVELVANTATLLDKQVSSIGTDKVNFGVIKLKNTFVSDRTYNM RGEDIVIPGVATVVERLLNNGKALSNRQAQNDPMPPFNITQKVSEDSNDF SFSSVTNSSILEDVSLIAYPETNSILVKGNDQQIQIIRDIITQLDIAKRH IELSLWIIDIDKSELNNLGVNWQGTASFGDSFGASFNMSSSASISTLDGN KFIASVMALNQKKKANVVSRPVILTQENIPAIFDNNRTFYVSLVGERNSS LEHVTYGTLINVIPRFSSRGQIEMSLTIEDGTGNSQSNYNYNNENTSVLP EVGRTKISTIARVPQGKSLLIGGYTHETNSNEIISIPFLSSIPVIGNVFK YKTSNISNIVRVFLIQPREIKESSYYNTAEYKSLISEREIQKTTQIIPSE TTLLEDEKSLVSYLNY >CP0144 mxiE, MxiE, similarities to transcriptional activators of the AraC family, function unknown MEGFFFVRNQNIKFSDNVNYHYRFNINSCAKFLAFWDYFSGALVEHSHAE KCIHFYHENDLRDSCNTESMLDKLMLRFIFSSDQNVSNALAMIRMTESYH LVLYLLRTIEKEKEVRIKSLTEHYGVSEAYFRSLCRKALGAKVKEQLNTW RLVNGLLDVFLHNQTITSAAMNNGYASTSHFSNEIKTRLGFSARELSNIT FLVKKINEKI >CP0136 mxiG, MxiG, component of the Mxi-Spa secretion machinery, contains one transmembrane segment MSEAKNSNLAPFRLLVKLTNGVGDEFPLYYGNNLIVLGRTIETLEFGNDN FPENIIPVTDSKSDGIIYLTISKDNICQFSDEKGEQIDINSQFNSFEYDG ISFHLKNMREDKSRGHILNGMYKNHSVFFFFAVIVVLIIIFSLSLKKDEV KEIAEIIDDKRYGIVNTGQCNYILAETQNDAVWASVALNKTGFTKCRYIL VSNKEINRIQQYINQRFPFINLYVLNLVSDKAELLVFLSKERNSSKDTEL DKLKNALIVEFPYIKNIKFNYLSDHNARGDAKGIFTKVNVQYKEICENNK VTYSVREELTDEKLELINRLISEHKNIYGDQYIEFSVLLIDDDFKGKSYL NSKDSYVMLNDKHWFFLDKNK >CP0137 mxiH, MxiH, component of the Mxi-Spa secretion machinery MSVTVPNDDWTLSSLSETFDDGTQTLQGELTLALDKLAKNPSNPQLLAEY QSKLSEYTLYRNAQSNTVKVIKDVDAAIIQNFR >CP0138 mxiI, MxiI, component of the Mxi-Spa secretion machinery MNYIYPVNQVDIIKASDFQSQEISSLEDVVSAKYSDIKMDTDIQVSQIME MVSNPESLNPESLAKLQTTLSNYSIGVSLAGTLARKTVSAVETLLKS >CP0139 mxiJ, MxiJ, lipoprotein, component of the Mxi-Spa secretion machinery MIRYKGFILFLLLMLIGCEQREELISNLSQRQANEIISVLERHNITARKV DGGKQGISVQVEKGTFASAVDLMRMYDLPNPERVDISQMFPTDSLVSSPR AEKARLYSAIEQRLEQSLVSIGGVISAKIHVSYDLEEKNISSKPMHISVI AIYDSPKESELLVSNIKRFLKNTFSDVKYENISVILTPKEEYVYTNVQPV KEVKSEFLTNEVIYLFLGMAVLVVILLVWAFKTGWFKRNKI >CP0140 mxiK, MxiK, putative component of the Mxi-Spa secretion machinery MIRMDGIYKKYLSIIFDPAFYINRNRLNLPSELLENGVIRSEINNLIINK YDLNCDIEPLSGVTAMFVANWNLLPAVAYFIGSQESRLINHSEMVISYYG GKISKQGEAAIRSGFWHLIAWKENISVGIYERINLLFNPIALEGNYTPVE RNLSRLNEGMQYAKRHFTGIQTSCL >CP0142 mxiL, MxiL, secreted by and putative component of the Mxi-Spa secretion machinery MINQINASNALQQRLNSEEVVNLNERLSSSQSFDEDIIYEIMQYFSQSEL NSIDNDELHNKIEQLFNSRFPYLTAAQKSSLLNKLIDANQYVDLHEGFYA SLSIYNNIDFYIKTTTFDSLISVFEAGREADDSTW >CP0143 mxiM, MxiM, lipoprotein, component of the Mxi-Spa secretion machinery MIRHGSNKLKIFILSILLLTLSGCALKSSSNSEKEWHIVPVSKDYFSIPN DLLWSFNTTNKSINVYSKCISGKAVYSFNAGKFMGNFNVKEVDGCFMDAQ KIAIDKLFSMLKDGVVLKGNKINDTILIEKDGEVKLKLIRGI >CP0141 mxiN, MxiN, putative component of the Mxi-Spa secretion machinery MKVCNMQKGTLPVSRHHAYDGVVIKRIEKELCKTIKDRDTESKKKAICVI KEATKKAESLRIDAVCDGYQIGIQTAFEHIIDYICEWKLKQNENRRNIED YITSLLSENLHDERIISTLLEQWLSSLRNTVTELKVVLPKCNLALRKKLE LDLHKYRSDVKIILKYSEGNNYIFCSGNQVVEFSPQDVISGVKIELAEKL TKNDKKYFKELAHKKLRQIAEDLLKENPVND >CP0003 ospB, OspB, protein secreted by the Mxi-Spa secretion machinery, function unknown MNLDGVRPYCRIVNKKNESISDIAFAHIIKRVKNSSCTHPKAALVFLGEK GFCDSNDVLSIMGQQIPRVFKNKMLYDYVFKNEKSKNDFLKMAESWLPQS EPIVINNDDDALNAAAYFSVKKAKIKTVNDTDFKEYNKVYILGHGSPGSH QLGLGSELIDVQTIISRMKDCGILNVKDIRFTSCGSADKVAPKNFNNAPA ESLSCILNSLPFFKEKESLLEQIKKHLENDESLSDGLKISGYHGYGVHYG QELFPYSHYRSTSIPADPEHTVKRSSQKKTFIINKELD >CP0094 ospC1, OspC1, secreted by the Mxi-Spa secretion machinery, function unknown MNISETLNSANTQCNIDSMDNRLHTLFPKVTSVRNAAQQTMPDEKNLKDS ANIIKSFFRKTIAAQSYSRMFSQGSNFKSLNIAIDAPSDAKASFKAIEHL DRLSKHYISEIREKLHPLSAEELNLLSLIINSDLIFRHQSNSDLSDKILN IKSFNKIQSEGICTKRNTYADDIKKIANHDFVFFGVEISNHQKKHPLNTK HHTVDFGANAYIIDHDSPYGYMTLTDHFDNAIPPVFYHEHQSFLDKFSEV NKEVSRYVHGSKGIIDVPIFNTKDMKLGLGLYLIDFIRKSEDQSFKEFCY GKNLAPVDLDRIINFVFQPEYHIPRMVSTENFKKVKIREISLEEAVTASN YEEINKQVTNKKIALQALFLSITNQKEDVALYILSNFEITRQDVISIKHE LYDIEYLLSAHNSSCKVLEYFINKGLVDVNTKFKKTNSGDCMLDNAIKYE NAEMIKLLLKYGATSDNKYI >CP0063 ospC2, OspC2, probably secreted by the Mxi-Spa secretion machinery, function unknown MKIPEAVNHINVQNNIDLVDGKINPNKDTKALQKNISCVTNSSSSGISEK HLDHCADTVKSFLRKSIAAQSYSKMFSQGTSFKSLNLSIEAPSGARSSFR SLEHLDKVSRHYLSEIIQKTHPLSSDERHLLSIIINSDFNFRHQSNANLS NNTLNIKSFDKIKSENIQTYKNTFSEDIEEIANHDFVFFGVEISNHQETL PLNKTHHTVDFGANAYIIDHDSPYGYMTLTDHFDNAIPPVFYHEHQSFFL DNFKEVVDEVSRYVHGNQGKTDVPIFNTKDMRLGIGLHLIDFIRKSKDQR FREFCYNKNIDPVSLDRIINFVFQLEYHIPRMLSTDNFKKIRLRDISLED AIKASNYEEINNKVTDKKMAHQALAYSLGNAKSDMALYLLSKFNFTKQDI AEMEKMNNNMYCELYDVEYLLSEDSANYKVLEYFISNGLVDVNKRFQKAN SGDTMLDNAMKSKDSKTIDFLLKNGAVSGKRFGR >CP0005 ospC3, OspC3, probably secreted by the Mxi-Spa secretion machinery, function unknown MKNFLRKSIAAQSYSKMFSQGTSFKSLNLSLEAPSGARSSFRSLEHLDKV SRHYISEIIQKVHPLSSDERHLLSIIINSNFNFRHQSNSNLSNIILNIKS FDKIQSENIQTHKNTYSEDIKEISNHDFVFFGVEISNHQEKLPLNKTHHT VDFGANAYIIDHDSPYGYMTLTDHFDNAIPPVFYHEHQSFFLDNFKEVVD EVSRYVHGNQGKTDVPIFNTKDMRLGIGLHLIDFIRKSKDQGFREFCYNK NIDPVSLDRIINFVFQLEYHIPRMLSTDNFKKIKLRDISLEDAIKASNYE EINNKVTDKKMAHQALAYSLGNKKADIALYLLSKFNFTKQDVAEMEKMKN NRYCNLYDVEYLLSKDGANYKVLEYFINNGLVDVNKKFQKANSGDTMLDN AMKSKDSKMIDFFIKKWSGIRQTI >CP0115 ospC4, OspC4, probably secreted by the Mxi-Spa secretion machinery, function unknown MKIPEAVNHINVQNNIDLVDGKINPNKDTKALQKNISCVTNSSSSGISEK HLDHCADTVKSFLRKSIAAQSYSKMFSQGTSFKSLNLSIEAPSGARSSFR SLEHLDKVSRHYLSEIIQKTHPLSSDERHLLSIIINSDFNFRHQSNANLS NNTLNIKSFDKIKSENIQTYKNTFSEDIEEIANHDFVFFGVEISNHQETL PLNKTHHTVDFGANAYIIDHDSPYGYMTLTDHFDNAIPPVFYHEHQSFFL DNFKEVVDEVSRYVHGNQGKTDVPIFNTKDMRLGIGLHLIDFIRKSKDQR FREFCYNKNIDPVSLDRIINFVFQLEYHIPRMLSTDNFKKIKLRDISLED AIKASNYEEINNKVTDKKMAHQALAYSLGNKKADIALYLLSKFNFTKQDV AEMEKMKNNRYCNLYDVEYLLSKDGANYKVLEYFINNGLVDVNKKFQKVN SGDTMLDNAMKSKDSKMIDFLLKNGAILGKRFEI >CP0022 ospD1, OspD1, secreted by the Mxi-Spa secretion machinery, function unknown MSINNYGLHPANNKNMHLIIGSNTANENKGMKNNIINVTNTAISHAINEE KSGGGYSGVSFRKLAKIQNISIPTKNNKEYNRHNLFSLIWHGNADAARKY GESLLAAEIPKEEKLEVLAARNNAGESALFIALQEGHSAAIQAYGDFIKT FDLSPKETIKLLDVRDNEGLPGLFLAAGKGNIEAMMAYINICHHSGIKLT EIADRLNNNEQDMFNIISDKIQELF >CP0009 ospD2, OspD2, probably secreted by the Mxi-Spa secretion machinery, function unknown MPLNKTFSSSIFSTKNSLSTDMSVNRDNRTITSSIMRVSNSSELIQFKNK TAPYFSEKRNVEVNINGVAKDIYGRQIVCRHLASYWEMNFMETNGKVNYQ LLSTPDAIAKNVCLEKTEDFSKSPAYIYFVENKKWGTVITNFFYNMKKNG DFVRTLSACTLNHQMALGLKIKRVQESEKWVVQFFDPNRTVTHKRTVFTC DSHFELSQLSAKDFFDDFYWKIYGLEQPGQVIFEDRHNSPLTNTVKLLPD ELINSRVIYHAITKNLTEVLFILMEKYKNGEISQSKLVNLLATRSSDGTP AFYIALQNGYSDIIQVYGKILNMCNLSQETILTLLAAVGANNVPGLCMSF MNGHVDTIKAYGEIVFKTPLTSDKRLYLLAAKDSHDLPGLFFALQNGHAD SIRMFGSLLNKKMLSSEQIKELLKVKHGLFMALQNGHTKAIMAYGDILKI LPPHQEYIDELLWIKNPNGTSGLFMAFYNGHTETIRAFCNILKNYSFTTR RLVEMLSATNKDGIPGVFVSVVNRDKETILEYCRIIKENNLEPDTIAEQF SKKMKKTFIEIINRFNHFL >CP0093 ospD3, OspD3 (SenA), probably secreted by the Mxi-Spa secretion machinery, function unknown MPSVNLIPSRKICLQNMINKDNVSVETIQSLLHSKQLPYFSDKRSFLLNL NCQVTDHSGRLIVCRHLASYWIAQFNKSSGHVDYHHFAFPDEIKNYVSVS EEEKAINVPAIIYFVENGSWGDIIFYIFNEMIFHSEKSRALEISTSNHNM ALGLKIKETKNGGDFVIQLYDPNHTATHLRAEFNKFNLAKIKKLTVDNFL DEKHQKCYGLISDGMSIFVDRHTPTSMSSIIRWPDNLLHPKVIYHAMRMG LTELIQKVTRVVQLSDLSDNTLELLLAAKNDDGLSGLLLALQNGHSDTIL AYGELLETSGLNLDKTVELLTAEGMGGRISGLSQALQNGHAETIKTYGRL LKKRAINIEYNKLKNLLTAYYYDEVHRQIPGLMFALQNGHADAIRAYGEL ILSPPLLNSEDIVNLLASRRYDNVPGLLLALNNGQADAILAYGDILNEAK LNLDKKAELLEAKDSNGLSGLFVALHNGCVETIIAYGKILHTADLTPHQA SKLLAAEGPNGVSGLIIAFQNRNFEAIKTYMGIIKNENITPEEIAEHLDK KNGSDFLEIMKNIKS >CP0227 ospG, OspG, secreted by the Mxi-Spa secretion machinery, function unknown MKITSTIIQTPFPFENNNSHAGIVTEPILGKLIGQGSTAEIFEDVNDSSA LYKKYDLIGNQYNEILEMAWQESELFNAFYGDEASVVIQYGGDVYLRMLR VPGTPLSDIDTADIPDNIESLYLQLICKLNELSIIHYDLNTGNMLYDKES ESLFPIDFRNIYAEYYAATKKDKEIIDRRLQMRTNDFYSLLNRKYL >CP0031 parA, plasmid segregation protein MTSFEQLSKVAQRADKMLLALTKQIQEQKQEFQADVFYQVYSKSAVAKLP KLTRASVDGAVGEMEAQGYQFEKRPAGTATKYALTIQNIIDIYAHRGIPK YRDRYSEAYSIFIGSLKGGVSKTVSSVSVAHALRAHPHLLSEDLRILLLD LDPQSSATMFLNYLHAVGLVDTTAPQAMLQNVSREELLEDFIVPSVIPGV YVMPASIDDAFIASNWDTLCEEHLLGQNKHAILRENIIDKLKHDFDFILI DTGPHLDAFLKNAIAAADIMFTPVPPAQVDFHSTLKYLARLPELVQIIEQ DGCSCRLQANIGFMSKLANKSDHKYCHSLTKEIFGGDMLDVSMPRLDGFE RSGESFDTVISANPVTYVGSGEALKNARMAAEDFAKAVFDRIEFIRANY >CP0032 parB, plasmid segregation protein MENRKHRPTIGRTLNTNILNNTEEISAPVHVFTLNTGRKAKFTEIKVDHD KVDTQTFVVEEVNGREQTALTPDSLKDITRTIRLQQFYPCIGIRTGDLIE ILDGSRRRAAALLCKVGLRVLVTDDELTVSEAQHLAKDLQTSLEHNIREI GLRLVRLKEAGMNQKQIAEREGLSAAKVTRALQAASVPKDFVSLFPVQSE LTYADYRQLAELSERLRLGDISIDEVVKNISPSIELITADDNLSEDEVKN SIMRLITKEMSSLLDSGVKDKAVVTLLWKFDSKDKFARKRVKGRTFSYEF GRLPLEVQDKLDRMIALVLKDNLNSL >CP0190 phoN1, PhoN1, periplasmic non specific acid ohosphatase MKRQLFTLSIVGVFSLNTFASFPPGNDVTTKPDLYYLTNDNAIDSLALLP PPPQIGSIAFLNDQAMYEKGRLLRNTERGKLAAEDANLSSGGVANVFSAA FGSPITAKDSPELHKLLTNMIEDAGDLATRSAKEYYMRIRPFAFYGVSTC NTKEQDTLSRNGSYPSGHTSIGWATALVLSEINPARQDTILKRGYELGDS RVICGYHWQSDVDAARIVGSAIVATLHSNPVFQAQLQKAKDEFANNQKK >CP0004 phoN2/apy, PhoN2 (Apy), periplasmic phosphatase, apyrase, ATP diphosphohydrolase MKTKNFLLFCIATNMIFIPSANALKAEGFLTQQTSPDSLSILPPPPAENS VVFQADKAHYEFGRSLRDANRVRLASEDAYYENFGLAFSDAYGMDISREN TPILYQLLTQVLQDSHDYAVRNAKEYYKRVRPFVIYKDATCTPDKDEKMA ITGSYPSGHASFGWAVALILAEINPQRKAEILRRGYEFGESRVICGAHWQ SDVEAGRLMGASVVAVLHNTPEFTKSLSEAKKEFEELNTPTNELTP >CP0260 repA, RepA, replication protein MTDLQQTYYRQVKNPNPVFTPREGARTLPFCGKLMEKAVGFTSRFDFAIH VAHARSLGLRRRMPPVLRRRAIDALLQGLCFHYDPLANRVQCSITTLAIE CGLATESAAGKLSITRATRALTFLAELGLITYQTEYDPLIGCYIPTDITF TSALFAALDVSEEAVAAARRSRVEWENRQRKKQGLDTLGMDELMAKAWRF VRERFRSYQTELKSRGMKRARARRDADRQRQDIVTLVKRQLTREISEGRF TASREAVKREVERRVKERMILSRNRNYSRLATASP >CP0258 repB, RepB, replication protein MSQTENAVTSSLSQKRFVRRGKPMTDSEKQMAAVARKRLTHKEIKVFVKN PLKDLMVEYCEREGITQAQFVEKIIKDELQRLDILK >CP0236 rfbU, UDP-sugar hydrolase MNILFTESSPNIGGQELQAVAQMKALKKMGHSVLLVCRENSKIAFEASKL GIDITFALFRNSLHIPTAWRLLGIVHGFQPNAIVCHSGHDSNIVGLVRLF TRKHPFRIIRQKTYLTRKTKVFSINHFCDEVIVPGTSMKTHLEQEGCRTR VTVVPPGFDFQKLYVDSRNSLPPNVLSWLASRRGCPVIAQVGMLRPEKGH EFMLNLLFHLKMNGRQFCWLIVGSGSPELREHLQYQIDSMGMHDDVFIAD NVFPAAPVYRVASLVVLPSENESFGMVLAEASAFSVPVLASQIGGIPDVI QNNQTGTLLPAGNKHAWMCALNDFFNDPGRFYQMARQAKQDIEERFDINK TALKILTLAKHK >CP0070 sepA, SepA, extracellular serine protease of the IgA1 protease family, secreted by a C-terminal autotransporter domain MNKIYYLKYCHITKSLIAVSELARRVTCKSHRRLSRRVILTSVAALSLSS AWPALSATVSAEIPYQIFRDFAENKGQFTPGTTNISIYDKQGNLVGKLDK APMADFSSATITTGSLPPGDHTLYSPQYVVTAKHVSGSDTMSFGYAKNTY TAVGTNNNSGLDIKTRRLSKLVTEVAPAEVSDIGAVSGAYQAGGRFTEFY RLGGGMQYVKDKNGNRTQVYTNGGFLVGGTVSALNSYNNGQMITAQTGDI FNPANGPLANYLNMGDSGSPLFAYDSLQKKWVLIGVLSSGTNYGNNWVVT TQDFLGQQPQNDFDKTIAYTSGEGVLQWKYDAANGTGTLTQGNTTWDMHG KKGNDLNAGKNLLFTGNNGEVVLQNSVNQGAGYLQFAGDYRVSALNGQTW MGGGIITDKGTHVLWQVNGVAGDNLHKTGEGTLTVNGTGVNAGGLKVGDG TVILNQQADADGKVQAFSSVGIASGRPTVVLSDSQQVNPDNISWGYRGGR LELNGNNLTFTRLQAADYGAIITNNSEKKSTVTLDLQTLKASDINVPVNT VSIFGGRGAPGDLYYDSSTKQYFILKASSYSPFFSDLNNSSVWQNVGKDR NKAIDTVKQQKIEASSQPYMYHGQLNGNMDVNIPQLSGKDVLALDGSVNL PEGSITKKSGTLIFQGHPVIHAGTTTSSSQSDWETRQFTLEKLKLDAATF HLSRNGKMQGDINATNGSTVILGSSRVFTDRSDGTGNAVFSVEGSATATT VGDQSDYSGNVTLENKSSLQIMERFTGGIEAYDSTVSVTSQNAVFDRVGS FVNSSLTLGKGAKLTAQSGIFSTGAVDVKENASLTLTGMPSAQKQGYYSP VISTTEGINLEDNASFSVKNMGYLSSDIHAGTTAATINLGDSDADAGKTD SPLFSSLMKGYNAVLRGSITGAQSTVNMINALWYSDGKSEAGALKAKGSR IELGDGKHFATLQVKELSADNTTFLMHTNNSRADQLNVTDKLSGSNNSVL VDFLNKPASEMSVTLITAPKGSDEKTFTAGTQQIGFSNVTPVISTEKTDD ATKWVLTGYQTTADAGASKAAKDFMASGYKSFLTEVNNLNKRMGDLRDTQ GDAGVWARIMNGTGSADGDYSDNYTHVQIGVDRKHELDGVDLFTGALLTY TDSNASSHAFSGKNKSVGGGLYASALFNSGAYFDLIGKYLHHDNQHTANF ASLGTKDYSSHSWYAGAEVGYRYHLTKESWVEPQIELVYGSVSGKAFSWE DRGMALSMKDKDYNPLIGRTGVDVGRAFSGDDWKITARAGLGYQFDLLAN GETVLQDASGEKRFEGEKDSRMLMTVGMNAEIKDNMRLGLELEKSAFGKY NVDNAINANFRYVF >CP0235 shf, putative carbohydrate transport protein MLNEGGILFKANHVPVLMYHHVSHCPGLVTLSPVTFRKQIKWLAENNWKT LSSDELEFFYRGGKLPRKSVMLTFDDGYLDNWFQVYPLLKEFNLKAHIFL ITGFIGNGPVRHSPGKEYSHRDCEHQIATGNADNVMLRWSEVNEMLQSGL VEFHVHTHTHTRWDKKFSSREEQCKHLRQDLLSGREYLKEMTGKCSKHLC WPEGYYNKDYIQVAEELGFYYLYTTERRMNAPAKGTTRIGRISTKERESC AWLKRRLFYYTTPFFSSLLAFHKGPRLPDD >CP0150 spa13, Spa13, component of the Mxi-Spa secretion machinery MEALDKRIIYFLQLENDLEPVGAQSVSQLFNTRRKIAIVKKHIIQYQSER ILLKGRIEEIQKDIDEANASKRKLLHKESKICKRIGLIKRNNFAKQLILD ELSQEDMKYGIR >CP0148 spa15, Spa15, putative component of the Mxi-Spa secretion machinery MSNINLVQLVRDSLFTIGCPPSIITDLDSHSAITISLDSMPAINIALVNE QVMLWANFDAPSDVKLQSSAYNILNLMLMNFSYSINELVELHRSDEYLQL RVVIKDDYVHDGIVFAEILHEFYQRMEILNGVL >CP0153 spa24, Spa24, component of the Mxi-Spa secretion machinery MLSDMSLIATLSFFTLLPFLVAAGTCYIKFSIVFVMVRNALGLQQVPSNM TLNGIALIMALFVMKPIIEAGYENYLNGPQKFDTISDIVRFSDSGLMEYK QYLKKHTDLELARFFQRSEEENADLKSAENNDYSLFSLLPAYALSEIKDA FKIGFYLYLPFVVVDLVISSILLALGMMMMSPITISVPIKLVLFVALDGW GILSKALIEQYINIPA >CP0155 spa29, Spa29, component of the Mxi-Spa secretion machinery MDISSWFESIHVFLILLNGVFFRLAPLFFFLPFLNNGIISPSIRIPVIFL VASGLITSGKVDIGSSVFEHVYFLMFKEIIVGLLLSFCLSLPFWIFHAVG SIIDNQRGATLSSSIDPANGVDTSELAKFFNLFSAVVFLYSGGMVFILES IQLSYNICPLFSQCSFRVSNILTFLTLLASQAVILASPVMIVLLLSEVLL GVLSRFAPQMNAFSVSLTIKSLLAIFIIFICSSTIYFSKVQFFLGEHKFF TNLFVR >CP0151 spa32, Spa32, secreted by and component of the Mxi-Spa machinery MALDNINLNFSSDKQIEKCEKLSSIDNIDSLVLKKKRKVEIPEYSLIASN YFTIDKHFEHKHDKGEIYSGIKNAFELRNERATYSDIPESMAIKENILIP DQDIKAREKINIGDMRGIFSYNKSGNADKNFERSHTSSVNPDNLLESDNR NGQIGLKNHSLSIDKNIADIISLLNGSVAKSFELPVMNKNTADITPSMSL QEKSIVENDKNVFQKNSEMTYHFKQWGAGHSVSISVESGSFVLKPSDQFV GNKLDLILKQDAEGNYRFDSSQHNKGNKNNSTGYNEQSEEEC >CP0152 spa33, Spa33, component of the Mxi-Spa secretion machinery MCGDWVIRIDTLSFLKKKYEVFSGFSTQESLLHLSKCVFIESSSVFSIPE LSDKITFRITNEIQYATTGSHLCCFSSSLGIIYFDKMPVLRNQVSLDSLH HLLEFCLGSSNVRLATLKRIRTGDIIIVQKLYNLLLCNQVIIGDYIVNDN NEAKINLSESNGESEHTEVSLALFNYDDINVKVDFILLEKNMTINELKMY VENELFKFPDDIVKHVNIKVNGSLVGHGELVSIEDGYGIEISSWMVKE >CP0156 spa40, type III secretion protein MANKTEKPTPKKLKDAAKKGQSFKFKDLTTVVIILVGTFTIISFFSLSDV MLLYRYVIINDFEINEGKYFFAVVIVFFKIIGFPLFFCVLSAVLPTLVQT KFVLATKAIKIDFSVLNPVKGLKKIFSIKTIKEFFKSILLLIILALTTYF FWINDRKIIFSQVFSSVDGLYLIWGRLFKDIILFFLAFSILVIILDFVIE FILYMKDMMMDKQEIKREYIEQEGHFETKSRRRELHIEILSEQTKSDIRN SKLVVMNPTHIAIGIYFNPEIAPAPFISLIETNQCALAVRKYANEVGIPT VRDVKLARKLYKTHTKYSFVDFEHLDEVLRLIVWLEQVENTH >CP0149 spa47, type III secretion system ATPase MSYTKLLTQLSFPNRISGPILETSLSDVSIGEICNIQAGIESNEIVARAQ VVGFHDEKTILSLIGNSRGLSRQTLIKPTAQFLHTQVGRGLLGAVVNPLG EVTDKFAVTDNSEILYRPVDNAPPLYSERAAIEKPFLTGIKVIDSLLTCG EGQRMGIFASAGCGKTFLMNMLIEHSGADIYVIGLIGERGREVTETVDYL KNSEKKSRCVLVYATSDYSSVDRCNAAYIATAIAEFFRTEGHKVALFIDS LTRYARALRDVALAAGESPARRGYPVSVFDSLPRLLERPGKLKAGGSITA FYTVLLEDDDFADPLAEEVRSILDGHIYLSRNLAQKGQFPAIDSLKSISR VFTQVVDEKHRIMAAAFRELLSEIEELRTIIDFGEYKPGENASQDKIYNK ISVVESFLKQDYRLGFTYEQTMELIGETIR >CP0154 spa9, Spa9, component of the Mxi-Spa secretion machinery MSDIVYMGNKALYLILIFSLWPVGIATVIGLSIGLLQTVTQLQEQTLPFG IKLIGVSISLLLLSGWYGEVLLSFCHEIMFLIKSGV >CP0195 stbA, plasmid stable inheritance protein MLKVSCDDGSTNVKLAWLEDGEVRTSLSGNSFKEGWNPGLFNAGKVYNYV VDEKKYTYDLGSTAVIGTTHVSYQYSTTNLLAIHHALLTSGLQPQDVELT VTLPVTEFFDNDNQPNEERIERKKANVLREISLNKGETFKIKKVNVMPES LPAAFESLKKDKVNKLERSLIIDLGGTTLDCGLILGAFEGISEIRGYSEI GTSRITHTVMNALTKASTPCNYFIADELIKNRHDNEYLQTLINDVAEIKN ISHVIDREVKSLAESIRQEISTFSGMNRIYLTGGGAELIYPHIKQYFPNL KVNKVDEPQFALVKAMVHA >CP0194 stbB, plasmid stable inheritance protein MESSDPKKRKKVVAYLHPALYPQDNLTQQTIDSLPVQMRGDFYRQSLICG AALYSVAPRLLTLISVFFSEKITAENLVKLIEQTTGYTSTSIDISVLKNI IEASSENKSESITSKDDFEEQTRRNLSMLKK >CP0259 tap, TapA MPGKVQDFFLCSLLLRIVSAGWCD >CP0243 traD, DNA transport protein MSVKLRLPQISESGEVVDMAAYEAWQQENHPDTWQQMQRREEVNINVHRE RGEDVEPGDDF >CP0250 traX, F pilin acetylation protein MTTDNTNTTRNDSLAARTDTWLQSFLVWSPGQRDIIKTVALVLMVLDHIN LIFQLKQEWMFLAGRGAFPLFALVWGLNLSRHAHIRQPAINRLWGWGIIA QFAYYLAGFPWYEGNILFAFAVAAQVLTWCTTRSGWRTAAAILLMALWGP LSGTSYGIAGLLMLAVSNRLYRAEDRAERLALVACLLAVIPALNLATSDA AAVAGLVMTVLTVGLVLCAGKSLPRFWPGDFFPTFYACHLAVLGVLAL >CP0244 trbH, hypothetical protein MNRSAPVFSSQAAHTFKFPGVISHNNQPPTAGMTCDHLIKWPDRASLTGK FCSYLAGVCGCSSVVIQNINAGNKSLDHSEITFRHLAFFCTIYQLHQGDR TDTHFPLVQVKTLPDAGGFVLYRKNADVGIEHKLQHQNDSLSCMPGCSLL SIKSVLTLCPSNHSSHVSPAGVMILVRPTAITSTRFTFSGNATAFGSLTA WLRLLRNTVVSIICLLMWICLVYIYCGIDTGICQRDIRL >CP0185 ushA, UshA, probable periplasmic UDP-sugar hydrolase MIPLKKNITLIMFTLSLLTGNPAIAYETDKVYKITVLHTNDHHGHFWRNN HGEYGLSSQKTLVDNIRQKVINNGGSVLLLSGGDINTGVPESDLQKAEPD IRGMNLIGYDAMAVGNHEFDNPLNILRQQEKWATFPFLSANIYQKSTGRR LFSPWKIFIRQNLKIAVIGLTTDDTAKTGNSEYFTDIEFRQPAAEARSVI DELNQQEKPDIIIAATHMGHYDNGESGSNAPGDVEMARSLPTGSLAMIVG GHSQAPVCMASDNKKQWNYIPGTTCVPDKQNGIWIVQAHEWGKYVGQADF EFCNGTMKLVNYQLHPVNLKMRITREDGKTEFSFYTPEITEDPQMLSLLT PFQNKGKAQLDVKVGVVNGRLEGDRSKVRFVQTSMGHLILSALTERIDAD FAVVSGGEIRDSIESGNITYKDILKVQPFGNTVVSIDLTGKEVADYLATV AQMKPDSGAYPQFLNTSFVVKKGKIEMLKIKGKSVDLNKKYRMTTFSFNA TGGDGYPRIDNRPGYINTGFIDAEVLIEYIRKHSPLDAASYEPKGEVSWQ >CP0181 virA, VirA, secreted by the Mxi-Spa secretion machinery, function unknown MQTSNITNHERNDSSWMSTVKSTTEVSWNKLSFCDILLKIITFGIYSPHE TLAEKHSEKKLMDSFSPSLSQDKMDGEFAHANIDGISIRLCLNKGICSVF YLDGDKIQSTQLSSKEYNNLLSSLPPKQFNLGKVHTITAPVSGNFKTHKP APEVIETAINCCTSIIPNDDYFHVKDTDFNSVWHDIYRDIRASDSNSTKI YFNNIEIPLKLIADLINELGINEFIDSKKELQMLSYNQVNKIINSNFPQQ DLCFQTEKLLFTSLFQDPAFISALTSAFWQSLHITSSSVEHIYAQIMSEN IENRLNFMPEQRVINNCGHIIKINAVVPKNDTAISASGGRAYEVSSSILP SHITCNGVGINKIETSYLVHAGTLPSSEGLRNAIPPESRQVSFAIISPDV >CP0123 virB, VirB, transcriptional activator required for tanscription of the ipa, mxi, and spa operons MVDLCNDLLSIKEGQKKEFTLHSGNKVSFIKAKIPHKRIQDLTFVNQKTN VRDQESLTEESLADIIKTIKLQQFFPVIGREIDGRIEILDGTRRRASAIY AGADLEVLYSKEYISTLDARKLANDIQTAKEHSIRELGIGLNFLKVSGMS YKDIAKKENLSRAKVTRAFQAASVPQEIISLFPIASELNFNDYKILFNYY KGLEKANESLSSTLPILKEEIKDLDTNLPPDIYKKEILNIIKKSKNRKQN PSLKVDSLFISKDKRTYIKRKENKTNRTLIFTLSKINKTVQREIDEAIRD IISRHLSSS >CP0046 virF, VirF, member of the AraC family of transcriptional activators, required for transcription of virB and icsA MMDMGHKNKIDIKVRLHNYIILYAKRCSMTVSSGNETLTIDEGQIAFIER NIQINVSIKKSDSINPFEIISLDRNLLLSIIRIMEPIYSFQHSYSEEKRG LNKKIFLLSEEEVSIDLFKSIKEMPFGKRKIYSLACLLSAVSDEEALYTS ISIASSLSFSDQIRKIVEKNIEKRWRLSDISNNLNLSEIAVRKRLESEKL TFQQILLDIRMHHAAKLLLNSQSYINDVSRLIGISSPSYFIRKFNEYYGI TPKKFYLYHKKF >CP0237 virK, VirK, required for proper localization of IcsA (VirG) at the surface of bacteria MFSVSNLSFIGFLKRIVFSSDSLPGKWEHRKFRFMYILRCAINPVASIRY YYELRSLQCIEDILAIQPTLPARIHRPYLHKGGRAWSRGQYILEHYRFVQ NLPEKYSEFLFPQKSVSLVQFIGKDGEDFDIQCSPSGFDREGELMLSLFF NKIVIARLTFSVILTQNGHTAFIGGLQGAPKNTGPDIIRCATRACYGLFP KRIIFEAFCALMKACNVSECLAVSEHSHVFRQLRYWYQKRKTFVAVYSDF WESVAGKTCGDWYKLPTQVVRKPLSNIASKKRSEYRKRYALLDYIHETAI RSLDAYPVNSEHYDLN >CP0196 yccB, hypothetical protein MRHGLMEAACERRIPMPNWCSNRMYFPGEPAQIAEIKRLASGAVTPLYRR ATNEGIQLFLAGSAGLLQITENIRSEQCPGVTAAGRGAVSTENIAFTRWL THLQNGVLLDEQNCLMLHELWLQSGTGQRRWEGLPDDARETITVHFTAKR GDWCDIWGNEDVSVWWNRLCDNVVPEKTMPFDLLTVLPTRLDVEVNGFNG GVLNGVPSAYHWYTERYGVKWPCGYDLNISSQGENFIQVDFDTPWCQPES DVIAELSRRFSCTLEHWYAEQGCDFCGWQLYERGELVNVLWGELEWSSPT DDDEQPEVTGPAWIVDNVAHYGG >CP0198 ycdA, hypothetical protein MSRFVLGNCIDVMARIPDNAIDFILTDPPYLVGFRDRQGRTIAGDKTDEW LQPACNEMYRVLKKDALMVSFYGWNRVDRFMSAWKNAGFSVVGHLVFTKN YTSKAAYVGYRHECAYILAKGRPRLPQNPLPDVLGWKYSGNRHHPTEKPV TSLQPLIESFTHPNAIVLDPFAGSGSTCVAALQSGRRYIGIELLEQYHRA GQQRLAAVQRAMQQGAANDDWFMPEAA >CP0199 yceA, hypothetical protein MNYAGHEKLRAEVAEVANAMCDLRTTMNEMERRYSFNADTLPERLVRQTL FRANRLLMEAYTEILELDSCFKD >CP0202 ycfA, hypothetical protein MNETLNALICRHARNLLLAQGWPEETDVDQCNPNYPGWISIYVRLDAPRL ATLLVNRHDGVLPPHLASAIQKLTGTGAELVLSGSQWQSLPVLPADGTQV SFPYAGEWLTEDEIRAVLDAVRDAVCSVSCRGAEDARRIRAALTTSGQTL LTRQTRRFRLVVKESDHPCWLDEDDENLPVVLDAILNRGARFSAVEMYLV SDCIEHILSSGLACDVLRIPDEPPRRWFDRGVLREVVREARAEIRSMADA LAKIRK >CP0252 yigA, hypothetical protein MNGFRNSSRNGQVWRYQRAGGRAVILEVSGRWMEAAEAWRRAACVAPRTD WQQFARKRAEHCHRRCRGRV >CP0253 yigB, hypothetical protein MLHYSGGLKYRWHLSDMENNMRKYIPLALFIFSWPVLSADIHGRVVRVLD GDTIEVMDSLKAVRIRLVNIDAPEKKQDYGRWSTDMMKSLVAGKTVTVTY >CP0257 yihA, hypothetical protein MKLIIFILIVLIIAALLIRIILRSVNQHSPLLMQLHAAGIRTGDAERILS SGEYWQRQKTLLTEREVSFMKGLFRIVDMKRWYLCPQVRVADIVQLNGNI RPRSRQWWQLFRMVSQWHVDVVIVELRSFSIVAAVELDDASHLRPERRRR DILLEEVLRQAGIPLLRSHDARKLLQMTGEWLNTTGADQQSPEHRS