Gene list
Applied filters:
Gene type: CDS
Gene type: CDS
Gene type: CDS
Genomic element: pSS_046
Number of genes found: 238
Hide UniProt / TrEMBL protein name | View in Fasta format (DNA) | View as list | ||||
# Shigella sonnei Ss046, Ss046 >SSO_P217 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_P155 putative IS orf, fragment MYLKIRDRLGYMSNTSSNFEMTGTLLGLELRKRKTPQEKIAIIQQTMEPG MTVSHVARLHGIQPSLLLKWKK >SSO_P014 IS91 ORF MLPRFADIFQQGNRWLNWLEKQSVQMSRLEHYAGQDEIGLRYNSHRTKRE ENLVMSGDEFMERFSWHVADKGFRMVIRGPESGEAAITGRCGVRHNGDSE KNGEANHKERDVSAVTEG >SSO_P133 IS629 ORF2 MFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYVSLA YTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRAEVE LATLTWVDWYNNRRLLERLGHTPPAEAEKAYYASIGNDDLAA >SSO_P123 IS600 ORF2 MESFWGTLKNESLSHYRFKSRDEAISVIREYIEIFYNRQRRHSRLGNISP AAFRIKYYQMTA >SSO_P175 IS3 ORF2 MAASLRRQGLRAKASRKFSPVSYRAHGLPVSENLLEQDFYASGPNQKWAG DITYLRTDEVRLHPVSTEPHAF >SSO_P006 IS630 ORF MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWMNHVERLW QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV >SSO_P138 putative reverse transcriptase, fragment MARTRSGRETSRTITAHRLRGNTGRRVIEGDLSSYFDTVHHRLLMKAVCR RISDARFMRLLWKTPC >SSO_P073 IS600 ORF1 MSRKTQRYSTEFKAEAVKTVPENQLSISEGASRLSVPEGTLGQWVTAARK GLNTPGSRTVAELESEVMQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_P150 hypothetical protein MFNAKIRGWIKYYGAFYKSALYLTLRQIDRKLVLWLPRKHKRLRGHRRRA SHWLARVARSETRLFAHWPLLWGQASMRRAG >SSO_P132 IS21 ORF2 MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT TPISDDEMVESGQHQ >SSO_P045 ISSfl2 ORF MALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRAFASAAHLAAY SGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALRDPLSRAYYTR KMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS >SSO_P048 IS1294 ORF MTRSGGDFQPRPLKRLFTANQCWTSFLDAGGLRDIEVEAVTKMLACGTRI LGVKEYNCDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDC DWVHLVFTLPDTLWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFC AIHTYGRRLNWHPHVHVSVTCGCLNKHGQWKKLSFLKDAMRSRWMWNMRQ RLLKAWSEGLAMPESLSHITTESQWRSLVLKAGGKYWHVYMSKKTAGGRN TARYLGRYLKKPPIAASRLAHYNGGASLSFRYLDHKTGETATETLTQREL VARLKQHIPEKFFKMVRYFGFLANRVCGEKLPQVYRALGMDKPEPVAKVC YVQMVKQFLSRDPFECVLCGGRMVYRRAIAGLNVEGLKKNARDISLLRYM PA >SSO_P081 ISSfl2 ORF MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS >SSO_P028 ISSfl2 ORF MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS >SSO_P035 hypothetical protein MFDQQRDGNLYKIWNRLCSGTWFPPPVLEKRIPKSNGKERILGIPTVSDR IAQGAIKLFMEEKLDPIFHADSYGYRPGKSAHDALKQCAIRCWRYSWILE VDISAFFDHVRHDLVLKALEHHGMPKWVILYCRRWMEAPMQSCENGELIT RTRGTPQGGVISPLLANLFHHYAFDLWMEREYRGYRLRGTLTIL >SSO_P040 IS1294 ORF MLSAFTPRPLKRLFTANQCWTSFLDAGGLRDIEVEAVTKMLACGTRILGV KEYNCDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDCDWV HLVFTLPDTLWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFCAIH TYGRRLNWHPHVHVSVTCGCLNKHGQWKKLSFLKDAMRSRWMWNMRQRLL KAWSEGLAMPESLSHITTESQWRSLVLKAGGKYWHVYMSKKTAGGRNTAR YLGRYLKKPPIAASRLAHYNGGASLSFRYLDHKTGETATETLTQRELVAR LKQHIPEKFFKMVRYFGFLANRVCGEKLPQVYRALGMDKPEPVAKVCYAQ MVKQFLSRDPFECVLCGGRMVYRRAIAGLNVEGLKKNARDISLLRYMPA >SSO_P079 IS2 ORF2 MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP HSALGYRSPREYLRQRACNGLSDNRCLEI >SSO_P236 putative transposase MWCFFNLFGVLIPIDERNLTRERTQVGLQAARARGRKGGRPKTLSKDKQA LAVQLYNEKKHTVAQICVLMGISRPTLYKYIESARLFKK >SSO_P119 IS600 ORF2 MAHIRTRETYGTRRLQTELADNGIIVGRDRLARLRKELRLHCKQKRKFRA TTSSDHNLPVTPNLLNQNFTPTAPNQVWVADITYVATREGWLYLAGVKDV YTCEIVGYAMGERMTKELTGKALFMALRSQHPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMEIFWGTLKNESLSHYRFKSRDISS AYGKTD >SSO_P210 ISSfl1 ORF2 MRQQQDEQGRFSICSRQAAVVHRDAYCNRNVVERCFGRLKEYRRIATRYD KTARNYLAMVKLGCIRLFYQRLRN >SSO_P121 putative transposase METCFKILQLKFDKKLTNRCIGLTLHISASTVFEVLARFKASSLSWPLPA DISHDTLEKLIFPPKDTSASELVMPDMLYFDTEMRKPGVTRQLLWMEYKA QAGDKAMGYSHFCRCYRKWKKTRRLSMRQEHRAGEKLFIDFCGPTVPVIN PDTGEIRRVAIFVAVMGASNYTYVEACEGQDMMSWLNAHSRCLTFLGGVP KLLIPDNLRSAVKKADRYEPVINDSYQALVEHYGTVIIPARPRKPKDKPK AENGVLIVERWLLARIRNETFHTLRALNARLRELLTDMNNRPMKGYGNQT RAERFRMLDAPALSPLPLEPYEYTEYKAVKVGPDYHVEYARHWYSVPHEL VGQRLSLKVGQSVVQLWHKGQCVAQHPRSTHEYKHTTNPLHMPERHRRHG TWTPERLIEQGNRTGPSTGRVVESMLKAKPHPELAYRAVLGLLALQKKYG PERLEKACYVALHYNAPDRRFIDNLLRHHRDNVELPLSRLGEQHPAYASE HENLRGPGYYH >SSO_P165 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFTPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_P174 putative IS orf MLTELLTRAYPCPPLTPRSTVCGLFARFRKSGLSWPLPAGMSEQELDALL YGSASTVPVVLTESTVMPKLPVVKKRPRRP >SSO_P053 hypothetical protein MSVKLRLPQCTDNKKTETDAIYDKVRSSYLLSCILKKNKNVGLILHAPSF VSVSEKIARIVMANYSRNWSNSELASAVLMSESSLKRRMYKEVGSISTFV HKIKLTEAIRKLRRTNTPISVISSELGYSSPSYFSKVFFKYLKTYPQNIR KKNGR >SSO_P141 IS629 ORF1 MTKNTRFSPEVRQRAVRMVLESQGEYDSQWAAICSIAPKIGCILETLRVW VRQHERDTGGGEVGSPPLNVSV >SSO_P005 hypothetical protein MWCCRGTVCTIPYVDQYNRNDNFRFRAQPKYILGHLSNRLPDTAPFFNKK SIIFESLLFIALSSIVSPLFAF >SSO_P012 IS630 ORF MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDAARTL CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV >SSO_P076 IS1353 putative transposase-like protein MVDCFDGKVVSWSLSTRPDAELVNTMLDSAVETLNAGERPVIHSDRGGHY RWPGWLERVNAAGLIRSMSRKGCSPDNAACEGFFGRLKTEMYYGRKWSGI TPEKFMQQVDAYIRWYNERRIKLSLGAVSPKMYRQQCGLE >SSO_P033 putative transposase, fragment MSEQKITGIDLAKTNFYLFSINAHGKPAGKTKLSRNQLLNWLVQQPKMTV AMEAGGASHYWAREIRKLDHDVILLPAQHVKAYQRCQKNDYNDAQAIAEA CQHGTIRPVPIKTLEQQDVQTFLNMRRLVSMERTQLINHIRGLLAEYGIV FSKGAADLRQK >SSO_P030 iso-IS1 ORF2 MVTSDDWGSYAREVPKEKHLTGKIFTQRIERNNRTLRTRIKRLARKTICF SRSVEIHEKVIGSFIEKHMFY >SSO_P144 adhesion protein, fragment MLPPNIRGYAPQITGIAETNARVVVSQQGRVIYDSTVPAGTFSIQDLSSS VRGILDVEIFEQNGKRKHFQVEMCRCAFLIQTWSE >SSO_P151 putative reverse transcriptase, fragment MKDRNGSGAKGLPHCADGAAATTGDNADGRTAVKSAKPFPVSKRQVWEAY KRVKANRGAAGIDGQTLAGFDENVTDNLYKLWNRMASGSYMPQAVRRVDI PKADGGVRPLGIPAVSDRIAQMVVKQILEPVLEPLFHADSYGYRPGKSAH QAIAQARKRCWKFDWVVEVDIKGFFDDIDHDLLLKTVQHHTQARWVVMYI ERRLKAPVQMPDGAMLARGRGTPQGGVISPLLSNLFLHYAFDMWMQRQFP GVPFERYADDVVCHCHSQWQADALISGLRQRLAQCGLQLHPQKTRIVYCK DADRRGDYPETSFDFLGYTFRPRLSMNRWGKTFVNFSPAMSARAGKAIRQ EVRRIAVTSPCTSWRICSMRKSEAG >SSO_P021 putative transposase, fragment MTHHLGCEKNQLRSGSNSRNGCLTKIITTGDEPLEIRTLRDRNGTFEPQQ LKKNQP >SSO_P074 IS600 ORF2 MAHIRTRETYGTRRLQTELADNGIIVGRDRLARLRKELRLHCKQKRKFRA TTNSDHNLPVTPNLLNQNFTPTAPNQVWVADITYVATREGWLYLAGVKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHTDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFKSRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFRIKYYQMTA >SSO_P208 hypothetical protein MTYTVKFRDDALKEWLKLDKTIQQQFVKKLKKCSENPHIPSAKLRGLKDC YKIKLRASGFRLVYQVIDDMLIIAVVAVGKRERSNVYNLASERMR >SSO_P154 IS2 ORF1 MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA VEYGRAKKWIAHAPLLPGDGE >SSO_P193 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_P122 putative transposase MSDNLLNKLTQLKLPAMAGSLIRQRETPQTYDELSFEERLTLLVDDELLS RENSRVARLRKNACLKYQATPEGLRYPASRGLRAEQMRELLNGYYIIHRK NLLITGPTGCGKSWIANALGEQACRQKYSVRYCRTGRLLEQLAQGRVDGS WLKYLKQLQKIQVLILDDLGLEQLSNAQCNDLLEITEDRYGQSSTIVVSQ FPVDKWHGLMENPTTADAILDRLVHNSHRVVLQGESLRKNPPTVESSEKT S >SSO_P149 hypothetical protein MSDATRLKNRLSVRFSEVGLVLNAGKTNIAYIDTFKRRNVATSFSFLGYD FKVRTLKNFKGELYRKCMPGASNAAMCKITETIKKWRIHRSTAESLLDFA RRYNAIVRGWIEYYGKFWSRNFSYRLWSAMQSRLLKWMQSKYRLSNRKAQ RKLALVRKQYPKLFAHWYLLRASNE >SSO_P027 IS100 ORF1 MVTFETVMEIKILHKQGMSSRAIARELGLSRNTVKRYLQAKSEPPKYTPR PAVASLLDEYRDYIRQRIADAHPYKIPATVIAREIRDQGYRGGMTILRAF IRSLSVPQEQEPAVRFETEPGRQMQVDWGTMRNGRSPLHVFVAVPGYSRM LYIEFTDNMRYDTLETCHRNAFRFFGGVPREVLYDNMKTVVLQRDAYQTG QHRFHPSLWQFGKEMGFSPRLCRPFRAQTKGKVERMVQYTRNSFYIPLMT RLRPMGSTVDVETANRHGLRWLHDVANQRKHETIQARPCDRWLEEQQSML ALPPEKKEYDVHPGENLVSFDNPVTLFVPLIMGC >SSO_P239 IS911 ORF2 MQTMTSRSRQAAYSGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISH GSAGARSIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEH VAIPNYLERQFAVTEPNQVWCGDVTYIWTGKRWAYLAVVLDLFARKPVGW AMSFSPDSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRY QIRQSMSRRGNCWDNSPMERFFRSLKNEWMPMVGYVSFREAAHAITDYIV GYYSALRPHEYNGGLPPNESENRYWKNSNSVASFC >SSO_P238 putative IS91 ORF2 MARSAKPRKRKPASQRSKLPRYVVKLHEDDFFDEEDAEVLRFDSFDDAVE CCADLNIPFFVDAGNKKLVFWFVRVDDEGYPEIARCTEREFATILSGISA GGMYCPECGTVHWPDGVAPPF >SSO_P129 IS629 ORF1 MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVR VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA EFDRLWKK >SSO_P172 putative IS orf MAGRRLGVPKSTVCGMFVRFRNAGLSWPLPAGMSEQELDALLYGSASTVP VVLTESTVMPKLPVVKKRPRRPNADQLRIS >SSO_P173 IS4 ORF MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLP LEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQT GDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELRKLGKGD HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG GEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELM RDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >SSO_P211 putative transposase MDEKKLKALAAELAKGLKTEADLNQFSRMQTKLTVETVLNAELTDHLGHE KSYIR >SSO_P207 hypothetical protein MVVSGISLRLVKQSQRCVMPNIILSETSASVSELKKNPMATVSAGDGYPV AILNRNQPAFYCVPAELYERMLDALDDQELVKLVTERSNQPLHDVDLDSY L >SSO_P209 IS1294 ORF MLSAFTPRPLKRLFTANQCWTSFLDAGGLRDIEVEAVTKMLACGTRILGV KEYNCDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDCDWV HLVFTLPDTLWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFCAIH TYGRRLNWHPHVHVSVTCGCLNKHGQWKKLSFLKDAMRSRWMWNMRQRLL KAWSEGLAMPESLSHITTESQWRSLVLKAGGKYWHVYMSKKTAGGRNTAR YLGRYLKKPPIAASRLAHYNGGASLSFRYLDHKTGETATETLTQRELVAR LKQHIPEKFFKMVRYFGFLANRVCGEKLPQVYRALGMDKPEPVAKVCYAQ MVKQFLSRDPFECVLCGGRMVYRRAIAGLNVEGLKKNARDISLLRYMPA >SSO_P237 IS91 ORF MACGTTLMGYTQWCCSSPDCCHTKKVCFRCKSRSCPHCGVKAGAQWIQYL LSLVPDCPWQHIVFTLPCQYWSLVFHNRWLLTEMSRIAADVILEICHQAD VEPGIFTVIHTWGRDQQWHPHIHLSTTAGGVTSGHTWKNLHFYARKVMSM WRYRITRLLSRKYPDLVMPDALAAEGSSKREWNRFLDTHYRRGWNVNVSR VMDNATHVAVYFGSYLKKPPVPMSRLEHYAGQDEIGLRYNSHRTKREEYL LMSGDEFMERFSWHVADKGFRMVRYYGFLSPAKRRLLEEVVYIITETVRK TAMQITWRGMYQRLLKVDPLKCVLCGSQMRFTGLKRGYRLAEQVLMHELL ARMRWCG >SSO_P218 IS630 ORF MQTMTSRSRQAAYSISLLKRLKATYRRAKTITLIVDNYIIHKSRETQSWL KENPKFTYRKLKN >SSO_P178 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_P071 ISSfl4 ORF1 MNSQTTKDIPCFRSYLPDALRLRFEDKLTIRAIAQRLGLSHSTIHTLFQR FLASGIAWPLPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYS REFKVRLAKQALQPGAVVARIAREHDINDNLLFKWKSQYEDGLLSDDDIQ ECMPVPVALTDTPEPTRPVTNPFWRKNHGSLAAANRGVAEYELSE >SSO_P184 putative transposase MCRLAVEYLLYAARKRGLEIGIFCTIHTLRLHFEEHLPLVVAGRRLGVPK STVCSMFVRFRKAGLSWPLPAGMSERELDARLYGSASTVPVVLTESTVMP EVPGVKKRPRRPNFPYEFKIALVEQSLQPGACVAQIARENGINDNLLFNW RHQYRKGGLLPSGKNMPALLPVTLTPEPDNHGFDIIYMLSTHHQRFTFVR LFDPYLIGSRPTFSHLAHHHIS >SSO_P194 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_P034 hypothetical protein MDGKRISGVPFERYADDIVVHCSRMSDATRLKNRLSERFSEVGLVLNAGK TNIAYIDTFKRRNVATSFTFLGYDFKVRTLKNFKGERYRKCMPGASNAAM RKITETIKKWRIHRSTAESLLDFARRYNAIVRGWIEYYGKFWSRNFNYRL WSAMQSRLLKWMQSKYRLSNRKAQRKLTLVRKEYPKLFVHWYLLRASNE >SSO_P080 IS2 ORF1 MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA VEYGRAKKWIAHAPLLPGDGE >SSO_P042 putative transposase MNNNNTLYVGLDVHKESITVAYAINSEPVELMGKIGTSPTDIQNLCKRLR SKSSQVSIVYEAGPCGYGLYRRLVKSGFDCMVCAPSLIPKKPGERVKTDR RDAIRLVRSLRAGDLSAVYVPGIEDEAFRDLARAWASARDDLRHARQRLK SFLLVHGVHYVGRADWGPAHRRWLSKYSFESPWRQLAFDEHRRTIEDRQA QCERLESALKEAVTEWRLYPVVEALQAMRGIQFITAVGLISELGDLTRFE HPRQLMSWFGITPSEYSSGGSRHQGSITKAGNSYARKLLVEAAWSYRHPA RISPAIQKRQENLPRPVIDRAWDAQLRLCKRYRKLQAKGKNVNITIVAVA RELAGFIWDMGRIAMSVAQQPQCHK >SSO_P187 putative antirestriction protein MQYAKPVTLNVEECDRLSFLPYLFGNDFLYAEAYVYALAQKMMPEYQGGF WHFIRLPDGGGYMMPDGDRFHMVNGANWFDRTVSADAAGIILTSLVINRQ LWLYHDSGDAGLTLR >SSO_P168 hypothetical protein MAQVNMSVRIDAELKDAFMAAAKSMDRNGSQLIRDFMRQTVERQHNSWFR DQVAAGRQQLECGDVLPHDMVESSAAAWRDEMSRKIADK >SSO_P019 putative IS orf, fragment MDTSLAHENARLRALLQTQQDTIRQMAKYNRLLSQRVAAYASEINRLKAL VAKLQRMQFGKSSEKLRAKTERQIQEAQERISALQEEMAETLGEQYDPVL PSPLRQSSAHKPLPASLPRETRVIRPEEECCPACGGELSSLGCDVLEQLE LISSAFKVIETQRPKLACCRCDHIVQAPVPSKPIARSYAGAGLLAHVVAG KYADYLPLYRQSEIYRRQGVELSRATLGRWTGAVAELLEPLYDILRQYVL MPGKVHADDIPVPVQEPGSGKTRTARLWVYVRDDRNAGSEMPPAVWFAYS PDRKGIHPQNHLAGYSGVLQADAYGSYRALYESGRITEAQQRIGELYAIE AEVRGCSAEQRLAARKARAAPLMQSLYDWIQQQMKIHSLKMECLHGEHYY PSGNSAGNSV >SSO_P043 IS2 ORF2 MADNGSAYTAHETRQFARELNLEPCTTAVSSPQSNGIAERFVKTMKEDCI AFMPKPRTALHNLAVAIEHYNENHPHSALGYLSPREYRRQRVMST >SSO_P156 hypothetical protein MINGVSLQGTAGYEAHTEEGNVNVKKLLESLNSKSLGDMDKDSELAATLQ KMINPSGGDGNCSGCALHACMAMLGYGVREAPVPNEISEYMTGFFHRHLE QIDSEGIVSHPNETYSKFRERIAENILQNTSKGSVVMISIEQATHWIAGF NDGEKIMFLDVQTGKGFNLYDPVEKSPDAFVDENSSVQVIHVSDQEFDHY ANSSSWKSKRLC >SSO_P130 IS629 ORF2 MPLLDKLREQYGVGPLCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL KKEIQRVYDENHKVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR GKKVRTTISRKAFAAGDRVNRQFVAERPDQLWVADFTYVSTCVSASDIRR >SSO_P195 oriT nicking and unwinding protein, fragment MNAERLFSTARELRDVAAGRAVLRQAGLAGGDSPARFIAPGRKYPQPYVA LPAFDRNGKSAGIWLNPLTTDDGNGLRGFSGEGRVKGSGDAQFVALQGSR NGESLLADNMQDGVRIARDNPDSGVVVSIAGEGRPWNPGAITDGRVWGDI PDNSVQPGAGNGEPVTAEVLAQRQAEEAIRRETERRADEIVRKMAENKPD LPDGKTELAVREIAGQERDRAAITEREAALPESVLRESQREREAVREVAR ENLLQERLQQMERDMVRDLQKEKTPGGD >SSO_P026 IS3 ORF2 MKYVFIEKHQAEFSIKAMCRVLRVARSGWYTWCQRRTGISPRQQFRQHCD SVVLAAAFTRSKQRYGAPRLTDELRAQGYHFNVKTVAASLRRQGLRAKAS RKFSPVSYRAHGLPVSENLLEQDFYASGPNQKWAGDITYLRTDEGWLYLA VVIDLWSRAVIGWSMSPRLTAQLACDALQMALWRRKRPRNVIVHSDRGSQ YCSADYQALLKWHNLRGSMSAKGCCYDNACVESFFHSLKVECLHGEHFIS REIMRATVFNYIECDYNRWRRHSWCGGLSPEQFENQNLA >SSO_P146 IS629 ORF2 MGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTW QGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHH SDKGSQYVSYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRK SWKNRAEVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLA A >SSO_P125 IS3 ORF1 MTKTVSTSKKTRKQHSPEFCSEALKLAERIGVAAAARELSLYESQLYAWR SKLQQQMTSSERESELAAENARLKRQLAEQAEELAILQKAATYFAKRLK >SSO_P166 putative IS orf, fragment MNDLFAWLEEQEPCCPPDGPLNKAINYILNRRDELSCFLGDGAVPLDNNI CERAIRPVVMGRKAWLFAGSLMAGNRAAQIMSLL >SSO_P189 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_P162 IS21 ORF2 MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIMLTSNKG FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT TPISDDEMVESGQHQ >SSO_P061 IS629 ORF2 MPLLDKLREQYGVGPLCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL KKEIQRVYDENHKVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAKVIHRKSWKNRA EVELATLTWVDWYNNRRLLERLGHTPPAEAEKAYYASIGNDDLAA >SSO_P124 IS3 ORF2 MKYVFIEKHQAEFSIKAMCRVLRVARSGWYTWCQRRTGISPRQQFRQHCD SVVLAAAFTRSKQRYGAPRLTDELRAQGYHFNVKTVAASLRRQGLRAKAS RKFSPVSYRAHGLRCTGNSGHHHLFFF >SSO_P215 putative IS91 ORF2 MDAGNKKLVFWFVRVDDEGYPEIARCTEREFATILSGISAGGMYCPECGT VHWPDGVAPPF >SSO_P066 ISSfl1 ORF1 MACYDLPDEAWTIIKPLLPPEPATPRAGRPWAEHRKIINGMFWVLCSGAP WRDLPERYGSWKTVYNRFNRWSKSGVINIIFNRLLSLLDAGDAANLLI >SSO_P127 putative transposase, fragment MCWGRTALYMAALEAPRFNLVIKAFYMRLLAAGNAKKVALVACMRKLLTI MNAMLRKNEEWNESYL >SSO_P228 IS630 ORF MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWMNHVERLW QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV >SSO_P140 IS629 ORF2 MPLLDKLREQYGVGSVCSELHIAPSTYYHCQQQRHHHDKRSARAQRDDWL KKEILRVYDENHQVYAVRKVWHQLLREGIRVARCTVARLMAVMGLAGVLR GKKVHTTVSRKAVAAGDRVNRHQGNMPRTPGGPQRLVYVVSAADKDKHTS AVPSALRQRCPQGFYPVQRYGAPRLTDELCALVTTLT >SSO_P139 IS3 ORF2 MCSGYHFNVKTVAASLRRQELSAKASQKFSPISYRAHGLPVSENLLTQDF YASGPNQKWAGDITYYYSSPTAGKHGAPGY >SSO_P135 IS91 ORF MLPRFADIFQQGNRWLNWLEKQPVQMSRLEHYAGQDEIGLRYNSHRTKRE ENLVMSGDEFMERFSWHVADKGFRMVIRGPESGEAAITGRCGVRHNGDSE KNGEANHKERDVSAVTEG >SSO_P036 IS21 ORF1 MIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRHKMVKLKPF MDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKRKMRPSKRT VRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSFHVFAAPKQ DAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVFNSGFLLLA DHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSFTHVNQQLE QWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSYFDIRHVSW DSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVASHRLCSAS SGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL >SSO_P018 ISSfl4 ORF2 MISFPAGSRIWLVAGITDMRNGFNGLASKVQNVLKDDPFSGHLFIFRGRR GDQIKVLWADSDGLCLFTRRLERGRFVWPVTRDGKVHLTPAQLSMLLEGI DWKHPKRTERAGIRI >SSO_P161 hypothetical protein MVGVAGAALAPLVKLLRHELLTRDVIHADETSLRLLDTRKGGKSCSGWLC AYVSGERSGPPVVCFDSQTGRALRYPETWLQCWCGGTLVSDGYSVYKSLA DNHPGITSACCWSHAGRGFANLYKASREPRAGVELRKIAGLYRIEKLIRE RCQRHDV >SSO_P136 putative IS91 ORF2 MACDYRYKNRQYHCLSGSYMARSAKPRKRKPAPQRSKLLRYVVKLHEDDF FDEEEAEVLRFDNFDDAVECCADLNIPFFVDAGNKKLVFWFVRVDDEGYP EIARCTEREFATIPAGISADGMYCPECGTVHWPDGVIPPF >SSO_P157 IS100 ORF2 MLHEEKLARHQRKQAMYTRMVAFPAVKMFEEYDFTFATGAPQKQLQSLRS LSFIERNENIVLQGTSDITNPRVGICV >SSO_P051 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNLFFEMKA >SSO_P023 hypothetical protein MPGATVADEFDKTLAFLEAIVNADNETTIGEIRSFADALDAVRFNRNKIN RQLSKPNLASLALEHEVIWLGRSR >SSO_P153 IS2 ORF2 MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP HSALGYRSPREYLRQRACNGLSDNRCLEI >SSO_P219 putative IS91 ORF2 MVCNYRYKNRQCHCLSGGYMARSAKPRKRKPASQRSKLPRYVVKLHEDDF FDEEDAEVLRFDSFDDAVECCADLNIP >SSO_P234 putative aquaporin MFKPFSAEFFGTFWLVLGGCGSALISAAFPQLGIGFLGVALAFGLTVVTM AYAVGHISGAHFNPAVTLGLWAGGRFPAARVLPYIIAQVIGGIAAAAVLY GIASGKAGFDATTSGFAANGYGLHSPGGYALSACMLSEFVLSAFFVRSDR KTRSCGLCATGDWSGNHPVN >SSO_P055 ISSfl1 ORF2 MLSPGQAHESQFAQRLLDGIGVQRQNGSMKRRGHAVLADKAYSGRALRNE LKNNGIKAVIPRKSNEKMASDGRAQLDRDAYCNRNVVERCFGRLKEYRRI ATRYDKTARNYLAMVKLGCIRLFYQRLRN >SSO_P054 ISSfl1 ORF1 MVHKSDSDELSALRAENARIIKPLLLPEPATPRAGRPWAEHRKIINGMFW VLCSGAPWRDLPERYGSWKTVYNRFNRWSKSGVINIIFNRLLSLLDANGF IDWSATALDGSNIRALKCAAGAQKNIPISTEIMGRVALAAVLAPKSIWQQ TEVASR >SSO_P148 hypothetical protein MNQKVKSVGSDNVIDDHHVFFADSRCDFVKVVSADVCDMGMQLLYFVFLL LPVVAEFNLAA >SSO_P016 putative transposase MEYRTWITEALRLHFEEHLPRVVAGRRLGVPKSTVCGMFVRFRNAGLSWP LPAGMSEQELDALLYGSASTVPVVLTESTVMPKLPVVKKRPRRP >SSO_P179 hypothetical protein MYHHVSHCPGLVTLSPVTFRKQMKWLAENNWKTLSSDELEFFYRGGKLPR KSVMLTFDDGYLDNWFQVYPLLKEFNLKAHIFLITGFIGNGPVRHSPGKE YSHRDCEHQIATGNADNVMLRWSEVNEMLQSGLVEFHVHTHTHTRWDKKF SSREEQCKHLRQDLLSGREYLKEMTGKCSKHLCWPEGYYNKDYIQVAEEL GFYYLYTTERRMNAPAKGTTRIGRISTKERESCAWLKRRLFYYTTPFFSS LLAFHKGPRLPDD >SSO_P082 IS100 ORF1 MVTFETVMEIKILHKQGMSSRAIARELGLSRNTVKRYLQAKSEPPKYTPR PAVASLLDEYRDYIRQRIADAHPYKIPATVIAREIRDQGYRGGMTILRAF IRSLSVPQEQEPAVRFETEPGRQMQVDWGTMRNGRSPLHVFVAVPGYSRM LYIEFTDNMRYDTLETCHRNAFRFFGGVPREVLYDNMKTVVLQRDAYQTG QHRFHPSLWQFGKEMGFSPRLCRPFRAQTKGKVERMVQYTRNSFYIPLMT RLRPMGSTVDVETANRHGLRWLHDVANQRKHETIQARPCDRWLEEQQSML ALPPEKKEYDVHPGENLVSFDNPVTLFVPLIMGC >SSO_P134 putative transposase, fragment MNKRAFFGAFLIFWGFKFLSMDMNAGYIRAARIHLPNAVEKIAFDRFHVA KQPGEVVDKTRQNEPPRVSWRVFYL >SSO_P077 IS21 ORF1 MADVADRRELRQFRQTPEQRFTQEQEHLQPLLGTDFDIRHVSWDGYIEVG GNRYSVPESLCGQLVSIRISLDDELRIYSNEQQVASHRLCSAAYGWQTVR PGCSSVTAKSA >SSO_P240 hypothetical protein MISPIKNIKNVFPINTANTEYIVRNIYPRVEHGYFNESPNIYGKKYISGI TRSMAQLKIEEFINEKSRRLNYMKTMYSPCPEDFQPISRDEASTPEGSWL TVISGKRPMGQFSVDSLYHPDLHALCELPEISCKIFPKENSDFLYIIVVF RNDSPQGELRANRFIELYDIKREIMQVLRDESPELKSIKSEIIIAREMGE LFSYASEEIDSYIKQMNDRLSQIKARMPVT >SSO_P220 IS1294 ORF MTRSGGDFQPRPLKRLFTANQCWTSFLDAGGLRDIEVEAVTKMLACGTRI LGVKEYNCDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDC DWVHLVFTLPDTLWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFC AIHTYGRRLNWHPHVHVSVTCGCLNKHGQWKKLSFLKDAMRSRWMWNMRQ RLLKAWSEGLAMPESLSHITTESQWRSLVLKAGGKYWHVYMSKKTAGGRN TARYLGRYLKKPPIAASRLAHYNGGASLSFRYLDHKTGETATETLTQREL VARLKQHIPEKFFKMVRYFGFLANRVCGEKLPQVYRALGMDKPEPVAKVC YVQMVKQFLSRDPFECVLCGGRMVYRRAIAGLNVEGLKKNARDISLLRYM PA >SSO_P176 hypothetical protein MFSKAFLRKISMFFARRKPAAMKICLYHTLNPDTIPGYKKFAQAIATDNF VQADVRKIDTNLYRARLSIRDRLLFSLYRYHGETICLVLEYIRNHAYNTS RFLRRNVVIDEGRLQQQPVPDPVDIATEALTYINPSHGRFHRLDKMLSFD DDQQALYEHPLPLVIVGSAGSGKTALVLEKMKQAAGDILYLSLSSFLVEK ARTLYDASGEGSEVQNIDFLSLTEFLETLRIPKGREVTFSAFSDWLPRNR AIAALGAAHTLYEEFRGVIGAVASGNGPLSREAYLSLGIRQSLYGMEDRP TVYVLFERYIAWLKQSHQYDSNLLSHQYLSLATPRYDVIFVDEVQDMTPV QLQLVLKTLRHPGQFLLCGDANQIVHPSFFSWSSLKSLFFRQQQGNDTTV NILQANYRNGHHVTALANRLLRLKQVRFSAIDRESHHFVRSCGQAEGTIR LLDDREETKQELNAKTSLSNRVAVIVMHPEQKAQARCWFSTPLVFSVQEV KGLEYETVILYNIVSAARQAFDDICEGLTPADLEGEARYSRPRDRQDRSA EIYKFFTNALYVALTRATHNVYLVEQQVEHPLWSLLALTHQEEPLNLQEE ISSRDEWQKTAHLLEKQGKQEQADTIRSRILQTSEMPWQIITAEDARQWK QHILAGTADKTIQLQALEYSLIYSLFPLYNALYREDFKPTRQPRTKTLQL LELKYFRPYSMNNPVAVLRDIERYGVDHRSPFNLTPLMSAARAGNIALVQ LLLERGADPLLTGNDGLAAYHQVLSAAVSTPRYAQQKSAQLYTLLKPESL SLQVEGRLIKLDNRQMAMFLVILMQALFHTHLGSALFFSEAFSAARLAEC VVHLPEALLPERRKRRSYISSQLSQHEVNSKNPYGKKLFLRLNHGQYILN PGLKIRQGDVWRAVYELQSPEDLGHDLQTYLQDMSPELVDMLGGKKGFYE RSEKSVGYWVGGIRRAAQKA >SSO_P128 hypothetical protein MKLTSLCWALKELAKDIWSRPWSEERRNDWQRWLSLAANSDVPMMKNVAK TIGKRLYGILNAMRHGVSNGNAEALNSKIRLLRIKAKGYRNRERFKLGVM FHYGKLNMAF >SSO_P011 hypothetical protein MKVSFKSLGYIFHDIYNKKHTIDEFNDVVRKAVLSGKINELNACHKVAIF LAEKDNEITKKDKAKIIDTLTENYSIEFQQLMNISERTLNSSLYITPGES GFVSFVNREGKICHTAYVKSSDNSMAYYHANYSSIDKYITDMCGLICMRH IESTGIIFYMLDEKVLSAIAEFMNEKGWRAAFCSAKNLYKCV >SSO_P188 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_P057 hypothetical protein MPGTTTAMSINFIGMTARTMNSNGSHGKPQIPVDYQKLLSIEDITFCRNR WGNIGENALRRVAVGKKLSFFGSDRGGENAAII >SSO_P183 IS630 ORF MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL RIRDPHKDEKMAAIHKALDECSAEHQVFYEDEVDIHLNPKIGADWQLRGQ QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV >SSO_P177 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_P017 IS4 ORF MKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPGGE MADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAYNL VRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELMRD LASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >SSO_P185 hypothetical protein MIYPVEELLQIEVDHPVIPGPDIFLGLYHCLVGRTTGTEPVAVVAERAIS QCLQYLHHSLLDEAIHHHLDAQQTFAAAGLQYGYSSHRGWAVSAGQQLRF QLWPVVPQVVRQFTYAHAIDSRRTLIASCRWNSNQRARGRQR >SSO_P186 IS186 ORF1 MGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRS LAFGEADYIVRVYWRGLRWLTAEGMRFDMMDFLRGLDCGKNGETTVMIGN SGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAA GHVLLLTSLSEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEL ELAKAWIFANLLAAFLIDDIIQPSLDFPPRSAGSEKKN >SSO_P060 IS629 ORF1 MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVR VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA EFDRLWKK >SSO_P046 ISSfl2 ORF MQPPGCRGSGKRLFDKALPNDENKLRSLISDLKQHGQILLVVDQPATIGA LPVAVARSEGVLVGYLPGLAMRRIADLHAGEAKTDARDAAIIAEAARTLP HALRTLKLADEQIAELSMLCGFDDDLAAQTTQASNRIRGLLTQIHPALER VLGPRLEHPAVLDLLQRYPSPEKLASLGFAG >SSO_P069 IS1294 ORF MTRSGGDFQPRPLKRLFTANQCWTSFLDAGGLRDIEVEAVTKMLACGTRI LGVKEYNCDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDC DWVHLVFTLPDTLWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFC AIHTYGRRLNWHPHVHVSVTCGCLNKHGQWKKLSFLKDAMRSRWMWNMRQ RLLKAWSEGLAMPESLSHITTESQWRSLVLKAGGKYWHVYMSKKTAGGRN TARYLGRYLKKPPIAASRLAHYNGGASLSFRYLDHKTGETATETCTNGET VPES >SSO_P064 ISSfl1 ORF2 MLSPGQAHESQFAQRLLDGIGVQRQNGSMKRRGHAVLADKAYSGRALRNE LKNNGIKAVIPRKSNEKMASDGRAQLDRDAYCNRNVVERCFGRLKEYRRI ATRYDKTARNYLAMVKLGCIRLFYQRLRN >SSO_P221 IS630 ORF MTWELILDGYSESSYSATPRFAAARLPWFRVIYQPVYSPWVNHVERLWQA LHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV >SSO_P001 putative resolvase, fragment MNHYPSVTSLETPEARCRSGVPPLPACRQRESIYGLIELFIQIVHRLSVR SERRLVKTLLADFQRVHGKTALLFRIAEAALNNPDGLVKEVVYHLHIVEP DRSGKRSSSYLAQLRDVSARGDAVKNGRTLPEQDSGLPALVSDPGLPRMI STVL >SSO_P159 sugar phosphate transport protein-like protein MEPVYVILNALLDSGRFTRKLILLGLSGSFSYIFGSIVATLGMGLVVDYL GWGATFIVLILSAVFAIIFTLMSRERSLEFEKE >SSO_P180 hypothetical protein MNILFTESSPNIGGQELQAVAQMKALKKMGHSVLLVCRENSKIAFEASKL GIDITFALFRNSLHIPTAWRLLGIVHGFQPNAIVCHSGHDSNIVGLVRLF TWKHPFRIIRQKTYLTRKTKVFSINHFCDEVIVPGTSMKTHLEQEGCRTR VTVVPPGFDFQKLYVDSRNSLPPNVLSWLASRRGCPVIAQVGMLRPEKGH EFMLNLLFHLKMNGRQFCWLIVGSGSPELREHLQYQIDSMGMHDDVFIAD NVFPAAPVYRVASLVVLPSENESFGMVLAEASAFSVPVLASQIGGIPDVI QNNQTGTLLPAGNKHAWMCALNDFFNDPGRFYQMARQAKQDIEERFDINK TALKILTLAKHK >SSO_P078 hypothetical protein MFISYSEVSIKNPDNSGQALPLTYVCCREQAEDGACWHLLTSGKAASAAD ARRIVSHYERRWLTEEYHKAWKSGGTWNRCECRPGITLSAWWLSRRL >SSO_P039 IS629 ORF1 MVLESQGEYDSQWATICSIAPKIGCTPETLRVRVRQHERDTGGGDGGLTT AERQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK >SSO_P015 putative IS91 ORF2 MDAGNKKLVFWFVRVDDEGYPEIARCMEREFATIPAGISADGMYCPECGT VHWPDGVIPPF >SSO_P013 IS21 ORF2 MNKRAFFGAFLIFWGFKFLSMNCRYEKASIILTSNKGVADWGEMFGDHVL ATAILNSCA >SSO_P164 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_P075 putative IS orf MLTDIFNSNYQCYGYRRLHAMLRHEGGRLSEKVVRRLMVEEQLVVSRNRR RRYSSYCGEIGPAPDNLIARDFKAEQPNQK >SSO_P068 ISSfl1 ORF2 MCRRCSKKHPDIDGDNGPGRSRGGFGTKIHLATDGSGLPLNIVLSPGQAH ESQFAQRLLDGIGVQRQNGSMKRRGHAVLADKAYSGRALRNELKNNGIKA VIPRKSNEKMASDGRAQLDV >SSO_P010 hypothetical protein MLNWLSKLRAARIHLPNAVEKIAFDRFHVAKQPGEVVDKTRQNEHPHLPV ESRRQAKGTRFLWQHSNKWMTESRQEKLIWLRAQMKLTSLCWALKELAKD IWSRPWSEERRNDWQRWLSLAANSDVPMMKNVAKTIGKRLYGILNAMRHG VSNGNAEALNSKIRLLRIKAKGYRNRERFKLGVMFHYGKLNMAF >SSO_P002 IS2 ORF2 MVHATGLMKHASSPGCWDFVEPKNTAVRSPESNRIAKSFVKTIKCDYISI MPKPDGLTAAKNLAEAFEHYNE >SSO_P120 IS600 ORF1 MSRKTQRYSTEFKAEAVKTVPENQLSISEGASRLSVPEGTLGQWVTAARK GLNTPGSRTVAELESEVMQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_P206 hypothetical protein MAENGYGLAGLGMGKVKSVNQYRLTPGFGGFTPVSHVTAACRLTCRWRGI RIIQAAFNAFAKV >SSO_P037 IS21 ORF2 MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT TPISDDEMVESGQHQ >SSO_P200 hypothetical protein MRKYIPLVLFIFSWPVLSADIHGRVVRVLDGDTIEVMDSLKAVRIRLVNI DAPEKKQDYGRWSTDMMKSLVAGKTVTVTYFQRDRYGRILGQVYAPDGMN INQFMVRAGAAWVYEQYNTDPVLPVLQNEARQQKRGLWSDADPVPPWIWM HRK >SSO_P214 IS91 ORF MACGTTLMGYTQWCCSSPDCCHTKKVCFRCKSRSCPHCGVKAGAQWIQYL LSLVPDCPWQHIVFTLPCQYWSLVFHNRWLLTEMSRIAADVILEICHQAD VESGIFTVIHTWGRDQQWHPHIHLSTTAGGVTSGHTWKNLHFYARKVMSM WRYRITRLLSRKYPDLVMPDALAAEGSSKREWNRFLDTHYRRGWNVNVSR VMDNATHVAVYFGSYLKKPPVPMSRLEHYAGQDEIGLRYNSHRTKREEYL LMSGDEFMERFSWHVADKGFRMVRYYGFLSPAKRRLLEEVVYIITETVRK TAMQITWRGMYQRLLKVDPLKCVLCGSQMRFTGLKRGYRLAEQVLMHELL ARMRWCG >SSO_P158 hypothetical protein MNAHWSSKKSNFFRKKIKLLTKYLFFESQGIPDKVDIVSRLKTYGYSISG VETDDGYKALVRAFQLHFRQKNYDGIMDAETAAILYALLEKYFPGK >SSO_P163 IS21 ORF1 MGYTGGRSMLRYYIQPKRKMRPSKRTVRFETQPGYQLQHDWGEVEVEVAG QRCKVNFAVNTLGFSRSFHVFAAPKQDAEHTYESLVRAFRYFGGCVKTVL VDNQKAAVLKNNNGKVVFNSGFLLLADHYNFLPRACRPRRARTKGKVERM VKYLKENFFVRYRRFDSFTHVNQQLEQWIADVADKRELRQFKETPEQRFA LEQEHLQPLPDTDFDTSYFDIRHVSWDSYIEVGGNRYSVPEALCGQPVSI RISLDDELRIYSNEKLVASHRLCSASSGWQTVPEHHAPLWQQVSQVEHRP LSAYEELL >SSO_P216 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFTPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_P199 hypothetical protein MDSETVHGTARSGVTSVPAGGPLFWKSVDAGWKRQKHGDGLPVLRPGQTG SSLPEKGLNTATGAAGEGCNEKSSLHYSRSQKAERSL >SSO_P056 IS630 ORF MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV >SSO_P137 putative reverse transcriptase, fragment MPQGGVISPLLSNIILNEFDQYLNKRYLSGKARKDRWYWNHSIQRGRSTA VKENWQWKPAVAYCCYADDCVPRRRVLGT >SSO_P222 hypothetical protein MSHNLEHQKVHTRMVKEVLKAVARANNHPYQSVFADFITGHPSCTVCFWE TFHKMYPDSPYEYVTFCHTCRRFDLYETEAEMKADDPKWW >SSO_P047 hypothetical protein MSHNLEHQKVHTRMVKEVLKAVARANNHPYQSVFADFITGHPSCTVCFWE TFHKMYPDSPYEYVTFCHTCRRFDLYETEAEMKADDPKWW >SSO_P235 IS629 ORF1 MVLESQGEYDSQWATICSIAPKIGCTPETLRVRVRQHERDTGGGDGGLTT AERQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK >SSO_P131 IS21 ORF1 MIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRHKMVKLKPF MDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKRKMRPSKRT VRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSFHVFAAPKQ DAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVFNSGFLLLA DHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSFTHVNQQLE QWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSYFDIRHVSW DSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVASHRLCSAS SGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL >SSO_P083 hypothetical protein MVINKGKCRRCRVSQISDIIEPNIRCITLQIVESLQVKYTLVGVYVAKHV IQVHLTNKDMSEVEDK >SSO_P145 IS629 ORF1 MTKNTRFSPEVRQRAIRMVLESQDEYDSQWAAICSIAPKIGCTPETLRVW VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA EFDRLWKK >SSO_P052 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_P070 hypothetical protein MSHNLEHQKVHTRMVKEVLKAVARANNHPYQSVFTDFIAGHPSCTVCFWE TFHKMYPDSPYEYVTFCHTCRRFDLYETEAEMKADDPKWW >SSO_P192 oriT nicking and unwinding protein, fragment MMSIAQVRSAGSADNYYTDKDNYYVLGSMGERWAGQGAEQLGLQGSVDKD VFTRLLEGRLPDGADLSRMQDGSNRHRPGYDLTFSAPKSVSMMAMLGGDK RLIEAHNQAVDFAVRQVEALASTRVMTDGQSETVLTGNLVMALFNHDTSR DQEPQLHTHAVVTNVTQHNGEWKTLSSDKVGKTGFIENVYANQIAFGRLY REKRKEQVEALGYETEVVGKHGMWEMPGVPVEAFSGRSQTIREAVGEDAS LKSRDVAALDTRKSKQHVDPEVRMAEWMQTLKETGFDIRAYRDAAEQRAY TRTQTPGPASQDGPDVQQAVTQAIAGLSERKVQFMYTDLLARTVGILPPE NGVIERARAGIDEAISREQLIPLDREKGLFTSGIHMLDELSVRALSRDIM KQNRVTVHPEKSVPRTAGYSDAVSVLAQDRPSLAIVSGQGGAAGQRERVA ELVMMAREQGREVQIIAADRRSQMNLKQDERLSGELITGRRQLLEGMAFT PGSTVIVDQGEKLSLKETLTLLDGAARHNVQVLITDSGQRTGTGSALMAM KDAGVNTYRWQGGEQRPATIISEPDRNVRYARLAGDFAASVKAGEESVAQ VSGVREQAILTQAIRSELKTQGVLGHPEVTMTALSPVWLDSRSRYLRDMY RPGMVMEQWNPETRSHDRYVTERVTAQSHSLTLRNAQGETQVVRISSLDS SWSLFRPEKMPVADGERLRVTGKIPGLRVSGGDRLQVASVSEDAMTVVVP GRAEPATLPVSDSPFTALKLENGWVETPGHSVSDSATVFASVTQMAMDNA TLNGLARSGRDVRLYSSLDETRTAEKLARHPSFTVVSEQIKARAGETLLE TAISLQKAGLHTPAQQAIHLALPVLESKNLAFSMVDLLTEAKSFAAEGTG FADLGGEINAQIKRGDLLYVDVAKGYGTGLLVSRASYEAEKSILRHILEG KEAVTPLMERVPGELMEKLTSGQRAATRMILETSDRFTVVQGYAGVGKTT QFRAVMSAVNMLPESERPRVVGLGPTHRAVGEMRSAGVDAQTLASFLHDT QLQQRSGETPDFSNTLFLLDESSMVGNTDMARAYALIAVGGGRAVASGDT DQLQAIAPGQPFRLQQTRSAADVVIMKEIVRQTPELREAVYSLINRDVER ALSGLERVKPSQVPRLEGAWAPEHSVTEFSHSQEAKLAEAQQKAMLKGEA FPDVPMTLYEAIVRDYTGRTPEAREQTLIVTHLNEDRRVLNSMIHDAREK AGELGQVQVMVPVLNTANIRDGELRRLSTWENNPDALALVDNVYHRIAGI SKDDGLITLQDAEGNTRLISPREAVAEGVTLYTPDTIRVGTGDRIRFTKS DRERGYVANSVWTVTAVSGDSVTLSDGQQTRVIRPGQERAEQHIDLAYAI TAHGAQGASETFAIALEGTEGNRKLMR >SSO_P063 ISSfl1 ORF1 MARYDLPDEAWTIIKPLLPPEPATPRAGRPWAEHRKIINGMFWVLCSGAP WRDLPERYGSWKTVYNRFNRWSKSGVINIIFNRLLSLLDANGFIDWSATA LDGSNIRALKCAAGAQKNIPISTEIMGRVALAAVLAPKSIWQQTEVASR >SSO_P038 IS629 ORF2 MPLLDKLREQYGVGPLCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL KKEIQRVYDENHKVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAKVIHRKSWKNRA EVELATLTWVDWYNNRRLLERLGHTPPAEAEKAYYASIGNDDLAA >SSO_P126 hypothetical protein MNDNSLLRNSSLFIAYMGCVGWVSAYSYGWGTSFYYGFPWWVVGAGLDDV ARSLLYAIIVMGILFTGWGIGILFFLLIKKRSKIQDLSFFRLFFAITLLF FPVIFELLILKQYFILPLSLSCIISSLVISIIIRIYGRIFSVSCFSDIPF VREHRIKLIMAGFLVYFWLFSFLVGWYKPQLKKEYQMLCYNNSWYYILAR YDSRLVLSSSFKDDSNRFLIFNTEQSGFYEINDVYVRK >SSO_P152 IS600 ORF2 MVLRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQFGLKTSMSRKGNCYDNA PMESFWGTLKNGTGTE >SSO_P020 putative transposase, fragment MISNEGEFMNEKQLTSNKLRALANELAKSLKNPEDLSQFDWMLKMKPYSM LI >SSO_P118 hypothetical protein MGVNFCNKIGIDQSEFEIESSIINSIANEVLNPISFLSNKDIINVLLRKI SSECDLVRKDIYRCALELVVEKTPDDL >SSO_P029 putative transposase MNNNNTLYVGLDVHKESITVAYAINSEPVELMGKIGTSPTDIQNLCKRLR SKSSQVSIVYEAGPCGYGLYRRLVKSGFDCMVCAPSLIPKKPGERVKTDR RDAIRLVRSLRAGDLSAVYVPGIEDEAFRDLARAWASARDDLRHARQRLK SFLLVHGVHYVGRADWGPAHRRWLSKYSFESPWRQLAFDEHRRTIEDRQA QCERLESALKEAVTEWRLYPVVEALQAMRGIQFITAVGLISELGDLTRFE HPRQLMSWFGITPSEYSSGGSRHQGSITKAGNSYARKLLVEAAWSYRHPA RISPAIQKRQENLPRPVIDRAWDAQLRLCKRYRKLQAKGKNVNITIVAVA RELAGFIWDMGRIAMSVAQQPQCHK >SSO_P025 IS3 ORF1 MTKTVSTSKKTRKQHSPEFRSEALKLAERIGVAAAARELSLYESQLYAWR SKLQQQMTSSERESELAAENARLKRQLAEQAEELAILQKAATYFAKRLK >SSO_P062 IS1294 ORF MLSAFTPRPLKRLFTANQCWTSFLDAGGLRDIEVEAVTKMLACGTRILGV KEYNCDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDCDWV HLVFTLPDTLWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFCAIH TYGRRLNWHPHVHVSVTCGCLNKHGQWKKLSFLKDAMRSRWMWNMRQRLL KAWSEGLAMPESLSHITTESQWRSLVLKAGGKYWHVYMSKKTAGGRNTAR YLGRYLKKPPIAASRLAHYNGGASLSFRYLDHKTGETATETCTNGETVPE S >SSO_P086 acp, Acp MIKEKILSIVAFCYGIAYSKLSEETKFIEDLSADSLSLIEMLDMISFEFN LRIDESALEHIITIGDLISVVKNSTKSI >SSO_P197 finO, FinO MTEQKRPVLTLKRKTEGETPVRSRKTIINVTTPPKWKVKKQKLAEKAARE AELAAKKAQARQALSIYLNLPTLDEAVNTLKPWWPGLFDGDTPRLLACGI RDVLLEDVAQRNIPLSHKKLRRALKAITRSESYLCAMKAGACRYDTEGYV TEHISQEEEAYAAARLDKIRRQNRIKAELQAVLDEK >SSO_P201 hmo, putative regulator MAKTKQEWLYQLRRCSSVNTLEKIIHKNRDSLSNSERESFNSAADHRLAE LITGKLYDRIPKEIWKYVR >SSO_P143 icsA/virG, IcsA/VirG MNQIHKFFCNMTQCSQGGAGELPTVKEKTCKLSFSPFVVGASLLLGGPIA FAIPLSGTQELHFSEDNYEKLLTPVDGLSPLGAGEDGMDAWYITSSNPSH ASRTKLRINSDIMISAGHGGAGDNNDGNSCGGNGGDSITGSDLSIINQGM ILGGNGGSGADHNGDGGEAVTGDNLFIINGEIISGGHGGDSYSDSDGGNG GDAVTGVNLPIINKGTISGGNGGNNYGEGDGGNGGDAITGSSLSVINKGT FAGGNGGAAYGYGYDGYGGNAITGDNLSIINNGAILGGNGGHWGDAINGS NMTIANSGYIISGKEDDGTQNVVGNAIHITGGNNSLILHEGSVITGDVQV NNSSILKIINNDYTGTTPTIEGDLCAGDCTTVSLSGNKFTVSGDVSFGEN SSLNLAGISSLEASGNMSFGNNVKVEAIINNWAQKDYKLLSADKGITGFS VSNISIINPLLTTGAIDYTKSYISDQNKLIYGLSWNDTDGDSHGEFNLKE NAELTVSTILADNLSHHNINSWDGKSLTKSGEGTLILAEKNTYSGFTNIN AGILKMGTVEAMTRTAGVIVNKGATLNFSGMNQTVNTLLNSGTVLINNIN APFLPDPVIVTGNMTLEKNGHVILNNSSSNVGQTYVQKGNWHGKGGILSL GAVLGNDNSKTDRLEIAGHASGITYVAVTNEGGSGDKTLEGVQIISTDSS DKNAFIQKGRIVAGSYDYRLKQGTVSGLNTNKWYLTSQMDNQESKQMSNQ ESTQMSSRRASSQLVSSLNLGEGSIHTWRPEAGSYIANLIAMNTMFSPSL YDRHGSTIVDPTTGQLSETTMWIRTVGGHNEHNLADRQLKTTANRMVYQI GGDILKTNFTDHDGLHVGIMGAYGYQDSKTHNKYTSYSSRGTVSGYTAGL YSSWFQNEKERTGLYMDAWLQYGWFNNTVKGDGLTGEKYSSKGITGALEA GYIYPTIRWTAHNNIDNALYLNPQVQITRHGVKANDYIEHNGTMVTSSGV NNIQAKLGLRTSLISQSCIDKETLRKFEPFLEVNWKWSSKQYGVIMNGMS NHQIGNRNVIELKTGVGGRLADNLSIWGNVSQQLGNNSYRDTQGILGVKY TF >SSO_P094 icsB, IcsB MILKISNFIDASNTKGPIRVEDTEHGPILVAQKFNLKDLFFRTLSTINAK INSQILNEQLKNYRLANQKSLLLFLKTLASEKSAESAFAAYEAVKNSIQH SFTGKDIKLMLNTAERFHGIGTAKNLERHLVFRCWGNRGITHLGHTSISI KNNLLQEPTHTYLSWYPGGNVTKDTEINYLFEKRSGYSVDTYKQDKLNMI SDQTAERLDAGQEVRNLLNSKQDQNNNKKIFFPRANQKKDPYGYWGVSAD KVYIPLSGDNKTKDGKISYNLFGLDETNMSKFICQKKADAFRQLANYKLI SKSENCAGMALNVLKAGNSEIYFPLPDVKLVATPNNVYAYANKVRQRIES LNQSYNEIMKYIESDFDLSRLTQLRRSYLKSFNKINLIHTPKTFKPLSIS LYKHPTENVSSEDFDAVINACHSYLVKSAPSNMSRVLNELKTGATDKKEE IIEKSIKTIDYYNSLKSPDLGTKLYIHDLLQVNKLLLNNSHSNI >SSO_P241 icsP/sopA, IcsP/SopA MKLKFLVLALCVPAIFTTHATTNYPLFIPDNISTDISLGSLSGKTKERVY HPKEGGRKISQLDWKYSNATIVRGGIDWKLIPKVSFGVSGWTTLGNQKAS MVDKDWNNSNTPQVWTDQSWHPNTHLRDANEFELNLKGWLLNNLDYRLGL IAGYQESRYSFNAMGGSYIYSENGGNRNKKGAHPSGERTIDYKQLFKIPY IGLTANYRHENFEFGAELKYSGWVRSSDTDKHYQTETIFKDEIKNQNYCS VAANIGYYVTPSAKFYIEGSRNYISNKKGDTSLYEQSTNISGTIKNSASI EYIGFLTSAGIKYIF >SSO_P160 insB, IS1 ORF2 MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_P067 insB, IS1 ORF2 MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_P203 insB, IS1 ORF2 MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_P087 ipaA, IpaA MHNVNNTQAPTFLYKATSPSSTEYSELKSKISDIHSSQTSLKTPASVSEK ESFATSFNQKCLDFLFSSSGKEDVLRSIYSNSMNAYAKSEILEFSNVLYS LVHQNGLNFENEKGLQKIVAQYSELIIKDKLSQDSAFGPWSAKNKKLHQL RQNIEHRLALLAQQHTSGEALSLGQKLLNTEVSSFIKNNILAELKLSNET VSSLKLDDLVDAQAKLAFDSLRNQRKNTIDSKGFGIGKLSRDLNTVAVFP ELLRKVLNDILEDIKDSHPIQDGLPTPPEDMPDGGPTPGANEKTSQPVIH YHINNDNRTYDNRVFDNRVYDNSYHENPENDAQSPTSQTNDLLSRNGNSL LNPQRALVQKVTSVLPHSISDTVQTFANNSALEKVFNHTPDNSDGIGSDL LTTSSQERSANNSLSRGHRPLNIQNSSTTPPLHPEGVTSSNDNSSDTTKS SASLSHRVASQINKFNSNTDSKVLQTDFFSRNGDTYLTRETIFEASKKVT NSLSNLISLIGTKSGTQERELQEKSKDITKSTTEHRINNKLKVTDANTIN YVTETNADTIDKNHAIYEKAKEVSSALSKVLSKIDDTSAELLTDDISDLK NNNDITAENNNIYKAAKDVTTSLSKVLKNINKD >SSO_P090 ipaB, IpaB MHNVSTTTTGLSLAKILASTELGDNTIQAANDAANKLFSLTIADLTANKN INTTNSHSTSNILIPELKAPKSLNASSQLTLLIGNLIQILGEKSLTALTN KITAWKSQQQARQQKNLEFSDKINTLLSETEGLTRDYEKQINKLKNADSK IKDLENKINQIQTRLSELDPDSPEKKKLSREEIQLTIKKDAAVKDRTLIE QKTLSIHSKLTDKSMQLEKEIDSFSAFSNTASAEQLSTQQKSLTGLASVT QLMATFIQLVGKNNEESLKNDLALFQSLQESRKTEMERKSDEYAAEVRKA EELNRVMGCVGKILGALLTIVSVVAAAFSGGASLALAAVGLALMVTDAIV QAATGNSFMEQALNPIMKAVIEPLIKLLSDAFTKMLEGLGVDSKKAKMIG SILGAIAGALVLVAAVVLVATVGKQAAAKLAENIGKIIGKTLTDLIPKFL KNFSSQLDDLITNAVARLNKFLGAAGDEVISKQIISTHLNQAVLLGESVN SATQAGGSVASAVFQNSASTNLADLTLSKYQVEQLSKYISEAIEKFGQLQ EVIADLLASMSNSQANRTDVAKAILQQTTA >SSO_P089 ipaC, IpaC MEIQNTKSTQILYTDISTKQTQSSSETQKSQNYQQIAAHIPLNVGKNPVL TTTLNDDQLLKLSEQVQHDSEIIARLTDKKMKDLSEMSHTLTPENTLDIS SLSSNAVSLIISVAVLLSALRTAETKLGSQLSLIAFDATKSAAENIVRQG LAALSSSITGAVTQVGITGIGAKKTHSGISDQKGALRKNLATAQSLEKEL AGSKLGLNKQIDTNITSPQTNSSTKFLGKNKLAPDNISLSTEHKTSLSSP DISLQDKIDTQRRTYELNTLSAQQKQNIGRATMDTSAVAGNISTSGGRYA SALEEEEQLISQASSKQAEEASQVSKEASQATNQLIQKLLNIIDSITQSR NSTASQIAGNIRA >SSO_P088 ipaD, IpaD MNITTLTNSISTSSFSPNNTNGSSTETVNSDIKTTTSSHPVSSLTMLNDT LHNIRTTNQALKKELSQKTLTKTSLEEIALHSSQISMDVNKSAQLLDILS RNEYPINKDARELLHSAPKEAELDGDQMISHRELWAKIANSINDINEQYL KVYEHAVSSYTQMYQDFSAVLSSLAGWISPGGNDGNSVKLQVNSLKKALE ELKKKYEDKPLYPATNTVSQKEADKWLTELGGTIGKVSKKNGGYVVNINM TPIDNMLKSLNNLGGNGEVVLDNAKYQAWNAGFSAEDETMKNNLQTLVQK YSNANSIFDNLVKVLSSTISSCTDTDKLFLHF >SSO_P212 ipaH1.4, invasion plasmid antigen MIKSTNIQAIGSSIMHQINNIYSLTPFSLPMELTPSCNEFYLKAWSEWEK NGTPGEQRNIAFNRLKICLQNQEAELNLSELDLKTLPDLPPQITTLEIRK NLLTHLPDLPPMLKVIHAQFNQLESLPALPETLEELNAGDNKIKELPFLP ENLTHLRVHNNRLHILPLLPPELKLLVVSGNRLDSIPPFPDKLEGLALAN NFIEQLPELPFSMNRAVLMNNNLTTLPESVLRLAQNAFVNVAGNPLSGHT MRTLQQITTGPDYSGPRIFFSMGNSATISAPEHSLADAVTAWFPENKQSD VSQIWHAFEHEEHANTFSAFLDRLSDTVSARNTSGFREQVAAWLEKLSAS AELRQQSFAVAADATESCEDRVALTWNNLRKTLLVHQASEGLFDNDTGAL LSLGREMFRLEILEDIARDKVRTLHFVDEIEVYLAFQTMLAEKLQLSTAV KEMRFYGVSGVTANDLRTAEAMVRSREENEFTDWFSLWGPWHAVLKRTEA DRWAQAEEQKYEMLENEYSQRVADRLKASGLSGDADAEREAGAQVMRETE QQIYRQLTDEVLALRLSENGSNHIA >SSO_P059 ipaH4.5, invasion plasmid antigen MKPINNHSFFRSLCGLSCISRLSVEEQCTRDYHRIWDDWAREGTTTENRI QAVRLLKICLDTREPVLNLSLLKLRSLPPLPLHIRELNISNNELISLPEN SPLLTELHVNGNNLNILPTLPSQLIKLNISFNRNLSCLPSLPPYLQSLSA RFNSLETLPELPSTLTILRIEGNRLTVLPELPHRLQELFVSGNRLQELPE FPQRLKYLKVGENQLRRLSRLPQELLALDVSNNLLTSLPENIITLPICTN VNISGNPLSTRVLQSLQRLTSSPDYHGPQIYFSMSDGQQNTLHRPLADAV TAWFPENKQSDVSQIWHAFEHEEHANTFSAFLDRLSDTVSARNTSGFREQ VAAWLEKLSASAELRQQSFAVAADATESCEDRVALTWNNLRKTLLVHQAS EGLFDNDTGALLSLGREMFRLEILEDIARDKVRTLHFVDEIEVYLAFQTM LAEKLQLSTAVKEMRFYGVSGVTANDLRTAEAMVRSREENEFTDWFSLWG PWHAVLKRTEADRWAQAEEQKYEMLENEYSQRVADRLKASGLSGDADAER EAGAQVMRETEQQIYRQLTDEVLA >SSO_P058 ipaH7.8, invasion plasmid antigen MFSVNNTHSSVSCSPSINSNSTSNEYYLRILTEWEKNSSPGEERGIAFNR LSQCFQNQEAVLNLSDLNLTSLPELPKHISALIVENNKLTSLPKLPAFLK ELNADNNRLSVIPELPESLTTLSVRSNQLENLPVLPNHLTSLFVENNRLY NLPALPEKLKFLHVYYNRLTTLPDLPDKLEILCAQRNNLVTFPQFSDRNN IRQKEYYFHFNQITTLPESFSQLDSSYRINISGNPLSTRVLQSLQRLTSS PDYHGPQIYFSMSDGQQNTLHRPLADAVTAWFPENKQSDVSQIWHAFEHE EHANTFSAFLDRLSDTVSARNTSGFREQVAAWLEKLSASAELRQQSFAVA ADATESCEDRVALTWNNLRKTLLVHQASEGLFDNDTGALLSLGREMFRLE ILEDIARDKVRTLHFVDEIEVYLAFQTMLAEKLQLSTAVKEMRFYGVSGV TANDLRTAEAMVRSREENEFTDWFSLWGPWHAVLKRTEADRWAQAEEQKY EMLENEYSQRVADRLKASGLSGDADAEREAGAQVMRETEQQIYRQLTDEV LALRLSENGSRLHHS >SSO_P167 ipaH9.8, invasion plasmid antigen MLPINNNFSLPQNSFYNTISGTYADYFSAWDKWEKQALPGEERDEAVSRL KECLINNSDELRLDRLNLSSLPDNLPAQITLLNVSYNQLTNLPELPVTLK KLYSASNKLSELPVLPPALESLQVQHNELENLPALPDSLLTMNISYNEIV SLPSLPQALKNLRATRNFLTELPAFSEGNNPVVREYFFDRNQISHIPESI LNLRNECSIHISDNPLSSHALQALQRLTSSPDYHGPRIYFSMSDGQQNTL HRPLADAVTAWFPENKQSDVSQIWHAFEHEEHANTFSAFLDRLSDTVSAR NTSGFREQVAAWLEKLSASAELRQQSFAVAADATESCEDRVALTWNNLRK TLLVHQASEGLFDNDTGALLSLGREMFRLEILEDIARDKVRTLHFVDEIE VYLAFQTMLAEKLQLSTAVKEMRFYGVSGVTANDLRTAEAMVRSREENEF TDWFSLWGPWHAVLKRTEADRWAQAEEQKYEMLENEYSQRVADRLKASGL SGDADAEREAGAQVMRETEQQIYRQLTDEVLALRLSENGSQLHHS >SSO_P084 ipaJ, IpaJ MSEQRKPCKRGCIHTGVMLYGVLLQGAIPREYMISHQTDVRVNENRVNEQ GCFLARKQMYDNSCGAASLLCAAKELGVDKIPQYKGSMSEMTRKSSLDLD NRCERDLYLITSGNYNPRIHKDNIADAGYSMPDKIVMATRLLGLNAYVVE ESNIFSQVISFIYPDARDLLIGMGCNIVHQRDVLSSNQRVLEAVAVSFIG VPVGLHWVLCRPDGSYMDPAVGENYSCFSTMELGARRSNSNFIGYTKIGI SIVITNEAL >SSO_P093 ipgA, IpgA MCRKLYDKLYEITGAKLDFNDKNQAFILLEEQIPVCITDNDEYIFLTGLL NEHELFTENIINPEHILILNYSLSRDYGSSICLLPDTHQCVLTKKHYKKY LSPDELIESLYEFLFCIKLTIANITSEVN >SSO_P092 ipgB1, IpgB1 MQILNKILPQVEFAIPRPSFNSLSYNKLVKKILSVFNLKQRFPQKNFGCP VNINKIRDNVIDKIKDSNSGNQLFCWMSQERTSYVSSMINRSIDEMAIHN GVVLTSDNKKNIFAAIEKKFPDIKLDEKSAQTSISHTALNEIASSGLRAK ILKRYSSNMDLFNTQMKDLTNLVSSSVYDKIFNESTKVLQIEISAEVLKA VYRQSNTN >SSO_P024 ipgB2, IpgB2 MLGTSFNNFGISLSHKRYFSGKVDEIIRCTMGKRIVKISSTKINTSILSS VSEQIGENITDWKNDEKKVYVSRVVNQCIDKFCAEHSREIGDNLRKQIFK QVEKDYRISLDINAAQSSINHLVSGSSYFKKKMDELCEGMNRSVKNDTTS NVANLISDQFFEKNVQYIDLKKLRGNMSDYITNLESPF >SSO_P091 ipgC, IpgC MSLNITENESISTAVIDAINSGATLKDINAIPDDMMDDIYSYAYDFYNKG RIEEAEVFFRFLCIYDFYNVDYIMGLAAIYQIKEQFQQAADLYAVAFALG KNDYTPVFHTGQCQLRLKAPLKAKECFELVIQHSNDEKLKIKAQSYLDAI QDIKE >SSO_P095 ipgD, IpgD MHITNLGLHQVSFQSGDSYKGAEETGKHKGVSVISYQRVKNGERNKGIEA LNRLYLQNQTSLTGKSLLFARDRAEVFYEAIKLAGGDTSKIKAMMERLDT YKLGEVNKRHINELNKVISEEIRAQLGIKNKKELQTKIKQIFTDYLNNKN WGPVNKNISHHGKNYGFQLTPASHMKIGNKNIFVKEYNGKGICCASTRES DHIANMWLSKVVDDEGKEIFSGIRHGVISAYGLKKNSSERAVAARNKAEE LVSAALYSRPELLSQALSGKTVDLKIVSTSLLTPTSLTGGEESMLKDQVN ALKGLNSKRGEPTKLLIRNSDGLLKEVSVNLKVVTFNFGVNELALKMGLG WRNVDKLNDESICSLLGDNFLKNGVIGGWAAEAIEKNPPCKNDVIYLANQ IKEIINKKLQKNDNGEPYKLSQRMTLLAYTIGAVPCWNCKSGKDRTGMQD AEIKREIIRKHETGQFSQLNSKLSSEEKRLFSTILMNSGNMEIQEMNTGV PGNKVMKKLPLSSLELSYSERIGDSKIWNMVKGYSSFV >SSO_P096 ipgE, IpgE MEDLADVICRALGIPLIDIDDQAIMLDDDVLIYIEKEGDSINLLCPFCAL PENINDLIYALSLNYSEKICLATDDEGGNLIARLDLTGINEFEDVYVNTE YYISRVRWLKDEFARRMKGY >SSO_P097 ipgF, IpgF MSRFVFILLCFIPYLGRADCWDKAGERYNIPSSLLKAIAEKESGFNKFAV NVNNNGSKDYGIMQINDFHSKRLREMGYSEEMLISHPCLSVHYAAKLLNE FMMMYGRGWEAVGAYNAGTSPKKKKERLKYAEDIYRRYLRIAAESKQNNR RI >SSO_P009 mkaD, mouse killing factor MPIKKPCLKLNLDSLNVVRSEIPQMLSANERLKNNFNILYNQIRQYPAYY FKVASNVPNYSDICQFFSVMYQGFQIVNHSGDVFIHACRENPQSKGDFVG DKFHISIAREQVPLAFQILSGLLFSEDSPIDKWKITDMNRVSQQSRVGIG AQFTLYVKSDQECSQYSALLLHKIRQFIMCLESNLLRSKIAPGEYPASDV RPEDWKYVSYRNELRSDRDGSERQEQMLREEPFYRLMIE >SSO_P171 mob9, plasmid mobilization protein MSLAGNPCVIRLAAQVCMWLKFIIRDRGGFSGGLLLFLPVCCRDRTERIL AVHTIKILR >SSO_P182 msbB2, MsbB2 MKKYKSEFIPEFKKNYLSPVYWFTWFVLGMIAGISMFPPSFRDPVLAKIG RWVGRLSRKARRRATINLSLCFPEKSDTEREIIVDNMFATALQSIVMMAE LAIRGPEKFQKRVFWKGLEILEEIRHNNRNVIFLVPHGWSVDIPAMLLAA QGEKMAAMFHQQRNPVIDYVWNSVRRKFGGRLHSREDGIKPFIQSVRQGY WGYYLPDQDHGPEYSEFADFFATYKATLPIIGRLMNISQAMIIPLFPVYD EKKHFLTIEVRPPMDACIASADNKMIARQMNKTVEILVGSHPEQYIWVLK LLKTRKSNEADPYP >SSO_P109 mxiA, MxiA MKVIQSFLKQVSTKPELIILVLMVMIIAMLIIPLPTYLVDFLIGLNIVLA ILVFMGSFYIERILSFSTFPSVLLITTLFRLALSISTSRLILVDADAGKI ITTFGQFVIGDSLAVGFVIFSIVTVVQFIVITKGSERVAEVAARFSLDGM PGKQMSIDADLKAGIIDAAGAKERRSILERESQLYGSFDGAMKFIKGDAI AGIIIIFVNLIGGISVGMSQHGMSLSGALSTYTILTIGDGLVSQIPALLI SISAGFIVTRVNGDSDNMGRNIMSQIFGNPFVLIVTSALALAIGMLPGFP FFVFFLIAVTLTALFYYKKVVEKEKSLSESDSSGYTGTFDIDNSHDSSLA MIENLDAISSETVPLILLFAENKINANDMEGLIERIRSQFFIDYGVRLPT ILYRTSNELKVDDIVLLINEVRADSFNIYFDKVCITDENGDIDALGIPVV STSYNERVISWVDVSYTENLTNIDAKIKSAQDEFYHQLSQALLNNINEIF GIQETKNMLDQFENRYPDLLKEVFRHVTIQRISEVLQRLLGENISVRNLK LIMESLALWAPREKDVITLVEHVRASLSRYICSKIAVSGEIKVVMLSGYI EDAIRKGIRQTSGGSFLNMDIEVSDEVMETLAHALRELRNAKKNFVLLVS VDIRRFVKRLIDNRFKSILVISYAEIDEAYTINVLKTI >SSO_P108 mxiC, MxiC MLDVKNTGVFSSAFIDKLNAMTNSDDGDETADAELDSGLANSKYIDSSDE MASALSSFINRRDLEKLKGTNSDSQERILDGEEDEINHKIFDLKRTLKDN LPLDRDFIDRLKRYFKDPSDQVLALRELLNEKDLTAEQVESLTKIINEII SGSEKSVNAGINSAIQAKLFGNKMKLEPQLLRACYRGFIMGNISTTDQYI EWLGNFGFNHRHTIVNFVEQSLIVDMDSEKPSCNAYEFGFVLSKLIAIKM IRTSDVIFMKKLESSSLLKDGSLSAEQLLLTLLYIFQYPSESEQILTSVI EVSRASHEDSVVYQTYLSSVNESPHDIFKSESEREIAINILRELVTSAYK KELSR >SSO_P107 mxiD, MxiD MKKFNIKSLTLLIVLLPLIVNANNIDSHLLEQNDIAKYVAQSDTVGSFFE RFSALLNYPIVVSKQAAKKRISGEFDLSNPEEMLEKLTLLVGLIWYKDGN ALYIYDSGELISKVILLENISLNYLIQYLKDANLYDHRYPIRGNISDKTF YISGPPALVELVANTATLLDKQVSSIGTDKVNFGVIKLKNTFVSDRTYNM RGEDIVIPGVATVVERLLNNGKALSNRQAQNDPMPPFNITQKVSEDSNDF SFSSVTNSSILEDVSLIAYPETNSILVKGNDQQIQIIRDIITQLDVAKRH IELSLWIIDIDKSELNNLGVNWQGTASFGDSFGASFNMSSSASISTLDGN KFIASVMALNQKKKANVVSRPVILTQENIPAIFDNNRTFYVSLVGERNSS LEHVTYGTLINVIPRFSSRGQIEMSLTIEDGTGNSQSNYNYNNENTSVLP EVGRTKISTIARVPQGKSLLIGGYTHETNSNEIVSIPFLSSIPVIGNVFK YKTSNISNIVRVFLIQPREIKESSYYNTAEYKSLISEREIQKTTQIIPSE TTLLEDEKSLVSYLNY >SSO_P106 mxiE, MxiE MEGFFFVRNQNIKFSDNVNYHYRFNINSCAKFLAFWDYFSGALVEHSHAE KCIHFYHENDLRDSCNTESMLDKLMLRFIFSSDQNVSNALAMIRMTESYH LVLYLLRTIEKEKEVRIKSLTEHYGVSEAYFRSLCRKALGAKVKEQLNTW RLVNGLLDVFLHNQTITSAAMNNGYASTSHFSNEIKTRLGFSARELSNIT FLVKKINEKI >SSO_P098 mxiG, MxiG MSEAKNSNLAPFRLLVKLTNGVGDEFPLYYGNNLIVLGRTIETLEFGNDN FPENIIPVTDSKSDGIIYLTISKDNICQFSDEKGEQIDINSQFNSFEYDG ISFHLKNMREDKSRGHILNGMYKNHSVFFFFAVIVVLIIIFSLSLKKDEV KEIAEIIDDKRYGIVNTGQCNYILAETQNDAVWASVALNKTGFTKCRYIL VSNKEINRIQQYINQRFPFINLYVLNLVSDKAELLVFLSKERNSSKDTEL DKLKNALIVEFPYIKNIKFNYLSDHNARGDAKGIFTKVNVQYKEICENNK VTYSVREELTDEKLELINRLISEHKNIYGDQYIEFSVLLIDDDFKGKSYL NSKDSYVMLNDKHWFFLDKNK >SSO_P099 mxiH, MxiH MSVTVPNDDWTLSSLSETFDDGTQTLQGELTLALDKLAKNPSNPQLLAEY QSKLSEYTLYRNAQSNTVKVIKDVDAAIIQNFR >SSO_P100 mxiI, MxiI MNYIYPVNQVDIIKASDFQSQEISSLEDVVSAKYSDIKMDTDIQVSQIME MVSNPESLNPESLAKLQTTLSNYSIGVSLAGTLARKTVSAVETLLKS >SSO_P101 mxiJ, MxiJ MIRYKGFILFLLLMLIGCEQREELISNLSQRQANEIISVLERHNITARKV DGGKQGISVQVEKGTFASAVDLMRMYDLPNPERVDISQMFPTDSLVSSPR AEKARLYSAIEQRLEQSLVSIGGVISAKIHVSYDLEEKNISSKPMHISVI AIYDSPKESELLVSNIKRFLKNTFSDVKYENISVILTPKEEYVYTNVQPV KEIKSEFLTNEVIYLFLGMAVLVVILLVWAFKTGWFKRNKI >SSO_P102 mxiK, MxiK MIRMDGIYKKYLSIIFDPAFYINRNRLNLPSELLENGVIRSEINNLIINK YDLNCDIEPLSGVTAMFVANWNLLPAVAYFIGSQESRLINHSEMVISYYG GKISKQGEAAIRSGFWHLIAWKENISVGIYERINLLFNPIALEGNYTPVE RNLSRLNEGMQYAKRHFTGIQTSCL >SSO_P104 mxiL, MxiL MINQINASNALQQRLNSEEFVNLNERLSSSQSFDEDIIYEIMQYFSQSEL NSIDNDELHNKIEQLFNSRFPYLTAAQKSSLLNKLIDANQYVDLHEGFYA SLSIYNNIDFYIKTTTFDSLISVFEAGREADDSTW >SSO_P105 mxiM, MxiM MIRHGSNKLKIFILSILLLTLSGCALKSSSNSEKEWHIVPVSKDYFSIPN DLLWSFNTTNKSINVYSKCISGKAVYSFNAGKFMGNFNVKEVDGCFMDAQ KIAIDKLFSMLKDGVVLKGNKINDTILIEKDGEVKLKLIRGI >SSO_P103 mxiN, MxiN MKVCNMQKGTLPVSRHHAYDGVVIKRIEKELCKTIKDRDTESKKKAICVI KDATKKAESLRIDAVCDGYQIGIQTAFEHIIDYICEWKLKQNENRRNIED YITSLLSENLHDERIISTLLEQWLSSLRNTVTELKVVLPKCNLALRKKLE LDLHKYRSDVKIILKYSEGNNYIFCSGNQVVEFSPQDVISGVKIELAEKL TKNDKKYFKELAHKKLRQIAEDLLKENPVND >SSO_P003 ospB, OspB MNLDGVRPYCRIVNKKNESISDIAFAHIIKRVKNSSCTHPKAALVFLGEK GFCDSNDVLSIMGQQIPRVFKNKMLYDYVFKNEKSKNDFLKMAESWLPQS EPIVINNDDDALNAAAYFSVKKAKIKTVNDTDFKEYNKVYILGHGSPGSH QLGLGSELIDVQTIISRMKDCGILNVKDIRFTSCGSADKVAPKNFNNAPA ESLSCILNSLPFFKEKESLLEQIKKHLENDESLSDGLKISGYHGYGVHYG QELFPYSHYRSTSIPADPEHTVKRSSQKKTFIINKELD >SSO_P049 ospC1, OspC1 MNISETLNSANTQCNIDSMDNRLHTLFPKVTSVRNAAQQTMPDEKNLKDS ANIIKDFFRKTIAAQSYSRMFSQGSNFKSLNIAIDAPSDAKASFKAIEHL DRLSKHYISEIREKLHPLSAEELNLLSLIINSDLIFRHQSNSDLSDKILN IKSFNKIQSEGICTKRNTYADDIKKIANHDFVFFGVEISSHQKKHPLNTK HHTVDFGANAYIIDHDSPYGYMTLTDHFDNAIPPVFYHEHQSFLDKFSEV NKEVSRYVHGSKGIIDVPIFNTKDMKLGLGLYLIDFIRKSEDQSFKEFCY GKNLAPVDLDRIINFVFQPEYHIPRMVSTENFKKVKIREISLEEAVTASN YEEINKQVTNKKIALQALFLSIANQKEDVALYILSNFEITRQDVISIKHE LYDIEYLLSAHNSSCKVLEYFINKGLVDVNTKFKKTNSGDCMLDNAIKYE NAEMIKLLLKYGATSDNKYI >SSO_P065 ospC2, OspC2 MKIPEAVNHINVQNNIDLVDGKINPNKDTKALQKNISCVTNSSSSGISEK HLDHCADTVKSFLRKSIAAQSYSKMFSQGTSFKSLNLSIEAPSGARSSFR SLEHLDKVSRHYLSEIIQKTHPLSSDERHLLSIIINSDFNFRHQSNANLS NNTLNIKSFDKIKSENIQTYKNTFSEDIEEIANHDFVFFGVEISNHQETL PLNKTHHTVDFGANAYIIDHDSPYGYMTLTDHFDNAIPPVFYHEHQSFFL DNFKEVVDEVSRYVHGNQGKTDVPIFNTKDMRLGIGLHLIDFIRKSKDQR FREFCYNKNIDPVSLDRIINFVFQLEYHIPRMLSTDNFKKIRLRDISLED AIKASNYEEINNKVTDKKMAHQALAYSLGNAKSDMALYLLSKFNFTKQDV AEMEKMNNNMYCELYDVEYLLSEDSANYKVLEYFINNGLVDVNKRFQKAN SGDTMLDNAMKSKDSKMIDFLLKNGAVSGKRFGR >SSO_P072 ospC3, OspC3 MKIPEAVNHINVQNNIDLVDGKTNPNKATKALQKNILRVTNSSSSGISEK HLDHCANTVKNFLRKSIAAQSYSKMFSQGTSFKSLNLSIEAPSGARSSFR SLEHLDKVSRHYISEIIQKVHPLSSDERHLLSIIINSNFNFRHQSNSNLS NNILNIKSFDKIQSENIQTHKNTYSEDIKEISNHDFVFFGVEISNHQEKL PLNKTHHTVDFGANAYIIDHDSPYGYMTLTDHFDNAIPPVFYHEHQSFFL DNFKEVVDEVSRYVHGNQGKTDVPIFNTKDMRLGIGLHLIDFIRKSKDQG FREFCYNKNIDPVSLDRIINFVFQLEYHIPRMLSTDNFKKIKLRDISLED AIKASNYEEINNKVTDKKMAHQALAYSLGNKKADIALYLLSKFNFTKQDV AEMEKMNNNRYCNLYDVEYLLSKDGANYKVLEYFINNGLVDVNKKFQKAN SGDTMLDNAMKSKDSKMIDFLLKNGAILGKRFEI >SSO_P022 ospD1, OspD1 MSINNYGLHPANNKNMHLIIGSNTANENKGMKSNIINVTNSAISHAINEE KSGGGYSGVSFRKLAKIQSISIPTKNNKEYNRHNLFSLIWHGNADAARKY GESLLAAEIPKEEKLEVLAARNNAGESALFIALQEGHSAAIQAYGDFIKT FDLSPKETIKLLDVRDNEGLPGLFLAAGKGNIEAMMAYINICHHSGIKLT EIADRLNNNEQDMFNIISDKIQELF >SSO_P008 ospD2, OspD2 MPLNKTFSSSIFSTKNSLSTDMSVNRDNRTITSSIMRVSNSSELIQFKNK TAPYFSEKRNVKVNINGVAKDIYGRQIVCRHLASYWEMNFMETNGKVNYQ LLSTPDAIAKNVCLEKTEDFSKSPAYIYFVENKKWGTVITNFFYNMKKNG DFVRTLSACTLNHQMALGLKIKRVQESEKWVVQFFDPNRTVTHKRTVFTC DSHFELSQLSAKDFFDDFYWKIYGLEQPGQVIFEDRHNSPLTNTVKLLPD ELINSRVIYHAITKNLTEVLFILMEKYKNGEISQSKLVNLLATRSSDGTP AFYIALQNGCSDIIQVYGKILNMCNLSQETILTLLAAVGANNVPGLCMSF MNGHVDTIKAYGEIVFKTPLTSDKRLYLLAAKDSHDLPGLFFALQNGHAD SIRMFGSLLNKKMLSSEQIKELLKVKHGLFMALQNGHTKAIMAYGDILKI LPPHQEYIDELLWIKNPNGTSGLFMAFYNGHTETIRAFCNILKNYSFTTR RLVEMLSATNKDGIPGVFVSVVNRDKETILEYCRIIKENNLEPDTIAEQF SKKMKKTFIEIINRFNHFL >SSO_P050 ospD3, OspD3 MPSVNLIPSRKICLQNMINKDNVSVETIQSLLHSKQLPYFSDKRSFLLNL NCQVTDHSGRLIVCRHLASYWIAQFNKSSGHVDYHHFAFPDEIKNYVSVS EEEKAINVPAIIYFVENGSWGDIIFYIFNEMIFHSEKSRALEISTSNHNM ALGLKIKETKNGGDFVIQLYDPNHTATHLRAEFNKFNLAKIKKLTVDNFL DEKHQKCYGLISDGMSIFVDRHTPTSMSSIIRWPNNLLHPKVIYHAMRMG LTELIQKVTRVVQLSDLSDNTLELLLAAKNDDGLSGLLLALQNGHSDTIL AYGELLETSGLNLDKTVELLTAEGMGGRISGLSQALQNGHAETIKTYGRL LKKRAINIEYNKLKNLLTAYYYDEVHRQIPGLMFALQNGHADAIRAYGEL ILSPPLLNSEDIVNLLASRRYDNVPGLLLALNNGQADAILAYGDILNEAK LNLDKKAELLEAKDSNGLSGLFVALHNGCVETIIAYGKILHTADLTPHQA SKLLAAEGPNGVSGLIIAFQNRNFEAIKTYMEIIKNENITPEEIAEHLDK KNGSDFLEIMKNIKS >SSO_P213 ospE2, truncated OspE2 MDILFLIKKTAIYELQIPATNRTKRLKFTATEIQWLTKINEAGIDKKQSQ RYSDF >SSO_P044 ospE2, OspE2 MLTQTIFPCLPQKQENIILEVSNPVLLSSTVTTDGYTVFNKKAAIYELQI PAANRTKTLKFTATEMQWLTKINEVGIVEKQSQRHSNI >SSO_P170 ospG, OspG MKITSTIIQTPFPFENNNSHAGIVTEPILGKLIGQGSTAEIFEDVNDSSA LYKKYDLIGNQYNEILEMAWQESELFNAFYGDEASVVIQYGGDVYLRMLR VPGTPLSDIDTADIPDNIESLYLQLICKLNELSIIHYDLNTGNMLYDKES ESLFPIDFRNIYAEYYAATKKDKEIIDRRLQMRTNDFYSLLNRKYL >SSO_P031 parA, plasmid segregation protein MTSFEQLSKVAQRADKMLLALTKQIQEQKQEFQADVFYQVYSKSAVAKLP KLTRASVDGAVGEMEAQGYQFEKRPAGTATKYALTIQNIIDIYAHRGIPK YRDRYSEAYSIFIGSLKGGVSKTVSSVSVAHALRAHPHLLSEDLRILLLD LDPQSSATMFLNYLHAVGLVDTTAPQAMLQNVSREELLEDFIVPSVIPGV YVMPASIDDAFIASNWDTLCEEHLLGQNKHAILRENIIDKLKHDFDFILI DTGPHLDAFLKNAIAAADIMFTPVPPAQVDFHSTLKYLARLPELVQIIEQ DGCSCRLQANIGFMSKLANKSDHKYCHSLTKEIFGGDMLDVSMPRLDGFE RSGESFDTVISANPVTYVGSGEALKNARMAAEDFAKAVFDRIEFIRANY >SSO_P032 parB, plasmid segregation protein MENRKHRPTIGRTLNTNILNNTEEISAPVHVFTLNTGRKAKFTEIKVDHD KVDTQTFVVEEVNGREQTALTPDSLKDITRTIHLQQFYPCIGIRTGDLIE ILDGSRRRAAALLCKVGLRVLVTDDELTVSEAQHLAKDLQTSLEHNIREI GLRLVRLKEAGMNQKQIAEREGLSAAKVTRALQAASVPKDFVSLFPVQSE LTYADYRQLAELSERLRLGDISIDEVVKNISPSIELITADDNLSEDEVKN SIMRLITKEMSSLLDSGVKDKAVVTLLWKFDSKDKFARKRVKGRTFSYEF GRLPLEVQDKLDRMIALVLKDNLNSL >SSO_P004 phoN2/apy, PhoN2 (Apy) MKTKNFLLFCIATNMIFIPSANALKAEGFLTQQTSPDSLSILPPPPAEDS VVFQADKAHYEFGRSLRDANRVRLASEDAYYENFGLAFSDAYGMDISREN TPILYQLLTQVLQDSHDYAVRNAKEYYKRVRPFVIYKDATCTPDKDENMA ITGSYPSGHASFGWAVALILAEINPQRKAEILRRGYEFGESRVICGAHWQ SDVEAGRLMGASVVAVLHNTPEFTKSLSEAKKEFEELNTPTNELTP >SSO_P205 repA, RepA MTDLQQTYYRQVKNPNPVFTPREGARTLPFCGKLMEKAVGFTSRFDFAIH VAHARSLGLRRRMPPVLRRRAIDALLQGLCFHYDPLANRVQCSITTLAIE CGLATESDAGTLSITRATRALTFLAELGLITYQTEYDPLIGCNIPTDITF TPALFAALDVSEEAVASARRSRVEWENRQRKKQGLDTLGMDELMAKAWRF VRERFRSYQTELKSRGMKRARARRDADRQRQDIVTLVKRQLTREISEGRF TASREAVKREVERRVKERMILSRNRNYSRLATASP >SSO_P204 repB, RepB MSQTENAVTSSLSQKRFVRRGKPMTDSEKQMAAVARKRLTHKEIKVFVKN PLKDLMVEYCEREGITQAQFVEKIIKDELQRLDILK >SSO_P112 spa13, Spa13 MEALDKRIIYFLQLENDLEPVGAQSVSQLFNTRRKIAIVKKHIIQYQSER ILLKGRIEEIQKDIDEANASKRKLLHKESKICKRIGLIKRNNFAKQLILD ELSQEDMKYGIR >SSO_P110 spa15, Spa15 MSNINLVQLVRDSLFTIGCPPSIITDLDSHSAITISLDSMPAINIALVNE QVMLWANFDAPSDVKLQSSAYNILNLMLMNFSYSINELVELHRSDEYLQL RVVIKDDYVHDGIVFAEILHEFYQRMEILNEVL >SSO_P115 spa24, Spa24 MLSDMSLIATLSFFTLLPFLVAAGTCYIKFSIVFVMVRNALGLQQVPSNM TLNGIALIMALFVMKPIIEAGYENYLNGPQKFDTISDIVRFSDSGLMEYK QYLKKHTDLELARFFQRSEEENADLKSAENNDYSLFSLLPAYALSEIKDA FKIGFYLYLPFVVVDLVISSILLALGMMMMSPITISVPIKLVLFVALDGW GILSKALIEQYINVPA >SSO_P116 spa29, Spa29 MDISSWFESIHVFLILLNGVFFRLAPLFFFLPFLNNGIISPSIRIPVIFL VASGLITSGKVDIGSSVFEHVYFLMFKEIIVGLLLSFCLSLPFWIFHAVG SIIDNQRGATLSSSIDPANGVDTSELAKFFNLFSAVVFLYSGGMVFILES IQLSYNICPLFSQCSFRVSNILTFLTLLASQAVILASPVMIVLLLSEVLL GVLSRFAPQMNAFSVSLTIKSLLAIFIIFICSSTIYFSKVQFFLGEHKFF TNLFVR >SSO_P113 spa32, Spa32 MALDNINLNFSSDKQIEKCEKLSSIDNIDSLVLKKKRKVEIPEYSLIASN YFTIDKHFEHKHDKGEIYSGIKNAFELRNERATYSDIPESMAIKENILIP DQDIKAREKINIGDMRGIFSYNKSGNADKNFERSHTSSVNPDNLLESDNR NGQIGLKNHSLSIDKNIADIISLLNGSVAKSFELPVMNKNTADITPSMSL QEKSIVENDKNVFQKNSEMTYHFKQWGAGHSVSISVESGSFVLKPSDQFV GNKLDLILKQDAEGNYRFDSSQHNKGNKNNSTGYNEQSEEEC >SSO_P114 spa33, Spa33 MLRIKHFDANEKLQILYAKQLCERFSIQTFKNKFTGSESLVTLTSVCGDW VIRIDTLSFLKKKYEVFSGFSTQESLLHLSKCVFIESSSVFSIPELSDKI TFRITNEIQYATTGSHLCCFSSSLGIIYFDKMPVLRNQVSLDLLHHLLEF CLGSSNVRLATLKRIRTGDIIIVQKLYNLLLCNQVIIGDYIVNDNNEAKI NLSESNGESEHTEVSLALFNYDDINVKVDFILLEKNMTINELKMYVENEL FKFPDDIVKHVNIKVNGSLVGHGELVSIEDGYGIEISSWMVKE >SSO_P117 spa40, Spa40 MANKTEKPTPKKLKDAAKKGQSFKFKDLTTVVIILVGTFTIISFFSLSDV MLLYRYVIINDFEINEGKYFFAVVIVFFKIIGFPLFFCVLSAVLPTLVQT KFVLATKAIKIDFSVLNPVKGLKKIFSIKTIKEFFKSILLLIILALTTYF FWINDRKIIFSQVFSSVDGLYLIWGGLFKDIILFFLAFSIFVIILDFVIE FILYMKDMMMDKQEIKREYIEQEGHFETKSRRRELHIEILSEQTKSDIRN SKLVVMNPTHIAIGIYFNPEIAPAPFISLIETNQCALAVRKYANEVGIPT VRDVKLARKLYKTHTKYSFVDFEHLDEVLRLIVWLEQVENTH >SSO_P111 spa47, Spa47 MSYTKLLTQLSFPNRISGPILETSLSDVSIGEICNIQAGIESNEIVARAQ VVGFHDEKTILSLIGNSRGLSRQTLIKPTAQFLHTQVGRGLLGAVVNPLG EVTDKFAVTDNSEILYRPVDNAPPLYSERAAIEKPFLTGIKVIDSLLTCG EGQRMGIFASAGCGKTFLMNMLIEHSGADIYVIGLIGERGREVTETVDYL KNSEKKNRCVLVYATSDYSSVDRCNAAYIATAIAEFFRTEGHKVALFIDS LTRYARALRDVALAAGESPARRGYPVSVFDSLPRLLERPGKLKAGGSITA FYTVLLEDDDFADPLAEEVRSILDGHIYLSRNLAQKGQFPAIDSLKSISR VFTQVVDEKHRIMAAAFRELLSEIEELRTIIDFGEYKPGENASQDKIYNK ISVVESFLKQDYRLGFTYEQTMELIGETIR >SSO_P190 traD, TraD protein MFSQIANIMLYCLFIFFWILVGLVLWVKISWQTFVNGCIYWWCTTLEGMR DLIKSQPVYEIQYYGKTFRMNAAQVLHDKYMIWCGEQLWSAFVLATVVAL VICLITFFVVSWILGRQGKQQSENEVTGGRQLTDNPKDVARMLKKDGRDS DIRIGDLPIIRDSEIQNFCLHGTVGAGKSEVIRRLANYARQRGDMVVIYD RSGEFVKSYYDPSIDKILNPLDARCAAWDLWKECLTQPDFDNTANTLIPM GTKEDPFWQGSGRTIFAEAAYLMRNDPNRSYSKLVDTLLSIKIEKLRTYL RNSPAANLVEEKIEKTAISIRAVLTNYVKAIRYLQGIEHNGEPFTIRDWM RGVREDQKNGWLFISSNADTHASLKPVISMWLSIAIRGLLAMGENRNRRV WFFCDELPTLHKLPDLVEILPEARKFGGCYVFGIQSYAQLEDIYGEKAAA TLFDVMNTRAFFRSPSHKIAEFAVGEIGEKEHLKASEQYSYGADPVRDGV STGKDMERQTLVSYSDIQSLPDLTCYVTLPGPYPAVKLSLKYQERPKVAP EFIPRDINPEMENRLSAVLAAREAEGRQMASLFEPDVPEVVSGEDVTQAE QPQQPQQPQQPQQPQQPQQPQQPQQPQQPQQPQQPQQPQQPVSPVINDKK SDSGVNVPAGGIEQELKMKPEEEMEQQLPPGISESGEVVDMAAYEAWQQE NHPDTWQQMQRREEVNINVHRERGEDVEPGDDF >SSO_P191 trbH, hypothetical protein MNRSAPVFSSQAAHTFKFPGVISHNNQPPTAGMTCDHLIKWPDRASLTGK FCSYLAGVCGCSSVVIQNINAGNKSLDHSEITFRHLAFFCTIYQLHQGDR TDTHSPLVQVKTLPDAGGFVLYRKNADVGIEHKLQHQNDSLSCMPGCSLL SIKSALTLCPSNHSSHVSPAGVMILVRPTAITSTRFTFSGNATAFGSLTA WLRLLRNTVVSIICLLMWICLVYIYCGIDAGICQRDIRL >SSO_P147 ushA, UshA MIPLKKNITLIMFTLSLLTGNPAIAYETDKVYKITVLHTNDHHGHFWRNN HGEYGLSSQKTLVDNIRQKVINNGGSVLLLSGGDINTGVPESDLQKAEPD IRGMNLIGYDAMAVGNHEFDNPLNILRQQEKWATFPFLSANIYQKSTGRR LFSPWKIFIRQNLKIAVIGLTTDDTAKTGNSEYFTDIEFRQPAAEARSVI DELNQQEKPDIIIAATHMGHYDNGESGSNAPGDVEMARSLPTGSLAMIVG GHSQAPVCMASDNKKQWNYIPGTTCVPDKQNGIWIVQAHEWGKYVGQADF EFCNGTMKLVNYQLHPVNLKMRITREDGKTEFSFYTPEITEDPQMLSLLT PFQNKGKAQLDVKVGVVNGRLEGDRSKVRFVQTSMGHLILSALTERIDAD FAVVSGGEIRDSIESGNITYKDILKVQPFGNTVVSIDLTGKEVADYLATV AQMKPDSGAYPQFLNTSFVVKKGKIEMLKIKGKSVDLNKKYRMTTFSFNA TGGDGYPRIDNRPGYINTGFIDAEVLIEYIREHSPLDAASYEPKGEVSWQ >SSO_P142 virA, VirA MQTSNITNHERNDSSWMSTVKSTTEVSWNKLSFCDVLLKIITFGIYSPHE TLAEKYSEKKLMDSFSPSLSQDKMDGEFAHANIDGISIRLCLNKGICSVF YLDGDKIQSTQLSSKEYNNLLSSLPPKQFNLGKVHTITAPVSGNFKTHKP APEVIETAINCCTSIIPNDDYFPVKDTDFNSVWHDIYRDIRASNSNSTKI YFNNIEIPLKLIADLINELGINEFIDSKKELQMLSYNQVNKIINSNFPQQ DLCFQTEKLLFTSLFQDPAFISALTSAFWQSLHITSSSVEHIYAQIMSEN IENRLNFMPEQRVINNCGHIIKINAVVPKNDTAISASGGRAYEVSSSILP SHITCNGVGINKIETSYLVHAGTLPSSEGLRNAIPPESRQVSFAIISPDV >SSO_P085 virB, VirB MVDLCNDLLSIKEGQKKEFTLHSGNKVSFIKAKIPHKRIQDLTFVNQKTN VRDQESLTEESLADIIKTIKLQQFFPVIGREIDGRIEILDGTRRRASAIY AGADLEVLYSKEYISTLDARKLANDIQTAKEHSIRELGIGLNFLKVSGMS YKDIAKKENLSRAKVTRAFQAASVPQEIISLFPIASELNFNDYKILFNYY KGLEKANESLSSTLPILKEEIKDLDTNLPPDIYKKEILNIIKKSKNRKQN PSLKVDSLFISKDKRTYIKRKENKTNRTLIFTLSKINKTVQREIDEAIRD IISRHLSSS >SSO_P041 virF, VirF MMDMGHKNKIDIKVRLHNYIILYAKRCSMTVSSGNETLTIDEGQIAFIER NIQINVSIKKSDSINPFEIISLDRNLLLSIIRIMEPIYSFQHSYSEEKRG LNKKIFLLSEEEVSIDLFKSIKEMPFGKRKIYSLACLLSAVSDEEALYTS ISIASSLSFSDQIRKIVEKNIEKRWRLSDISNNLNLSEIAVRKRLESEKL TFQQILLDIRMHHAAKLLLNSQSYINDVSRLIGISSPSYFIRKFNEYYGI TPKKFYLYHKKF >SSO_P181 virK, VirK MFSVSNLSFIGFLKRIVFSSDSLPGKWEHRKFRFMYILRCAINPVASIRY YYELRSLQCIEDILAIQPTLPARIHRPYLHKGGRAWSRGQYILEHYRFVQ NLPEKYSEFLFPQKSVSLVQFIGKDGEDFDIQCSPSGFDREGELMLSLFF NKIVIARLTFSVILTQNGHTAFIGGLQGAPKNTGPDVIRCATRACYGLFP KRIIFEAFCALMKACNVSECLAVSEHSHVFRQLRYWYQKRKTFVAVYSDF WESVAGKTCGDWYKLPTQVVRKPLSNIASKKRSEYRKRYALLDYIHETAI RSLDAYPVNSEHQDLN >SSO_P224 wbgT, putative UDP-glucose 6-dehydrogenase MFYVHFMMDKKMKFDTLNAKIGIIGLGYVGLPLAVEFGKKVTTIGFDINK SRIDELRNGHDSTLECSNLELLEATKLTYACSLDALKECNVFIVTVPTPI DKHKQPDLTPLIKASETLGKIIKKGDVIIYESTVYPGATEEDCIPVVEKV SGLKFNIDFFAGYSPERINPGDKEHRVTNILKVTSGSTPDVAEYVDQLYK LIITVGTHKASSIKVAEAAKVIENTQRDVNIALINELSIIFNKLGIDTLE VLEAAGTKWNFLPFRPGLVGGHCIGVDPYYLTHKAQSVGYHPEMILAGRR LNDSMGQYVVSQLVKKMLKQRIQVEGANVLVMGLTFKENCPDLRNTKVID IISELKEYNINIDIIDPWCSTDEAQHEYGLTLCEDPKVNHYDAIIIAVAH NEFREMGESAIRALGKDEHVLFDLKYVLDKKSIDMRL >SSO_P225 wbgU, putative UDP-glucose 4-epimerase MDIYMSRYEEITQQLIFSPKTWLITGVAGFIGSNLLEKLLKLNQVVIGLD NFSTGHQYNLDEVKTLVSTEQWSRFCFIEGDIRDLTTCEQVMKGVDHVLH QAALGSVPRSIVDPITTNATNITGFLNILHAAKNAQVQSFTYAASSSTYG DHPALPKVEENIGNPLSPYAVTKYVNEIYAQVYARTYGFKTIGLRYFNVF GRRQDPNGAYAAVIPKWTAAMLKGDDVYINGDGETSRDFCYIDNVIQMNI LSALAKDSAKDNIYNVAVGDRTTLNELSGYIYDELNLIHHIDKLSIKYRE FRSGDVRHSQADVTKAIDLLKYRPNIKIREGLRLSMPWYVRFLKG >SSO_P229 wbgV, putative glycosyltransferase MLLEYVERKISLALSKYPKVRDVIKFFYLYIASLFGIILNKNKTVIQSKI YEISIDDSEESFFGYYDHSPMSSNGRYVLFHSSAFSTKRHPKKVKYISIC VKDLLNNKVYKLYDTRAFNWQQGSRLMWIDDDNIIFNDYENNGYISVVYS LSLMKVIKKINYPIYDVNNYKAVTLDFSWLAKYDSDYGYYNKKSFSTDIS IINLNTGGIELFLSLDEMLKRTNFKCNIDVEHVVNHFMFAPDGRSVMFIH RYYTPKGKRERLIHWNLINDNVRVLINESIISHCCWNGNDEIIGFFGAEI DSLNYYRLSIESCNTEKLFFDARKYSDGHPTIVHNRYIISDTYPDKNRIK KLFVYDLVKNDYRELGLFYESMSFFSYSRCDLHPRISVDNRFLFVDSVHS GKRKLYFMRSGICE >SSO_P230 wbgW, putative glycosyltransferase MSDVLVSLIIVCFNAEKYIEKSLLAFINQDVGLDKFELIIVDGDSSDNTI SIVQDVFSKHSNIKHKIINNKKRTLATGWNIGVLEANGKFVCRVDAHSDI PNNYISKLLDDYFNIMQFDDSVVGVGGVLTNSYKTKFGSIVADFYASKFG VGNSPFRCVDKNNRLKKTDTAVFALYNKDVFFDVGLFNEVLDRNQDIDFH KRVLSNNLSLYTDNSLFVEYYVRDNFKDFIKKGFLDGFWVVMSGAYYFRH IVPLFFVLYLIVSFSLFFATGDYIYLSFLFFYFLISILFSIRDGRSFIGR VFLPFIFLSYHISYGCGSLLSFLKRYFK >SSO_P231 wbgX, putative glycosyltransferase MKNFIPFALPEIGEEEIAEVIDSLRSGWITTGPKAKQFEQEFSNYLGANV QSLAVNSATSGLHLALEAVGVKPGDQVIVPSYTFTATAEIVRYLGADPVI VDVDRKTFNISVDAIEKAITNETKAIIPVHFAGLACDMDSILSIAKKYDL KVVEDAAHAFPTTYKGSKIGTLDSDATVFSFYANKTMTTGEGGMVVSKNK DIIERCKVMRLHGISRDAFDRYQSKTPSWFYEVVAPGFKYNMPDICAAIG IHQLRKIDDFQKKRQRMAKIYDDALKELPLELPEWPTNASDIHAWHLYPI RLKTDSAINRDDFIKKLSDLGIGCSVHFIPLHKQPVWRDTYNLNASDFPV SEECYLNEISIPLYTKMTDQDQLFVIKSIRQLFM >SSO_P232 wbgY, putative glycosyltransferase MKRIFDVIVAGLGLLFLFPVFIIVSMLIVADSKGGVFFRQYRVGRFGKDF RIHKFRTMFIDSEKKGRITVGQDARVTRVGWYLRKYKIDELPQLIDVLSG TMSLVGPRPEVREFIDEYPDDIREKVLSVRPGITDLASIEMVDENEILSS YDDPRRAYIDIILPIKQRYYLDYVANNSVKYDCVIIWKTIIKILSR >SSO_P233 wbgZ, putative epimerase/dehydratase MIDRILELPRIVKRGIIICIDVVMVIFSFWLSYWLRLDEQTAFLSAPMWF AAAILTIFTVFIFIRIGLYRAVLRYVSAKIMLLIPVGILASTLSLVVISY SLSIMLPRTVVGIYFLVLLLLTSGSRLLFRMILNYGVKGSAPVLIYGAGE SGRQLLPALMQAKEYFPVAFVDDNPRLHKAVIHGVTVYPSDKLSYLVDRY GIKKILLAMPSVSKSQRQKVITRLEHLPCEVLSIPGMVDLVEGRAQISNL KKVSIDDLLGRDPVAPDAKLMAENITGKAVMVTGAGGSIGSELCRQIVRY KPAKLVLFELSEYALYAIEKELSALCDKEVLNVPVIPLLGSVQRQNRLQM VMKSFGIQTVYHAAAYKHVPLVEHNVVEGVRNNVFGTLYCAESAIESGVE TFVLISTDKAVRPTNTMGTTKRLAELVLQALSARQSQTRFCMVRFGNVLG SSGSVVPLFEKQIAQGGPVTLTHRDIIRYFMTIPEASQLVIQAGAMGHGG DVFVLDMGDPVKIYDLAKRMIRLSGLSVRDDKNPDGDIAIEVTGLRPGEK LYEELLIGDSVQGTSHPRIMTANEVMLPWQDLSLLLKELDQACHDFDHER IRSLLLQAPAAFNPTDDICDLVWQQKKSLLSQASNVIRL >SSO_P226 wzx, putative repeat unit transporter MIDAGGTFLLKAIFQIGVFVYFTHVSDITTFGIISYVFTVYWFVLNFSDY GFRTKLVKDISDNSYSASELLSRSDGVKTYVFFFIFIIFMFYSYVSDSIS LTLLVYISSAYFVCISSGRFSLLQAVGRFRCELYINIYSTIIYIGCNLFL SLFIEPLYYSAISIFIYSISLLVFSSHKCNVPCFHIKRPSILVYKDFLDA TPFAILVLLNVVLSSIDLFILKEYFSYNSVAIYQVVTRVNTGLIIVFNVI YTVLLPSFSYYLKNSEWGNIRKLQRYISLLVLLLCLCYYFFGIYFVGILF GDEYKVISSATFLIMFMALIKYNFWLINELYLVCSGNQSERVKSYCIGVV ISMAVFFYFIPRYGWSGAVFGSAIATLVIGIFYIISVKKDCGKILHDKYS LMMIFVPIFFYFIINGQQRLLY >SSO_P227 wzy, putative polysaccharide polymerase MLIYLYPVLLLFNILPVFFYGQMNSDLERFFGVPIGYIPDLIFYFFVVLT SIITLRFHVSLWTKKLLFLGIIFLIYISIQMLLLSADISGVVILLSFFSN FIALVLLVSFCIGKDELYLTHSVRNINVVMCFGIICGVVKLFIGYSEDSN FIVYLNRNATAIIVVCFYCVYSYFYRGRKSWYVSSVLYSLFFLFLDSRAG IISFAISLFFVFLQLTKKEKLLISLFFVPLLTLGISFTDIGTRLERMLSS SQVIFSGGNTLTKSQNDYRRVELVFIGVDVLKENYLIGTGLGVANYVKAI DKKFLGSTNFGLAHNFYLSYSAQLGIIGFILLISVFYIMLSPIFKCGGYI GKGCVFALAFYVFFNEYILTPAIYIYISIFLSVVFIRNSK >SSO_P223 wzz, O antigen chain length determinant protein MPKAEDEIDLFELLGTLWKKKWVILCVTLLTTGLAAVYAFTAKEQWTAKT YIQAPRIAELGSYLKFHQAYARILNQPLDTNALANGLFSDLILIAESPDT KVKFLESTEYYKKETNNLSTEQDKKIWLAEQANKGLVITPPKEKGNTSYY IIQASADSAQEAYKLLQGYLKNVNNQAVTLSLDEFGQNVNTLLVNLNKEI IDIDFQRKSEKLDQIAHIQRDLTTAEQAGIIDYRSSKGGFDNAQSSYKFL LGEKLLSAELKATKDAPIIYPFRYYEVKRQIDELEGMLRDNIQAQAYRYQ MKPSEPVIKDKPNKALILILGALPGAMFAIVGTLVYATLKDKTKLD >SSO_P169 yacB, hypothetical protein MAAIEPDERIGYSASSLAGQPYKGRNGRVEGTSGPHKVACNVILCENLL >SSO_P198 yigA, hypothetical protein MNGFRNSSRNGQVWRYQRAGGRAVILEVSGRWMEAAEAWRRAACVAPRTD WQQFARKRAEHCHRRCRGRV