TitleGenColors Logo

Gene list

Applied filters:

Gene type: CDS
Genomic element: pSD1_197

Number of genes found: 223

Free access
Sort by:

 



# Shigella dysenteriae Sd197, Sd197

>SDY_P007 ISEc8 orf, fragment
MAETLGEQYDPVLPSSLRQSSARKPLPASLPRAPRVIRPEEECCPACGGE
LSPLGCDVSEQLELISSAFKVIEKQRPKLACRRCDHIVQAPVPSKPIARS
YAGAGLLAHVVTGKYADHLPLYRQSDLLFHAAI
>SDY_P212 iso-IS1 ORF2
MALICELDEQWSFVGSKARQHWLWYAYNTKTGGVLAYTFGPRTDETCREL
LALLTPFNIGMITSDDWGSYGREVPKDKHLTGKIFTQRIERNNLTLRTRI
KRLARKTICFSRSVEIHEKVIGTFIEKHMFY
>SDY_P134 iso-IS1 ORF1
MASVNIHCPRCQSAQVYRHGQNPKGHDRFRCRDCHRVFQLTYTYEARKPG
IKELITEMAFNGAGVRDTARTLKIGINTVIRTLKNSRQSK
>SDY_P121 oriT nicking and unwinding protein, fragment
MMSIAQVRSAGSAGNYYTDKDNYYVLGSMGERWAGQGAEQLGLQGSVDKD
VFTRLLEGRLPDGADLSRMQDGSNKHRPGYDLTFSAPKSVSMMAMLGGDK
RLIDAHNQAVDFAVRHHPCGGTGLHTGDDGRTVRNGADR
>SDY_P206 putative reverse transcriptase, fragment
MARTRSGRETSRTITAHRLRGNTGRRVIEGDLSSYVDTVHHRLLMKALCR
RISDARFMRLLWKTPC
>SDY_P047 IS1 ORF, fragment
MCGMRSLHFNSVTPSATKDKQVTRKGIFIQHMLYLERNNLPLRTRIKRLA
RKTICFSRSVEIHEKSSAPSLKNTYSTDWKRHPKKYRFFTVNFI
>SDY_P153 IS600 ORF2
MAHIRTRETYGTRRLQTELEDNGIIVGRDRLARLRKELRLHCKQKRKFRA
TTNSDHNLPVTPNLLNQNFTPTAPNQVWVADITYVATREGWLYLAGVKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHTDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFKSRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFRIKYYQMTA
>SDY_P030 IS100 ORF2
MDFLEHLLHEEKLARHQRKQAMYTRMAAFPAVKTFEEYDFTFATGAPQKQ
LQSLRSLSFIERNENIVLLGPSGVGKTHLAIAMGYEAVRAGIKVRFTTAA
DLLLQLSTAQRQGRYKTTLQRGVMAPRLLIIDEIGYLPFSQEEAKLFFQV
IAKRYEKSAMILTSNLPFGQWDQTFAGDAALTSAMLDRTYTTHMSFKSKE
KAIDSDRNERPGL
>SDY_P085 hypothetical protein
MNAHWISKKSNILRKNIKLLTKYLFFESQGIPDKVDIVSRLKTYGYSISG
VETDDGYKALVRAFQLHFRQKNYDGIMDAETAAILYALLEKYFPGK
>SDY_P216 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SDY_P024 hypothetical protein
MPGATVADEFDKTLAFLEAIVNADNETTIGEIRSFADALDAVRFNRNKIN
RQLSKPNLASLALEHEVIWLGRSR
>SDY_P092 putative plasmid stable inheritance protein
MPESIPAGYEVLQELDELDSLLIIDLGGTTLDISQVMGKLSGISKIYGDS
SLGVSLVTSAVKDTLSLARTKGSSYLADDIIIHKKDNNYLKQRINDENKI
SIVTEAMNEALRKLEQRVLNTLNEFSSYTHVMVIGGGAELICDTVKKTHR
FVMNVFSKPITLNMI
>SDY_P050 iso-IS1 ORF1
MASVNIHCPRCQSAQVYRHGQNPKGHDRFRCRDCHRVFQLTYTYEARKPG
IKELITEMAFNGAGVRDTARTLKIGINTVIRTLKNSRQSE
>SDY_P057 iso-IS1 ORF2
MALICELDEQWSFVGSKARQHWLWYAYNTKTGGVLAYTFGPRTDETCREL
LALLTPFNIGMITSDDWGSYGREVPKDKHLTGKIFTQRIERNNLTLRTRI
KRLARKTICFSRSVEIHEKVIGTFIEKHMFY
>SDY_P053 putative transposase
MGNKNDVMDARAIWMAVQQPGKEIAVKTEEQQSVLVLHRTRMQLVKFRTA
QINALHGTLLEFGETIHKGRAAMEREFPEALERMKERLPPYLITVLENQY
MNRPGNPGD
>SDY_P077 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNLNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SDY_P094 iso-IS1 ORF2
MITSDDWGSYGREVPKDKHLTGKIFTQRIERNNLTLRTRIKRLARKTICF
SRSVEIHEKVIGTFIEKHMFY
>SDY_P027 IS3 ORF2
MKYVFIEKHQAEFSIKAMCRVLRVARSGWYTWCQRRTGISPRQQFCQHCD
SVVLAAAFTRSKQRYGAPRLTDELRAQGYHFNVKTVAASLRRQGLRAKGS
RKFSPVSYRAHGLPVSENLLEQDFYASGPNQKWAGDITYLRTDEGWLYLA
VVIDLWSRAVIGWSMSPRLTAQLACDALQMALWRRKRPRNVIVHSDRGSQ
YCSADYQALLKWHNLRGSMSAKGCCYDNACVESFFHSLKVECLHGEHFIS
REIMWATVFNYIECDYNRWRRHSWCGGLSPEQFENQNLA
>SDY_P205 putative IS91 ORF2
MARSAKPRKRKPAPQRSKLLRYVVKLHEDDFFDEEEAEVLRFDNFDDAVE
CCADLNIPFFVDAGNKKLVFWFVRVDDEGYPEIAR
>SDY_P207 IS3 ORF2
MCSGYHFNVKTVAASLRRQELSAKASQKFSPISYRAHGLPVSENLLTQDF
YASGPNQKWAGDITYYYSSPTAGKHGAPGY
>SDY_P116 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDISKKTTAYFAQESLKNTR
>SDY_P130 hypothetical protein
MMKSLVAGKTVTVTYFQRDRYGRILGQVYAPDGMNINQFMVRAGAAWVYE
QYNTDPVLPVLQNEARQQKRGLWSRC
>SDY_P035 IS629 ORF2
MGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTW
QGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHH
SDKGSQYVSLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIH
RKSWKNRAEVELATLTWVDWYNNRRLLERLGHTPPAEAEKAYYASIGNDD
LAAWVHR
>SDY_P147 IS1294 transposase
MTRSGGDFQPRPLKRLFTTNQCWTSFLDAGGLRDIEVEAVTKMLACGTRI
LGVKEYNCDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDC
DWVHLVFTLPDTLWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFC
AIHTYGRRLNCHPHVHVSVTCGCLNKHGQWKKLSFLKDAMRSRWMWNMRQ
LLLKAWSEGLAMPESLSHITTESQWRSLVLKAGGKYWHVYMSKKTAGGRN
TARYLGRYLKKPPIAASRLAHYNGGASLSFRYLDHKTGETATETLTQREL
VARLKQHIPEKFFKMVRYFGFLANRVCGEKLPQVYRALGMDKPAPVAKVC
YAQMVKQFLSRDPFECVLCGGRMVYRRAIAGRFTRLSVPEGTLGQWVTAA
RKGLNTPGSRTCHERCNSDPHPTPEIRSRG
>SDY_P049 iso-IS1 ORF2
MALICELDEQWSFVGSKARQHWLWYAYNTKTGGGLAYTFGPRTDETCREL
LALLTPFNIRMITSDDWGSYGREVPKDKHLTGKIFTQRIERNNLTLRTRI
KRLARKTICFSRSVEIHEKVIGTFIEKHMFY
>SDY_P044 hypothetical protein
MDEKKLKALAAELAKGLKTEADLNQFSRMQTKLTVETVLNAELTDHLGHE
KSYIR
>SDY_P036 hypothetical protein
MTGQYAVRCIAVRPRTAHAAYPAVASFFPEAVELRLRKISGSLAKDIVTA
AQFTIFTLQLFQTLTFSGGEPSITAPGIPLMLANPDTQSLRRTANLWSNG
TNCRSL
>SDY_P154 IS1353 transposase
MLTDIFNSNYQCYGYRRLHAMLRHEGGRLSEKVVRRLMVEEQLVVSRNRR
RRYSSYCGEIGPAPDNLIARDFKAEQPNQK
>SDY_P051 IS629 ORF2
MARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQ
LRVADFTYASTWQGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQA
LWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDSYDNAMAE
SINGLYKAEVIHRKSWKNRAEVELAILTWVDWYNNRRLLERLGYTPPAEA
EKAYYASIGNDDLAA
>SDY_P152 IS600 ORF1
MSRKTQRYSTEFKAEAVKTVPENQLSISEGPSRLSVPEGTLGQWVTAARK
GLNTPGSRTVAELESEVMQLRKALNEARLERDILKKATAYFAQESLKNTR
>SDY_P043 ISSfl1 ORF2
MLSPGQAHESQFAQRLLDGIGVQRQNGSMKRRGHAVLADKAYSGRALRNE
LKNNGIKAVIPRKSNEKMASDGRAQLDRDAYRNRNVVERCFGRLKEYRRI
ATRYDKTARNYLAMVKLGCIRLFYQRLRN
>SDY_P114 IS186 ORF1
MGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRS
LAFGEADYIVRVYWRGLRWLTAEGMRFDMMDFLRGLDCGKNGETTVMIGN
SGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAA
GHVLLLTSLSEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEP
ELAKAWIFANLLAAFLIDDIIQPSLDFPPR
>SDY_P026 IS3 ORF1
MTKTVSTSKKTRKQHSPEFCSEALKLAERIGVAAAARELSLYESQLYAWR
SKLQQQMTSSERESELAAENARLKRQLAEQAEELAILQKAATYFAKRLK
>SDY_P115 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRQRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISSAAFREKYHQMAA
>SDY_P008 IS600 ORF2
MCRVPGVSRSGYYDRVQHEPSDRKQSDERLKLEIKVAHIRTRETYGTRRL
QTELADNGIIVGRDRLARLRKELRLHCKQKRKFRATTSSDHNLPVTPNLL
NQNFTPTAPNQVWVADSVVQAFRNQPTEGAGRETAAYAVR
>SDY_P200 putative transposase
MTAAALLAEMPESSSLSRREISALVGVAQVNRDSGTLRGRRTIFGDCAGE
EQLCTWRRLRHPGLIW
>SDY_P089 hypothetical protein
MLSGQMFCIPLNNLVGDKINYDKITKITARDWRQYRAPGWQIIHQKRYCQ
TLRTHQF
>SDY_P018 IS91 ORF
MRYGSLAGWRYSAFLMLPRFADIFQQGNRWLNWLEKQPVQMSRLEHYAGQ
DEIGLRYNSHRTKREENLVMSGDEFMERFSWHVADKGFRMVIRGPESGEA
AITGRCGVRHNGDSEKNGEANHKERDVSAVTEG
>SDY_P210 IS629 ORF2
MMPLLDKLREQYGVGPLCSELHIAPSTYYYCQQQRHHPDKRSARAQRDDW
LKKEIQRVYDENHKVYGVKSGVSCYGKVSEWPDALWHVSWRLWDLPVFSG
VKRSVRPSAGKPLPQATA
>SDY_P208 IS629 ORF2
MARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRHQGNVPRTP
GGPQRLVYVVSAADKDKHTSAVPSALRQRCPQGFYPVQRYGSPRLTDELC
ALVTTLT
>SDY_P040 IS600 ORF2
MCQVFGVSRSGYYNWIQHEPSDRKQSDERLKLEIKVAHIRTRETYGTRRL
QTELAENGIIVGRDRLARLRKELRLRCKQKRKFRATTNSNSCAKSAEPDV
RSYSTKSGLGGGPDVCCHTGGMVVPRWHQRCLYVRNCRLRHGRAHDKRAD
R
>SDY_P196 IS3 ORF2
MNWPLKMPPEAATGGAGRGTGYPPKGRDILCEAPEMKYIFIEKHQAEFSI
KAMCRVLRVARSGWYTWCQRRTGISPRQQFRQHCDSVVLSAAFTRSKQRY
GAPRLTDELRAPGYHFNVKTVAASLRRQGLSWNLKLCNCVRL
>SDY_P063 hypothetical protein
MASFFPEAVELRLRKISGSLAKDIVTAAQFTIFTLQLFQTLTFSGGEPSI
TAPGIPLMLANPDTQSLRRTANLWSNGTNCRSL
>SDY_P122 oriT nicking and unwinding protein, fragment
MALFNHDTSRDQEPQLHTHAVVANVTQHNGGWKTLSSDKVGKTGFIENVY
ANQIAFGRLYREKLKEQVEALGYETEVVGKHGMWEMPGVPVEAFSGRSQT
IREAVGEDASLKSRDVAALDTRKSKQHVDPEVRMAEWMQTLKETGFDIRA
YRDAADQRAEIRTQAPGPASQDGPDVQQAVAQAIAGLSERKVQFTYTDVL
ARTVGILPPENGVIERARAGIDEAISREQLIPLDREKGLFTSGIHVLDEL
SVRALSRDIMKQNRVTVHPEKSVPRTAGYSDAVSVLAQDRPSLAIVSGQG
GAAGQRERVAELVMMAREQGREVQIIAADRRSQMNLKQDERLSGELITGR
RQLLEGMAFTPGSTVIVDQGEKLSLKETLTLLDGAARHNVQVLITDSGQR
TGTGSALMAMKDAGVNTYRWQGGEQRPATIISEPDRNVRYARLAGDFAAS
VKAGEESVAQVSGVREQAIRSELKTQGVLGHPEVTMTALSPVWLDSRSRY
LRDMYRPGMVMEQWNPETRSHDRYVTERVTAQSHSLTLRNAQGETQVVRI
SSLDSSWSLFRPEKMPVADGERLRVTGKIPGLRVSGGDRLQVASVSEDAM
TVVVPGRAEPATLPVSDSPFTALKLENGWVETPGHSVSDSATVFASVTQM
AMDNATPNGLARSGRDVRLYSSLDETRTAEKLARHPSFTVVSEQIKARAG
ETLLETAISLQKAGLHTPAQQAIHLALPVLESKNLAFSMVDLLTEAKSFA
AEGTGFADLGGEINAQIKRGDLLYVDVAKGYGTGLLVSRASYEAEKSILR
HILEGKEAVTPLMERVPGELMEKLTSGQRAATRMILETSDRFTVVQGYAG
VGKTTQFRAVMSAVNMLPESERPRVVGLGPTHRAVGEMRSAGVDAQTLAS
FLHDTQLQQRSGETPDFSNTLFLLDESSMVGNTDMARAYALIAAGGGRAV
ASGDTDQLQAIAPGQPFRLQQTRSAADVVIMKEIVRQTPELREAVYSLIN
RDVERALSGLERVKPSQVPRLEGAWAPEHSVTEFSHSQEAKLAEAQQKAM
LKGEAFPDVPMTLYEAIVRDYTGRTPEAREQTLIVTHLNEDRRVLNSMIH
DAREKAGELGQVQVMVPVLNTANIRDGELRRLSTWENNPDALALVDNVYH
RIAGISKDDGLITQQDAEGNTRLISPREAVAEGVTLYTPDTIRVGTGDRI
RFTKSDRERGYVANSVWTVIAVSGDSVTLSDGQQTCVIRPGQERAEQHID
LAYAITAHGAQGASETFAIALEGTEGNRKLMAGFESAYVALSRMKQHVQV
YTDNRQGWTDAINNAVQKGTAHDVFEPKPDREVMNAERLFSTARELRDVA
AGRAVLRQAGWPGETVLHGLLLRDVNIRNRMWHCRRLTVTASPPVSG
>SDY_P146 IS100 ORF2
MDFLEHLLHEEKLARHQRKQAMYTRMAAFPAVKTFEEYDFTFATGAPQKQ
LQSLRSLSFIERNENIVLLGPSGVGKTHLAIAMGYEAVRAGIKVRFTTAA
DLLLQLSTAQRQGRYKTTLQRGVMAPRLLIIDEIGYLPFSQEEAKLFFQV
IAKRYEKSAMILTSNLPFGQWDQTFAGDAALTSAMLGRILHHSHVVQIKG
ESYRLRQKRKAGVIAEANPE
>SDY_P197 IS3 ORF1
MTKTVSTSKKTRKQHSPEFCSEALKLAERIGVAAAARELSLYESQLYAWR
SKLQQQMTSSERKSELAAENAA
>SDY_P020 ISSfl4 ORF2
MISFPAGSRIWLVAGITDMRNGFNGLASKVQNVLKDDPFSGHLFIFRGRR
SDQIKVLWADSDGLCLFTRRLERGRFVWPVTRDGKVHLTPAQLSMLLEGI
DWKHPKRTERAGIRI
>SDY_P081 IS2 orf1, fragment
MYLKIRDRLGYMSNTSSNFEMTGTLLGLELRKRKTPQEKIAIIQQTMEPG
MTVSHVARLHGIQPSLLLKWKK
>SDY_P113 hypothetical protein
MRVSELPDYLRHHWPELKAQLLSGRYRPSPVRRVSILKPGGGERLLGIQM
VVDRFIQQAMMQVLQAL
>SDY_P072 ISSfl1 ORF1
MARYDLPDEAWTIIKPLLPPEPATPRAGRPWAEHRKIINGMFWVLCSGAP
WRDLPERYGAWKTVYNRFNRWSKSGVINIIFNRLLSLLDANGFIDWSATA
LDGSNIRALKCAAGAQKNIPISTEIMGRVALAAVLAPKSIWQQTEVASR
>SDY_P005 iso-IS1 ORF2
MALICELDEQWSFVGSKARQHWLWYAYNTKTGGVLAYTFGPRTDETCREL
LALLTPFNIGMITSDDWGSYGREVTKDKHLTGKIFTQRIERNNLTLRTRI
KRLARKTICFSRSVEIHEKVIGTFIEKHMFY
>SDY_P127 hypothetical protein
MLHYSGGLKYRCHLSDMENNMRKYIPLVLFIFSWPVLSADIHGRVVRVLD
GDTIEVMDSLKAVRIRLVNIDAPEGNDSNLLIVFYVQIMPDDFVMQLHRF
>SDY_P098 iso-IS1 ORF2
MALICELDEQWSFVGSKARQHWLWYAYNTKTGGVLAYTFGPRTDETCREL
LALLTPFNIGMITSDDWGSYGREVPKDKHLTGKIFTQRIERNNLTLRTRI
KRLARKTICFLRSVEIHEKVIGTFIEKHMFY
>SDY_P009 IS600 ORF1
MSRKTQRYSTEFKAEAVKTVPENQLSISEGPSRLSVPEGTLGQWVTAARK
GLNTPGSRTVAELESEVMQLRKALNEARLERDILKKATAYFAQESLKNTR
>SDY_P021 putative IS orf, fragment
MAAYASEINRLKALVAKLQRMQFGKSSEKLRAKTERQIQEAQERISALQE
EMAETLGEQYDPVLPSPLRQSSARKPLPASLPRETRVIRPEEECCPACGG
ELSSLGCDVSEQLELISSAFKVIETQRPKLACCRCDHIVQAPVPSKPIAR
SYAGAGLLAHVVAGKYADYLPLYRQSEIYRRQGVELSRATLGRWTGAVAE
LLELLYDILRQYVLMPGKVHADDIPVPVQEPGSGKTRTARLWVYVRDDRN
AGSEMPPAVWFAYSPDRKGIHPQNHLAGYSGVLQADAYGGYRALYESGRI
TEAACMAHVRRKIHDVHARVPTDITTEALQRIGELYAIEAEVRGCSAEQR
LAARKARAAPLMQSLYDWIQQQMKIHSLKMECLHGEHYYPSGNSAGNSV
>SDY_P106 hypothetical protein
MKICLYHTLNPDTIPGYKKFAQAIATDNFVQADVRKIDTNLYRARLSIRD
RLLFSLYRYHGETICLVLEYIRNHAYNTSRFLRRNVVIDEGRLQQQPVPD
PVDIATEALTYINPSHGRFHRLDKMLSFDDDQQALYEHPLPLVIVGSAGS
GKTALVLEKMKQAAGDILYLSLSSFLVEKARTLYDASGEGSEVQNIDFLS
LTEFLETLRIPEGREVTFSAFSDWLPRNRAIAALGAAHTLYEEFRGVIGA
VASGNGPLSREAYLSLGIRQSLYGMEDRPTVYVLFERYIAWLKQSHQYDS
NLLSHQYLSLATPRYDVIFVDEVQDMTPVQLQLVLKTLRHPGQFLLCGDA
NQIVHPSFFSWSSLKSLFFRQQQGNDTTVNILQANYRNGHHVTALANRLL
RLKQVRFSAIDRESHHFVRSCGQAEGTIRLLDDREETKQELNAKTSLSNR
VAVIVMHPEQKAQARCWFSTPLVFSVQEVKGLEYETVILYNIVSAARQAF
DDICEGLTPADLEGEARYSRPRDKQDRSAEIYKFFTNALYVALTRATHNV
YLVEQQVEHPLWSLLALTHQEEPLNLQEEISSRDEWQKTAHLLEKQGKQE
QADTIRSRILQTSEMPWQIITAEDARQWKQHILAGTADKTIQLQALEYSL
IYSLFPLYNALYREDFKPTRQPRTKTLQLLELKYFRPYSMNNPVAVLRDI
ERYGVDHRSPFNLTPLMSAARAGNIALVQLLLERGADPLLTGNDGLAAYH
QVLSAAVSTPRYAQQKSAQLYTLLKPESLSLQVEGRLIKLDNRLMAMFLV
ILMQALFHTHLGSALFFSEAFSAARLAECVVHLPEALLPERRKRRSYISS
QLSQHEVNSKNPYGKKLFLRLNHGQYILNPGLKIRQGDVWRAVYELQSPE
DLGHDLQTYLQDMSPELVDMLGGKKGFYERSEKSVGYWVGGIRRAAQKA
>SDY_P076 hypothetical protein
MPNWCSNRMYFSGEPAQIAEIKRLASGAVTPLYRRATNEGIQLFLAGSAG
FLQITENIRSEQCPGVTAAGRGAVSTENIAFTRWLTHLQNGVLLDEQNCL
MLHELWLQSGTGQRRWVRCTGNSGHYHLFFF
>SDY_P087 hypothetical protein
MQMKNNTAQATKVITAHVPLPMADKVDQMAARLERSRGWVIKQALSAWLA
QEEERNRLTLEALDDVTSGQVIDHQAVQAWADSLSTDHPLPVPR
>SDY_P105 IS3 ORF2
MAASLRRQGLRAKASRKFSPVSYRAHGLPVSENLLEQDFYASGPNQKWAG
DITYLRTDEVRLHPVSTEPHAF
>SDY_P203 IS629 ORF2
MARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQ
LRVADFTYASTWQGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQA
LWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDSYDNAMAE
SINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLGHTPPAEA
EKAYYASIGNDDLAA
>SDY_P001 putative resolvase, fragment
MNHYPSVTSLETPEARCRSGVPPLPACRQRESIYGLIELFIQIVHRLSVR
SERRLVKTLLADFQRGHGKTALLFRIAEAALNNPDGLVKEVVYHLHIVEP
DRSGKRSSSYLAQLRDVSARGDAVKNGRTLPEQDSGLPALVSDPGLPRMI
STVL
>SDY_P014 putative transposase
MTESRQEKLIWLRAQMKLTSLCWALKELAKDIWSRPWSEERRNDWQRWLS
LAANSDVPMMKNVAKTIGKRLYGILNAMRHGVSNGNAEALNSKIRLLRIK
AKGYRNRERFKLGVMFHYGKLNMAF
>SDY_P223 hypothetical protein
MTTTELFWDLNAIKWLVEGRGDDVSPFENIPVPDLTYNKEAWQFG
>SDY_P118 hypothetical protein
MENRLSAVLAAREAEGRQMASLFEPAVPEVVSGEDVTQAEQPQQPVSPAI
NDKKSDAGVSVPAGGIEQELKMKPEEEMEQQLPPGISESGEVVDMTAYEA
WQQENHPDIQQQMQRCEEVNINVHRERGEDVEPGDDF
>SDY_P052 hypothetical protein
MTGQYAVRCIAVRPRTAHAAYPAVASFFPEAVELRLRKISGSLAKDIVTA
AQFTIFTLQLFQTLTFSGGEPSITAPGIPLMLANPDTQSLRRTANLWSNG
TNCRSL
>SDY_P144 hypothetical protein
MEINVTAPALLTDEHILQPFDCGNEVLSNWLRGRAMKNQMLNASRTFVIC
LEDTLRVVGYYSLATGSVTHAELGRSLRHNMPNPVPVVLLGRLAVDVCTQ
GHGFGKWLLSDAIHRVVNLADQVGIKAVMVHAIDDDARAFYERFGFVQSV
VAPNTLFYKV
>SDY_P209 hypothetical protein
MASFFPEAVELRLRKISGSLAKDIVTAAQFTIFTLQLFQTLTFSGGEPSI
TAPGIPLMLANPDTQSLRRTANLWSNGTNCRSL
>SDY_P148 hypothetical protein
MSHNLEHQKVHTRMVKEVLKAVARANNHPYQSVFTDFIAGHPSCTVCFWE
TFHKMSPDSPYEYVTFCHTCRRFDLYETEAEMKADDPKWW
>SDY_P066 iso-IS1 ORF2
MALICELDEQWSFVGSKARQHWLWYAYNTKTGGVLAYTFGPRTDETCREL
LALLTPFNIGMITSDDWGSYGREVPKDKHLTGKIFTQRIERNNLTLRTRI
KRLARKTICFSRSVEIHEKVIGTFIEKHMFY
>SDY_P012 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SDY_P138 hypothetical protein
MKSVNQYRLTPGFGGFTPVSHVTTACRLPCR
>SDY_P123 putative DNA helicase I, fragment
MALPAFDRNGKSAGIWLNPLTTDDGNGLRGFSGEGRVKGSGDAQFVALQG
SRNGESLLADNMQDGVRIARDNPDSGVVVRIAGEGRPWNPGAITGGRVWG
DIPDNSVQPGAGNGEPVTAEVLAQRQAEEAIRRETERRADEIVRKMAENK
PDLPDGKTEQAVREIAGQERDRAAITEREAALPESVLREPQRVREAVREV
ARENLLQERLQQMERDMVRDLQKEKTPGGD
>SDY_P016 IS3 orf2, fragment
MPSHHLYLANETQMVGLAPKHTAVQNGMAENFVKTMKYD
>SDY_P198 hypothetical protein
MKIKRSTFISNIFYIISWFLMNDNSLLRNSSLFIAYMGCVGWVSAYSYGW
GTSFYYGFPWWVVGAGLDDVARSLLYAIIVMGILFTGWGIGILFFLLIKK
RSKIQDLSFFRLFFAITLLFFPVIFELLILKQYFILPLSLSCIISSLVIS
IIIRIYGRIFSVSCFSDIPFVREHRIKLIMAGFLVYFWLFSFLVGWYKPQ
LKKEYQMLCYNNSWYYVLARYDSRLVLSSSFKDDSNRFLIFNTEQSGFYE
INDVYVRK
>SDY_P090 plasmid stable inheritance protein
MDKRRTIAFKPNPDVNQTDKIVCNTLDSIPQGERSRLNRAALTADLALYR
QDPRPPFLLCELLTKETTFSDIVNILRSLFPKEMVDFNSSTITQSSSQKE
QKSDEETQKNVMKLINQFNYY
>SDY_P084 IS100 ORF2
MLHEEKLARHQRKQAMYTRMVAFPAVKMFEEYDFTFATGAPQKQLQSLRS
LSFIERNENIVLQGTSDITNPWVGICV
>SDY_P017 hypothetical protein
MNRTAFRAIHAVGADTGRNCRKFLPVSAGNFRIPFVINTYKPEDQFFVSG
IHKERNIQVCATLNGIVKIIKAQNLSFFVTGSFKPSGLFPPAPQK
>SDY_P112 hypothetical protein
MIQAQEYIGAGYHWVVDLDLEKFFDRINHDVLMSRIEKRVSDKLVLSLIR
RFLNAGVMDAGLVRPVTEGTPQGGVISPLLSNLFLHYAFDMWMQRQCPDV
PF
>SDY_P068 ISSfl1 ORF2
MLSPGQAHESQFAQRLLDGIGVQRQNGSMKRRGHAVLVDKAYSGRALRNE
LKNNGIKAVIPRKSNEKMASDGRAQLDRDAYRNRNVVERCFGRLKEYRRI
ATRYDKTARNYLAMVKLGCIRLFYQRLRN
>SDY_P201 putative transposase
MTESRQEKLIWLRAQMKLTSLCWALKELAKDIWSRPWSEERRNDWQRWLS
LAANSDVPMMKNVAKTIGKRLYGILNTMRHGVSNGNAEALNSKIRLLRIK
AKGYRNRERFKLGVMFHYGKLNMAF
>SDY_P061 hypothetical protein
MSLALTAGLSIVKSSQSTAQVEGACRLMRNPSVSPQAIAEAGFTATARAC
EAHPLLLALEDTTTINFSHSTAFDDLGNTTGDADDFLDSQEMKHVY
>SDY_P054 hypothetical protein
MKYTPVGVDIAKHLIQVHFIDENTGEVVDKQLRGRDFLEYFSNREPCLIG
MEACGGS
>SDY_P093 putative IS orf, fragment
MNDLFAWLEEQEPCCPPDGPLNKAINYILNRRDELSCFLGDGAVPLDNNI
CERAIRPVVMGRKAWLFAGSLMAGNRAAQIMSLLETAKRNGLEPHAWLTD
VLTRLPEWPEERLAELLPLEGFTFTG
>SDY_P213 iso-IS1 ORF1
MASVNIHCPRCQSAQVYRHGQNPKGHDRFRCRDCHRVFQLTYTYEARKPG
IKELITEMAFNGAGVRDTARTLKIGINTVIRTLKNSRQSE
>SDY_P086 hypothetical protein
MEPVYVILNALLDSGRFTRKLILLGLSGSFSYIFGSIVATLGMGLVVDYL
GCGATFIVLILSLFLPSFSH
>SDY_P141 hypothetical protein
MSHNLEHQKVHTRMVKEVLKAVARANNHPYQSVFTDFIAGHPSCTVCFWE
TFHKMSPDSPYEYVTFCHTCRRFDLYETEAEMKADDPKWW
>SDY_P217 IS629 ORF2
MTLLYQLELFLFRTWVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTY
VSTWQGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSG
TVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKA
EVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLGYTPPAEAEKAYYASI
GNDDLAA
>SDY_P002 IS2 ORF2
MVHATGLMKHASSPGCWDFVEPKNTAVRSPESNRIAKSFVKPREYLRQRA
NDNRCLEI
>SDY_P215 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SDY_P039 IS600 ORF2
MTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQFGLKTS
MSRKGNCYDNAPMESFWGTPKNESLSHYRFNNRDEAISVIREYIEIFYNR
QRRHSRLGNISPAAFREKYHQMTA
>SDY_P048 IS600 ORF1
MSRKTQRYSTEFKAEAVKTVPENQLSISEGPSRLSVPEGTLGQWVTAARK
GLNTPGSRTVAELESEVMQLRKALNEARLERDILKKATAYFAQESLKNTR
>SDY_P058 iso-IS1 ORF1
MASVNIHCPRCQSAQVYRHGQNPKGHDRFRCRDCHRVFQLTYTYEARKPG
IKELITEMAFNGAGVRDTARTLKIGINTVIRTLKNSRQSE
>SDY_P159 IS100 ORF1
MPDSRNTVKRYLQAKSEPPKYTPRPAVASLLDEYRDYIRQRIADAHPYKI
PATVIAREIRDQGYRGGMTILRAFIRSLSVPQEQEPAVRFETEPGRQMQV
DWGTMRNGRSPLHVFVAVPGYSRMLYIEFTDNMRYDTLETCHRNAFRFFG
GVPREVLYDNMKTVVLQRDAYQTGQHRFHPSLWQFGKEMGLSPRLCRPFR
AQTKGKVERMVQYTRNSFYIPLMTRLRPMGSTVDVETANRHGLRWLHDVA
NQRKHETIQARPCDRWLEEQQSMLALPPEKKEYDVHPGENLVSFDNPVTL
FVPLIMGC
>SDY_P104 putative IS orf
MLTELRTRAYPCPPLTPRSTVCGLFARFRKSGLSWPLPAGMSEQKLDALL
YGSASTVPVVLTESTVMPKLPVVKKRPRRP
>SDY_P149 ISSfl4 ORF1
MNSQTTKDIPCFRSYLPDALRLRLEDKLTIRAIAQRLGLSHSTIHTLFQR
FLASGIAWPLPDSVSFAQLNAILYANRKKELTEPQIREGSWRKERRTSYS
REFKVRLAKQALQPGAVVARIAREHDINDNLLFKWKSQYEDGLLSDDDIQ
ECMPVPVALTDTPEPTRPVTNPFWRKNHGSLAAANRGVAEYELSE
>SDY_P133 iso-IS1 ORF2
MALICELDEQWSFVGSKARQHWLWYAYNTKTGGGLAYTFGPRTDETCREL
LALLTPFNIRMITSDDWGSYGREVPKDKHLTGKIFTQRIERNNLTLRTRI
KRLARKTICFSRSVEIHEKVIGTFIEKHMFY
>SDY_P145 hypothetical protein
MSTAASVRKTPREHQINIRATDEERAVIDYAASLVNKNRTDFIMELAYQE
AKNIILDQRLFVLDNERYDSFITQLEAPVQNAEGRERLMAVKPEWK
>SDY_P222 IS3 ORF2
MECLHGEHFIYREIVRATVFNYIECDYNRWRRHRWCGGLSPEQFENQNLA
>SDY_P221 hypothetical protein
MSIEIKMISPIKNIKNVFPINTANTEYIVRNIYPRVEHGYFNESPNIYDK
KYISGITRSMAQLKIEEFINEKSRRLNYMKTMYSPCPEDFQPISRDEAST
PEGSWLTVISGKRPMGQFSVDSLYHPDLHALCELPEISCKIFPKENSDFL
YIIVVFRNDSPQGELRANRFIELYDIKREIMQVLRDESPELKSIKSEIII
AREMGELFSYASEEIDSYIKQMNDRLSQIKARMPVT
>SDY_P028 iso-IS1 ORF1
MVTVNLHCPRCQSVQVYRHGQNPKGHDRFRCRDCHRVFQLTYCYEARKPG
VKNQITEMAFNGAGVRDTARTLKIGINTVIRTLKSSRPGG
>SDY_P073 hypothetical protein
MATINARIDDDIKNQADEVLKLMNISQTQAIAAFYQYITEQKKLPFVITS
IVKTPHDLLRESTDMLAEALAVISNLQVWTEQQDGIGKAKLMEYYRRLDA
LYCCAKEKIGLLSDNRDAELGCVP
>SDY_P059 putative transposase, fragment
MDDKQLQALANELAKNLKTPEDLSQFDRLLKKSVLRLLSTLN
>SDY_P060 putative transposase
MELRTPRDRDGSFEPQLVKKNQTRITGMDNQLLALYARGMTTREITSVFK
EMYDADVSPALISKVTDAVIDQVVEWQNRPLDAIYPIVYLDCIVLKVRQD
SRVINKAVFLALGINLDGQKELLGMWLAENEGAKFWLNVLTELKNRGLND
ILIACVDGLKGYFLIVIHELTSSLDMPVSVIRVVHDALCRWR
>SDY_P139 IS1294 transposase
MTRSGGDFQPRPLKRLFTTNQCWTSFLDAGGLRDIEVEAVTKMLACGTRI
LGVKEYNCDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDC
DWVHLVFTLPDTLWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFC
AIHTYGRRLNCHPHVHVSVTCGCLNKHGQWKKLSFLKDAMRSRWMWNMRQ
LLLKAWSEGLAMPESLSHITTESQWRSLVLKAGGKYWHVYMSKKTAGGRN
TARYLGRYLKKPPIAASRLAHYNGGASLSFRYLDHKTGETATETLTQREL
VARLKQHIPEKFFKMVRYFGFLANRVCGEKLPQVYRALGMDKPAPVAKVC
YAQMVKQFLSRDPFECVLCGGRMVYRRAIAGLNVEGLKKNARDISLLRYM
PA
>SDY_P069 ISSfl1 ORF1
MINIIFNRLLSLLDANGFIDWSATALDGSNIRALKCAAGAQKNIPISTEI
MGRVALAAVLAPKSIWQQTEVASR
>SDY_P033 putative transposase, fragment
MSEQKITGIDLAKTNFYLFSINAHGKPAGKTKLSRNQLLNWLVQQPKMTV
AMEAGGASHYWAREIRKLDHDVILLPAQHVKAYQRCQKNDYNDAQAIAEA
CQHGTIRPVPIKTLEQQDVQTFLNMRRLVSMERTQLINHIRGLLAEYGIV
FSKGAAELRQK
>SDY_P157 hypothetical protein
MTGKRISGITCIIKSATDSVVRAAQNRRLEEAPGKLFELPEVLATAGSHT
LNVMQKGGRAARQARMFISYSEVSIKNPDNSGQALPLTYVCCREQAEDGA
CWHLLTSGKAASAADARRIVSHYERRWLTEEYHKAWKSGGTWNRCECRPG
ITLSAWWLSRRL
>SDY_P195 IS600 ORF2
MAHIRTRETYGTRRLQTELADNGIIVGRDRLARLRKELRLHCKQKRKFRA
TTSSDHNLPVTPNLLNQNFTPTAPNQVWVADITYIATREGWLYLAGVKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQHPPAGLIHHSDRGSQYCAYD
YRVMQEQSGLKTSMSRKGNCYDNAPMEIFWGTLKNESLSHYRFKSRDISS
AYGKTD
>SDY_P071 ISSfl1 ORF2
MLSPGQAHESQFAQRLLDGIGVQRQNGSMKRRGHAVLADKAYSGRTLRNK
LKNNGIKAVIPRKSNEKMASDGRAQLDRDAYCNRNVVERCFGRLKEYRRI
ATRYDKTARNYLAMVKLGCIRLFYQRLRN
>SDY_P034 putative reverse transcriptase
MFDQQRDGNLYKIWNRLCSGTWFPPPVLEKRIPKSNGKERILGIPTVSDR
IAQGAIKLFMEEKLDPIFHADSYGYRPGKSAHDALKQCAIRCWRYSWILE
VDISAFFDHVRHDLVLKALEHHGMPKWVILYCRRWMEAPMQSCENGEVIT
RTRGTPQGGVISPLLANLFLHYAFDLWMEREYRGVPFERYADDIVVHCSR
MSDATRLKNRLSERFSEVGLVLNAGKTNTAYIDTFKRRNVATSFTFLGYD
FKVRTLKNFKGERYRKCMPGASNAAMRKITETIKKWRIHRSTAESLLDFA
RRYNAIVRGWIEYYGKFWSRNFNYRLWSAMQSRLLKWMQSKYRLSNRKAQ
RKLTLVRKEDPKLFVHWYLLRASNE
>SDY_P199 putative transposase
MCWGRTALYMAALEAPRFNLVIKAFYMRLLAAGNAKKVALVACMRKLLTI
MNAMLRKNEKWNESYL
>SDY_P158 ISSfl2 ORF
MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL
ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH
AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFNDDLAA
QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG
EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ
LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGAGVRTAARLLTEVACRA
FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR
DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYPPQGS
>SDY_P156 putative transposase
MADVADRRELRQFRQTPEQRFTQEQEHLQPLLGTDFDIRHVSWDGYIEVG
GNRYSVPESLCGQLVSIRISLDDELRIYSNEQQVASHRLCSAAYGWQTVP
EHHAPLWQQASERMAERLGEIQKRVITVCDREADIWHYLYYKVSHGQRGA
CCTESPAGRGTRQALRTAGSPGNRRKPHAECDAKRRAGSPSGPDVHQLQR
SQHKKSRQQRPGAPAHVCLLPGAGRGRCLLASADVRKSGECRRCTTYCQP
LRATLADRGIPQGVEKWWYMESLRMQTRDNLERMVVIKAFIAVRVLGLRQ
GGVSEETQNDSCEKILTPTEWKLLWVKLEGKPLPVQAPTLKWAWGDGMTA
NAQVVPVGSSCGMAGQTSGYG
>SDY_P202 hypothetical protein
MTGQYAVRCIAVRPRTAHAAYPAVASFFPEAVELRLRKISGSLAKDIVTA
AQFTIFTLQLFQTLTFSGGEPSITAPGIPLMLANPDTQSLRRTANLWSNG
TNCRSL
>SDY_P015 hypothetical protein
MKVSFKPLGYIFHDIYNKKHTIDEFNDVVRKAVLSGKINELNACHKVAIF
LAEKDNEITKKDKAKIIDTLTENYSIEFQQLMNISERTLNSSLYITPGES
GFVSFVNREGKICHTAYVKSSDNSMAYYHANGSSIDKYITDMCGLICMRH
IDSTGIIFYMLDEKVLSAIAEFMNEKGWRAAFCSAKNLYKCV
>SDY_P111 putative transposase
MCRLAVENLLYAARKRGLEIGIFCTIHTLRLHFEEHLPLVVAGRRLGVPK
STACSMFVRFRKAGLSWPLPAGMSERELDARLYGSASTVPVVLTESTVMP
EVPGGKKRPRRPNFPYEFKIALVEQSLQPGACVAQIARENGINDNLLFNW
RHQYRKGGLLPSGKNMPALLPVTLTPEPDNHGFDIIYTLSTHHQRFTFVR
LFDPYLIGSRSTFSHLAHHHIS
>SDY_P080 hypothetical protein
MLRAQLAQRMKTSGNKPNKRRRRCDEAADAEVLARILDIISDMPTYGYRR
V
>SDY_P218 hypothetical protein
MNTTFPLLFFSTAAFWLSLKSLARRDLELHDEIADLDVMIAAIVDELAPE
LIKRNAIGYESASQLLISQVADFKKL
>SDY_P079 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGPLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SDY_P062 IS629 ORF2
MMPLLDKLREQYGVGPLCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDW
LKKEIQRVYDENHKVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVL
RGKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVA
FIIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQY
VSLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNR
AEVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SDY_P143 IS600 ORF1
MSRKTQRYSTEFKAEAVKTVPENQLSISEGPSRLSVPEGTLGQWVTAARK
GLNTPGSRTVAELESEVMQLRKALNEARLERDILKKATAYFAQESLKNTR
>SDY_P088 hypothetical protein
MELKWTSKALSDLARLYDFLVLASKPAAARTVQSLTQAPVILLTHPRMGE
QLFQFEPREVRRIFTGEYEIRYELTGQTIYVLRLWYTRENR
>SDY_P132 hypothetical protein
MKLIIFILIVLIIAALLIRIILRSVNQHSPLLGDASN
>SDY_P129 putative iso-IS1 orf
MYDGVFEVLQWLLFLSAVPPVQLLTGWCVTAKALPDISAISALTAVKHGN
CSSLTPLLNPVRTRKSLIWP
>SDY_P065 iso-IS1 ORF1
MASVNIHCPRCQSAQVYRHGQNPKGHDRFRCRDCHRVFQLTYTYEARKPG
IKELITEMAFNGAGVCDTARTLKIGINTVIRTLKNSRQSE
>SDY_P142 IS600 ORF2
MCRVPGVSRSGYYDRVQHAPSDRKQSDERLKLEIKVAHIRTRETYGTRRL
QTELADNGIIVG
>SDY_P102 putative IS orf
MAGCRLGVPKSTVCGMFVRFRNAGLSWPLPVGMSEQELDALLYGSASTVP
VVLTESTVMPKLPVVKKRPRRPNADQLRIS
>SDY_P019 putative IS orf, fragment
MPKLPVVKKRPRRPNFPYEFKIALVEQSLQPGACVAQIARENGINDNLLF
NWRHQYRKGGLLPSGKNMPALLPVTLTPEPDNKIPAPAQEPEQINTPSDS
LCCELVLPAGTLRLKGKLTPALLQILIREIKGSSH
>SDY_P042 ISSfl1 ORF1
MARYDLPDEAWTIIKPLLPPEPATPRAGRPWAEHRKIINGMFWVLCSGAP
WRDLPERYGAWKTVYNRFNRWSKSGVINIIFNRLLSLLDANGFIDWSATA
LDGSNIRALKCAAGAQKNIPISTEIMGRVALAAVLAPKSIWQQTEVASR
>SDY_P022 putative transposase, fragment
MDAENETVLNANMTHHLGCEKNQLRSGSNSRNGCLTKIITTGDEPLEIRT
LRDRNGTFEPQQLKKNQP
>SDY_P083 iso-IS1 ORF1
MATVTVHCPRCHSDEVYRHGRSCSRHERFRCRSCKRVFQLTYSYEARKPG
VKEQIVEMAHNGAGSCYTARTLKTGINTVIRTLKSSRPGG
>SDY_P091 putative IS orf, fragment
MTFTSFIVNVSPPFNAIRYIRPQYSCPCCEKVFSGKMPAHILSESAVESS
VIAQVVISKYTDHLPLYRQQHIFSRMGVELPVSTMADMVGVAGAALAPLA
KLLRHEFLTRDVIHADETSLRHLDTRKGGKSCSGWLCAYVSGERSGPPVV
CFDSQTGRALRYPETWLQCWRGGPLVSDGYSVYKSLADNHPGITSACCWS
HAGRGFANLYKASREPRAGVELRKIAGLYRIEKLIRERPVEKYGSGDNGT
PGR
>SDY_P103 IS4 ORF
MFPDSFMHIGQALDLVSRYDSLRNPLTSLGDYLAPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMVLERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEEIR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD
AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>SDY_P155 IS1353 transposase
MVDCFDGKVVSGSLSTRPDAELVNTMLDNAVGTLNAGERPVIHSDRGGHY
RWPGWLERVNAAGLIRSMSRKGCSPDNAACEGFFGRLKTEMYYGRKWSGI
TPEKFMQQVDAYIRWYN
>SDY_P204 IS91 ORF
MRYGSLAGWRYSAFLMLPRFADIFQQGNRWLNWLEKQPVQMSRLEHYAGQ
DEIGLRYNSHRTKREENLVMSGDEFMERFSWHVADKGFRMVIRGPESGEA
AITGRCGVRHNGDSEKNGEANHKERDVSAVTEG
>SDY_P150 hypothetical protein
MDSINVRFSSPPQDSCLLLYDSMEFKLDLIEKSYQLGACVAQLAREYGIN
DNLLFTW
>SDY_P095 iso-IS1 ORF1
MASVNIHCPRCQSAQVYRHGQNPKGHDRFRCRDCHRVFQLTYTYEARKPG
IKELITEMAFNGAGVRDTARTLKIGINTVIRTLKNSRQSE
>SDY_P082 hypothetical protein
MINGVSLQGTAGYEAHTEEGNVNVKKLLESLNSKSLGDMDKDSELAATLQ
KMINPSGGDGNCSGCALHACMAMLGYGVREAPVPNEISEYMTGFFHRHLE
QIDSEGIVSHPNETYSKFRERIAENILQNTSKGSVVMISIEQATHWIAGF
NDGEKIMFLDVQTGKGFNLYDPVEKSPDAFVDENSSVQVIHVSDQEFDHY
ANSSSWKSKRLC
>SDY_P011 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELNGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPRGYLRQRACNGLSDNRCLEI
>SDY_P029 IS100 ORF1
MPDSRNTVKRYLQAKSEPPKYTPRPAVASLLDEYRDYIRQRIADAHPYKI
PATVIAREIRDQGYRGGMTILRAFIRSLSVPQEQEPAVRFETEPGRQMQV
DWGTMRNGRSPLHVFVAVPGYSRMLYIEFTDNMRYDTLETCHRNAFRFFG
GVPREVLYDNMKTVVLQRDAYQTGQHRFHPSLWQFGKEMGFSPRLCRPFR
AQTKGKVERMVQYTRNSFYIPLMTRLRPMGSTVDVETANRHGLRWLHDVA
NQRKHETIQARPCDRWLEEQQSMLALPPEKKEYDVHPGENLVSFDNPPNI
IHSPSTTHSAEEWRDDGTATSTTDGARRAVATGKPYKRSACAVTTGSRPG
MELYGLPGASAS
>SDY_P162 acp, Acp
MIKERILSIVAFCYGIAYSKLSEETKFIEDLSADSLSLIEMLDMISFEFN
LRIDESALEHIITIGDLISVVKNSTKSI
>SDY_P096 ccdA, post-segregation antitoxin
MKQRITVTIDSDSYQLLKSANVNISGLVNTAMQKEARRLRAERWQAENQQ
GMAEIARFIEMNGSFADENRDW
>SDY_P097 ccdB, post-segregation toxin
MQFKVYAYKRESRYRLFVDVQSDIIDTPGRRMVIPLASARLLSDKVSREL
YPVVHVGDESWRMMTTDMASVPIFVIGEEVADLSHRENDIKNAINLMFWG
I
>SDY_P125 finO, FinO
MTEQKRPVLTLKRKTEGTAPVRSRKTIINVTTPPKWKVKKQKLAEKAARE
AELAAKKAQARQALSIYLNLPTLDEAVNILKPWWPGLFDGDTPRLLACGI
RDVLLEDVAQRDIPLSHKKLRRALKAITRSESYLCAMKAGACRYDTEGYV
TEHISQEEEAYAGARLAKIRRQNRIKAELQAVLDEK
>SDY_P131 hmo, putative regulator
MAVSVTAMFISQYAGKIIHKNRDSLSNSERESFNSAADHRLAELITGKLY
DRIPKEIWKYVR
>SDY_P214 icsA, IcsA
MNQIHKFFCNMTQCSQGGAGELPTVKEKTCKLSFSPFVVGAFLLLGGPIA
FAIPLSGTQELHFSEDNYEKLLTPVDGLSPLGAGEDGMDAWYITSSNPSH
ASRTKLRINSDIMISAGHGGAGDNNDGNSCGGNGGDSITGSDLSIINQGM
ILGGSGGSGADHNGDGGEAVTGDNLFIINGEIISGGHGGDSYSDSDGGNG
GDAVTGVNLPIINKGTISGGNGGNNYGEGDGGNGGDAITGSSLSVINKGT
FAGGNGGAAYGYGYDGYGGNAITGDNLSIINNGAILGGNGGHWGDAINGS
NMTIANSGYIISGKEDDGTQNVAGNAIHITGGNNSLILHEGSVITGDVQV
NNSSILKIINNDYTGTTPTIEGDLCAGDCTTVSLSGNKFTVSGDVSFGEN
SSLNLAGISSLEASGNMSFGNNVKVEAIINNWAQKDYKLLSADKGITGFS
VSNISIINPLLTTGAIDYTKSYISDQNKLIYGLSWNDTDGDSHGEFNLKE
NAELTVSTILADNLSHHNINSWDGKSLTKSGEGTLILAEKNTYSGFTNIN
AGILKMGTVEAMTRTAGVIVNKGATLNLSGMNQTVNTLLNSGTVLINNIN
APFLPDPVIVTGNMTLEKNGHVILNNSSSNVGQTYVQKGNWHGKGGILSL
GAVLGNDNSKTDRLEIAGHASGITYVAVTNEGGSGDKTLEGVQIISTDSS
DKNAFIQKGRIVAGSYDYRLKQGTVSGLNTNKWYLTSQMDNQESKQMSNQ
ESTQMSSRRASSQLVSSLNLGEGSIHTWRPEAGSYIANLIAMNTMFSPSL
YDRHGSTIVDPTTGQLSETTMWIRTVGGHNEHNLADRQLKTTANRMVYQI
GGDILKTNFTDHDGLHVGIMGAYGYQDSKTHNKYTSYSSRGTVSGYTAGL
YSSWFQDEKERTGLYMDAWLQYGWFNNTVKGNGLTGEKYSSKGITGALEV
GYIYPTIRWTAHNNIDNALYLNPQVQITRHGVKANDYIEHNGTMVTSSGV
NNIQAKLGLRTSLISQSCIDKETLRKFEPFLEVNWKWSSKQYGVIMNGMS
NHQIGNRNVIELKTGVGGRLADNLSIWGNVSQQLGNNSYRDTQGILGVKY
TF
>SDY_P170 icsB, IcsB
MILKISNFIDASNTKGPIRVEDTEHGPILVAQKFNLKDLFFRTLSTINAK
INSQILNEQLKNYRLANQKSLLLFLKTLASEKSAESAFAAYEAVKNSIQH
SFTGKDIKLMLNTAERFHGIGTAKNLERHLVFRCWENRGITHLGHTSISI
KNNLLQEPTHTYLSWYPGGNVTKDTEINYLFEKRSGYSVDTYKQDKLNMI
SDQTAERLDAGQEVRNLLNSKQDQNNNKKIFFPRANQKKDPYGYWGVSAD
KVYIPLSGDNKTKDGKISYNLFGLDETNMSKFICQKKADAFRQLANYKLI
SKSENCAGMALNVLKAGNSEIYFPLPDVKLVATPNNVYAYANKVRQRIES
LNQSYNEIMKYIESDFDLSRLTQLRRSYLKSFNKINLIHTPKTFKPLSIS
LYKHPTENVSSEDFDAVINACHSYLVKSAPSNMSRVLNELKTGATDKKEE
IIEKSIKTIDYYNSLKSPDLGTKLYIHDLLQVNKLLLNNSHSNI
>SDY_P224 icsP, IcsP
MDISTKKVEFSMKLKFLVLALCVPAIFTTHATTNYPLFIPDNISTDISLG
SLSGKTKERVYHPKEGGRKISQLDWKYSNATIVRGGIDWKLIPKVSFGVS
GWTTLGNQKASMVDKDWNNSNTPQIWTDQSWHPNTHLRDANEFELNLKGW
LLNNLDYRLGLIAGYQESRYSFNAMGGSYIYSENGGNRNKKGAHPSGERT
IGYKQLFKIPYIGLTANYRHENFEFGAELKYSGWVRSSDTDKHYQTETIF
KDEIKNQNYRSVAANIGYYVTPSAKFYIEGSRNYISNKKGDTSLYEQSTN
ISGTIKNSASIEYIGFLTSAGIKYIF
>SDY_P128 insB, IS1 ORF2
MSLLLPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHL
ARLGRKALSFSKSVELHDKVIGHYLNIKHYQ
>SDY_P078 insB, IS1 ORF2
MDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSP
CDVVIWMTDGWPLYESRLKGKLHIISKRYTQRIERHNLNLRQHLARLGRK
SLSFSKSVELHDKVIGHYLNIKHYQ
>SDY_P041 insB, IS1 ORF2
MDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLLP
FDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRK
ALSFSKSVELHDKVIGHYLNIKHYQ
>SDY_P163 ipaA, IpaA
MHNVNNTQATTFLYKATSPSSTEYSELKSKISDIHSSQTSLKTPSSVSEK
ESFATSFNQKCLDFLFSSSGKEDVLRSIYSNSMNAYAKSEIIEFSNVLYS
LVHQNGLNFENEKGLQKIVAQYSELIIKDKLSQDSAFGPWSAKNKKLHQL
RQNIEHRLALLAQQHTSGEALSLGQKLLNTEVSSFIKNNILAELKLSNET
VSSLKLDDLVDAQAKLAFDSLRNQRKNTIDSNGFGIGKLSRDLNTVAVFP
ELLRKVLNNILEDIKDSHPIQDGLPTPPKDMPDGGPTPGANEKTSQPVIH
YHINNDNRTYDNRVFDNRVYDNSYHENPENDAQSPTSQTNDLLPRNANSL
LNPQRALVQKVTSVLPHSISDTVQTFANNSALEKVFNHTPDNSDGIDSDL
LTTSSQERSANNSLSRGHRPLNIQNSSTTTPLRPEGVTNSNDNSSDTTKS
SASFSHRVTSQISKFNSNTDSKVLQTDFLSRNGDPYLTRETIFEASKKVT
NSLSNLISLIGTKSGTQERELQEKSKDITKSTTEHRINNKLKITDANTLN
YVTETNADTIDKNHAIYEKAKEVSSALSKVLSKIDDTSAELLTEDISNLK
NNNDITAENNNIYKAAKDVTTSLSKVLKNINKD
>SDY_P166 ipaB, IpaB
MHNVSTTNTGLPLAKILASTELGDNTIQAANDAANKLFSLTIADLTANNN
INTTNAHSTSNILIPELKAPKSLNASSQLTLLIGNLIQILGEKSLTALTN
KITAWKSQQQARQQKNLEFSDKINTLLSETEGLTRDYEKQINKLKNADSK
IKDLENKINQIQTRLSELAPDSPEKKKLRREEIQLTIKKDAAVKDRTLIE
QKTLSIHSKLTDKSMQLEKEIDSFSAFSNTASAEQLSTQQKSLTGLASVT
QLMATFIQLVGKNNEESLKNDLALFQSLQESRKTEMERKSDEYAAEVRKA
EELNRVMGCVGKILGALLTIVSVVAAAFSGGSSLALAAVGLALMVTDAIV
QAATGNSFMEQALNPIMKAVIEPLIKLLSDAFTKMLEGLGVDSKKAKMIG
SILGAIAGALVLVAAVVLVATVGKQAAAKLAENIGKIIGKTLTDLIPKFL
KNFSSQLDDLITNAVARLNKFLGAAGDEVISKQIISTHLNQAVLLGESVN
SATQAGGSVASAVFQNNASTNLADLTLSKYQVEQLSKYISEAIEKFGQLQ
EVIADLLASMSNSQANRTDVAKAILQQTTA
>SDY_P165 ipaC, IpaC
MEIQNTKPTQILYTDISTKQTQSSSETQKSQNYQQIAAHIPLNAGKNPVL
TTTLNDDQLVKLSEQVQHDSEIIARLTDKKMKDLSEMSHTITPENTLDIS
SLSSNAVSLIISVAVLLSALRTAETKLGSQLSLIAFDATKSAAENIVRQG
LAALSSSITGAVTQVGITGIGAKKTHSGISEQKGALRKNLATAQSLEKEL
AGSKLGLNKQIDTNITSPQTNSSTKILGKNKLAPDNISLSTEHKTSLSSP
DISLQDKIDTQRRAYELNTLSAQQKQNIGRATMETSAVAGNISTSGGRYA
SALEEEEQLISQASSKQAEEASQVSKEASQATNQLIQKLLNIIDNINQSR
SSTASQIAGNIRA
>SDY_P164 ipaD, IpaD
MNITTLTNSISTSSFSPNNTNGSSTETVNSDIKTTTSSHPVSSLTMLNDT
LHNIRTTNQALKKDLSQKTLTKTSLEEIALHSSQISMDVNKSAQLLDILS
KKEYPINKDARELLHSAPEEAELDGYQMISHRELWDKIAKSINNINEQYL
KVYEHAVSSYTQMYQDFSAVLSSLAGWISPGGNDGNSVKLQVNSLKAELT
KLKEKYEDKPLYPANNTVSKEQADKWLTELGGTIGTVSRKNGGYVVNINM
SPIDNMLKSLNNLGGNGEVVLDNAKYQAWNAGFSAEDETMKNNLQTLVQK
YSNANSIFDNLVKVLSSTISSCTDTDKLFLHF
>SDY_P140 ipaH, invasion plasmid antigen, fragment
MTWELILDGYSESSYSATPRFAAARLPWFPENKQSNVSQIWHAFEHEEHA
NTFSAFLDRLSDTVSARNTSGFREQVAAWLEKLSASAELRQQSFAVAADA
TESCEDRVALTWNNLRKTLLVHQASEGLFDNDTGALLSLGREMFRLEILE
DIARDKVRTLHFVDEIEVYLAFQTMLAEKLQLSTAVKEMRFYGVSGVTAN
DLRTAEATVRSREENEFTDWFSLWGPWHAVLKRTEADRWAQAEEQKYEML
ENEYSQRVADRLKASGLSGDADAEREAGAQVMRETEQQIYRQLLREGIRV
ARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQL
RVADFTYASTWQGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQAL
WARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDSYDNAMAES
INGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLGHTPPAEAE
KAYYASIGNDDLAA
>SDY_P045 ipaH1.4, invasion plasmid antigen
MIRILVIMIKSTNIQAIGSGIMHQINNIYSLTPFSSPMELTPSCNEFYLK
AWSEWEKNGTPGEQRNIAFNRLKICLQNQEAELNLSELDLKTLPDLPPQI
TTLEIRKNLLTHLPDLPPMLKVIHAQFNQLESLPALPETLEELNAGDNKI
KELPFLPENLTHLRVHNNRLHILPLLPPELKLLVVSGNRLDSIPPFPDKL
EGLALANNFIEQLPELPFSMNRAVLMNNNLTTLPESVLRLAQNAFVNVAG
NPLSGHTMRTLQQITTGPDYSGPRIFFSMGNSATISAPEHSLADAVTAWF
PENKQSDVSQIWHAFEHEEHANTFSAFLDRLSDTVSARNTSGFREQVAAW
LEKLSASAELRQQSFAVAADATESCEDRVALTWNNLRKTLLVHQASEGLF
DNDTGALLSLGREMFRLEILEDIARDKVRTLHFVDEIEVYLAFQTMLAEK
LQLSTAVKEMRFYGVSGVTANDLRTAEATVRSREENEFTDWFSLWGPWHA
VLKRTEADRWAQAEEQKYEMLENEYSQRVADRLKASGLSGDADAEREAGA
QVMRETEQQIYRQLTDEVLALRLSENGSNHIA
>SDY_P037 ipaH4.5, invasion plasmid antigen
MKPINNHSFFRSLCGLSCISRLSVEEQCTRDYHRIWDDWAREGTTTENRI
QAVRLLKICLDTREPVLNLSLLKLRSLPPLPLHIRELNISNNELISLPEN
SPLLTELHVNGNNLNILPTLPSQLIKLNISFNRNLSCLPSLPPYLQSLSA
RFNSLETLPELPSTLTILRIEGNRLTVLPELPHRLQELFVSGNRLQELPE
FPQRLKYLKVGENQLRRLSRLPQELLALDVSNNLLTSLPENIITLPICTN
VNISGNPLSTRVLQSLQRLTSSPDYHGPQIYFSMSDGQQNTLHRPLADAV
TAWFPENKQSDISQIWHAFEHEEHANTFSAFLDRLSDTVSARNTSGFREQ
VAAWLEKLSASAELRQQSFAVAADATESCEDRVALTWNNLRKTLLVHQAS
EGLFDNDTGALLSLGREMFRLEILEDIARDKVRTLHFVDEIEVYLAFQTM
LAEKLQLSTAVKEMRFYGVSGVTANDLRTAEATVRSREENEFTDWFSLWG
PWHAVLKRTEADRWAQAEEQKYEMLENEYSQRVADRLKASGLSGDADAER
EAGAQVMRETEQQIYRQLTDEVLA
>SDY_P038 ipaH7.8, invasion plasmid antigen
MFSVNNTHSSVSCSPSINSNLTSNEHYLRILTEWEKNSSPGEERGIAFNR
LSQCFQNQEAVLNLSDLNLTSLPELPKHISALIVENNKLTSLPKLPAFLK
ELNADNNRLSVIPELPESLTTLSVRSNQLENLPVLPNHLTSLFVENNRLY
NLPALPEKLKFLHVYYNRLTTLPDLPDKLEILCAQRNNLVTFPQFSDRNN
IRQKEYYFHFNQITTLPESFSQLDSSYRINISGNPLSTRVLQSLQRLTSS
PDYHGPQIYFSMSDGQQNTLHRPLADAVTAWFPENKQSDISQIWHAFEHE
EHANTFSAFLDRLSDTVSARNTSGFREQVAAWLEKLSASAELRQQSFAVA
ADATESCEDRVALTWNNLRKTLLVHQASEGLFDNDTGALLSLGREMFRLE
ILEDIARDKVRTLHFVDEIEVYLAFQTMLAEKLQLSTAVKEMRFYGVSGV
TANDLRTAEATVRSREENEFADWFSLWGPWHAVLKRTEADRWAQAEEQKY
EMLENEYSQRVADRLKASGLSGDADAEREAGAQVMRETEQQIYRQLTDEV
LALRLSENGSRLHHS
>SDY_P099 ipaH9.8, invasion plasmid antigen
MSTGFNWMPIMLPINNNFSLPQNSFYNTISGTYADYFSAWDKWEKQALPG
EERDEAVSRLKECLINNSDELRLDRLNLSSLPDNLPAQITLLNVSYNQLT
NLPELPVTLKKLYSASNKLSELPVLPPALESLQVQHNELENLPALPDSLL
TMNISYNEIVSLPSLPQALKNLRATRNFLTELPAFSEGNNPVVREYFFDR
NQISHIPESILNLRNECSIHISDNPLSSHALQALQRLTSSPDYHGPRIYF
SMSDGQQNTLHRPLADAVTAWFPENKQSDVSQTWHAFEHEEHANTFSAFL
DRLSDTVSARNTSGFREQVAAWLEKLSASAELRQQSFAVAADATESCEDR
VALTWNNLRKTLLVHQASEGLFDNDTGALLSLGREMFRLEILEDIARDKV
RTLHFVDEIEVYLAFQTMLAEKLQLSTAVKEMRFYGVSGVTANDLRTAEA
MVRSREENEFTDWFSLWGPWHAVLKRTEADRWALAEEQKYEMLENEYPQR
VADRLKASGLSGDADAEREAGAQVMRETEQQIYRQLTDEVLALRLSENGS
QLHHS
>SDY_P160 ipaJ, IpaJ
MISHQTDVRVNENRVNEQGCFLARKQMYDNSCGAASLLCAAKELGVDKIP
QYKGSMSEMTRKSSLDLDNRCERDLYLITSGNYNPRTHKDNIADAGYSMP
DKIVMATRLLGLNAYVVEESNIFSQVISFIYPDARDLLIGMGCNIVHQRD
VLSSNQRVLEAVAVSFIGVPVGLHWVLCRPDGSYMDPAVGENYSCFSTME
LGARRSNSNFIGYTKIGISIVITNEAL
>SDY_P169 ipgA, IpgA
MCRKLYDKLYEITGAKLDFNDKNQAFILLEEQIPVCITDNDEYIFLTGLL
NEHELFTENIINPEHILILNYSLSRDYGSSICLLPDTHQCVLTKKHYKKY
LSPDELIESLYEFLFCIKLTIANITSEVN
>SDY_P168 ipgB1, IpgB1
MQILNKILPQVEFAIPRPSFNSLSHNKLVKKILSVFNLKQRFPHKNFGCP
VNINKIRDSVIDKIKDSNSGDQLFCWMSQERTSYVSSMINRSIDEMAIHN
GVVLTSDNKKNIFAAIEKKFPDIKLDEKSAQTSISHTALNEIASSGLRAK
ILKRYSSDMDLFNTQMKDLTNLVSSSVYDKIFNESTKVLQIEISAEVLKA
VYRQSNTN
>SDY_P025 ipgB2, IpgB2
MIIMLGTSFNNFGISLSHKRYFSGKVDEIIRCTMGKRIVKISSTKINTSI
LSSVSEQIGENITDWKNDEKKVYVSRVVNQCIDKFCAEHSREIGDNLRKQ
IFKQVEKDYRISLDINAAQSSINHLVSGSSYFKKKMDELCEGMNRSVKND
TTSNVVNLISDQFFEKNVQYIDLKKLRGNMSDYITNLESPF
>SDY_P167 ipgC, IpgC
MSLNITENESISTAVIDAINSGATLKDINAIPDDMMDDIYSYAYDFYNKG
RIEEAEVFFRFLCIYDFYNVDYIMGLAAIYQIKEQFQQAADLYAVAFALG
KNDYTPVFHTGQCQLRLKAPLKAKECFELVIQHSNDEKLKIKAQSYLDAI
QDIKE
>SDY_P171 ipgD, IpgD
MHITNLGLHQVSFQSGDSYKGAEETGNQKSVSVISYQRVKNGERNKGVEA
LNRLYLQNQTSLTGKGLLFARDRAAVFYEAIKLAGGDTSKIKAMMEQLDT
YKLGEVDKRNINELNKVISEEIRAQLGIKNKKELQSEIKQIFTNHLNNKN
WEPINKNINYHGKNYGFQLTPASHMKIGNKNIFVKEYNGKGICCASTRES
QHIANMWLSKVVDDEGKEIFSGIRHGVISAYGLKKNSSERAVAARNKAEE
LASAALYSRRELLSQALSGKTVDLKIVSTSLLTPTSLTGGEESMLKDQVK
ALKGLNSKRGEPTKLLIRNSDGLLQEVNVNLKVVTFNFGVNELALKMGLG
WRNVDKLNGESICSLLGDNFLKNGVIGGWAAEAIEKNPSCKNDVIYLANQ
IKEILNKKLQKNDNGEPYKLSQRMALLAYTIGAVPCWNCKSGKDRTGMQD
AEIKREIIRKHETGQFSQLNSKLSSEEKRLFSTILMNSGNMEIQEMNTGV
PGNKVMKKLPLSSLELSYAERIGDPKIWNMVKGYSSFV
>SDY_P172 ipgE, IpgE
MEDLADVICRALGIPSIDTDDQAIMLDDDVLIYIEKEGDSINLLCPFCAL
PENINDLIYALSLNYSEKICLATDDEGGSLIARLDLTGINEFEDIYVNTE
YYISRVRWLKDEFARRMKGY
>SDY_P173 ipgF, IpgF
MSRFVFILLCFIPHLGRADCWDKAGERYNIPSSLLKAIAEKESGFNKSAV
NVNNNGSKDYGIMQINDFHSKRLREMGYTEEMLISHPCLSVHYAAKLLNE
FMLMYGRGWEAVGAYNAGTSPKKKKERLKYAEDIYKRYLRIAAESKQNNR
RI
>SDY_P013 mkaD, mouse killing factor
MPIKKPCLKLNLDSLNVVRSEIPQMLSANERLKNNFNILYNQIRQYPAYY
FKVASNVPNYSDICQFFSVMYQGFQIVNHSGDVFIHACRENPQSKGDFVG
DKFHISIAREQVPLAFQILSGLLFSEDSPIDKWKITDMNRVSQQSRVGIG
AQFTLYVKSDQECSQYSALLLHKIRQFIMCLESNLLRSKIAPGEYPASDV
RPEDWKYVSYRNELRSDRNGSERQEQMLREEPFYRLMIE
>SDY_P110 msbB2, MsbB2
MKKYKSEFIPEFKKNYLSPVYWFTWFVLGMIAGISMFPPSFRDPVLAKIG
RWVGRLSRKARRRATINLSLCFPEKSDTEREIIVDNMFATALQSIVMMAE
LAIRGPEKFQKRVFWKGLEILEEIRHNNRNVIFLVPHGWSVDIPAMLLAA
QGEKMAAMFHQQRNPVIDYVWNSVRRKFGGRLHSREDGIKPFIQSVRQGY
WGYYLPDQDHGPEYSEFADFFATYKATLPIIGRLMNISQAMIIPLFPVYD
EKKHFLTIEVRPPMDACIASADNKMIARQMNKTVEILVGSHPEQYIWVLK
LLKTRKSNEADPYP
>SDY_P120 mvpT, plasmid maintenance protein
METTVFLSNRSQAVRLPKAVALPENVKRVEVIAVGRTRIITPAGETWDEW
FDGHSVSADFMDNREQPDMQERESF
>SDY_P185 mxiA, MxiA
MKVIQSFLKQVSTKPELIILVLMVMIIAMLIIPLPTYLVDFLIGLNIVLA
ILVFMGSFYIERILSFSTFPSVLLITTLFRLALSISTSRLILVDADAGKI
ITTFGQFVIGDSLAVGFVIFSIVTVVQFIVITKGSERVAEVAARFSLDGM
PGKQMSIDADLKAGIIDAAGAKERRSILERESQLYGSFDGAMKFIKGDAI
AGIIIIFVNLIGGISVGMSQHGMSLSGALSTYTILTIGDGLVSQIPALLI
SISAGFIVTRVNGDSDNMGRNIMSQIFGNPFVLIVTSALALAIGMLPGFP
FFVFFLIAVTLTALFYYKKVVEREKSLSESDPSGYTGTFDIDNSHDSSLA
MIENLDAISSETVPLILLFAENKINANDMEGLIERIRSQFFIDYGVRLPT
ILYRTSNELKVDDIVLLINEVRADSFNIYFDKVCITDENGDIDALGIPVV
PTSYNERVISWVDVSYTENLTNIDAKIKSAQDEFYYQLSQALLNNINEIF
GIQETKNMLDQFENRYPDLLKEVFRHVTIQRISEVLQRLLGENISVRNLK
LIMESLALWAPREKDVITLVEHVRASLSRYICSKIAVSGEIKVVMLSGYI
EDAIRKGIRQTSGGSFLNMDIEVSDEVMETLAHALRELRNAKKDFVLLVS
VDIRRFVKRLIDNRFKSILVISYAEIDEAYTINVLKTI
>SDY_P184 mxiC, MxiC
MTNSDDGDETADAELDSGLANSKYIDSSDEMASALSSFINRRDLEKLKGT
NSDSQERILDGEEDEINHKIFDLKRTLKDNLPLDRDFIDRLKRYFKDPSD
QVLALRELLNEKDLTAEQVELLTKIINEIISGSEKSVNAGINSAIQAKLF
GNKMKLEPQLLRACYRGFIMGNTSTTDQYIEWLGNFGFNHRHTIVNFVEQ
SLIVDMDSEKPSCNTYEFGFVLSKLIAIKMIRTSDVIFMKKLESSSLLKD
GSLSAEQLLLTLLYIFQYPSESEQILTSVIEVSRASHEDSVVYQTYLSSV
NESPHDIFKSESEREIAINILRELVTSAYKKELSR
>SDY_P183 mxiD, MxiD
MKKFNVKSLTLLIVLLPLIVNANNIDSHLLEQNDIAKYVAQSDTVGSFFE
RFSALLNYPIVVSKQAAKKRISGEFDLSNPEKMLEKLTLLVGLVWYKDGN
ALYIYDSGELISKVILLENISLNYLIQYLKDANLYDDRYPIRGNISDKTF
YISGPPALVELVANTATLLDKQVSSIGTDKVNFGVIKLKNTFVSDRTYNM
RGEDIVIPGVATVVERLLNNGKALSNRQAKNDLMPPFNITQKVSEDGNDF
SFSSVTNNSILEDVSLIAYPETNSILVKGNDQQIQIIRDIITQLDVAKRH
IELSLWIIDIDKSELNNLGVNWQGTASFGDSFGASFNMSSSASISTLDGN
KFIASVMALNQKKKANVVSRPVILTQENIPAIFDNNRTFYVSLVGERNSS
LEHVTYGTLINVIPRFSSRGQIEMSLTIEDGTGNSLSNYNYNNENTSVLP
EVGRTKISTIARVPQGKSLLIGGYTHETNSNEIVSIPFLSSIPVIGNVFK
YKTSNISNIVRVFLIQPREIKESSYYNTAEYKSLISEREIQKTTQIIPSE
TTLLEDEKSLVSYLNY
>SDY_P182 mxiE, MxiE
MEGFFFVRNQNIKFSDNVNYHYRFNINSCAKFLAFWDYFSGALVEHSHAE
KCIHFYHENDLRDSCNTESMLDKLMLKFIFSSDQNVSNALAMIRMTESYH
LVLYLLRTIEKEKEVRIKSLTEHYGVSEAYFRTLCRKALGAKVKEQLNTW
RLVNGLLDVFLHNQTITSAAMNNGYASTSHFSNEIKTRLGFSARELSNIT
FLVKKINEKI
>SDY_P174 mxiG, MxiG
MSEAKNSNLAPFRLLVKLTNGVGDEFPLYYGNNLIVLGRTIETLEFGNDN
FPENIIPVTDSKSDGIIYLTISKDNICQFSDEKGEQIDINSQFNSFEYDG
ISFHLKNMREDKSRGHILNGMYKNPSVFFFFAVIVVLIIIFSLSLKKDEV
KEIAEIIDDNRYGIVNTGQCNYILAETQNDAVWASVALNKTGFTKCRYIL
VSNKEINRIQQYINQRFPYINLYVLNLVSDKAELLVFLSKERNSSKDTEL
DKLKNALIVEFPYIKNIKFNYLSDHNARGDAKGIFTKVNVQYKEICENNK
VTYSVREELTDEKLELINRLISEHKNIYGDQYIEFSVLLIDDDFKGKSYL
NSKDSYVMLNDKHWFFLDKNK
>SDY_P175 mxiH, MxiH
MSVTVPNKDWTLSSLSETFDDGTQTLQGELTLALEALATNPSNPQLLAEY
QSKLSEYTLYRNAQSNTVKVIKDVDAAIIQNYR
>SDY_P176 mxiI, MxiI
MNYIYPVNQVDIIKASDFQSQEISSLEEVVSAKYSDIKMDTDIQVSQIME
MVSNQESLNPESLAKLQMMLSNYSIGISLAGTLARKTVSAVETLLKS
>SDY_P177 mxiJ, MxiJ
MIRYKFFILFLLLMLIGCEQRDELISNLSQRQANEIISVLERHNISARKV
DGGKQGISVQVEKGTFASAVDLMRMYDLPNPERVDISQMFPTDSLVSSPR
AEKARLYSAIEQRLEQSLVSIGGVISAKIHVSYDLDDKNISSKPMHISVI
AIYDSPKESELLVSNIKRFLKNTFSDVKYENISVILTPKEEYVYTNVQPV
KETKSEFLINEVIYVFLGMAVLVVILLVWAFKTGWFKRNKI
>SDY_P178 mxiK, MxiK
MDGIYKKYLSIIFDPAFYINRNRLNLPSELLENGVIRSEINNLIINKYDL
NCDIDPLSGVTAMFVANWNLLPAVAYFIGSQESRLINHSEMVISYYGGKI
SKQGEAAIRSGFWHLIAWKENISVGVYERINLLFNPIVLEGNYTPVERNL
SRLNEGMQYAKRHFTGIQTSCL
>SDY_P180 mxiL, MxiL
MINQINASNALQQRLNSEELVNLNDRLSSSQSLDEDIIYEIMQYFSQSEL
NPIDNDELHNKIEQLFNSRFPYLTAAQKSSLLNKLIDANQYVDLHEGFYA
SLSIYNNINFYIKTTTFDSLISVFEAGRDADDSTW
>SDY_P181 mxiM, MxiM
MIRHGSNKLKIFILSILLLSLSGCALKASSNSEKEWHIVPVSKDYFSIPN
DLFWSFNTTNKSINVYSKCISGKAVYSFNAGKFMSNFNVKEVDGCFMDAQ
KIAIDKLFSMLKDGVVLKGNKINDTILIEKDGEVKLKLIRGL
>SDY_P179 mxiN, MxiN
MKVCNMQKGTLPVSRHHAYDGVVIKRMEKELCKTIKDRDTESKKKAICVI
KDATKKAESLRIDAVCDGYQIGIQTAFEHIVDYICEWKLKQNENRRNIED
YITSLLSENLHDERIISTLLEQWLSSLRNTVTELKVVLPKCNSALRKKLE
LDLYKYRSDVKIIFKYSEGNNYIFCSGNQVVEFSPQDVISGVKIELAEKL
TKNDKKYFEELAHKKLRQIAEDLLKENPVND
>SDY_P003 ospB, OspB
MGQQIPRVFKNKMLYDYVFKNEKSKNDFLKMAESWLPQSEPVVINNDDDA
LNAAAYFSVKKAKIKTVNDTDFKEYNKVYILGHGSPGSHQLGLGSELIDV
QTIISRMKDCGILNVKDIRFTSCGSADKVAPKDFNNAPAESLSCILNSLP
FFKEKESLLEQIKKHLENDESLSDGLKISGYHGYGVHYGQELFPYSHYRS
TSIPADPEHTVKRSSQKKTFIINKELD
>SDY_P055 ospC1, OspC1
MNISEILNSSNTQCNIDSMDNRLHTLFPKVTSVRNAAQQTMPDEKCLKDS
ANIIKDFFRKTIAAQSYSRMFSQGSNFKSLNIAIDAPSDAKASFKAIEHL
DRLSKHYISEIRKKLHPLSAKELNLLSLIINSDLIFRHQSNSDLSDKILN
IKSFNKIQSEGICTNRNTYADDIKKIANHDFVFFGVEISSHQKKHPLNTK
HHTVDFGANAYMIEHSSPYGYMTLTDHFDNVIPFAQYHEHQSFLDKFSEV
NEEVSRYVHGSKGIIDVPIFNTKDMKLGLGLYLIDFIRKSEDQSFKEFCY
EKNLSPVDLDRIINFVFQPEYHIPRMVSTENFKKVKIREISLEEAVTASN
YEEINKQVTNKKIALQALFFSITNKKEDVALYILSNFEITKQDVISIKHE
LYDIEYLLSAHNSSCKVLEYFINKGLVDVNTKFKKTNSGDCMLDNAIKYE
NAEMIKLLLKYGATSDNKYI
>SDY_P070 ospC2, OspC2
MKIPEAVNHINVQNNIDLVDGKINPNKDTKALQKNISCVTNSSSSGISEK
HLDHCADTVKSFLRKSIAAQSYSKMFSQGTSFKSLNLSIEAPSGARSSFR
SLEHLDKVSRHYLSEIIQKTHPLSSDERHFLSIIINSDFNFRHQSNANLS
NNTLNIKSFDKIKSENIQTYKNTFSEDIEEIANHDFVFFGVEISNHQETL
PLNKTHHTVDFGANAYIIDHDSPYGYMTLTDHFDNAIPPVFYHEHQSFFL
DNFKEVVDEVSRYVHGNQGKTDVPIFNTKDMRLGIGLHLIDFIRKSKDQR
FREFCYNKNIDPVSLDRIINFVFQPEYHIPRMLSTDNFKKIRLRDISLED
AIKASNYEEINNKVTDKKMAHQALAYSLGNAKSDMALYLLSKFNFTKQDV
AEMEKMNNNMYCELYDVEYLLSEDSANYKVLEYFINNGLVDVNKRFQKAN
SGDTMLDNAMKSKDSKTIDFLLKNGAVSGKRFGR
>SDY_P151 ospC3, OspC3
MKIPEAVNHINVQNNVDLVDGKTNPNKATKALQKNILRVTNSSSSGISEK
HLDHCANTVKNFLRKSIAAQSYSKMFSQGTSFKSLNLSIEAPSGARSSFR
SLEHLDKVSRHYISEIIQKVHPLSSDERHLLSIIINSNFNFRHQSNSNLS
NNILNIKSFDKIQSENIQTHKNTYSEDIKEISNHDFVFFGVEISNHQEKL
PLNKTHHTVDFGANAYIIDHDSPYGYMTLTDHFDNAIPPVFYHEHQSFFL
DNFKEVVDEVSRYVHGNQGKTDVPIFNTKDMRLGIGLHLIDFIRKSKDQG
FREFCYNKNIDPVSLDRIINFVFQLEYHIPRMLSTDNFKKIKLRDISLED
AIKASNYEEINNKVTDKKMAHQALAYSLGNKKADIALYLLSKFNFTKQDV
AEMEKMNNNRYCNLYDVEYLLSKDGANYKVLEYFINNGLVDVNKKFQKVN
SGDTMLDNAMKSKDSKMIDFLLKNGAILGKRFEI
>SDY_P023 ospD1, OspD1
MSINNYGLHPANNKNMHLIIGSNTANENKGIRSNIINVTNSAISHAINEE
KSGGGYSGVFFRKLAKIQSISIPTKNNKEYNRHNLFSLIWHGNADAARKY
GESLLAAEIPKEEKLEVLAARNNAGESALFIALQEGHSAAIQAYGDFIKT
FDLSPKETIKLLDVRDNEGLPGLFLAAGRGNIEAMMAYINICHHSGIKLT
EIADRLNNNEQDMFNIISDKIQELF
>SDY_P010 ospD2, OspD2
MSSMPLNKTFSSSIFSTKNSLSTDMSVNRDNRTITSSIMRVSNSSELIQF
KNKTAPYFSEKRNVKVNINGVAKDIYGRQIVCRHLASYWEMNFMETNGKV
NYQLLSTPDAIAKNVCLEKTEDFSKSPAYIYFVENKKWGTVITNFFYNMK
KNGDFVRTLSACTLNHQMALGLKIKRVQESEKWVVQFFDPNRTVTHKRTV
FTCDSHFELSQLSAKDFFDDFYWKIYGLEQPGQVIFEDRHNSPLTNTVKL
LPDELINSRVIYHAITKNLTEVLFILMEKYKNGEISQSKLVNLLATRSSD
GTPAFYIALQNGCSDIIQVYGKILNMCNLSQETILSLLAAVGANNVPGLC
MSFMNGHVDTIKAYGEIVFKTPLTSDKRLYLLAAKDSHDLPGLFFALQNG
HADSIRMFGSLLNKKMLSSEQIKELLKVKHGLFMALQNGHTKAIMAYGDI
LKILPPHQEYIDELLWIKNPNGTSGLFMAFYNGHTETIRAFCNILKNYSF
TTRRLVEMLSATNKDGIPGVFVSVVNRDKETILEYCRIIKENNLEPDTIA
EQFSKKIKKTFIEIINRFNHFL
>SDY_P056 ospD3, OspD3
MINKDNVSVETIQSLLHSKQLPYFSDKRSFLLNLNCQVTDLSGRLIVCRH
LASYWIAQFNKSSGHVDYHHFAFPDEIKNYVSVSEEEKAINVPGIIYFVE
NGSWGDIIYHIFNEMIFHAEKNRALEISTSNHNMALGLKIKETKNGGRFV
IQLYDPNHTATHLRAEFNNFNLDKIKKLTVDNFLDEKHQECYGLISDGMS
IFVDRHTPTSMSSIIRWPNNLLHPKVIYHAMRMGLTELIQKVTRVVQLSD
LSDNTLELLLAAKNDDGLSGLLLALQNGHSDTILAYGELLETSGLNLDKT
VELLTAEGMGGRISGLSQALQNGHAETIKTYGGLLKKRAINIEYNKLKNL
LTAYYYDEVHRQTPGLMFALQNGHADAIRAYGELILSLPFLNSEDIVNLL
ASRRYDNVPGLLLALNNGQADAILAYGDILNEAKLNLDKKAELLAAKDSN
GLSGLFVALHNGRVETIIAYGKILHTADLTPHQASKLLAAEGPNGVSGLI
IAFQNRNFEAIKTYMEIIKDENITPEEIAEHLDKKNGSDFLEIMSNIKS
>SDY_P046 ospE2, OspE2
MLTQTIFPCLPQKQENIILEVSNPVLLSSTVTTDGYTVFNKKAAIYELQI
PAANRTKTLKFTATEMQWLTKINEAGIDEKQSQRYSDF
>SDY_P101 ospG, OspG
MKITSTIIQTPFPFENNNSHAGIVTEPILGKLIGQGSTAEIFEDVNDSSA
LYKKYDLIGNQYNEILEMAWQESELFNAFYGDEASVVIQYGGDVYLRMLR
VPGTPLSDIDTADIPDNIESLYLQLICKLNELSIIHYDLNTGNMLYDKES
ESLFPIDFRNIYAEYYAATKKDKEIIDRRLQMRTNDFYSLLNRKYL
>SDY_P031 parA, plasmid segregation protein
MTSFEQLSKVAQRADKMLLALTKQIQEQKQEFQADVFYQVYSKSAVAKLP
KLTRASVDGAVGEMEAQGYQFEKRPAGTATKYALTIQNIIDIYAHRGIPK
YRDRYSEAYSIFIGSLKGGVSKTVSSVSVAHALRAHPHLLSEDLRILLLD
LDPQSSATMFLNYLHAVGLVDTTAPQAMLQNVSREELLEDFIVPSVIPGV
YVMPASIDDAFIASNWDTLCEEHLLGQNKHAILRENIIDKLKHDFDFILI
DTGPHLDAFLKNAIAAADIMFTPVPPAQVDFHSTLKYLARLPELVQIIEQ
DGCSCRLQANIGFMSKLANKSDHKYCHSLTKEIFGGDMLDVSMPRLDGFE
RSGESFDTVISANPVTYVGSGEALKNARMAAEAFAKAVFDRIEFIRANY
>SDY_P032 parB, plasmid segregation protein
MENRKHRPTIGRTLNTNILNNTEEISAPVHVFTLNTGRKAKFTEIKVDHD
KVDTQTFVVEEVNGREQTALTPDSLKDITRTIRLQQFYPCIGIRTGDLIE
ILDGSRRRAAALLCKVGLRVLVTDDELTVSEAQHLAKDLQTSLEHNIREI
GLRLVRLKEAGMNQKQIAEREGLSAAKVTRALQAASVPKDFVSLFPVQSE
LTYADYRQLAELSERLRLGDISIDEVVKNISPSIELITADDNLSEDEVKN
SIMRLITKEMSSLLDSGVKDKAVVTLLWKFDSKDKFARKRVKGRTFSYEF
GRLPLEVQDKLDRMIALVLKDNLNSL
>SDY_P067 phoN1, PhoN1
MKRQLFTLSIVGVFSLNTFASFPPGNDVTTKPDLYYLTNDNAIDSLALLP
PPPQIGSIAFLNDQAMYEKGLLLRNTERGKLAAEDANLSSGGVANVFSAA
FGSPITAKDSPELHKLLTNMIEDAGDLATRSAKEYYMRIRPFAFYGVSTC
NTKEQDTLSRNGSYPSGHTSIGWATALVLSEINPARQDTILKRGYELGDS
RVICGYHWQSDVDAARIVGSAIVATLHSNPVFQAQLQKAKDEFANNQKK
>SDY_P004 phoN2/apy, PhoN2/Apy
MKTKNFLLFCIATNMIFIPSANALKAEGFLTQQTSPDSLSILPPPPAEDS
VVFQADKAHYEFGRSLRDANRVRLASEDAYYENFGLAFSDAYGMDISREN
TPILYQLLTQVLQDSHDYAVRNAKEYYKRVRPFVIYKDATCTPDKDEKMA
ITGSYPSGHASFGWAVALILAEINPQRKAEILRRGYEFGESRVICGAHWQ
SDVEAGRLMGASVVAVLHNTPEFTKSLSEAKKEFEELNTPTNELTP
>SDY_P137 repA, RepA
MTDLQQTYYRQVKNPNPVFTPREGARTLPFCGKLMEKAVGFTSRFDFAIH
VAHARSLGLRRRMPPVLRRRAIDALLQGLCFHYDPLANRVQCSITTLAIE
CGLATESDAGTLSITRATRALTFLAELGLITYQTEYDPLIGCNIPTDITF
TPALFAALDVSEEAVASARRSRVEWENRQRKKQGLDTLGMDELMAKAWRF
VRERFRSYQTELKSRGMKRARARRDADRQRQDIVTLVKRQLTREISEGRF
TASREAVKREVERRVKERMILSRNRNYSRLATASP
>SDY_P220 repA, RepA
MTDLQQTYYRQVKNPNPVSTPREGARTLPFCGKLMEKAVGFTSRFDFAIH
VAHARSLGLRRRMPPVLRRRAIDALLQGLCFHYDPLANRVQCSITTLAIE
CGLATESDAGTLSITRATRALTFLAELGLITYQTEYDPLIGCNIPTDITF
TPALFAALDVSEEAVASARRSRVEWENRQRKKQGLDTLGMDELMAKAWRF
VRERFRSYQTELKSRGMKRARARRDADRQRQDIVTLVKRQLTREISEGRF
TASREAVKREVERRVKERMILSRNRNYSRLATASP
>SDY_P219 repB, RepB
MSQIENAVTSSPKRIYRKGNPLTGAEKQRISVSRKKGTHKAINVFIQSEL
KDDLTQLCKDSGLTQKEMIEHWILKEKAAVDDANRR
>SDY_P136 repB, RepB
MSQTENAVTSSLSQKRFVRRGKPMTDSEKQMAAVARKRLTHKEIKVFVKN
PLKDLMVEYCEREGITQAQFVEKIIKDELQRLDILK
>SDY_P108 rfbU, UDP-sugar hydrolase
MGIDITFALFRNSLHIPTAWRLLGIVHGFQPNAIVCHSGHDSNIVGLVRL
FTWKHPFRIIRQKTYLTRKTKVFSINHFCDEVIVPGTSMKTHLEQEGCRT
RVTVVPPGFDFQKLYVDSRNSLPPNVLSWLASRRGCPVIAQVGMLRPEKG
HEFMLNLLFHLKMNGRQFCWLIVGSGTPELREHLQYQIDSMGMHDDVFIA
DNVFPAAPVYRVASLVVLPSENESFGMVLAEASAFSVPVLASQIGGIPDV
IQNNQTGTLLPAGNKHAWMCALNDFFNDPGRFYQMARQAKQDIEERFDIN
KTALKILTLAKHK
>SDY_P107 shf, putative carbohydrate transport protein
MYHHVSHCPGLVTLSPVTFRKQMKWLAENNWKTLSSDELEFFYRGGKLPR
KSVMLTFDDGYLDNWFQVYPLLKEFNLKAHIFLITGFIGNGPVRHSPGKE
YSHRDCEHQIATGNADNVMLRWSEVNEMLQSGLVEFHVHTHTHTRWDKKF
SSREEQCKHLRQDLLSGREYLKEMTGKCSKHLCWPEGYYNKDYIQVAEEL
GFYYLYTTERRMNAPAKGTTRIGRISTKERESCAWLKRRLFYYTTPFFSS
LLAFHKGPRLPDD
>SDY_P194 spa-orf10, hypothetical protein
MGVNFCNKIGIDQSEFEIESSIINSIANEVLNPISFLSNKDIINVLLRKI
SSECDLVRKDIYRCALELVVEKTPDDL
>SDY_P188 spa13, Spa13
MEPVGAQSVSQLFNTRRKIAIVKKHIIQYQSERIILKGRIEEIQKDIDEA
NASKRKLLHKESKICKRISLIKRNNFAKQLILDELSQEDMKHGIR
>SDY_P186 spa15, Spa15
MSNINLVQLVRDSLFTIGCPPSIITDLDSHSAITISLDNMPAINIALVNE
QVMLWANFDAPSDVKLQSSAYNILNLMLMNFSYSINELVELHRSDEYLQL
RVVIKDDYVHDGIVFAEILHEFYQRMEILNEVL
>SDY_P191 spa24, Spa24
MLSDMSLIATLSFFTLLPFLVAAGTCYIKFSIVFVMVRNALGLQQVPSNM
TLNGIALIMALFVMKPIIEAGYENYLNGPQKFDTISDIVRFSDSGLIEYK
QYLKKHTDLELARFFQRSEEENADLKSAENNDYSLFSLLPAYALSEIKDA
FKIGFYLYLPFIVVDLVISSILLALGMMMMSPITISVPIKLVLFVALDGW
GILSKALIEQYINVPA
>SDY_P192 spa29, Spa29
MDISSWFESIHVFLILLNGVFFRLAPLFFFLPFLNNGIISPSIRIPVIFL
VASGLITSGKVDIGSSVFEHVYFLMFKEIIVGLLLSFCLSLPFWIFHAVG
SIIDNQRGATLSSSIDPANGVDTSELAKFFNLFSAVVFLYSGGMVFILES
IQLSYNICPLFSQCSFRVSNILTFLTLLASQAVILASPVMIVLLLSEVLL
GVLSRFAPQMNAFSVSLTIKSLLAIFIIFICSSTIYFSKVQFFLGEHKFF
TNLFVR
>SDY_P189 spa32, Spa32
MALDNINLNFSSDKQIEKCEKLSSIDNIDSLVLKKKRKVEIPEYSLIASN
YFTIDKHFEHKHDKGEIYSGIKNAFELRNERATYSDILESMVIKENILIP
VQDIKAREKINIGDMRGIFSYNESGNADKNFERSHTSSANPDNLLEPDNR
NSQIGLKNHSLSIDKNIADIISLLNGSVAKSFDLPVMKKNTSDITPSMSL
QEKSIVENDKNVYQKNSEMTYHFKQWGAGHSVSISVESGSFVLKPSDQFV
GNKLDLILKQDAEGNYRFDSSQHNKGNKNNSTGYNEQSEEEC
>SDY_P190 spa33, Spa33
MLRLKHFDANEKLQILYAKQLCERFAIQTFKNKFTGSESLVTLTSVCGDW
VIRIDTLSFLKKKYEVFSGFSTQESLLHLSKCVFIESSSVFSIPELSDKI
TFRITNEIQFATTGSHLCCFSSSLGIIYLDKMPVLRNQVSLDLLHHLLEF
CLGSSNVRLATLKRIRTGDIIIVQKLYNLLLCNQVIIGDYIVNDNNEAKI
NLSESNGESDHTEVSLALFNYDDINVKVDFILLEKKMTINELKMYVENEL
FKFPDDIVKHVNIKVNGSLVGHGELVSIEDGYGIEISSWMVKE
>SDY_P193 spa40, Spa40
MANKTEKPTPKKLKDAAKKGQSFKFKDLTTVVIILVGTFTIISFFSLSDV
MLLYRYVIINDFEINEGKYFFAVVIVFFKIIGFPLFFCVLSAVLPTLVQT
KFVLATKAIKIDFSVLNPVKGLKKIFSIKTIKEFFKSILLLIILALTTYF
FWINDRKIIFSQVFSSVDGLYLIWGGLFKDIILFFLAFSIFVIILDFVIE
FILYMKDMMMDKQEIKREYIEQEGHFETKSRRRELHIEILSEQTKSDIRN
SKLVVMNPTHIAIGIYFNPEIAPAPFISLIETNQCALAVRKYANEVGIPT
VRDVKLARKLYKTHTKYSFVDFEHLDEVLRLIVWLEQVENTH
>SDY_P187 spa47, Spa47
MSYTKLLTQLSFPNKISGPILETSLSDVSIGEICNIQAGIESNEIVARAQ
VVGFHDEKTILSLIGNSRGLSRQTLIKPTAQFLHTQVGRGLLGAVVNPLG
EVTDKFAVTDNSEILYRPVDNAPPLYSERAAIEKPFLTGVKVIDSLLTCG
EGQRMGIFASAGCGKTFLMNMLIEHSGADIYVIGLIGERGREVTETVDYL
KNSEKKSRCVLVYATSDYSSVDRCNAAYIATAIAEFFRTEGHKVALFIDS
LTRYARALRDVALAAGESPARRGYPVSVFDSLPRLLERPGKLKAGGSITA
FYTVLLEDDDFADPLAEEVRSILDGHIYLSRNLAQKGQFPAIDSLKSISR
VFTQVVDEKHRIMAAAFRELLSEIEELRTIIDFGEYKPGENASQDKIYNK
ISVVESFLKQDYRLGFTYEQTMELIGETIR
>SDY_P075 stbA, plasmid stable inheritance protein
MLKVSCDDGSTNVKLAWLEDGEVRTSLSGNSFKEGWNPGLFNAGKVYNYV
VDEKKYTYDLGSTAVIGTTHVSYQYSTTNLLAIHHALLTSGLQPQDVELT
VTLPVTEFFDNDNQPNEERIERKKANVLREISLNKGETFKIKKVNVMPES
LPAAFELLKKDKVNKLERSLIIDLGGTTLDCGLILGAFEGISEIRGYSEI
GTSRITHTVMNALTKASTPCNYFIADELIKNRHDNEYLQTLINDVAEIKN
ISHVIDREVKSLAESIRQEISTFSGMNRIYLTGGGAELIYPHIKQYFPNL
KVNKVDEPQFALVKAMVHA
>SDY_P074 stbB, plasmid stable inheritance protein
MESSDPKKRKKVVAYLHPALYPQDNLTQQTIDSLPVQMRGDFYRQSLICG
AALYSVDPRLLTLISGFFSEKITAENLVKLIEQTTGYTSTSIDISVLKNI
IEASSENKSESITSKDDFEEQTRRNLSMLKK
>SDY_P117 traD, hypothetical protein
MSVKLRLPQITFFVVSWILGRQGKQQSENEVTGGRQLTDNPKEVARMLKK
GGEDSDIRIGDLPIIRDSEIKNFCLHGTVGAGKSEVIRRLANYARKRGDM
VVIYDRSCEFVKSYYDPSIDNILNPLDVRCAAWDLWKECLTQPDFDNVAN
TLIPMGTKEAPFWQGSGRTIFAEAAYLMRNDPNRSYSKLVDTLLSIKIEK
LRTFLRNSPAANLVEEKIEKTAISIRAVLTNYVKAIRYLQRIEHNGESFT
IRDWMRGVREDKKNGWLFISSNADTHASLKPVISMWLSIAIRGLMAMGGN
RNRRVWFFCDELPTLHKLPDLVEILPEARKFGGCYVFGIQSYAQLEDIYG
EKAAATLFDVMNTRAFFRSPSHKIAEFAAGEIGEKEHLKASEQYSYGADP
VRDGVSTGKDMERQTLVSYSDIQSLPDLTCYVTLPGPYPAVKLSLKYQAR
PKVAPEFIPREINPENGEPSECRACCKGSRRSSDGQPLRTGCPGGCIRRG
RDSG
>SDY_P124 traX, F pilin acetylation protein
MTTDNTNTTRNDSLAARTDTWLQSFLVWSPGQRDIIKTVALVLMVLDHIN
LIFQLKQEWMFLAGRGAFPLFALVWGLNLSRHAHIRQPAINRLWGWGIIA
QFAYYLAGFPWYEGNILFAFAVAAQVLTWCETRSGWRTAAAILLMALWGP
LSGTSYGIAGLLMLAVSNRLYRAEDRAERLALVACLLAVIPALNLATSDA
AAVAGLVMTVLTVGLVLCAGKSLPRFWPGDFFPTFYACHLAVLGVLVL
>SDY_P119 trbH, TrbH
MNRSAPVFNSQAAHTFKFPGVVSHNNQPPTAGMTCDHLIKWPDRASLTGK
FCSYLAGVYGCSGVVIQNINAGNKSLDHSEITFRHLAFFCTIYQLHQGDR
TDTHSPLVKVKTLPDAGGFVLYRKNTDVGIEHKLQHQNDSLSCMSGCSLL
SIKSALTLCPSNHSSHVSPAGVMIRVRPTAITSTRFTFSGNATAFGSLTA
WLRLLRNTVVSIICLLMWICLVYIHCGIDTGICQRDIRL
>SDY_P064 ushA, UshA
MSEQRKPCKRGCVHTGTMIPLKKNITLIMFTLSLLTGNPAIAYETDKVYK
ITVLHTNDHHGHFWRNNHGEYGLSSQKTLVDNIRQKVINNGGSVLLLSGG
DINTGVPESDLQKAEPDIRGMNLIGYDAMAVGNHEFDNPLNILRQQEKWA
TFPFLSANIYQKSTGRRLFSPWKIFIRQNLKIAVIGLTTDDTAKTGNSEY
FTDIEFRQPAAEARSVIDELNQQEKPDIIIAATHMGHYDNGESGSNAPGD
VEMARSLPTGSLAMIVGGHSQAPVCMASDNKKQWNYIPGTTCVPDKQNGI
WIVQAHEWGKYVGQADFEFCNGTMKLVNYQLHPVNLKMRITREDGKTEFS
FYTPEITEDPQMLSLLTPFQNKGKAQLDVKVGVVNGRLEGDRSKVRFVQT
SMGHLILSALTERIDADFAVVSGGEIRDSIESGNITYKDILKVQPFGNTV
VSIDLTGKEVADYLATVAQMKPDSGAYPQFLNTSFVVKKGKIEMLKIKGK
SVDLNKKYRMTTFSFNATGGDGYPRIDNRPGYINTGFIDAEVLIEYIREH
SPLDAASYEPKGEVSWQ
>SDY_P211 virA, VirA
MQTSNITNYERNDSSWMSTVKSTTEVSWNKLSFCDVLLKIITFGIYSPHE
TLAEKYSEKKLMDSFSPSLSQDKMDGEFAHANIDGISIRLCLNKGICSVF
YLDGDKIQSTQLSSKEYNNLLSSLPPKQFNLGKVHTITAPVSGNFKTHKP
APEVIETAINCCTSIIPNDDYFSVKDTDFNSVWHDIYRDIRASDSNSTKI
YFNNIEIPLKLIADLINELGINEFIDSKKELQMLSYNQVNKIINSNFPQQ
DLCFQTEKLLFTSLFQDPAFISALTSAFWQSLHITSSSVEHIYAQIMSEN
IENRLNFMPEQRVINNCGHIIKINAVVPKNDTAISASGGRAYEVSSSILP
SHITCNGVGINKIETSYLVHAGTLPSSEGLRNAIPPESRQVSFAIISPDV
>SDY_P161 virB, VirB
MVDLCNDLLSIKEGQKKEFTLHSGNKVSFIKAKIPHKRIQDLTFVNQKTN
VRDQESLTEESLADIIKTIKLQQFFPVIGREIDGRIEILDGTRRRASAIY
AGADLEVLYSKEYISTLDARKLANDIQTAKEHSIRELGIGLNFLKVSGMS
YKDIAKKENLSRAKVTRAFQAASVPQEIISLFPIASELNFNDYKILFNYY
KGLEKANESLSSTLPILKEEIKDLDTNLPPDIYKKEILNIIKKSKNRKQN
PSLKVDSLFISKDKRTYIKRKENKTNRTLIFTLSKINKTVQREIDEAIRD
IISRHLSSS
>SDY_P109 virK, VirK
MFSVSNLSFIGFLKRIVFSSDSLPGKWEHRKFRFMYILRCAINPVASIRY
YYELRSLQCIEDILAIQPTLPARIHRPYLHKGGRAWSRGQYILEHYRFVQ
NLPEKYSEFLFPQKSVSLVQFIGKDGEDFDIQCSPSGFDREGELMLSLFF
NKIVIARLTFSVILTQNGHTAFIGGLQGAPKNTGPDVIRCATRACYGLFP
KRIIFEAFCALMKACNVSECLAVSEHSHVFRQLRYWYQKRKTFVAVYSDF
WESVAGKTCGDWYKLPTQVVRKPLSNIASKKRSEYRKRYALLDYIHETAI
RSLDAYPVNSEHQDLN
>SDY_P100 yacA, hypothetical protein
MAQVNMSVRIDAELKDAFMAAAKSMDRNGSQLIRDFMRQTVERQHNSWFR
DQVAAGRQQLECGDVLPHDMVESSAAAWRDEMSRKIAGK
>SDY_P126 yigA, hypothetical protein
MNGFRNSSRNGQVWRYQRAGGRAVILEVSGRWMEAAEAWRRAACVAPRTD
WQQFARKRAEHCHRRCRGRV
>SDY_P135 yihA, hypothetical protein
MLMQLHAAGIRTGDAERILSSGEYWQRQKTLLTEREVSFMKGLFRIVDMK
RWYLCPQVRGADIVQLNGNIRPRSRQWWQLFRMVSQWHVDVVIVELRSFS
IVAAVELDDASHLRPERRRRDILLEEVLRQAGIPLLRSHDARKLQQMTGE
WLNTTGADQQSPEHRS