TitleGenColors Logo

Gene list

Applied filters:

Organism: Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67, B67
Gene type: CDS

Number of genes found: 221

Free access
Sort by:

 



# Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67, B67

>SCV37 TraG protein
MFPFSSCCHPGRLDYFTSRKTSQSASHTDNNAESRVDQLSASLNNAKQSY
DQYTTNLTRSHEYAEMASRTESMSGQMSEDLSQQFVNYVQKHAPNDAESI
LTNTNSPDVAERHRAMAWAFVQEQVQPGVDSAYSGANENIGMGMPTNSNG
GTRSDVVNDYTTNSGVIDHHTRTSGIRSDISGEVDNMIDQNQQRHDADKT
QITNQSNDIKNQNSELEKQYNLEKNKHEKMYNEKMESQGHIPGGYSKEEL
TKMAEDMQKKHRGGKGDS
>SCV11 hypothetical protein
MLIIVSDCQFTRLALTRLLAHLDPVNMSVARWLQTAPPAGSHVLLAASPG
MLASLVPACHHARTSLSLKLALLGSGGQAVFLNTLGLRPDCLLPRTASAT
QLKAAVSSWLRRARGWRQATTESLSLRERQALCATLAGLSARTAPGSASA
PKPFTAIAAAR
>SCV19 hypothetical protein
MAIPPTYKEIFMNTALLVPFISTALSGVGVYTRYYFKKYQASRDLCQKVH
RMMVNTLTDTARLFSRSAPAQALKSELRIMQQKLSQETQKLDKMTTRLPL
KRLQREQRHILRHAKWLLGYLDSHLADNEGKFFLFLHDLATDSDDKVLDF
LTSLHDTKRFRNRRLQTGSTTCP
>SC031 hypothetical protein
MRVGLTEMLEPEAKTDNLNLATGANREKTLRHRGGTALQKR
>SC113 hypothetical protein
MDLSDGASVTTSRGSCPGADAKFFAGFMAKNFVDGISSPGLLKRDVVTPA
PGKSTAGNRPQGWRRKGWLRHQAQLPYTPSGVQGAPFPARQGRRDRNGRC
KPGFSDIGP
>SC128 hypothetical protein
MYGTCETLCRELAAQYPGNTPLMLVVWSPEEIQALADGMDIALTDHEIRT
VLARLEDIPEDQRIESGISSAAVMEIIRNESENRLVTVPAELLASLIQTA
EQALWKREWAARDNGLAVPECVTRRQAVVNQARALLKNNTREND
>SC147 hypothetical protein
MRNVVSDDKISDFRDLVNSNSSFVYQIYKDKGGKNLFNLVCSAMDWISVS
VRHLENAPEFDKNIDSRCMQVYSLISSIDLIFESIKQLHRVFITDKKDPF
YGEKKCFKDRLFANEDDNNYFKTIRACFGAHPVNLNQENSKRFASWPFQS
HFNTGDLSVHLYSRDVGKEDLTLNLNINELLEFLRIRYEYLDVIADRIET
LFVEYQHKLSKEKIETKLDPLEQLYVLRTESEKRLDNDYYNGEIDDLIMI
FEAEVTDADLVPLADKYKESLLPLIEEIKTNLQEMNIVDLANDSELRIRS
ELDKELRYELGKFYTWVHGGRYDPLLEYYFERFNASTDGKFKFTKTDDIK
LTFLKAKLMLTE
>SC136 hypothetical protein
MRHGLMEAACERRIPMPNWCSNRMYFSGEPAQIAEIKRLASGAVTPLYRR
ATNEGIQLFLAGSAGLLQITENIRSEQCPGVTAAGRGAVSPENIAFTRWL
THLQNGVLLDEQNCLMLHELWLQSGTGQRRWEELPDDVRETITVHFTAKR
GDWCDIWGNEDVSVWWNRLCDNVLPEKTMPFDLLTVLPTRLDIEVNGFNG
GVLNGVPSAYHWYTERYGVKWPCGYDLNISSQGDNFIQVDFDTPWCQPES
DVIAELSRRFSCTLEHWYAEQGCDFCGWQLYERGELVDVLWGELEWSSPT
DDDELPEVTGPAWIVDKVAHYGG
>SCV16 hypothetical protein
MRVSEQVLLSSLRQGGCVRSFWRRSARLAGTPSPIVPDGLVLETPGERGD
TPLCHVDFAVVQKWLVCEETWTQTLGGTEFGGTVWRLRTDRENTTS
>SC083 hypothetical protein
MSALRHVLSDILFSGKNRLQVLAKTSGGAGFGALVGEELSITDTIETVVT
GSVRANNGVLSPVMEKRMINLLIIVLRAVVAVANALIAVLELIRELID
>SC142 hypothetical protein
MPKHNTRKRKYLLPGKNLIKGKAEVKTLHLADMVICVNGSILRFERFAFK
SCPVLFRGFRKVETSQFTDMKRSSFVRQIYSLLSENVTSTTASRYETLIK
YVRWVDDSNDTELIDKDMFHWELIDGFMTWCGRQNSKGLLSRPVWGRHRT
NISWLLKQLNRTQDTKRLPKISNVSGHTTPHKSLDIERELKPITKRLFNS
YFKLLEHYNAGTMPEKHPLYDKELLELIAQKKGIKGSDRVAHFAAFSRAV
QPSSGHKNNPITKIAMMLCYLFTGMNTTPLAKMKISDVNFKEVQGGKYIL
DSVKGRAQHQEQDNALGFSKHAKDFIESWLNVAKSMANGDSNSPLFPYYS
KDGQIKLYSEIGSSPHASINKLLTRLDFPKITPSILRKTKMDTLFQATES
VYLVAMSANNSIKVVSSQYLHGTSGEHEKKLNAAMDAKFSNAKGVDINTA
VEEAKYKHSNILDHYEYEALREGSNRSHEARTPTGIRCNDNKQGAVKTIN
KMLERIGVEMNSDEEACTSFLDCFECEHHSLVSDVTDIWLMMSFKETLQE
LEVTPAINSMPPKKMNTLLNTIETILGGFRDKSPENYKQAYELQKQTSHP
LYSNIYSLNDLHEVFS
>SC040 hypothetical protein
MVKIISFLSLMLLSISAYSNIGDIVIDGSIANANSDKGYVITEKQFDSFP
RDTIVTTTPWTPTGKKVKFKGVDLSYVLKLAGANGAKLKFHALNDYEIIV
DMNDVEKYNILLATEMDGKRLQIRDFGPYFLVYPLDEHYIELNTPHYLAR
FIWQIDKITVLK
>SCV35 TraD protein
MSFNAKDMTQGGQIASMRIRMFSQIANIIFYCLFIFFWILVGLVLWVKLS
WQTFVNGCIYWWCTTLEGMRDLIKSQPIYEIRYYGQTFRMNAAQVLQDKY
TVWCGEQLWSAFVLAACVALVICLITFFMATWILGRQGKQQSEDDVTGGR
QLTDNPKDVARMLKKDGKASDILIGDLPIIKDSEIQNFLLHGTVSTGKSE
IIRRLANYARQRGDMVVMYDRSCEFVKSYYDPSIDKILNPCDARCAAWDL
WRECLTLPDFDNAANTLIPMGTKEDPFWQGSGRTIFAEAAYLMRNDSYRS
YSKLVDTLLSIRIEKLRTYLRDSPAANLVEEKIEKTAISIRAVLTNYVKA
VRYLQGIERNGEPFTIRDWMRGVREDQKNGWLFISSNADTHSSLKPVISM
WLSIAIRGLLAMGENRNRRVWFFCDELPTLHKLPDLVEIVPEARKFGGCY
VFGIQSSTQMEDIYGEKAAATLLDVMNTRAFFRSPSYKIADYVAHEIGEK
EILKASEQYSYGADPVRDGVSTGKEMERVTLVSYSDIQSLPDLTCYVTLP
GPYPAVKLALKYQERPKVAPEFIPREMNPEAEARLSAVLAAREAEGRQMS
SLFEPDTEAVAPTAAEAQAEQPQQPQQPQQPQQPQQPQPTVVSDKKSAAA
GTAVNTPAGGVGQELKMKPEDEEQPLPPGINASGEVVDMAAYERWQQEQE
VNTQQQMQRREEVNINVHRGHGKDEPEPGDDF
>SCV15 hypothetical protein
MTRSRICPDTESTGLSPASDALPVRETTEQFAQVFFGKSVSLLTPEERFT
TVYAGSRDIPADLHPASWFPADTWFRNELRACAAYVGRRQGWPLYHASEA
ERLRALYPLRLATPATDPGKQLLTRTALLKAGYSRATIAAMTPVAERQNR
HSGDWYPLYRVQTETRDDSGEKT
>SC059 hypothetical protein
MAAPWISSPEMAMRDPGQGGRHRQTWRWSTPARIVGCGDASSRSVATAGD
LVPSSAMAMLQLDVRGLVVSASSDAAMLERGEKKPPRGVRGGKSACGSGR
KKPRRRSGRKTARPDVQLGHFSRIFRLIRCERPAVGAARSAVAFSRRASA
PGAAARRAALLQS
>SC143 hypothetical protein
MISVLKHNLASVPDYDENIPVSSYSDGSVESYIFDKKWKFSGAIQVGVGG
LTDISFSRIDMKYRKYIQSTLAYIMDEYKQKNKTHPTIGQLQHWKNGLKY
IVQSLGECNWNLLSDDRIFKQFKKSFSELAIKRSLSQTLINNVVTTFGLL
NSLNLCHRIVTIKSLNIQSIKPTKQYIAIPTNIYQQILNDAISTVESLHP
HRHEISALMNRIQTIYDEEKNRKDRSSSLGQINDRTRNRIRKIGHNIPDF
KLTRDGTNISNILTPCAIVLLAFSGVRVGELNSFTKDSYEEREGEHGDNI
CFLKGETTKTVNGIPKKEVWQTHRISKDALELAYDCTAYLRTTYIDDVNK
QYTEQKINLDEYKKTLRVLESVFLPVTLGGSKKTNYSSTAARSKNLKSYL
EQLNIKATKEDVDSFNTLNPSWHGELKEGQFLPKISAHDFRRTFAVFFKR
YGFGTTSTIKFQYKHRNINMSEYYGKNAQLQYFEDVLLDSDLIQLLHEEG
VNIAVDMFDEIYNDSETLSGLGGERIAKDKFEKLRSGDKVYMTRTEIESL
VRNGTLSIVKLPTGGYCTNSSCSRVCGIGEFSAEIKPCEHQIITDKEAKR
TQRQNKRLINTFRDLNNGDPMMKSILIGVKQKIKINESLIINHGIKVTAF
EDNIKGIIATSRG
>SC012 hypothetical protein
MINYGKMRLEFLQKALAQDTSGDFCFRVLHPEVSGPPDMKKASAGYRDFI
IGNRALLDLVNSAGEGAPVAHYSADEIQSLFSAQIQGSVDKYGDSFLTDD
PYVLAEDKLQTCQMEIDLMADVLRAPPRESAELIRYVFADEWPE
>SC167 hypothetical protein
MVLANHMQVLSAFQSRSERYTVILIRCDPSDNLMTSLRSACCCITNGALW
LQLSKSEGVNAKQKCHPRCETEMSCFGSENAFDRLAIYFRAFSFPNRNHE
NDNFLVYNLVYKAIA
>SC001 hypothetical protein
MASAPSFKWVKWRLKPVQSSTSISSSVILMRGSNWLVWSISACARSGTAA
SSGVIFSPDSVRMASGSSLSLARRVTVANCSSSSCKRSSRYWSLFGLIAR
GRACSRLSAANFSAGSR
>SCV51 hypothetical protein
MRVEVMEHYGLTQSIEQAGYYETAHHKQLMKDIKGAIREGRLIAVCGVVG
SGKTVTLRRLQQQLLDENKIIVARSLSVDKQSVRLATLINALFYDLAQDK
QVQIHKQGERRERELQELVKKGKRPVALFVDEAHDLNGNTLTGLKRLMEV
VEDGGGRLSVVLAGHPKLRNDLRRPTVEEIGYRTDIFTLDGITGSQREYI
QWLMKTSTGKGKPEDILTTEAVDLLAMKLRTPLQVQLHLTLAMEAGYQTG
EKPITATLVESVLSRQLDDLEPTLTRHGYRLKDMVEQFDAKPAEIRALFN
NQLDPARTAELRDRMLAVGLPI
>SCV43 hypothetical protein
MTLTEKSGHLAWCALVALALARQDGGVLSPAQENLFLTRWLATALKQRRF
SRDVTPDIEWLLKQGRQMGVSAKLASKLNYLWRSCTGELSEQNDLFRLTY
ALETAKDMHWNYRLLSDREWSGRNAVALSAGVNGIYLSRAKLDVGFNDSG
RQINSLTARLTGNVAGVMKLFDRCGWQSLMPPCPTSIR
>SC151 hypothetical protein
MTRYVVIKGASLPSPKNKTVCLCAVMIPPVFFHGSAAFTKSINQITCCTF
NMLPTWLSPVTVGNQYIVSLAAAFPTICCPCPFYFVIGQCEKRHHSGDAA
DGMMFIQGRKIGRYWLPAGRRAGQRAASVAGFKNAGGNLKMKPHTLRNFA
NFTQHASFRSQQSFHHRL
>SC061 hypothetical protein
MKKPNQDDEPFFITEEIAAEMIAGGYEFELPPIPCTIRLRDVLERMTDAE
LALQPGEIADQERERCRRKPCSTS
>SC081 hypothetical protein
MSMPPAIANTFLFEMMKSKSKDVTLAAIYALGEGRCQAENITRELHRLSQ
SDDMEIKIAAIKALGRIYR
>SC144 hypothetical protein
MRKSGKALARLRAALERLISGKPQNVSPSGKLTLNKINNEAGLGNSYIHK
FKDFIENEANPAIESFNANYDPVKAKLLQNKQNLTEKEKHKARMKKEVKL
KEQYRQERDDLKTINKELETQISSLMFRLYELQEQLNVQNVVKISQ
>SCV33 TraI protein
MSKGYTFMMSIAQVRSAGSAAGYYSDRDNYYVLGSLEERWAGKGAEQLGL
QGAVDKEVFTRVLEGRLPDGADLSRQQDGGNKHRPGYDLTFSAPKSVSLM
AMLAGDKRLTEAHNQAVDIAVRQVEALASTRVMTDGQSETVLTGNLVMAL
FNHDTSRDQEPQLHTHAVVANVTQHDGEWKTLSSDKVGKTGFIENVYANQ
IAFGKIYRAVLKEKVEALGYETEVVGKHGMWEMPGVPVEAFSSRSQTIRE
AVGDDASLKSRDVAALDTRKSKQHVDPEVKMAEWMQTLKDTGFDISAYRE
AADRRAEIQAAQPVPSQEQPDIQQAVTQAIAGLSDRKVQFTYTDVLARTV
GMLPPEAGVIEKARAGIDEAINREQLIPLDREKGLFTSGIHVLDELSVRA
LSSDIMKQNRVTVHPEKSVPRTGSYSDAVSVLARDRPSLAIISGQGGAAG
QRERVAELTMMAREQTLIVTHLNEDRRVLNSMIQDALAKPSEQQVTVPVL
TTANIRDGELRRLSTWENHQGALALVDNVYHRIAGISKEDGLITLQDADG
NTRLISPREAAAEGVTLYNPETIRGGGGGGPVRRLPLRWREPKGAGNRWP
VLSRRMWRCRV
>SC019 hypothetical protein
MERKLTLRDFGHEIIKKDLHLDPFKLKMKF
>SCV12 hypothetical protein
MRTGVISMNQPLLFRTVAGRQSNHEGFYMPPGIRHLGTLSLYRAVAWWGL
FLGREFTRDDVSEAFSIEPRRASGILNYICNRHNDDDICFDSRLHPVRGG
RAQLVVRIRAVESRPDTIRRQRTDRPGGKVSDRQYDRQMAHWLLSRPAGG
DTAKLAAWQAACPVREASC
>SC118 ParB-like nuclease
MIMPVTKCEPETTRKASRKYAKTQETVLSALLAQTEEVSVPLASLIKSPL
NVRTVPYSAESVSELAESIKGVGLLQNLVVHTLPGDRYGVAAGGRRLAAL
NMLAERGIIPADWPVRVKVIPQELATAASMTENGHRRDMHPAEQIAGFRA
MAQEGKTPAQIGDLLGYSPRHVQRMLKLADLAPVILDALAEDRITTEHCQ
ALALENDTARQVQVFEAACQSGWGGKPDVRVIRNLITESEVAVAGNSKFR
FVGADAFSPDELRTDLFSDDEGGYVDCVALDAALLEKLQAVAEFLREAEG
WEWCAGRMEPVGECREDAGTYRCLPEPEAVLTEAEEERLNELMARYDALE
NQCEESDLLEAEMKLMRCMAKVRAWTPEMRAGGGVVVSWRYGNVCVQRGV
QLRSEDDATDDADRTEQVQEKASVEAISLPLLTKMSSERTLAVQAALMQQ
PDKSLALLAWTLCLNVFDSGAYSKPAQISLECKHYSLTSDAPSGKEGAAF
LALMAEKARLAALLPEGWSRDMTTFLSLSQEVLLSLLSFCTACSLNGVQT
RECGRTSRSPLDSLDSAIGFHMRDWWQPTKANFFGHLKKPQIIAALNDAG
LSGAARDAEKMKKGDAAEHAEFHMKDNRWVPGWMCSPRPQTDATERADNL
ADAA
>SC155 hypothetical protein
MFAGKKSAQIREILISESAWEEITCLFAPSLANTFLPFGSRFLIAFNSVK
LAPGYTDSCSLILPG
>SCV31 TraX protein
MTMDNATRNDSRAVRVDTRLQRLLAWSPGQRDLIKTVALLLMVADHINRI
LHLNQEWLFLAGRGAFPLFALVWGLNLSRHTHIRQSAINRLWGWAVIAQS
GYFLAGFPWYEGNILFAFAVTAQALKWCEQRCLFHSAAALLLLTAWIPLS
GTSYGVAGVLVLVICYRLYRITDTEEHLALAACLVVAVPALNLVTSDAAA
VAGLLVTGLTVWLVSLTGKSRPRFWPADFFPVFYTCHLAALGVLAM
>SC104 hypothetical protein
MSPHRHDCRVSQVISRRSEAPSAVRVFGDFPAVDLSDGGRSPRHGAVVRG
QTQKFLTPAAKSFVAGTSLPWTTETGCGRVCTPGKSVCGEQAAVRRQTGR
DSCAIRRSGLTHRKVCRERRIRRD
>SCV50 hypothetical protein
MSSRRSAIPSDSLLQLRQRLDRLPPKSPERANQIAATAQLYGISVTTVYR
ALHLVLKPRTAHRSDHGQPRILPPSELEHYCELIAALKLRTTNKSGRHLS
TGRAIQLLEEHGVETVQGLIKSPKGLLRKQTVNRWLSRWRLDQPRLLREP
PAVRFQAENSNDCWQFDMSPSDLKHIERPDWVDPARGEPTLMLFSVVDDR
SGVAYQEYRCVYGEDAESALRFLFNAMAPKTRSDFPFQGRPKLLYLDNGP
VAKNHVFQNVMQSLKVDWLTHTPAGKDGTRTTARSKGKVERPFRTVKEAH
ETLYHFHKPETELQANEWLWNYLSRYNAQRHRSEKHSRLEDWLANIGQEG
VRDMCSWEQYCRFAREPESRKVGVDARITIDGTAWEVEPDMAGETVILLW
GLFDEEMYVEFTGETWGPYYPVSGPVPLHRYRTFRRGKAAERADRIHALA
RQLNIPISALSGSDLRVVSDDTQQRIDALPHQPFDTRKFEYHFPTVIAAK
LAIADDLAIPLARMSDEDRAFIDSILTETLNRSEVLARIRDYFRSRQSGE
DHAG
>SCV13 hypothetical protein YahA
MHFSAFRLQQAIRNREFTPFYQPIVCATGGEVVGCEMLARWLHPQKGLLS
AGNFIPAIEATGLGGALLRGLADEVCGDGQDLARSAGRRLMMTLNLSLSL
VMTPLFRPHLLALSIRLEQAGMTPVFEITEREDIRAFPQAAVFRQLAAGG
LRFAVDDFGTGHAGPASTVADRMIARTVSLARC
>SC140 hypothetical protein
MAIVYEIKTTEIKPFTYRTPLITPDENGELSIKYSRQQPKHIKKVVLLNL
VGRNTNGDIVSYEPMEQVNRFLLAHHLNDNKQESEQYSKGLVHYFSFLLE
LQRLWDSEYDEDLYEEFIDLPRPSWDKFPFRKSDKATYQYREALIKAVLE
PDTPNHAIARTTAIAYMGAVVKFYSFHIRNGYKFNNPPFEHEVVSIQYQG
GSTSIGAYLSKDIHTTDLRLNLGKSKRNDGGALSSARRDLKPLTNKEWLA
VEDILTNTRRVIKKVAGETTTSNLSIEYCLFFLVARYTGLRKEEVASLHK
GQVAKPGEDKKAMKLGVGGQYGSLTKTAGSGNKSRQTIIPTRIMHLLYEY
TRSDRYKKRISKFKEHCELKRQKGELGYFDGEDGVDDTKEYLFISQTGVP
LFTKLPEANARWNEIRATVNILSGFNLTATIHNLRATFAVSLFRLLLRRV
TPDKALALVSECLGHGEESVTLIYLKIAQDEPTGDEIYEDILEFIGAFED
ADISGMESQ
>SC162 hypothetical protein
MIAFNHVVPVLNLSVFNVRRAPAFAFEQSKRATIGGRFIRVDESRDLPLL
HVVEDFTQKPVCSFAVTTGGEIKIDSAAPAVDGPVQIRPAAIDLHVGFIH
VPRAKIGRVTPVPAQPFFHFRRITLNPAVNRGVIDIHSAFSQHLLQLTVT
DAVFAVPAYGPQNDVTLKMPAFEWVHVQLHQQKGMISLSPPTICNSAIDS
YVAFLACIFGWFVTVFNGDTSETRSGRLSGTRNEALW
>SC094 hypothetical protein
MIDKKISRGFLLYTVHKTLSAYSLKRPRLF
>SCV21 probable site-specific recombinase
MRALSEKTQARRECPPKDEVRLVPLTDISYVRQIESWVITPVPAAQILRI
LPS
>SC149 orf44
MTNIILCDVTASVSELKNDPVATASARGGYPVEIIDRNRPVFYCVPAALY
EQMLDELDEKDLVQTITERQNQPLREVDLNQYL
>SC098 hypothetical protein
MEALTGLYLLFLLSLKSDFPLLFQVVLVCSA
>SCV32 TraI protein
MAGFESAYVALSRMKQHVQVYTDDRQGWVKAINSAEQKGTAHDVLEPKSE
REMMNAERLFSTARELRDVAAGRAVLRNAGLAQGDSRARLIAPGRKYPQP
YVALPAFDRNGKSAGIWLNPLTTDDGAGLRGFTGEGRVKGSEEAQFVALQ
GSRNGESLLAGSMQDGVRIARENPDSGVVVRIAGDGRPWNPVAITGGRVW
GDIPDSNVQPGAGKPGLGAAAADGAGDGA
>SC131 hypothetical protein
MEYRSRRLFDGKAMPRKRLEGRSSEKGDGLAYTATDASRER
>SC064 hypothetical protein
MTEIIKDLYQFTEVMEPIKLSMHQYLLMTNEPVLIQTGAVSQAQTTIPKL
QELLGERKIKYILISHFESDECGGLALVLKEHPEAVAVCSETTARQLMGF
GITNNVLIKKPNEIFAGDDFEFQTISYPSEMHMWEGLLFFEKKRGIFFSS
DLMFGMGENHGQVIESSWDAAVKSSGADTLPNQESGQKLSSDLSEIEPKF
VASGHGFCITIVG
>SCV36 TraT complement resistance protein precursor
MKMKKIMMVALVSSTLALSGCGAMSTAIKKRNLEVKTQMSQTIWLDPSSE
RTVYLQVKNTSDKDMSDLQSLIAKDIQAKGYTVVTSPDKAYYWIQANVLK
ADKMDLRESQGWLNRGYEGALSGAALGAGITAYNSNSAGATLGVGLAAGL
AGMAADALVEDVNYTMITDVQIAERTKATVTTDNVAALRQGTSGAKIQTS
TETGNQHKYQTRVVSNANKVNLKFEEAKPVLEEQLAKSIANIL
>SCV07 transposase
MSNHHESLEGFLRKRHSVSKLVVHLIFTTKYRCKLFDGQIIAQLRDAFGS
AAAKLECEIIEMDGEQDHVHLLIAYPLKLGVSVMVNNLKSVSSRLLRQQN
THLRMQSKTGLLWSRSYFACSAGGATIETLKAYVLRQNTPE
>SC170 hypothetical protein
MALEQVIYNLEEVKGLSDEELYQRLYDSWGLENHKIDVWGELSINVAGNW
AGIRNAKTLSGNQSIEYPLADSHELQSGVFLTPSVAKIVLGNDTKGMVAC
ELMLASIPQRAKKNNPFLLAVNDKTVERLTYLPDALPNLDRQALIQNEDE
TLLRKVIYDAEVKRVKHKIASETEALEAALQEKQQDMTDKLAELSDAFKS
TQDKITASTTELESSIKRNEELNNDNHRLNLKIQESKAELDVIIEKSRKV
EESMARKVEKLTDFIKEKAIFLKSFEFLDEEDFDFFVGSPLSNKERGEHI
SFSQALYSNYSDAVSYIQAYLKERDILYPRHIIENFLTLIRTNDLIILAG
DSGSGKTNLVQSFAKAIGGVSKIIPVKPNWTSSEDLLGYYNPLEKKYLAT
PFLEALIEAKQNPDIPYFICLDEMNLARVEYYFADFLSKLEERNEQPTIQ
LYSDDEAAHVLAELKGVVSVISNAQEKFSKNGIVDFVALMQDEEINAEMK
RAFGFSDKDSLIKYHGDIRRMLAGVLGTPSSITIPANVRIIGAINIDETT
HYLSPKILDRAHVMKFKSPLLTDWDAIFDEIDSYGLDDVTLPLVFDIEEL
GERTPYPKFERLNEFCELFTTLNRDVFDPLGVEFGMRTIRQGLNYVSLFS
DVNDNKSLAINNFIVHKVLPKFTFDGDKQVGDYSKAELVSRVFLPRLESL
LDNQVEIAAEFSCTKSIERLVKTAESNDGVVNYWA
>SC085 hypothetical protein
MKKKGRNEVSLRNKAAAENLPPCCPLTFNFVG
>SC039 hypothetical protein
MKKNFFIILVALLLLIAFTVSSVVKYTSLLSEKNLYTVAGTQENYGFSVA
KFIIQLNELYTLINSDNTIDDVRLKFDILYSRLNVIYVKSEATAPLYKQQ
GYEETIDSINKKLEDIDGLLSHNHPDYKKISTIIADIKPLTKTVTNLADM
AEIAQRNDALKDFRSKRQQLWTLLFITGGLISLLLFVLFIYISKINRLLL
SERAAFASKNAFLGMVGHELRTSLQAIVSIIDVVTNNLSGGIKSGQIERL
ETAVSKMERQLNDLAEFAKIDNGSVEIKNTYNSLQAIVTNAVQDCIAIYE
KKDVTVKIKNNNDAVIFTDALRLNQVIENLTSNAIKYTERGEVNVDYFIE
KGKVLNIVISDTGKGIPKDKLKFIFKPFTRVVDSKSTVPGFGMGLAIVAG
IIRLLKGTIHISSEVNVGTTVTVRIPIKLGDSSLATAEVSASAPGDVFGV
LSLLVIDDNEMACSSLSSLLTNAGYIVEATTSPERALEKLLRKPYGHCCK
VSDEAAFCLIQRPYISKTLLTRRISPRGSP
>SC041 hypothetical protein
MISNDSLIVAPYDYQKLSYENILPLNNGLIKNWNIQKFSSDYSEIEHLNV
INGMGVTLGDSIVGISVLSAIKKKNPKIKIKLIRPETAPSYVEELYELAS
SVIDELHYMPFHIQKLPCTALNIDVGNQLYSPDFHIMEMHDYFIQGLGVK
LDDVNHNELSNKWLGNIDLGAPVFENYTLFAPIASTKIRSIPSKFYYDIV
DMLSRREGKTVLGFVDVNHKNYINISKYSPHTKDFIALIKYASNVYTCDS
SALHIAAGFGVPTECVFNTIPPELRTKYYSKCKSIYVGNKRLEKIQSSED
FQLVKMVENNYREYLYG
>SCV01 transposase
MPIIAAIPDEERQLMRKEAQQTHDKNHARRLIAMLMLHQGMTVTDVARLL
CAARSSVGRWINWFTLHGVEGLKSLRPGRAPRWPVADILQLLPLLVQRSP
KDFGWLRSRWSTELLALVINRLFDVTLHRSTLHRYLRQADMVWRRAAPTL
KIKDPHYEEKRLVIDQALAQEQTAHPVFYQDEVDIDLNPKIGADWMPKGQ
QKRIATPGQNQKHYLAGALHSGTGRVHYVSGSSKSSDLFISLLETLRRTY
RRAKTITLVADNYIIHKSRKVERWLEENPKFRLLFLPMYSPWLNPIERLW
LSLHETITRNHQCRYMWQLLKQVAQFMNAASLFPGNQQGLAKVER
>SC087 hypothetical protein
MVLYFTTTHVMLILVNKRIPLGDGNVAVKTTYFPEKDKKSHQHNPWA
>SCV09 hypothetical protein YeeJ
MLGANIFLDYDLSRDHARAGFGGEYWRDLLKLSANAYVGLTGWKTSPDVE
DYEERPASGWDLRAEGYLPSYPQLGAKMVYEQYYGNEVGLFGKDERQKNP
HALTAGVSWTPVPLLKLSAEQRAGKAGEHDTRFGAEASYRIGDSLRSQLD
PDAVGPCAAWRAAATT
>SCV34 TrbH protein
MAVSAPVLSSQSTHTFKLPGVVRDNNQSPAACVTGNHLIEWPDRASLTGK
LCPDLPRMGGGRRVIIQNIKSGNKPLYHSEISFGHLAFFGTVNQLHQGDR
ADTHSPLVQVKTLPDAGRFVFYRENADVSIQHKLQHQNDSRSCTEGCSRL
SIKSALTLFPSNHSSHDSPAGVMIRVRPTAITSTRFTCSGNATAFGSLTA
>SC145 hypothetical protein
MKLYVIGNGFDVHHGLDTRYTSFGLYLKNNYWETYELLLDYYGFADLDPD
FPTTMSDPLWSEFETSMSLLDKDSVLEANMDAMPNYSSDDFRDRDRYTLE
IEMERILGLLTTELYKAFKEFILAVQFPQFDHSRSVNIDRDAVYLTFNYT
DTLSQYYAIPDKNVLFIHGKADEHIDELILGHGVDPENFKEKPAEPPSGL
SDEDLERWMEYQSDQYDYSFERGKDAVNQYFSATFKGTDQIIKNNEDFFA
KLGNIDEVYVLGHSLADVDLPYFKKLEQSVRPDAKWVVTFYDPDDEKVHG
ETLTGLGIANVAVVRMEQI
>SC129 hypothetical protein
MNYAGHEKLRADVAEVANTMCDLRARLNDMEHRCRFDSDVLVERLTRQTL
YRANRLFMEAYTEILELDSCFKD
>SC121 hypothetical protein
MDLSDGGPFTASRDSCPGADAKFLAGFMAKNFVAGASHPGQMEREGVTSA
PRKVHSGKQAAGVAAGGKTAAPSGAVSSHTVRCAGSAGHGATGAARPERS
LQAGFSRIGP
>SC124 hypothetical protein
MNISTETREILRNYKAVINARRREMGQKPLTTAQIVDEICDFVANQQAVF
LGGHYILQGSRNR
>SC105 hypothetical protein
MQGAPHPARLKKNDGTTMACGGHWHTTPAQVKTEKTARHLQPKKVKFLHF
FSLIYRADLAIVMIK
>SCV10 hypothetical protein YeeJ
MQATTPLVLTLTPAMTARADIASRIGQSPFSADRDAALAGIRTVPYTLKK
GETVAQAHGLTVPQLKKLNGLRTFARGFDHLQAGDELDVPAVPLTGGKGD
NNRHDARGPFAADRENEDAQAQQMVGMASQAGSFLASHPDGQAAAGMVTV
TSGRLTCSGRTSSSTMTSPATTPAPASAENTGATSLSFPPTPMSA
>SCV29 hypothetical protein
MLAHGLRFKIVSWDVSTSVSSPFPAPSGMSSGSGSCGLGLSHPRRQHVLR
IRTVCCTGIVKRSHFGVATVARFSRIAGAGQRRERRRRQDKLLLPSLVVA
RSLRSIPARISTAAFSRVSCPAPPGITVTFFSVAFLTGIIIITGEPCSAS
VCASRLLASSMLFVFTFPPLILCPFRTAAAALLISSLVSAPS
>SCV22 protein YgiW precursor
MKKTLITLIITTLSFSSLARQTDIVSSVEQPGYVQGGFTGLAPTQTSVSQ
AKKQWDDAWVVLEGNIIRQVGHELYEFRDSSGTVYVDIDNKYWMGQTASP
ADKVHIEGEVDRDWDGIKIDVKNIRVMK
>SCV44 hypothetical protein
MENMLKTMREHFPVRTVVIEQSLDMFGVQRNEDGTLLFPRHSDKRFPVAR
IRKRLAAYGWRGERLNKEVARIVAGPDILPERRDYSIIFPYGWEKKQLTR
EMRRMFLRKTG
>SC150 hypothetical protein
MGSKIAGVVFLYFAVWGMASVIELQYWRNADTYPVWLTKHLRISPENNDK
SLIRIEMETFDKVCGDKRGLLAVTQKQNGMFMRCDDSPGFFSWFSGVYKV
N
>SC042 hypothetical protein
MFEKIEKKYWQDKIKKISKFRVIKMQDLDDAKVIEVLAKVLTKKGISKVK
HNSYISKVLQITTSAGYKKLNGATKWEVQQLIKVVQSVGMDMIEFFRIYY
EDNKEIQDAIWNDGRSEHSCKVCLYPEGSEVNTEYSALKVADQWNIFPTN
DLKDELLYESKRSIESIVINPKVISSTKYRIALLDDDANITESLKEILSN
DYYQVDTYDNLKKLTQAVATNPYDAYVLDWIIGDETSFKLVKAIRSSKNK
NAMIIVLTGQLSGIKDDEIVKSIHDYDIVGPYEKPIKSGIIKSNIEKYFA
R
>SCV47 hypothetical protein
MHLIPCAFLSVLIFLVVLLHSPDILQNCLPVVLCLLPVAFGNGSNNVRIT
VMAAVHASLLAKDINNNILAALPVEKMSAQALYNQRSIVQPGLISDLEEE
VQHLIRFVALDGRTESGCPLWCFPGVIPLVTQFCLVVGFQVFLCIPVVAE
KVFPGPFDD
>SC082 hypothetical protein
MRKISYGRKSQLSQPAHSLFSLCSGINYFQHKKEKTMNLVDAFVKKVISE
PYEEYGKWWIDVEYISWGVPGKTRLMFESKEQALEVKEGYKFLT
>SC054 hypothetical protein
MSLALFDFDGTITTRETMPDFVRRSVSRRRLLVGQLLLAPLVLGYKIGIL
SGTLIRRVIVRFAYSGIPASVLEAQGRDFAQSYLPNTLRGEAMQRIAWHK
AQGHKVVVVSGGLEAYLEPWCDSHGVELICSSLQQSDGILTGRYEGRQCV
LTEKARLVRERYDLATYPTIYAYGDTPEDLDMLAIATKQYYQWQEVGVAG
GR
>SC002 hypothetical protein
MLFPNFSLGEEYEHAPPATNRQISPYLPSGRFRTGLPVEGLAIERGDLFY
ACPRASVFYGTALDADLRTRGVSTLVMAGISTTGVVLSSVAWASDADYDV
RLVQDCCYDPDRDAHEALLRSGFGGRVQVV
>SC141 hypothetical protein
MLNLVGRNTNGDIVSYEPMEQVNRFLLAHHLNDNKQESEQYSKGLVHYFS
FLLELQRLWDSEYDEDLYEEFIDLPRPSWDKFPFRKSDKATYQYREALIK
AVLEPDTPNHAIARTTAIAYMGAVVKFYSFHIRNGYKFNNPPFEHEVVSI
QYQGGSTSIGAYLSKDIHTTDLRLNLGKSKRNDGGALSSARRDLKPLTNK
EWLAVEDILTNTRRVIKKVAGETTTSNLSIEYCLFFLVARYTGLRKEEVA
SLHKGQVAKPGEDKKAMKLGVGGQYGSLTKTAGSGNKSRQTIIPTRIMHL
LYEYTRSDRYKKRISKFKEHCELKRQKGELGYFDGEDGVDDTKEYLFISQ
TGVPLFTKLPEANARWNEIRATVNILSGFNLTATIHNLRATFAVSLFRLL
LRRVTPDKALALVSECLGHGEESVTLIYLKIAQDEPTGDEIYEDILEFIG
AFEDADISGMESQ
>SC171 hypothetical protein
METPFVIKFIETKWHDKQTLVSVSESEYSLKLEQTGNNAFSAHTTIYPKV
DELRFAQLAIKTKQGDQSPPYIVMPNGDRKQLESITDPASNAVWWVEPAH
WDAKQRVWRSEARRTAGQITFVIGNSTLKLDIDISEQTKSDLSRYLSDFK
ADLWELILDENSHITGDAKNSQVAAIDQEALSLVASILSNAQTILKKPKV
ELKEIQALKPAKEVRPVPRTFMEICTKGSRKHLTSRASEPSYNVPENQYV
LYVVLSTLSIVKQLVKVAESKKSRFSGAIEKLNERLDSLKDYRIINRDLV
VKDLERLKKRFDTEVINAELASQLGEINANKYFSQNHAAKGYLRLEKTTG
SENEWWAKIKPSQHDDWQQFELDGYTIFSSGEYYASLFQPYSDYDMVAIM
PPPSRRGTASILYPEYISKLTILADSRSLLRDKEKFSKLREQGIALNENG
WKTKLTPEELSEQEKERETIRKRLSYFASEHEKVGIVHQVLAPKIKPFQQ
VEKEWRQCKVKSKSTFPNSMTFVQNPAYQAVHSGFKKLKEQIGLADEDIL
LSLEKIEAIGLVNMPLIYERWCLLQIIKVLTQAFRYLPEDNWKRKLIANI
QGNEEQISIQFFNPNVSRKVTLQYEPFLANGKRPDFVLDVEAITKSGNQI
SKRLVVDAKYYSAAYLKLRGGIGGVIHELYNGKDYSECQENSVFVLHPVL
DAVEKVVSPQEWAKDSYLGELSMFDWEPAYHQRQATNYGAVCANPMKSQR
YLDEIQRMLGMFLQYGIEDNTSFRGASDDTHAVNFCVSCGSEKVVDVTKS
MSSNNQKRWYRCNECTHFTVYTHCGTCNTRLIKNGEYWTYLSLMPMSSIN
IKCPNCESPV
>SC146 hypothetical protein
MLLVGYPLNCFINLFDDNLHHVFKVLIMELQTRLVFIDTSAFEKKNYQFG
QHALGRLQELIEDEKIHLLITDVTRKEIDSHLKHKSEEAASMIKKMQRDA
MFLRNTPDLDCHGIFTKIKGDDIYAVISEKFDDFVENGYVETIDVSTVNP
QVVFDAYFNNLPPFGKESKKHEFPDAFALEAIKQVSLARGYSAYIVSDDG
DMKSFCEKEDNFIHLENVDDLIDLIVRSDKAYEEPAKFADEIFEQLLEQI
KADALQALEDGEFNYENADPFDETINSIEINSVNVGKKNLQNVDAEWAEY
EVEFEVVVTADYRFSDYDRSPWDPEDKQYVFVLSNESTVKHKETYTAHIT
LAYTDGLKANAYIEEFYFQDTYFELTDNDSEVISFKELDINGE
>SC163 hypothetical protein
MHTRTKLPAPLAADVAALIKNGMDFLDKAREEFEAKQYKHSVVSFWIAVE
ILLKVPLASEHWTLVCSGKKVSRKSYLAGDFQSVSFDDVCTRLRDILEKP
LPKETEAVFNTIRNHRNRVVHFFHTAFSDSEVETILAEQARAWFALNRLM
REDWQQHFASPHNWALALGETQLLRGNEFYAEARLKHIQPELEQLATEGA
EFHPCTICHKPAAIMEILAVGKNGPTVYEQTCRVCFHSERHVKFTCPECD
TDQVLPVEEEDDDTFICRTCNAELSRYNLLDEENFRHVDEMMYPDGLANC
AHCEGHETVCVFGENFLCTRCLEIHTGYDTCEICGTPCEAMGETMRYTGC
PHCADED
>SC152 hypothetical protein
MTTVPPASPVSLSAWDHGPTCIAGQVICLGPRSHLHRLSAYLPGTTVPPV
SPVRLSAWDHGPTCIVCQLICLGPRSHLLRRSAYLPGTTVPPVSPVRLSV
WDHGATCIAGQVICLGPRSHLHRRHIIS
>SC164 hypothetical protein
MLTLGRISLVMHLLYGLVHPKILQWLMVHEKFECNEVNTKVFALDVVASQ
QLRFILYTH
>SC107 hypothetical protein
MTIAERLIQKGFDEGFDEGFKEGFKKGALEVAREAACRLRDMGWTPERIQ
EAAGLSGEELKKLFPDEQ
>SC135 hypothetical protein
MRKRMRTITTREQLLVNGKVRERIATHIVTGAHGYETLCTSGYNLQYNKE
RVLIENCEKVADGELPVTCHTCFSIWQDVHRFKPGDFDTESGKGNFTDTE
LTKITIGQEKTPNAC
>SC053 aadA2, streptomycin 3''-adenyl transferase
MLDIMRVAVTIEISNQLSEVLSVIERHLESTLLAVHLYGSAVDGGLKPYS
DIDLLVTVAVKLDETTRRALLNDLMEASAFPGESETLRAIEVTLVVHDDI
IPWRYPAKRELQFGEWQRNDILAGIFEPAMIDIDLAILLTKAREHSVALV
GPAAEEFFDPVPEQDLFEALRETLKLWNSQPDWAGDERNVVLTLSRIWYS
AITGKIAPKDVAADWAIKRLPAQYQPVLLEAKQAYLGQKEDHLASRADHL
EEFIRFVKGEIIKSVGK
>SC032 aadA2, streptomycin/spectinomycin 3' adenyl transferase
MREAVTIEISNQLSEVLSVIERHLESTLLAVHLYGSAVDGGLKPYSDIDL
LVTVAVKLDETTRRALLNDLMEASAFPGESETLRAIEVTLVVHDDIIPWR
YPAKRELQFGEWQRNDILAGIFEPAMIDIDLAILLTKAREHSVALVGPAA
EEFFDPVPEQDLFEALRETLKLWNSQPDWAGDERNVVLTLSRIWYSAITG
KIAPKDVAADWAIKRLPAQYQPVLLEAKQAYLGQKEDHLASRADHLEEFI
RFVKGEIIKSVGK
>SC051 aadA2, aminoglycoside adenyltransferase
MREAVIAEVSTQLSEVVGVIERHLEPTLLAVHLYGSAVDGGLKPHSDIDL
LVTVTVRLDETTRRALINDLLETSASPGESEILRAVEVTIVVHDDIIPWR
YPAKRELQFGEWQRNDILAGIFEPATIDIDLAILLTKAREHSVALVGPAA
EELFDPVPEQDLFEALNETLTLWNSPPDWAGDERNVVLTLSRIWYSAVTG
KIAPKDVAADWAMERLPAQYQPVILEARQAYLGQEEDRLASRADQLEEFV
HYVKGEITKVVGK
>SC090 ampC, extended spectrum beta-lactamase
MSTRCKSNTLIASDGPGHLFAFNYGTDFMMKKSLCCALLLTASFSTFAAA
KTEQQIADIVNRTITPLMQEQAIPGMAVAVIYQGKPYYFTWGKADIANNH
PVTQQTLFELGSVSKTFNGVLGGDAIARGEIKLSDPVTKYWPELTGKQWQ
GIRLLHLATYTAGGLPLQIPDDVRDKAALLHFYQNWQPQWTPGAKRLYAN
SSIGLFGALAVKPSGMSYEEAMTRRVLQPLKLAHTWITVPQNEQKDYAWG
YREGKPVHVSPGQLDAEAYGVKSSVIDMARWVQANMDASHVQEKTLQQGI
ALAQSRYWRIGDMYQGLGWEMLNWPLKADSIINGSDSKVALAALPAVEVN
PPAPAVKASWVHKTGSTGGFGSYVAFVPEKNLGIVMLANKSYPNPVRVEA
AWRILEKLQ
>SC026 aph, aminoglycoside 3'-phosphotransferase
MSHIQRETSCSRPRLNSNMDADLYGYKWARDNVGQSGATIYRLYGKPDAP
ELFLKHGKGSVANDVTDEMVRLNWLTEFMPLPTIKHFIRTPDDAWLLTTA
IPGKTAFQVLEEYPDSGENIVDALAVFLRRLHSIPVCNCPFNSDRVFRLA
QAQSRMNNGLVDASDFDDERNGWPVEQVWKEMHKLLPFSPDSVVTHGDFS
LDNLIFDEGKLIGCIDVGRVGIADRYQDLAILWNCLGEFSPSLQKRLFQK
YGIDNPDMNKLQFHLMLDEFF
>SC114 ard, anti-restriction protein
MRRVLCTRRNKEVAGGSAFTRPCGAWPCGFSALWPQRRVRAVPCLHLSRA
GRDARVRFAAAVTRSLLPVCRDFPVVRPLRFRGLTLQLPSAVCVRLRLPL
RPCIPALSPGFYGGTAPPGVAEHVTMEDSGMSVVAPAVYVGTWHKYNCGS
IAGRWFDLTTFDDERDFFAACRALHQDEADPELMFQDYEGFPGNMASECH
INWAWVEGFRRARDEGCEEAYRLWVDDTGETDFDTFRDAWWGEADSEEAF
AVEFASDTGLLADVPETVALYFDYEAYARDLFLDSFTFIDGHVFRR
>SC091 blc, outer membrane lipoprotein BLC precursor
MRILPVVAAVTAAFLVVACSSPTPPKGVTVVNNFDAKRYLGTWYEIARFD
HRFERGLDKVTATYSLRDDGGINVINKGYNPDREMWQKTEGKAYFTGDPS
RAALKVSFFGPFYGGYNVIALDREYRHALVCGPDRDYLWILSRTPTISDE
MKQQMLAIATREGFEVNKLIWVKQPGA
>SC156 blc, putative lipoprotein
MKLWPVVTGVAIALTLVACKSPTPPKGVQPISGFDASRYLGKWYEVARLE
NRFERGLEQVTATYGKRSDGGISVLNRGYDPVKNKWNESEGKAYFTGEPT
TAALKVSFLGSVWI
>SC011 cat2, chloramphenicol acetyl transferase II
MNFTRIDLNTWNRREHFALYRQQIKCGFSLTTKLDITALRTALAKTGYKF
YPLMIYLISRAVNQFPEFRMAMKDNELIYWEQSDPVFTVFHKETETFSAL
SCRYFPDLSEFMAGYNAVTAEYQHDTRLFPQGNLPENHLNISSLPWVSFD
GFNLNITGNDDYFSPVFTMAKFQQEGDRVLLPVSVQVHHAVCDGFHAARF
INTLQLMCDNILK
>SCV18 ccdA, CcdA protein
MKQRITVTVDSDSYQLLKAYDVNISGLVSTTMQNEARRLRAERWQEENRE
GMAEVASFIEANGSFADDNRNW
>SCV17 ccdB, cytotoxic protein CcdB
MQFKVYTCKRESRYRLFVDVQSDIIDTPERRMAVPLVSARLLSEKVPRDL
YPVMHIGDEPYRLLTTDMTSVPATVIGEEVADLSLRENDIKNAINLMFRG
I
>SC109 ccgA2, CcgAII protein
MVSPQKTEKGNRFSFIQGPGGLEITMKSCFDTDKAFHMSFDEVTFNADRI
RSKLELLSEELMPRCGLVYELYQRRLSAAIDNFVATLPTEQHLLVLELAR
EEFDYLSAEEIAEEIRRDSERGYCCHGFERNCCPLGCGDLGDDCCDAADL
MEEP
>SC052 cmlA, chloramphenicol resistance protein
MSSKNFSWRYSLAATVLLLSPFDLLASLGMDMYLPAVPFMPNALGTTAST
IQLTLTTYLVMIGAGQLLFGPLSDRLGRRPVLLGGGLAYVVASMGLALTS
SAEVFLGLRILQACGASACLVSTFATVRDIYAGREESNVIYGILGSMLAM
VPAVGPLLGALVDMWLGWRAIFAFLGLGMIAASAAAWRFWPETRVQRVAG
LQWSQLLLPVKCLNFWLYTLCYAAGMGSFFVFFSIAPGLMMGRQGVSQLG
FSLLFATVAIAMVFTARFMGRVIPKWGSPSVLRMGMGCLIAGAVLLAITE
IWALQSVLGFIAPMWLVGIGVATAVSVAPNGALRGFDHVAGTVTAVYFCL
GGVLLGSIGTLIISLLPRNTAWPVVVYCLTLATVVLGLSCVSRVKGSRGQ
GEHDVVALQSAESTSNPNR
>SC033 ebr, ORF3-QacEdelta1 fusion protein
MKGWLFLVIAIVGEVIATSALKSSEGFTKLAPSAVVIIGYGIAFYFLSLV
LKSIPVGVAYAVWSGLGVVIITAIAWLLHGQKLDAWGFVGMGLIIAAFLL
ARSPSWKSLRRPTPW
>SC080 exeAB, surface exclusion protein
MKAKKETTDRFPTWWLFYYVLRKAYFFLGIPFFLFCALGFTEMLCSDRYF
GNKVEDYVVTFGSWFLLLAPGIWMYSRAKTRREKIRKVVQTIKESGFYSP
EKGYEGLSLTQGAYFGIDLKNGTMLYVRIYPGNIMDVIGFDIHNFTRTVT
DDKTLEIHTKYINLPMVPIPSWCTHPETASNTMHAMASRGYDYPVDFPRL
IQEKRKEWEQIAGIPVAEVF
>SCV30 fio2, fertility inhibition protein
MTEQKRPVLTLKRKTDGEAPARSRKTIINVTTPPKWKVKKQQLADRAVRE
ASLAEKKARARKDLSIYLRFQSVEEAVSTLKPWWPGLFDGDTPRLFACGV
REALFEDASRRGIPLSHKKIIRALKAIARSEAYLSAMKAGACRYDTEGYV
TEHITVEEEQYALARLAKVRAQNARKAELRAVLAQTV
>SC115 gp43, hypothetical protein
MMKSDEKYQVPAWMRPLLPLLCNTGGNDPEELLNDTETTASANVVRYVLI
VAVRSQVDLLQLLYRKGLLRTEIPGGFSPEEAQALLDNLVRSHISKALSG
ERMAARDRNADLAWIGQQLVDAAWFVRATLEAHGMSVGNESPPAPPETMP
DIQTRELVMLIKRLASSLKAVKPDSCVVREAQDWLRDRKLVDITDILR
>SC138 impA, ImpA
MSTVYHRPADPSGDDSYVRPLFADRCQAGFPSPATDYAEQELDLNSYCIS
RPAATFFLRASGESMNQAGVQNGDLLVVDRAEKPQHGDIVIAEIDGEFTV
KRLLLRPRPALEPVSDSPEFRTLYPENICIFGVVTHVIHRTRELR
>SC137 impC, ImpC
MIRIEILFDRQSTKNLKSGTLQALQNEIEQRLKPHYPEIWLRIDQGSAPS
VSVTGARNDKDKERILSLLEEIWQDDSWLPAA
>SC062 insA, transposition protein
MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKNSGRSR
>SC063 insB, transposition protein
MPGNRPHYGRWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SC177 insB, IS1 protein InsB
MLPWLPSPSVVPPAQLLKAWCVTVKVLPDISAISALTAVKHGSYSSLTPP
LRPQSVNSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRIRRTVV
AHVFGERTMATLEHLLSLPSVFQGA
>SC176 int2, integrase/recombinase
MILIPDDEPELSLPAASEEFLPALSGENAPVSPARAYLLSLNSHRSRQTM
ASFLNIVAVMLGAASLESCSWGSLRRHHVMAVTELLRDTGRATATVNTYL
SALKGVAKEAWMLRLMDVESFQHIRAVRNLRGSRLPSGRALPQGEIRALF
AVCEADRSCLGARDAAMLAVILGCGLRRSEVVSLDLRDVVTQDRALRVLG
KGNKERLAYVPAGAWQRLQIWIDEIRGETPGPLFTRIRRFGDVTLNRLTD
QAVYHILQVRQGQAGITKCSPHDLRRTFATAMLDNGEDLITVKDAMGHAS
VTTTQQYDRRGEQRLQDARDRLNLI
>SC029 int2, Int
MKTATAPLPPLRSVKVLDQLRERIRYLHYSLRTEQAYVNWVRAFIRFHGV
RHPATLGSSEVEAFLSWLANERKVSVSTHRQALAALLFFYGKVLCTDLPW
LQEIGRPRPSRRLPVVLTPDEVVRILGFLEGEHRLFAQLLYGTGMRISEG
LQLRVKDLDFDHGTIIVREGKGSKDRALMLPESLAPSLREQLSRARAWWL
KDQAEGRSGVALPDALERKYPRAGHSWPWFWVFAQHTHSTDPRSGVVRRH
HMYDQTFQRAFKRAVEQAGITKPATPHTLRHSFATALLRSGYDIRTVQDL
LGHSDVSTTMIYTHVLKVGGAGVRSPLDALPPLTSER
>SC056 int2, integrase
MKTATAPLPPLRSVKVLDQLRERIRYLHYSLRTEQAYVHWVRAFIRFHGV
RHPATLGSSEVEAFLSWLANERKVSVSTHRQALAALLFFYGKVLCTDLPW
LQEIGRPRPSRRLPVVLTPDEVVRILGFLEGEHRLFAQLLYGTGMRISEG
LQLRVKDLDFDHGTIIVREGKGSKDRALMLPESLAPSLREQLSRARAWWL
KDQAEGRSGVALPDALERKYPRAGHSWPWFWVFAQHTHSTDPRSGVVRRH
HMYDQTFQRAFKRAVEQAGITKPATPHTLRHSFATALLRSGYDIRTVQDL
LGHSDVSTTMIYTHVLKVGGAGVRSPLDALPPLTSER
>SC168 istB, transposase/IS protein
MVELQHQRLMVLAEQLQLDSLIGAAPALSQQAVDQEWSYMDFLEHLLHEE
KLARHQRKQAMYTRMAAFPAVKTFEEYDFTFATGAPQKQIQSLRSLSFIE
RNENIVLLGPSGVGKTHLAIAMGYEAVRAGIKVRFTTAADLLLQLSTSQR
QGRYKTTLNRGVMAPKLLIIDEIGYLPFSQEEAKLFFQVIAKRYEKSAMI
LTSNLPFGQWDQTFAGDAALTSAMLDRILHHSHVVQIKGESYRLKQKRKA
GVIAEANPE
>SC126 klcA, probable antirestriction protein KlcA
MQYAKPVTLNVEECDRLSFLPYLFGNDFLYAEAYVYALAQKMMPEYQGGF
WHFIRLPDGGGYMMPDGDRFHMVNGANWFDRTVSADAAGIILTSLVINRQ
LWLYHDSGDAGLTQLYRMRDAQLWRHIEFHPECNAIYAALD
>SC130 mboIB, putative site-specific DNA methyl transferase
MSRFVLGNCIDVMARIPDNAIDFILTDPPYLVGFRDRQGRTIAGDKTDEW
LQPACNEMYRVLKKDALMVSFYGWNRVDRFMSAWKNAGFSVVGHLVFTKN
YTSKAAYVGYRHECAYILAKGRPALPQKPLPDVQGWKYSGNRHHPTEKPV
TSLQPLIESFTHPNAIVLDPFAGSGSTCVAALQSGRRYIGIELLEQYHRA
GQQRLAAVQRAMQQGAANDNWFEPEAA
>SC045 mefE, macrolide-efflux determinant
MAYVQESIAPEMMGKVFSLLMTAMTLSMPIGLLVAGPVVEVIGVNTWFFW
SGVALIVNAVLCRILTRRYDKVTMKPQVD
>SC020 merA, mercuric reductase
MSTLKITGMTCDSCAVHVKDALEKVPGVQSADVSYAKGSAKLAIEVGTSP
DALTAAVAGLGYRATLADAPSVSTPGGLLDKMRDLLGRNDKTGSSGALHI
AVIGSGGAAMAAALKAVEQGARVTLIERGTIGGTCVNVGCVPSKIMIRAA
HIAHLRRESPFDGGIAATTPTIQRTALLAQQQARVDELRHAKYEGILEGN
PAITVLHGSARFKDNRNLIVQLNDGGERVVAFDRCLIATGASPAVPPIPG
LKDTPYWTSTEALVSETIPKRLAVIGSSVVALELAQAFARLGAKVTILAR
STLFFREDPAIGEAVTAAFRMEGIEVREHTQASQVAYINGEGDGEFVLTT
AHGELRADKLLVATGRAPNTRKLALDATGVTLTPQGAIVIDPGMRTSVEH
IYAAGDCTDQPQFVYVAAAAGTRAAINMTGGDAALNLTAMPAVVFTDPQV
ATVGYSEAEAHHDGIKTDSRTLTLDNVPRALANFDTRGFIKLVVEEGSGR
LIGVQAVAPEAGELIQTAALAIRNRMTVQELADQLFPYLTMVEGLKLAAQ
TFNKDVKQLSCCAG
>SC021 merD, MerD
MSAYTVSQLAHNAGVSVHIVRDYLVRGLLRPVACTTGGYGVFDDAALQRL
CFVRAAFEAGIGLDALARLCRALDAADGAQAAAQLAVLRQLVERRRAALA
HLDAQLASMPAERAHEEALP
>SC022 merE, MerE
MNAPDKLPPETRQPVSGYLWGALAVLTCPCHLPILAAVLAGTTAGAFLGE
HWGVAALALTGLFVLAVTRLLRAFRGGS
>SCV48 mig5, carbonic anhydrase chloroplast precursor
MEQNQPAQPSRRAILKQTLAVSALSVTGLAALSVPTISFAASLSKEERDG
MTPDAVIEHFKQGNLRFRENRPAKHDYLAQKRNSIAGQYPAAVILSCIDS
RAPAEIVLDAGIGETFNSRVAGNISNRDMLGSMEFACAVAGAKVVLVIGH
TRCGAVRCAIDNAELGNLTGLLDEIKPAIAKTEYSGERKGSNYDFVDAVA
RKNVELTIENIRKNSPVLKQLEDEKKIKIVGSMYHLTGGKVEFFEV
>SC157 mlrA, putative transcriptional regulator
MAYSIGEFARLSGITATTLRAWQRRYGLLKPQRTEGGHRQYSDEDVQQAL
KILDWVKKGVPIGQVKPLLERPTSRLTNNWLTLQQSMLQRLQDGKIESLR
QMIYDAGREYPRPELVANVLRPLRSQVSANVAAAMTLREILDGIIIAYTS
FCLEGDKKAPGDNILLSGWNLNDPCEVWLEALTRTGQGHRIDILPTPPAT
LAPEIFPDRHWLLVTRGKLTAARKKQIEQWQQQVSLEVIIL
>SC101 mobC, MobC-like protein
MASKKFYSDDDIQLAKAALSELPDLTAQRKTLRDFLDAIRDDIIILVRTK
GYTLADVRDTLQNAGYEVGEKALRDIIREAESKKTSRRSSS
>SC175 mpi, putative mannose-6-phosphate isomerase
MWLMKTLLTSGCTNQRGAGQIHCTIFSPTASASLRMLPEQHAAEARYPIP
VEDFHFSIFTAVNNQTVMTSSAEIVLVIEGTATLSHASGESLLLHVGLSA
FIPASIGRWLLTTTGKVCRVSC
>SC100 nikA, relaxosome component
MSDSAVRKKSEVRQKTVVRTLRFSPVEDETIRKKAEDSGLTVSAYIRNAA
LNKRINSRTDDAFLKELMRLGRMQKHLFVQGKRTGDKEYAEVLVAITELT
NTLRKQLMEG
>SC099 nikB, NikB-like protein
MNAVIPKKRRDGKSSFEDLVSYVSVRDDMTDEELNLSSSSQAEQPHRSRF
SRLVDYATRLRNESFVALVDVMKDGCEWVNFYGVTCFHNCTSLETAAADM
EYIAQQAHYAKDNTDPVFHYILSWQAHESPRPEQIYDSVRHTLKSLGLGE
HQYVSAVHTDTDNLHVHVAVNRVHPVTGYLNCLSWSQEKLSRACRELELK
HGFAPDNGCWVHAPGNRIVRKTAVERDRQNAWTRGKKQTFREYVAQTAVA
GLRSEPVNDWLSLHRRLAEDGLYLSQMDGKFLVMDGWDRNREGVQLDSFG
PSWCAEKLMKKMGDYTPVPKDIFSQVEAPGRYNPDFIAADVRPEKIAETE
SLQQYACRHLGERLPEMAREGRLENCQAIHRTLAEAGLWMRVQHGHLVIC
DGYDHNQTPVRADSVWSLLTLDNVNQLDGGWQPVPTDIFRQVTPTERFRG
RRMESCPATDKEWHRMRTGTGPQGAIKRELFSDKESLWGYSISHCSPQIE
EMITQGEFTWQRCHELFAQQGLMLQKQHHGLVVVDAFNHEQTPVKASSIH
PDLTLGRAEPQAGPFVSAPADLFDRVQPESRYNPELAVSDRYGVSSKRDP
MLRRQRREARAEARADLRARYLAWREQWRKPDLRYGERCREIHQACRLRK
SHIRAQYDDPALRKLHYHIAEVQRMQALIRLKEDIRDERQKLIADGKWYP
PSYRQWVEIQAAQGDRAAVSQLRGWDYRDRRKDRSRTTTTDRCVVLCEPG
GTPVYGNTGDLEARLQKNGSVRFRDRRTGEFVCTDYGDRVVFRNHHDRNA
LADKLDLIAPVLFGRDPRMGFEPEGNDKQFNQVFAEMVAWHNVTGRTGHE
DYRITRPDVDHHREGSERYYRDYIAANSNDDASLPPPEQDKRWEPPSPG
>SC060 paa4, resolvase
MRRTKPVAAPMVARVYLRVSTDAQDLERQEAITTAAKAAGYYVAGIYREK
ASGARADRPELLRMIGDLQPGEVVIAEKIDRISRLPLPEAERLVASIQAK
GASLAVPGVVDLSDLAAEAQGVAKIVLEAVQIMLFRLALQMARDDYEDRR
ERQRQGIELARQAGRYKGRRADPKRRAQVVALRKSGYSINKTAELAGYSA
AQVKRIWAEVSQAEAKQHGAFVEDALTEADALAAVGQDERQEERA
>SCV41 parA, plasmid partition protein A
MENIEQLRKVATRAGKLLTSLSESIRQQKEELKLTEFYQEYSKAALYKLP
KLSKGSVEYAVAEMEASGYIFKKKPSGNTMKYAMTIQNVIDLYFHRKVPK
YRDRFDKAFTIFVCNLKGGGSKTVSTASLSHAFRAHPQLLFEDLRILAID
FDPQASLTMFLSHENSVGLVENTAAQAMLQNVSREELLSDFIVSSIIPGV
DVIPASIDDAFLAEGWKGLCEEHLPGQNIHAVLKENIIDKLRYDYDFIFL
DSGPHLDAFLKNCIGAADLMLTPLPPATVDFHSSLKFVASLPALIDSIEQ
DGHTCNLIGNVGFMSKILNKSDHKICHSQAKEVFGADMLDMVLPRLDGFE
RCGETFDTVISANPATYDGSTEALKSAKSAAEDFAKAVFDRIEFIRTNGG
M
>SC132 parB, ParB
MADNRKCSFYIYPERNAADRVADRFLEKLPQKERGRAMRAMMLCGAALMK
QDERLPFLIAEFLTDSTSMQDIQRIISSTLPQQENGEVVRLLEAFLQSAG
NNAKAILPAVDSATQEISAPVDQNLLETRNNIKNLFPDDE
>SCV40 parB, plasmid partition protein B
MMSNERRKTIGRQLNTQASMVEMTDTQRSQVFTLKTGRKITFRFVRVPAS
DVESKTFVNQETNGRDQLALTRESLKSIIQTIKFQQFFPCIGIQQGERIE
ILDGSRRRASAIYIRTGLDVMVTNELLSADEARHLAKDIQTAKEHNLREI
GLRLMALKESGFNQKEIAELEGLSQAKVTRALQAAAVPQELISLFPVQSE
LSFSDYKILLEVNEKLSEKGLTSEGLIQSVSDQHDAILSDYERPDDEQKS
SILKLISQASQALIAPPPKEKSVISALWTFEEKDKFARKRVKGRTLTYEF
SRMSKVVQDELDKAINEVLERNLSQ
>SC003 pecM, hypothetical protein
MSLRTPDLLFTAIAPAIWGSTYIVTTQYLPNFSPMTVAMLRALPAGLLLV
MIVRQIPTGIWWMRIFILGALNISLFWSLLFISVYRLPGGVAATVGAVQP
LMVVFISAALLGSPIRLMAVLGAICGTAGVALLVLTPNAALDPVGVAAGL
AGAVSMAFGTVLTRKWQPPVPLLTFTAWQLAAGGLLLVPVALVFDPPIPM
PTGTNVLGLAWLGLIGAGLTYFLWFRGISRLEPTVVSLLGFLSPGTAVLL
GWLFLDQTLSALQIIGVLLVIGSIWLGQRSNRTPRARIACRKSP
>SCV24 pefA, F107 fimbrial protein precursor
MKKSIIASIIALGVLGGTAHAANEVTFLGSVSATTCDLTTSVNGAAQPNQ
VVQLGTVQANQTGNAVEFAMKPADPTAQACGNLAQKTATITWASAALDGE
GFGATSGTAADAKVLVDSVNSKAPGAVNANASSVDFDGANLTTDGLKFTA
KLKGGQTEGDFKSVASFAVTYK
>SCV23 pefB, major pilu subunit operon regulatory protein PapB
MMLNRKDADYYLGKEIMLARIRRGALIPAKVNEEHFWLLIGISSIHSEKI
IQALRDYLVFGVSRKDVCERYEVNNGYFSTSLNRLSRISQAAAQMVVYYS
>SCV25 pefC, outer membrane usher protein PefC precursor
MSFHHRVFKLSALSLALFSHLSFASTDSELNLDFLQGMGAIPSVLKSGSD
FPAGQYYVDVIVNQENVGKARLSITPQEESANALCLSPEWLKAAGVPVRL
EGYASTLNAAGQCYVLSRNPYTRVDFSYGSQSLVFSIPQSFLVGKTDPSR
WDYGVPAARLKYSANASQTSGQSTSAYANADLMVNLGRWVLASNMSASRY
ADGSGEFTARDITLSTAISQVQGDLLLGKSQTRSALFSDFGFYGAALRSN
SNMLPWEARGYAPLITGVANSTSRVTISQNGYTVYSKVVPPGPYQLDDVR
SVGNGDLVVTVEDASGHKTTTVYPVTTLPTLLRPGEIEYNVAAGRKSSNY
QLKKPFRDGESGTFWMGSVGYGFDSTTLNAASILHGKYQAGGVSVTQALG
GFGAVSAGMNLSQAKYDNRDNKRGHSVSAKYAKSFSDSSDLQLLAYRYQS
KGYVEFADFYSTDRYTRYNTKSRYEMRFSQRLGNSNLNLAGWQEDYWWMK
GKATGGDVSLSTTILDGVSVFLNGSYSKRPYLDKPDYSTSLSFSIPFTLG
GVRHYSSTGLSYSSSGRMGMNSGVSASPTDRLSYGLNTNLSDKGDRSLSG
NLSYGFDAIQTNMMLSQGRDNTTVSGSVSGTILGTADSGLMMTKETGNTL
GVARIPGVKGVRINGSAPTNSKGYTVVNLSDYSLNRVSVDMENVPDDLEL
QTTSFNVVPTEKAVVYREFGAEHVLRYILRVKERDGRILNGGSAQTEQGL
DAGFIAGNGVLLMNMLSAPSRVSVERGDGSVCHFSVKGIVPNTGKVQEVY
CE
>SCV26 pefD, chaperone protein FanE precursor
MMKWGLVSLLSLAVSGQAMAAFVLNGTRFIYEEGRKNTSFEVTNQADETF
GGQVWIDNTTQGSSTVYMVPAPPFFKVRPKEKQIIRIMKTDSALPSDRES
LFWLNVQEIPPKPKASEGNVLAVAVNTKVKLIYRPKALVEGRRNAEKNLQ
ITHRGGEAYLKNPTPYYFAVTGVKLNGQPVRLNDRVMNEIAQLAPKSEVA
LGKLSLNGTVTVQAVNDWGGTQDYTLK
>SC086 pnd, hypothetical protein
MIDKKISRGFLLYTVHKTLSAYSLKRPQLF
>SC084 pndA, hypothetical protein
MSEKWSKEFDSCFQVGSIEEIADALEKRSGSEENALAMLAMTAFTLMTRR
GICEMQNVAPDGKGIRLELIGFRENETIPDTLH
>SC116 psiA, hypothetical protein
MSARSRALIPLSAEQQAAMQAVAVTEQRRRQGRTLSAWPYATAFFRCLNG
SRRISLTDLRFFAPALTKEEFHGNRLLWLAAVDKLIESFGEVCVLPLPSD
AGHRLFPSVPFREGERRRQKTTLTEQKYSRQREREAERRELEYQTCFAQA
QIDLAFHTPATVGSWLSRWSGVVEEHDLETIFWGWCGRFPSLSSFDRFFW
QEEPLWRLIFEAGEAGRGAPVQVRALEQWMIPNKLENVI
>SC117 psiB, plasmid SOS inhibition protein
MMKTELTLNALQSMNAQEYEEIRAAGSDMRRNLTHEVMREVDAPANWMMN
GEYGSEFGGFFPVQVRFTPAHERFHLALCSPGDVSQLWMLVLVNGGGQPF
AVVQVQHIFTPVAISHTLALAATLDAQGYSVNDIIHILMVEGGQA
>SC050 qacF, quaternary ammonium compound resistance protein
MKNWLFLAIAIFGEVVATSALKSSHGFTKLVPSVVVVAGYGLAFYFLSLA
LKSIPVGIAYAVWAGLGIVLVAAIAWIFHGQKLDLWAFVGMGLIVSGVAV
LNLLSKVSAH
>SC160 recF, putative RecF protein
MITTLHIQNYRSIREMSLELEQLNIVFGPNGTGKSNIYKAIHLMHSAAQG
QFSQALANEGGILKVFWAGKTRSDQLRRMNLAVETETYEYELQVGFVEKL
PYPSQFQLDPVIKEESIWLSGQYRRPSSQLMKRKNQAVFLNNVHHEKVTH
SGTLYENESVFGQLGEPHLYPEVSQMRESLRNWRFYQEFSVSIGSAMRAP
QVGFRSPVLASDGANLAAAFQTIVEIGDELLLMRILDQAFPGCVFYSDNT
GGRFRMMMQREGLSRPLEPAEFSDGTLRFLCLAVALLSPRPPAFIALNEP
ENSLHPQMLPALASLIAEASRYSQIWLTSHSPELANLIEKQRSFSLYQLS
MVEGETKVERLG
>SC165 relB, translation negative regulator
MGSINLRIDDELKARSYAALEKMGVTPSEALRLMLEYIADNERLPFKQTL
LSDEDAELVEIVKERLRKPKPVRVTLDEL
>SC166 relE, hypothetical protein
MAYFLDFDERALKEWRKLGSTVREQLKKKLVEVLESPRIEANKLRGMPDC
YKIKLRSSGYRLVYQVIDEKVVVFVISVGKRERSEVYSEAVKRIL
>SC154 repA, DNA replication protein
MTSENNSLLLNLQEVDKTTGEVVKLDVNSTSTVQPVALMRLGLFVPTLKS
TGKSKANRKNVTDATEELVQLAIAKSEGYTDVKITGSRLDMDTDFKVWLG
IIRSMSEYGVKSDTLELSFVEFVKMCGFDSRRSNKKMRDRISNSLFKLAS
VTLKFQSETKGWTTHLVQSAYYDINEDIVEIKAEPKLFELYHMDRRVLLR
LKAIDALQRKESAQALYTYIESLPQNPAPISMKRMRDRLNLTSNVYTQNH
TVRKAMEQLRDIGYLDYTEFKRGRATYFSVHYRNPKLISSPVKVPRKEEE
EKAPEQNYDEVIKALKAAGIDPLKLAEALSAMKPEN
>SCV27 repA/repB, replication initiation protein
MQAEVTDTRTLFTHYRQVKNPNPEFTPREGKKTLPFCRKLMAKAEGFTSR
FDFSVHVAFVRSLGKRHRMPPLLRRRAIDALLQGLCFHYDPLANRVQRSI
TNLAIECGLATESKSGNLSITRATRALKFMAELGLITYQTEYDPQIGCNI
PTDITFTPALFSALDVSDVAVMAARCSRVEWENQQRKKQNLEPLEMDELI
AKAWRFVRERFRSYQSERKLHGLKRARARRDADRTRKDIETLVKQQLTRE
YASGRFTGGLDAMKRELQRRVKERMMMSRGKNYTRLTMATVPI
>SCV20 repA/repC, RepFIB replication protein A
MDKDNLDIKKLFEEVDKSSGEIVNLTPNASNTVQPVALMRLGVFVPTLKS
LKNRKKNTLSRTDASEELTRLSLARAEGFDKVEITGPRLDMDNDFKTWVG
VIHSFARHKVIGDKVELPFVEFAKLCGIPSSQSSRKLRERISPSLKRIAG
TVISFSRTTEKHTKEYITHLVQSAYYDTEKDIVQLQADPRLFELYQFDRK
VLLQLKAINALKRRESAQALYTFIESLPRDPAPISLARLRARLNLKSPVF
SQNQTVRRAMEQLREIGYLDYTEIQRGRTKLFCIHYRRPKLKPPHDESVE
NPQLPATPGDVSPEMAEKLALLDKLGITLDDLEKLFKSR
>SCV49 rlgA, integrase-like protein Y4LS
MGCFYPIPVLRFHHLELNETVFTIMALYGYARVSTSDQDLTLQTQILRAA
GCEIIRAEKASGSGRTGRSELQLLLEFLRPGDTLMVTRVDRLARSIKDLQ
DIVYALNQQGVTLRATEQPVDTRSAAGKAFLDMLGVFAEFETNLRRERQM
EGIAAAKARGVYRGRKPSIDPAEVYRLYTIEKMGATAIARQLGIGRASVY
WALENYEQPA
>SCV14 rsd, resolvase
MSQPPLPAVCTQAASALLPVAIDYPAALALRQMAMQHDDYPKYLLAPEVS
ALLHYVPDLHRRMLLATLWNTGARINEALALTRGDFSLAPPYPFVQLATL
KQRAEKAARTAGRMPSGSQPHRLVPLSDNQYVSELQMMVATLKIPLERRN
RRTGRTEKARLWEITDRTVRTWIGEAVEAAAADDVTFSVPVTPHTFRHSY
AMHMLYAGIPLKVLQALMGHKSVSSTEVYTKVFALDVAARHRVQFQMPGA
DAVAMLKGGS
>SCV38 samA, SamA protein
MLLLVAPEQEPVQSTAPLFTERCPAGFPSPAADYTEEELDLNAYCIRRPA
ATFFVRAIGDSMKEMGLHSGDLMVVDKAEKPMQGDIVIAETDGEFTVKRL
QLKPRIALLPINPAYPTLYPEELQIFGVVTAFIHKTRSTD
>SCV39 samB, DNA polymerase IV
MFALADVNSFYASCEKVFRPDLRDRSVVVLSNNDGCVIARSAEAKKLGIK
MGVPWFQLRSAKFPEPVIAFSSNYALYASMSNRVMVHQEELAPRVEQYSI
DEMFLDIRGIDSCIDFEDFGRQLREHVRSGTGLTIGVGMGPTKTLAKSAQ
WASKEWSQFGGMLALTLHNQKRTEKLLSLQPVEEIWGVGRRISKKLNTMG
ITTALQLARANPTFIRKNFNVVLERTVRELNGESCISLEEAPPPKQQIVC
SRSFGERVTTYEAMRQAVCQHAERAAEKLRGERQFCRHIAVFVKTSPFAV
TEPYYGNLASEKLLIPTQDTRDIIAAAVRALDRIWVDGHRYAKAGCMLND
FTPTRVSQLNLFDEVQPRERSEQLMQVLDGINHPGKGKIWFAGRGIAPEW
QMKRELLSPAYTTRWADIPAAKLT
>SC055 sat, streptothricin acetyl-transferase
MRSRNWSRTLTERSGGNGAVAVFMACYDCFFVQSMPRASKQQARYAVGRC
LMLWSSNDVTQQGSRPKTKLYVELEGNLSMKEKVVVDKAISLYTESFGDP
AHEPIILIMGAMSSAVWWPDEFCSQLAKMGRYVIRYDHRDTGKSTSYEPG
QAPYSVEELADDVVRVIDGYGLEAAHLVGMSLGGFLSQLVALKYPKRVKS
LTLIASERLADADPDMPAFDPAIIEYHQRAESLDWSDRDAVVAYQVGAWR
INSGTAHAFDAEKIQNIAELNFDRTPNILTTFNHTTLGGGERWLGRLNEI
AVPTLIIHGTEDPVLPYVHGLALKDAIRGSKMLTLEGTGHELHHEDWPRI
IQAIKGQTS
>SC073 sogS, DNA primase
MQQCHKGLRFAGYGEAVNHDADSTNRPAPELMQFHLKTREEPLFAAVYTP
EKQPDALYRNLGFEQSWQQWSNSQKPEDRQEKTLHQDLSHSPGR
>SC153 sopB, SopB
MKRAPVIPRHTTHSQSTEDTSSPAPAAPMVDSLIARVGAMARGNAISLPV
CGREVKFTLEVLRGDSVESASRVWSGNERDQELLTEDALDDLIPSFLLTG
QQTPAFGRRVSDVIEIADGSRRRKAAILTESDYRVLVGELDDEQMAALSR
LGNDYRPTSAYERGLRYTSRLQNEFAGNISALADAENISRKIITRCINTA
KLPKSVVALFAHPGELSARSGEALQKAFADKEELLKQQAETLHDQKKAGL
IFEAEEVISLLTSVLKQSPASRVNLSSRHQFAPGATALYKGDKMVLNLDR
SRIPAECIEKIEAILKELEKPGV
>SCV03 spvA, 28.1 kDa virulence protein
MNMNQTTSPALSQVETAIRVPAGNFAKYNYYSVFDIVRQTRKQFINANMS
WPGSRGGKAWDLAMGQAQYIRCMFRENQLTRRVRGTLQQTPDNGTNLSSS
AVGGIQGQAERRPDLATLMVVNDAINQQIPTLLPYHFPHDQVELSLLNTD
VSLEDIISESSIDWPWFLSNSLTGDNSNYAMELASRLSPEQQTLPTEPDN
STATDLTSFYQTNLGLKTADYTPFEALNTFARQLAITVPPGGTVDCGYSA
CQPAV
>SCV04 spvB, 65 kDa virulence protein
MLILNGFSSATLALITPPSLPKGGKALSQSGPDGLASITLPLPISAERGF
APALALHYSSGGGNGPFGVGWSCATMSIARSTSHGVPQYNDSDEFLGPDG
EVLVQTLSTGDAPNPVTCFAYGDVSFPQSYTVTRYQPRTESSFYRLEYWV
GNSNGDDFWLLHDSNGILHLLGKTAAARLSDPQAASHTAQWLVEESVTPA
GEHIYYSYLAENGDNVDLNGNEAGRDRSAMRYLSKVQYGNATPAADLYLW
TSATPAVQWLFTLVFDYGERGVDPQVPPAFTAQNSWLARQDPFSLYNYGF
EIRLHRLCRQVLMFHHFPDELGEADTLVSRLLLEYDENPILTQLCAARTL
AYEGDGYRRAPVNNIMPPPPPPPMMGGNSSRPKSKWAIVEESKQIQALRY
YSAQGYSVINKYLRGDDYPETQAKETLLSRDYLSTNEPSDEEFKNAMSVY
INDIAEGLSSLPETDHRVVYRGLKLDKPALSDVLKEYTTIGNIIIDKAFM
STSPDKAWINDTILNIYLEKGHKGRILGDVAHFKGEAEMLFPPNTKLKIE
SIVNCGSQDFASQLSKLRLSDDATADTNRIKRIINMRVLNS
>SCV05 spvC, 27.5 kDa virulence protein
MPINRPNLNLHIPPLNIVAAYDGAEIPSTNKHLKNNFNSLHNQMRKMPLS
HFKEALDVPDYSGMRQSGFFAMSQGFQLNNHGYDVFIHARRESPQSLGKF
AGDKFHISVLRDMVPQAFQALSGLLFSEDSPVDKWKVTDMEKVVQQARVS
LGAQFTLYIKPDQENSQYSASFLHKTRQFIECLESRLSENGVISGQCPES
DVHPENWKYLSYRNELRSGRDGGEMQRQALREEPFYRLMTE
>SCV06 spvD, virulence protein VsdE
MRVSGSASSQDIISRINSKNINNNDSNDVKRIKDALCIESKERILYPQNL
SRDNLKQMARYVNNTYVHYSGNCVLLSACLHYNIHHRQDILSSKNTASPT
VGLDSAIVDKIIFGHELNQSYCLNSIEEVEKEILNRYDIKRESSFIISAE
NYIAPIIGECRHDFNAVVICEYDKKPYVQFIDSWKTSNILPSLQEIKKHF
SSSGEFYVRAYDEKHD
>SCV02 spvR, transcriptional activator
MDFLINKKLKIFITLMETGSFSIATSVLYITRTPLSRVISDLERELKQRL
FIRKNGTLIPTEFAQTIYRKVKSHYIFLHALEQEIGPTGKTKQLEIIFDE
IYPESLKNLIISALTISGQKTNIMRRAVNSQIIEELCQTNNCIVISARNY
FHRESLVCRTSVEGGVMLFIPKKFFLCGKPDINRLAGTPVLFHEGAKNFN
LDTIYHFFEQTLGITNPAFSFDNVDLFSSLYRLQQGLAMLLIPVRVCRAL
GLSTDHALHIKGVALCTSLYYPTKKRETPDYRKAIKLIQQELKQSTF
>SC120 ssb, single-stranded DNA binding protein
MSARGINKVILVGRLGNDPEVRYIPNGGAVANLQVATSESWRDKQTGEMR
EQTEWHRVVLFGKLAEVAGEYLRKGAQVYIEGQLRTRSWEDNGITRYVTE
ILVKTTGTVQMLGRAPQQNAQAQPKPQQNGQPQSADATKKGGAKTKGRGR
KAAQPEPQPQPPEGEDYGFSDDIPF
>SC065 stra, StrA
MAFIIHAEDIQHSAGNRRWHLRKKPLKMRDRPEHNRCKSIATRFFSQVEP
PPRFIRKLKEPPLNRTNIFFGESHSDWLPVRGGESGDFVFRRGDGHAFAK
IAPASRRGELAGERDRLIWLKGRGVACPEVINWQEEQEGACLVITAIPGV
PAADLSGADLLKAWPSMGQQLGAVHSLSVDQCPFERRLSRMFGRAVDVVS
RNAVNPDFLPDEDKSTPQLDLLARVERELPVRLDQERTDMVVCHGDPCMP
NFMVDPKTLQCTGLIDLGRLGTADRYADLALMIANAEENGTVANSRW
>SC092 sugE, SugE protein
MNKKVEIHFLPGKNAISHISFAGRPANASFTGDGPIVLEPDMSWIVLLIA
GLLEVVWAIGLKYTHGFTRLTPSIITIAAMIVSIAMLSWAMRTLPVGTAY
AVWTGIGAVGAAITGILLLGESASPARLLSLGLIVAGIIGLKLSTH
>SC034 sulI, dihydropteroate synthetase typeI
MVTVFGILNLTEDSFFDESRRLDPAGAVTAAIEMLRVGSDVVDVGPAASH
PDARPVSPADEIRRIAPLLDALSDQMHRVSIDSFQPETQRYALKRGVGYL
NDIQGFPDPALYPDIAEADCRLVVMHSAQRDGIATRTGHLRPEDALDEIV
RFFEARVSALRRSGVAADRLILDPGMGFFLSPAPETSLHVLSNLQKLKSA
LGLPLLVSVSRKSFLGATVGLPVKDLGPASLAAELHAIGNGADYVRTHAP
GDLRSAITFSETLAKFRSRDARDRGLDHA
>SC035 sulI3', SulI3'
MDSEEPPNVRVACSGDIDEVVRLMHDAAAWMSAKGTPAWDVARIDRTFAE
TFVLRSELLVASCSDGIVGCCTLSAEDPEFWPDALKGEAAYLHKLAVRRT
HAGRGVSSALIEACRHAARTQGCAKLRLDCHPNLRGLYERLGFTHVDTFN
PGWDPTFIAERLELEI
>SC048 sulII, dihydropteroate synthase
MSKIFGIVNITTDSFSDGGLYLDTDKAIEHALHLVEDGADVIDLGAASSN
PDTTEVGVVEEIKRLKPVIKALKEKGISISVDTFKPEVQSFCIEQKVDFI
NDIQGFPYPEIYSGLAKSDCKLVLMHSVQRIGAATKVETNPEEVFTSMME
FFKERIAALVEAGVKRERIILDPGMGFFLGSNPETSILVLKRFPEIQEAF
NLQVMIAVSRKSFLGKITGTDVKSRLAPTLAAEMYAYKKGADYLRTHDVK
SLSDALKISKALG
>SCV28 tap/repC, hypothetical protein
MLRKFQYLFLWHLLLPCIVSAGRSD
>SC018 tem-1, beta-lactamase
MSIQHFRVALIPFFAAFCLPVFAHPETLVKVKDAEDQLGARVGYIELDLN
SGKILESFRPEERFPMMSTFKVLLCGAVLSRVDAGQEQLGRRIHYSQNDL
VEYSPVTEKHLTDGMTVRELCSAAITMSDNTAANLLLTTIGGPKELTAFL
HNMGDHVTRLDRWEPELNEAIPNDERDTTMPAAMATTLRKLLTGELLTLA
SRQQLIDWMEADKVAGPLLRSALPAGWFIADKSGAGERGSRGIIAALGPD
GKPSRIVVIYTDGESGNYG
>SC043 tem67, hypothetical protein
MHRTRVAYSYPGETAMSMQTVRRKALRQNPYSTLGKPPLRSSRDRITRAM
ALLQIVGGDKLIIPFC
>SC004 tetA, tetracycline efflux protein
MKPNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLA
LYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLY
IGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVL
GGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPL
ASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGI
SLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRG
WMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIV
GPLLFTAIYAASITTWNGWAWIAGAALYLLCLPALRRGLWSGAGQRADR
>SC005 tetR, RK2 tetracycline repressor protein
MFISDKVSSMTKLQPNTVIRAALDLLNEVGVDGLTTRKLAERLGVQQPAL
YWHFRNKRALLDALAEAMLAENHTHSVPRADDDWRSFLIGNARSFRQALL
AYRDGARIHAGTRPGAPQMETADAQLRFLCEAGFSAGDAVNALMTISYFT
VGAVLEEQAGDSDAGERGGTVEQAPLSPLLRAAIDAFDEAGPDAAFEQGL
AVIVDGLAKRRLVVRNVEGPRKGDD
>SCV46 tlpA, myosin heavy chain gizzard smooth muscle
MRPATYEPEQIIEAGLALQAEGRNITGFALRNQVGGGNPTRLRQIWDEYQ
ASQSTVVTEPVAELPVEVAEEVKAVSAALSERITQLATELNDKAVRAAER
RVAEVTRAAGEQTAQAERELADAAQTVDDLEEKLDELQDRYDSLTLALES
ERSLRQQHDVEMAQLKERLAAAEENTRQREERYQEQKTVLQDALNAEQAQ
HKNTREDLQKRLEQISAEANARTEELKSERDKVNTLLTRLESQENALASE
RQQHLATRETLQQRLEQAIADTQARAGEIALERDRVSSLTARLESQEKAS
SEQLVRMGSEIASLTERCTQLENQRDDARLETMGEKETVAALRGEAEALK
RQNQSLMAALSGNKQTGGQNA
>SC024 tniAdelta1, TniAdelta1
MLNTRVHQSEVSMATDTPRIPEQGVATLPDEAWERARRRAEIISPLAQSE
TVGHEAADMAAQALGLSRRQVYVLIRRARQGSGLVTDLVPGQSGGGKGKG
RLPEPVERVIHELLQKRFLTKQKRSLAAFHREVTQVCKAQKLRVPARNTV
ALRIASLDPRKVIRRREGQDAARDLQGVGGEPPAVTAPLEQVQIDHTVID
LIVVDDRDRQPIGRPYLTLAIDVFTRCVLGMVVTLEAPSAVSVGLCLVHV
ACDKRPWLEGLNVEMDWQMSGKPLLLYLDNAAEFKSEALRRGCEQHGIRL
DYRPLGQPHYGGIVERIIGTAMQMIHDELPGTTFSNPDQRGDYDSENKAA
LTLRELERWLTLAVGTYHGSVHNGLLQPPAARWAEAVARVGVPAVVTRAT
SFLVDFLPILRRTLTRTGFVIDHIHYYADGHCCK
>SC036 tniBdelta1, NTP-binding protein
MDEYPIIDLSHLLPAAQGLARLPADERIQRLRADRWIGYPRAVEALNRLE
ALYAWPNKQRMPNLLLVGPTNNGKSMIVEKFRRTHPASSDADQEHIPVLV
VQMPSEPSVIRFYVALLAAMGAPLRPRPRLPEMEQLALALLRKVGVRMLV
IDELHNVLAGNSVNRREFLNLLRFLGNELRIPLVGVGTRDAYLAIRSDDQ
LENRFEPMMLPVWEANDDCCSLLASFAASLPLRRPSPIATLDMARYLLTR
SEGTIGELAHLLMAAAIVAVESGEEAINHRTLSMACRQPLAQPRHRGRTA
SDLEATGPSHPDMDLDARTDVRFRVLGVLR
>SC016 tnpA, transposase
MLAERGVNVDHSTIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVK
VNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPR
FINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRII
GATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFE
M
>SC013 tnpA, transposase
MLAERGVNVDHSTIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVK
VNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPR
FINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRII
GATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFE
M
>SC174 tnpA, hypothetical protein A
MKKRFSDEQIISILREAEAGVPARELCRKHAISDATFYTWRKKYGGMEVP
EVKRLKSLEEENTRLKKLLAEAMLDKEALQVALGRKY
>SC006 tnpA, relaxase/helicase
MKAAALDLARERQAHEAGARTRATAHERTPQQERQKAAREAERGREAWTL
GQGMKKPVAGCYGRLTRWKGGGDVVYMALL
>SC010 tnpA, putative transposase
MPRAKIGRVTPVPAQPFFHFRRITLNPAVNRGVIDIHSAFSQHLLQLTVT
DAVFAVPAYGPQNDVTLKMPAFEWVHVQLHQQKGMISLSPPTICNSATGR
AHGVRQRRAEVHRRGLTDGQSDAVGHGSLR
>SC161 tnpA, TnpA
MLMELHMNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDH
STIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRA
VDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYG
RALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKT
AYATIKGIEVMRALRKGQASAFYYGDPLGEIRLVSRVFEM
>SC049 tnpA, transposase
MLVNPNNPIFNLATGKVLLKVPQVRGMEFYPSCIEKGMRSERALKLAIAE
MYVKGVSTRRVSDIVEILCGTEVSSSQVSRLAKELDEEITSWKAQPVGQI
QYLVLDATYESVRVGSHVVKQALLVAIGVDYSGNRHILDAEVANSEAEVN
WRSFLEGLVRRGMHGLRMITSDDHSGLRAAIDAVFPGILWQRCQFHLQQN
AHSYVTKKDEIPLIAADIRKVFNRNMSR
>SC008 tnpA, transposase
MPRRSILSATERESLLALPDAKDELIRHYTFNETDLSVIRQRRGAANRLG
FAVQLCYLRFPGTFLGVDEPPFPPLLRMVAAQLKMPVESWSEYGQREQTR
REHLVELQTVFGFKPFTMSHYRQAVHTLTELALQTDKGIVLASALVENLR
RQSIILPAMNAIERASAEAITRANRRIYAALTDSLLSPHRQRLDELLKRK
DGSKVTWLAWLRQSPAKPNSRHMLEHIERLKSWQALDLPAGIERQVHQNR
LLKIAREGGQMTPADLAKFEVQRRYATLVALAIEGMATVTDEIIDLHDRI
IGKLFNAAKNKHQQQFQASGKAINDKVRMYGRIGQALIEAKQSGSDPFAA
IEAVMPWDTFAASVTEAQTLARPADFDFLHHIGESYATLRRYAPQFLGVL
KLRAAPAAKGVLDAIDMLRGMNSDSARKVPADAPTAFIKPRWAKLVLTDD
GIDRRYYELCALSELKNALRSGDVWVQGSRQFKDFDEYLVPVEKFATLKL
ASELPLAVATDCDQYLHDRLELLEAQLATVNRMAAANDLPDAIITTASGL
KITPLDAAVPDAAQAMIDQTAMLLPHLKITELLMEVDEWTGFTRHFTHLK
TSDTAKDKTLLLTTILADAINLGLTKMAESCPGTTYAKLSWLQAWHIRDE
TYSTALAELVNAQFRQPFAGNWGDGTTSSSDGQNFRTGSKAESTGHINPK
YGSSPGRTFYTHISDQYAPFSAKVVNVGIRDSTYVLDGLLYHESDLRIEE
HYTDTAGFTDHVFGLMHLLGFRFAPRIRDLGETKLFIPKGDAAYDALKPM
ISSDRLNIKQIRAHWDEILRLATSIKQGTVTASLMLRKLGSYPRQNGLAV
ALRELGRIERTLFILDWLQSVELRRRVHAGLNKGEARNALARAVFFYRLG
EIRDRSFEQQRYRASGLNLVTAAIVLWNTVYLERATSALRGNGTALDDTL
LQYLSPLGWEHINLTGDYLWRSSAKVGAGKFRPLRPLPPA
>SC066 tnpA, TnpA
MLMELHMNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDH
STIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRA
VDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYG
RALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKT
AYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM
>SC089 tnpA, transposase
MINKIDFKAKNLTSNAGLFLLLENAKSNGIFDFIENDLVFDNDSTNKIKM
NHIKTMLCGHFIGIDKLERLKLLQNDPLVNEFDISVKEPETVSRFLGNFN
FKTTQMFRDINFKVFKKLLTKSKLTSITIDIDSSVINVEGHQEGASKGYN
PKKLGNRCYNIQFAFCDELKAYVTGFVRSGNTYTANGAAEMIKEIVANIK
SDDLEILFRMDSGYFDEKIIETIESLGCKYLIKAKSYSTLTSQATNSSIV
FVKGEEGRETTELYTKLVKWEKDRRFVVSRVLKPEKERAQLSLLEGSEYD
YFFFVTNTTLLSEKVVIYYEKRGNAENYIKEAKYDMAVGHLLLKSFWANE
AVFQMMMLSYNLFLLFKFDSLDSSEYRQQIKTFRLKYVFLAAKIIKTARY
VIMKLSENYPYKGVYEKCLV
>SC074 tnpA, transposase
MLAERGVNVDHSTIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVK
VNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPR
FINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRII
NATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFE
M
>SC173 tnpA, hypothetical protein B
MMLMCDATGLSQRRACRLTGLSLSTCRYEAHRPAADAHLSGRITELALER
RRFGYRRIWQLLRREGLHVNHKRVYRLYHLSGLGVKRRRRRKGLATERLP
LLRPAAPNLTWSMDFVMDALSTGRRIKCLTCVDDFTKECLTVTVAFGISG
VQVTRILDSIALFRGYPATIRTDQGPEFTCRALDQWAFEHGVELRLIQPG
KPTQNGFIESFNGRFRDECLNEHWFSDIVHARKIINDWRQDYNECRPHST
LNYQTPSEFAAGWRKGHSENEDSDVTN
>SC058 tnpA, TnpA
MLMELHMNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDH
STIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRA
VDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYG
RALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKT
AYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM
>SC169 tnpA, putative transposase
MITFEIRMEIKVLHKRGMSIRAIARELGISRNTVRSHLKAKSEKPQYSPR
PAPSSLLDEYRDYISKRISDAHPYKIPATVIAREIMELGYRGGLTILREF
IRKQTLPAQAEPVVRFETEPGRQMQVDWGTMRNGKSPLHVFVAVLGYSRM
LYIEFTDNMRYDTLEACHRNAFSFFGGVPQEVLYDNMKTVVLQRDAYQTG
QHRFHPSLWQFGKEMGFSPRLCRPFRAQTKGKVERMVQYARNSFYIPLMT
RLRPMGITVDVETANRYGLRWLYDVANQRKHETIQTRPCDRWVEEQQSML
ALPPEKKQYDVQVDESLMTFDRQPLHHPLSIYDTFCRGAA
>SC007 tnpA, transposase
MHLLGFRFAPRIRDLGETKLYVPQGVQAYPTLRPLIGGTLNIKHVRAHWD
DILRLASSIKQGTVTASLMLRKLGSYPRQNGLAVALRELGRIERTLFILD
WLQSVELRRRVHAGLNKGEARNSLARAVFFNRLGEIRDRSFEQQRYRASG
LNLVTAAIVLWNTVYLERATQGLVEAGKPVDGELLQFLSPLGWEHINLTG
DYVWRQSRRLEDGKFRPLRMPGKP
>SC025 tnpA, TnpA
MLMELHMNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDH
STIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRA
VDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYG
RALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKT
AYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM
>SC044 tnpA, transposase
MLMELHMNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDH
STIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRA
VDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYG
RALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIINATLGFKSMKT
AYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM
>SC038 tnpA, TnpA
MLMELHMNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDH
STIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRA
VDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYG
RALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKT
AYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM
>SC027 tnpA, TnpA
MLMELHMNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDH
STIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRA
VDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYG
RALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKT
AYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSKSFLKCKAFE
>SC037 tnpA, putative transposase
MIAFNHVVPVLNLSVFNVRRAPAFAFEQSKRATIGGRFIRVDESRDLPLL
HVVEDFTQKPVCSFAVTTGGEIKIDSAAPAVDGPVQIRPAAIDLHVGFIH
VPRAKIGRVTPVPAQPFFHFRRITLNPAVNRGVIDIHSAFSQHLLQLTVT
DAVFAVPAYGPQNDVTLKMPAFEWVHVQLHQQKGMISLSPPTICNSAQDI
SSAGQARSAGYGYCRPAGRQLATRQTVRPD
>SC014 tnpA, transposase
MPRRSILSAAERESLLALPDSKDDLIRHYTFNDTDLSIIRQRRGPANRLG
FAVQLCYLRFPGVILGVDELPFPPLLKLVADQLKVGVESWNEYGQREQTR
REHLSELQTVFGFRPFTMSHYRQAVQMLTELAMQTDKGIVLASALIGHLR
RQSVILPALNAVERASAEAITRANRRIYDALAEPLADAHRRRLDDLLKRR
DNGKTTWLAWLRQSPAKPNSRHMLEHIERLKAWQALDLPTGIERLVHQIR
LLKIAREGGQMTPADLAKFERHCCKVSDEAAFCLIQRPYISKTLLTRRIS
PRGSP
>SC172 tnpA, TnpA
MLWNTMYMQAALDHLRAQGETLNDEDIARLSPLCHGHINMLGHYSFTLAE
LVTKGHLRPLKEASEVENVA
>SC178 tnpA, transposase
MHLLGFRFAPRIRDLGETKLYVPQGVQAYPTLRPLIGGTLNIKHVRAHWD
DILRLASSIKQGTVTASLMLRKLGSYPRQNGLAVALRELGRIERTLFILD
WLQSVELRRRVHAGLNKGEARNSLARAVFFNRLGEIRDRSFEQQRYRASG
LNLVTAAIVLWNTVYLERATQGLVEAGKPVDGELLQFLSPLGWEHINLTG
DYVWRQSRRLEDGKFRPLRMPGKP
>SC057 tnpM, TnpM
MPECAHRLMRCRPSLVRGRAAQVNPGGFTTPARRPSVPHRTAGCGKSSLR
PLMAGSSPSLPDGSNPSAAIVQSASDVQCSRLLKTTMEVVAEGVETPDCL
AWLRQAGCDTVQGFLFARPMPAAAFVGFVNQWRNTTMNANEPSTSCCVCC
KEIPLDAAFTPEGAEYVEHFCGLECYQRFQARASTATETSVKPDACDSPP
SG
>SC009 tnpR, resolvase family recombinase
MEFVKEGLKFTGEDSPMANLMLSVMGAFAEFERALIRERQREGIVLAKQR
GAYRGRKKSLNSEQIAELKRRVAAGDQKTLVARDFGISRETLYQYLRED
>SC017 tnpR, hypothetical protein
MIAFNHVVPVLNLSVFNVRRAPAFAFEQSKRATIGGRFIRVDESRDLPLL
HVVEDFTQKPVCSFAVTTGGEIKIDSAAPAVDGPVQIRPAAIDLHVGFIH
VPRAKIGRVTPVPAQPFFHFRRITLNPAVNRGVIDIHSAFSQHLLQLTVT
DAVFAVPAYGPQNDVTLKMPAFEWVHVQLHQQKGMISLSPPTICNSAGDV
ILVKKLDRLGRDTADMIQLIKEFDAQGVAVRFIDDGISTDGDMGQMVVTI
LSAVAQAERRRILERTNEGRQEAKLKGIKFGRRRTVDRNVVLTLHQKGTG
ATEIAHQLSIARSTVYKILEDERAS
>SC028 tnpR, hypothetical protein
MNKTKGCLIANFATVPAPLCQPLDDAAQVIKVACQPVHAMHHHGVALADE
GQQPFQLGTLGVLARSLVGEHPRHLNTLQLPFRVLVEAADADIADALTLQ
DASKGKSVRMKSITFDGICQSIGNLTLF
>SC015 tnpR, resolvase
MQQCPFSDKASGKDVKRPQLEALISFARTGDTVVVHSMDRLARNLDDLRR
IVQTLTQRGVHIEFVKEHLSFTGEDSPMANLMLSVMGAFAEFERALIRER
QREGIALAKQRGAYRGRKKSLSSERIAELRQRVEAGEQKTKLAREFGISR
ETLYQYLRTDQ
>SC072 traL, TraL-like protein
MNMHKSLVAVLGLILISSTASAATCRAGSAAQAGSNTGYERARAAADAWS
QRENDVSSSLQSCLSRIKKISINLPQFPSLDDILSQLENQVCDAVVDKVN
EKLPGNIDPWKDYNL
>SC071 traM, TraM-like protein
MTENTEPSSPSHPAPDPEILKSTIRAMKQSEQRANNVPALIKALLCTGTC
LLISITGNAIQYWHSTNVEREYFATDNGRLVRLAPTSQPAWSQNDAMAFG
SQALATAFNLDFVHYRSQISSLSPRFSDEGFVGYVNALQASNILETIKKE
KMNLTATTGAGVLVRQGQMSDGVWFWTFQYPVRMRLVGQTTSKPEQSFVF
EITIQRVDPRLKPSGMEIRQMISRNAGPNS
>SC070 traN, TraN-like protein
MQSRYLLSTLLLVCSAATSADNAGWQNARTPQTTNSASDHVQNASQNTGG
VPATLVKGELPAPGQASPLVQDAARLDSELSADEIRSLRSLMADNERAIN
APITSVVPRISSLTVNLSPGASLPLVRTAMNNLSVVTFTDINGSPWPQSD
PPYNAAPKLFDVQYNGNMVTITPLRPWASGNISVYLKGLSVPVILNVTSG
ETDTPSSSQEMDSRLDLRIPRQGPTSPVVSIPTDKIALHDATLQAFLDGI
PPRDPSVKRLKFTGNVPDTTIWQHGDDLLVRSRAMLRDEFEQTLSSADGT
HLWKLPVTPLLTFSVNGQSVHVTPELE
>SC069 traO, TraO-like protein
MSAEQDAGKSGKKLAALLGLGSIILFGGGYIAFSKLSGSNSDMQSAVNIN
SAASGGTRSVTETPHYRELLRADNERGAAAAARNNQTFIASLPQGLDIPD
TPEKQQQPATKPENYAHRQASGTPQEDRAASEKRMERLQKLIVRIKDQHP
AGSTPTIATTMWNKSPAETTGQNGTQQFALQNASLSTPVAEKGIQLIPAL
TRIPAYIDTAVDSDNPSSKVIATIPAGPWAGATLFSPGVKLVGNGVEIHF
DRMSWNGMDLKVNAYAQREDNLMSSVASNVNTRWFKHIILPSVLGGVGSI
GTLYKDANTQVIQGNYGTVTGRVGMPSGEAVAGVIAGGMAERGSQILTRQ
AESEPYKQVEVYQHEVVSILFVDPVMTNDARSSSLSSGISPSVNRTSQAE
QRSQARMQTAMEQRKAVMQRRYDEQPETP
>SC068 traP, TraP-like protein
MKPETEIDETGSFGEPEEKPAPFWKRSVWGISVATWGLCAVVLLAAIWYL
FLRAPSETGMPSFNDADAGVQTWQTTQESSPSVQSTETMTVRTGEDMSQL
ARDVKTELDNRDEKIQATLNMLHDSINKLGEAIKKDEEYAQETRRQLDDI
RSRLNGIMTQKSVTESSSTPHPAKKKTSSVLNGMKIMSMETGMAWIRWQG
STWAVREGQTLGNVVIQRIDPTTRTIITSAGTLR
>SC067 traQ, TraQ-like protein
MNMDALTAIENFASSIFSAGMDFLFTWGEFIGVISMITLFARARSAGPVK
MSPGKFIAGMLTSCMLVSLPAMINAGGVQMGFRADSFGPIAYVQPQTFGA
AAGAANAVLSLAKLAGVGFVMNGISIWRKAGLDGHTALSASESVSKGNVK
FIAGVLLVFIDRVLNALLASIGIVF
>SC075 traU, TraU-like protein
MGSCAAPSAKGDDKFITTDYLQQCPRNQAILKSAIEGWGVCGTTTTFGDP
RRAWVNTILAASGGSGPVPLYPPLSHAISLFPLNRAGSVWRGKGNLMLHT
EDGSAFEVGLASSQQNKHTELAPGDPGLGKSVLINTLSEIQISSAQKNLP
FIAYIDKGYSAQGLVQLIRDSLPPERKDEAVGIILSNDPEYTRNLFDVMY
GAKKPITPEKNFMSSVLCALCVDTGTGQPCNPGDTRQIINQLIELAFKEY
GENNPRLYRASTEDLVDSALQDSGLYEKHDAAWWARSTWFEVRDMLHNAG
YIMAAQRAHYQAMPQLPEVSSMLGHTSLRDVFGTVQRDGSNELLLDYIRR
ALEQGHNDYPMISGYTRFMINPETRVIAVDLNNVAGDKTPAGRLKTGIMY
LLAGQIAGGDFTLPQYRDEVLKQLPREYHEIALKRINQLDQEVKTKVYDE
LHNARGIDFIWENLDTQEREQRKFAIRTVLSTQYLRDYPESVLKSANTLW
LLRYKPEDIPVLRDNFNVPEFMLKRFLKMPEGPAPDGSGVPVLGVFRVKS
GTLARILKFTVGPLELWALNSSPKDSALRKTLTNKLGSVRARKILAENFP
RGSATSLIEHRAGQHNSDNVIEELASELIRKQGYNL
>SC076 traV, TraV-like protein
MCTIHITPRCRRPARGQVFLKKLEVKGDSFSFTLPSRVFRLHPAPNPRVI
YASDARTQGKVTGMRLLLPSSTDELITTAIWQVSVPDEAPFDVQHVIQWI
LPSPLASLMSDDTWLWPSVPGTPALSAGDWPVMDTDPSTLRRVRRRSFSI
IDNRNRKGYITRRYQCFPLAPLPEDRKYDSLPDLVLHPEQETIKCKEKHY
SQPL
>SC077 traW, lipoprotein
MQRKTLLAALIATLSGTACQAHAYSVTVVASRPVEEQVIPRMEAIKDVLG
NILSTQTATGTAINQNSEKLASVIAQNGQATRQQMIFSNETQRLEEARKS
FTVPDSICSESASGIATESKSASASAASKLSKGGGVSNRSIRDRLASAAN
SPVREAYDGAAIHASYCTEAEYARFGGTAVCPSVGEIPGGDSQVRSIYHG
AGTADTPAALTWDQKQIDAATAYMKNTSRPSAGRALGKGEVNTQSGRTYV
GLQNEYNGIIDSASNPQLTLIADSTPNESTRKALAETLQSDSAAAYFDQV
ASPEAKARGYMSTREFEAFEAGRRYANTAYLVDLQEMQGDNLLRELVRIT
AQMNWQLNDLKEQIRQGNVISGQQLALTARQYYEKQLGSLEKTINQANAR
>SC078 traX, TraX-like protein
MNKLPENSDQTKNVKKSPSITRIATRRIINIFVPLMETRLITNSVQHMLK
QQKSRLEKFRQVNNKKAQLCLSWEEALQASHMSVDDLDRRFRRRRTVWRF
CCWSLLAIALFLSGMLFAASSLPLTTLVRAISTLVLILSGVALCASRALI
VTYRLWQLHERKVSEPEQGTFRDFLNDRNGWRNATLIAVTSKQY
>SC079 traY, integral membrane protein
MSPSPLIKSCVAFAHTQFYSGVTVKILLRALCAGLAISSLPAMASVTYQD
IVSAATNPDDLSRQALVTIFGDVVTNPLSTSAPTLIGSMFGAFNSIIAVL
AVVWFMFIGIRHVVRSGHQGQVFSTGRDIVGTLSVVAGFLMIVPTGNGWS
IAQLIMLWGASIMGVGSANVMVQLAADNIANGYSMTVQPVQASTRTAARG
IFEMELCKYAVNAGLNDFNQTAKSSTSLMTESAKTASGNYTVTVSNGSGI
CGSASLSVEGNGTTDQSTIGKFFNPFSKNEYSGVISAQRAAMDNMISDMD
NAASEFVTTFLEKRNSGNGTLPDIETRIQRAADEYERAVQKSLPTDNGEQ
SRKEALKSYLTTYGWVTLGAWYQTFATANQRLAELADRAPAVTSMSSLGE
VGDTDLFSAVMSAYRTQLQNSSYTPPLGTVVSSSEQRMANAQDPKSALSE
IMGPLVSLTNRIATETSGTGTTSAQINPLIKMKNIGDITMVSAEGIWTIY
TTARVVVAGGKDSILVKFFNSLTGAASMLTALFEALAPPVYFLLFLMFCA
GFSLSIYLPFIPFIFWMTGIGNWIVSVLIGCTAGPLWAATHLGTSEDRGS
RAAYGYIYLIDSMIRPPIMVFGFFFASVAIIAVGTILNALFGAALVNVQF
NSLTGIFSLAGFLLIYARICTTTVAAIFALQAYLPDHVINFLGGRDGANT
LGSMANSVKEIFIGGSRNIRHTPGMKTDRLKDNTKGSDDKDGIKG
>SC095 trbA, TrbA-like protein
MSYNRQPVAEDPMQIWGAVGVLLILLLFVIWLFLPEVVYASCLILHTLWG
LVDWGPFHNYAAPRYNLLAMTGNNAANISYSQWVNVMEQTIGILWMYLLP
VTLWCLWEWYQHPGQSRFTRRPVDITRLPHIFASLSPAIAPVLEDGDPEK
LFHGGKRPERRVALTPEAFVEQHTLITNMQLDVAAARRCFMAQLGKPLTS
WKDMAPHEKALFAIFGLQYFLDDRKAALKLMDTLNLSCRIKSKRDSGKFC
TPVYSLAKSAFKRVIKSDGAQQWLKQHRYVRSGLVWLYAHDLRLTPPNWI
WLKGVDRTLFYALHRANTTKGFIEGAGVVAVARAEAEAMRFGLPCPEPCV
DEAVEGLRRDMLSLGLIWDEPQPDRDRKRRILTNWSLTDDILPRTPATDN
EF
>SC096 trbB, TrbB
MTIEYFATRIAKHISVQAIWPGGRTEIIAVLPQDALSGLVTRNNQLIHIA
FDAMGTRNRETVLIDKQEHNADQILTAIDSCLRLEYLMERRFSRTPLFRA
IIASVVLFVMAMIAVSLFRYVDRVFWDDTTPEAVQTAGEPRLLPPHLNHT
VPLNEGIQLPVPPKKDVQVPEKTLTSKNPEAAAARHNLAAVLKRNADRGM
FTVNLSSGHERTLYAFLDPACPNCRLLEPALKRLASDFNVVIYPVSVIGG
EESTDRVAPLLCEKDAQKRAAGWHRLYSADNGMMTPSEETTPADETCLKA
ARAAIDVNNVAFRKFGFAGTPWVLSDTGWHLPTGILQETGTLNLFLKTTD
SESGHE
>SC097 trbC, TrbC-like protein
MSEHRVNPELLHRTAWGNPVWNALQSLNIYGFCLVASLVASFIWPLALPA
CLLFTLITMLVFSLQRWRCPLRMPMTLECADPSQDRMIKRSLFSFWPTLF
QYEVILESPASGIFYVGYQRVRDIGRELWLSMDDLTRHIMFFATTGGGKT
ETIFAWAINPLCWARGFTLVDGKAQNDTARTIWYLARRFGREDDVEVINF
MNGGKSRSEIILSGEKTRPQSNTWNPFCYSTEAFTAETMQSMLPQNVQGG
EWQSRAIAMNKALVFGTKFWCVREGKTMSLQMLREHMTLEGMAKLYCRGL
DDQWPEEAIAPLRNYLQDVPGFDLSLVRTPSAWTEEPRKQHAYLSGQFSE
TFSTFTEAFGDIFAEDSGDIDIRDSIHSDRILMVMIPALDTSAHTTSALG
RMFITQKSMILARDLGYRLEGTDSDALEVKKYKGRFPYLCFLDEVGAYYT
DRIAVEATQVRSLDFALILMAQDQERIEGQTTATNTATLMQNTGTKFAGR
IVSEGSTARTLKSAAGEEARARMNNLQRQDGIFGESWIDSPQISILMESK
INVQELIELHPGEFFSIFRGETVPSASFFIPDDEKSCSSDPVVINRYISV
DAPRLDRLRRLVPRTTQRRIPSPENVSAIIGVLTAKPSRKRRKIRTEPHT
IVDTFQQRIAGRQAAMAMLEEYDTDINARESALWETAVNTLKTTTREERR
IRYITLNRPELPETKEENQISVRAERAGINLLTLPQDNNHPTGRPVNGFH
HKKNNRPDWDGMY
>SC030 tri, dihydrofolate reductase
MNSESVRIYLVAAMGANRVIGNGPNIPWKIPGEQKIFRRLTEGKVVVMGR
KTFESIGKPLPNRHTLVISRQANYRATGCVVVSTLSHAIALASELGNELY
VAGGAEIYTLALPHAHGVFLSEVHQTFEGDAFFPMLNETEFELVSTETIQ
AVIPYTHSVYARRNG
>SC023 urf2, Urf2
MGCAGRVDLPLPSADSRRRAGRDDRRCLPWRALGCCRARADRLVRSGRNA
AAARLPGRIMTSSQPAGWTAAELAQAAARGQLDLHYQPLVDLRDHRIAGA
EALMRWRHPRLGLLPPGQFLPLAESFGLMPEIGAWVLGEACRQMHKWQGP
AWQPFRLAINVSASQVGPTFDDEVKRVLADMALPAELLEIELTESVAFGN
PALFASFDALRAIGVRFAADDFGTGYSCLQHLKCCPITTLKIDQSFVARL
PDDARDQTIVRAVIQLAHGLGMDVIFRRRLHQLIGRNGCCAASS
>SCV08 vsdF, virulence protein VsdF
MLRATKVCIYPTPEQAEHLNAQFGAVRFVYSKSLHIKKHAYQRHGVSLTP
RKDIKPLLAVAKKFRKFRKYAWLKEYDSIALQQAVINLDVAFSNCFNPKL
KARFPMFKRKHGKLLG
>SC159 ybgA, putative pathogenicity island protein
MTTKPVLGISGCLTGSAVRFDGGHKRMGFVMDELAQWVSFRPVCPEMAIG
LPTPRPAIRLTLTDSGETQLRFSKPPHDDITQKMADFTADYLPKIGDLSG
FIVCAKSPSCGMERVRLYDENGNRGRKEGVGLFTAALLETYPWLPVEEDG
RLHAPVLRENFIERVFALHELNTLRAKGLTRRALLDFHSRYKLQLLAHHQ
AGYREIGPFVASLHEWEDLDAFFVAYREKLMTILKKPASRKNHTNVLMHI
QGYFRNQLNSRQRVELRDVILHYRDGLLPILAPLTLLKHYMAEYPDRYLM
TQNYFDPYPDDLGLRLAVT
>SC127 ycfA, YcfA
MNETLNALICRHARNLLLAQGWPEETDVDQRNPNHPGWISIYVRLDAPRL
ATLLINRHGGVLPPHLASAIQKLTGTGAELVLSGSQWQSLPVLPADGTQV
SFPYAGEWLAEDEIRAVLAAVRDAVRCVSYQVADDARRIRAALTTTGQTL
LTRQTRRFRLVVKESDHPCWLDEDDENLPVVLDAIVNRGARFSAVEMYLV
SECVEHILSSGLACDVLRIPDEPPRRWFDRDVLREVVREARAEIRSMADA
LAKIRK
>SC123 ycgB, YcjB
MRGVVKWGRNRSPLRRLLMKSAILWRISRRFSWAVTTSYRAAETGDTGQA
VEAACQPARKPLAG
>SC125 ycgC, YcgC
MYCTVKEIIREVLNTDVPDSECVFAVVLTRGDVRHIAQDWSLTDDELETV
MQRLDDAFAHGADVSIVHDVVRELMEEKRASRQVTVPAVMLEKVMALAGS
EMKRLYAVGSENGGDGDAFVREEREAMDVVLQALDGEHMS
>SC122 ychA, YchA
MRGAAPAALPARLSGQKVVYGCMQGTGVIFRDTVTVSGCLLLRPLRLPAA
WRHPVVRPDSDRQLLAHPPVSAEPAGAVSGCRYGPMRENPACSDRSGRAA
PVAP
>SC111 ydfA, YdfA
MTVSSTISVFCRDGVFRTVYCHLHGEPTWNGRILHTHYATGQQAEALVEH
GDIRCLGPRCDKPAGHTLQNPVDGVTAYYGRDSGFRMDSEAREYRSFMEA
IATESTEEVRFHYVFIDGYWKVMYRTPEGWKMKALALALRRCPK
>SC110 ydfB, YdfB
MKPSIIFATAEYVKRLREECLRENKPLHRHTRFRRQELAQDEINPDVLAM
SGHIARRCSEQKRVRIPAMKVSEWGHLLRALEIERVCH
>SC108 ydgA, putative YhgA-like transposase
MSKKKNTTTPTPHDAVFRSFLANPDVARDFLELHLPAEYRQLCDLSTLKL
EPATFVEPDLHQYASDILWSVKTTGGKDGYVYTLIEHQSTENLYMPFRML
RYSVAAMQRHLEQHKTLPLVIPVLFYHGERSPYPYSMNWLDCFENPALAA
KIYTKPFPLVDITVVDDNEIMNHRRMAALTLLMKHIRHRDMMELLDKLPQ
VMVEISDEQVRVLIHYIVNAGDSVSPEFMRALAERLPQHEDKLMTIAERL
EQKGRQEGRMEGALEKALAIACQLQKMGMTPEQIKQATGLSDDELKKIIH
>SC102 ydiA, YdiA
MMNQTLPTADLNTAGTTDVIPSVAIDRIIAQRNEGIALFMQAMECLATAR
KILLDASGDIFLYGFEDCVTDSVRRMDKPEEAKRNITRLADRKIWDRLMT
DTGMYTFMSSCQRDEWNSQLMSDTCPEITLDNVLATFRHLNASKMQTFEQ
GLIDVYRKLSWDYRTNNPCRLGKKIIIENLLYRWSNGRVTLDCSGREALD
DLVRPFYLLEGRNVPDFRNSIGAQYGEFLGNGDNVGKLLEGEYFTVRGYQ
KGTVHIVFKRSDLVEKLNDIIARHYPGALPPRV
>SC158 yedX, hypothetical protein
MKKTASLLMLTTLAFAPAAFSAPAGTLSVHILDQQTGMPPSYVTVTLEKQ
QQDKWTPIASGKTDHDGRIKSLYPEDQDMQPGVYKVAFKTADYFHGKKLD
SFFPEIPVLFTVTRTNEKLHIPLLLSQYGYSTYKGS
>SC106 ygeA, YgeA
MTGWELRIWRKSMLWSREKAAREFGVTQRTWHAWENAEQVDVTVWRTTQA
LSVRDLLPHMQGMRKADIIRRLENELGETAEDV
>SC119 ykfF, YkfF
MSEYFRILQGLPDGSFTREQAEAVAAQYRNVFIEDDQGTHFRLVVRQDGT
LIWRSWNFEDCAGYWMNQYIRDFGILK
>SC047 yqkA, hypothetical protein
MKIEIMEYNPDWTKNFEEEKIKLLHFFGSHAVAIEHIGSTAIPNQRAKPV
IDIFIGVSPFAELPFISAFLMQRSITTLRQI
>SCV45 ytl1, hypothetical protein pSLT049
MKPAPGAEPVRMYKSPYGGKYGVWRLADCVPMRAKRPQTEKQRLASTRLG
LQARMKSERGRFAMLAHTWLALGPVFLDTETTGLDAGAQALEIGLVNARG
ERIFETRLKPTVGIDPAAAAVQIVTEPYRFPVLSVRLA
>SCV42 ytl2, hypothetical protein pSLT051
MKKKNTTPTPHDATFRQFLTQPDIARDFMELHLPAELRAICDLSTLKLES
GSFVEDDLRQYFSDVLYSLKTTAGDGYIHVLVEHQSTPDKHMAFRLIRYA
VAAMQRHLEAGHKKLPLVIPVLFYTGKRSPYPYSTRWLDEFDDTALADKL
YSSAFPLVDVTVIPDDEIAGHRSMAALTLLQKHIHQRDLAELVDRLAPIL
LAGYLSSSQVISLVHYIVQAGETSDAEAFVRELAQRVPQHGDALMTIAQQ
LEQKGIEKGIQLGEQRGIEKGRSEGEREATLKIARTMLQNCIDRNTVMKM
TGLTEDDLAQIRH
>SC046 yusZ, short chain dehydrogenase
MIPNSENKRVWFITGASKGLGYAFTCAALKAGDKVVAVARTIDNLAKLEE
TYQESLLPLNLDVTDREAVFSTVETAVKHFGRLDIVVNNAGIMTMGMIEE
LNESDARKLMDTNFFGALWVCQAVMPYLRSQRSGHIIQITSIGAIISGPM
SGIYSASKFALEGMSEALAKEAEHFGVKLTMVEPGGYWTDLYTSMSYSNP
LDSYGTLRDELAKQYSEDSVDSDPSLAAEALMKLVASNNPPLRLILGSMV
YDLAMDTLKARMATWEEWEAVSRASEKAIPAPERYGV