TitleGenColors Logo

Gene list

Applied filters:

Gene type: CDS
Genomic element: pO157

Number of genes found: 85

Free access
Sort by:

 



# Escherichia coli O157:H7 str. Sakai, Sakai

>pO157p70 hypothetical protein
MELKWSSKALSDLARLYDFLVLASKPAAARTVQSLTQAPVILLTHPRMGE
QLFQFEPREVRRIFAGEYEIRYELNGQTIYVLRLWHTRENR
>pO157p22 recombinase
MNNVIPLQNSPERVFLLPIAPGVDFATALSLRRMATSTGATPAYLLAPEV
SALLFYMPDQRHHMLFATLWNTGMRIGEARMLTPESFDLDGARPFVRILS
EKVRARRGRPPKDEVRLVPLTDISYVRQMESWMITTRPRRREPLWAVTDE
TMRNWLKQAVRRAEADGVHFSIPVTPHTFRHSYIMHMLYHRQPRKVIQAL
AGHRDPRSMEVYTRVFALDMAATLAVPFTGDGRDAAEILRTLPPLR
>pO157p71 hypothetical protein
MNYDEITKITAERISDYMTEAVNTDSIAVAEMFHNAAWGVRTLWFELVTK
IDIDIHKKNRYASYDLDR
>ECp088.2c hypothetical protein
MVTVNLHCPRCQSVQVYRHGKNPKGHDRFRCRDCHRVFQLTYRYEARKPG
VQDQITEMPFNGAGVRDTARTLKIGINTVIRTLKNARHEE
>pO157p59 transposase Tra5
MDELRAQGYHFKVKTVTESLRRHGLRAKASWNFSPVCYRAHSQPVSENLL
EQDFYASGPNQKWAGDITYCVPGVQGEHGGSNEPRVCLEY
>pO157p46 hypothetical protein
MTSAAITMTAPEAASPVQMYRATYSPDDNKLRLYAVSRLDPETYKKVHDA
GFRWAPKQALFVAPAWTPGREDVLLSLAGEIEDEDSTLAERQEARAERFT
GYSGKRASESAQALDEVERLAAMIPPGQPILVGHHSERRARRDAQRIENG
MKRAVMLFERAEYWEERGRSALLHAKYKERPDVRWRRIKKIEADLRKAEK
TIAQSQKYLTMWRAESLDLNMAKLISSHDHISACFPLDTYPRPAEKSQYE
GSRSLWSALDDDIITTEQAREIAIRCHERQIQHQQRWVNHYQNRLNYERA
MLDESGGVVTRTQDFEPGGQVFSRGEWLTIIRVNKSNGAVSSVTTPNYSF
LGYSGTMKVTPDRITDYKAPSAEEAAVASQAAKRPPVVNYPGEGFREMTK
AQWAALPRDCKAVRSVAEAEDHGAYRYRRTMDNNFRLVNVYITDMKITEI
PQK
>pO157p35 reverse transcriptase
MTEQATTCKGASLLNGDSWHSINWRQCYREVRRLQARIVKATREGKHGKV
KSLQWILTHSFSGRAVAVRRVTENSGKRTPGVDGQTWSSPEVKFLAINLL
KRRGYKPQPLKRVYIPKSNGKSRPLGIPTMKDRAMQALYLLALEPVAEVT
ADQRSFGFRTGRSTADAIAQCFCVLAQKTSAEWVLEGDIRGCFDNISHQW
LIDNTSTDRQILTKWLKAGYREKGQLFPVNSGTPQGGIISPVLANIALDG
LEALLASEFKKRTVKGRLVNPKVNYVRYADDFIITGESKELLESQVLPVV
RRFMAERGLMLSPEKTKITHIEEGFDFLGQNIRKYGGKMLIKPSKANVSS
FLKKIRAVIKGNKAMDQLTLIRMLNPMIKGWAAYHQHIVAKVAFNKVDNE
IWLALWRWAVRRHPNKGKKWIRKRYFHQQGARNWSFSTATGELLANGKPQ
YANLRKAIDTPINRFKPIKIAANPFDPQWEMYFEERCADKMRHKLQGRKK
LIQIWFEQRGRCPICDERITSDSQWQVHHIIRRVDGGSNCLSNLIMLHPM
CHTLVHAKGIHVVKPAHESGLRKA
>pO157p40 hypothetical protein
MNENTTLNALICRHARNLLLAQGWPEETDVDQRNPKYPGWISIYVLLDAP
RLATLLVNRHGGVLPPHLASAIQKLTGTGAELVLSGSQWQSLPVLPADGT
QVSFPYAGEWLTEDEIRAVLAAVRDAVRSVSCRVAEDTRRIRAALTTTGQ
TLLTRQTRRFRLVVKESDHPCWLDEDDENLPVVLDAILNRGARFSAVEMY
LVSDCIEHILSSGLACDVLRIPDEPPRRWFDRGVLREVVREARNEIRSMA
DALAKIRK
>pO157p47 hypothetical protein
MHSQLKERIRLMRARLDNAAPVAEIRAESQLFVTPAPVCDRLVTLAEISN
RDHILEPSAGTGAILRAIRDTAPEAMCDAVEINSGLVRYLRENFNGVRVQ
CGDFMEWQPVQYYSRVIINPPFSHGQDIRHILRAFSLLRPGGVLVAVCLN
GPRQQEKLLPFSDVREELPRGTFAYTDVPTMIIRLRA
>pO157p48 hypothetical protein
MAPDTTTGRHPPQRREARFAKRRCPLHPGGLLPGCRERRPRRDRGGATGT
VAASPGFASGMEARQGGDSSAGSVHDSPPRQGDARKRHKQENNKTDSNQS
SRKKKNSSSSSSSSSSSSSSMNNNIISNRQMTGLKRQKPTAECPRNVRGG
A
>ECp023 hypothetical protein
MWILNVFQVHKSVRIYIAHFSVFNCHLHPSIDRSLASWHYPNSPKNGKPA
LSPRHDWLNTGSLLQNVPLRIWFYPQ
>pO157p37 hypothetical protein
MTETGGQPPVSFPVKDVAGLLFLLRRLTRRGRNAACGQCPASGRARDGRF
YRSRLASVTVYASPSPFSDERPSSRFRGIFSPSKRRRLRYSTVGLTRYRT
R
>pO157p63 hypothetical protein
MKITDHKLSEGIALTFRVPEGNIKHPLIILCHGFCGIRNVLLPCFANAFT
EAGFATITFDYRGFGESDGERGRLVPAMQTEDIISVINWAEKQECIDNQR
IGLWGTSLGGGHVFSAAAQDQRVKCIVSQLAFADGDVLVTGEMNESERAS
FLSTLNKMAEKKKNTGKEMFVGVTRVLSDDESKVFFEKIKARHPEMDIKI
PFLTVMETLQYKPAESAASVQCPVLVVIAGQDSVNPPEQGRALYDAVASG
TKELYEEADACHYDIYEGAFFERVVAVQTQWFKQYL
>ECp025.2n hypothetical protein
MKIYHYTDLNGLKGIIESGSLWATHFSFLNDSNELTHGMNCLENALQYLR
DDFNPKTLKFIEQAPL
>pO157p81 hypothetical protein
MHLNTGQNRPTFSWSALGWAIFYFGFFSTLLQVIIFSSGYSGTNGIRDSL
LFSCLWLIPVFLYPDRIKIIAAVVGFILWGTSLAALCYYFLYGHEFSQSV
LFVMFETNAREAGEYFSQYFSLKLLLISLVYTAVSVFLWTRLRPVYIPLP
WRRIVSFLLLYALLLHPVVLKSLIRQEPLNDTLGKLASRMEPAAPWQFVS
SYYQYHQQLNALTTFLNENSALPPLGNLRDESGERPRTLVLVIGESTQRE
RMSLYGYLRETTPELDALRKTDPGLTVFNNVVASRPYTIEALQQALTFAN
EKNPDLYLTQPSLMNMMKQAGYKTFWITNQQTITARNTMLTVFSRQTDRQ
YYMNQQRTQSAREYDTNVLKPFREVLNDPAPKKLIIVHLLGTHIKYKYRY
PEGQGRFDGITGHIPTGLNAKELEVYNDYDNANLFNDHVVASLIKDFRAT
APDGFLLYFSDHGEEVYDTPPYKTQGRNEDNPTRPMYTVPFLLWTSEKWH
AAHPRDFSQYVDRKYSLAELIHTWSDLAGLTYDGYDPTRSLVNPQFRETT
RWIGNPYKKNGLTDFDTLPYGEP
>pO157p31 transposase
MMPLLDKLREQYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQHDDW
LKREIQRVYDENHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVL
RGKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVA
FIIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTIHHSDKGSQY
VSLAYTERLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNR
AEVELATLTWVDWYNNRRLLGRLGHTPPAEAEKAYYASIGNDDLAA
>pO157p83 hypothetical protein
MTKNTRFSPEVRQRAIRMVLESQDEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTSAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>pO157p15 hypothetical protein
MICSPQNNTGAPMKKRNFSAEFKRESAQLVVDQNYTVADAAKAMDIGLST
MTRWVKQLRDERQGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKAT
TALLMSDSLNSSR
>ECp042 hypothetical protein
MNYAGHEKLRADVAEVANTMCDLRARLNDMEHRCRFDSDVLVERLARQTL
YRANRLFMEAYTEILELDACFKD
>ECp054 hypothetical protein
MSEYFRILQGLPDGPFTRKHAEAVAAQYRNVFIENDHGEQFRLVVRNNGA
MVWRTWNFEDGAGYWMNHVIRDFGIIK
>pO157p43 hypothetical protein
MYCTVKEIIRDVLDTDVPDSECVFAVVLTRGDVRHIAQDWNLSDDELETV
MQRLDDAFEYGADVSIVHDVVRELMEEKRASRQVTVPAVMLEKVMALAGS
EMKRLYAVGSENGGDGDAFVREEREAMDVVLQALDGEHMS
>pO157p45 hypothetical protein
MTVSIVSPSAAAVKPRRHPRFRREDIPAPEIDPVLKAFGRHIARSFHRGR
GVHIPAMKNTAFGQVLRTLELKRAFN
>pO157p50 hypothetical protein
MSVTESKAKTERKSSRKPAKTQETVLSALLAQTEEVSVPLASLIKSPLNV
RTVPYSAESVSELADSIKGVGLLQNLVVHALPGDRYGVAAGGRRLAALNM
LAERDIIPADWPVRVKVIPQELATAASMTENGHRRDMHPAEQIAGFRAMA
QEGKTPAQIGDLLGYSPRHVQRMLKLADLAPVILDALAEDRITTEHCQAL
ALENDTARQVQVFEAACQSGWGGKPEVQTIRRLVTESEVAVAGNSKFRFV
GADTFSPDELRTDLFSDDEGGYVDCVALDAALLEKLQAVAEHLREAEGWE
WCAGRMEPVGFCREDAGTYRSLPEPEAVLTEAEEERLNELMARYDALENQ
CEESDLLEAEMKLMRCMAKVRAWTPEMRAGSGVVVSWRYGNVCVQRGVQL
RSEDDVADNDYRTEQVQEKASVEEISLPLLTKMSSERTLAVQAALMQQSD
KSLALLAWTLCLNVFGSGAYSNPARIRLECEHYSLTSDAPSGKEGAAFMA
MMAEKARLAALLPDGWARDMTTFLSLSQEVLLSLLSFCTACSIHGVQTRE
YGHTSRSPLDTLESAIGFHMRDWWQPTKANFFGHLKKPQIIAALNEAGLS
GAARDAEKMKKGDAAEHAEHHMKDNRWVPGWMCAPRPQTDATERTDNLAD
AA
>pO157p16 hypothetical protein
MFGVHRSSYRYWKNRPEKPDGRRVVLRSQVLELHNISHGSAGARSIATMA
TLRGFRMGRWLAGRLMKELGLVSCQQPAHRYKRGGREHVTIPNHLGRQFA
VTEPNQVWCGDVTYIWTGKRWAYLAVVLDLFARKPVGWAMSFSPDSRLTI
KALKMAWEIRSKPAGVMFHSDQGSHYTSRQFRQLLWRYQIKQSLSRRGNC
WDNSPMERFFRSLKNEWIPVTGYMNFSDAAHEITDYIVGYYNALRPHEYN
GGLPPNESENRYWKNSKAVASFC
>pO157p65 hypothetica protein
MRKYIPLVLFIFSWPVLCADIHGRVVRVLDGDTIEVMDSRKAVRIRLVNI
DAPEKKQDYGRWSTDMMKSLVAGKTVTVTYFQRDRYGRMLGQVYAPDGMN
VNQFMVRAGAAWVYEQYNTDPVLPVLQNEARQQKRGLWSDADPVPPWIWR
HRK
>pO157p36 hypothetical protein
MYFSGEPAQIAEIKRLASGAVTPLYRRATNEGIQLFLAGSAGLLQTTEDV
RFEPCPGLTAAGRGVVSPENIAFTRWLTHLQDGVLLDEQNCLMLHELWLQ
SGTGRRRWEELPDDARESITALFTPKRGDWCDIWSNEDVSVWWNRLCDNV
LPEKTMPFDLLTVLSTRLDVEVNGFNGGVLNGVPSAYHWYTEQYGVKWPC
GYEVNISSQGDNFIQVDFDTPWCQPESDVIAVLSRRFSCMLEHWYAGQGC
NFCGWQRYERGELVDVLWGELEWSSPTDDDELPEVTAPEWIVDKVAHYGG
>pO157p41 hypothetical protein
MCCVYRMNRPAGGLTVVFCGRWSGKPGTKSAAWRMPWQKSGNDDVGGVNP
RLFPAGPRTEHKGSRRWLRFTRPCRAWPCGFSVMWPQRRVRAVPCLHLSR
AGVDARVRFAAAVTRSLLPVCRDFPVVRPLRFRGLTLQLPSAVCVRLRLP
LRPVHPRLIARLFWRHGTARCRGICEQRRKTSCNMRNLSL
>pO157p55 hypothetical protein
MVLRSIWVEEQYMGVKKIVILAILVSFVAGCAPLHPSNCHKTTALGSCSS
GRWDDQDEWGEQARGIRDAINAKLDDPQKWKRKKCRLHMEFSRDGTALKI
STSNGDKAYCEAIKSAAHKAKFPAFNNPEVYRDFQKSGFDMRG
>pO157p27 KfrAs
MYELYCMYCVFFIPGIRWRHSLRNRSQTMTSLSPDTVRRIEDAAAALIAA
GTPNPTNEQVRQHLGGGSLSHISPVMRAFRARQREQATPLPPELAQLLTG
QLGLLWQAAVKQAEAGALAAREQADDDIARADKERDEALANVAALESELA
VLREVVAERDRLLQEVRELRAEALPLREQVARLTATGEHLAAQLQDTKAE
LKEAREDGRQLQTELLALARQDGKVKK
>pO157p39 hypothetical protein
MCLLNAWHARRCIAPIACSWRRIPKFWNWMRALKIEKETRMYGTCETLCR
ELAAKYPGYTPLMLVIWSPEEIQALADGMDISLSDPEIRTVLARLEDIPE
DQRIESGISSAAAMEIISNVSENRQVTVPAELLASLIQTAEQALWKREWA
ARDYGLAVPECVTRRQAVVNQARILLKNNTHENG
>ECp072 hypothetical protein
MEKTKQEWLYQLRRCSSVNTLEKIIHKNRDSLSTSERESFNSAADHRLAE
LITGKLYDRIPKEIWKYVR
>pO157p23 hypothetical protein
MPSPQGRQCTQDLRSIPPVTCERHCQRGSHIQRKHSGINLHRTWIPVTCQ
GLDDFPGLAVIEHMHDIAVPEGVWCDRNRKVYSVSFGPSDSLLQPVAHGF
VGHGP
>ECp070 hypothetical protein
MNGFRNSSRNGQVWRYQRAGSRAVILEVSGRWMEAAEAWRRAAGVAPRTD
WQQFARKRAEHCHRRCRGRG
>pO157p38 hemagglutinin-associated protein
MSRFIQGDCVRVMATIPGNGVDFILTDPPYLVGFRDRQGRTIAGDKTDEW
LQPACNEMYRVLKKDALMVSFYGWNRVDRFMAAWKNAGFSVVGHLVFTKT
YTSKAAYVGYRHECAYILAKGRPALPQKPLPDVLGWKYSGNRHHPTEKPV
TSLQPLIESFTHPNAIVLDPFAGSGSTCVAALQSGRRYIGIELLEQYHRA
GQQRLAAVQRAMQQGAANDDWFMPEAA
>ECp021.1n hypothetical protein
MLFKQTMIQFLSLKIVNGMDSIKLLDKVIIYQQQNEQLYPAQEHLLLQLC
MRVTKKLTDILMPP
>pO157p30 resolvase
MSGSVIHSQSAVMVPAVYSAGQPASLPVAIDYPAALALRQMSMVHDELPK
YLLAPEVSALLHYVPNLHRKMLLATLWNTGARINEALALTRGDFSLAPPY
PFVQLATLKQRTEKAARTAGRMPAGQQTHRLVPLSDSWYVSQLQTMVATL
KIPMERRNRRTGRTEKARIWEVTDRTVRTWIGEAVAAAAADGVTFSVPVT
PHTFRHSYAMHMLYAGIPLKVLQSLMGHKSISSKEVYTKVFALDVAARHR
VQFAMPESDAVAMLKQLS
>pO157p82 lipid A biosynthesis lauroyl acyltransferase
MVPPSAVLCYHNEISRQIPVNMKNIRTEFIPRFNLTLCFPRYWMTWTGIG
IICVFAMVPPALRDPLLGKLGMLVGRLGKSARQRALINLSLCFPEYSDKE
KENIVDAMFATASMAVVLMAELALSGPDKISHRIRWNGLEIVEKMAQNNE
KVIFLVPHAWGVDIPAMLMAASGRKMAAMFHNQRNPVVDYVWNSVRRRFG
GKLHARNDGIASFVRSVRQGYWGYYLPDQDHGPEFSEFADFFATYKATLP
VIGRLSRISGARIIPLFPVYDGKTHHLTIHVSPPLAIRQKSDAHIARQIN
EVVENFVRPHPEQYTWILKLLKTRKEGEEDPY
>pO157p66 hypothetical protein
MKLIIFILIVLIIAALLIRIILRSVNQHSPLLMQLHAAGIRTGDAERILS
GGEYDASHLRPERRRRDILLEEVLRQAGIPLLRSHDARKLLQMTGEWLNT
TGAAQQSPEHRS
>pO157p57 transposase
MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH
KMAEMDYIDMRLAENVWNGEVILAEIKAMGYTGGRSMLLRQFRKTPEQRF
TQEHSLFFRLLNRRYEKASIVLTSNKGFADWGRDVR
>pO157p26 hypothetical protein
MRQERVLRIKAFRRALELGGGERADGCRHFPRWKTELRNFPHSSCDLASN
TLAYYLKERERCHPCIIFMEGNAGFHEAENSTVHGHVIVLLDGEYIDLTL
DQFDEYPDYIPYEPVESDGQIGKLLRNIMKHEDPVKTRQIDLDGGEELYA
WLRDTADEVLAADPEWQARERSIEEAREAAITVFPFLSDVQKTECEQGPD
SQEAAR
>pO157p69 hypothetical protein
MQMKNNTAQATKVITAHVPLPMADKVDQMAARLERSRGWVIKQALSAWLA
QEEERNRLTLEALDDVTSGQVIDHQAVQAWADSLSTDNPLPVPR
>pO157p44 hypothetical protein
MCPECFFLMLFFCGYRACYCSSSFSSSSSSSSFRSSPAYGFSGRPPGGAG
CRERSQRSCLRPGGLPSLTRNPGLQRPFRSRSLCRAVACAPGIPAKGRRD
VRGNAVSQTALHVVAAGPCSLPAGCHTPV
>pO157p25 hypothetical protein
MIPTQVLKSLSISRRGPVISTLSNPSARARERRVSSSVASVRDSVFFLLF
FSDLRVGTKTPRRISATGCTVLLLFGVSVTSSPDDLSTSPNSFLMSEFSF
SRKPPACGQ
>pO157p56 transposase Tra5
MDELRAQGYHFKVKTVTESLRRHGLRAKASWNFSPVCYRAHSQPVSENLL
EQDFYASGPNQKWAGDITYLRTDEGWPYLAVVTCHYWLINVVTHDGTGAP
GTLCYRMSFAKMTDRYTSI
>pO157p74 transposase
MLPRFADIFQQGNRWLNWLEKQPEGSVRPVVIESVTKIMACGTTLMGYTQ
WCCSSPDCSHIKKVCFRCKSRSCPHCGVKAGAQWIQYLLSLVPDCPWQHI
VFTLPCQYWSLVFHNRWLLAEMSRIAADVIQEICRQADVRLRTRTTICQL
RQIC
>pO157p77 transposase
MPLLDKLREQYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KREIQRVYDENHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKIRTTVSRKAVSAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGCIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTERLKEAKLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLGRLGHIPPAEAEKAYYASIRNDDLAA
>pO157p79 hypothetical protein
MLRWSEVREMRDSGLVEFHVHTHSHKRWDRLSVSRAEQCRLMKEDILVGK
QCLTEKLGFCSSHLCWPEGYYNRDYINLAGKLGFSYLYTTERRMNCPENG
SLRIGRISTKEREHSGWLKRRLFYYTTPLFSSVLALHKGPRLPDN
>pO157p80 RfbU-like protein
MMKILFTESSSDIGGQELQALAQMTALQKQGHSVLLACREKSKIAPEARK
RGHDVTFIPFRNSLHLPSILRLRRIIGEFKPDLVICHSGHDSNIAGLSRL
ICCHRFSIVRQKTYITRKTRTFSLNYLCDFIVVPSSAMMAHLMAEGVRTP
VTVIPPGFDWPALHNEAMRPLPLHIHAWAASADNVPLIVQVGMLRPEKGH
EFMLRVLYQLKMEGKSFRWLVVGAGREEYEARLRQQTEHLGMSGDVLMAG
ALFPALPVYRIASVVVMPSENEAFGMVLAEASVSGVPVIASETGGIPDVI
QKNVTGTLLPVGDVSAWTGALRDFLSRPERFRMMAASAREDIEYRFDINR
TAQIIVSLASQAKGKCNR
>pO157p60 IS629, hypothetical protein
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGSGDGGLTTAERQRLKELERENRELRVQFFRLLNRRYEKASI
ILTSNKGFADWGEMFGEHVLATAILDRLLHHSTTLNIKGESYRFNIPDRL
VVHLSLHARPELRAHNT
>pO157p78 espP, EspP
MNKIYSLKYSHITGGLIAVSELSGRVSSRATGKKKHKRILALCFLGLLQS
SYSFASQMDISNFYIRDYMDFAQNKGIFQAGATNIEIVKKDGSTLKLPEV
PFPDFSPVANKGSTTSIGGAYSITATHNTKNHHSVATQNWGNSTYKQTDW
NTSHPDFAVSRLDKFVVETRGATEGADISLSKQQALERYGVNYKGEKKLI
AFRAGSGVVSVKKNGRITPFNEVSYKPEMLNGSFVHIDDWSGWLILTNNQ
FDEFNNIASQGDSGSALFVYDNQKKKWVVAGTVWGIYNYANGKNHAAYSK
WNQTTIDNLKNKYSYNVDMSGAQVATIENGKLTGTGSDTTDIKNKDLIFT
GGGDILLKSSFDNGAGGLVFNDKKTYRVNGDDFTFKGAGVDTRNGSTVEW
NIRYDNKDNLHKIGDGTLDVRKTQNTNLKTGEGLVILGAEKTFNNIYITS
GDGTVRLNAENALSGGEYNGIFFAKNGGTLDLNGYNQSFNKIAATDSGAV
ITNTSTKKSILSLNNTADYIYHGNINGNLDVLQHHETKKENRRLILDGGV
DTTNDISLRNTQLSMQGHATEHAIYRDGAFSCSLPAPMRFLCGSDYVAGM
QNTEADAVKQNGNAYKTNNAVSDLSQPDWETGTFRFGTLHLENSDFSVGR
NANVIGDIQASKSNITIGDTTAYIDLHAGKNITGDGFGFRQNIVRGNSQG
ETLFTGGITAEDSTIVIKDKAKALFSNYVYLLNTKATIENGADVTTQSGM
FSTSDISISGNLSMTGNPDKDNKFEPSIYLNDASYLLTDDSARLVAKNKA
SVVGDIHSTKSASIMFGHDESDLSQLSDRTSKGLALGLLGGFDVSYRGSV
NAPSASATMNNTWWQLTGDSALKTLKSTNSMVYFTDSANNKKFHTLTVDE
LATSNSAYAMRTNLSESDKLEVKKHLSGENNILLVDFLQKPTPEKQLNIE
LVSAPKDTNENVFKASKQTIGFSDVTPVITTRETDDKITWSLTGYNTVAN
KEATRNAAALFSVDYKAFLNEVNNLNKRMGDLRDINGEAGAWARIMSGTG
SASGGFSDNYTHVQVGVDKKHELDGLDLFTGFTVTHTDSSASADVFSGKT
KSVGAGLYASAMFDSGAYIDLIGKYVHHDNEYTATFAGLGTRDYSTHSWY
AGAEAGYRYHVTEDAWIEPQAELVYGSVSGKQFAWKDQGMHLSMKDKDYN
PLIGRTGVDVGKSFSGKDWKVTARAGLGYQFDLLANGETVLRDASGEKRI
KGEKDSRMLMSVGLNAEIRDNVRFGLEFEKSAFGKYNVDNAVNANFRYSF
>pO157p02 etpC, EtpC
MLFFLSFRGDRGLFIKDIVLKMLTPNRLLCVILLIAGYQLVSVIHHFWLT
QAASVPGLSRVSAPETAVTGDQTEERFVFTLFGRASPLSSEGRAQETMPS
LSDDLLSGEDLDVRGILYSSVAEHSVAIFAHNNRQFSLSVGEKVPSYDAT
ISAIFSDHIVINYQGKTVSLPLRYDNTEKKNAYDNNNLTVGDVITQDNFR
VESVFDIMSFSAVTVNNTLSGYRLIPGKHSSLFYNAGLHDNDLAVSVNGS
ELRDTRQAQQIMKQLPELKEIKITVERDGQLYDAFIAVGEN
>pO157p03 etpD, EtpD
MLNEEQYYQFFLSVLDVYGFAVVDMHNGILKVVRSKDAKTSAVPVASDVS
PGTGDEVVTRVVPVSNVAARDLAPLLRQLNDNAGAGSVVHYEPSNVLLMT
GRAAVMKRLMEIVERVDKVGNRSVATVPLTYASATDVARLVTELTKETDK
TAIPAWMTAKLVADERTNSVLVSGEPISQQRIISIIKQLDRQEDVQGNTK
VIYLKYAKAKDLVEVLTGISSSIENDSKKSPSTEALRKGVTIKSHEQTNA
LILTGAPDVIRDLENVISQLDIRRPQVLVEAIIAEIQDADGLNLGIQWVN
KHAGVAQFTSTGLPITTMVQTRQNEILDSDQSNALSMFNGIAAGFYQGNW
AMLLTALSTSSKNDILATPSIVTLDNMEATFNVGQEVPVLSGSQTTSGDN
IFNTVERKTVGIKLRVKPQINEGDSVLLEIEQEVSGVADTAVATTTDLGA
TFNTRTVTNAMLVGNGETVVVGGLLDKSIRGSESKVPLLGDIPVLGHLFR
AKSEQTAKRNLMLFIRPTIIRERDGFRHASAEKYQSFNQEQVQSRGKETT
ALTLNEEQLRLSPDQDDTAFRKVKAAIAAFYAQEM
>pO157p04 etpE, EtpE
MSRVVQNVSESRPLLPFSFSRTQRILLLREQEGNRVFCMEDTPASALLEV
RRVAEGPLNVTTVSAEAFEKQLVSSYQRDSDEARQMMAEIGNEMDFYTVA
GELPDREDLLDANDDAPIIRLINAMLTEAIKEKASDIHIETYERHLQVRF
RIDGVLREILRLHRNLASLLISRIKVMARLDIAEKRVPQDGRMVLRIGGR
AVDVRVSTLPSNHGERIVLRLLDKNSVSLDLAALGMSQQNQRHIDALIRR
PHGIILVTGPTGSGKSTTLYAALSLLNPRDRNIMTVEDPVEYELDGISQT
QVNPKVDMTFARSLRAILRQDPDVVLVGEIRDGETAQIAVQASLTGHLVL
STLHTNSAAGALSRLQDMGIPPFLLSTSLLAVLAQRLVRTLCPRCRQPCQ
VSTELAMDMDIPPETTIWQPAGCQHCSFTGYHGRTGIHELLLIDDRIRTA
IYQGEGELGITRLAGSRYLTLRGDGRQKVLAGETSWEEVVRVTESRLQEE
E
>pO157p05 etpF, EtpF
MALFHYQASDIHGRKRSGILEADSARHARQLLREQALIPVRLDEKQVHHK
HSLRSILRFRPRGGSSAELALLTRQLATLVAASLPLEEALDALLRQSEKP
RQRNLIAAVRTKVLEGHSLAAAMGMFPGTFERLYCAMVAAGETSGRLDVV
LSRLADYTEQRQIMRNRLLQALLYPCVLTLVAVGVIAILLTAVVPKVVEQ
FIHMKQTLPLSTRVLMGAAEVSQTWGPWLLLAAALGGIAGRMILHQPSQR
LAFHHLLLRLPVVGRISRGLNTARYARTLSILNASAVPLLQAMHISGDVL
SNDWARHQLATAAELVREGVSLHQALEQTSLFPPMMRHMIASGENSGELD
SMLERAADNQDREFSTQMQLALGLFEPLLVVGMAGVVLFIVLAILQPLLQ
LNNMMNM
>pO157p06 etpG, EtpG
MRKQHQRGFTLLEIMVVIVILGVLASLVVPNLMGNKDKADRQKVMSDLVA
LESTLDMYRLDNNRYPTTEQGLRALVSKPTVQPEPRNYRQDGYIRRLPQD
PWGGDYQLLNPGQYSDIDIFSPGPDGVPNTEDDIGNWTLGNAQP
>pO157p07 etpH, EtpH
MSQRGFTILEMMLILLLMGITAGLVLMSFPDSAQNHLQQQRERLQAQLDY
ALDRSQQDGLLMGIQVRFDGWKFKVLQRGTAESPPTLAEGGDIWQGYVWQ
TWQPRRAAMGGKLPDDVRLELQYLRGLQWMSEHDDGAEPDILLLPGGEVT
PFRLLFRQVGEEAVVGLQVDENGLMTLFEGEVSL
>pO157p08 etpI, EtpI
MTLLEVIVALVVFALAGMALMQASTQQAAGIGRMEEKVLAGWLADNQMVQ
LQLEKTWPENGWGEKTISFAGTEWYLRWGGPDSDVPQPRSLEVEVRRTKE
ETSALVSLRSSVVRE
>pO157p09 etpJ, EptJ
MSQQRVKGFTLLEMLLALAVFAALSISAFQVLQGGIRAHELSRDKVQRLA
ELQRGISQMERDLTQMLPRHSRGNEGLLLAAPHLLKSDDWGISFTRNSWL
NPAGMLPRPELQWVGYRLRQQKLERLSYFHVDHPSGVSPDVRVMLDGVHA
FRLRFFVNGDWQARWDSTGILPQAVEVTLVMDDFAELPRLFLVSKETAE
>pO157p10 etpK, EtpK
MKLREQGVALLVVLLILSLMVTIAAVIAERNGRTFLRTVAQLDQLQAKWD
GYTAETIAKQILQRSRQESPRKTHLAQNWAQSERQFETRGGDVRGQIVDA
QACFNLNAINYGVVDLTSIPYAARIFQQLLINLQVELLQARQVTAALRDW
IDRDDKPVRGGAEDEVYMGMEPPYLAANQPMQDVSELRLIRGIDARLYRK
LLPYVCVLPTSDLSVNVNTLLDSQAPLLAALFLTKPDSLPVTELLQRRPR
TGWESVAAFLDAPPLKDIDTSAAMPVLAVSSNYFLVRLHVRSGEHLFSQQ
TLMQWREERFRIIQRQYGLTMREVP
>pO157p11 etpL, EtpL
MHRWLAYCGQLGVREVRLLPDVLMLPLATEGWSAVKLAGQWLFRRDRYAG
MTVELSWLTHFLSLTAPPVIESYSVPPVPCPGPKTTEWRNQPGRDLLQLV
AEGDGYDGADLRQGEFARQGTWYARFRPWRYVAGALLACVFLAGSNAGLA
HYRLWQQAQFWRYESVRVYQQLFPSEKAVKDPRRQMLRHLQQSTGNNIPE
LGSVMRQLQQLLSETSTIRLQALAWDSSSQTLKIDLQAASFQALEHFQQI
AGEKYLIQPGEIRQQPQGVESRLILRVNNERA
>pO157p12 etpM, EtpM
MNELKKRLQYVSPRERRLLIGCVALLVVVFVYYVLWQPWLIREDKWRTVV
TREKNTVEWMKQQVPNIKRREASMPEQGEPLGLSAAVTRTSAAYGVSVTR
LQPQGERLAVTLAPVEFNALMQWLTQLERRHRIRTMVFDVAAQGNPPGQV
TVNRLVLSQDDVNKDASLSR
>pO157p13 etpN, EptN
MLERRWRREALTLLAHSVPESGPAFNLMIPQSHCPCCFHPLGIRDNIPLL
SYLFLKGKSRCCGERIPVYYPLVEITNSVLFILAASRFPPGLTLAAAWLF
ISMLLVLAVIDCHTALLPDVLTLPLLWLGLLFSLQRGVVTLEEAVVGAVS
GYLCLWGLYWLFRFATGREGLGYGDFKLAAALGAWMGWQALPSILLFASV
SGLIVTVLLRIVTATDFTRPLPFGPWLALSGVCHFLLM
>pO157p14 etpO, EptO
MMGNILKKLNCIASLLVLVTISGCHQSPSIHKQATVPPSEQLEQMASIVS
ATRYLKMRCNRSDLPDEQSILNVANRIAIGKGWQSLTQEDIRKHSDDIYV
RLTRDSTPEYIKCREFNRRLVPFIGELLARGRG
>pO157p64 finO, fertility inhibition protein
MAEQKRPVLTLKRKTEGETPVRSRKTIINVTTPPKWKVKKQKLAEKAARE
AELAAKKAQARQALSIYLNLPTQDEAVNTLKPWWPGLFDGDTPRLLACGI
RDVLLEDVAQRNIPLSHKKLRRALKAITRSESYLCAMKAGACRYDTEGYV
TEHISQEEEAYAAARLDKIRRQNRIKAELQAVLDEK
>pO157p18 hlyA, hemolysin A
MTVNKIKNIFNNATLTTKSAFNTASSSVRSAGKKLILLIPDNYEAQGVGI
NELVKAADELGIEIHRTERDDTAIANQFFGAAEKVVGLTERGVAIFAPQL
DKLLQKYQKVGSKIGGTAENVGNNLGKAGTVLSALQNFTGIALSGMALDE
LLRKQREGEDISQNDIAKSSIELINQLVDTVSSINSTVDSFSEQLNQLGS
FLSSKPRLSSVGGKLQNLPDLGPLGDGLDVVSGILSAVSASFILGNSDAH
TGTKAAAGIELTTQVLGNVGKAVSQYILAQRMAQGLSTTAASAGLITSAV
MLAISPLSFLAAADKFERAKQLESYSERFKKLNYEGDALLAAFHKETGAI
DAALTTINTVLSSVSAGVSAASSASLIGAPISMLVSALTGTISGILEASK
QAMFEHVAEKFAARINEWEKEHGKNYFENGYDARHAAFLEDSLSLLADFS
RQHAVERAVAITQQHWDEKIGELAGITRNADRSQSGKAYINYLENGGLLE
AQPKEFTQQVFDPQKGTIDLSTGNVSSVLTFITPTFTPGEEVRERKQSGK
YEYMTSLIVNGKDTWSVKGIKNHKGVYDYSKLIQFVEKNNKHYQARIISE
LGDKDDVVYSGAGSSEVFAGEGYDTVSYNKTDVGKLTIDATGASKPGEYI
VSKNMYGDVKVLQEVVKEQEVSVGKRTEKIQYRDFEFRTGGIPYDVIDNL
HSVEELIGGKHDDEFKGGKFNDIFHGADGNDYIEGNYGNDRLYGDDGDDY
ISGGQGDDQLFGGSGNDKLSGGDGNNYLTGGSGNDELQAHGAYNILSGGT
GDDKLYGGGGIDLLDGGEGNDYLNGGFGNDIYVYGQNYGHHTIADEGGKG
DRLHLSDISFDDIAFKRVGNDLIMNKAINGVLSFNESNDVNGITFKNWFA
KDASGADNHLVEVITDKDGREIKVDKIPHNNNERSGYIKASNIASEKNMV
NITSVANDINKIISSVSGFDSGDERLASLYNLSLHQNNTHSTTLTTTV
>pO157p19 hlyB, hemolysin B
MMSKCSSHNSLYALILLAQYHNITVNAETIRHQYNTHTQDFGVTEWLLAA
KSIGLKAKYVEKHFSRLSIISLPALIWRDDGKHYILSRITKDSSRYLVYD
PEQHQSLTFSRDEFEKLYQGKVILVTSRATVVGELAKFDFSWFIPSVVKY
RRILLEVLTVSAFIQFLALITPLFFQVVMDKVLVHRGFSTLNIITIAFII
VILFEVILTGARTYIFSHTTSRIDVELGAKLFRHLLALPVSYFENRRVGE
TVARVRELEQIRNFLTGQALTSVLDLFFSVIFFCVMWYYSPQLTLVILLS
LPCYVIWSLFISPLLRRRLDDKFLRNAENQAFLVETVTAINTIKSMAVSP
QMIATWDKQLAGYVASSFRVNLVAMTGQQGIQLIQKSVMVISLWMGAHLV
ISGEISIGQLIAFNMLAGQVIAPVIRLAHLWQDFQQVGISVERLGDVLNT
PVEKKSGRNILPEIQGDIEFKNVRFRYSSDGNVILNNINLYISKGDVIGI
VGRSGSGKSTLTKLLQRFYIPETGQILIDGHDLSLADPEWLRRQIGVVLQ
ENILLNRSIIDNITLASPAVSMEQAIEAARLAGAHDFIRELKEGYNTIVG
EQGVGLSGGQRQRIAIARALVTNPRILIFDEATSALDYESENIIMKNMSR
ICKNRTVIIIAHRLSTVKNANRIIVMDNGFISEDGTHKELISKKDSLYAY
LYQLQA
>pO157p17 hlyC, hemolysin C
MGKVAWLWACSPLHKKWPLSVFAINVIPAIQTNQFALLIKDELPVAFCSW
ASLDLECEVKYINDVTSLYAKDWMSGERKWFIDWIAPFGHNMELYKYMRK
KYPYELFRAIRLDESSKTGKIAEFHGGGIDKKLASKIFRQYHHELMSEVK
NRQDFNFNIEKEN
>pO157p20 hlyD, hemolysin D
MRFYMKGLWDLVCRYKTVFSDVWKIRHTLDAPVREKDEYAFLPAHLELIE
TPVSRRSHFVVWSILLFVIISLLLSVLGKVEVVSVANGKFTHSGRSKEIK
PIENAIVEKIMVKDGSFVKKNDPLVELTVPGVESDILKSEASLLYEKTEQ
YRYAILSESIQRNELPEIRITDFPGGEDNAGGEHFQRVSSLIKEQFMTWQ
NRKNQKQLTLNKKIVERDAALARVSLYEHQVSQEGRKLNDFKYLLNKKAV
SQHSVMEQENSYIQAKNEHAVWLAQVSQLEKEIELVREELALETNIFRSE
IIEKHRKSTDNIVLLEHELEKNRQRKASSFIKAPVSGTVQELNIHTEGGV
VTTAETLMIIVPDNDILEVTASVLNKDIGFIQPGQEVVIKVDAYPYTRHG
YLTGKVKNITADSVSVPDTGLVFNVIISVDRNDIQGERKKIPVTAGMTVM
AEIKTGVRSVISYLLSPLKETINESLRER
>pO157p42 klcA, KlcA
MMPEYQGGFWHFIRLPDGGGYMMPDGDRFHMVNGANWFDRTVSADACGII
LTSLVINRQLWLYHDSGDAGLTQLYRMRDAQLWRHIEFHPECNAIYAALD
>pO157p28 letA, LetA
MKQRITVTVDSDSYQLLKAYDVNISGLVSTTMQNEARRLRAERWKAENQE
GMAEVARFIEMNGSFADENRDW
>pO157p29 letB, LetB
MQFKVYTYKRESRYRLFVDVQSDIIDTPGRRMVIPLASARLLSDKVSREL
YPVVHIGDESWRMMTTDMASVPVSVIGEEVADLSHRENDIKNAINLMFWG
I
>pO157p54 nikB, NikB
MSIAGLREEPAQDWLSLHKRLASDGLYITMQEGELVVKDGWDRAREGVAL
SSFGPLWTAEKLGRKLGEYQPVPTDIFSQVGTPGRYDPEAINVDIRPEKV
AETESLKQYACRHFAERLPEMARNGELESCLDVHRTLATAGLWMGIQHGH
LVLHDGFDKQQTPVRADSVWPLMTLDYMQDLDGGWQPVPKDIFTQVIPGE
RFRGRNLGTQAVSDYEWYRMRMGTGPQGAIKRELFSDKESLWGYTTVQCE
SLIEDMIAGGNFSWQACHEMFARKGLMLQKQHHGLVIVDAFNHELTPVKA
SSIHPDLTLSRAEPQAGPFEIAGADIFERVKPECRYNPELAASDEVEPGF
RRDPVLRRERREARAAAREDLRARYLAWKEHWRKPDLRYGERLREIHAAC
RRRKAYIRVQFRDPQLRKLHYHIAEVQRMQALIRLKESVKEERLSLIEEG
KWYPLSYRQWVEQQAVQGDRAALSQLRGWDYRDRRKDKRRTTNADRCVIL
CEPGGTPLYEDTGVLEARLQKDGSVRFRDRRNGELVCVDYGDRVVFYHHQ
DRNELVDKLNLIAPVLFDREPGMGFEPEGSYQQFNDVFAEMVAWHNAAGI
TGNGHFVISRPDVDLHRQRSEQYYHEYIRQQKSISGGHGASYAPVQDNEW
TPPSPGM
>pO157p21 papX, PapX
MLLALLSSTDNFCLSSTELSERLDVSRTYITRACDSLEKFGFIKRMESKE
DRRSKNIYLTSDGNLYLQRTTRIYGRYLKKYGATLQMMKSKHLK
>pO157p53 parB, stable plasmid inheritance protein
MKLPRSSLVWCVLIVCLTLLIFTYLTRKSLCEIRYRDGYREVAAFMAYES
GK
>pO157p52 psiA, PsiA
MSARSQALVPLSTEQQAAWRAVAETEKRRNQGNTLAEYPYAGAFFRCLNG
SRRISLSDLRFFMPSLTAEELHGNRLQWLYAIDVLIETQGEVCLLPLPGD
AAERLFPSVRFCVRERSRHKSALVMQKYSRQQAREAEQKARAYQALVAQA
EIELAFHSPETVGSWYARWSDRVAEHDLETLFWQWGERFPSLAGMERWQW
QDMPFWQVIAEASLAAKEAGHAVREMERWMVPNKLREVRDAATITRIARK
QPE
>pO157p51 psiB, PsiB
MKTELTLNVLHTMNAQEYEDIRAAGSDARRELTHAVMRELDAPDNWTMNG
EYGSEFGGFFPVQVRFTPAHERFHLALCSPGDVSQVWVLVLVNAGGEPFA
VVQVQRRFAPEAVSHSLALAASLDTQGYSVNDIIHILMAEGGQV
>pO157p68 repA1, replication protein
MTDLQQTYYRQVKNPNPVFTPREGAGTLKFCEKLMEKAVGFTSRFDFAIH
VAHARSKGLRRRMPPVLRRRAIDALLQGLCFHYDPLANRVQCSITTLAIE
CGLATESAAGKLSITRATRALTFLSELGLITYQTEYDPLIGCYIPTDITF
TPALFAALDVSEDAVAAARRSRVEWENRLRKKQGLDTLGMDELIAKAWRF
VRERFRSYQTELKSRGIKRARARRDAGRERQDIVTLVKRQLTREISEGRF
TANREAVKREVERRVKERMILSRNRNYSRLATASP
>pO157p67 repA2, replication protein
MSQTENAVTSSSGAKRAYRKGNPLSDAEKQRLSVARKRASFKEVKVFLEP
KYKAMLMQMCHEDGLTQAEVLTALIKSEAQKRCV
>pO157p24 repFIB, RepFIB
MDKSSGELVTLTPNNNNTVQPVALMRLGVFVPTLKSLKNSKKNTLSRTDA
TEELTRLSLARAEGFDKVEITGPRLDMDNDFKTWVGIIHSFARHNVIGDK
VELPFVEFAKLCGIPSSQSSRRLRERISPSLKRIAGTVISFSRTDEKHTR
EYITHLVQSAYYDTERDIVQLQADPRLFELYQFDRKVLLQLKAINALKRR
ESAQALYTFIESLPRDPAPVSLARLRARLNLKSPVFSQNQTVRRAMEQLR
EIGYLDYTEIQRGRTKLFCIHYRRPRLKAPNDESKENPLPPSPAEKVSPE
MAEKLALLEKLGITLDDLEKLFKSR
>pO157p33 sopA, SopA
MKLMETLNQCINAGHEMTKAIAIAQFNDDSPEARKITRRWRIGEAADLVG
VSSQAIRDAEKAGRLPHPDMEIRGRVEQRVGYTIEQINHMRDVFGTRLRR
AEDVFPPVIGVAAHKGGVYKTSVSVHLAQDLALKGLRVLLVEGNDPQGTA
SMYHGWVPDLHIHAEDTLLPFYLGEKDDVTYAIKPTCWPGLDIIPSCLAL
HRIETELMGKFDEGKLPTDPHLMLRLAIETVAHDYDVIVIDSAPNLGIGT
INVVCAADVLIVPTPAELFDYTSALQFFDMLRDLLKNVDLKGFEPDVRIL
LTKYSNSNGSQSPWMEEQIRDAWGSMVLKNVVRETDEVGKGQIRMRTVFE
QAIDQRSSTGAWRNALSIWEPVCNEIFDRLIKPRWEIR
>pO157p34 sopB, SopB
MKRAPVIPKHTLNTQPVEDTSLSTPAAPMVDSLIARVGVMARGNAITLPV
CGRDVKFTLEVLRGDSVEKTSRVWSGNERDQELLTEDALDDLIPSFLLTG
QQTPAFGRRVSGVIEIADGSRRRKAAALTESDYRVLVGELDDEQMAALSR
LGNDYRPTSAYERGQRYASRLQNEFAGNISALADAENISRKIITRCINTA
KLPKSVVALFSHPGELSARSGDALQKAFTDKEELLKQQASNLHEQKKAGV
IFEAEEVITLLTSVLKTSSASRTSLSSRHQFAPGATVLYKGDKMVLNLDR
SRVPTECIEKIEAILKELEKPAP
>pO157p49 ssb, single-strand binding protein
MAVRGINKVILVGRLGKDPEVRYIPNGGAVANLQVATSESWRDKQTGEMR
EQTEWHRVVLFGKLAEVAGEYLRKGAQVYIEGQLRTRSWEDNGITRYVTE
ILVKTTGTMQMLVRAAGAQTQPEEGQQFSGQPQPQPPEGDDYGFSDDIPF
>pO157p01 tagA, ToxR-regulated lipoprotein
MNTKMNERWRTPMKLKYLSCTILAPLAIGVFSATAADNNSAIYFNTSQPI
NDLQGSLAAEVKFAQSQILPAHPKEGDSQPHLTSLRKSLLLVRPVKADDK
TPVQVEARDDNNKILGTLTLYPPSSLPDTIYHLDGVPEGGIDFTPHNGTK
KIINTVAEVNKLSDASGSSIHSHLTNNALVEIHTANGRWVRDIYLPQGPD
LEGKMVRFVSSAGYSSTVFYGDRKVTLSVGNTLLFKYVNGQWFRSGELEN
NRITYAQHIWSAELPAHWIVPGLNLVIKQGNLSGRLNDIKIGAPGELLLH
TIDIGMLTTPRDRFDFAKDKEAHREYFQTIPVSRMIVNNYAPLHLKEVML
PTGELLTDMDPGNGGWHSGTMRQRIGKELVSHGIDNANYGLNSTAGLGEN
SHPYVVAQLAAHNSRGNYANGIQVHGGSGGGGIVTLDSTLGNEFSHEVGH
NYGLGHYVDGFKGSVHRSAENNNSTWGWDGDKKRFIPNFYPSQTNEKSCL
NNQCQEPFDGHKFGFDAMAGGSPFSAANRFTMYTPNSSAIIQRFFENKAV
FDSRSSTGFSKWNADTQEMEPYEHTIDRAEQITASVNELSESKMAELMAE
YAVVKVHMWNGNWTRNIYIPTASADNRGSILTINHEAGYNSYLFINGDEK
VVSQGYKKSFVSDGQFWKERDVVDTREARKPEQFGVPVTTLVGYYDPEGT
LSSYIYPAMYGAYGFTYSDDSQNLSDNDCQLQVDTKEGQLRFRLANHRAN
NTVMNKFHINVPTESQPTQATLVCNNKILDTKSLTPAPEGLTYTVNGQAL
PAKENEGCIVSVNSGKRYCLPVGQRSGYSLPDWIVGQEVYVDSGAKAKVL
LSDWDNLSYNRIGEFVGNVNPADMKKVKAWNGQYLDFSKPRSMRVVYK
>pO157p58 toxB, toxin B
MIHPGSSLDKAINNTRVKNVSTDVKHGQIQERKRNFIYKKNDDISSRFKL
YSSLVKQKNATEDVVLIGKMILDEVRSYRTIHNDRNIVSNSGNWKTSFLC
NLARLLYSIFNGSNYFCSREGENNSSPSSTLLTIHQPEKQELLQQKSIKH
LPTSNNIDGYIKIRKTRGAEDQTTTITQSLIINELLNGVDRNTIPFQKIS
ELNDIIHSYENMQIKNSRKGIEILVKQGELLSSLINVNKGNKQLSDNASK
IINLLGIEYQSHKVDIEPFIHAVWVAGAPPDNTFSYITAFLNTYKDYTYL
LWIDPNAFGAAKFSGILKNIAMNYAIMRLRRTNPHLAEEMNEVILKIQNI
QNETIEFKETRERLKELENRYKSLTSETKEKFNVFFLESMIGMQDNYFTY
CISNGISNTDDISRLDFLTNVLKLSPEVQNDFKSTVEKNKRDIDLLKNTI
SQKFGDRFQLRDINTLESFKKPQDYFFYQQEMLLRWNYAAASDQVRINIL
KEYGGIYTDTDILPAYSDKVSQIINEKSDDKRFFEDLKLRRIISESILSL
IKGEKYSIKHDGLDETTLNQLNNILSEIEKLTIDDYFKPVETKVVRDTFK
IFKRYQKWTENTWNIRGNNNFMLTHKGSKCIDFILSGQKKQYLELQRIRD
NISYNNLFYTTEDLKSLNNVAIGGIPAKKYLEHGLFSEYRQDGTIPYVVS
TLNISGPDMIMRQMKKYYKSLGRIGEVHIKDNKLSDVNFLGVYASSNKDN
KSFNWLNPVSVGINDITPDDESSWAVRNNDINKILFEKINCHVPEKLPTS
LYYEIDSRSFFQGWDNKSIKHVTEINKDLIKDINLLLTSSNIDVKLLIKL
DRELYAISSKIDNPLALRSIRTLQLQLANYVTSNTFEPENTINFIYDFYR
KKQDDLLSAIKLFSRNDADTKIIVWYNSVMEKNVFLREVISCVLRSKKVD
SYINENKKNLSKEDAGALRDYAKLKMKELFSMLDDDGYKKIITTNAYIKE
RDKLSGIIYNIENSIISGHESFDIIRSNQHEWGDLSTVEQFKKFEFYVKS
ELSSAKSIFDDIKNKYITDPETKRNVLYHQLDSDIKERIAFLDISHYAYP
GSLLEKLQLSGYVFSDINIIAEYLLASYGVSGHYSHGVVYPAPSDKLLEL
LRRHTKSNSEWIEKITPYVYDILSDNVSNVLRPPLSEEQKKILNDIKLEI
SKSVSEQYFMKLTEQKSSVIGIKYSVDFDRYNENLFLSLPINQNLTLPFM
YRYFEMLYDIHIGIIENKANREFIYSKFSSLNLDFLINDERVLNLEGLIK
KYKYLSLSEIHRTLTNSTSFADISIPLLQTICPSITTIIKKTEYYGHQLT
NAMTVASVVKPYDFSNLGAINSIDKSVSDVPALHTIVEQAKYNLLSWNDF
YNTHASIWDTIARQHKSTNIEFHPQSLLFDRDSKGKCLGLSLLYLDTGGY
GGGYQKLRHNIDTASTLYQTKYNDNLKLSNRDDFFLRKTQRIITMSNELG
NNRLKNAQLEVLELKDPILTEGILYQRRISSLLITTEYHSLALQQISSFW
RVTDPNFGHCDFHSLAQALTFIKNITSNRNFSSLYGSGIVKIYFSESLNN
WKYIKLPLVQTGSLLRDIYLTTPEKLSTSGGSLNIMGHLVPVSFIYDIGG
VINGNRISESTDVKNKIRSLKINGDILQHYINTHYLSEEQTQKIKDIVDF
LGIQDNTIKVKLESDIKPISEIQQPLHSILSRQKEHVKNLLSGLLDEFSN
KLRKQGLSLKTNVLSVNNFKESKINSDTVEVTVTDLQGRLYRVDIDTRVI
GLTFKEGINSLSEALEHMNIDAIMSVIGLVQYARMIKMNDNISAIDHAGA
VSDIKNIVDKFLGGILTLTNNRVYNPGGVSGASLEGFTSSGLEVCASRMG
GTAGRYLSNVAKVIKLPLLDIGINIWSLYDSSLNHAKATTQIEYISTAID
VSFSSINTALSIGAIAYPPLAIAIVPITIFSHEVKNYAVYVNQINERHKL
WLEAEKYLDNGSAKVLSINKATGIIDLSNNQVLGNIYLDMRENPPILHGE
KSYNSGKNIGSHPDMTDREIMESRAYNFACTKLSDAGEPDIFGWGDKEIC
NSMELSESQLANGYSNRQWPSQIPVIPEGIYNTVYLGYGETLRANTEVTL
SSTGYFYEIARAYTDDELSEPLLTVCNQHSHVIGGKEPLTIIIPAIEHGM
LGSNLHMIERFKNYNFSISGGKGGIKLMVGGIGDYNIECTPGVRNIISFE
QLSRDFNLDLDLSDGRKQNFNFHHPSGFYSGKVMSITQKGINAVVGTKLG
YDKIRGNNLDNTFSLGSGGGVIYSGGGSNTYFVPATLQDNLHIYISEKSN
GNHIILGDMHSRLSIECHFSNNEREFIKMGYYNGCDVILESDTIQKIKSF
AKNITIQTADGVMANWDDKLNTLSVYSIDMIAWRDKNKTAKEPLPVDVIQ
LTNWRMHNTCSLFYDKYQVDIEANKLTYTVLFPDTELPVQLNYTSIIYGN
HGAKYTFLNSGSKTIDIHILDKNSDYDTFDFRNIIFEHYTNEIFISFDNQ
GGFVISILNNATSEAANINVFRKNMTSLDSSGSLIYLPSGDIYHISDIYK
MSRGRKSFKLNVEKKPDIDDIINVAILETSYLQIKKIPNNDDSDYILCLD
NPNLSSYTLNFNDLSGYISSLWDNIRGSFTPFHKNTVNIAPNEKKYISLI
GLDKLSFNIDVFRQALEVKNKNSYKISKFTWETYGDIVVSPEDRISHLEL
DGFNYFSQPELDTPISDSFSYLYDNFQIVDSDVHIKLLHLNRETKQITPH
RIILKRYFIDSFAKTSITDREKNIYPVICDSPDHFTSDIYRHPFRIVLGN
KTLYPSEELVKFISTSKEYLSNMDVINNVIVPQKTTKKNKLSIVSLNSNI
KNDIVLSGVMTGTSKIFHLNNSGDLLLTTSKTHGGGVVVIFKDFINNWWK
YNLTLITVPIDNKLSDNRINITPMGIKIQETVSGNDRLFFYPTPLKNGCF
ILHNPLYSNSFPSLYSHELFDLIEAYRNPTYSYLQNNIINRYIHTVSEYA
GKDGIANMSLSIYAYSTCRFWSKESVAVGDTIKLSNYEHIEFSFNSFSKD
YFREQNSDIYKIFFKIASSSNSVVVDKNTLTQKNFLENNTFFSITGIDES
LYEKHIIFITLKVIVPSKK
>pO157p62 traX, TraX
MTTDNTNTTRNDSLAARTDTWLQSLLVWSPGQRDIIKTVALVLMVLDHAN
RILHLDQTWMFLAGRGAFPLFALVWGLNLSRHAHIRQSAINRLWGWGIIA
QFAYYLAGFPWYEGNILFAFAVAAQVLTWCETRSGWRTAAAILLMALWGP
LSGTSYGIAGLLMLAVSHRLYRAEDRTERLVLVACLLAVIPALNLASSDA
AAVAGLVMTVLTVGLVSCAGKSLPRFWPGDFFPTFYGCHLAVLGILAL