TitleGenColors Logo

Gene list

Applied filters:

COG category: Unclassified
Gene type: CDS
Genomic element: chromosome

Number of genes found: 746

Free access
Sort by:

 



# Salmonella enterica subsp. enterica serovar Typhi Ty2, Ty2

>t1377 putative lipoprotein
MWHLVKKLPWRGILLAILINAFLVGLYAMGYRSGHDSAKRDGDTALSQLQ
SAFDAYKTEQATLENAALRAWARRYQEQVAAGQRAEAGYLEQIAQLESRN
KQLQGQINDVTQRWIDEKGKSHPIECVFTRGFVRQYNAALGYDNASVDTG
HSDSVAAAGTRSGTATGQPETTDTRLRDSGVSQRDVLANIIDNAGQCRRW
RNQINALLDEREGLQK
>t1024 hypothetical protein
MPPLKKIVLRLFVGAMVATVTTPALALVCLEDHSAKECAISCAEVMWFMF
PICF
>t3516 hypothetical protein
MPPLFTINACKSAGCRNLGLPDSPDYVWPDYRLGYPALHCRACGSYPPLF
NEGEFRRWASAYIAQYAKEHGHFCPDCYQRTWIRYGRNPGGTQRLQCQYC
KKVWTPKQHALNVAETPEQICSIPLLVPFQGANAFQQLYFLFSFDAVRGN
ILHLSSNFTLLSAGKSLHYHWKGIAPPEGEKGDIIHRIAIKERQFLQRSQ
FDEIQYGPAALKRNAQGTILRPVITAHGHFRVLKNRFPDVATHIIAHECF
LRGAVITAWAERFRQRLSSLWFVEEEINDDDCRAEWQLLGKTWQGWWQNQ
WQLWGQGHNRKMVCSLTGSHLEQGVAVNLAASRRFVTWLWQQPEFQQSAH
CSAKRVTQILYLLTEKYNSQWNHI
>t2330 hypothetical protein
MKRIHPLMASRQSADYRQPWQMSGVWRRSINFVAKSGELLTLHRRGSGFS
PGGWILRGGDFDALRDVLTDGEKPQLTAAGIQIGEFLLCEPERRCSLRVP
RYLASPRLNATYMQRSEETGLFGPLTVAAAQPLCPELRQFCHCFQSALAG
AATDWRLWLGKGPGLTPSHEDTLTGMLLAAWYFGAIDERAGRNFFAQSGS
LDRATTLVSVSYLRYAAMGYFASPLLHFIHALRRDARTETAIDGLLALGH
TSGADTLLGFWLGQQIVKG
>t4130 hypothetical protein
MMKRTISALALAFVASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQP
RLATSWWPAAVIGEEQATVTARQQQQELLGRLAALSAEEDGDAAAINTLR
RQIQAVKVTGRQFVNLDPDVVRVSERGNPPLQGHYTLWVGPQPTQVTLFG
LISQPGSQPFVPGRDVASYLEGQRLLSGADRSYAWVVYPDGRSQKAPVAY
WNKRHIEPMPGSIIFVGFADSLWRGTPEAINADILHTLTQRIPE
>t1897 hypothetical prophage protein
MFTVKTIINGVTHICEQPSISIARAGSETFADTLKLTHNSACPDFAYWLP
AIYEDPEMTKALQEEELVISDRTDVLDTDAIAIIIEEYPSENFPGAGDGC
RYQFIYPGDQVYVMNSHGSTIETVK
>t0535 hypothetical protein
MPICTRLTAQIVKVAVNQPVAIRSSMWAKNNQNIHFMNLHVKKLEKLLLF
>t0723 hypothetical protein
MCRFVNQGMAVALFFNEVSSEVELTIADDTQY
>t1067 hypothetical protein
MSERPDLVDPLPEDEPLPGENIPDTPEGDDPLDPDTQNEVEDPRR
>t2770 hypothetical protein found within S. typhi pathogenicity island 1
MIPGTIPTSYLVPTADTEVTGVVSLSARAAMLNNVDSAPLSNGGDVDLYD
AFYQRLLALPESASSETLKDSIYQEMNAFKDPKSGDSAFVSFEQQTAMLQ
NMLAKVEPGTHLYEALNGVLVGTMNAQSQMTSWMQEIILSGGENKEAIDW
>t1205 hypothetical protein
MASGDITRYVITVTFHEDSLTEINELNNHLTRSGFLLTLTDDEGNVHELG
TNTFGFVSAQSADEIKALVAGLAKSALDKDVDITVATWEAWSKNAQ
>t2004 probable lipoprotein
MRYSKLSLLIPCALLLSACTTVTPAYKDNGPRIGACVEGGPDSVAQQFYD
YRIQHHSNDIAALRPYLSDNLAKLLTDASRDTEHRQLMSGDMFSSRATLP
DSANVASASTIPNRDARNIPLRVALKQGDQSWQDEVLMIHEGSCWVIDDV
RYLGGDVHAPAGTLRQSIENR
>t3147 hypothetical protein
MLKPRITARQLIWISAFLLMLTILMMTWSTLRQQESTLAIRAVNQGASMP
DGFSVLHHLDANGIHFKSITPKNDMLLITFDSPAQSAAAKTVLDQTLPHG
YVVAQQDDDNETVQWLSRLRESSHRFG
>t3459 possible exported protein
MKRNNKSAIALIALSLLALSSGAAFAGHHWGNNDGMWQQGGSPLTTEQQA
TAQKIYDDYYTQTSALRQQLISKRYEYNALLTASSPDTAKINAVAKEMES
LGQKLDEQRVKRDVAMAQAGIPRGAGMGYGGCGGYGGGYHRGGGHMGMGN
W
>t0575 hypothetical protein
MPDGTYALRVRFSANRYSLAIRQEVCAMMALNMLRRWLNGEDITSEHGWI
DVVESLTA
>t4182 hypothetical protein
MKFFKKVIRVRKSEKRGSFVQVTNSKVVFYIHLQLIKDNILSVVICYT
>t2149 hypothetical protein
MPWRDHVNPDGEVVSCCDNVIITGYHRLFVHPLWIFIMAFDVHSRQSEAG
>t2945 endonuclease fragment
MHKKHMCPWLLPGLLGLARCAPVPHTYAAIIEAGFYPEGTDLQLVLKIIE
TARQEIRLMDYSFTSWEVVRFEHSVSTKFSMSK
>t0740 hypothetical protein
MTETSSHRYKPRNIINAPNVKSSIFSRSQQRGDSENIQRWLSNHFYRWII
GDFPHVYPVRSVADYAVYFSPDAEIPAWVAMSAFTTLTFSTRNLWQWKEI
>t4569 hypothetical protein
MKKSTTSTPHDAVFKTFLRHPETARDFMEIHLPVSLRQLCDLRTLKLEPG
SFIEKNLRAYYSDVLWSLKTREGDGYIYVVIEHQSTPDAHMAFRLMRYAM
AAIQQHLDAGHKHLPLVVPMLFYHGVDSPYPFSLCWLDEFANPEVARRLY
AAAFPLVDITVVSDDDIMQHRRIALLELIQKHIRQRDLLGLVERIASLLV
TGCANDRQLKALFNYLMIQHGHTPRFTTFIRDVVGHVPHTKERLMTLIER
IRAADRRKGERQGRQLGLEEGLAEGLEKGLEKGQHVAALRIARQMLADGL
DRETVQRFTGLTAEELQDVSH
>t4267 putative lipoprotein
MTDNQKITSRTVKITPVLFFIAGMFSVALGGCSSSKDELLPPGDSSMMEL
WQGNAPTSHAVVKGRETLRRSLSSQETLASGRIDESYSRTQENELSQIFP
RLPNPDMVMYIFPHLANGNTPVPGYSTVFPFYAQTQYALPGERTEAL
>t1278 putative pathogenicity island protein
MLQLYRYFWQPARYAVPEWLDKLGFHLSNCWRYGDRPKLDRLLDRALNRL
RGSSVIPACLNDRQKRQIRLAPRISAFAFGLGLFKLRCSDYFMLPEYRQL
LLQWFSEDEIWQLYGWLGQRDGKLLPPQVMQQTALQIGTAILNREAHDDA
VLHALLVLLPPPQRILWPKTSLTEIIFMEHLL
>t1789 putative secreted protein
MKINRYLLGMVSFIAFSSYLQAATLDYRHEYADRTRINKDRIAIIEKLPN
GIGFYVDASVKSGGVDGEQDKHLSDLVANAIELGVSYNYKVTDHFVLQPG
FIFESGPDTSIYKPYLRAQYNFDSGVYMAGRYRYDYARKTANYNDDEKTN
RFDTYIGYVFDELKLEYKFTWMDSDQIKFDNKKTNYEHNVALAWKLNKSF
TPYVEVGNVAVRNNTDERQTRYRVGLQYHF
>t3718 hypothetical protein
MVKILGGVVFKPLIASLILTSAVVYAKPMPLTAARYAQQLGVGMDVDWAR
TERGIREFDPLVVRDFKAKGLTHVRIRVAGAPTEARLIHLRKLVEACEYY
GVIPIIAYQADAYKTDPSASHEKELINWWSVVARYFGQTSPLLGFDLIYE
PADKLNHNMASLNRVYDKTIRLIHAIDPQRMIFVAPRMRAAPEDLSALKL
PAQSQNYVLAEWHIFPWGPLKSGGKYPWTSGTTAEKAAIRARINAAVRWQ
HKTGHASWVGGWAPGESIKMTPIASQFAFARFMACELKKAHIPYAINADT
QFYDGEEGAWRPAQEPLLNAMIAPECETPGKKPGEGNVKPSVPDAISVTP
AAASTPRSAIP
>t0938 hypothetical protein
MVKRRNLLHFNNGARSERDLGDLVTKVFEKAAKKEPQPLYTFSLPLLSVQ
DEIRVYCKKKISK
>t4506 hypothetical protein
MSSFCLPQGRLYSGQNIMKMTFPARQQLDAGRMTTTNFADGDSVKDKNAN
L
>t3440 putative phage tail protein
MSDKKTEKTIQLDTPIKRGKTEITEIVLRKPQSGALRGTRLQAIMDMDVN
AMMTVIPRISSPALTAQEIAEMDPADLTAMSVEVVTFLLKKSVLAGLPTA
>t2652 hypothetical protein
MVWRVVCRAGMILFAIACYATESMVAQAGQPPGWPVSDNAGILTPVWAIA
IERENSGDSVIYAVIGGCLMATTLTPSHPEFVFVFAAVRRADRHPRICML
RTVAGDERSARRSLVRDYVLSLAARLPVVEVSRA
>t1015 hypothetical protein
MFMFLPFLLALSVAMGAINRKDKVSYILWAVLLIVTILSFIHHMTNSLTL
SF
>t0733 hypothetical protein
MVSSTYGEENYKNIHFKNATINIPARWVANKKDDCLLISKNHINVFSYLY
VCTDAATNKNSFFTKNDDGEWEAVTDGVPVLADVNITPKFIGMSAIVSCR
YKDDAGYHIDQCFQAVIVLSTNIMFVFIGRGDSSLFNNYKEIYRSFKVK
>t3969 hypothetical protein
MKRLLLITALLPFAVLAQPINTMNNPNQPGYQIPSQQRMQTQMQTQARSQ
QQNLQSQLNANTQRVQQGQPGNGMLGQQTLPNTQGGMLSGSGNPDRMLNH
SQPMLQQDSGTPQPDIPLKTISP
>t2553 hypothetical protein
MMPTDGNDNTRKQDGAVFGQPEVKPMPETLQCTEKWR
>t3913 hypothetical protein
MTNSASQATRAPFEHSLGIIRQASIEILLLLGIHTTEGKEPRWFMEQLEQ
ARLNLGGWGAVAKKLRINDAQLSQFMLQLRHLQQHVPQYDSGQEVSENQL
LAALRFVTSLEHLRQQQPLLTYQTELEDPDQEAHLEAQRQLRAIELTLKA
LIARAWPDRASLNHYLKQHFGPDRLRQWLKQGEDQHALEGMLFSELALMV
VDKKLFARHYVRIFNDASALTLFAESRTTLRMFLDDCRLARNEVIARQPL
TSAQLMLLNVQYQQIVRPIQRAYAEKRTRVNPASFLLADERELRQFWETA
RLKDRQAGGDKHEISESIEPPRKRPPRTPEEREQLISGALWGGVGVMTLA
ILAGAFWLFSSSSPSSDNGQAPAMAQDEPPREAPSARETLNHMGITWDAF
TMRAAIERNDTRVTALFLQGGMNWQLAWTEQAFAAGHTEVLQLLLRYPAL
MDEVKPCRRFITTLSHDMSSGAPLTAMHKTYLQTFCTVPAVVTRQQYDTE
QAQLRAQARPSADNKKWLKIQSAIYDAIH
>t0491 putative bacteriophage protein
MAKEWFTVKECLGLPGFPGSEPAVRERLYKYSEGKAGVRRKRVKSKAEEF
HISVFPLYVHRYLDDSGEEPTPEPISLQEAEPEDIWEMMFRLLTPEQRKQ
VTGRFKVRGMKAVFLFLFDDTPPR
>t3891 hypothetical protein
MPLQQGGYMTLTQLGVAFWHDLAAPIIAGIIASVIVNWLRDRK
>t1861 hypothetical protein
MMRIKPDDNWRWYYDEEHDRMMLDLANGMLFRSRFSRKMLTPDAFCPTGF
CVDDAALYFSFEEKCRDFELTKEQRAELVLNALVAIRYLKPQMPKSWHFV
AHGEMWTPGTGDAASVWLSDTAEQVNLLVVEPGENAALCLLAQPGVVIAG
RTMQLGDAIKIMNDRLKPQVHCHSFSLEQAV
>t2216 2-keto-3-deoxygluconate permease
MKIKKTLERFPGGMMVVPLIIGALFKTFAPEALEIGGFVISISHGAMAIL
GMFLVCMGADIQFKAAPKALKKGAAITFAKFASGVIIGILVGKFCGPDGL
LGLSALAIISAMTNSNSGLYAALVGEYGDETDGGAIAVISLNDGPFFTML
ALGSAGMVSIPFMNLVAVIIPIIIGMILGNLDEDMRKFLKQGSVVTIPFF
AFGLGYSIDFARLITAGSSGILLGLMTVAIGGFFNIFADRVTGGSGVAGA
AVSTTSGNAVATPAAIALLDPHFTDLASTAAAQVAASTIITALCAPFLTV
WIKKRYDRKLNPAAAGG
>t1479 hypothetical protein
MFTYHSANTSAAQPALVNAIEQGLRAELGVVTEDDILMELTKWVEASDND
ILSDIYQQTINYVVSGQHPTL
>t4294 putative positive regulator of late gene transcription
MMNCPKCGHSAHTRSSFQVTDSTKERYCQCQNINCGSTFVTHETVVRFIV
TPALVNNAPPHPTVSGQGHMNF
>t2590 hypothetical protein
MPKELSGAQWVYGVTKFVGGNTDKPHWSIDGH
>t2746 hypothetical protein
MKIIVWSGLLFLCLAARALATGVVGYLPMSDGEYAQKRALKPLLILPYSV
SPDQTWHFRQVGVSSVTLLPEPKKDNEWRISGKDRAGNSWVVPVGRLINL
AGNAQFYRADLDRNGIQDLVIWLGNPGLGLAPSAQYIIFTFLKNGRPCVF
EPWGFYTATDTGVDDLLDLQGNGRTQLLDMQFDSGYWITNLYQVKDARWQ
RVHGWFGRLSYPALTRFNHYPGRKLIIKPIAGRNPQTDDLSLTQRCLIRG
NVLPGVNQD
>t0867 hypothetical protein
MTAKEETKGVSVRAAAVNMNPLKRKENYLSATIIAQKEGYWYSHLLSSLQ
LVI
>t1108 putative pertussis-like toxin subunit
MKKLIFLTLSIVSFNNYAVDFVYRVDSTPPDVIFRDGFSLLGYNRNFQQF
ISGRSCSGGSSDSRYIATTSSVNQTYAIARAYYSRSTFKGNLYRYQIRAD
NNFYSLLPSITYLETQGGHFNAYEKTMMRLQREYVSTLSILPENIQKAVA
LVYDSATGLVKDGVSTMNASYLGLSTTSNPGVIPFLPEPQTYTQQRIDAF
GPLISSCFSIGSVCHSHRGQRADVYNMSFYDARPVIELILSK
>t4333 hypothetical protein
MAIEGDAATVPLSAGLRLNGLNHIAELRAKVFGLNIDSELDRFISDMRDQ
RDINHEQNKRALAAIFFMAKIPAERHSVNVSELTTDEKRELIKAMNHFRT
VVSLFPNRLAMPN
>t3157 hypothetical protein
MSKKSAKKRQPVVKPAVQEAMSAAVPLGYEEMLTELEAIVADAEARLAEE
EAAA
>t4325 hypothetical protein
MAENGPIEDLAKRISEDLLSRFKWQQHGPCDRDFLCDDEAKHKPEGKKQK
HTHPVDVVFSYKDPYLNKVIYLNTDLKSYKAGSINASKIESALASLAKTI
ECARYSPEWSEKYNFSQIDCEVRGLLFVFNHDNQLQHDFYEFFNPPKPAK
GRRDKAVNLEKIPLSAGQQIHIIDPFLINYMLAITNDMNDLIAKKEFPDE
EYGFYYPQLTFHKVAVTEKYLPATIEVLSSPFMVIKHGAVYKFNRAKGIE
EEVYPEGFVVYYNKKGNSDNEFFYLLDILSNYQILDGINKIRIRLAYREK
DERILSHFQRGVEKYAHEYGLDEEAKKRLEDLDVKVVSTVKEFFSAEVIS
WEPK
>t0352 hypothetical protein
MRFFFILMVLPGADRRVDETSKEEEKTDKQYDTGHASVKSMSFSHTVGLT
YKGFLNVGPAVWAGSGWVLQRYYQGEERNKPAFTLH
>t4276 hypothetical protein
MTTSSYLEYILTILGWVVNNGLWDVLTGTGLFALPLGFKIIGIWIKTREE
GDDEGNKGILSARRMEHAIYGAFIVIIACCVPTQTVSLTTLTFDTTRAKQ
CGFWTPVKPGDSGYGSVVSTMSGMTASAPVWWAFMHMISKGVTQAAVASL
PCRPDLRQVRFDVQHTQITNPALAQELQDFTSDCYAQAFALWKRQDAGRT
TDIDVLRDIEWLGSKIFLKDFYPQLHSKLPRSAFPWSQSRDDGYANTGQG
GYPTCSEWWSQNKTGLKSRVLDTVNTTTMTRMAAAFKGMVSKEEYTEALI
RRLVSPTSLSVSQNGRTYAGYGGNADFTLDNAVNRMASVLGTSVGALGAF
PAFDAMRQALPMVQALLLMAMYVLIPLVLVFGTYEYKTVVTLTFVTFGMH
FLTFWWELARWLDSWLLEIMYGSDSHSRWNVAGFQNSADDLIMNFVMGTM
FLVLPAVWMGALSWAGVRIGGIVDTNLNKGVQPAGQHGGKVGDKIGSGIA
K
>t3415 hypothetical protein
MTNTNSVYVAWQAPDTRDWHVVGNLQERKSGYVFRYTKGALKSTKFTKFS
GMSDVRETYVSEELFPLFKNRLLSPRRPEYPSFIKWLGFEEDKVNPIDIL
ARSGGLRSTDQLQIFKKIDVDSDGNFEHFFFLHGLGYLNSLANARVSELK
PGQILRLCLDLQNEYDGDAVVVRADKPAEIVGYCPRYLSNDIKKMLLDDP
KSITLTVEKISDDAPHNYRLLCKLSGVLSQACQSTLIPQDEFEPIE
>t0375 hypothetical protein
MDWLAKYWWILVLVFLVGVLLNVIKDLKRIDHKKFLANKPELPPHRDFND
KWDDEDDWPKKDQPKK
>t3434 hypothetical protein
MKPVFDENGLATVPGDMRCYYYDAVTSEYTGWSDEYINTGVSMPACSTGI
DPGEYIPGRVAVFTGKGWSHEEDHRNETVYSTENGAAVTVDYIGAIKDGY
VTLSPLTPYDKWDGEKWVTDTEAQHSAAVEAAEAQRQSLIDAAMASISLI
QLKLQAGRKLTQPENTRLNAVLDYIDAVTATDTSTAPDVIWPELPEA
>t3407 hypothetical protein
MRNIETLSTKTGPDDAGLNILLTEARLEERRARAEAMVARLDSLACHITS
RQLTHVEAAELLRVTAEAIQNEAQEIH
>t1120 hypothetical protein
MIAYHNPLSVLMLVRYYVQRHYDEESMTKREKTGRYVLLVKKPCFVTGGA
EGEPSA
>t0727 hypothetical protein
MKYIIFAFIFLIPLRCMSEKLVFLDEVQTKTMNVAFSHFREHTNWGLFNT
MIQDDDENIKIAFYCKSHLEEARGGGMEQIIYIISKKDFKIEKINKSYSK
>t4131 putative lipoprotein
MKKTHLLSVLALGISAACHAETYPAPVGPSQSDFGGVGLLQTPTARMARE
GEMSLNYRDNDQYRYYSASVQLFPWLETTLRYTDVRTKKYSSVESFSGDQ
TYKDKAFDVKLRLWEESYWMPQVAVGARDIGGTGLFDAEYIVASKAWGPF
DFSLGLGWGYLGTSGNVSNPFCSYSDKFCSRDNSYKEAGSVDGSDMFHGP
ASLFGGVEYQTPWQPLRLKLEYEGNNYQQDFAGKLEQKSKFNVGAIYCVT
DWADVNLSYERGNTFMFGVTLRTNFNDLRPAYHDNSRPQYRPQPQDAILQ
HSVVANQLTLLKYNAGLADPKIQVKGDTLYVTGEQVKYRDSREGIVRANR
IVMNDLPEGIRTIRVTENRLNLPQVTTETDVASLKRHLEGEPLGHETPLA
QKRVEPIVPESTEQGWYIDKSRIDFHLDPVLNQSVGGPENFYMYQLGVMG
TADLWVTDHLLTTGSVFANIANNYDKFNYTNPPKDSHLPRVRTHVREYVQ
NDVYVNNLQANYFQYFGNGFYGQVYGGYLETMFGGAGAEVLYRPVDSNWA
FGLDANYVKQRDWRSAQDMMKFTDYSVKTGHLTAYWTPSFAQDVLVKASV
GQYLAGDKGGTLEIAKRFDSGVVVGGYATITDASPDEYGEGDFTKGVYVS
VPLDLFSSGPTRSRAAIGWTPLTRDGGQQLGRKFGLYDMTSDRSVNFR
>t3355 hypothetical protein
MFFVITHHIDVLPLATLPTHETCPYCNNQDVWLIIRQKRTRTCGLSQARK
RDKFGLAICNHCSNEIKEKRWSPALRQLFTEQKSLFNLTFWQRYGFWVGW
LSVWPALFLATWLYFTFRGW
>t3581 putative lipoprotein
MKKDLLSSIIIAMLMTAGLSACDEKKADEQPVAQSADSSASNTQSTSAES
ADANDVLNQKLNVYIDCYNNLQADIYRAVNRYANTFDDFRTGPTGKEDDP
SPLVPVYPAFIQDCRKDIKAAAELKPAFASLDSAALAFINAAGPLAETIN
SMNKYYDQDNFKDDAFAGAKAFHKTFIKQFDEFDPIAKKYIAEITIMSKQ
HAANEIKATEKKEGKSIKYYTLLTMQEAETLNDAVADASFDVAAVSKQLA
DFEEHTQKLNEKINVDIDKHRSFPGFISELEKFQGKVKKRIRRVRDNVAY
TSHEQDYLNSGSGDMVDGSYKAVVKAYNELIDTYNGYHLEREF
>t4232 hypothetical protein
MRDASITTERTMQTPAQVMEEPGSSVHRLPADDIIDYTLSKMQARLDGTP
GSSGQERNGLLFTGNIHDAYPRALILDMRLSPLDKMCWIMIRQYALQNDG
AIFPSYDELQKLLSSPGSGQASRDTVSRALTMLRLTGWLSLCKRVRDTGG
RVRGNIYAQHDEPVSAKDAEMFDPGWLDMVGKACQHKYREVSDTAFHVLN
GILDDPMMRHRHSRMTEIAERLTRPSKVGDVARTHQNPGTGLSLNSIKNE
HKSLSPVNRPSSERGEKSPSPDSRLSLKSDSYDRVRQPDCNNVRSFTQSV
IKKTYVSPQSGKNAVAGIPEGFVWPDGLQSILPESEQPMLAQQLNQLAQK
SPVQAEQIALSVVNGWHQKRISNPVGYLLTTLRQARAGLYRLEPAVTQPV
KSRSVVPAEPSAAGKIPTCSAEADVPAGQEVVKAMVEQIRQRMNSSQ
>t0870 hypothetical protein
MPNGAVKLLKSKEPQNRLFVMLPSKKPLKSTKHSANLILKSSLMSEALRL
HILLNQKKLFYQKRDPALLQMKNKDLYW
>t1393 hypothetical protein
MTDQDKHIEKLKKLLALAASGNPHEAALALRRARKLMDVHGITHSDIAMS
DIDETISHYWPTGSLRPPRYMLGLMNIIREAFGVNSIIHPGTYPGVGFYG
NRERAALAAYTWEVLARQLKKARQQYISAQNKRIKTATRTSRGDQFAEGW
VLAVISEIQSFALTDDERELMQQWLEHKYPQTQTTRARKPGRSRNGDASR
YAGFREGQNVRLHRPVSGREQQKLEAR
>t4546 hypothetical protein
MVPAFFLPACLQRSDMELNNEYQNYEDEDDESADDLPPSHHPRSFQSIIG
WLYDKAVSGIAGIESAEMLAARYLEDAKG
>t3091 hypothetical protein
MQDASCYQLASVLTLAAEHRDTPENLLRYLAKIRGISPFLTAAESCPEAA
IPNAEFLYTPFATALRQHNVPIVRFFSQQLVGETSSARENRNIVARKENP
LLTLYKSNYISQYREQYRLEISQLLLNIMPELLNDTVYIYPIIQRNTELV
AYFWQKHPPTIPLRRLEAMVLLAKTESLISEVTHNPEILITPPIERWDRE
NLLTFILSNGDLVMIQSLIDANVVDWKRAMEDGNNEPLHQAILRLRGGAL
ENALLIQIIKAMQAQKALSNEQIAHYLPWTPTFPAAFLQAGLSCEQLREV
LNALVVGSEQVLHDTRQRLNALCPVAK
>t1468 hypothetical protein
MMENAFSEKYKYAFRNLELKELRRFILKELISSPELDNQLTNLIVLLNKF
WDTHAAALITEVRASKSYRLLLPIQRFGYNISNHLRSLGIYFDSIIMVDP
LHFPSLSSLHSFLSLPPDNSYVRMRRLVLLEHVSNLFRAIPFMTIDDDYP
IFLIVPELLDFDKERNHEQSAAFLSQLFFKDRKMDYATYLAFLEHFGRNE
ETFQKILINKELLKTLLDNFNNIEKEVWVYDTDLKSFTTREVDLLDYDLP
SAIGTLLGKVEGAIYAQRSTQLSATLLGIDPVIYSNHMFLHEWTTEQLVK
DYGKLNPLSSEEQAITMGLSAAEVKFLTALSDLELKKIRERGQLESLRHE
LRISRHDLQGKRVQDKRVQINRIRAASRQGRDQTYGPEVF
>t4323 hypothetical protein
MSQLRLKWMRLKIRTSPEAVFDFIKNTPYSDAIGAGFTKYETIHNGMTAT
FNKKSVVLEPIADPFGEVLEFERVVFDQISFSIQTLSNKICLLTFYNPPK
TVKPFIDFLSQAEGLNVAYGNLTVDLKAFMRVIRENFGVKVFGISKVKVS
NLPVTEKTRACLELNSSGDALHDLKIFVGDSEFKLDKIKAGGFYLDSKIS
FELTSGASAVVPDEHFSIFNDAISCMEIDKF
>t3045 hypothetical protein
MEPGLGEVLWANGLLKLFFTVYQDHHKTNPYRIIYVICLVWVRFLLLSDN
CWVSFFLIGYQQGRLSKFDIQQESV
>t2074 hypothetical protein
MKWQQRVRVATGLGCWQIMLHLLVVAMLVIGWMSGTLVRVGLGLCVLYGV
TVLLMLALQRHHEQRWRDVADVLEELTTTWYFGTALIVLWLLSRVLQNNV
LLALAGLAILAGPAVVSLLAKDKKLHHFASKHRIRR
>t1868 putative bacteriophage tail protein
MAKNDFKPFATGKGANVTSQPDWEALPALLSGFTAGKASSAQVNKALRQA
SFIAAALAQYTASKSGKDVLDDGDLSGFIAKMSAAFGKDFQTLDATLTAL
AGLATGADKLPYFTGNDTAGQTDLTSVGRDIIGKASIADILTYLGLGETI
NLAKNAVPATRRVNSKPLTGDITLWASDVGAISADAVGEITDNGTMASAN
APGWWKVAVSNSDTVVDFPTYPGGSKLYSYGYLFVEKIGDVWFQHYYAHI
GANAKRQDWGTVPNTSRPWVIDYNTANKPSASDVGALPITGGRLNGPLSI
GTDNALGGNSIVLGDNDTGFKQNGDGILDTFANSQHTVRVAPGEMQVLGA
IRAGDAKRMTMTSSNNSVLNAQFHLWGDGNRPTVIELDDDQGWHLYSQRN
TDGSIQFVVNGQVIPDNYGNFDARYLTSGNVYTKGESDNRYVQNIQRGAP
VWPGKVDEYGPAEAPAGCFLTQARHDPTTAYGVTFAYRPLQMWVGNGWRT
ING
>t2809 hypothetical protein
MKTTLSQPFIINKLSINVKSALSRSGKIVFEANPAQKLYIVFDDHRQAPA
GFGVKASLTKKTYVIQRRVASSDRNVSEGRKPSSVLKVKVENVFDFPNID
ETRQSAGN
>t2145 hypothetical protein
MHQSTVTSLLFGSPLCERGDDLGTEQWAFAGNGLINFELAANQERSNPSN
VTCLSLITVLKEWVNIIVLLHNLLTGERVTP
>t3118 hypothetical protein
MTLFVEYNSPYLFAIAFVFFIGVLEMISLIFGHFLSGALDAHLDHYDALS
SGPAGQALHYLNIGRVPALVVLCLLAGYFGLFGILIQHGGIMLWQAPLSN
LLLVPLSIVLSVFAVHYSEKILAPWLPRDESSALREEEFIGGMAIITGHA
AVAGTPCEGKFTDKFGQIHYLLLEPEKGKEFKKGDKVLIVCRLSATRYLA
ERTFYV
>t4148 hypothetical protein
MNKNVKLSLIAIAVSLFMTKQAGAANTWTEARNDAMGGTGVASANYGSGV
LLNPALLAKAKPEDNITVVLPAVGVQITDKDNLQDEIDDISDKVDYYDEV
VDNLTLGQILLNPRGVLNQFQGAARDLADELEYLNGKTARANAGAGLAVS
IPGQTLSVAFIAKGYAHGRVSSSIDQNDIQYLRDIQHDERVALREAGRAA
LLGSDEITKHLNSTALGRVAIVSDYGIALAKQFVVGEVPVSIGVTPKLQK
TWLYNYTTSIYNYDSSDWNSSRYRNDDTGFNIDAGLAADIGENWTLGLSG
QNLVSRDIDTKDIYITNGMTGETTNYKETYQIRPLVTAGIAWHNDLLTVS
ADGDLTETKGFKSEDNSQYVGVGAEVRPLSWLAVRAGYRADVKNNDSNVV
TGGLGFAPFNRAHLDLMGLYGEDETWGAGAQLTMTF
>t3752 hypothetical protein
MKKSPTSTPHDAVFKTFLRHPDTARDFLNIHLPHLLRIRCDLTTLKLAPD
SFIEKNLRAFYSDVLWSLKTCEGDGYIYVVIEHQSTPDAHMAFRLMCYAT
AAMQRHLDAGHKTLPLVIPMLFYHGAKSPYPFSLCWLDEFGDPALARQLY
ATAFPLVDITVVPDNEIMQHRRIAMLELVQKHIRQRDLMGLVERLAVLLI
TGNANDSQLKALFNYLLIQHGSTPRFGKFIREVARCVPQHKERLMTIVDR
IRESGRRKGKREGVQQGIQQGIHQGKQEEALRIAHTMLEQGIEREMVLMI
TGLSDEEIKAKRH
>t0569 hypothetical protein
MFLEWFLSALIPQRLAVQFCRFGEFFFMCLLERVKMKKSILLGFAGMLFV
SASAQAISISGQAGEDYTNIGVGFGTESTGLALSGNWMHNDDDGDAAGVG
LGLNIPVGPLLATVGGKGIYTNPKDSDEGYAAAVGGGLQWKIGNSFRLFG
EYYYSPDSLSSGIESYEEANAGARFTIMRPLSIEAGYRYLNLAGKDGNRD
NAIADGPYVGVNASF
>t2680 hypothetical protein
MKGYCFNNHATELLHVGSHNHKNLLPNLKNPDVRREDKK
>t1021 hypothetical protein
MSKDEISYQILYRYSLEKLYSTLTRRVDNVLSFALVFLGVGVTINVGSPF
ILGPGIVGIAILKRVLRFGTRSTQADRQSRAWLKLFNTQHRFPSDKTLFL
AFTSLEQDASEVWSMLIGPAIVMTETALGKTPIEPLTAGEKLCAFLSGAT
KSQPAARN
>t0717 hypothetical protein
MKRCALIALPFFFLLYAAVSEAWFKANGDKRDAGMRIE
>t0162 probable secreted protein
MKKVMLSALLLSLPLLGYAQERFPSPEAAASAFAAAVAGKNETQLTALLG
DDWRQFLPPEGADPEAVARFNRDWREGHRIVQKDNTAHLNVGREDWQLPV
PMVKETGGWRFDMAAAGNEILTRTIGRNELSTLQAMHAYVDAQQDYYLQN
HRWAHRIISSEGQKDGLYWPTKAGDVPSPLGPNFSPAAPDEGYHGYHFRI
ISDNDGHGAALLAWPMHYGETGVMSFMVNQDDRIYQADLGKETESKVQAI
TRFAPDAQWQVAE
>t4167 large repetitive protein
MEDLAGNVKESAPLEVRIDTTTTINNIVLLNDTGVQNDQLTNVAKPSFRI
DVPGDVVQVRVTLDGGANWNVIRKNADGQWIFDSPNTLVDGTYTLRVEAT
DEAGNIANKDLVFNIDTNIQVPTIALDAGQDTGANTADNITNISRPTFTI
GNVDPDVIKVVVTIDGHDYNATKVGAGWQFTPGNAIPDGSYNITVTVEDK
AGNTATSKPLPVVIDTTAEIESVTLVTDSGDSDVDNITKVDKPQFSIVTA
DDITHVRVKIDNAANWIELTKGGDGRWIFNVGSALPDGQHTLLVDVTDIA
GNVAQETLQFTIDTTLREPTIVLDPTHDTGDDTNDNLTRINKPVFIIGNV
DNDVSHIVVHLDGRDYTIENKGGNLTFTPDQPLSDGQHTISVTVTDIAGN
TKTSAELQIEIDTQVQIDSVTLTTDSGVNDHDNVTNATRPSFEIATPDDV
TSVLVSFDGVNWTPISKNAAGQWQFTAGSALSDGHYTLHVQATDRAGNTA
NSTLGFTVDTQIDGLSVVMLDDAGKDSTDGITNITSPRFEISAREQLQSV
TVILNGKSSTLTQGAGNKWLFTPDTPLVDGTYKIEIVAEDIAGNKISKEV
SFTIDTIVSDPSIDLLDADDTGESAVDNITSVTTPRFVIGNVPADIDTVV
IRINGVSYPVTANGNNLWEFQVPVALNDGVYEAVVVFRDIAGNTSETKLP
FTIDTTTSVSVRMEPASDTGSSNSDNLTNKQNPKFEGTAEPNAKLVITIV
DDKSGREVLKHTITVGADGNWSVTPNILPDGMYTINVVATDVAGNTAQTQ
ERFTIDTVTIDPTIRLSDPSIDDQYEATSLRPEFKGLAEAFSTIMIQWDG
KVVGSANANANGEWSWTPPSVLAPGSYVVSIVAKDKAGNESSQVDFPVVI
PVIDVTPPTIKLSEESDSGALGDFTTNNKTPTLIGSTLPNTIVSIYVDGV
KVGEATADTAGRYTFQLSEMKDGHYVVQVGIVNPRDNSELRSTAVDVTID
TEVAELVWNISGMHEGGYINTVTPEIGGTSEPNSKITIFVNGVEKAIAYT
TGAGHWGVVLPALGNDGNYELTFKVEDVAGNIREFGPQNVILDTVISPLT
VVLREADDSGKVGDWITNKSHVTIDGTAEAGSTLTIRNPQGVVIATLVVG
NDGRWSAELDLREGSNAFVVVSEDKAGNGQQKDILIEHDTQIEISDISLS
RDTNSGDKYDLITNNKSPVLVAMTDPGATVQVYINGVLQGTVEASSSGNI
SYTMPANSADGEYQVQFVATDTAGNRVESAITTVTIDSQIAVFDIDEDSL
PALSNNRALSVSGVGEAGSQVSIFVDGKLVNVVMVEADGSWRAPILLQDD
GTFNIHFSITDVAGNTQVSKNYSVDVDSSTDFPTLNLEDASNSGSLDDLI
TNHNKPVLVGTAEAGATIHIYVDEKIVANVLVLEDGTWSYQFDNALKDGE
YSIRVVAEDPAGNTAESPRLLVTIDTSTFIDNPAMVAGSDNGIFSNDSIT
SQTRPTFSIFGEMNQSVQIFIDGVLVDTITVTDRNQVYRPESPLGDGSHS
IYYVITDKAGNTATSKTLNFTIDTFNTTPVAIDSIGGQTLAEMTGSDGKI
YITDTTRNLLFSGSAEPNSKIEIIINGLNVGEVWVNEKGHWQMPVNPLYF
TEGQLDITVKSTDRAGNVNQEKYSIWVDTHIQVFTSELDDNKSSSKTDWW
SNSSTITMRGMGEIGATVSLIVAGVTLATAVVAANGQWELSTDQLPEGKY
DITLSIEDNAGNRKEEVHEIFIDRTPPNAPVVTYSDIVNDLIIMQGTAEA
KSQLIITDSNGNTYTLTVPDNGKWSMAIPYPSEGKFTITSVDAIGNRSDD
VPLDIMKEVPVISLSPDSDSGTVGDNITRDKQPTFIIGNLESDVVVVQVD
INGTVYNAEKNADGVWFFTPGTPLADGSYTISVIASDAAGNQKNSLPITV
TIDSTLTVPEIALAAGEDNGVSDSDNVTNHTQPKFTLQHIDADVTGVTVN
VTHNGVTDTYQATQGADGWTFTPPAAWNDGTYTLSVTVVDRAGNSQQSAS
LAVTVDSTVTVTADSQHDDASDDATPTAVTPPESETVNAESDTHLRTVPS
AAEESVVKETAYSITLLNANSGDEIDRSISQTPSFEISVPENIVNVSVMF
EGEEFTLPITNQKAIFEVPLSLEDGEYTMDVKFIDKDDDFLIKEKTFSVD
HSSADIVNAMNARGKTEDDINDSPSTSSVGHNNNGAIDVFAVNEVTLPVD
NQEEHA
>t1967 hypothetical protein
MQAEYAFSRRQSKEYNRGGEGAYKSIEKKTKLKEP
>t2841 hypothetical protein
MKEYLVFQLYAPLASWGEEASGEIRHSATVPTRSALLGLLAAALGIRRDE
EARLNNFNRHYHLAVHALASQDRWLRDYHTVSAPRENKKYRYYTRRDELT
LASDEVGTLISQREYRCDGYWHVAISATPDAPYSLSELREALLTPHFPLY
LGRKSCPLALPLAARLMTGTLKEVFTHAVEEISAAELSGFTLREGICYWD
DPDEESLVWQQKQHSNNQPVSRQRWQFGGYTRFNGPLQERT
>t4523 bacteriophage gene regulatory protein
MFHCPFCKKTAHVRTSRYLSENVKQRYHQCTNIECSATFRTIESVDGVIR
AAPEKPDPAPVTPPPPRKVQGCYSSPFRH
>t1883 putative bacteriophage protein
MPQGLPVSNVVNVDVIIGPRAATGRNFGSLLILGTSTVIPVKERLRLYSS
KEDIGSDFGVDSPEYEAATVYFSQSPRPKEVYVGRWAKTLATGEAGAAEK
LMDAVNAVMGYTNWYGLGIADKEDIADDDWLKVAAAVEASGVSRILAITT
SDPATFDATSTGDLSYKLKAAKYGRTFVQYSSSSKYAALSAFGRAFTVNF
NGSNTTITLKFKQEPGITYETLTTDQAAALDAKNCNVFVYYQNDTAILQQ
GVMSSGDFFDERHGLDWLQNYVQTNLYNLLYTSTTKVPQTDAGVTRLLSN
VEQSMDQSVTNGLVAAGVWNGGPIGQLDSGDALTKGYYVYAQPISEQAQA
DREARKAPVIQVSCKLAGAVHFADVQINVVR
>t1574 putative lipoprotein
MYFKKSLLALAICASVVSVEAVANIRANDMASSATPASVGHRPVATTNAK
KIEVSGELTTGSTLQIADVFIEDEDGDPLSLDKMNNADDIKWYLVDNEAA
DPSGTPAATGTAFTIPAYAGGKKIKIVYRIKTATGTPDSAFLPATVLLTS
ASSGVSGDSAADGTISNKLRSVTINVVNESGKPTDELNGTNDANTPVVGG
TLEVVLECAATAAAECEVSNYNFTWQMADAGTPDRFTDLNNTNAEKHKYI
IKGTEQNKLFRVHVTPKTTKATPENKRAIRR
>t2635 hypothetical protein
MSEDYVIEWDKNFADDLNVVANVFLSHNPTLWPTIFSQLSTQPEIFEDED
EDEYGLQDVLDCSGGDLGNNELAQAFLQVLRGEGFIHLVDWKGEDEEGEL
ANFAADRFYELTKNLTDSEELRNLLVEITQEDEISDVCEAGDRYLDEIFE
RIQTELNKRGFQIFDLNEGSDTYNVVVLPMSEYKK
>t0875 hypothetical protein
MKYFVKMVDTGSLTPVAEMADKFLVQAKRFPSISQAELWQTSFTSKRVWL
RVVFSKRE
>t4361 hypothetical protein
MVDRYFELAQAPFDPVRIWQWIGNLNFHHQCQADQSKSVQVLRENDTLRQ
GIIAYVFGPLTDRNEIFNTRVEKFDGHLHSHSGLHLWRNDYKFILNLAFE
ADNVDLWASFLVNHQRYRNKEEQGPDDLRAQMRRHALSKPAFMREWARYN
NAMKLSGQKSQLRRFRHSRKMKRRDRKQREIHARNIKFVNENRDIVERGR
HWRCLLRFAELVLMHPERIELEFGDEKLVRTALRNCLDFITPEVPTLPEL
AALQCESKYRHSETVLYAACLEILRAEGNLECVNIELLTALRTNIHRGYH
SVSKEERDALQTEIDRLIFPYSESAEKYLRQYVEPQLTQPCPHPEIGVLS
RQEVFRHSRAKLSIEWLRRFPDVSLNSVDTLFEIAAQYGDRENLKEIIAE
RCSEMMSGWPNPKENEDIERKRIFWLVREFYFMENIADTYWAWLKSDKNN
LLHFYERSGRMNRSEHRAWPELASIKVEAILDAFIEHWPRVDLPDSWGSD
SPKEEKAYRFLTDLIWSINSDTSDDAIPVLDRLLNDPRFTNLHKELQSIY
ADQIRKKALRDFEPPTPDEIVQRLDCDSVVTVEGLRQLVLQELHDFQKAI
DGGEFNSADRFYEKNERLDEVKSTEIIAERLSLRLQPQGIAITPEHQLKG
QNRSDFTASKLIGGRRRLLVTEVKGQWHRELYSAASAQLYDRYSIHPDAE
QQGIFLVIWFGESETVAGRKTHGIKTAQELKISIEAVLPADLRDLIDVFV
LDVSRNGGHQR
>t4231 hypothetical protein
MTFSLAQGANSLLFNLVMELRGGNIRRCEALGLKPEEMRLLRSLTMEELH
YLSGSPVSVLNVAIHHENLKRMLDQANREQKRSERIDRAIALGASIEMMG
IFFGLNASDVSSKRRLEGIQTRQGRCQSPDEECEAKIWRLWHEANISDIE
SLNSLETMMLIAEEADVSLTVIWGLIKNWCTGEVA
>t2844 hypothetical protein
MDLTKEKWLPVIFSNGDKKKISLRDLLDNRIQDLAYPRADFQGAAWQMLI
GILQCTVAPEDKEEWADIWHESIEFEQWEKALNTISLALQFGEQKPSFLQ
SFDPLDSEYGSIAGLLVDAPGGNALKLNKDHFVKRGNVEQICPHCAAIAL
FAIQTNSPAGGAGYRVGMRGGGPLTTLVVPQEEDKYPLWKKLWLNVLPQE
EPPNVTQHPLIFPWLAPTKTSEKAGNVVTPDNAHPLQAYWGMPRRIELDF
THTVAGICDLCGEHHESLLLQMRSKNYGVQYDSWLHPFSPYRQALKDPSA
PWLAFKGQPGGLSYKDWLGLMLNREDKFNKMQPAKVVRAAGQRNKMSLWC
FAWDMNKAKVRCWYQHRIPLISVSHEEQFLAALNIVLVLASESLSLLRNA
LKSAKFDCPKEAKMDFSMVDIAFWQETEPAFRTLQEALAVDPLRQDTQTR
HAVSQWEAELAHYLFHVFDRDALTNPDCPDDILQRQLTARQDLASSYRKH
KARKDVLALVE
>t4118 hypothetical protein
MAHDVARVTGNARSLKVTPGQICHLVSPEKLQV
>t1677 putative invasin
MRGENPGTLCGKNKDIPGNTRKGCKTCPDTLWFTLMPFLLSFVVEVADTL
SRIVFRSFSLSLLLLAASGTIRAQAQDPFDQNRLPDLGMMPESHEGEKHF
AEMAKAFSEASMKNNDLDTGEQARQFAFGQVRDVVSEQVNQQLESWLSAW
GSASVDINVDNEGHFNGSRGSWFIPLQDKQRYLTWSQLGLTQQTDGLVSN
VGIGQRWAQDGWLLGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANYYQ
PFADWQTHTATLEQRMARGYDINAQVRLPFYQHINTSVSLEQYFGDSVDL
FDSGTGYHNPVALKLGLNYTPVPLLTMTAQHKQGESGVSQNNLGLTLNYR
FGVPLKKQLAASEVAQSQSLRGSRYDTPQRNSLPTMEYRQRKTLTVFLAT
PPWDLTPGETVALKLQVRSVHGIRHLSWQGDTQALSLTAGTDTRSTEGWT
IIMPAWDHREGAANRWRLSVVVEDEKGQRVSSNEITLALTEPFITMPDDN
PHWQPFQEQ
>t3418 putative capsid scaffolding protein
MTVKAKRFRIGVEGATTDGREIQREWLEQMAASYNPAVYTALINLEHIKS
YLPDSTFNRYGKVTALFAEEITEGPLAGKMALYADVEPTESLVELVKKGQ
KLFTSMEVSPKFADTGKAYLVGLAATDDPASLGTEMLTFSASAAHNPLAN
RKQNPANLFTAAEETVIELEEIQEDKLSLFARVTALFTKKEQSDDARFSD
VHKAVELVATEQQNLSARTEKSLSEQEERLSELETALQAQQTAFNELVNK
LSHEDSRQDYRQRATGGNAPADTLTNC
>t2064 hypothetical protein
MSFKNFEQGRQFYSILSFAKNRDTHHIDGTSSHKQNNLREINHENH
>t4310 putative phage tail completion protein
MDELQKVDDWLTALLANLEPAARNRMMRQLAQQLRRTQQQNIRLQRNPDG
IGYEPRRVTARSKKGRIKRQMFAKLRTTKYLKTAASADSASVQFDGKVQR
IARVHHYGLRDRVSRKGPEVRYAERRLLGVNDEVETITRDTLLRWLAG
>t0629 hypothetical protein
MSKGIRQRPPVNFSLFRQVLYAHIVAFLMMLMLGMVFTVLSLVLFYTYGA
NWLLSLFICPLFLLSGLFITGFAFKSTWSSIRYYYDKGQLKRYGLNLDAT
LTHKEKVEIRIDNAKRQVRVDELELHVLFDFQFDSKTWSCGDLLTNEKVF
DALNDGQTIPIRILPWKPESASVRQRALFNRLKGMNTAAETTDPRLGEAL
IECGEV
>t2952 hypothetical protein
MALYGPVLHSLGCHVVSYGLMPSVAQRPLRDPPSSKSIRINTNQYDLLIG
DTLRTTHHVSLKVPPLALHHTVAPHSYDTIIVIHSFNLKVILFASCCPAG
LPSLGDFE
>t2808 hypothetical protein
MFATKRNHNKIKRETDASELKTVIKIVLHEGKPIHFATTVIVLYSDRYLE
LCDLYSSQAIE
>t4273 hypothetical protein
MKSLRLKRALLSVMLITGGMEVQAALNTASIVASSISPSCISWRISGICY
WLMCTPFGCTVKTSIKVHHFIPEAVVSTYQASGGNPWTEMSLVSQTAGGV
ENAVTGALSGLAAGGGNQEQKFPGTRKSNVRFKYADAIGHPSTSIIGGQI
PGYSCNSAATPLVPYFLSTLDTLAWRTGVPESVYPEALIPGRREIGSTSA
QNIWGNLYPRSGFVTQQDDYKSGAIVAQRVADIITRTNQIHVYKPLTGNP
SAGYWPPEPVKENTGTQNHKWQQLSPTLSMSCSVFPDTGKIAENGNYAWT
LWQPYSCCKRRGQTFLYSTDIN
>t0410 hypothetical protein
MKSTEFHPADYDVHGCLRLPFLFWCVLLLQARAWVLFVIAGSSRGQGNTL
LNFFYPDHDNFWLGLLPGVPAVVAFLLSGRREAVPGVWRWLRGLLILAQL
VSLCWLPVMWLGGDPVNGVGLALLLADIVALIWLLTNQRLRACFSLEKE
>t4288 hypothetical protein
MSIIDEYISQHFSERLCLDVTEEDITWQLRGSRSDYVNTRIQFDREKLMA
VMDVMLSGLDSDETTLARCRQVLTLWIAGLDMLSKEAEQPDWLPRVHPHS
SGQCDLLLKGNPAALTEADEETYLRVTGQQDLPAHRRIPQVIFSKTVRYW
HRFESWLAQQLQDITQHCYQKLKCFVANCTTEPRQLREFRGEYGSLRLFV
GPQDIDEIDILEFNPEYIVSWVDKVADGLFTPVCFVVNVYYKNGILLESF
TWDSEVDNINRMTSSDYGEAMSQAISWVREQFEQPVIDQPVPQQPRLAA
>t1475 putative secreted protein
MNKFSLATAGIIVAALVTSVSVNAATDTTKTNVTPKGMSCQEFVDLNPQT
MAPVAFWVLNEDEDFKGGDYVDFQETETTAVPLAVELCKKNPQSELSKIK
DEIKKELSK
>t2593 hypothetical protein
MKQDMWELRKIQSQGITDNAQYPAGTYIVFTAAAPYIPLFEQHGKDDIAL
SLVRHEEAWWIVNHSDELCCAVNEQVMEPHHRMRLNDGDTIEWGLSSWCL
ARTNDESRPDVSFPQLVQSSESVAEYLDLDWFKQQQLNPQNPFDIIPVRE
TAPSYTGHEADSTLHQLYQEYQQALRPSGQEKPLRPKPFPRNEDAVTQDL
TSLYDKKGDTDTLQDMVAGAPGIDAILDTLDTTGEGEMHWLAMESMPDIL
QLLSPEMGGKTAHSEILPDLTRREHRIIGIDSHYRITPTQKNGNTAHEKN
>t4527 hypothetical protein
MRHTTITARDLECLEHMRNVGQLVNELMQVQDCATVRRDPVQQSQLTSVI
YLMTAQLDGVVERCNQRWLTGEGNV
>t1706 putative secreted protein
MKNVKNLIAAAVLSSLSFASFAAVEVQATPEGQQKFGTISANGGTNLGSL
EDQLAQKAQEMGAKSFRITSVTGPNTLHGTAVIYK
>t4451 hypothetical protein
MLRIKQFCLSILLMLAPVLALACSDIPYALFHYQPWGQGTKYVVEEDYGY
SINFKDIEIREVKFSQLRHVPDRIVNEYNDRGKMSNDFVHVAAFSSLIKN
DFWQFYWLTDGQHILWAGKIVQNPPGKPPVDAATFRAYGRFAADKDSLYF
DGERTDDNHGEKRVDMDSLQQVGGNIKLRDSGDVLKDRRNLYFQGRWLAS
AQGYSILGVKSWEQRNILFSPTSCDTTSNPGPWDTILRTATQVFVNGVAI
DADPNSFHVVRWVAGVQLLYRDKAGVKRYYFGKDCLNTFNVQRDKVTWMT
REASRRGDDCRIATIPNVDPEYFHPIHYNWTVAQYKDLLYQVKYVSPVDR
VLTITRLPDPQVQLKAGANLVGKQIYFIGDGGVQIIDLIGPLVWFKSPDG
SFSDVFAHDDRYLYFFGDTLVDSGQRRNVTRTLDTAHIDKSGYYLVTEEG
KYEVINNAFTSR
>t0350 hypothetical protein
MKKVILGAVLFTLSGSVLSSSLQDQLAAVAQAEQQGKNEENRQRDALQAK
RDQEAQQERQRQANAAAVAKQRAKAAEAERKARQAKLAAEAAQDKARDQS
YEDELRRLEIQKQKLALAREEARVKRENEFIDQELKSKAAQTDVIQSHAD
ANRNLSEGGRDLLQSEGKAREEKASGWFN
>t1872 putative bacteriophage protein
MIDFSKLIMELRVMGEKLPNWKFLLIWIVFFLFGLSSLIGAVRWW
>t4636 putative secreted protein
MKKSPLCCYLFCAAMSASSAALAATAPVSAGVIHFKGQIVEYGCNLAPHD
RNIEVSCLRNPPFADSRHTVWKRYFVDEWNRHGTARVVKRSSGDSKPDRP
VPLTAYGSRREEFRQAAISLSKTPPACILYFSLIRILQQRCSSLRYPVRS
TRSLSTVPMIAVIK
>t0981 hypothetical protein
MKIGWVFYFSYIFVAIIFHLFISFCYCLSIDMAEANTVVFLLKPGTISLL
FLLLPARRFRTRLLATLSSVFIALVFNQWHLVAGNKELVLCMQAACFMAF
LAMTSVKKSGWMISASLFLVCAAGTIRQCWLEQLFNAADIYIVDDGRSCG
ASGHCFQYIAAKGRGLAAKRQALFSSEEYVNIYYSYSEGIPAVNFDGMKN
EFAQYLLCHGELKYVVRDDKTTCD
>t0864 hypothetical protein
MDTVEELNGTYFYAGKSNLTAAELLL
>t1516 putative lipoprotein
MRTHTLFKVAVLTGLLALSGCASKVTQPDKYSGFLKNYSDLKETTSATGK
PVLRWVDPHFNDSNYDSIVYNPITYYPIPKPTTQVGQQVLDKLLAYTNTK
VKSAIEQRKPLVTTAGPRSLIFREAITGVDTSKEGLQFYEVIPVALIVAG
TQMATGHRTMDTHLYFEGELIDAATNKPVVKVVRQGEGKDLSNSSTPMAF
ETLKQVVDDMATDTSMFDTNKK
>t4121 hypothetical protein
MALPRITQKEMTEREQRELKTLLDRARIAHGRPLTNSETNSVKKEYIDKL
MALREAEAKKARQLKKKQAYKPDTEASFSWSANTPTRGRR
>t3428 putative phage tail protein
MAELQKVDDWLSALLANLEPATRSRMMRQLAQELRRTQQQNIRMQRNPDG
SSYEPRRVTARSKKGRIKRQMFAKLRTTKYLKTAASADSASVQFEGKVQR
IARVHHYGLRDRVSRKGPEVRYAERRLLGVNDDVEAMTRDMILQWLAG
>t2509 hypothetical protein
MRYLFVIFLYIAGKMLAKIDKSLHIINKAYWHMICHIFYGDYLTQRKAKR
G
>t4159 putative lipoprotein
MENTMKLPYAITLLLCLFLAACTLPDRFSAVAFQQLTLLQARSTRFLQDA
ARIPWQKETLLKDDRDIRQTFFQAERVARQGGDKHRLDNLALLKNHYLRL
YARVMKCKQPLTYIQAERYQQQNNQVWKLAIQGECLHWGARCTQGEENGV
Y
>t1449 hypothetical protein
MGITIMQKTSLFFSAAAITLTLMASSASAATPTAAQNEATSTVKQEITES
INRYLYSIDKADPTLGKQLFYVSPETSFIHPRGHERGWSQIAENFYGTTM
GKTFSKRTLKLDAPPAIHVYGNAAVAEFDWHFTAVRRDNGQTQHTTGRES
QVWAKIPNTGWRIVHVHYSGPAKTGVGEGY
>t3242 hypothetical protein
MILLSKQTPLGAGRHRKCYTHPDNARRCIKVIYNRDHGGDKEIRRELSYY
AHLSRYLTDWSAIPRYYGTVETDCGTGYVYDMITDFNGAPSITLTEFAAQ
CRYEEDVAVLRRLLKKLKRYLLDNHIVTMSLKPQNILCQRISESEVVPVV
CDNLGESTFIPLATWSTWCCERKLERVWQRFIAQPALAVALERDAQPKDK
KRLALTSHEA
>t3441 hypothetical protein
MADIATIFHWPPSVTDVMPLTEVLEWRYKAIQRSGANDE
>t1576 hypothetical protein
MATVYGKSKVHTLTINGSMLAGRLAGPSCSHIYNVGMITFSLYFTVLTFY
LVEIPFNLNVLFLMYIIRGDIIHIFEIRADDMYTTIRNTTLAMVACFSYI
AHASTHPPLIITRGAGGDASGAAVIHDNWRHGTPDLVNLTDIPIDKIRPE
KYRCVLIIGQGAIKEMLLANNASAILSGKTVGLYTHLIDQNTLRLLRQLQ
NKVRFNLFFTRSQITLLKLRNISEYNFLSSKVNNVWGQDSLAIETVAPDR
GNIPEKALPLKTTDYVIWLGGNYTTSSGTQRIFTNDQIVVALKPLHNVIS
PNASIAIMLSPRFFDNSMSKEAKVKRLKAVLNTFSQNRVTFYMSKEMLAN
LKEFDLPVQLSPSYAELMRMPWASATRHFASVDQYNLFADLIPKVTPFLL
EPNDADQALYATDYLNTRRVSLTQNILNHGCD
>t0935 hypothetical protein
MLICFMPALLKRVGLPVKYTDNAKLFISFINIGIVSCALLLFYFV
>t2694 hypothetical protein
MIEVNEDHIVAVQVGNEPGEKLILKAADVTPYCEKGDFGVC
>t1106 putative bacteriophage protein
MLKPIFYSGSVKVPECLETDKEKNVGRTPLSSDIQQIKNVVEDIPPFPES
RAVGGSVSAAYRLSFDEVFCGLSNEERKKVYGRLFGKQVLAHIHSRCQRD
ADIIREKALRRISRECGTEIDCALLLNKMVDILQNARLTINFNAAKIDFV
SLLKNKEYLNSYALGCRPGDLPAYNVGRDSVETKAFELEKLADSPYAPYG
QTGGFSVAYTPNSKTFSPTSRPIYAALDFLNGENGGASAYGKSFFELNDN
VKTNCTFSPFDIYGHRFGLDTSKLSTFCHMENLIASCQNDFFGYNCFKSL
VKMAKGEKFLAHSNYGTGYEGNYIEAHIHGDVCLFRDIKHVYLSLQENSY
SESQLYDYAKQINQALNRDCIILY
>t2658 hypothetical protein
MEEQVMDTNELGLVKARVELITAMLKCATAFVGLVGAVYAVLNMAFNYNN
LTHDNRSSKVGR
>t2991 hypothetical protein
MARRPFSSQSLVLIVIAIAINMIGGQLISMLKLPIFLDSIGTLISAVLLG
PFIGMLTGLLTNLLWGLLTDPIAAAFAPVAMVIGLVAGWLARAGWFRTLP
KVIVSGVVITLAVTLVAVPLRTALFGGVTGSGADLFVAWMHSMGQNLVES
VAITVIGANLVDKILTAIIVWVLLRQLPLRTTRHFPAMSAVR
>t1038 hypothetical protein
MKKFRWVVLGIVVVVCLLLWAQVFNIMCDQDVQFFSGICAINKFIPW
>t3317 possible transferase
MKFKYALTSLALSVAILSSVPSTAFAIGGASGAKVDYQVQGKIGEVVMNP
YDIAPLTAVIRNGGYQLRDVHVRIVPKENGQEIAYKVNNKYLLTYGGIPV
FGLYPDYVNTVEVEYTRIQGSKTENIKESYKMYAPPAYSESAGTKEEQSA
LFTIDVKKVSPEFKDRLYLLNNTKDKSGNGTRTVWNNPTGGALEWNFTTA
NAIIDTSGDIRWFMNPSSIYDLKSIYRAGVMMGFKQNQDGALSWGYGQRY
VKYDIMGREIFNRRLPDNYNDFSHSMDNAPNGHYFLRVASSNYKRPDGKN
VRTVRDVIAEVDQNGVVVDEWRLFDILDPYRDVIMKTLDQGAVCLNIDAS
QSGHTLSEEDLAALDSSDKFGDIVGSGAGRNWAHVNSVDYDSEDDSIIIS
FRHQSAIIKIGRDKKVKWILGTPAGWKAPFNAAILTPVDSKGQKISCQES
GCEGDFDWTWTQHTAFKIDSKSKGDILYLSAFDNGDGRGLEQPAMQSMKY
SRSVIYKIDQKNKTVQQIWQYGKERGNEWFSPVTSITEYQTDKNSVFVYS
ATAGGEFDLSVGAFTSLPNPYLEEFRWGEKEPAVEMQIHGARGYQAMPFS
LTKALTE
>t0731 hypothetical protein
MYKKIVILVITLIIIFFGGGWYMHKSQQQMATLVISDSENALDYPNKRKW
FDASRWLSTSQYIKIDDFYLLNLKHHPVNNINDAGIIVILHFAIRDAIKK
FPELSKLSQMDNKEFFHFMQNKLSNEYLRTKFNEDTLEPTDDYFLFFFTY
NEISYEVELLRKVTEHGMMFVPYGYQVNKKGDWHRMHPSTYSCFNDIQSN
>t1761 hypothetical protein
MRRLFHFLMNNIREHFMLYVILWSLLAVMDIVYLLFF
>t1364 hypothetical protein
MMKEYRYSGPASGVTLSDGTEILLWPGKTVSLPEEHDYVKVLVALKHLTP
VPEDTKPAVTPAVQSPKRRSSSDSEVKTEDSHGS
>t2545 hypothetical protein
MVKYHPVHVTKNRIKTDDVTDIYFVEPFWKKGENHIIESLMFFEELQKSF
NLFNHKYGLNKYILRIPWEFVLIHMERISKLNSVGLFAVSVDFNNNHKFL
SEYIRSRRDYGMEVWFDFCGKHSYSSEIKNLGFFFQACVVPRDPNFISSV
YHYHKFQKILVGDINDVEQRAVYQNEVDYMYGMQWPSSYDGFFFRDHKKN
ETWCI
>t1179 hypothetical protein
MRRSLMVAVMAIMPMVGLANQNSRPDIQVNVPPEVFSTRGQSSQPCIQCC
VYQDQNYSEGAVIKVEGVVLQCQRDEKTISTNPLVWRRVKP
>t1107 putative pertussis-like toxin subunit
MYMSKYVPVYTLLILIYSFNASAEWTGDNTNAYYSDEVISELHVGQIDTS
PYFCIKTVKANGSGTPVVACAVSKQSIWAPSFKELLDQARYFYSTGQSVR
IHVQKNIWTYPLFVNTFSANALVGLSSCSATQCFGPK
>t4128 hypothetical protein
MMKKVLYGIFAITALAATSVSAAPVQVGEAAGSAATSVSAGSSSATSVST
VSSAVGVALAATGGGDGSNTGTTTTTTTSTQ
>t4233 hypothetical protein
MMSEKEQVSVSRKAGALRSALNIALNTDYAINLWQGRPPEEEGEKEGKKK
ERRQSLIFSMPAFIQKAGRINADSFNDNPYADQKMLELETLLQSASVKMN
EELVALKKSMSMLPPQATISEVNCASPLNIGVFSRTPLGYRCVWLLVGFD
QLAMQAFQAAHYGFISQQELHRSLRRGGHLIRQIYGAAQKYRFFQVNRRD
FALQNAQYHEAIRNAGEIDEAVLLGQKRSSFSPPVSKESIELLLAANAKA
DKADIVQLL
>t4317 putative capsid completion protein
MKFVAPEQAPEQAEVIKNTPFWPDVNLSEFRSVMRTDGTVTQPRLKQVVL
TAISEVNAELYDFRNRQQMLGWRTLAEVPAEMLDGKSERIRHYHNAVFCW
ARAVLNERYQDYDATASGVKRGEELAEASGDLWRDARWAISRVQDAPHCT
VELI
>t2020 hypothetical protein
MISLVVPTLDTLRQWLDDLGMNFFECDTCQALHLPHMQNFDGVYDAKINL
VDNTVLFSAMAEVRPSALLPLAADLSAINASSLTVKAFLDMQDDNLPKLV
VCQSLSVMQGVTYEQFEWFVRQSEEQISMVILEAGAHQLLFNAEEDAQKT
SAVDHFLH
>t1869 putative bacteriophage protein
MSKYTDLITNYHATKPKFVEHIDLVTRPLAETSAAINGLISAFDIDYATG
IQLDILGQWIGLSRVVSQPISGVYFSWDTDGLGYDQGVWQGPYDPDSGYT
SLSDETYRIVLKTKIAINNWDGRNDSLPPILDAALDGSGLKMQIVDNQDM
TIGIWVFPETDISSVSLELIAAIRQGYLTVKAAGVWGGSIEIPSVETPSE
GNRFFGFDMDNEYISGFDAGSWGTLL
>t4605 hypothetical protein
MVKERLMFRWGIIFLVIALIAAALGFGGLAGTAAGAAKIVFVVGIVLFLV
SLFMGRKRP
>t0718 hypothetical protein
MNVTSGVNAQTPLLPPSEWGDDEKPVAEIVEFNAYGNKPRCLMCLGTTAL
FTGAFSGVCSGAVASVSSGAAYTTALTILGASFGMGGIGMMGICAGLYLS
ANGVRTRPAWP
>t0576 hypothetical protein
MSDEALALLFSAVENGDQNCIDLLCNLALRNDNLGHRVEKFLFDLFSGKR
SGSPDIDKKINQACLVLHQIANNDITKDNTEWKKLHAPSRLLYMAGSATT
DLSKKIGIAHKIMGDQFAQTDQEQVGVENLWCSARMLSSDELAAATLGLV
QESPLLSVNYPIGLIHPTTKENILRTQLLEKMAQSGLSENEVFLINTGDH
WLICLFYKLAEKIKCLIFNTYHDLNENTKQEIIEAAKIAGISENEDIDFI
ETNLQNNVPNGCGLFCYHTIQLLSNAGQNDPATTLREFAENFLTLSVEEQ
TLFNTQTRRQIYEYSLQ
>t0488 putative lipoprotein
MKTFSLVALILLLCSCSAPHHDSTQAVKQFYTSWMTTFMNDVNTPDDTTA
LMQRYVAKEVIHRLALIQSLYEQEIVGADYFMYAQDYAPEWIPQLRVGKA
HPFLGGEKVDVLLATESTPIHLEVYTRWEEGRWKIYRVRDADKDYEQPIY
DAGAITQAEAWSAKVAPEYEKH
>t0694 hypothetical protein
MRVAKIGVIALFLLMAIGGIGGVMLAGYSFILRAG
>t0737 hypothetical protein
MEQDDRLLNAMFEMCNHKNPLNDGQREWHIADIPGLLREERYDELDERYN
QALTESFTSREAEKRYFFAWNQMDNPFYDMDTLVEAGPQGLALIKNWQRA
RPRSTHAWLAEAQYWNHRAWLYRSYGWARETTRAMWICAAACNERMVIAA
LNAIDCEPRQWMAAALTSTNSKVFGQPDWLVEFLEGADVAGQPLMEDLAE
YHRHSPQEVDALMAHSGLSFADAVCPNLPRPSVLPECNDDAGQKYWLAVC
LAIFPTAFYVLDEYIPFRMPRWGGSHEEIREFLESSVCDHLSAAEREHLE
LLIWWDDHRDLRIKEVDSPAEQERIIAKAEEISLRAHIQESRHNTLKWLR
VCYSDLDDNDALWRTLQRSIVEKVKFNNYFFDDTIKFALRDFPDTWWMYN
FLCQNAQQTEFAVPKIRRGYFQYAGLLGFEKDEAQGLAWLDSVADIQYNH
NWRAAIKNFDWFGLPEHFVPLAELGAQRNIPAALNLLGLEHNNKENNGLL
PYDPAIALGYFQRAAEILHRRLALRESTPYKLIDNGGYTDYENDLQNIHF
SIGICNQRLSKQELDTEKRSAYEKELLDNLWLAHQFGHKEAWGLFLLNIF
EVKDITLAHKHLELVQQEANKGTLHAMVTLSRLHGNKHDRTLFNMKLSAR
WAHFAFTLYPDNEIVMDCLDHLHFDSFWKRFRFAWYTVRIPNSELPGQVN
SMV
>t1599 hypothetical protein
MKPGGADREDKSPAIIIEAWTAGENRKQYVYIYVLKNELITKIKA
>t3032 hypothetical protein
MSQNGLRFTLDVDGLTPAATTVVSFTLQRYQQFNRCKLE
>t3090 hypothetical protein
MDYINRWLGSELLMFCVLPWGYAAAVASLLILMFSKKRSRQILLWVLLPQ
WAFVVLLLPTLQYTQLLSQTGTVWMLMLLLPILSWAGLLPALLLGTWLRK
PWPAWLLCHIVFIGMLCPVMPELWRAISYQWQQQNIAQLLRQVQAGDLRQ
LESIHDNSTLEQTLVQAVKAPGISEKNLRDLTARVASPFRFSREDGYFVN
APFFAAFESGNITAVRIFSEQLMGDSPQAQANRTIVRRQNPLEYLPTPRF
KPEGFRQTFFEMADILLRVMPDLLTDEAYSGAIQLQDKETLAFFWQRREA
QNPLYRAYYFLLQGQTKALLAQIKLTPQVLGQSVYPNKNLLASLFIDADG
ETLRALVKGQMLNWQHIPQDKLTDGWNFLISRTLHTASKEDALPPDILAG
ILQSMQQQHTALSEALIVASLDYQDERHSLMTAYRMAWLDCNKLNAMIDK
VYPPEDTRRTNVRIKLAQQCADLD
>t3166 hypothetical protein
MTKSREPSILRAEADQTVAASSSSFRGDGRRGGKSGLHRAGCQVTPGGGN
PRPVQQRANRRWPTQVGSGKGERVR
>t1910 putative bacteriophage protein
MVLNCPQPLKVIINSLDVFTSYNLRKHCQTCMRVIIMGHALKKADRLYIP
PRDKSMVAKPRAAISKACSHTGQVKNAFEFGFARYEKAMEELSKV
>t4299 putative phage tail protein
MSDKLTEKTVKLDTPIMRGKTEITEIVLRKPQSGALRGTRLQAIMDMDVG
AMMTVIPRISSPTLTAQEMAELDPADLTAMAVEVVTFLLPKSVLADLPTT
>t1502 hypothetical protein
MSHLEEVSARVDAAIAESVIAHMNELLIALSDDAELRHEDRYVQQQRLRT
AIAHHGRQYQEDRDARREQLTKGGTIL
>t4258 hypothetical protein
MVSILLTTSSMAALNVIADLGGEPTAPLFDAINNKNNEFTPPRSLNATGA
SVTAAPADVSDMLPVTTPEMRAGKVEARELNLNGMTPVFLVGDDSLSREW
LTMRRDELKRLHATGLVVNVSDKDALSELQQIIPDVILLPASASEIARRL
HLNHYPVLITATGLAQ
>t3000 hypothetical protein
MKKKPVAQAEHQRYLLENPLVYGLLSRLRIAIVVNCFTLANKN
>t3001 hypothetical protein
MATRRKRRVLLYEPRVAFYTQNAEKKQSINAHNVSIIRNVYRLILTDIAT
AFKKSWKLGEEVT
>t1329 hypothetical protein
MTVQDYLLKFRKISSLESLEKLFDHLNYTLTDDMDIVNMYRAADHRRAEL
VSGGRLFDVGQVPQSVWRYVQ
>t1152 hypothetical protein
MGYKLNHSVLIKIIVIINDCNPDIYALLSTTLKGDGYHKLHFPDDVLFFL
TANKPLTLRHK
>t1094 hypothetical protein
MKRKNASLFGNVLMGLGLVVMVVGVGYSILNQLPQFNLPQFFAHGAILSI
FVGAVLWLAGARVGGHEQVSDKY
>t4256 hypothetical protein
MKPMKSTLNCLRQWTASVMLAVIVSLPALASVSTGRSVSAPQQQNTTTEG
RTQTSGTAWGLSEKEWQQYQQVLKGPRAFQSPGIDPLMALGLEAGSASER
RRFAEMWVKEEYARGEKELAFQREVDAAWKRLYPNTLSVNMGNASGIAHD
TQGRLALFVKRNCVPCDARVSAVLADNRPVDIYLVDSGGSDDTIRQWALA
HHIPVDKVRSRQITLNHDNGNWLKYGQGYMPVMLQQGVSGWQIAAF
>t2015 hypothetical protein
MGFIKQSTSSHARLNVPTLVQVAAMAIILIRGLDVLMIMNTLGIRGMGEF
IHRSVQTWSLTLVFLASLVLVFIEIYCAFSLVKGRSWARWVYLAAQIIVS
GYLWAASLGYGYPELFSIAGESKRDILHSLVMQKLPDLLILFLLFIPAPS
RRFFRLQ
>t4293 hypothetical protein
MNKPLIALLLITSFSASADKIPSSIQNLIAVYDTRTHSLEDGGLTIRYNK
RLLLVDAAESMFQGICNDYYMNKWKPGTIKRITLLNVTSDQGFEINAGGD
ECRKAGTMKDEQARTYRTSFIKPLQ
>t4313 hypothetical protein
MKKKVMSVFFQLAWAALLVISLLYPRSGAPVLVGASVQVSCFLAWLLAAL
CAVGWFAGDRARDEVRAALIKFRAHPVKPVRTWAIRLLIVLCLAFSGWVI
TLVFYLLTLVLYQIARSQLHEPMAA
>t3357 hypothetical protein
MADAFILLGIVMAMVSLGFILINKLFCFISAGCLLSLCASMASLQLWDAS
YWGRWGKVCPGLDVIISCDNYHSLYDLGWGLYGIAFLFFTALMLTCVVII
LINMIMALERYCAGWRR
>t3921 putative lipoprotein
MFHYGKCVVRTLFLLLVCCSLSGCLSFGLILATDKDGNRHQSTWKSDTVT
ALSLGKDSNGKTGWVFVGEHFDYLLTQGGDNVVALLKDATIRRDKMRVKD
GVKFLIDTDKKEFTGEVNVTYAWVDEKDKLAAVGYGFICADGAVNCTLSV
MDLKGTIHQKNKEQSVTQQLSFYHPFTVEFYQYQRSMSGAKFGRVLLPVT
LALDIVTMPLQLLIFSR
>t2663 putative bacteriophage protein
MADIATIFHWPPSVTDVMPLTEVLEWRYKAIQRSGANDE
>t1150 hypothetical protein
MGKATYTVTVTNNSNGVSVDYETEAPMTLLVPEVTAEVVKDLVNTVRSYD
TENEHDVCGW
>t2840 hypothetical protein
MYLSRIQLRFNNLRPEMLAKWNSARPYASHQWLWQLFPEQELRQFLFREE
AHGGFFMLSAIPPLLQHSLFLIETKLFNPQLTNGLELDFQLRANPVITRN
GKRSDVMMNAKHQAKANGVEKERWWELQQQAAQAWLEQQGQQHGFRLIAP
EPDDFAMWAGDEYSELQAHCGCVQAYQQHRFVRKDQETPITFSSVDFSGA
LCITDAALFKQALFSGLGKSKALGCGMLMVKRKR
>t1557 hypothetical protein
MRITVSDISTRESQQTVQIQAIRSWDTIPYLSMLDGLYQDDIFHEQISNL
PEEYIKLDEMAKDEEKNSLNIYEFFFEPTHEIICEEIQSTLDFYYSNSAT
FRRLVNYKVERSINDDVDTNKCEVKISLNYSYENIDGGRGCLSLPFDQNG
YPIPPDFHNCVNGITSKKMLLDLFLKHILHDHLNASGEVANVYANVIYKE
IDPAAIAHTSSCFSQVTMSGEHELFNVDSVTVPPKSTEQIIFEGKEIQKQ
IFLSSHSRNHLQPVTIHKMDRSIKNIAVTGLLLSSRLAVTSGDYRIKNGN
SEGENDFLPVKELQRYERALPEDHPAPTTEPNLAGKVLDGLFEHVTSMAG
GRYRKQASYPPLSSEAPNTIKYGDRELVLTKEPGSETYQATYSDSGKNSA
ITFYRSSDGRFYQASGLKGGGLIRHIDKPYSELREGDTGYDEELLDITDD
SPLLEDILASLSENLYPTSEENVQSIYKKYQSGDAVAGETEVVLCRGTIG
PQAENIVSFKTAGGIEGAM
>t2648 hypothetical protein
MESTALQQAFDTCQNNKAAWLQRKNELAAAEQEYLRLLSGEGRNVSRLDE
LRNIIEVRKWQVNQAAGRYIRSHEAVQHISIRDRLNDFMQQHGTALAAAL
APELMGYSELTAIARNCAIQRATDALREALLSWLAKGEKINYSAQDSDIL
TTIGFRPDAASVDDSREKFTPAQNMIFSRKSAQLASRQSV
>t1383 hypothetical protein
MIDTPRDFMAGLVCQLESTARSLRSTFDLPDEPTGNAAPSWLTEPTPQIN
GLEA
>t3089 hypothetical protein
MKKCFLFIFMCLFIFSANAELKFRPEIENKKIYFQGKVTDYTLNDFIFFG
DSREPFYGSENDDYTATADEWLRFYAELPDVRKWQRVVPDDFSMMSGAPW
CDIQFFEQENDHSVITGSEHTRCIDFLVTPKRKGLIPMGTKGTLLDYGSY
LAFAPQIEHITLEGRFPEASRTWLQVRSVQTSATSIADLKIMLNEIYARH
GYIFRPGGEMDRYFRQQPWYRPRYANVDYLITETEHRNIKLLKELLSPES
QRKKALYIQNTTDFCEALKANNIKYVMDNTAEYLTTGADQPINLEKVWQK
YRDLMINGGGNCQRDIESYKRYSFLYSEPLGIYYNSTFEFDEKKGKYVLA
YVNFDSSCNKEGDTIELEGMMVKREHVIEDNQGQHSDKDLELKTINLQCV
AGAMMDQPDWDRYVQLILPPEKYNYYQRFIGKKITLRGKVMIAESMYHVT
PVLLNLLEEQNPIIKVADMPARP
>t1945 hypothetical protein
MEQLRAELSHLLGEKLSRIECVNEKADSALWSLYDSQGNPMPLMARSFTT
PGVAQQLAWKTSMLARSGTVRMPVIYGVLTHEEHPGLDVLLLERLRGVPV
EAPARTPERWEQLKDQIVEGLLAWHRQDSRGCVGAVDHTQENIWPSWYRQ
RVEVLWTTLNQFSNTGLTMQDKRILFRTREYLPSLFEGFNDNCVLVHGNF
CLRSMLKDARSDQLLAMVGPGLMLWAPREFELFRLMDNPLAEGLLWHYLQ
RAPVAESFIWRRWLYVLWDEVAQLVNTGRFNRTNFDLAAKSLLPWLA
>t2842 hypothetical protein
MIMTTFIQLHLLTAYAPANLNRDESGRPKTAFMGGVERLRVSSQSLKRAW
RVSETFEAAMDGFMGKRTRRIGVDYVYRPMKDAGIEEKIAKSSSELIAKQ
FGKLKSDKDAKPEKNLEIEQIVHVSNHEISLIKQLVDTLIADKREPNDEE
VKLLRKEQRSVDMALFDRMLASSPEFNVEAACQVSHALGVSAVTVESDFF
TAVDDLNNKEEDAGSGHMGEQGFASALFYTYVCISRDLLVENLGGNEELA
KRTIAALTETALTVSPTGKQNSFASRAYATYALAEVGQKQPRSLAAAFFQ
PVRDTDQIPAAITRLKQQRASFDSVYGNCADDYRELNVQEGTGSLAELLA
FVSQ
>t1388 hypothetical protein
MIFKCIQCERDITALRFHSAIAVMSGKYHIPAVRVTLVCPYCSQHFSADV
PVMEFSRPDREDAQ
>t2592 hypothetical protein
MNLVKIIYELKLPKRFLQKNKYI
>t1209 hypothetical protein
MLKNGVKKHAPDIIEYPFFIRYLSARNYWLHDYAYNTMFFGINMNIMLYS
FELIFFDGFDVYLLLIFTVIVLSLMMSASNGWQGNITKLLPSELLGQLLL
RKKK
>t1577 putative lipoprotein
MQKCSLITVLSLSVLMLAGCTTTYTMTTRTGEIIETQGKPEVDTATGMTK
YADVYGYHRVIKTSEIVQTTEGASKLDW
>t1395 hypothetical protein
MTTESVVCALFWYCFVGWCTAELHRRSGFYSRYSGAGYWISWSVMFLCWP
VALPLYVDYIGDAGRRSNNDD
>t3427 putative phage tail protein
MNKPQSLRHALNKAVPYVRNNPDKLHLFVDNGSLVATGASSMSWEYRYTL
NVVIEDFSGDQNLLMAPVLLWLRDNQPDAINNPALREKLFTFDVDILRND
VCDISLNLQLTERVLVSTDGSVSSVEAVAEPDEPEEMWTVKRG
>t4230 hypothetical protein
MVPGEITSETCRNIPSNALSVPGWLNALVTVVCMTAGCRSYYGWYQNSDH
KKRRSVTFYGLGERPAVAGYLFGVVSRQLKRDAEHYLHTACAHPLLKLST
IRRRMDEYRLYWVAGVWTVLESFEPETSEKVLLDRWLTQQQRNLTPAKVR
QPEGCRNAKKVRQAAWTAGRQAEIYPAMDYRSEEHSLQEVSNG
>t4595 hypothetical protein
MLQRTLGSGWGVLLPGVIIVGLAFIGLSAYALKLLIVSGLLLSALMLYHK
QLRHFVLLPSCIALIGGMMLAMMNWNQG
>t4545 hypothetical protein
MSQSEYASILKCTPWLAKFLTRRGLKQPDHRPLYEYHATSEEYDELKRLL
RAIGVPDGYKSDKGYAACFTLFCSEWYRRDYEREYGWAWEPIYKTIGISA
SSSKMGKIIPKGLDGYWGRPVRFYDTERRNFLGSLFSEGGLPFRLLKESN
SRFQSMFSLILNQYDQAKSSNISTFALVHAAVEKSSLPVVFKEDTSVELI
SRMAEQLVSLVQIYDLSNHTEPVKELERVHPKWRDSFPVPLDDDTGTSFL
NGLLRTASTESKPRLQKNKTTLCQFLWSENHPEALQALISLPEELSFSID
IEPSTTRFELAIYEDGNEIASLGPAYATLSNSQAKIKVRKREIKFYRRNP
TVSLFIVARAGGMFFGSNLLEGSEVAVGDVPLVFVSDKNEWLLQGQASCS
VRGSHVLIVLPKDGCLASEHEDCDSGFSALGCHALTIKGRQDIIIKGDET
YRIKIGRDQIIHTGFSFQGKRLNWTSYPDELFLGVPGITQHSENLSTRHY
KRFFNGTFIENCDVQEKMGAQFISVRNENDETLLRKKIGILPNDFSLEIK
NGQQANEGSVIITTQHPCLYSLKEKTLEVGRKRLPDSTEIMMKAEGIPPA
SISLQITPNLTANPIVIWLPFPARGCLAFDKDEKPLPKNLTINDLLGARA
YLFGKNGEPTRYQLELRLRSRSGMQAWYEWHYSAGECPVELTLYSLREHI
DNLLSLEEGIDQTVDMRIKGGGSSFTWQIRRYKYSLDYDRGRQILLANSI
SNRTGQIPSPVIMLLSEPERKVVLLTSRMSEGVPVGEFELSSIIQKNGPW
LVLPKPGEEASFRPCFIAGEPVIQSDATAIQSLQKATQLFNPRSDVNTIM
LVLEQMASDPAHSGWQFLRNLYDQFGYLPLATFEVWRALVQHPQALAMSL
FKFEMSIDYLSRIESEFPVFWEFLSITEVKRSATRFRAFLTHKGAPEEMQ
IRLLYRMYQQLGTTFPTYASEVQLWLSQGKLPPVFPELTMKGIILEWYQE
LLREHGESRWPEFGGPGLLRWYMSQQNPVIDISPDASYRYSVTLLPVFAA
AVASGKTTFESVFENKPGAVFFLRQVRDFDSRWFNAIFQYCLLRNVTEK
>t0015 hypothetical protein
MSIPNHVSTTEVVLLELEILLTIISIGAWGGFVSYLLRKDKTEYNSSHES
IKYCLTQIVISCFTSFLLSAIAIEKECSFNIVLLAAGLGGVFASPILKIL
GRRIKKIIGGNNSD
>t4235 hypothetical protein
MSKETNYFNLNINGLGYITNVRQVVNGNSKFTCCTLNALSGPTDNADYTR
FDVTVAGKDATSLINRCQKSCDEDKKVMIGFVLSGIKSDIFTLAKGDHAG
ENRVSLKTRLIRVDWIKIDGAIVYKAEKPDSTPPAQSQPAQKQYAEDSF
>t3343 hypothetical protein
MPPLVRGVAYCHANDVTQHMDVKLMLSVFIPSSERCVSRCRYLLSFALIN
IIFSILVGVLLYLSFVILAILFTILLHYLVINLNCQRFRDSGFEYIKFYV
WGTLVIYIASFVIMVAEDFACDGFGMPLFLIWYFATFSLLLLAPPDSNSL
NK
>t2946 hypothetical protein
MKVNAWTILLMSAHLTACAVPGTEKYQTSMDSVTAEKISRIIQSDVIPYK
GENHGEVISRVSSAFLGTPYQADTLIGGPGIPEVLVANFNGVDCFTLADY
VEALARSDNQKSFLHNLARTRYAAGKVAYLSRRHFFSDWFAAAPRNARDV
TPDISPDYVVVDKQLNHKPDGGEYIPGLGIHPRKINYIPGRAINQQVMNH
LKNGDYIGVYSPLDGLDVSHVGIVVRHDEQVWFRNASSLAANRKVVDTPF
MEYMHSRLGIVVLRAE
>t4319 putative major capsid protein
MKKKTRFAFNAYLQQLARLNGVDVEELSSKFTVEPSVQQTLEDHIQQSAA
FLTLINITPVTEQSGQLLGLGVGSTIAGTTDTTTKEREPTDPTLMEDVEY
KCEQTNFDTVLTYAKLDLWAKFQDFQVRIRNAIVKRQALDRIMIGFNGVK
RAKTSNRAENPLLQDVNKGWLQKIREDAPDHVMGSKTAEDGTTTAEPVKV
GPGGKYVNLDAVVMDTVNELIDVEYQDDDELVVVCGRELLSDKYFPLVNK
EQDNSEKIAADLIISQKRMGGLQAVRAPFFPANALLITRLDNLSIYWQED
TRRRSVIDNPKRDRIENFESVNEAYVVEDYRCAALVENIEIGDFSAPAAP
ESGE
>t3727 hypothetical protein
MTTRITLWRELFSEQPRILLENDDFTVTAFRYASGVEGLKIQNSRGHLVI
LPWMGQMIWDVQFDGHDLTMRNMFRQPKPAAEVVATYGCFAFHSGLLANG
CPSPEDTHPLHGEMPCAAMDDAWLELEGDSLRVTGRYEYVMGFGHHYQAQ
PAVVMRKSSALFDIQMTVTNLASVAMPLQYMCHMNYAYVPNATFRQNIPD
TALKLRESVPAHVKPTAQWLAFNQRLLQGEASLATLNEPGFYDPEIVFFA
DELDKYTDTPEFSMIAPDGTTFVTRFASAELNYVTRWILYNGDQQVAAFA
LPATCRPEGFLAAQRNGTLLQLEPQQTRTFTVTTGIV
>t3044 possible membrane protein
MKYNEGTFKYMLKKIIGALASIIGGLLAFLCVYWLFRLDTWTERGVYIGI
SIFLALILIKILSFFNDE
>t3038 bacteriocin immunity protein
MELKNNLEDYTEDEFIEFLNNFFEPPEELTGDELSKFIDNLLRHFNKITQ
HPDGGDLIFYPSEEREDSPEGVIEELKRWRKSQRLPCFKENK
>t2936 hypothetical protein
MCPDNTHAKKQYLIPGNDIHYPGQTNHDACFIPVSVRQYAGEPLYIIVAH
WCLLQQNWVQRNQIAEAFHITARRASYLIAYLRSKTSRVVSICRHQTLPN
KARRYEIYVIRVLDSPTPSTRREKAGPPPVSKRRVGNGDRSMANELWNRL
CSNRNAGKILKKREDEDDRT
>t0722 hypothetical protein
MAAYLYLSAIRMLWHPLNIQNLNKAFPEYETNSLKNKGEITSATCIKENT
KGHLTYYSVSDPVFN
>t4169 hypothetical protein
MAAITTGVVLLRWQLLSAVLMFLASTLNIRFRKSDYIGLAVISSGLGVVS
ACWFATGLLGITMMDLAAIWHNIEAVMVETMSQTPPEWPMVLT
>t2573 hypothetical protein
MAVRLTFDGQKLTWPGIGIFKATTGLPDLQWPDKQCVPDAAIPEGNYKLF
IQFQGEAPIRNAADCDLGPSWGWSTIPRGQAAGTCEIYWANWGYNRIRLE
SADEKTRKACGGKRGGFYIHDSTKGYSHGCIEVEPVFFRILKQET
>t2664 putative phage tail protein
MSDKKTEKTIQLDTPIKRGKTEITEIVLRKPQSGALRGTRLQAIMDMDVN
AMMTVIPRISSPALTAQEIAEMDPADLTAMSVEVVTFLLKKSVLAGLPTA
>t2564 hypothetical protein
MEQRAFLIEIKKLIASITSKNMTVKGCSTEDILYLEENYGELPKSYKLFL
SLLGVESGDFKEGTDLLFKDINDINKYTVELMQENNISIPDGMYSFLLHQ
GYSALFFIERDDDPSVYCYTEGKEIKKTKYVFSEYVLAEIELYNRYQ
>t2389 hypothetical protein
MTEIQRLLSETIDDLNVREKRDNRPRFSISFIRKHPGLFIAMYAAWFATL
AVMLQSETLVGSVWLLVVLFIAFNGFFFFDIAPRYHYNDIDVLDLRVCYN
GEWYNTRFVPPTLIETILQSPQVDNEHKVQLQKMVARKGELSFYDIFTLA
RAEASR
>t3419 major capsid protein
MKKNTRFAFNAYLQQLARLNGVAVEELSSKFTVEPSVQQTLEDQIQQSAA
FLTLINVTPVTEQSGQLLGLGVGSTIAGTTDTTAKEREPVDPTLMVDVEY
KCEQTNFDTVLTYAKLDLWAKFQDFQVRIRDAIVKRQALDRIMIGFNGVK
RAKTSNRSENPLLQDVNKGWLQKIREDAPDHVMGSTTTGGETTPGAVKVG
KGGEYANLDAVVMDAVNELIDVVYQDDDDLVVICGRELLSDKYFPLVNKE
QENSEKLAADMIISQKRMGGLQAVRAPFFPPNALLITRLDNLSIYWQEGT
RRRSVIDNPKRDRIENFESVNEAYVVEDYRCAALVENIQIGDFSAAAAEA
GA
>t4254 hypothetical protein
MNATYNALIALMRSGEIPLETGASVSAPSAQFSVRIRPETRVFLDRCAEH
LGISRAAMFGMCIDGIVAEARESIADRTSTLYERFCLLLDAHGLNVIEQA
QLLASWGIRASVLASQDRTLDLLNKPLLQQLSTWFDVDVNWLLGNSSYPV
DMADGQRDWAVQTPSLLCRLQSIQSVDPVELIFFWQQGRKPQSVGLCLRY
RPLISGEPIGLLRVYQALDWQDEQVQQAYNQLRRITSITLGGYTKDKVEK
YPPIRLRYFTFSARQMQALDNGAVIPAMIFDKPKGEFPSIS
>t1802 hypothetical protein
MPTQEAKAHRVGEWASLRNTSPEIAEAIFEVAHYDEKLAEKIWEEGSDEV
LIKAFEKTDKDSLFWGEQVIERKNV
>t2650 hypothetical protein
MSDNTIPEYLQPALAQLEKARVAHLENARLMDETVTAIERAEQEKNALTQ
ADGNDADDWRTAFRAAGGVLSDELKQRHIERVARRELVQEYDNLAVVLNF
ERERLKGACDSTATTYRKAHHHLLSLYAEHELEHALNETCEALVRAMHLS
ILVQENPLANTTGHQGYVAPEKAVMQQVKSSLEQKIKQMQISLTGEPVLR
LTGLSAATLPHMDYEVAGTPAQRKVWQDKIDQQGAVLKARGLLS
>t0646 hypothetical protein
MCGFYLMKKYTFAARAFIFTLVLLFVISVFGLTISVLKGM
>t2048 hypothetical protein
MTTPFTYETLPADPKAAIRQMKQALRAQIGDVQAVFDRLSATIAARVAEI
NDLKAQGQPVWPIIPFSELATGNISDATRAEVKRRGCAVIKGHFPREQAL
AWDQSMLDYLDKNHFDEVYKGPGDNFFGTLSASRPEIYPVYWSQAQMQAR
QSEEMALAQSFLNRLWQVERDGKRWFNPDISIIYPDRIRRRPPGTTSKGL
GAHTDSGALERWLLPAYQQVFASVFNGNVEQYDPWNAAHRTEVEEYTVDN
TTKCSVFRTFQGWTALSDMLPGQGLLHVVPIPEAMAYILLRPLLDDVPED
ELCGVAPGRVLPISEQWHPLLMAALTSIPPLEAGDSVWWHCDVIHSVAPV
ENQQGWGNVMYIPAAPMCEKNLAYARKVKAALETGASPDDFPREDYETTW
EGRFTLRDLNIHGKRALGMDV
>t1878 putative bacteriophage protein
MEAVEIPLVADNQTFATTINGTVYHLSVIWRGEYWVLDLADSNGSAIISG
MPMITGADLLAQYRYMNLGFSLVVLCDVAGQENPTQFDLGTFSHLYVFTE
>t2510 hypothetical protein
MSHFLWGLSHAKKSEAWLAYRDRPVARPIAIDDYLSRGGKPAFCRAAAIC
SADSDVSMSTRRVAGSVLTVALASRVRIVFFTVLAQPPQVMSSIWNCIVN
SFCVGTVASLGLVSVGRSSVISGLTDALPVRGDEDGAIPLRYAPPSTGCA
AQTPG
>t3043 hypothetical protein
MKANFYQRLNNPLYVYYNDADLIKMIHIIKWNRKMIIALTWSFKPLVCVK
DRTKAA
>t1389 hypothetical protein
MITPQEARQRTRALVEHYVNECECRDLTDVKHVLTALISMATQAIVATNG
KAAALQVLVNTLTHTAENEVPYRMETTAEGGLHITVSRKH
>t2161 hypothetical protein
MTTYQEIEARLIRCFTITSWIACQIASYMDFKLFTHSNFFSEN
>t3831 hypothetical protein
MAKIGENVPLLIDKAVDFMASSQAFREYLNKTPPRDYVPSEVPSESTPIY
LQRLEYYRRLYRPKEERG
>t1382 hypothetical protein
MMQASTKYLSPALKPNVPTLAHLGKAKLFELMTEDDEELAELADGGTVAG
LTLDDVDRMSVRELRQALREARETNAAQQRVLADKNEKIDSLSTRLEKKS
RIQPPEPDEEVKKLRAEVTALAVEAESAIAVRLSSAFETLCAY
>t0085 probable secreted protein
MSNKKNLSAEETDLTRRKLLTSAGILAAGGMLSGAVKADEKCAVKAKPTW
DKPFTGEIPEKLPEGYNILLVVTD
>t1602 hypothetical periplasmic protein
MKKKLKVLTLALASISSVCYAAMADYDTYVSNVQINNLSYGVYTSGGKET
QFFCIGLKHGSEAISINAMCKVDVYGNHKQGFDNMLNTAKYYYTTGGDVR
IYYKENVWRDPDFKSAFSSRELIAITTCSSSSYCMGPTVTN
>t2036 hypothetical protein
MVVDRLRTDLLNKLINARIDLAAYLQLRKAKGYMSVSESDTLRDNFFELN
RELHDHALRQGLHLDQEEWNALRRAEGALAAAAVCLMSGHHDCPTFIAVN
ADKLENCLTTLTLSIQSLKAHSPLTQV
>t4507 hypothetical protein
MKRLLSAIVFPAMFISISNVYALDIQPGEWKMENIEMRTINPDTKEVLMD
EKNSGIATLMCYTPKMSEDSKKMVKGFSTSAGGCTTTFVESTDTKLINET
VCNNPDVKSHSIVETTKISDTEFAMTMKSDVDAGGNKTTSINKIKQTFVG
KTCSEASKGVKQ
>t0384 hypothetical protein
MKINNGPVLCPHCGCLSAYYEIDRLAAIREKVNKEGGSAAWDSTLQAHKK
KAFCLMCHKSIDEVVIGQSDAPESTK
>t3403 hypothetical protein
MRPNISITLTTPHVTIERYSELTGLSIDTINDMLADGRLICHRLRKDKKR
EKVMINIAAMTVDALSECNLNLN
>t4565 hypothetical protein
MVFSVINLSIILYTLPTSSGLCVGYGLPGPSMGLALTGRCSPQSHRYYAP
GDSLPCRLYATRIILAIYSIMMIIAPSQIITIINKSHHEINSIIRFLLPA
RRILRAQADPSAATDLTVTIPLKA
>t1474 putative secreted protein
MKILPLALFIIPFLAGCGANNTPPQTPIPGEKTSAKLRTLETGAAAIQSR
PPVDAISTYLDGFHFYSGDKNGQMEAHHYVTVLNEDVMQAVIYDGNTKNA
RLMGVEYIISERLFKTLPPEEKKLWHSHQYEVKSGSLVAPGLPQVADKAL
MSKIVNTYGKTWHTWHTDRDKTLPMGIPALMMGFTGDGQLDPALLADRDR
RLGIDTQAIKRERQDLPEHPVIKGANAWEQGEVIQLQRVQGSGEHGRGDT
AHFGISEQSRQ
>t4558 hypothetical protein
MALKGAFLMQVIYYCDNTARDKNIKYIYIVNMFYWNGCFRYDKSLKNPLM
>t4275 hypothetical protein
MTLRILIKTTSFLRFILTLVFMLVMLTCLSLALADLGFRYQSEIAQFRIW
MKETWLLWLLWRLMLYSVTGWGIWRCYHSHGSSQALRQSLLRIAVVSVGF
VAICEYSVLR
>t4378 hypothetical protein
MIQITVGILNQHMISFIRPMEGIIVGQKTFEITYILCVLK
>t2520 possible transmembrane regulator
MNESIFLLDKRVVFDSTKMTLSHGNEIIRISEADTHLLLAFWHGLYKKED
IIHFVWENRGGCVSESSYYKLINQMRNDFSSIGLQLSGIVTRPRVGVSLS
VAIEPIKKITSLKVSDENVKGPSTREKIFYKIKRHSVFVVLTGAILLALF
YGVFTIYKTPVRNSPDSFFTYLGEYNDYAIYKTKEDKVTLSEVVFAFNSL
KIKIYRQNGRHLYYIREPNMNIFLQCLNPVEMAVPKCITVKERY
>t2673 hypothetical protein
MRFRVHRRFPVDGKPFLQFAVRVMAGSDTIVVIRYMTMKVMLAPKYKAPG
NNYQALCRQKTIKHRFLVLQDLLENMATHS
>t3766 hypothetical protein
MTLLLNEQYHLCCYSEIRADLRGLGYHIEHVENKSQQPGRTFDYQNLAAS
ALDSENGLPQLGINTFGGHVQGKNKSVDMAQFIHCHLPDCSRYFAYLSNG
HVVPSIDLTEQEAEYAQYTIDHLNLNSGFLQTERRNHWEELEQLFEEHIE
KDWDLHQLLQLDLVPSPDHKLHEFFSITRQFFQQEAEQVLQSHAPALI
>t4209 hypothetical protein
MDNTYIGEQIRVSTRHTNNLKKWRDNDHISHILFNLSRDLPDGETQKLNN
KRLFLKQTNHNLLLSLTPPAHRQRVLPSEHHHRKKIIT
>t3929 hypothetical protein
MHGMTTDFVLTMGNADGSFRYKVVAVKPDKLVSAREPE
>t3998 hypothetical protein
MATSTTSTPHDAVFKQFLCHPDTARDFLEIHLPSTLRQICNLNTLRLESG
SFIEEDLRPHYSDILWSLETSEGDGYIYVVIEHQSTPDAHMAFRLMRYAM
AAMQRHLEAGHKTLPLVVPMLFYHGNRSPYPFSLCWLDEFADPVMARKLY
ATAFPLVDITVVPDDEIMRHRRVALLELIQKHIRQRDLMGLVEQLVALLV
KGYANDTQLQSLFNYMMHTGDAARFNTFIRQVAMRIPQHKEKIMTIAERL
RQEGHRNGLQKGLQQGKQEGQRLAALRIARSMLNDGFDRDTVLRVTGLAP
ADLASESH
>t2649 hypothetical protein
MCTDPGCGLVFKTLQTITRFIVRPVTPDELAESLHEKQELPPVRLKTQSY
SLRLE
>t4318 putative phage terminase
MSLSPARQHRLRIQAEQAAREGGSVRHASGYDLMLLQLAEDRRRLKGVQS
TVKKAEIKVELLPKYSAWAEGVLAAGGAQQDDVLMYVMLWRIDAGDYAGA
LEIGRHALRHGWVMPLGNRNVQTVLAEEMADAAQSALLAAAGFDADPLLQ
TLDLTTDLDMPDQSRARLHKAIGAVLSESNPASALNHLNHALQLDPRCGV
KKEKQQLERRLRNDSR
>t3426 putative regulatory protein
MNRLLLVVLALLLAALGWQTWRLADASQTISTQADELQSKSQALAKSNSQ
LISLSILTETNNREQARLYAEAEQTSAQLRQRQRRIEELKRENEDLRHWA
DTPLPADIIRLRERPALTGGAAYRQWLSASDAVSAGAGSTAH
>t1005 hypothetical protein
MTDNTPGFFNHLRHKLENLASSSILKEQGDLACINANF
>t1010 hypothetical protein
MSSRRHQLTVEYSWCCEQGDIHNTTVLTEFKCQTDDYDIRYLCKVILFDF
HADFASR
>t4572 hypothetical protein
MRLKVKENITLWRNEGLIAYVALGFLCTFFMEVNALLPYYLQQSIFFETL
MSYMAFNTLFSLALSEIFFAMLVVICHNTKLERLTNSILQELHKRIMQGS
FIISFLCFGIFLFCVMAFCIGSLTTNNNYYGKHVINFAYPFVLFLSFPYL
IHKGITILCTILKAFPKGKIHAAIIILLIIAAAIIIPGIFNKKEKMTITI
DKEKYHAIEQKTKIKPEKYIEKKINDININDIK
>t3896 hypothetical protein
MYNNEPGAQSDPTLGYTFQNDFLALSQAFSLPEIDYTDISQREQLAAAIK
RWPLLAEFAQPHSLRKP
>t0928 hypothetical protein
MHHAATFILWVTPLLFDQKTTTAIFSVAYGILRRLNICYRTFSLKHNEEN
K
>t0724 hypothetical protein
MGDFIKYLFIFPCLWSANSFAITQTQWDGNFRVEELGEQLNDRSQVFLQY
NLKIDSKNNRASLSMTTWHAGITCIGDYSLKINSGVLVLYYNGDEENACP
YPSPQFEISNKGKEYYIKGKMFSYSQPGEWLPLKRITLK
>t0910 hypothetical protein
MIKIIFISIIGVVCLAAARYSPVTVVNAHPSAIARCLADNLSPYGYILDL
HETDIPGISKEFTVYYRNGAYIAGTFWIANSMTIASELIVGMNGVSKQAY
PIFNKSLQQCATRLSIK
>t1905 hypothetical prophage protein
MAKLPRRKCANKECRQWFHPIREGQIVCSYQCASTVGKEQTRKAREAAQR
KAQSLQRAAEKKERAAWRQRKAAVKPLKHWIDLTQRAVNDICRETELAEG
LGCISCGTKTAFAWHAGHYRSTAAAGHLRFTRFNIHLQCDVCNVYKSGNI
EAYRAALVERYGEAAVLALENNNTPHRWTVEELKEIRLAALSDLRALKKL
EAA
>t3862 putative lipoprotein
MIMKYFCTVMIAIALVGCTATPPPTQKAQQSKVSPTRTLDMEALCKAQAA
QRYNTGAQKIAVTGFEQFQGSYEMRGNTFRKESFVCSFDADGQFLHLSMR
>t1311 hypothetical protein
MKCILNATGLPLQDLMLGASVYFPPFFKAFALGFVIWLFIHRLLRDRIYS
DEIWHPLLMDLSLFTLCVCLGLVLLIVW
>t1328 hypothetical protein
MTTTPPQRIGGWLLGPLAWLLVALLSASLALLLYVMALATPQTFKTLSGQ
ETGNLLLWGISFITAIAMWYYTLWLTIAFFKRRRCVPKHYIIWLLVSVLL
AVKAFAFSPVSDAFAVRQLLFPLLATALIVPYLKRSARVKTTFVNP
>t2653 hypothetical protein
MRNKKAPQTVSARHDAREHLSIEAYHKLNRASAVSQFVGGDLIHRELSGL
HQLYIPHIFSYLNEDIDFVLNELKAKGLCRDFLAQQKDRGDRTHV
>t4340 hypothetical protein
MAANEVKKEKAKVAFFFSSPTATDQNLMRADKFTYWQ
>t3486 hypothetical protein
METCVLMQLKNCIKCSIAHILRYKRQKCSNFKQESINFRHFTNRLTRIRR
GISTLRR
>t2643 large repetitive protein
MRLLAVVSKLTGVSTTLESSAVTLNAPSIVKLSVARDEISQLTRINQDLV
VRLHSGETITIKNFYVTNDLGASQLVLAENDGTLWWVENPQAGLHFEQIA
DINELLVTSGASHEAGGAVWPWVLAGAVAAGGIAAIASSGGGDSHHHSDG
DNPPPDNTNPDGNPPDNSNPGGSNPNGNTPGSSNPVDTTPPLAPGELLIS
ADGKTVSGEAEAGSLITIKDPSGNVVGEGKADSDGKFSIDLTAPQISGEQ
LTVTATDDAGNTGPSATIDAPNIPLPDTPVITAAIDDAAPLTGTLSNNQF
TNDNTPTLEGTGSAGTVIHIYANGQEIGSTTVDTSGNWHFAITSALADGE
NHFTAIATNVKGESSESARFTLTIDTLSPDAPRVELMADNTGLLTGPLQN
NDRTDEAKPLFSGQGEAGNTITIKEGSTVIGSATVDENGRWTFTPTTPLS
DGEHTFTVEQSDKAGNASRVTTTPTIIVDTTPPDAAIIDNVAKDGTTVSG
TAEAGSTVSIYDPAGNYLGSTITGENNHFSITLNPAQTHGERLEARIQDA
VGNIGPATEFTASDSQYPAQPTILTVTDDAGAVTGLLKNGDATDDNRPTL
SGTAEPGSTISINDNGFPVPSFPPIVADADGKWSFTPSLALADGDHVFTA
TATNDRGTSGQSVAFTIDIDTQPPVLEGLAVSDVGDRLTGTTEAGSTVVI
KDSLGNTLGSGTAGDDGTFSIGISPAKINGETLSISVTDKAANSGPVETL
NAPDKTAPAAPNGLIVATDGLSVSGQAEAGATVTIRDSSNTVLGSAVANG
NGQFIVPLNAAQTNGQALIATATDIANNESAAATVDAPDSTAPEMPKNVV
ISEDGASISDTAEPGSSITITTPDGTPLGSGKADGEGHFTLPLAPAQTNG
EQVTVTATDSANNVSPPTTAQAPDITAPDKPIITQVLDDVESFTGPLVNG
QTTNDNRPTLSGTAEAGARVEIFDNGVSLGLATLQPNGGWTFTPSQNLGE
GAHRLTVIATDAKGNASPAGNESPESISFTLRIDTQAPDAPQIVSAAITG
GEGEVLLANGSITNQRMPTLSGTGEPGAIITLYNNGVELATVQVNPQGSW
TYPLTRNLSEGLNILTATATDAAGNSSPTSGVFSVTLDTQPPAQPDAPLI
SDNVAPVIGNIGNNGATNDTTPTFSGTGEIGSTIILYNNGSEIGRTTVGD
NGSWNFTPAALTPETYTITVTETDIAGNISPPSASVTFTLDTTAPANPVI
TFAEDNVGEVQDTIVSGATTDDNTPVIHGTGDIGSVITLYNGSSVVGVVT
VDETGTWTLPVTSALPDGVYTLTAIAADAAGNSSGVSNSFTFTVDTVPLQ
PPVVNEILDDVAPVTGPLTDGAFTNDRTLTINGSGENGSTVTIYDNGVAI
GTALVTDGVWTFNTSELSEASHALTFSATDDAGNTTAQTQPITITVDITA
PPAPTIQTVADDGTRVAGLADPYATVEIHHADGTLVGSAVANGTGEFVVT
LSPAQTDGGTLTAIAIDRAGNNGPATNFPASDSGLPAVPAITAIEDDVGS
IQGNIAAGGATDDTMPTLRGTTDIGSTVEVFIDGDSAGFATVDASGNWIF
EIATPLSESTHYFTVQATNANGPGGLSAPVGITVDLSAPAQPVITSATDD
VPGMTGTLDNGALTNDSRPTLNGTGEAGATIRILDNGVEIGSATVDQSGN
WRFTPNTPLESNAHIFTAVATDPAGNSGQLSDGFTLNIDAQAPDVPVITS
VIDDNNQPTVPVLPGQSTDDRQPILNGTGEPGATITIFDNGTPLGTAQVG
ENGSWTFPVPRNLSEGSHNLTVSATDPAGNTSAVSAPWTIVVDITPPAIP
VLTSVVDDQPGITGNLVSGQLTNDATPTLNGRGEAGATINVYLDGNPASI
GTTTVNSDGTWSFTPQTPLANGSHTFTLSATDPAGNSSAVSSGFVLTIDT
TPPAAPVIASVADNTAPVTGIVPNGGSTNETRPTLSGTGEAGTTISIYNG
SALVGTAQVQANGSWSFTPSTSLGAGVWNLTATATDAAGNTSAASEIRSF
TIDTTAPAAPVIDTVYDGTGPITGNLSSGQITDEARPVISGTREANTTIR
LYDNGTLLAEIPADNSSSWRYTPDASLATGNHVITVIAVDAAGNASPVSD
SVNFVVDTTPPLTPVITSVSDDQAPGLGTIANGQNTNDPTPTFSGTAEAG
ATITLYENGTVIGTTTAQPDGAWSVSTSTLASGTHVITAVATDAAGNSSP
NSTAFTLTVDTTAPQTPILTSVVDDVAGGVTGNLANGQITNDNRPTLNGT
AEAGSVVSIYDGDTLLGVTSANASGAWSFTPTTGLNDGTRTLTVTATDPA
GNVSPATSGFTIVVDTLAPTVPLITSIVDDVPNNTGAIGNGQSTNDTQPT
LNGTAEANSAVSIFDNGALVATVNANASGNWSWTPTASLGQGSHAYSVSA
ADAAGNVSAASPSTTIIVDTIAPGAPGNLVINATGNRVTGTAEAGSTVTI
TSETGVVLGTATADGTGSFTATLTPAQTNGQPLLAFAQDKAGNTGIAAGF
TAPDTRVPEAPIITNVVDDVGIYTGAIANGPVTNDAQPTLNGTAQAGATV
SIYNNGALLGTTTANASGNWSFTPTGNLTEGSHAFTATATNANGTGSVST
AATVIVDTLAPGTPSGTLSADGGSLSGQAEANSTVTVTLAGGVTLTTTAG
SNGAWSLTLPTKQIEGQLINVTATDAAGNASGTLGITAPILPLAARDNIT
SLDLTSTAVTSTQNYSDYGLLLVGALGNVASVLGNDTAQVEFTIAEGGTG
DVTIDAAATGIVLSLLSTQEIVVQRYDTSLGTWTTIVNTAVGDFANLLTL
TGSGVTLNLNGLGEGQYRVLTYNTSLLATGSYTSLDVDVHQTSAGIISGP
TISTGNVMADDTAPTGTTVTAITNANGVSTPVGAGGVDILGQYGTLHINQ
DGSYTYTLTKPTAGYGHKESFTYTITQNGVGSSAAQLVINLGPAPVPGSV
IATDNNASLVFDTHVSYVNNGPSTQSGVTVLSVGLGNVLNANLLDDMTNP
IIFNVEEGATRTMTLQGTVGGVSLVSTFDLYVYRFNDAIQQYEQFRVQKG
WINTLLLAGQSQPLTLTLPGGEYLFVLNTASGISVLTGYTLAISQDHTYA
VDSITANTTGNVLTNDVAPTDALLTEVNGVAIAATGTTEVNGLYGSLIID
ARGNYTYTLKNGVGADSIKTPDSFIYTLKAPNGDTDTASLNITPTARALD
AINDVSDTLSVATLQDTAAWLDSSVGSASWGLLGKSGSGSGTFDVATGTV
LKGASLVFDVSTLITLGNLNISWAIQENGTVIRNGTVPVANITLGSATVT
VNLSGLELDAGTYTLNFTGTNTLAGAATITPRVIGTTVDLDNFETSGTHT
VLGNIFDGSDAAGAMDQLNTVNTRLSISGYNGSAATLDAAANTTSATIQG
HYGTLQINLDGAYTYTLNNGVAMSSITSKEVFTYQLDDKIGHTDSATLTI
DMAPQIVSTNQNDVLIGSAYGDTLIYHLLNGADATGGNGADRWQNFSTAQ
GDKIDIHELLTGWDHQAATLGNFVQVHTSGANTVISVDRDGAGSAFKSTD
LVTLENVQLTLNDLLQNNHLITGG
>t1888 putative bacteriophage protein
MAKEKLVTIHVHTPFTLTLGDQSKQEFGRGRHNVPEEVASHWFTRAHAEL
SESGSNETDDQQPVIDSLQAQIADKDKLIADLKDALLKLQEQNDSLQAQI
TSARTGGNGAKDAKESKPANSK
>t3281 hypothetical protein
MSLFPVIVVFGLSFPPIFFELLLSLAIFWLVRRMLVPTGIYDFVWHPALF
NTALYCCLFYLISRLFV
>t0645 hypothetical protein
MASSRVVKHMSPVGCKPEHTFGRKGEPKIRHKLSSESCTHLVFIGLPFGI
PGIGDEIEGAIQQAPQPLRQSMLSPSIYNQMKDNRICPKKNPDLKCGRHH
DQACPQTFRRQGWKQKGQHSTLPPSRRHAIAYSR
>t0489 hypothetical protein
MSWDKRVAVNYAKTHAGSHSQGRCAEFTRKAIQAGGITLGHTYHAKDYGP
MLRSAGFTAIGTYEMPREGDVIIIQPYAGGNPSGHMAIYDGAEWYSDFKQ
RDMWAGPGYRAARPSYTIYRKN
>t0577 hypothetical protein
MNNNKIRSMPDLIYLPLTIIFLQIIIHDIIFVTFTYQETEQNEYMCKFTL
PIEHTAISQFIFRRGER
>t1877 putative bacteriophage protein
MSKNWMRHFELLLVDDKGDGIKISELKVTFNIQKMPATIFNGFVGNFKVY
NLSPTTQNRIMQKEFSRIQVIAGYKGQPDATGNYPDENVGMIFNGDIRFT
VTGKDNATDSWIMLQCIDSWEGHLNASVKTTVAAGWKYSDLFSLGMKSFG
PYGIESGAVPDMPETVFPRGRVVYQSTARLMNHIAGQCNAYWWYENNQVN
IVPEDNYIGVATVLNANTGLIGRPQQTMGAGVNVRCLINPNIKLGGLIRL
DQASVYRTSLSNDQIAQSPARLDESESDGNLYVNGLPGMSQPASINTDGD
YIVGSIDYTGDTRGQAWYMDLLCLAKDGKELLNSKGIDAAKYT
>t0728 hypothetical protein
MTFIFDVNKEYHAGANLTDKFLCLETYSGLGRYSSDPDYPCQLLSIDSDD
VCIGHELLQALKNSRTYTPEESEEYLSLEKTQVEYDEWVTVLMAKYNYRT
RRALFKNMKYCSIICVNNIIKIQPTRHTKLEGWSWAGHDKDVIRLPVTSE
PEKIGSALRQAFECCD
>t2451 hypothetical protein
MRRRVAPTSHYFRRCHQPKNRPVAIPTSVGAARSCQIVMLNSVSVILPPL
LQWQRFAAILNLSGVKLAPVMLPVNTEQCAGVFSFQRQGHRGSSPRNIAP
HRVAPAYR
>t0422 hypothetical protein
MKSYRLVVRQQGRIVGHFETSGLDALEDICVARAMFGITGGYQCELQVSD
SERRILESGPDGMKILMREKCFRPVTSQL
>t0046 hypothetical protein
MQHSLRSDGAGFYQLACCEYSLSMRKIALSGGFWGGVCRMAMKSIFFMFH
QGNRRLTLTAVQGILLRFSLF
>t1203 hypothetical protein
MTYQQAGRIAILKRVVGWVIFIPALLSTLISVLKFMYAHSEKQEGINAVM
LDFTHVMIDMMRVNTPFLNVFWYNSPTPNFQGSLNIGFWLIFILIFVGLA
MQDSGARMSRQSRFLREGVEDQLILEKAKGAEGLTREQIESRIVVPHHTI
FLQFFPLYILPVIIIVLGYFFFSLLGFM
>t3421 putative capsid completion protein
MAKGTATSKPRPPPFISGESSMKFVAPEQAPEQAEIIRNTPFWPDVDLSE
FRSMMRTDGTVTQPRLKQVALSAISEVNAELYEFRRRQQMLGYASLAEVP
AEQLDGKSERIQHYFNAVYCWARAMLNERYQDYDATASGVKRGEELAEAS
GDLWRDARWAISRVQDAPHCTVELI
>t3072 hypothetical protein
MNNHFGKGLMAGLHAPYAYSAHHAVNFCSEYKRGFVLGFTHRMFEKTGDR
QLSAWEAGILTRRYGLDKEMVMDFFKENHSGMAVRFFMAGYRLEG
>t1985 hypothetical protein
MYPPNQGGMRTTDRGSTTGAASWTDRARSERAAPKGRAKRVNPPLTAIFK
KELVRKYGLLYLNFLFLL
>t0039 hypothetical protein
MYCRQCNKTFISYTAIRSDARQENLATLIGEGASLVEIRAALAIDSTGFS
RELQKLSRRANQAERDFVFPAFDIAMSTRAFRVKFNGGDSSLYVLVTAEE
ESGKVVAISTNYSAQPVEADYQYHSDYEERLPSGTLAHLVQRKEALTMRR
NVLFDVDYGPAILYKNDPGMLVKPVLPAYRHFELVQALTDERSLNVQHYL
DHECFILGGCMMANFSYLRQGRCHISFVRERGVTPPKRDLPPRLFLSGGI
RNNVWRTFSTRDYAMAVCNLTGNKKVSLLRHATLNSATAFIRYVHNHPFL
PHLNRMSPGNVVAVLDYLKFEYDASCN
>t3744 putative secreted protein
MKRKTLLLIATLVALPGVTYADSPFSSLQSAHEKNTILKDLRKMCTPKGA
LTDEAWEKKIMASEGNQQHIREAMIAIERNNQHNYWQALGKVECPEM
>t1122 hypothetical protein
MFLANHIDCGMVHREAPGTAIFYKMKFTPYGSELCFYIRWASIMQVNLKD
TSNNRNQ
>t4272 hypothetical protein
MKRILSSLILISSFSASAQIVVYTDQGHPPVGVTPAIRVVYLDASERWLQ
QQFGELSSDPEQAVRQAHSLLNSPDWKRRQKALIEHYRGMIEAWQLGLLK
YPAVVFDNRDVVYGTVDISRAVALRRGENQ
>t4218 hypothetical protein
MGRLEAHATIKLTIDTQPSAEPTLGSTVSSLDNTVSLKGVVEAHQDIQVK
VTVDDTDYTASVDNDTGNWHVSVPQSAILSGENQYTITATDPAGNSAAAS
VTFTGTSPTENPVNKSSGVDEMIMPDN
>t1126 putative lipoprotein
MKKWLIGGILIASFLTGCLMWHNIDKWFNKDIEVFYAGDDN
>t3080 hypothetical protein
MAFLMRKRKEKAVKVRQYVNSNENDYQFDVVLILLCSDFVICVLEIQSG
>t1871 putative bacteriophage protein
MRYRREDTEGDYTFGSGDDTWLINSPEAVAQAVKTRFALWYGQWFLDKTE
GTPWIQSVLGKQKPETYNLAIRKRILETRGVKSILSFNTTVNTTTRRVQF
FAEIDTIYGTTTVTSEA
>t4229 hypothetical protein
MHVLSGMARIRRQKTMKSLKKPRSHYQWVGATVVTTQELSSGLAVIPVGS
RGVVNAAKRGLSVIFDACPCCGVQLRLTRIRPEMLDIVAYPEIEEVTRVG
E
>t2297 hypothetical secreted protein
MLKVTTLIASLLAAPLAFSASAQPLTDVEYISVSAVSATPSMLEDAIARL
AQSKQASSWKITSMRIDNTGYATAILYK
>t1368 hypothetical protein
MSDYLKGKRVVDPVLTSIARGYKNAAFIGERIFPVVLTDKEGVRVPTFGK
TAFVEYDTERAVGADSNVLVREKTGTLDLVLGEHDLAAPVDYREQAESMF
NEESKAIRRATNGVNLRRELIAARLAQDEKVYRTGHVKKLTAGDRWAGGK
GDPIGVIEAGMEAVRTATGLRPNLMTMGAGVMALLKFHPAIQAAIGANER
KRITTEILQDLFQIEEIVIGAPVSLPSMKAAMDKDSVPTDIWGDNLMLHY
VGKPQPGADSADENEPSFGYTLRRKGMPVADKYDGAGGKVKYCRYTDIYK
VAVVGGDAGYLITGISK
>t1566 hypothetical protein
MIMMKLKSAKGKKFLLCLLAVFIVAASVVTRATIGGVIEQYHIPLSEWTS
SMYAIQSAMIFVYSLVFTILLAIPLGIYFLGGDE
>t1880 putative bacteriophage protein
MNAETIKDFLVSLGFGIDEAGYEKFESVLAGVTANAIKTGLAVEGAALSV
VAFTAKIASGLDNLYWASQRTGATVQGIQSIGYAVSQVGGSVDAARTSLE
SLSRFVRNNPGAEGFLNRLGVQTRDASGNMRDMAAIFTGVGQELSSMPYY
RANQYAQMLGIDENTLMAMRRGLGGFSGQYSAMAKAIGFNADEAAKSSNR
FMTSLREFGAMAGLARDKIGSNLAGGLAGSLDTLRRHILDNFPRIEQTLT
KAIKGILTLGDIIGRVAFRIVEGVGDIIDWWGKLDKETKTLIEVIGGLVV
AMRILNSTFWMSPVGLITGLIVAFGLLWEDYKTWKEGGNSLIDWEKWQPA
IDKAKDAMVWLRDHLLELKDSIGGWKTSLELLATFIAGAWISKVTGAFAR
LAGIPMPPWLKGWMAYAAYLYDDRENIVASAQSSIDYAKQNIGDGMRALG
IDTDFGRNPHTVKGANIQPDIPGAEPVQHAQSAKRTLADRNNNPGNIRPV
GGNGFRFFESALQGWEAMKNQLMRYFTGKTTGRALQTIQDIVSTWAPAGD
NNDPKKYAQDVAKWMGVSPNTVLNLANPETMAALMQSMARKEGYSNWNSP
LAYQAAGGSLNQQTVINVHGVNNPQEAANLIADKQGAVNARAVQQLKGPA
>t1018 hypothetical protein
MIKHDAIFNQKHMFRSTRTLPRNLQTYTEQASTEELHTDLNNMQSTGLYS
IQQMVLQL
>t2947 hypothetical protein
MSLQELFELVLCVHLVCQMLYRE
>t2331 hypothetical protein
MAQANAAVIEQIRRARPHWLDVKPASSLISVLNQGKTLLHAGPPMRWQEM
TGPMKGACIGACLFEGWAKDEMSALALLEQGKVNFIPCHHVNAVGPMGGI
TSASMPMLVVENITDGNRAYCNLNEGIGKVMRFGAYGEDVQQRLRWMRDV
LMPVLSAALGRLERGLDLTAMMAQGITMGDEFHQRNIASSALLMRTLAPH
IARLQHDKQQIAEVMDFLSVTDQFFLNLAMAYCKAAMDAGAQIRAGSIVT
AMTRNGDMFGIRVSGLGDRWFTAPVNTPQGLFFTGFSQDQANPDMGDSAI
TETFGIGGAAMIAAPGVTRFVGAGGMEAAKSVSEEMAEIYLERNMQLQIP
GWDFQGACLGLDIRRVVETGITPLINTGIAHKEAGIGQIGAGTVRAPLAC
FEQALEALAESMGVS
>t3689 hypothetical protein
MKKRSLLLLALPAMAFSTWAATSPPANLSPVYSYQDDVPSAATPPVATTP
VAKPASTSVLPFLGDEARKRGYELPEPFGLNINYMNIGQNINVDSINFNG
LALGPNGGIPLDNAFKINVGHTREKSKTETVKLDAWLLPFMNVYGLVGYT
DGHSVSQIGVGLMTKNGHVFHPADLQNLKFKLDFKGTTYGIGTTLVGGVG
NWFTAVDANYTQTQFDILDGSIDAFTVSPRVGYRFTTPGIDKMHLPSGKL
NVWVGSMYQDVQQEFKGSLDDLTMPSATLQRLVSMANYNNNGRFDVKQHL
QSPWNMLLGAQYEITRHFNVTTEFGFAERNSFFVAGEYRF
>t2159 hypothetical protein
MQPKSRTGQGVREQCLDGARRHFSYRHGMAPGPRKGADMLFPQHEAAVAG
AVPGRQWPEENNLMTQEYAEMSTGRGGTSKCRYSMGAKIFLLLLAVSTVA
ALFSLYQVLTY
>t4531 hypothetical secreted protein
MGAQKGAHEHVFFFISFIFNLLCCVLSPAFAPYKAFFKSTNVYFNPQKQ
>t3092 hypothetical protein
MLGLSLLLALLFSHRLRQPHHLWAGCYVVVLLLLLAHMGDILDRHHRRDA
YQAQQVAEETLLRKIDTTDDRVFLNHLMSQAMQPQNAGDWGTNRRIEHLA
KRISPFDIAGGTEKIWLVLAIDRLNRSAVGTFASWFIGDSVQAKQYRHQL
LQNNPLLDLLNRVFNDSTADEQTFLQQQLLARDICTSLISVVPELLTDEL
YAQAVAFDNSNKPEPFSWQFEFDVFYHQKK
>t3568 hypothetical protein
MDSVMRKSLFLLLPLVVTNAHAVYVDVRHEYLDDSKANYDRAYISHRFAN
GVGFAIEAISKSGGDDTNKAFNDLETQGNEYTISYQFKTGDVAWQPGFVL
ETGNGYSTYKPYFRATWTLNESWWVGARYRFEYVRRSSDIRDDDTINRMD
VWAGYKWNNFDWTIEGIYKKADKYDLYDGGKDNYEYNFRTAYIIDQWSPF
VEVGNVSVNSNSDERQTRFRVGIGYTF
>t4166 large repetitive protein
MGNKSIQKFFADQNSVIDLSSLGNAKGAKVSLSGPDMNITTPHGSVIIVN
GALYSSIKGNNLAVKFKDKTITGAKILGSVDLKDIQLERIDSSLVDSAQV
EKKGNGKRRNKKEEEELKKQLDEAENAKKEADKAKEEAEKAKEAAEKTLN
EAFEVQNSSKQIEEMLQNFLADNVAKDNLAQQSDASQQNTQAKATQASKQ
NDAEKVLPQPINKNTSTGKSNSSKNEENKLDAESVKEPLKVTLALAAESN
SGSKDDSITNFTKPQFVGSTAPNATVIIKINGIAVGQAVADSLGNFTFTA
PETLTDGTYNLEAEAKTADGSGSAKLVITIDSVTDKPTFELSPESSVSGH
KGLTPTLTPSIVGTAEENAKVDIYVDNKLVASVDVDKDGNWSYEFKDNEL
SEGENSIKVVAVDKAGNKNETTDSIITDTIPPEKPTIELDDSSDSGIKND
NITNSTLPTFIGVAEPGSTVSIYLGLKHLGEVIVAKDGTWSYTLTTPLKD
GEYNITATATDIAGHTSATANLPFTIDTRISYFSAEIETTDDSGIVGDNV
TNNTRPTFTGKTEPNAIISVINSETGEEVIFKANDKGEWTFNFTSDSVEG
VNNLTFTVEDVAGNKKDFSFSYVIDTVAPVPPTVSLEDFVVLPNGIILSG
NDLPALVGTAEPKSTILLMRDGKLYDSIEVDSNGTWNYQFSNKFLQGAYD
IEIISQDAAGNKSSTVKYSFTIQTEVVPPKAELDASDDSGAKGDWITNKH
NALTLLGTADRFATINILIDGKTIGVTTADADGNWNFDISRNLSDNVYKI
TVESIDPLGRTSSVDYQLTIDSFTPIPTVMLHDSAGSGVKGDMITKINTP
LFTGMAEANAKVSIYVDGVLSGEAIAGDDGVWNFQFTTALSDGSHDVTVK
VEDIAGNTASSSAYNFQIVTQTQKPTIELVNDTGVDNTDHIINEKNPALT
GTAAPYSTVKLYVDGALIAEVRTNKDSRWEYTLKADQGLVDGDHRITASV
EDIAGNIAHSDPFLISVDTAISIPIVSLSPDSDSGIADDNLTNIVNPTLH
LKDIDPDIISVQVWDAASDTQIGVATQQPDGSWTYTFTSDLTEGLHQVYV
KVEDIAGNKANSAVFDFTIDTTVSTPVISLLSKDDTGVTGDNLTNINKPG
FAISGVDADAHRVVVQVMHNGVSEEIELSHLNGSWLFTPGNTWADGSYTL
TVKVEDKAGNTSYSAPLTVVIDTQIAIDGVELVNDSGVKGDNMTNDDRPH
FRVTVPTDVNEVRLSIDGGNSWVQATPGVAGSWEYIWPTDLADGQYTLTV
EATDKAGNTVTKTIDFAVDTTLSVPVIVLNSADDTGVQGDNMTNRTQPTF
ALQHIDDDAVRVTVSVEHGGVTTTFDATKGTGGWTFTPPTSWADGDYTLS
VSVEDKAGNTSHSASLTVTVDTQIAINNIELVNDSGIPNDNLTNNVRPHF
QVTVPTDVNVVRLSIDGGKTWFNATQSATPGVWDYIWPDDVADGGYTLTV
EATDEAGNKATQTLDFTIDTTLSVPTLSLDSADDSGIAGDNITSVKTPGF
TLNNIDTDVSRVIVEVMHNGIKQEVPLVQTGGQWRFAPTSDWADGGYILT
VKVEDRAGNVKQSAPLTVTVDTHIAIDRIELVNDSSIPDDNLTNEARPHF
QVTVPADVNGVRLSIDGGKTWFDATQSATSGVWDYTWLTNVANGPHTLMV
EATDKAGNKTTQKLDFIIDTLLSEPTITLDSADDSAAGDNITNVKMPGFT
LGNIDADVTKVVVTVAHDGKNQQIELIKNGGVWRFTPGAAWTDGDYTLTV
KVEDKAGNTNYSAPLTVTIDTQTSIDRIGLLNDTGIVGDNLTNEARPQFH
ITVPTDVNSVQLSLDGGINWVNATLTSDGVWEYIWPTDLVENTYTLTVKA
TDVAGNTATETLNFIIDTTLSTPTITLDSADDSGTANDNKTNVKTPGFII
GGIDSDVTQVVVQVMRDGHSEEVELTQTNGQWRFVPGSAWTDGDYTLTVT
VKDEAGNIRHSAPLTVTIDTQITIDHIELVNDSGIPDDNLTNNVRPHFQV
TVPTDVNVVRLSIDGGKTWFNATQSATPGVWDYTWLADVGEGKHTLTVEA
TDKAGNKTTQQLDFIIDTLLSEPTIVLDSTDDSGTKGDNLTNVNKPTFLL
GNIDADARYVTVEVQHGGTKEVLTATKDATGNWSVTPIGTWADGDYTLTV
RVEDEAGNEKHSASLTVTVDTQITIDVIELVNDNGIPGDNMTNDAHPQFR
VTVPGDVNEVSLSIDGGVTWVKAMQSATPGVWNYTWPKTVADGDYTLTVK
ATDNAGNTVTRTLDFTIDTTLSTPVIVLDSADDSGVHGDNMTNRTQPTFA
LQHIDDDAVRVTVSVEHGGVTTTFDATKDAGGWTFTPTGAWADGDYTLSV
SVEDKAGNTSHSASLTVTVDTQIAINNIELVNDSGIPNDNLTNNVRPHFQ
VTVPTDVNVVRLSIDGGKTWFNATQSATPGVWDYTWLADVGEGKHTLTVE
ATDKAGNKTTQQLDFIIDTLLSEPTIVLDNTDDSGTKGDNLTNVNKPTFL
LGNIDADARYVTVEVQHGGTKEVLTATKGATGIWSVTPTGTWADGDYTLT
VRVEDDAGNVKYSAPLTVTVDTQITIDVIELVNDNGIPGDNLTNDVRPHF
RVTVPGDVNEVRLSIDGGNTWVRATQGTAGIWDYTWPKDVTDGLHTLTVE
ATDKAGNKTTQTLDFTIDTRLSTPTIAMDSRDDTGAIGDHITSVKRPGFT
IGNIDADAHSVILRITQGGNSQEVTLTQVGGQWRFTPDADWADGSYTLTV
EVTDNAGNVRQSTPLVVTVDTQTSITDITLVNDHGVPDDNLTNSTRPQFE
ITVPADVNSVQLSIDGGANWVSATQGIEGVWGYTWPTDMGDGKHTLTVMV
TDRAGNTATQTLEFFIDTRLSTPTIALDSTDDTGTPGDDMTNRTRPTFIL
QNIDSDVINVTVSVTHNGTTTSFTATQGAGGWSFTPPAPWGDGDYTLTVT
VEDRAGNTRPSTPLTVTVDTQIAIDHIELVNDSGVPGDNVTKHVRPQFQI
SVPDDVEKVLLSIDGGTTWVTAIKSSTVGIWDYTWPTDMPEGQHTLIVEV
TDGAGNKMTGTLDFTIDITLLTPTIELAPDQDTGQNKNDNLTSVTQPVFV
LGSIDKDVRHVELSIEHNGTFKTVVLTESADGWRYRPDSALADGSYTFTV
TVTDVAGNQQTSAPLKVTIDGTLTTPVIELAAGEDSGTVGDRLTNHDRPV
FDIRQIDSDVTRVMVKVTYNGKTHEEAAVFTNG
>t3151 hypothetical protein
MSSKGEREKRKALLLSQIQQQRLDLSASRRDWLETTGAYDRGWNTVLSLR
SWALVGSSVMAIWTIRHPNMLVRWAKRGLGIWSAWRLVKTTLRQQQLRG
>t3420 terminase, endonuclease subunit
MSLSPARQHRLRVQAEQAAREGGSVRHASGYDLMLLQLAEDRRRLKGVQS
TVKKAEIKVELLPKYAAWAEGVLAAGGAQQDDVLMYVMLWRIDAGDYAGA
LEIGRHALRHGWVMPLGNRNVQTVLAEEMADAAQSAMLAATGFDADLLLQ
TLELTDGLDMPDQSRARLHKAIGAVLSESNPASALNHLNHALQLDPRCGV
KKDKQQLERRLRNDSR
>t1394 hypothetical protein
MITLSGNSRKLKACRISARYLFARAFFKNVRPGITIGVIAGREQVEKYMS
GAWWNKDPVIAARNIHISWGDIQNDD
>t2579 hypothetical protein
MKPLFSVLKSNHNSSSFESPDFVDSKDFYAGIGYDQGKLGAQFENTCAAR
MSVALIKSGVKFKGRLLPIKEGKWKGRSIETGAKNLADILSQATVFGKPA
VWRDPTKFQTELGNKKGVVFFWKIDGYNGGSGSHIDLIEPTSAGAVCHSH
CYFSCKQIWFWELR
>t1826 hypothetical protein
MTLQTNISRETKAAKTQEVYKHVVLFKMAAPSYIRR
>t3034 hypothetical protein
MVTKKFSTQTEYSCYIQIHRKKKEKLYLITNFKKISSALRKPIHTFPDGI
MPSNRRENKLINILNSHKKDSEFRFLLKELISYIGVTVSTMFLSLINS
>t1898 putative secreted protein
MNLLSALLKRYWLQLAFILLMAGAFIAGNVWSDRGWQKKWADRDSAESSQ
EVNAQTAARIIEQGRIIARDEAVKDAQAQAAKSAATAAGLSATVSQLRTE
AKKLATRLDAAKHTANLAAAVRSKTAGADATVLADMFGSLAEEAQYYAER
SDESYRAGMTCERIYNSVRESTNNPIAPH
>t0869 hypothetical protein
MTTQKEMEEIRAALSWFDVRRYDGLKDLTLEQIYAELERRMLAYKTRQQW
ETAGKDNQMQAIYHDATIRCGDVILKSDWISDNHRLSHSYAVRPMTRVQL
FNYGRAMYRLETSEQTDPVAVSSDYISEYLKQGGMNPANKMLIEIDLEEA
SSDDLAEHLKVLIALWQKHLKTPKPPKRKFRFGHKTFERILDYKVIPLMD
LISWEQLYNEGKNIPFTILADILHGNNGVRSSENIKDTDYDYAKSYLDND
EYFKVLNDFYIKNNMLKDWGIVDVITLNDKSSDKNKK
>t1915 conserved hypothetical bacteriophage protein
MSNIDKRALREVAERATPGNWRRTSSLFNGITVTPFSLCGEEVTLAHTVE
KRDAEFIAAANPATMLALLDELETKEEQRANWFRMAQKLGEDLDTAERLI
AELDQRLIEYAGIATREARRVAELEARKVNLSKLSVGEVMHMTGFSRDYA
EGWCAGNDNAIHEIRTAGIKVKES
>t2575 probable secreted protein
MPRKIMLVFFLFISEFCYAQAVVSEFNLSDINRGGMTKAQAEKLLIIALK
YQKYDLSLDGVFVDGDLQDKHGNPPHPGYYDFSLGYDTPTAGAIDYWGLF
SVSSQTGDIWEINKCERIIFPQLQKIQQEIMKKTGATFASEVVQRRGLGC
TDE
>t1548 putative lipoprotein
MKMSIAMLSALASFIVVGCTPRIEVAAPEQPITINMNVKIEHEIHIKVDK
DVEELLKSRSDLF
>t3521 hypothetical protein
MKNLFIAWLSLLLYGCTTFPQAVKPTTGGQPSPTEIYIVSHGWHTGIIAP
AHDVNTVLPQLKKRFAQETQWYEIGWGDKGFYQSQEITTALTLQAMFWSS
GAVMHIVAFSGQPERYFAGSEIKSLLLHTNQRNSLMRYLGRSFARDAEGN
LIPLKQGIYGDSQFYAANGRYGILNTCNKWTAKGLESAGLTINPSLKLTA
GSVMKAITDHRMCLNKYCYP
>t3619 hypothetical protein
MSLPQKSKVMHIARTLTGSTTQDEGQERQEAKTQDCQEDKRPETLVKGNG
NNMECSRLREKQGVLIANRDGGTR
>t0052 hypothetical protein
MGKIKYRLPTLLAIGFWMDAASATVLELPAWERNYTGTIAGKPVNVNITR
FGNTLYGHYCYEPCNQHKAVIVLRGSLQDKEVHLEERVKALSGYWNAEIS
SSEIKGEWTSADKKRHFPVALIYYKPKNSPDIVLVTNTNDAGGYDPSKEI
DCGNTPAISAIKLYRDGKLIQTLDTASVGTCSPFMPQWGDVNFDGYPDLS
IVTELLAGPDAPVQTWLYDPAKQRYVDAPASYQEITSPEIDAEHKQIVSY
WRGGCCSHGVNVYRWKGKTIELIDRGESYFQPVISKGKMYNCYMTPSYAD
GRIIYPLVRKNGHLTPPLSLDGTCQSFWLTGNVRTVIQAEKPGAEPESLE
IQWQENKASPGRFCPLVPFVEGDKLSPRLVTDDDVPDTCISRAEYEDIKQ
>t0036 hypothetical protein
MKNKKYINSLLAACVLFSCFNGQAAELKRVYGKLSFGYGDWNKGFVNVDR
GEVWKAVADFGAVFDRGEFASFYEMNVLNHPVEGRNHVTQFLGHYRVVEG
SNFTAMMKLYMSMENKFGDELNMMYGVGYLGLTSPSGFIKPYFAVHNLSN
DYTSKKYGQATGFNGYVLGWVAAYNFDMFNEKFVISNWNEVEMDRNDAYA
EQQGGTTGLNGAVTFTWKFMPRWTASVSYRYFNNKLGYNGYGDRMNYLIG
FKF
>t1040 hypothetical protein
MCGIFSKEVLSKHVDVEYRFSAEPYISASSSNVSVLSMLCLRAKKTL
>t2580 hypothetical protein
MIKKSTYDVSHHSAICGVTGDYYRISATYHIKRSIRVFLIILCCLLPGGV
FAGSLINAGLISPDNVNLSIRDFLGFYASDNLQEKDNTLMYVLGVADATE
GKTWCGYGQVDSITINHTVLAWLDRYSVKKPDARASVLIEEALVKNFPCQ
GTEPSVKIASRSSPVLSLTPDALNLSGNDFFKFWVSGNQLDKLRAGIYLL
GVEDATEKKLWCGYDLFKTLTLNEIVYVFLKIKHIKNLILARLSLLWIN
>t0860 hypothetical protein
MSVMAGASFSQRTARPPIKPPKTMDAKKPGKPYQVNRVKISSYFLNVSGL
AGISRN
>t4538 probable regulatory protein
MGESIITNIISIIRERQSTDNAPVKIRDIADAAGLSIYQVRSYLEQLRAV
GVLEKVNAGKGAPGLWRLL
>t1156 hypothetical protein
MFQLKPGALAIVIGAKTPAGRRNIGKSVELFCLCQPGDEFINPVNGHVTL
LPKEAPRALWLVTGDVASADGQHGFAWVRAEHLMPLSPDRQPGQAAARQS
QLS
>t1211 hypothetical protein
MDTQLTVNKFIFSLYVILRNKYNPLFTSILVRLNLFHQFRVITGNSGYIC
FIPFYIDPFRAVRCTYLI
>t3401 hypothetical protein
MLEIIPAQGKREVFLTVPQMANSFAMMELLVNPKKSVLKTHDNFCFYICV
>t3149 hypothetical protein
MSKDNTTEHLRAELKSLTDTLEEVLSSSGEKSKEELSKIRSKAERALKES
RYRLGETGDVIAKQTRVAAARADDYVRENPWTGVGIGAAVGLVLGVLLTR
R
>t1039 hypothetical protein
MNDVLNSGAFSLASLIVSMVVLVVGLALWFFVNRASSRANEQIELLEALL
DQQKRQNALLRRLCEANEPEKEAEPATAASEPKEDEDIIRLVAER
>t1145 hypothetical protein
MTTITLVNEQNSTGNPFSAHMLCEQRAIQEITYELLQSQQHVSNKDIIAK
LIEKLETEKDVVQLDIYRNALEAVFFQTPDDI
>t2970 hypothetical protein
MAANGENNPSNLLGAPVNTAISGGSVLPEGKLLTAVNSSFRDKDHQIEGH
GSPDVYSQIWLLKIRYGLTDRLELSTVGSYINNKRDNLSPEHIEGMSDQS
VGVNYALMSQRRGDPFWVTVGGAVLLPTGQDGDNHLPGNSAWGGRVTLTL
TKLFTPNIKGDMGFVYQGPFERGNQDVKRGNEFQWNTQVRYMFSDLPLDL
GLESAYSNNASGTKKLSNGSVINNHSGTTEWVVGPSFNIAVDSLKLWFGA
GAFFPVMQEAKSPTKMEDVRWEFKIGKTW
>t3606 hypothetical protein
MNKRFNIDWDNELTQEQLINLILTDEDLPKLRSLTIGNWGDCWEDETCQP
IIDMIVENAPRFSHLESLFIGDMESEDCEISWIKQGDYSRLYAALPNLKE
LIIKGASDLRLGAIHHEKLEHLEIISGGIPSNVLAELQNAQLPALKTLKL
FLGVEEYGFDGSLDNVMALVSKDLFPQLNHLGLMNSEEQDDIARRVLESN
ILPQLNVLELSCGTLTDSGAEALLEHKDRIAHLETLDLHHHYLTPEMQEK
LKAALPIPLNLSEALEPDDYDGDIYMNAMYTE
>t1375 hypothetical protein
MVDITRVRRESLRWSLLVALNKTRPYTASETLLLDVSRAIYPDTTPLELR
RELDYLADRKMVDLEKKPSGDWFADLTRLGVDLVEYTVECGPGIARPEKY
WSE
>t0939 hypothetical protein
MEITFSADREAVDELIKHFFTENKLYLRGRFYLSAAVV
>t4045 hypothetical protein
MDEQWTTNVLFQRVQRLGGAGDVTLLHKHHEDGELAEGNVIGDRQVHRPP
RALEPIPVGVIVEASLGPDSAQKP
>t2891 possible lipoprotein
MKKTAAIISACMLTFALSACSGPNYVMHTNDGRSIVTDGKPQTDNDTGMI
SYKDANGNKQQINRTDVKEMVALEN
>t4216 hypothetical protein
MKLNLITVSLATLVAAGAFPAHAGPQAHVVCGYHHTLGDDAIMMFGKANQ
AMWHDFFGNTHTDAVSTYQTLRAQPDTTCDNKADSSGYWAPSMKLPDGEI
VNPAYQKTYYQSTNVAQYPLHPFPAGLELLAGDHHGTGPSSAITFLCANG
KGYTNKVGEICGLRKAGDAVQFNIGIAFPNCWDDVNLKPTHSHNNAIYAD
HGKCSADYPVKIPTVNMNIAWVLPQISSLDTSKVELSMDPVMHGETREER
WGSIYTAHADFMNGWTEDGAQFMTDLCMNQGLDCGTAVPYAYSKAEENTW
VSSEDDKPHASVDTLYVQDDWTNGGRTQHPETLTLVKFKIPPLPANMDAS
LFKYRIRLFGGKTETNGADQIFFYPTSSDWHASSVSWNNKPAINYRSDAV
LYLNHSHEYRMVDVDKAVRKALAEGKTEISWYIGGDRQGNHYDFMPADSK
QSLVLMLTGFKKTPEL
>t3767 hypothetical protein
MSYKYVGKHGCDVALRMGYKEYPDENAYGDAYYIKDGLKWIFNITGLKKR
LGVYSDDDLRKQNYDVDTYYRVENQQEESADDEMQSLYHNLAVEEGEPVY
LEGGMYLYPDGSIR
>t4326 hypothetical protein
MKSHDRLRIVFAQARQNWRAFAYVMQVHENHYTKRAGVAGIRARAQRLTL
GPKSLQAKKLTFIFQLELSAKK
>t4324 hypothetical protein
MKNIHSVTDKALYDALNQKQITLNEIQDLFLERGTIICKKTPRKELARNY
SRMTHDYYEHQKIATLLGGQTRTEKITCVRIETGIDKKGIIDAAEKLKKE
ITGQDDYCKIIVDGPRVLINIRYLSTNYGKSDFKQAINKEALIEIEPLDD
GYSIRRPDNENLDDYEGLLLGHISAIQNEQADDSNLDLKLNEISLSHNTS
ADVRTLFFDKLIRTLDGYELLDVTDAYVYHPKPETIEAEDGNTETGVHVS
RASLRGEGVLKSDELSDLYDRGFYIWKIKWKVKENLADPDIFELEAQFGD
PLNCTNFSYLVKGVRKYKANGQYFSKPQKLSGRETERFNKLIENRAYSII
MEIS
>t3412 possible lipoprotein
MKRITFSFTLVAIIIALAGCALPDKDGDFGAYVHTCQQYAYGKSYAFENR
DFAYKVCKDAAKLWSDEVPGYVIRQIQLHPEIPSEEIKYAAMAGSLGNN
>t2568 hypothetical protein
MKYKKIRVSYDTEVTGNVNGVYSVEIKDSLSFKSKEDKKYFEGFFLKNSK
DYNRVMIDDFKCIDVNKIQEICFFPVRKKIKEIDMIDFCPFKLGLDFLIS
KKLFDIMNNFNLPPVNKIPTRINTFNTEYFLIGFPMIPQERIDLNKSIFF
DTKKRSEFNLKSYDAFINTDFSVKPRKIYPDVFYDVDTIGFQGKGLFFSD
RLIDAIQDAGIVGLHVDDTEMEMNP
>t4282 hypothetical protein
MNVYQLKNRTDVLIWLSLIEGDLLSIQASLNAGLYPLYDDRQEEPEFECA
VFNCGMAFGEFMERLESEDIDVLTTAGHELTGSIEHMGRMLCEPVWTQAV
LLGRNEARADRAICAAEADGWLYDPA
>t4553 hypothetical protein
MHTFRPYLLRHSDLLYEDIPLEIREQIILLIINTLGNCSSFYDMTLYCYH
NSHSDEVYRRICKTLRKEYGLFTLYAHSTSYLDEMSNLLLKTDDKRKHID
TIELAFNYIDTYLRTYEVTLGLEPDKAISELNNIFHEHSLKYRYENGRIV
RLRRIKRLKNICYYLYSPGEYGFVEYDLMEAYNRLMLNDFACVVRECHAV
FRSVLLRIHERKSIVYHEQDSLNTLMANLMARGVISAEYVHKFHFLIDVL
ESEIFLPMAPEKSHHHHAMMLRISEELACSIYYLTERSIFFLTQRAEEDG
VAP
>t2505 hypothetical protein
MLNRYYRFAVEKDDPSEAPESGDKPDSQPQKKS
>t1887 putative bacteriophage protein
MPKNQSLPTVSDFRRDFPQFADPARYPEAQIEFRLNLADELLSENVTGKK
LFPYFAELFVAHYMTLWAADSRAMLVGGPGGSTNGVQSSKSVDKVSVSYD
TSATLNPDAGFWNNTRYGAEFYQLITMFGAGGRQL
>t4287 hypothetical protein
MLASFYIQRQLSKALGLSVNAEEVFYQVDDRESDYVNTDMVLNRDRLLSV
MQFMLDDVALNPELRERCRQAERILTLWIRGLDALATVSEDMSILPRTIT
ECSGRVDRLLPGDPQALLALPDDAFLRLTAQCHLMSGEQFPREQLAATMP
YWTRFMAWIARELYQVEDRCLVQLGRLYRRLHVEPRKIRCFNLAFGRIEL
HMSGRDIDECQYLYAYDDASLEDYLEEIMAGNLTPVRFEVRVIYRNDSEL
NVFRRDADVIDVEHPHVSDWQNVVSEALDWLRQERTSLVEIPLTRPALKL
AA
>t0082 probable secreted protein
MKLKLIPFYLLALFSAASGATEINACKDLIGTWKTTADNPPYTMTILPPV
EGCGEKCVKLNVQYELDVTHRNALYCHEGQEGVKGQGPMVIAFEGAYGGH
AIGTYNRQLQLLWAGVIPKNKKGKWITKMENYWFRQVKAH
>t1118 hypothetical protein
MLNIMVNRITVPAMACNGISSGASREDCEAQCDMFWLYGEFA
>t3042 hypothetical protein
MFLILPLKANIIPSISLPDKLSESKYSVRMLRIDARIDYRVETPECKNVK
FNDYFFLNENIKDLNDKNHVYDPEFYPITLWSCSYSREVYDVGEYGSHIQ
HIVFDKNKNIWVIDDDATKTNNEFKPSFLYNISSVNAQGFLIIDENISEM
DKTIYEDDNIKKKFYFCMIDGSHALCGAGSLIRKIDNQLIDFTPYILKSL
ESMEIGVN
>t1026 putative secreted protein
MFALVLFVCYLDGGCEDIVVDIYDTEQQCLYSMDDQRIRHGGCFPVEDFI
DGFWRPAQQYSDF
>t1117 hypothetical protein
MGRHWHFLFNEELTTLNNNGFYSKIDFFIECSIFASVIIFLECKFSLNTG
EIYV
>t3607 hypothetical protein
MCTFPYTPIKGFIVITLIGPHQSKRTHYFMQAAAILGESVTALTHPQALD
ASLTSGIVKIDPPVHNETNIRKINAIGLDYIHFLQNMAARPGIT
>t1151 hypothetical protein
MIMKLDTRLTSSALTLALAAVVIPFTADWQLPLLNGVVVRWIENGQALWL
LFGALFTAWYIRPLSRPEGAKQFWLWAVVWWVVLLGRSTSWGRDYFPDEP
RMLFRTISVLLIAALVLPVLFFAGLRKEIVRRLRDVPLPLWLFTVTACSY
LISDTVEHHRWLSPIFLHNARYTDLIEELYEVPFMIGLFMVTVGFMQQDK
QDECTALELTPYHAK
>t1906 putative bacteriophage protein
MAHELQLIKQSSGILIPATPETSDILQSKIKLGAVLVAEFRQVRNPAFHR
RFFALLNLGFEYWEPTGGAISANERKLVNGYAKFLAAYGGNESALLDAAE
QYLEQIANRRVTNGISLCKSFDAYRAWVTVEAGHYDAIQLPDGTLRKHPR
SIAFSSMDEVEFQQLYKSALDVLWRWILSRTFRTQREAENAAAQLMSFAG
>t1091 hypothetical protein
MKNNIRFDLSDYLIHFFRDVDLETGSHIYLPEHCGFNNQHHACSIDAKYL
LRLSLRSHKIFSSWSYRNGQRTVYGDSPVVCFTDMPIAAYLETGVRRLER
NENIGLYAIVLPKEQMFNYGARPVIYGLDQHNNARCSQGRYGERILDETA
LPLIEQYRYVTYVPGKIDWTHEREWRWPYRGDIKKFLNHIKEYGIPENIE
STPGFDFRSSEISGAGIIVPFAEDIPTVAHDILTLIDRGVIGRNTFKFII
AVESLQSWTQLSEPGALLSYINDNTFEFESFFDLSASKVKNYADSINDYV
SELYSKKDFLNDNYAMEFGNAWVWIHDNQSQVVRALLQAGMIKVNKEGRY
LLDVNLASVDWPLRRKEAFASHVAGWLKHRFDIEAGRYSVRGKDDYDAIP
SYETPLKDQHPFYNHTVNVDW
>t4236 hypothetical protein
MLTSPSSSTGLLAAKFAGLSLPLQVLRSGAGYYIGTQNEEGPVSRESVEY
FSTQSQAEHALKQGTWSQREQA
>t2659 putative bacteriophage late gene regulator
MMICPLCGSAAHTRSSFQVSSLTKERYNQCQNINCSHTFVTHETFVRSIA
TPKESNPVQPHPHKFKQVGLPI
>t1668 hypothetical protein
MTADLQSAPFGRSGTPPRDKFIMLLAFYSGKDYSELKLLTLRAVACGNVL
SLLLESNLEVDDGRFAVSSLMIAREPHHGVMLNTGPLPFGKRGASYQMTR
RCKAFLCLK
>t4264 hypothetical protein
MLSKSKHGLRVPLPCLLILLLLVASFSSHAGLPTVNIPGGGGGGDMMQTI
KLLFSGFLVLMGLIICAVGFYMVSGAILATFSEIRSGKAEWGKFVAIVVV
GIAILVVIVWLCTEAAKILN
>t3406 hypothetical protein
MAINGAAATVPLSPGERLNGLNHIAELRAKVFGLNIESELERFIKDMRDP
RDINNEQNKRALAAIFFMAKIPAERHSISINELTTDEKRELIKAMNHFRA
VVSLFPRRLTMPN
>t0865 hypothetical protein
MIFCENTASQFGIGVADFGAVVALVSGRNNLWTRAKPAGATKGTSYASRT
ARAVFKKSKFPFGISLPTWLGGYTPWTARRVMVRNIAPFVGRSIPLIGEI
ILAADVSQIAYRTIRDYNTIARGNDKLC
>t2585 hypothetical protein
MQKPILSLNSTCIVGGILWLLPVIVVHADEIVCPAVIHFSQAPIISSAEG
WNVSSRAARMLNDGGFLSSGDPKEQADLKGEDAVVDGKKGHLWEIDPEDN
RRGIWFSCSYGQWAVLLARKVQGTVATCWSPDVLPAKLICK
>t3148 hypothetical protein
MKYRIALAITLFTLSAGSYANSLCQEKEQDIQKEISYAEKHNNQRRIEGL
NKALSEVRANCTDSKLRAEHQKKIAEQKEEVAERQRDLAEAKAKGDADKI
DKRERKLAEAQDELKKLEARDY
>t1865 hypothetical protein
MFLTFPNVAITRDNRIDKLSENDLELIRDTAIQNGGRKIQVQLRDLLYEV
SNRAVEGDNNTFKVSFSTTDRAMFRERHIEWQGNAIRLERQLNTGLNVSR
GNFLNMSTTSFSNNSQELAAHSSPGAPAIFDTSNKQQLVDKIDLCSFSPN
VDELSCTEDNLTCPVMLVVPEKGVFVKTGPESNICQLFDETALIQLIIDG
ATHPVSRAPLSLDMIINKNECYFDTTKGNFIIP
>t2655 hypothetical protein
MKTPLPPVLRAALYRRAVACAWLTVCERQHRYPHLTLESLEAAIAAELEG
FYLRQHGEEKGRQIACALLEDLMESGPLKAAPSLSFLGLVVMDELCARHI
KAPVLH
>t2572 hypothetical protein
MVKRYLRLMLSMFLVSKRMVEQNNNTVKTGSNYRV
>t3894 hypothetical protein
MMTISDIVQIILFCALIFFPLGYLARHSLRRISDTTRLLFAKPRYVKPAG
TLRRATKVKADKK
>t0634 hypothetical protein
MTGAVSLNGAEECAGGVGGRYPLHDGKAKRLS
>t3352 possible exported protein
MQYRNLFCLFSLSLLLPPAWGCRLSEPPHNVYQRQGSGVVYLQPEEENNL
SLPEVNFKRLRRLPNSLINPATQDEWDREPPLTDLTTDYVHTSAQAIFPR
FSYYSDGRVILYGGKIMRNPPGTPSVDVASFRAFGKVGVDKNSLYDEGKR
TDDNGGENRVNLQSLRKVEFSSQWKPDLLGLTLRDDRFLYVDGHRLSDPD
SFTVLAQKSWDQRGKFSATFNPCIAVPFGPWDTLARTRTKIIINGDQLDA
DPDTFSVVRWMPGSLLTWRDKNGLHRHVLNQNNLNWDMQIEKHCDAFNLQ
EKRVLWRKGPGCKFAEIPGLDPEQFHPLSNAVAQSQDRLYAIRKTEFGEN
QLDVVTLDNPNLVIDKRFNAGRYHGYLLTKTRSDFKEEDGLQVFESAGPL
ILMDYHVPDETEAHLGDSPHYKKWYARDDRYVYAFDGAQLWRYPTPNPKA
LRMKWQTTHSGYGYWVDYRIAQLDGALMEDGSFIATGIPRDPSVKTAYAK
TDGTFDIGKTAVRWRKVLSPDGEWGRWQTIPNVDPSQFHLVNDRIAQYKD
RLYIARLSPFGEDQLETLELDSTAPFLNQKINAGKEHGYFMHNSQQAQDI
QVFSINGPLKVTERFAYDNRYVYTWIGSQIYRTASLCPDKTLNLNQNMYS
DNKDIIIITQAQECQKSRR
>t0353 hypothetical protein
MEDNFSIYINNKLMITKNKKTNKGCRRKVIIDLNQMLLLFNNYNR
>t3033 hypothetical protein
MITIIIFLIAQLKKLGALADNEMFGLEPAYIFGGEIKKFSLFKKLNFLMV
RLSLIMNLCHGH
>t1422 hypothetical protein
MKKPLLLTLLCMILAGCDNPKSPESFTPEMASFSNEFDFDPLRGPVKDFS
QTLMSENGEVAKQVTGTLSQEGCFDTLELHDLENNTGLALVLDANYYRDA
QTLEKKVQLQGKCQLAALPSAGVTWETDDNGFVVSATGKEMKVEYRYDSE
GYPLGKTTINSQNTLSVTAKLSTDSRKKLDYTAVSRVDDRQVGNVTQSCE
YDAYANPVDCRLVIVDESVKPAVSHHYTIKNRIDYY
>t1110 hypothetical protein
MSCFTSPAIMEMLGHYKWRVYEPFRFYLSEDKNDVIEVPVGFVTDLATVP
RIFWSLLPPDGEYAKAAIIHDYLYHYPLRNRKESDLIFLDGMKVLGVPKW
KRIIMYLAVRIFGWKYYHSHTIQHN
>t4368 hypothetical protein
MAFSGLALLPLAALYLYLAFSLLLAQSPEKNNGMLHQKPVNRLDHLETSL
LKGNRTKNFAPKIFCTKTDG
>t1573 invasin-like protein
MVFSKKPITKYITWAIVTSQIPLPVIADSDNEIQSWIAGTASSISPHLQE
GTLEDYAKGKIKALPGQAANHLVNEGMKSAFPEIIFRGGVNLEDGAKYRS
SEFDMFIPVQETTSSLLFGQLGFRDHDSSSFDGRTYVNVGVGYRQEVNGW
LLGVNTFLDADIRYSHLRGGIGGEVYKDSLAFSGNYYFPLTGWKTSVVHE
LHDERPAYGFDLRTKGTLPDFPWFSGELTYEQYYGDKVDLLGNGTLSRNP
RAAGAALVWNPVPLLEVRAGYRDAGNGGSQAEGGLRVNYSFGTPLHEQLD
YRNVGAPSNTTNRRAFVDRNYDIVMAYREQASKIRIMAMPVSGLSGTLVI
LMATVDSRYPIEKVEWSGDAELLAGLQLQGSLGSGLILPQLPLTATDGQE
YSLYLTVTDSRGTRVTSERIPVRVTQDETSFRSWINVINDDVQVEDGNFV
ISTPLPAGEEGKVIEWHYVRERSEEEWASLKPRHIKYQSDTPGLAFKALG
GTERDGHWVERVLVTHIGDDARSLKLHIEASGPDDKHPVKGTVLLQAQSD
SIAQKVTSVEVLFTQGTEEANGSVTAPVVGTEMRARTLCVNNTDCTDAFN
YQWEISDEMKSWQSVPGATKATWLLPYSLNGESLQNKYIRVRIISDKGNA
KGNTATSDAN
>t3445 putative positive regulator of late gene transcription
MMICPLCGSAAHTRSSFQVSSLTKERYNQCQNINCSHTFVTHETFVRSIA
TPKESNPVQPHPMKSGQVALSL
>t1129 hypothetical protein
MKTINRYTNLCSGIAPITGRMSRVDKNWTDLADNSNLPPSNGDEVSGTG
>t4450 hypothetical protein
MTIRRWILSATLLLLPVPAFADFQYQQDKDGVFYTADDEPQILSRLPDVS
YSHLRRIADLSHPQDPRPLIEINPDSHNCDDNHICQHAYLSDGHFILWAG
KIVQNTGDEPAVDVASFQSFGAFAADKHGLYFDGKRRDSNAGEKRVDMAT
LAETKIWNLLRDKNNLYYEGRWLGRADGFRVLRLDSTSAREFIVTTAQRV
IVNGIPITADANTFQIIRWMPGEVLIYRDKTGKHDYEIDNSSRYCGYFNI
GLREVTWLKHEATNAGSSCKVETLPGVDPEYFFRLNGNTGWYKDRIYQVS
TNALGEGVLRIFTSQEKLPALKIDRVTYNYYHLALSADGQLYRQISRDQW
QSYNPILTEWTTVSPAPTDVISLLPSDYH
>t2948 hypothetical protein
MSITNQINKASSLASLRQPQRDKDGQIIKGSIWGTDISRTDYVQMTNGQD
AQVLNVGFTDRSGRPMAFGNDSHDFVQALEDSLDETFNDGDFRTAVMENV
FPCEFRPYGTDPTPIRRQALQNRTVRFKHNGAFNPAKDAPSTTAIKGASV
TVTPVLNPETNKTGILCSAATHVSDFDMMGGWTEGALIQTVGDMQVQYSR
QMFAAVVDVLKDTPDLQIIEAAPLSGKPSDQAEDLLDTLALNLPVELGNT
LSDYAVLVPERLEAILDRAAQRAGHEDISELLGCTVCSYAGDDTGVYLLP
KRFASISFRSTKDAKTVDVKVTRNSNTAGYDLELISVVDVLATGSVKVKA
GEFDVEKDASFPLIHVIRFTTPE
>t0936 hypothetical protein
MYEPESAIVTYSLPGELSSSTCDAHNFIGYVDANDFAEYIKKVELTRYNM
YCLKKTGAGQWSIYAN
>t2696 hypothetical protein
MFSPQSRLRHAVADTFAMVVHCSVVNMLIEIFLSGMSFEQSLSSRLVAIP
VNILIAWPYGVYRDLIMRVARKASPAGWAKNLADVLAYVTFQSPVYIIIL
LTVGAGWHQIVAAVSSNIVVSMLMGAVYGYFLNYCRRLFKVSSYHQAKA
>t1027 hypothetical protein
MMKTSVRIGAFEIDDAELHGESPGERTLTIPCKSDPDLCMQLDAWDAETS
VPAILNGEHSVLFRNHYDPKSDAWVMRLA
>t1963 hypothetical protein
MQRQALDRRNVLWTHLNYVENYFCSDHKNGINHLKKEAVHKQQNQDAMDD
TLTVCIAVRALRADE
>t0104 hypothetical protein
MNVSMQARLVGSDRYLFVNTQRANPSIKTVSRFFEYKTWTEQIWRTEIIE
NGNAFFHWQGHDRKDGHRDTIINYLLNGQRWQSTIEDYIFFHALEGKAWQ
GHYDNIIEYVSSDHYVYQSAFAEYITDQIHQRAPHGTRF
>t1430 hypothetical protein
MMTYDRNRNAITTGSRVMISGTGHTGIIKAIESEGLDAGQIRRGKTVIVE
GCEGKFAPVELIRLGMN
>t3276 hypothetical protein
MKIKTTVATLSILSVLSFGAFAAEPISAEQAQNREAIGSVSVSAIGSSPM
DMNAMLSKKADEQGATAYHITEARSGSNWHATAELYK
>t4145 hypothetical protein
MVKNSEYKTEGRIPAYNYYHVQQKLQCIWANNTCYS
>t3041 hypothetical protein
MIVSWVITKKFIYIVTIAILFCSVVIYLWSGRPVEIVDVHYYSGKDINIL
ARHFPITDRGKLNWWRENERKILEKYNLPKNNFSVYIWDFGDGYQKLSPY
DAEDEFYCFPDIKSESKCIKKDIYMPAESGGNYYRMFLFSGSGYFYQPKK
DDDKLIESNNSTKLNQDKSHEKNHYH
>t1874 putative bacteriophage protein
MPVMHNCFLELARESLQHNGEQWTRNAISRSYYGMYHSALRITNNLTPTH
DTDGERLPGGSHMRLYTAFCNGEAAKVNGVDVDKVRKIGIKLKMLHAQRV
NADYRLERKINRITAISALQDAEEIDALVDRMMNNPDDSLTA
>t4263 hypothetical protein
MAMNSEQANAFKAGSGIDPTVLNKFVLGFVLSVLFLWFAWSVLVVFRGWR
AGKVTEQNALHFFISTAILVVFSVWMFAS
>t1132 putative lipoprotein
MTNKKHIFSIIFIGSLLTGCATGPSPTGIGLYTDVKGPITATSLPATKTG
KACAQTVLGIVNTGDASIDSAKKAGDISLVSSVDYETTGSYPFYGKTCVV
VRGQ
>t4487 hypothetical protein
MICIKDVCNFIDVEQGKNQSVQMAGAVAFVKDSKKNLINYSTKTDSNLEQ
RHSPCQ
>t0034 possible sulfatase
MNKKNSSMVNLPAPRESINQKIDTNNALVLNHNAIYEQRLAEITQSNTCD
KAIVTVNPYGTAPLSLYLGIWIDEAAALEINVVDSEATTEAVRYQYDVHP
GANLIPVCGMVSAVNNQITLRLASQIVGQYTVMTDALPPTDSANVSLGFP
IISVSCPAQQASLMEEGLYFSTYFDRYNLAFDHNGIVRWYVSQEIPSYNF
VRMDNGHFLATSQGINHCLNMYEFDIMGRVYTVYLLDNEFHHSILPIENN
LAIAPSEYSNGRPDGYSTGKDGVSIINLSTGLEVAYYDMLYVMDYSRSPR
PSGSAPGQDVSMDDWLHINQSYINEPNNLLICSGRHQSAIFGVNVDSGEL
RFIMANHEDWSDEFKQYLLTPVDDDGVPLYDLTSPGGIDAADKNFWTWGQ
HNIVEIPNDEPGILEFMVFDNGNYRSREDAKSLLPLDNFSRVVQFKINLN
TMTVTRPYEYGKTEVGNRGYSSFVSAKHLLTNGHLVIHFGATTVDEFEHT
ITAQPGSSDLVDPDEGQQALGRLVLQEINKETKEVLFEAMVTSGYFKNEE
TNGTNYRYDISAFRVYKMPLFA
>t2941 hypothetical protein
MVILFSGNLMAATNMTDNVTLNNDKISGQAWQAMRDIGMSRFELFNGRTQ
KAEQLAAQAEKLLNDDSTDWNLYVKSDKKAPVEGDHYIRINSSITVAEDY
LPAGQKNDAINKANQKMKEGDKKGTIEALKLAGVSVIENQELIPLQQTRK
DVTTALSLMNEGKYYQAGLLLKSAQDGIVVDSQSVQESPTHPVQHDAAH
>t0484 hypothetical protein
MIAEFESRILALIDDMVEHASDDELFASGYLRGHLTLAIAELESGDDHSV
EAVYANVSQSLEKAIGAGELSPRDQALVKAMWDNLFDKAKQ
>t2621 phe leader peptide
MKLTRFFFAFFFIFP
>t2937 hypothetical protein
MTEWIFNLKTKLTVLVMMLCSLCVTKVYAVEPGINECVVTSGQNINLRSI
NLTTDDFKPGPDSVIYSINQDAVFKCSMGYGTQFPQLVFNQGYFSKFTQT
LDAMGLGFRMSVKETENVSNVVSFSWDEIKSTQSGNELRKEFGTKLPVGT
TERKVRITLDFLYTKAYSESSAVTAFTGISNVLNIVPFPYSLRQNGFVLS
GFNVRILRNGLGKVDIVPLQVNFGHIYTTYEPSQTRQANFTVIARQVLRP
AMGQEFAIPLAITFGKGALTQDTGQTLNLVSLDGPNKGQPNGLRLSIKDD
KGKEITFDKQEVLGDITITGTVTGNVSKVYTAVITPTLGGSVKTGKFSAA
ILVTVTYN
>t3798 hypothetical protein
MKVLIINDTGNSYHWGCYGTSTAIKESLRFRGINEIVTFSCEEGSKIENS
PKKILLVYSKNKLIRRLASHYYSKHLRRKLPDLWDSLLKSDCVIINGEGT
INSIHTATRFIFFIIHVAKDILKKRFI
>t2187 putative lipoprotein
MKKILLIASMTAGLTACASSPAPEEDSRLKEAYSACINTAQGSPEKIEAC
QSVLNVLKKERKHQQFANEESVRVLDYQQCIQATRTGNDQAVKADCDKVW
QEIRSHNNVQ
>t2302 possible copper-binding protein
MESRQLSATAHEMMNNGDAFAHQQMAQTQKPSAPTTVENATPFSEMDDYE
KAMVIHQSMNNAHSFSHEIQAEEHHKQIKRN
>t1987 hypothetical protein
MPGLIGYWKQLPTKDEYIKKHNMSKISCYSCGHEKFSDVGLIQVWDNHRR
ILCAKCKTTLFREED
>t0450 hypothetical protein
MFRSLILAAALLAFTPLAANAGEITLLPSIKLQIGDRDDYGNYWDGGRWR
DRDYWHNHYEWRKNRWWRHDNGYHRGWDKRKAYERGYREGWRDRDDHRGR
GRGHKHHH
>t1918 conserved hypothetical bacteriophage protein
MSMSNTAEIYKFPAPIPTQQECRMADLENGYLRLANQIQDALCIVELSGR
EFRVLNAIIRLTYGWSKKSDRIANSLIADKTTLKVKHVSEAVLSLAYRNI
IILRRIGQTRYIGINTNLDKWAYSKPHCSKCPVSFPDDEIATWIISVPET
RDSYTRKGGRASPKTGIVIPENRDSVLPHSAIPENGDSYPRKEGRASPKT
GNTKDIIPKTNIKDLTPFNPPKGKVKFDPLSIPVPEWLNAASWNEWVTYR
QQSGKPIKTELTVTKAFRLLKECLDEGHDPVNVINTSIANGYQGLFKPKF
ALNDRRAGRDVNHISAPDKTIPTGFRG
>t3942 putative lipoprotein
MRNLVKYAGIGLLVMGLAACDNNDTKASAESAAAESQASGQPISLMDGKL
SFSLPADMTDQSGKLGTQANNMHVYSDPTGQKAVIVIVGDNTDEALPVLA
NRLLEQQRSRDPQLQVVTNKSIELKGHTLQQLDSIISAKGQTAYSSIVLG
KVDNQLLTIQVTLPADNQQKAQTTAENIINTLVIK
>t1451 putative ATP/GTP-binding protein
MQRLTVYSHPLWITWQEAPIGRLLQGATPVYAKTLISRLFTLCAQAHNAA
AALLLFPEEKPDMQAAQQELARETLRRALTDWLPLFSHRQATAEEWALLR
RGELSPLASTIFFDDDPQTWLAAGVKGWEAWFLQERSETARWLAAVQNII
TPTLPMASSPDHTLITHGPLDVSPLAIEYPLLSACCLSGKTTALRLLARC
ITLARSLSALPTLRWNRFDDGEWKIAVVETARGWLVHQARLTTSGNITKF
RQYVSP
>t1903 putative bacteriophage protein
MKMHNDPHSMDSQSIFAGSQLLPMEKTSHLALSVGFRSHLA
>t1572 hypothetical protein
MRMSKHNLIIYSLLLAAVPISVLADSSTTSAATVGMFSPPTAPSVGHRPG
TPRDTYELKFNGLDFSIEKYTPFPGMTIARDDRYLGISDPDNNYQSAETF
TTVCTYYQINKDNSQHILKREKPCEHKFNIQKEHRGSKIKLEIYNETDMA
SASGYTPVPSVSEFQYIETETISNTPSPDLTVMSIDKPVLSPGESATITT
IVKDIDGNPVNEVLIDKTVKYENSKGLWDYGPIKKENVPGKYTQVITYRG
HSNERIDISFEYAGDLFRKEISIRGRL
>t1774 hypothetical protein
MSVIKKNIPAIGLCICAFFIHSAVGQQTVQGGVIHFRGAIVEPLCDISTH
AENIDLTCLREGKKQMHRIDLRQASGLPQDIQSIATVRLHYLDAQKSLAV
MNIEYR
>t3020 hypothetical protein
MMRKTLLAAVLTFTAMAAHADYKCSVTPRDDVILSPQTVQVKGENGNLVI
TPDGNVMYNGKQYTLSAAQREQAKDYQAELRSALPWIDEGARSRVEKGRV
ALDKIIAKEVGESSNMRSRLTKLDAQLKAQMNRIIEHRTDGLTFHYKAID
QVRADGQQLVNQAMGGILQDSINEMGAKAVLKGGGNPLQGVMGSLGGLQT
AIQNEWKNQEKDFQQFGKDVCSRVVTLEDSRKALVGSLK
>t4571 hypothetical protein
MVVWGGSRKIPYQPPHRTENAGIMNVCTNNTGIKSTVSATWINKEGD
>t3920 hypothetical protein
MYNILNFFVSGIQGLYMTTLLQVIVLIFIVFFIFPGRKIFRRSKHLQDGR
KLISSSSLMLRKFGSSRGYNADYYFDMQYLYEVADGITTAIPLTSIIEAK
PGTTRVSGRSVWSVDWITAEGQRKQTRFLHNYTLFNRNFATFLKTVKQAN
PDACITSLTLFTL
>t2513 probable secreted protein
MKKLTMRECESINPTLLLAIKILISALDEKTKIRLREHIEKTEQEIASSI
HDSIMTETFLQQMKDIRYILNIVP
>t1582 putative oxidoreductase
MKQSNNQVGEKVLVLGAGQLGASVLASLVPAITQRNGSVCVIVSGRSRDK
QSKRRSSIHQQLADAGARFIPVDIADSSVAALKDQFHGFDTIINCMGFVA
GAGTQIKITRAVLEAGVKRYFPWQFGVNYDVVGKGSGQPVWDEQYDVRTL
LREQNVTEWVIVSTGLFTSFLFEPAFDVVNLDQKTINALGGWERQVTVTS
PADIGRLTTEIYLHQPRITNEVVFVAGETTSYAKLAKTVERVTQQTFTRG
VLTLPDLQEQLRLHPHDPMLRYRVAFARGDGMWWPMSDTWNAQHHLPTQD
IATWLKTQQ
>t1068 hypothetical protein
MNSEELTHKAEEEIAALISKKVAELRKKTGQEVSEIEFAPRETMKGLEGY
HVKIKLL
>t4530 hypothetical protein
MKSRLFTLHQLITTYHHDIKKKSEEVNSVNSYMEKKNFTDVQYYPG
>t2949 hypothetical protein
MSISYRKLDIALSADKETVLVFGQELSTKYFTEIVVTTMLNSTGSDMANS
NRILNDIHAAGLDAGDYGKYSRWWAQSNAQERQEAERRRKEAKAHQERMA
AIHATPEEIAKAVAERKAREEALIKRFGNKGAAFGL
>t0729 hypothetical protein
MMLKIVMLFLMFFPCYCLPMDIKNIKDCKLEEGNRVKLISLSTVDGSTPY
LIFDNVIVSAFLDGSIYSGDIILSKYIHHSLIFALNYGAPYMKGCLITGL
SASAERSYKPNGFCFAERNIPESVWFGEDHTLIIIKNNNSVGEWRGNYII
YDSRGDEAQTFNKLPDTKNYKIYRLDLYYCSCYVFWEDRQVMRIQKRTFL
KNFLVSIKIFHLLSVLKKIMKKWALFLLSYL
>t1733 hypothetical protein
MSVACISYGNTAQLSGKQPGHYSPEKILSTGKDCNPQPANCLKNQYVLRH
CCVDDRSDKMGYSAKLFVLTSFGAETASLFHC
>t1604 hypothetical protein
MAKLPRRKCKVCREWFPPAYSNVVWCCPEHGAIYALELRAKEKSKAAARC
IRGKHQADKAERQANGCMLRERQAVLYTLSRKMFRKHLR
>t3405 hypothetical protein
MYAAESEVVYQFCYRGESYSVPEDDLLCCYPSLSGDGSYFFTLKDGTFLR
GEQVKETVRKNVSPFERYRKNKER
>t0730 hypothetical protein
MGVVFTKLPLKDNTRGFIDLDMGLAFYRDKDNVLASFIENKTGDFYKPRQ
AYGDLASVNMVIYDCIDFYHSKELNIFLRKIISERNIRNEND
>t1125 hypothetical protein
MIVTTAQYYGIKTEENGTVHCLVSYGICQ
>t4370 hypothetical protein
MMKEKKTLLNNNENGTGLPSKPSIMSENKLQNTVFSAIGSASPAHKLVQG
SSAAAFLKNHESLLAGTAVSSAFATQKPAQGSSAAAFLKNHESLLAGTAV
SSAFATQKLAQGSSAAAFLKNHESLLAGTAVSSAFATQKLAQGSSAAAFL
KNHESLLAGTAVSSAFATQKLAQGSSVAAFLKSHESLFASTAVSSAFAAQ
KLVQDSGITALFKKQGGIFNENYFATSIAKQLEKLDATEIQTLQEAAILF
SRSPQGAAIIATNIHAVEVKEPDALTKLKNYIEKSPITSNFRKLPLLVQL
IIIYLVTTMYSVVVDKNKELAMNLMEQAQQYVSKSISPRTHFKELTKQLP
REVDISTLKHIRIITGENVRLRNTPSMQGDVLVKLEKYTPVIVIDKSDRK
WLYVQLSFGEQKIYGWVNRSYTKAINH
>t3047 hypothetical protein
MNMVLRDLSGWRCEKLTEHSAVLHLNAFTQVICHVQQKRLFMASIHSCEF
RVKGTINYPLQGKIRAHQPGWLKRYPVIFTGSKSTAGLISYLNRFPNLQQ
ALSELDYRRFTLVLHHKEWYCSIELWAASEVVCKMPPLRRYLRLERHQRV
LLLSVINMINQAMNQWLQQDADAR
>t4228 hypothetical protein
MAKAPLLNLNNALLQGAKESSAATTAAVAIMPTSEMPMVLTLDEVAPNPD
NPRTTRNPKYDEIKESIRARGLDTVPKVTKNPDIPGSPYIFSDGGNTRYA
ILRELFAETQDERFYRFHALFKPWPGRLECLVGHLAENDVRGDLTFIDKA
LGIKKARAIHEETLGRTVTLRELADLLGQQGYPIHNSMISRMEHTVEYLY
PFMPELLVSGLGKPQIVNLLNLRADALKIWQQYSVMTETDSDFNEIFGHV
CTQFDDPEVYSFEMFRDEFIGMLVNVLPHPSLNYDRWLLELDPKARNQRK
LFGEPEPVASHLVDADRQTWQSTASLPGTANDENQQGGSSLKPKLNNAPV
LPKQPDPDNDNGDNDDVTGEWFGSCGSPRLPKAEVQNDFLGGPSVLTGDV
IPDEYHFIPDAGNIAVSASAFAGAQVGLTADTPSDAVSIASEMPGLSLLP
TEPALSSVEFANVGLEPVTDIWVIPALQDDIEHLQDMAYRLAYELAEAMG
CEVHVSEDKNASAAGFSVSEHGGEFALFLAGLSGHVPNKQFNMFMFCLNF
FGSQSAGDTPVFDDVHVVKSLRLIRVIRRLRELQRNMAAEQNDGRRHEH
>t3075 hypothetical protein
MLPEHLVSSHFRLSSPPPEATENSKNLSQGKTKLSDYEPDILICANATGQ
HRNVLFEERDRHIKERLYCVAKIETFARLINALQAEGDIDAQTLSKILAD
KTAMINEKGNAIWLNLITRETNMPLFYSLEDKDERS
>t2684 hypothetical protein
MILMNALTAVKANTDDLAQRHTGFILAPSAQSPRLLALTFTADTTRQFLH
QVAQWPVQALEYKSFLRFKIGKILDDLCGNQLQPLLIKTLLNRAQGALLI
SAEGIDDVAQAEEMVKLATAVAHLIGRSNYDAMSGQYYARFVVKNVDNSD
SYLRQPHRVMELHNDGTYVEEVTDYVLMMKIDEQNMEGGNSLLLHLDDWE
HLESFFTHPLARRVMRWAAPPSKNVSHDVWHPVFDVDQQGRPVMRYIDQF
VQPKDFEEGVWLSELSDALETSQNILSVPVPVGKFLLINNLFWLHGRDRF
TPHPDLRRELMRQRGYFAYAASHYQTHQ
>t0161 probable secreted protein
MKLFLTTAALTATLISGMAFASDPVIPWATNSGGTESTHIAAMGEDLNAQ
HQQITHTHEGVWAANSGSIQADEAALTSNKPPVQGHPELMPHQG
>t0682 hypothetical protein
MNDFAAIRASSLHELNVAAPNKAADGLILSYTLFVFTNKGTNNENSSMGN
LNYFSDWAVGGNGRV
>t1509 hypothetical protein
MAMTSRPNYLGSRGILCVCTTAVNRNFSALSPTIDVFLTNCLPDYIVVLS
LAKQCYLVMEGDNNCTIDYQISFLVR
>t1361 hypothetical protein
MSVELTDKGGRCASLGMSNGTWFTLLDIPGVETLFNTRKTNDPIDCTRSK
ARKLADLIEAWKPPDQWFSGTGKSEGKALLIAFLRNCKGFRTC
>t4219 hypothetical protein
MVGKATLDIIFRDRSANAMDNSSLSIGWLTIDSTPPVRSMKNNSDIGAGG
DNITNINTPTFYWVA
>t4332 hypothetical protein
MRNIETRITKTGPDDAGLNQMLTDARMEERRGRADVMAARLDSLAARIVS
RQLNHTEAAELLRQEAVKIQNEAQEIH
>t4222 hypothetical protein
MFPLPLKLPDIVTPSTRRSIYLPRCMPFLAEIVNIYLQDKPYHKLTSDLI
PHANYPHEIKELFSKRIRVIIT
>t3370 4-alpha-L-fucosyltransferase
MTVLIHVLGSDIPHHNHTVLRFFNDTLAATGEHAREFMVAGEDNGFTESC
PALSLRFYGSKKALAQAVIAKAKANRRQRFFFHGQFNISLWLALLSGGIK
PAQFYWHIWGADLYEVSNGLKFRLFYPLRRIAQGRVGGVFATRGDLSYFA
RQHPGVRGELLYFPTRMDPSLNAMAKERQRAGKLTILVGNSGDRSNQHIA
ALRAVYQQFGDTVNVVVPMGYPANNQAYIDEVRQAGLALFSAENLQILSE
KMEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQADIPCVLNRDNPFWQ
DMAEQHLPVLFTTDDLNEQVVREAQRQLASVDKSGITFFSPNYLQPWHNA
LRIAAGEAE
>t1907 putative bacteriophage protein
MHIKHRGVADVSRNKVSVTGGYMLEDLPESGYAVIRCYDHCVVARFGSIP
DSGRALMYRRGDEISFVPLHPDDIVGTPTLFTQMLEKAGYRITRCFDTLQ
M
>t4283 hypothetical protein
MLFPRPRSTDKAYWPDAPAEAGGNPHAGNVVTGAASVLRRRRFTMMLTTR
LFVFLLHHLWRVMRWTGLTVYHGISVLYRRRQARLQASDWRPRTVYTGRW
QLADDGQFCGRAVVLRLRDALKPGVELLYFGEEAAEPYYSERQFVDDRES
VMVAEQMLLRFLGSRHRRQNRRNAPVSVLVTDVVTTAPVNDEAA
>t4304 putative phage tail fiber protein
MSEYYYSFKEKGFFWQPDTESDNYPDDLIPLTDEYYRELMQGQVDGKYIE
HRKGGPVLVEHREYTPEELVAQAEARKAELLAEAESVIAPLARAVKLKIA
TDEEIKRLEAWELYSVMVNRVDTANPDWPEKPAQI
>t2626 hypothetical protein
MRFSHRFILLLSLLLASLPLYAQCVTEEEKSVRAIVSGIVSYTHWPALSG
PPRLCIFSSARFVRVLSEEANWAFPYQPLVIRTTQEALSARCDGFYFGNE
SPAYQVELTRHYPVNALLLIAEQNTECIIGSAFCLIINNDEVKFSVNLDS
LSHSGVRVNPEVLMLARNQKHE
>t0458 hypothetical protein
MIYLWTFLAISILAVSGYIGQVMGAFSAVSSFTGMVILAALIYLLNVWLQ
DGDDIVSGLLLFLAPACGLIIRFMVGYGKR
>t2158 hypothetical protein
MQTLTRVLPPLRLIMFCQSGENPTQFPDTGGLCVEDCVRLRTPEGLLDRL
RRWPGAMVISAGRPSTQLLLWQQVFLRYPRTVVFCSSNAFLPVDVSVEGY
FRHLRLIKRAMSVRVLARMAELAIWSSLQTSPYEEEMKSALSVPELVMEI
NSRTLVRLLSERLPKQGRRVLGLLLSGCSPEMTARMLGTGVRQVWLAEQT
LKQRWDIPTGVPLSDAVRIRIPDVGPDISQQSGLVKTGAGNAPDLC
>t1391 hypothetical protein
MNRTEQTPQLTPEDAAQRIRVLEDENEYLRKRFEEVDLYFGRNLVVMKAA
VIEWRATGDARNGMAWIYNTLCGPGELPPQEEKEAQEYFNRETEVIDRKL
AALYHWFRKYHRTHAAPDQTTTGGTSD
>t3963 hypothetical protein
MIRSGACIMGTLLFAASFSVNALTSNDFDPRDPAIWQEETATPYPLAATT
CEVRHHPDCATWEDEKSDIQRERERRDQRRLQNLYKKYSHRD
>t4284 hypothetical protein
MMDIMALLSSTPPAVAIEEPVWEQLVKYPQAPDCENERLQRLIYHGFQAL
SQAPSGQGIVEFGYFCLPPDGDLHAPLWQNVCIKREYSDGQVTLTFDR
>t0348 hypothetical protein
MHFLCSAQLLAAKHCIDTIVGFTGSTQSNLTVGFERGADIGSFNNMTGIE
VY
>t0084 hypothetical protein
MLLVTHSLAAFKQLELFWVITSVDYLKIALFGVTVLYLVIVTSLCCCQCK
IMRCQLKRNLTPCK
>t4271 hypothetical protein
MSLLKHEQWLILGEDESHRIPPDALLGLPPQGKAWIRKEHDEDMSRLTSH
IQGALGNTYRGVCHPQRIGHRACLAIHLENLRGKKLDILITVSGKTALPA
EESYVNPRWYIDVADAADALYLALWLIR
>t4609 hypothetical protein
MPASCETALQQRCQQIVTSPVLTPEQKRHFLALEAENALPYPTLPEDARQ
ALDEGVICDMFEGHAPFKPRYVLPDYARFLANGSQWLELEGAKDLDDALS
LLTILYHHVPSVTSMPVYLGQLDALLQPYVRILTQDAIDIRIKRFWRYLD
RTLPDAFMHANIGPADTPVTRAILRADAELKQVAPNLTFIYDAEITPDDL
LLEVAKNICECSKPHISNGPVNDKIFTKGHYGIVSCYNSLPLAGGGSTLV
RLNLKAVAERSTSVDDFFSRTLPHYCRQQIAIINSRCEFLYEKSHFFENS
FLVQEGLIDPERFAPMFGMYGLAEAVNLLCENAGLTARYGKNETANELGY
RISAQLADFVENTPVKYGRKQRALLHAQSGISSDIGTTPGARLPYGDEPD
PITHLQTVAPHHAFYHAGISDILTLDETIKRNPQALVQLCLGAFKAGMRE
FTANVSGNDLVRVTGYMVRLSDLAKFRAEGSRTNTTWLGEEAARNTRILE
RQPRVVSHEQQMRFSQ
>t2964 hypothetical protein
MLLHGLVAAVILLMPWPLSYTPLWMILLSLVVFDCVRSQRRINTCQGEIK
LLMDGRLRWQGQDWTLLRPPWLLKSGMVLRLRAESGRHQHLWLAADSMEE
AEWRELRRILLQQPI
>t2526 probable secreted protein
MKLIIILLLALFPLCSSASNHYALVFENNTILLVLNLKCSPCDLNCANIQ
YQLFNKETESVISGAAKPVTTGIENNFRGYMMRNNNTFYSLIESDNKNVW
DMSVEEKGTQQNKDAVQKVKYQLNAIFDNGTC
>t2119 probable secreted protein
MKMTKLTTLLLTATLGLASGAALAAESNAQSSNGQANSAANAGQVAPDAR
QNVAPNDVNNNDINTNGNTNSTMQHPDGSTMNHDGMTKDEEHKNTMCKDG
RCPDINKKVETGNGVNNDVNTKTDGTTQ
>t2824 hypothetical protein
MICPRCADAHIELMATSPVKGVWTVYQCQHCLYTWRDTEPLRRTSREHYP
QAFRMTQKDIDDAPMVPSIPPLLAEDKR
>t4476 hypothetical protein
MMKEQFTTTVRVKGKGDAKARAFADALNHVQSAVMRESPYILLRIEPQDV
RIVQAHESVRKEAFLFFFLRRERRTYSVELDVAVNVTAINLDRVDFVAKR
>t4441 hypothetical protein
MWQKSTLTTVSIFLPQQAVYSGMRYEVNISAAKCDGTGVTFGTINNCTIT
LSLPAGASIITRHFSADL
>t1424 putative secreted protein
MNNTLSKRLCLTAMLTLAAVVYTTSAFAETSKLVIESGDSAQSRQEAAME
KEQWNDTRSLRQKVNTRAEKEWDKADAAFDNRDKCEQSANINAYWEPNTL
RCLDRRTGRVITP
>t3277 hypothetical protein
MKTKYIIASLGLATLLSFGANAAVHQVNAEQAQNLQPMGTISVSQIGSTP
MDMRQEIVAKAEKAGANSYRIIELKEGDNWHATAELYK
>t1357 putative bacteriophage tail fiber protein
MMFLEHITRDGERWDSLAWQYYGDPLGYPRIIAANPHVAITPVLPSGLLL
LIPVIEAEDARTEEDIAPWLR
>t1127 hypothetical protein
MLLVGLPPVAEKDNDDVTVAETDEYDDREFYFWRPYFLPKGRPKAALGLI
MEKGVGHCHNQPVYTNVIDNHYQLW
>t1606 hypothetical protein
MPQLNTNSARYFKPDYSPDAAARRFYKYCNILIAPYFKLVPDATVGLNDP
GPNSVQC
>t2486 hypothetical protein
MADFTLSKSLFSGKHRETSSTPGNIAYAIFVLFCFWAGAQLLNLLVHAPG
IYEHLMQVQDTGRPRVEIGLGVGTIFGLVPFLTGCLIFAVIAAFLRWRHR
HQ
>t1775 hypothetical protein
MHTLLLLAALSNQITFTTTQQGDIYTVIPQVTLNEPCVCLVQILSVRDGV
GGQSHTQQKQTLSLPANQPIELSRLSVNISSEDSVKIIVTVSDGQSLHLS
QQWPPSAQ
>t1398 hypothetical protein
MKLKNTLLASALLSAAAFSVHAATELTPEQAAALKPYDRIVITGRFNAIG
DAVSAVSRRADKEGAASFYVVDTSEFGNSGNWRVVADVYKADAPKADAPK
NRVINGIVELPKDQAVQLEPYDTVTVQGFYRSQPEVNDAITKAAKQKGAY
AFYIVRQIDANQGGNQRITVFIYKQDAKKRIVQSPDAIPADSEAGRAALA
QGGEAAKKVEIPGVATSASPSAEVGRFFETQSSKGGRYTVTLPDGTKVEE
LNKATAAMMVPFDSVKFTGNYGNMTEISYQVAKRAAKKGAKYYHITRQWQ
ERGNNITISADLYK
>t4129 putative lipoprotein
MKRLALILICLLLQACSATTKGLGDSLWDSLFGTPGVQLTDDDIQNMPYA
SQYMQLNGGPQLFVVLAFSENGQQKWVTQDGATIVTQHGRLVKTLLGGDN
LIDVNNLATDPLAKPGQIIDGATWTRTLGWTEHRQVRYATARSVFTWRGT
DRVNVGSEETAVRVLDEEVTTDQTRWRNRYWVDSEGQIRQTEQYLGANYF
PVKTTLIKAAKS
>t1895 putative prophage terminase small subunit
MAKPDWEAIETAYRAGIMSLRDIGALYGVTEGAIRKKAKKLEWVRKNITQ
VRKNGTQKNTVRTTRRPASSGAVQKHSRPESGPPADTKPEAVRKKVVTNH
PPFQPGNQYALKHGGYARRLLLKDEVVEDARALTLEDELFRLRANNLMAA
ENIGRWFTLLEDAEEEQQRKILMDNISAAEKAMMRNTVRIESIVGTLATV
SKIHADTDYRLAATDKVSLEADRLRRDAGIDDGNGERDLNDFYADIQTDA
>t3425 hypothetical protein
MKKKLISGLFLMLWMALLIAAMVYPQGIFPVLAASGVWVACLLTWAVIPV
ALAALIKNGPLWQELRASLLKTITRKENVFTSWVMRLLIVVSLAWTGWAI
TLVFYLLTVIAFWITRNQMAQQVAA
>t1902 putative lipoprotein
MKNTALGKFIFIVGTALLLGGCSGMVMPPYATHGTSVGIIAPAGGYSEWH
TDSRNHTTGDSHSQSQGNCTQSEDSQLSENGLTRTHQSNCNTRSQTHSSS
TSKTRSSSVGFSVGGPVGASIGLIKQMESMNRAPANDMSSNEMFKNFGF
>t4638 hypothetical protein
MTKVRNCVLDALSINVNNIISLVVGTFPLDPTVSKTAVILTILTAT
>t0383 hypothetical protein
MRYRVILFCLFGLLPVQLLWAAPAQRTFSDWQVTCNNQNFCVARNTGEHH
GLVMTLSRSAGARTDAVLRIDRGGLAPPDAKEAAIAPRLLLDGKPLSFNS
SHWRVSPWHLMTGDPATITAFLQTIQDAQAITLKNGVQTLSLAGLKAALL
FIDAQQKRVGSETAWIEKGNEPPLSVPPAPALKGIAVINPTPVPLSEEER
DDLLDYAAWRVNGIRCSLDPLRRETQVSALTDDKALLIVNCEAGAYNTID
LAWVVSRKKTLVSRAVRLRLPFNRGVESNDMELMNAFFDEKTRELVTLAK
GRGLTDCGIQTRWRYDGDRFRLVRYAEEPSCDSWHGPDAWPTLWITR
>t1900 putative prophage membrane protein
MVANDPSAVLNAVICGVIVIVLMFYRRGDATHRPLISLLAYVMVLVYASV
PFRFVFGLYESSHWLVVMVNILICAAVLWARGNVARLVDALRH
>t3829 putative lipoprotein
MYKNGKFIPLLALGFTFFLSGCDYFADKHLVEELKKQQKEQEAKINLLEK
QQKEQEVKINLPEKQQTTVINTTQKIAEVVGRVERKQRLFDYTELDPSQT
HYFIINNGNIGLAGRILSIEPIDNGSVIHLDLVNLLSIPVSNLAFNMTWG
TKKPSEAKDLPRWRQLLLNTKMDSTIELLPGAWTNVTLTLKGVSPNNLKY
LKIGIDMENVIFDSIQPINDTKKKPKK
>t1925 putative bacteriophage protein
MSKIGDYFFEFPASRGLQGNTVVLMMTVPARALTRVLASDNHGNTLERSQ
RELNPARAKKFYEYLRTAYEKKEPFIIPPLVGNCDSFIEFEEFGNTNVGV
ARFPMDAEIKLFDGQHRAAGIAEFCRTYGEPIHIPMMLTHQLPLKTRQQF
FSDINNNVSKPSAAINMAYNGRDKIAQDMVSFLSSHAMFSEITDFEHNVV
PAKSDLWISFKPLSDATAKFSGNGDELATSDIYDIWEAWLKLTAIKGIRH
GCTPVEYKRDYIQFHAVMINAFGYAMQELLRHRPAHIIVQMIEELVNNST
MSELENFFLISSWSGVCASTEKDRATVIASVASQKAASVRLIQAITTKSF
EVAA
>t1886 putative bacteriophage protein
MSFKSGVTTRVDNAQAILDALKSLTKKDVLVGIPAEDSDRDDVSFGNAGI
GYINEYGSPAQNIPPRPHLVPGVKSVEDQTMPQLKAAAQAALDGNAAGAE
RALNRAGTVAARGVKNHIKAANFTPLADSTVEARARRGRKGAKAELARRA
AGESPGTTLAKPLYDTGKYLASITHVVRDKDADS
>t1386 hypothetical protein
MKSTTGINQQISKVQSAIMALKATNTDVQSITIRGNKPVIRVSRSAHCMR
MLEQGKACYLYTGHDHRGYFRQGVFELHGCRVVWPESLW
>t2512 probable lipoprotein
MKKILVCFVGLALTACSANSLNYGAEQVRVMTSEPGKECSYLGDITGSQG
NFFTGGWTSNSNLETGARNDLKNKAYKMGDNTVVLLTQRAGQTGSSWHGS
GSSKQTNVTLSGNVYRCPR
>t2088 hypothetical protein
MATDYIILLKKEVMEARAFISQALQILNDLSEIMTTMASCLKLRISLSVT
LGYVQIR
>t3278 hypothetical protein
MNVYTFDFNDIKNQSDFYREFTQTFGLASEKVSDLDTLWDAVMSDILPLP
LEIEFVHLPDKLRRRYGALILLFDEAEEELEGRLRFNVRH
>t1495 putative lipoprotein
MKKLFICSGLGMMFFMLAGCTTNYVMTTKNGQTIVTQGKPQLDKETGMTS
YTDQEGNQREINSNDVAQLIKAD
>t1447 hypothetical protein
MNALEPLFARLARSTFRSRFRLGIKERQYCWDKGAEVIDKHAADFIAQRL
VPAHPANDGKQTPMRGHPVFIAQHATATCCRGCLAKWHQIPQGEPLSKAQ
QQYIVSVIHYWLVIQMNQR
>t1864 hypothetical protein
MFVELVYDKRNVEGLEGASEIILAELTKRMHQIFPDAEVRVKPMQANCLN
SDTNKSDREKLNRWLGWFQITRVGSDHSKYRRHRIRNSVMNQ
>t4261 hypothetical protein
MSTEKNSSQSSPPPRKPPGLLTLVLWIWPVRLLAFLLVSWMAGVFIEWAG
MFFFWSDQGALHSQSVMNKELGYLSADFTQSLIFSSPSVTTMGWISSAYQ
WAFVDSGLLTWIQKEQRETLSSSDSVVFFLGQVQAWLLSVLSDYLLALVY
VTVVFAVRVLILVLSIPLFVLVIIVAVIDGLCRRDLRRYGAGYESSFLYH
HAKRFVKPAVYLPCLLYLSWPASIYPNLLLLPGALLLGLAVTVVTSTFKK
YL
>t0405 hypothetical protein
MDERIPCKNPQCSHFILPATAARTEGYCMPCVQARYRQEQEEYIRKNRKT
IDAFSGITNPVEMLKLVHEPREHDPLIEWIPCPIPTDELYKKLSDDESRD
MVDYAEELFDSGWQEEAQEIALCLAAFTQANLDNFLRQVINEEELELSSP
LPFHRAPPDVRDALLQKVETDDENRDGILCALAWIGDEVVVEHFNRWRQE
PPAWSASLHILPHRYAHQAGWELTENGRRRDLYFTQCTHLVKQAPEQPAV
FRAVAEYGENCPHCSLPLINLFEVAPSAVGLSTQGWPGQIRILTCQCCTA
YNTVFATVDPQGQPRWYEKNALSTLAVENSSDWITLPLDVLHPGESRLPL
FAAEIFLPTTFSQLGGHPAWVQDTDYPTCPTCAQTMMFLAQLSYEDIEEE
EYAEGMLYGFICPSCQTTATSYQQT
>t2498 probable secreted protein
MKTGYKVMLGALAFVVTNVYAAEIMKKTDFDKVASEYTKIGTISTTGEMS
PLDAREDLIKKADEKGADVVVLTSGQTENKIHGTADIYKKK
>t1911 putative bacteriophage protein
MGRLKSERYIQRGRNTTAMLGVVHSNQDMIFVQKVLFIGFVSTLIDSQDQ
PIS
>t2134 hypothetical protein
MWYFAWILGTLLACAFGIITALALEHVEAGKTGQEES
>t4354 hypothetical protein
MTGKPSERHTGFIISGEMMVRDCFGNEYLIHAGEAFEVSENHDAWVVGDT
PCVALDFTHFLR
>t2286 hypothetical protein
MKHPLESLMTAAGILLMALLSSLLLPAPSLGLALAQKLVGIFHLMDLNQL
YTLLFCLWFLLLGTLEYYVIRFIWRRWFSLAD
>t0019 hypothetical protein
MIQAPVYRGFSPPGTEEEQTMTISIHASAFDVNSWYQKITLTFINESGNP
VDMNHAAISFTASGHIDPWGNSGGTLKGNPPLTLNDSSYGTLETNNIIIN
NSDALLLQPGERGTLSFSLAATQVPVKMSAVTLTLASSSSEDAESATPSD
QETPAIPAADEQPAESDVPEKDNDLQERGLTLNVSELNAASWYQHVTFTL
TNLYAQAVDLNQLQLNFTASAHPDPYSPFQGTMLGNQAVTLASDGGWPIE
KNTITINHDGALMLAAGDIAELQCYLAATQTPVAISDLNATLAHDPAHQG
KICVHFPAMTQTVALKPVIELLFPAGETRRFVGEWGEVLTISDLSAGTYR
LIVPVLANDEMQIAPVESSFTVTLQSGDAAAQV
>t0732 hypothetical protein
MKYMKHFFLILFGISSPFICLATSVEFNVTKGIKASITWVDNKKVEYEIT
GSDRVAKRGYYDVDTENNIHVKYGDYNFDGKEDFVIWYTDDGMGIYDIYR
VFLYSEKMADFKEIKPSCGDDFINLNLNKKKRELISLYYSHNEAQRCITN
VFVGENKLK
>t2586 hypothetical protein
MPYVYANAKALQDTEKVGNHHQCVELIQHYIRVGQASTWQQGAAVFGNKN
IEVGTVIATFVNGRYPNHNSGNHAAFFLGQDTGGIWVMDQWKDDIAKPRV
SKRYIRKLHNGSVRSDGTYIRMSNNAEAYFIVE
>t3956 hypothetical protein
MTAGRSAKRKPDGADAYPAYMPGEKIKTTSQFSKNFALQAGGKL
>t3608 hypothetical protein
MEEKRVASVFIKPSLGFGAAGVIALRRHPDGAKQVLYSAIAISGQQLFNS
KKIQQYRQKEDIQLIIDGVLQQENLVEQWLPKASVKHKTYDLRIVCLDGE
IIWRVVRTSSQPITNLHLQNQAYRFESLELSAAKVAEIDTLCQNAMRLFP
GIRLAGIDVLLTTSLTPYIIEINGQGDLIYQDAQQNNLIYQAQIRAMRKQ
NV
>t0619 hypothetical protein
MGISKDVALVHFAGIYTGIGLGAYIKSKSRDDMRVNSAFTFGEKAFLGWN
FGAFSTEAYIRHFSNGSLTDKNSGHNFVGASISYNF
>t1912 putative bacteriophage protein
MRIITRKKPAFTDLYQTGVLTRIAAVKTDSGGWRLFGVWRDQDIAVFVEA
ARGGIREWSGLNYLAEFVFSCGISLWEVHNKTDRKTPA
>t0884 hypothetical protein
MSTREDNATRSMGGKLALWVFYTFCGYFIWAMARCVWLMSAIQTEPVLGP
ISTPGSATEKWLNALSLGVVWLILGSIAWYTRPRKNRGYPADTQPETRKH
ARM
>t2863 hypothetical protein
MPDGLESTLEFDPSTSFINTTAVDNGITYSFTGEAVKATAHASCTANQAT
NSAICVWDKSFSLPLTLLTKQNGITINQQNLTVSVIPGGGNNDTNQYSSL
SGHIIIDESVSSYTNYEPETSGIENLKLKWTVTVTEDTEGDVGIWNGSLV
LNGSPASTLHRYLLPEILDASFSENAIAHSTEVAMTGAKNNVELASVYLY
HGNTPCDDTSLCNYSLNKTSVSLVCHDLNAQLNFSNTCNASGTTHCLNGK
VGTIKGTWERVNIDTTCAITVLIPYE
>t1369 hypothetical protein
MSAIHIFKAGTHTDMHGKKLPFTPDDLAACVKAYDPSVHEAPLVIGHPRT
EDPAWGWVKALSLSGVDLMAEPAQLDPQFAEMVTDGRFKKVSASFYLPDS
PSNPKPGVLYLRHVGFLGAQPPSVKGLKQVSFSEQEEGVVEFADWQAITN
ASLWGKLRDFLIARFSLDEAEKVLPEWQLNSLREEAYRDTLSQDAAGAQF
SETGPGPSSASNEESSMTKEEIEALQEENRRLKQQAADRDARDAQVRQEQ
LHKDNVAFAEKLVAEGRLAPRASSVVVALLDAVAGGDKPVEFAEGESRTP
LATAFRSLLSDGEPVMNFAEQATKDRVGDAVKVDVAEFAEADPERLALHQ
KAVALSKKEGISYEAAVARCL
>t1882 putative bacteriophage protein
MATYSFMDVTASISGPTGEIDLGYGSASSEEGITVAMGGPKNTMTIGADG
EVMHSLHADKSGTVTVNLLKTSPTNKKLSLAYNAQSQSSGTWGNNVIVIR
NKVSGDIITARSVAFQKQPDNANAKAGNTMPWVFDCGKIDQVLGEF
>t4528 hypothetical P4 phage protein
MKNPLPPVLRAALYRRAVACAWLTLCERQHRYPHLTLDALESAIAAELEG
FYLRQHGEEKGRQIACALLEDLIEAGPLKAAPSLSFLGLTVMDELCARHI
TAPVLH
>t3039 hypothetical protein
MAKYIFLFIWIVTFSVSAGERGYYLFVWGNPEGKEYFKEYRADERIYAVN
KSCWNERAGNSIRIVYVDTYPHGITDSLINSFLAGNNKSIINIRVSLSNF
SDDQILHGFDGMLIINKKNEEIEIFTIPVVGANYSYKDKFLVNVHDFELF
DGKICNALMPIDSYFSP
>t2935 hypothetical protein
MTEHNRIPARQIIVYGDCWPVTIAVAHLVRRFLPGCNCETAYRLPVLLQQ
LRRKPEAILILCLRPREHLFLFYSLRQILPDYPVMIISDELFFSDRVVLK
VYGGIPALLEQELAEILIRWRRDEQWAGGARLRRTGGLDAFLLSPDPVTG
FLEVPPIFNNPKRLMNYMDQLMHREILACGVSLAQLRLLQEVYRGRGRLS
ALCGRLNTQEKQIWQDKYRLLVKLGMRNRLRELLFGTRFCKSLQRTPFIA
PQ
>t1351 bacteriophage tail fiber protein
MANLPETPQWESGIYQIEVSDPVLGGPDGISNRQAKQLASRTSYLKQKVE
KSGTDLAAHIAAVDPHTQYATKASPTFTGTPTAPTPANGDNSKKLATTEF
VAKALAALAGSAPETLDTLKELADALGNDPNFATTVLNKLAEKLAKDQNG
ADIPEPALFVKNLGLGEGSALPVGVPVPWPSATPPAGWLKCNGAAFSSEM
YPNLAKAYPTNKLPDLRGEFIRGWDDGRGVDAGRALLRLQDDSFEAHRHE
SFFYAGISRNEIPLKNLPSSDEMLTLSSTTNALSPDGIDATNSLIGNDDY
NCLIEGNKNNKRTATGLSTSIVGATETRPRNIAFNYIVRAA
>t0721 hypothetical protein
MMKMSRTAIALSLLGMAAHPVCAAPSFEETARQVIIAFQQRDNAKINALI
DKKVGMYVLYRIGAGFDYKWMKKFDINKPIPDFNYLLGQVGWFSEHIPVD
KEFDHHTEVEYVCEKGWDHAGFFVSYTGSDNALLTFSMVNGADSGDQASD
TRIANARRLELQSERVVAVPKEWGDGLIFHLSELHGLGKGWSLTLLDLVT
EDCSA
>t2701 hypothetical protein
MTHCCRIALMSANSVMREPAGRIGRSRRHPAKISHAAPTPLRSSAALKSI
ENSHILYG
>t2013 hypothetical protein
MKNSDRSWLYITLFLQIKSQIITLNLLMTNFDYYSACKAKLDKVVYRVTF
NSISEAIRNVSKGVNYDNE
>t4262 hypothetical protein
MFSLPEMVSAAEKDELALALRQLDQVQSALERAKIVAVQDNSDGRFFFDY
ERATRDLKTMKQGIETYLEPSRAQPRDKGSLVGQYRKEQP
>t3579 hypothetical protein
MGNMTLFIIGIALLSTGTYLMRLGGAKLGSRLALSERSQALLSDAATVLL
FSVALATTFYEGEHFAGMARVLGVGFAVFLA
>t2304 hypothetical protein
MVNNRLKMVIAILIVFSLVYSIGFITPMNSDDYTYALRELSLSSVKMHYL
GWSGRVVSDTLSTSLLKFFSPHIYNAINSAALTLMVLCWTMIPATLTKSS
PSPYVMIFLFFLYFIANPALGQTNFWLVGLANYLWTNMFIAIYILISIYL
SNAKKSNLILFVYAISSIFAGCSNENTSLVVVLISVAYFFIMNRNKYLLI
GVFGSAIGAGVLLLAPGNLSRASTIQDWYNQPLAWRVLEHFSERLPSAMG
AYWQVYIAFIILLISVVLSRNSSSKLMFGSFLFILGAIAANVAFLASPAM
PSRALNGALCFMILSISFVAHSAFTKFNKASIYLSITTYAMAFLYFIPSY
ILYYSSIKSISKQTEIREEIIDRAKDNKQDQAIIPDYYFPPVLHAGPSLD
TFNSEAMSRYYGIDVKITAPGFFDYSRAFNLKPLNINAKICNNVYIKSLW
IYKQQMGIKTFVIFEFNKNPADSLDENTAMFISLKTKDGKVINADVDKKT
FQIDGRWLSGRAINGIDSNELESITSGTWDVRTGARTNENITEIIK
>t0874 hypothetical protein
MPRHGATLSVIAQVKCCDLTAMAMYQRLMRREKLAGTWQIILRHNAVEIK
GRGQNEL
>t1782 hypothetical protein
MCVISLMCRFSVMMRAMNILLSIAITTGILSGIWGWVAVSLGLLSWAGFL
GCTAYFACPQGGFKGLLISACTLLSGMVWALVIIHGSALAPHLEIVSYVL
TGIVAFLMCIQAKQLLLSFVPGTFIGACATFAGQGDWRLVLPSLALGLIF
GYAMKNSGLWLASRREQHSANTAVTK
>t2782 hypothetical protein
MKIITHVVPGSGMAAIYDDIADSSRFVIKGKLRHVENDPKELLICVPMRS
EWLFYWIKGEKYCARRWARKNIKTLCNQIKFEEAIL
>t1612 hypothetical protein
MPFTFQIGNHSCQISERHLRDIIDHKREHVFSTYEKFIDFFRNIFTSRSL
ISDYREIYNLLCQKNERPDITKPFSLRPFSKRDEDCTRWRPLLGYIKLID
ASRPETRDKYTVEVLAHQENMLLLQMFYDGMLVTETECSERCVDFLKETM
FNYNSGEITLAALDNDHLTPSEAGSNGIYEAFEHRLIDFLTTPATPATAS
GDESGAIDQTDTSQPAAIEAFINSPEFQKNIRMRDIEKNKIGSGSYGTVY
RLHDDFVVKIPINERGIKVDVNSPEHRNCHPDRVSKYLNMANDDKNFSRS
ASMNINGKDVTVLVSKYIQGQEFDIEDEDNYRMAEALLESRGVYMHDIDN
VGNILIKEGVLFFVDGDQIVLSQESRQQRSVSLATRQLEEQIKARYLVKL
KQSETEGNTEDIEYYKSLITELDELIGEEEQAPAPGRRFKLAAPEEGTLV
AKVLKDELKK
>t1019 hypothetical protein
MKFDNAWNQGVWYALRSATGRLSPQPFSVEDLPALGEECRRIMKITSAVY
TSKKTLSERWFASAELLNLYLTKYGLSCLNCSRKKTTA
>t1365 hypothetical protein
MNVLPVLDAVLARLREKLPQLQVEYFPEKPAEYRLNHPVGALLLSYAGSR
FDRPDDTGAVIQSQTIQLCVTVVFRQLNGKKGAINVLDAVRRILGGHTPP
GCRRRIWLTREVFIGEVRGLWQYALDFATESVFIEDSDLPSGPLLTEVNY
EESE
>t3869 hypothetical protein
MSKSSWLLLLGLCASGSALAASSESAFLAQHGLAGKTVEQIVDTIDQTPQ
SRPLPYSASITSTELKLSDGEQIYTLPLGDKFYLSFAPYEWRTHPCFNHS
LSGCQGEMPNKPFTVKVTDSKGAVIVQKEMQSYRNGFIGVWLPRNMEGTL
EVSYNGKTASHAIATRDDSQTCLTELPLR
>t3076 hypothetical protein
MALDNHKSDEFILKQNLAALIASKNATLEKVTQEVVSIPAALVRLKWQNR
REMYALQVKEEIYGATINAIIEQHPELRDKIMSRLESDWQHLLARETATL
RLTRKLSDGDYRTRNVTTVARQEK
>t0338 hypothetical protein
MEIICPVCHHALERNGDTAHCETCAKDFSLQALCPDCRQPLQVLKACGAV
DYFCQNGHGLISKKRVNFVISDQ
>t1628 hypothetical protein
MHKETQPIDRETLLLEANKIIREHEDTMAGIVATGVTQRNGVLVFSGDYF
LDEQGLPTPKSTAVFNMFKHLAHVLSEKYHLVD
>t0879 hypothetical protein
MGAGQKINCCQGNALARGDNLRCGTVDAVQKIWPPDWLKYAHMTIQPENL
QNA
>t0255 hypothetical protein
MAIKPFNYQQDFSSIDFRQQPELYQVGRGEQGGLLVEPYKSEILPFWRYK
DEASAMKSAEQIYQLFEAYRQQDDFVGMDMARKFIQMGYTRARRYANYKG
GKKYAEDGSLNTRGNDPIKAAAATVFKGWWDKIRQDEDYLKRKRQHQARW
G
>t4298 hypothetical protein
MADIATIFHWSPSITDVMPLTDVLEWRHKAIQRSGASDE
>t2567 hypothetical protein
MTAMLFFLLLVSKPAIGSNTLKIPTHHRNIPGVIQRTGKIFQPCFNVSLV
DIGHAHPFNTAQQFTGILQRDHDAVFHHPNFNGHSVNKPGLRHPFTA
>t1380 hypothetical protein
MKNLKKFIPPVKKPRLSGWLLTSVLLLGIIALVSPQQLPVVIYKLALITL
AAVLGYWLDRSLFPKARPGQYLKHDDRLMAEGRFPVQTGLHLVFSAALIR
RALIVAAVCLAVATGL
>t1827 hypothetical protein
MPAKPRTSKTVTKNIRFSYSMLEQIEFALKSEKTRNFSAWVKEACREKLC
NTGHKL
>t1759 hypothetical protein
MMEKNNEVIQTHPLVGWDISTVDSYDALMLRLHYQTPNRPEPEGTEVGQT
LWLTTDVARQFISILEAGIAKIESGDYQENEYRHH
>t4522 phage polarity suppression protein
MTTVTIQQAFEACQTNKNTWLKRKAELADLEREYREQLLAGDEQIPRRMQ
DLRDNIDVKKWEINQAAGRYIRSHEEVQHISIRNRLHDFMQQHGAELAAT
LAPELMGYHEQIPAVKQSAMQHSVDYLREALSVWLAAGEKINYSVQDNDM
LTTIGFRPDAASRDDNREKFTPAQNLIYTRRRAELATR
>t0735 hypothetical protein
MPIMNNYRLTFSLCLLRSAILFGIILISVNCNNQKSKELVLPVDSTQFTQ
LACYYKDINSDDNIMELKLPDEYKEKTNLFNQQEIKVPVKGESFLSPYIG
DGVRYYELGYFEHDGNTYKLIIYNKIGESDTLLLNVQINSYDAKGNLVDA
LLLSSFFAYEDIVRFSDFVIRQDYTISIDSYVIYRWYEDSKDGHLVTIKF
KDQAPQIYIKEQYQMENGRFKLISRNAVSQGEKRSER
>t1009 putative DNA-binding protein
MSRNYTPAQKAEIQKRLTELVRTHGRMTFGELRKITGLTIFTARHYLEKA
ESCGDLYQAGRSGIFPSEQAFLLWK
>t2843 hypothetical protein
MLNSNTAVLCRILHPDAQKALLDWFATLSERYERKDGKRVNGRAWRAELK
RMAPPYGVMICEGHDALRQALLKHMRLQPLDEMALALFVSVAVHIKSHKE
NISFAAQLGEKLKGSTSCVSGLRFERLQKASDPETFCQLLIQAVKIRGTE
GVNVLSLADGIFLWMEEWQRRENHQPEFRNPFERSRIRWANEYLSTSRGK
>t2025 hypothetical protein
MKHKHGWASVVCCFVLFIVVCLSLTMHVQGAFRAAGHPEIGLLFFTLLGA
VASFCSHRREVIRPLIGAMLAAPFCLVLMRVVFMPTRSFWQELAWLFSAV
FWCALGALCYLFISSLFSHRRKKRR
>t3562 hypothetical protein
MTEAENAVAIVKEFLVASMIPDAERAATYMHPEVKITFTGGRAMAGAADI
AQFNGARYKWVKKALGKFDAVQHDDYVVIYSNGTLYGEWPDGRPFADNRF
IDRFEVRDGKITRMDVWNDSAEWILAPDISR
>t1123 putative secreted protein
MKRKLIPFTLFLAALSASTTSIAASQEISKSIYTCNDNQVMEVIYVNTEA
GNAYAIISQVNEMIPMRLMKMASGANYEAIDKNYTYKLYTKGKTAKLVEG
DDKPVLSNCSLAN
>t1146 hypothetical protein
MLIRLNWSEKKEQAVDSAESSPSFKVYKIYNKSHKCDAHHVFMIFYYSFM
I
>t0322 hypothetical protein
MDISLTNLMELVKKVNRNKVPNPMPAEEISCLRVRKYRDPQNTETTELPE
SLKALLAYDRDLLSNYNMPVIETLQRFIDNEGVIHSYSPDEEAYYGAGMD
SSGIDIEDLMPVWSNDPRLPALIRIDHVGDQAIFIYITERDANGEYPIAR
MERNEFWLAESSLVEYLYNIISGAKDIGFTEEDLHLPQWKAQQKMNEQRD
AALLDLEDYHEAFWAKLDALVD
>t1342 hypothetical protein
MNQRHPNLNNIISANANFSYMLLINTKVTIGNAYIYGEYVCYYIFMVVIC
LISQFAKKDGNIAAGRHSVINFVNWLRVFFNLIFLSEKDVIMNRNAEKTV
IKGLFRLIKIALMYTVSVF
>t1037 hypothetical protein
MRLIIRAIVLFALVWIGLLMSGYGILVGSKVNAAGLGLQCHYLTARGTST
AQYLHTNSGIIGFSDCPIFRKIATVVDNG
>t2738 hypothetical protein
MQEGGDAANPLERTSIRDRGERGKPTHMQREVLRVNT
>t3411 hypothetical protein
MQDYFLESLKLQRIDFFLKLVAASECSDEEKGLALQWVSELTDELMAKIR
SHEYSRSMDVIS
>t3414 hypothetical protein
MPYQLVELSPVANDLEQLGTKEKFWFYFSDDTVNLQLFKYSRLGTGEHWS
EKCAAELCHLLNIPHASYDLARYNGRFGVVTQNIIPSGFRMVMGNEVLHS
STFDYPGPLQAGEKPVRVREHTVTRVLGCLDRESIKPPPSVYDLTGLNAA
DVFCGFLMLDALVSNQDRHHENWAIMLNNETGEQFLCPTYDHAASLGREM
LDDERNERLNTKDKNRQIPCFVRKARSELFKAKTDKKPLLTVEAFQHAVE
GRVAARDHWLGKLSVLTEDSITDVFNQVPSSCISDSARRFATLMVMENRR
RLLE
>t0620 homolog of virulence protein msgA
MPTIKKLADARDVILHELILHELIGHLSCAVVTVKSMQATNVNHVCTKTE
KAHLHPILEKMFAVAGK
>t2767 hypothetical protein
MKSYQRRKVNSSVQKINTTVVVESHPISVVGQYGKQLSLNFFTTSVKISC
FTPHVFPVSKFFQSLANPA
>t1013 hypothetical protein
MNHATTSYCHLMKKAVHNRIRTDWHQGHVLQQQIHKMKQDIKNEGKIEYE
KRTELSEVTSLQH
>t4265 possible exported protein
MSRFRHALKDRDQHILTLRLACGVLLLLLILVITGWMRAPSDLTIHNPPD
LRSGSTRKWWEVPPSTVYSFAFYIFQQINAWPKDGEKDYPMKIAQLSPYL
TPSCQDFLNKDAELRSQKGELLDRVRVVYEIPKRGYKPESVIIESDDSWE
VSLDLVVDEYYHTEPVKRALARYPLHIVRWEGDPERNAFGLALDCYKGVP
QRLEAAVVPEPEKKGMF
>t4044 hypothetical protein
MSKTLLQIHFNFSGPFGEEMTQQLVGLAESINEEPGFIWKIWTESEKNQQ
AGGIYLFESEETAQAYIKKHTARLKNLGVDEVTFTLFGVNDALTKINHGN
LCR
>t4334 hypothetical protein
MLTKEPSFASLLVKQSPAMHCGHGWIMGKDGKRWHPCRSQDALLAELSAK
KQGKPWLLKVMLRLFR
>t0040 possible sulfatase
MNTLTATSVVLPAPRPAINQGIDINNEIVLNHTAIYENCLAQVTQENTVE
NALMLLDPYGTAPLSAYAGVWSLEPAEIMVTVQDAAKTAMPVEHLYTLTA
GANLLPVLGLVAETENRIVFSQADTPLAVYTLTTQPLPPVDSAEVVLGFP
IINVTQPATDVDKMAPGFYFVTHFDRYNYALDQNGLVRWYVTQDYPSYNF
VRIDNGHFLTTSEAKNTYLDMYEFDMMGRLHTFYNLDNQFHHSIWPWDSN
TIVAPSEYTSGRPDDLKTNEDGVSVVDLATGLETAYYDMAKVLDTTRVSR
PSGTAPGEDPTVKDWLHINQSYVNETNQLLIASGRHQSAVFGVDLQTQAL
RFILSTHEDWDDAYQPYLLTPVDSEGVALYDFSKQEDIDAADRDFWTWGQ
HNVVEIANNTPGIVEFMVFDNGNYRSRDDSKSLLPPDNYSRIVHFVVNMN
EMTVMRPFEYGKELGARGYSSCVSAKAIQQNGNIVVHFADCTFDENGRAI
SCQPGESDIIDPQAGSEAMGLLILQEIAPTEKTVLFEATMTSGYYKNAET
NGEGYRYDITSFRVYKMDLYA
>t0021 hypothetical protein
MKKMMNDAFAKDNNENLLHSFLFSQQAKPHAAIDALFSALLPFGQPFTLG
IGDEFYLQANDEHYIVLLESGIVSFCRDDNRLHISSSFAPSVVGMVDSYG
ATYNVPARPEHFLLAETVCSGRFVRLPDFIKIADECDLWHDVARCLAYRL
MVMSARDRELVGVDSYLKVRALLIEIWAYPQAYRENIIVLNFIQRRTGIS
RSRTMKILSELKKGGYIHIDNGRLTALGKLPVAY
>t2563 hypothetical protein
MARLCGILTYSRVERLSVRLYGEWMPQAGFINGMPVKVRVMRDCIVITPQ
HTRGLFVCIEGMGVTFINQKKVKAWLKTFPGALNDTGDIPVIKRSRLEGG
I
>t1349 putative bacteriophage tail fiber assembly protein
MNNAAAVLDQNGIAITAGDITVYNYDAENREYLSATVEYLAYGVGIPAHS
CIDAPPEKRPGFAVCRDTDQNIWKYVPDHRGETVYSTENGNAVQITQPGD
YPPDTTVKQPATIYDVWDGETWVTDAGRQHAAELEAAGAHRQQLEEQAMA
SVELINLKLRAGRRLTPQETEKLNAVLDFIDVLNATDISTAPDINWPEMP
LAAAS
>t1360 hypothetical protein
MSQTQSDTFTLSYPFTTAAGTRIEQIELKRLTVKDLKQVRKINKDPADWD
EPLIARSTGILPEDLDNMDLADYMELQKRFQKITGLGKSDKNTDAGAGPA
GEMVQVSTGGD
>t1004 hypothetical protein
MKKILLPALLLATSGVALAAPQVITVSRFEVGKDKWAFNREEVMLTCRPG
QALYVINPSTLVQYPLNAIAEQQVAEGKTRAQPIAIIQIDNPAKPGEKMS
LAPFIERAQKLCDPSNS
>t1921 prophage Kil protein
MTNYGTTTLPRTSVVPGMLVKYQGRTYRASANVGKGLYLFTLFERLRTTN
DEIEVYLNQHGKPATH
>t1884 putative bacteriophage protein
MSNNSSTEPGWLTPVSGDPDYDEALDRLLSQWVRNVSGLPTGMVRPRWQK
DQPPLLPAETNWCAFGVTGWPIDNSPAFTNQTEEGAQLWRHETFECMASF
YGPAGMTFASRFRDGISVAQNNAELNALGLSMGDYTGLTPFPELINQQWV
RRYDITVRLRRKVVREYGIKSLVDAPVSFFGD
>t2502 hypothetical protein
MVAASRMDGRDMVRQRASAGGRRNVVQTAHDLRRISLLTDTPGRAFVSRE
SAFLSTDRRLSITFARLISPLNSG
>t3881 hypothetical protein
MTEDDLPLRAVADATFSRFARVEPLAPGGSHPRQASDTKNPAR
>t1124 hypothetical protein
MNAVVFLFTDEKCARKAYYNLSITDICDE
>t0944 putative lipoprotein
MKPLIFTLSLLALTGCTITRQAQVSEASPISGIVRLTYNQPLFFTSRTDD
YVSHGTATRECQQMGYADAVSFGQPVGTCSIYAGSLCLNTRFTLSWQCRG
VSVPQIMPLYY
>t4290 hypothetical protein
MRINVLLLTSLLVAGPALAGEAHVCKSQTVANSAANAELTDNTVFKCGES
ISGTIPSLAREGWKIVQQTDQADVTDPSKTYAQLIIQKD
>t4365 hypothetical protein
MGAVDDSMEQEGDYASAAGAERKNCLYVKKIICEPELPRFACLLPIF
>t1512 hypothetical protein
MLTKTLSVVLLTCALFSGQLLAKQQDHAFVWFATGGHQLRHEADSDELRA
AAEESAEGLREHHNWQKSRKPESYFR
>t0719 hypothetical protein
MYSRADRLLRQFSLKLNADSIAFDENRLCSFIIDNRYRILLTSTNSEYIM
IYGFCGRPPDNNNLAFEFLNSNLWFAENNGPHLCYDNNSQSLLLALNFSL
NESSVEKLECEIEVVIRSMENLYHILQDKGINLDTDYT
>t3849 hypothetical protein
MKNNTGYIIGAYPCAPSFHQKSEEEETEFWRQLSDTPDIRGLEQPCLEHL
HPLGDEWLLRHTPGNWQIVVTAIMETMRRRSENGGFGLASSDEEQRKACV
EYYRHLHQKINKINGNNTGKVIALELHAAPLAGNPNVAQATDAFARSLKE
IANWDWSCDLVLEHCDAMTGPAPRKGFLPLVNVLETIADYDISVCINWAR
SAIEGRDTSLPLIHTQQAKQAGKLGALMFSGTTLDGEYGEWQDLHAPFAP
FCPQSLMTEKHVKELITAAAPELLQFTGIKLLEINASADINHRINILRDG
INMMKKATRR
>t2519 hypothetical protein
MVDLCWILNPAGLPYTTMLLPIRKRNGFFFRDICISKLRRTGMKTFTFKV
NSVKTFESDTAGDLFSWLRLLQPGTINELKIVKIGKNTYMFSLNRHLYNV
CTTSSNVEL
>t4285 hypothetical protein
MSVMTTNETPASTVAEPEVFRRTRNRFNQWLQAEFDRHYHTMRDGGYRSF
LKKNHPTELLRYDEACEALRREEFARFAELQTLGLYLHLQVQQKEAQFRR
RRNRLLLGMTLTGVVTSVLLYGHFHPEQFLLIGQELATLPGRILGFVGRL
VP
>t0951 hypothetical protein
MINYDVLHINVALAHCRNAINRVKLKHNLIFLQSRSEL
>t2490 hypothetical protein
MLSLNKPLQEFNRLDKCLSKHGTRFEFVNDKEIICSPDESNTHTFVILEG
VVSLVRGDKVLIGIVQAPFIFGLADGVAKKEAQYKLIAESGCIGYRLSSS
QTLAIIEQNQLWREAFCWIVWKSQVLELRDKQLIGNNSYDQIRATLMTMI
EWDEELRSRIGVMNYIHQRTRVSRSVVAEVLAALRKGNYIEMNKGKLISI
NRLPSEY
>t3040 hypothetical protein
MQDHFSLDSNDITHWLYSHWDVFKIWFVLQHYEKEGYEFKPFIHKIILDA
LRRQGKKSPGA
>t0738 hypothetical protein
MFLLLPPYFLAGATTKKNKPLIAARIAEEKEKDRLKDLRRDTRRRQWALA
LADILARRNGLPIKGVELVFKLDDDKHRYLAQQVKKELGLSENLNGAALR
HKVEDILRRWPAGIGSSPSTFYHHLAAQGQVRDALAFDCMRTAFLTRCIA
GLGWCDENEAWLVLLLNAQRAQDCFDSWEDYATAYVRARRVWLTLRDTPI
ALAGRDLQEATHYLQDPVSRWRQLPWNEFKIFEPI
>t2950 hypothetical protein
MTNNTNDTIKIDPRTPEGRKALRLMVVPPKALIATLGLPAKENRPYYSKA
ALCLMAVDAGLTPRDFM
>t1927 putative bacteriophage protein
MMMKLINRSKQSPVGRRACDIALAAHHEKFGDYGRQKHVTNYTVVVDGVK
VPVEVVNRATSYVATAMIGVRKLRNLPAQAN
>t4281 hypothetical protein
MGNLPTVAGFIPVFRVQTVDHFGPVSSLKERSFVSLALRRLNHKSYILPI
GESLPDTVMVFSVLLLEKTMNIFLTSLVSILRKALPRIRHGKSEWIANHT
GYLRFQAEVWLDDNDHFHAVVNKRSGWMNPRYEQVVDCGKFDSFHCAMNT
AYSQALELAHLRYAWELTD
>t4548 hypothetical protein
MSASGIYEARHLKEHREIVSSPILLAFAYLADGIRRMKHENYLMFYIRIQ
HHIKTGSSQPFKAMCQIKWQLININLKTYISGQIDFFITAL
>t4360 hypothetical protein
MEQCKNDAILEHIKNYSKHIDEFRSQANSQGIWLFISTLGCWSVNIPLIQ
VIAAILLFCIFIFNSKQDMTEKRAFHKIEEDIAKDIDSNLIGDSRKARLY
DLGLVEKYRKAIKPVLKTSPIFIVCYIFYSISFLVFFSNLFPRMKLIFNF
>t3036 bacteriocin immunity protein
MKLKENISDYTESEFIDFLRVIFSENESDTDETLDPLLEYFEKITEYPGG
TDLIYYPETESDGTPEGILNIIKEWRESQGLPCFKKSK
>t1392 hypothetical protein
MTDTIMEIVVWAFLLTGIAVCICAGFALVALLTHVVTQWLWEKLKAAYSL
KELSDAVRAWEQQKNTGDTEQ
>t0030 hypothetical protein
MKSMRCVIPVILLSFIVHEGTAKPTAQIHFMGSVVEAGCWNDVGTLEIQC
YNKEGVERYIIVENIITPISSPHATVKRDYLDEDKQLTVLRIVYD
>t1803 agp, glucose-1-phosphatase precursor
MKKSLLAVAVAGAVLLSSAVQAQTTPEGYQLQQVLMMSRHNLRAPLANNG
NVLAQSTPNAWPAWDVPGGQLTTKGGVLEVYMGHYTREWLVAQGLIPSGE
CPAPDTVYAYANSLQRTVATAQFFITSAFPGCDIPVHHQEKMGTMDPTFN
PVITDDSAAFRQQAVQAMEKARSQLHLDESYKLLEQITHYQDSPSCKEKH
QCSLIDAKDTFSANYQQEPGVQGPLKVGNSLVDAFTLQYYEGFPMDQVAW
GGIHTDRQWKVLSKLKNGYQDSLFTSPTVARNVAAPLVKYIDKVLVAERV
SAPKVTVLVGHDSNIASLLTALDFKPYQLHDQYERTPIGGQLVFQRWHDG
NANRDLMKIEYVYQSARQLRNAEALTLKSPAQRVTLELKGCPVDANGFCP
LDKFDNVMNTAAK
>t0567 ais, Ais protein
MLAFTLRFIKNKRYFAILAGALVIIAGLASQHAWSGNGLPQINGKALAAL
AKQHPVVVLFRHAERCDRSDNTCLSDSTGITVNGAQDARALGKAFSADIQ
NYNLYSSNTVRTIQSATWFSAGRSLTADKKMMDCGSGIYASINTLLKKSQ
NKNIVIFTHNHCLTYIAKNKRGVKFDPDYLNALVMYAENGKLFLDGEFVP
G
>t4336 apl, phage regulatory protein
MSTDISIRVPKEMATPAEFAEWEGISRGSVYQKIHHGQLAKYMVKKEKNK
GRVSLRYLMYKTDQVRESLGHSNFRVIVGQ
>t1405 asr, acid shock protein precursor
MNENQIEGIKMKKVLALVVAAAMGLSSAAFAAETATPAKTAAPAKTTQTT
QHHKKQHKKTVEQKAQAAKKHQKKDGKKAPAKSTSKTTSQPAA
>t3112 assT, probable arylsulfate sulfotransferase
MFDQYRKTILAGAVALTCGLTAASTFAAGFQPAQPAGKLGAVVVDPYGNA
PLTALVELDSHIISDVKVTVHGKGEKGVPVTYTVGKESLETYDGIPIFGL
YQKFANNVTVEYKENGKAMKDDYVVQTSAIVNHYMDNRSISDLQQTKVIK
VAPGFEDRLYLVNTHTFTPQGAEFHWHGEKDKNAGILDAGPAGGALPFDI
APYTFVVDTQGEYRWWLDQDTFYNGHDMNINKRGYLMGIRETPRGTFTAV
QGQHWYEFDMMGQILADHKLPRGFLDASHESIETVNGTVLLRVGKRDYRK
EDGIHVHTIRDQIIEVDKSGRVVDVWDLTKILDPMRDALLGALDAGAVCV
NVDLAHAGQQAKLEPDTPYGDALGVGAGRNWAHVNSIAYDAKDDSIILSS
RHQGIVKIGRDKQVKWILAPSKGWNKQLASKLLKPVDDHGKPLTCDENGK
CKDTDFDFTYTQHTAWLSSKGTLTVFDNGDGRGLEQPALPTMKYSRFVEY
KIDEKKGTVQQVWEYGKERGYDFYSPITSVVEYQKDRDTMFGFGGSINLF
DVGKPTVGKLNEIDYKTKEVKVEIDVLSDKPNQTHYRALLVHPTQMFK
>t4526 cI, phage immunity repressor protein
MHLLLAQWFHDLCSYRVMVAQAGPTSVGPVSVRAGISTPVWATTTERGNS
GGSITCYLTEVALMATILTPSYPQFVFVFAAVRRADRKPRICMLRTVAGD
EQAARLSLVRDYVLSFAGRLPVAEVRA
>t3404 cII, regulatory protein cII
MFDYQVSKHPHFDEACRAFALRHNLVQLAERAGMNVQILRNKLNPAQPHL
LTAPEIWLLTDLTEDSTLVDGFLAQIHCLPCVPINEVAKEKLPHYVMSAT
AEIGRVAAGAVSGDVKTSAGRRDAISSINSVTRLMALAAVSLQARLQANP
AMASAVDTVTGLGASFGLL
>t4335 cII, phage regulatory protein
MFDYKISKHPHFDEACRAFALRHNMAKLAERAGMKVQTLRNKLNPEQPHQ
LTPSEIWLLTDITEDSTLVDGFLAQIHCLPCIPMNEVAKEKLPHYVMSAT
AEIGRVAAGAVSGDVKTTAGRRDVISSINSVTRLMALAAVSMQARLQANP
AMASAVDTVTGLGASFGLI
>t0069 caiF, transcriptional activator CaiF
MCEKYVERPLYLLIADWMMAENRWITAREISRQFDIEHCKAINTLSYILS
EVGEIVCEVKMIPNQIAGRGCQCQRLVKVVSIDSQLYRRLNHNLQERKVS
VAKAPRLSAVPPTELNREQKWQMMLSKSMRR
>t1111 cdtB, putative toxin-like protein
MKKPVFFLLTMIICSYISFACANISDYKVMTWNLQGSSASTESKWNVNVR
QLLSGTAGVDILMVQEAGAVPTSAVPTGRHIQPFGVGIPIDEYTWNLGTT
SRQDIRYIYHSAIDVGARRVNLAIVSRQRADNVYVLRPTTVASRPVIGIG
LGNDVFLTAHALASGGPDAAAIVRVTINFFRQPQMRHLSWFLAGDFNRSP
DRLENDLMTEHLERVVAVLAPTEPTQIGGGILDYGVIVDRAPYSQRVEAL
RNPQLASDHYPVAFLARSC
>t1199 cedA, cell division activator CedA
MMKPLRQQNRQIISYIPRVEPAPPEHAIKMDTFRDVWILRGKYVAFVLTG
ESFQRSPAFSVPESAQRWANQVRQENEIAD
>t1777 csgB, nucleation component of curlin monomers
MKNKLLFMMLTILGAPGIATATNYDLARSEYNFAVNELSKSSFNQAAIIG
QVGTDNSARVRQEGSKLLSVISQEGENNRAKVDQAGNYNFAYIEQTGNAN
DASISQSAYGNSAAIIQKGSGNKANITQYGTQKTAVVVQKQSHMAIRVTQ
R
>t1780 csgF, assembly/transport component in curli production
MRVKHAVVLLMLFSPLTWAGNMTFQFRNPNFGGNPNNGSFLLNSAQAQNS
YKDPAYDNDFGIETPSALDNFTQAIQSQILGGLLTNINTGKPGRMVTNDF
IIDIANRDGQLQLNVTDRKTGRTSTIEVSGLQTQSTDF
>t1758 dinI, damage-inducible protein
MRIEVTIAKTSPLPAGAIGALAGELSRRISHHFPENLGNVTVRYATANNL
SVIGASKEDKERISEILQETWESADDWFINE
>t0493 div, Div protein
MHPISGAPAQPPGEGRNPLSAASEQPLSMQQRTVLERLITRLISLTQQQS
AEVWAGMKHDLGIKNDAPLLSRHFPAAEQNLTQRLGVAQQNHANRQVLSQ
LTELLGVGNNRQAVSDFIRQQYGQTALSQLTPDQLKNVLTLLQQGQLSIP
QPQQRPATDRPLLPAEHNTLNQLVTKLAAATGESNKLIWQSMLELSGVKS
GELIPAKQFTHLATWLQARQTLSLQHAPTLHTLQAALKQPLEPDELTAIK
EYAQHTYQIQPQTVLTTAQVQDLLNHIFLRRVEREADELEPLSIQPIYRP
FAPMIETVKNLSARPGLLFIALIIVLALFWLVS
>t4587 dnaT, primosomal protein I
MSSRILTSDVIGIDVLLHDHHAVLAKSTGGAVAVFANNAPAFYAVTPARM
AELLALEEKLSRPGSDVALDAQFYEEPEAAPVAIPCGKFAMYPAWQPDAD
FQRQAALWGVALREPVTAEELAAFIAYWQAEGKVFHHIQWQQKLARSVQI
SRSSNGGMPQRDINSVSEPDNHIPPGFRG
>t0894 dsrB, DsrB protein
MMKVNDRVTVKTDGGPRRPGVVLAVEEFSEGTMYLVSLEDYPLGIWFFNE
SGHQDGIFVEKAEQD
>t4387 ecnA, entericidin A precursor
MMKRFIGLVALVLLTGTLLTACNTARGFGEDIQHLGHAISRAAS
>t1114 envE, putative lipoprotein
MTLLSGKNTLVLCLSSILCGCTTNGLPAPYSINLSFPVITQNQINSGGYY
INDAEQIRTTDGLCLDTGPDQQNRLTLRECKHVQSQLFSFHRDRITQGEK
CLDAAGQGTKEGTPIILYSCTGNDNQRWLTDDNKIKGKQSRKCLGTNSII
VRKGDPVVLADCDFSRALEFTIR
>t1924 exo, exonuclease
MTPEIILARTGIDVSNIEQGDDAWHRLRLGVITASEVHNVISRPKSGKKW
TDMKMSYFLTLLAEVCTGVAPEVNARALAWGKQYEDDARTLFEFTTDVKV
TGSPILFRDEDMRTACSPDGLCSDGRGLELKCPFTSRDFMKFRLGGFEAI
KSAYMAQVQFSMWVTGIDAWYFANYDPRMKREGIHHVVVERDDKYMSLFN
EMVPEFIEKMDEALKEIGFTFGEQWR
>t3759 fidL, hypothetical protein
MLACLTLLFLGVGLGHLFHLYTEKNRDPEKCTAPVIVFYNNTQANLTLDF
MYSLKKRTGVVSISGTYYVDNKMSGVIRRDVSYVWSENKDSTHFISTDIN
KVTRDETLSDAVIETVLPDFYVYPGKSISYTILTQGHRGFMFTIGKRPIF
FCTH
>t2316 fimH, FimH protein precursor
MKIYSALLLAGTALFFTHPALATVCRNSNGTATDIFYDLSDVFTSGNNQP
GQVVTLLKKSDWCGVNATCPAGTTVNYTYRSYVSELPVQSTEGNFKYLKL
NDYLLGAMSITDSVAGVFYPPRNYIRMGVDSNVSQQKPFGVQDSKLVFKL
KVIRPFINIVTIPRQTMFTVYVTTSTGDALSTPVYTISYSGKVEVPQNCE
VNAGQVVEFDFGDIGAPLFSQAGAGNRPQGVTPQAKTIAIKCTNVAAQAY
LSMRLEAEKASGQAMVSDNPDLGFVVANSNGTPLTPNNLSSKIPFHLDDN
AAARVGIRAWPISVTGNKPAEGPFTARGYLRVDYD
>t0953 flhC, flagellar transcriptional activator
MIMSEKSIVQEARDIQLAMELINLGARLQMLESETQLSRGRLIRLYKELR
GSPPPKGMLPFSTDWFMTWEQNIHASMFCNAWQFLLKTGLCSGVDAVIKA
YRLYLEQCPQPPEGSLLALTRAWTLVRFVESGLLELSSCNCCGGNFITHA
HQPVGSFACSLCQPPSRAVKRRKLSRDAADIIPQLLDEQIEQAV
>t0952 flhD, transcriptional activator FlhD
MGTMHTSELLKHIYDINLSYLLLAQRLIVQDKASAMFRLGINEEMANTLG
ALTLPQMVKLAETNQLVCHFRFDDHQTITRLTQDSRVDDLQQIHTGIMLS
TRLLNEVDDTARKKRA
>t0965 flhE, flagellar protein FlhE precursor
MRKWLALLLFPLTVQAAGEGAWQDSGMGVTLNYRGVSASSSPLSARQPVS
GVMTLVAWRYELNGPTPAGLRVRLCSQSRCVELDGQSGTTHGFAHVPAVE
PLRFVWEVPGGGRLIPALKVRSNQVIVNYR
>t0915 fliT, flagellar protein FliT
MTSTVEFINRWQRIALLSQSLLELAQRGEWDLLLQQEVSYLQSIETVMEK
QTPPGITRSIQDMVAGYIKQTLDNEQLLKGLLQQRLDELSSLIGQSTRQK
SLNNAYGRLSGMLLVPDAPGAS
>t0921 fliZ, FliZ protein
MTVQQPKRRPLSRYLKDFKHSQTHCAHCHKLLDRITLVRRGKIVNKIAIS
QLDMLLDDAAWQREQKEWVALCRFCGDLHCKKQSDFFDIIGFKQYLFEQT
EMSHGTVREYVVRLRRLGNYLSEQNISHDLLQDGFLDESLAPWLPETSTN
NYRIALRKYQQYKAHQQIAPRQKSPFTASSDIY
>t1922 gam, host-nuclease inhibitor protein
MNAYLTYDRIEAQDWTRHYQQIAREEKESELADDLEKGLSLHMLESLCMD
ELPRHGANKKAISRAFDDDVEFQERASEFVRYMAETFSRHQIDIESEE
>t3117 glgS, glycogen synthesis protein GlgS
MNNNNVYSLNNFDFLARSFARMQAEGRPVDIQAVTGNMDEEHRDWFCKRY
ALYCQQATQAKKLELEH
>t2387 hha, haemolysin expression modulating protein
MSDKPLTKTDYLMRLRRCQTIDTLERVIEKNKYELSDNELAVFYSAADHR
LAELTMNKLYDKIPSSVWKFIR
>t0803 hisL, his operon leader peptide
MTRVQFKHHHHPD
>t1477 hlyE, haemolysin HlyE
MIMTGIFAEQTVEVVKSAIETADGALDLYNKYLDQVIPWKTFDETIKELS
RFKQEYSQEASVLVGDIKVLLMDSQDKYFEATQTVYEWCGVVTQLLSAYI
LLFDEYNEKKASAQKDILIRILDDGVKKLNEAQKSLLTSSQSFNNASGKL
LALDSQLTNDFSEKSSYFQSQVDRIRKEAYAGAAAGIVAGPFGLIISYSI
AAGVIEGKLIPELNNRLKTVQNFFTSLSATVKQANKDIDAAKLKLATEIA
AIGEIKTETETTRFYVDYDDLMLSLLKGAAKKMINTCNEYQQRHGKKTLF
EVPDV
>t1001 holE, DNA polymerase III theta subunit
MKTNLAQLEQAEMDKVNVDLAAAGVAFKERYNMPVVAEAVEREQPEHLRA
WFRERLIAHRLASVSLSRLPYEPKVK
>t1085 hyaF, hydrogenase-1 operon protein HyaF
MSNAFFHLLGPGTQPDDASFSMNPLPLTCQVNGDPSMAALERCAHSPAVM
ALLTDLRGQLARRIPEVGDVLGWELSPLNADDLSFLNTLLGEGEVSVRIQ
HPDGSESEIQETIFCGLWRVRHLHNRRLLTDRLEAGSAPLTLWQAATADT
LPDDSLLPPPVAGLMNGLPLAHELLAHVRDPALQPHSINLTQLPLSEADR
LFLARLCGHGNIQIRISGYGESQINATALRHLWHVRCLDALKGPLLDSYE
ICPLPELVLAAPEDLADSRQRLDEVCRWLETR
>t3066 hybE, hydrogenase-2 component protein
MSETFSGFDTAPVARVQAAFEEIAHRSMHDLSFLHPTMPVHVSDFTLFEG
QWTGTVITPWMLSALIFPGPDQIWPGRTVGEKLGLQLPYGTMTFTVGELE
GVSQYLACSLMSPLSRSLSPEEGVRLADDCARMLLSLPVSNPDAPQTSRR
ALLFGRRSCENA
>t2755 hycA, formate hydrogenlyase regulatory protein
MTIWEISEKADYIAQRHRRLQDQWHIYCNSLVQGITLSKARLHHAMSCAP
ERDLCFVLFEHFRIYVALADGFNSHTIEYYVETKDGEDKQLIAQAQLDID
GKVDERVNNRDREQVLEHYLEKIASVYDSLYTAVETNSPVNLRQLVKGHS
PAV
>t2748 hycH, formate hydrogenlyase maturation protein
MSEQVVFSQLSRKFIDENDATPAEAQQVVYYSLAIGHHLGVIDCLEAALT
CPWDEYLAWIATLEAGSDARRKMEGVPKYGEIVIDFNHVQMLARAFDEAR
AAQTPQQQEWSKLMLSMLHDIHQESAIYLMVRRLRD
>t3398 ilvL, ilvGMEDA operon attenuator peptide
MTALLRVISLVVISVVVIIIPPCGAALGRGKA
>t2799 invE, cell invasion protein
MIPGSTSGISFSRILSRQASHQDATQHTDAQQAEIQQAAEDSSPGAEVQK
FVQSTDEMSAALAQFRNRRDYEKKSSNLSNSFERVLEDEALPKAKQILKL
ISVHGGALEDFLRQARSLFPDPSDLVLVLRELLRRKDLEEIVRKKLESLL
KHVEEQTDPKTLKAGINCALKARLFGKTLSLKPGLLRASYRQFIQSESHE
VEIYSDWIASYGYQRRLVVLDFIEGSLLTDIDANDASCSRLEFGQLLRRL
TQLKMLRSADLLFVSTLLSYSFTKAFNAEESSWLLLMLSLLQQPHEVDSL
LADIIGLNALLLSHKEHASFLQIFYQVCKAIPSSLFYEEYWQEELLMALR
SMTDIAYKHEMAEQRRTIEKLS
>t2802 invH, cell adherance/invasion protein
MKKFYSCLPVFLLIGCAQVPLPSSVSKPVQQPGAQKEQLANANSIDECQS
LPYVPSDLAKNKSLSNQNADNSASKNSAISSSIFCEKYKQTKEQALTFFQ
EHPQYMRSKEDEEQLMTEFKKVLLEPGSKNLSIYQTLLAAHERLQAL
>t3721 ivbL, ilvBN operon attenuator peptide
MNPSMLNATLLTTAPSRAVVVVRVVVVVGNAP
>t0118 leuL, leu operon leader peptide
MSYIVRFTGLLLLNAFIVRGRPVGGIQH
>t4138 malM, maltose operon periplasmic protein
MKMKKSLVALCLTAGLFASVPGISLAEVNYVPQNTSAAPAIPAAALQQLT
WTPVDQSKTQSTQLATGGQRLDVAGITGPVAAYSVPANIGELTLTLTSEV
NKQASVFAPNVLILDQNMTPSAFFPSSYFTYQQPGVMSADRLEGVMRLTP
ALGQQKLYVLVFTTEKDLQQTTTLLDPAKAYAKGVGNSIPDIPDPVARHT
TDGVVKLKVKTNSSSSVLVGPLFGSSGTGPVTVGNTAAPVAAPAPVAPKK
SEPMLNDTESYFNKAIKDAVAKGDVDKALKLLDEAERLGSTSARSTFISS
VKGKG
>t1439 marB, multiple antibiotic resistance protein MarB
MKMLFPALPGLLLIASGYGIAEQTLLPVAQNSRDVMLLPCVGDPPNDLHP
VSVNSDKSDELGVPYYNDQHL
>t2676 mig-14, putative transcriptional regulator
MKIQEVKRILTRWQPSSFTLYREVFTQYGGSINMHPDIVDYFMKRHNWHF
KFFHYKEDDKIKGAYFICNDQNIGILTRRTFPLSSDEILIPMAPDLRCFL
PDRTNRLSALHQPQIRNAIWKLARKKQNCLVKETFSSKFEKTRRNEYQRF
LKKGGSVKSVADCSSDELTHIFIELFRSRFGNTSSCYPADNLANFFSQLH
HLLFGHILYIEGIPCAFDIVLKSESQMNVYFDVSNGAIKNECRPLSPGSI
LMWLNISRARHYCQERQKKLLFSIGILKPEWEYKRMWSTPYFTGKSIC
>t1113 msgA, putative virulence protein
MFVELVYDKRNVEGLPSAREIILNELTKRVHQLFPDAQVKVKPMQANALN
SDCTKTEKERLHRMLEEMFEEADMWLVAE
>t1767 msyB, acidic protein MsyB
MTMYATLEEAIDAAREEFLADHPGLEQDEANVQQFNVQKYVLQDGDIMWQ
VEFFADEGEDGECLPMLSGEAAQSVFDGDYDEIEIRQEWQEENTLHEWDE
GEFQLEPPLDTEEGRTAADEWDER
>t4184 nrfB, cytochrome c-type protein NrfB precursor
MSVLRSLLTAGVLASGLFWSLSGITATPTPQESDQRWTVTQQRNPDAACL
DCHKPDTEGMHGKHTGAINPNNKLPITCTNCHGQPSLHHREGVKDVMRFN
DPMYTVEQQNSVCMSCHLPEQLQKAFWPHDVHVTKVTCASCHSLHPQQDT
MQTLNEKGRIKICVDCHSDQRTNPHFNPASVPLLKEQP
>t3423 nucE, possible secretory protein
MTLERISAFITYCIAVVLAWLGDLSIKDASTLGGLMIGVLMLAINWYYKH
KAYQLLRDGQISREDYESINR
>t4315 nucE, possible secretion protein
MTLERISAFITYCIAVLLAWLGDLSLKDASTVGGVLIGVLMLAINWYYKH
QSFKLLRGGKISRGEYEYFNR
>t3268 oadG, putative oxaloacetate decarboxylase subunit gamma
MTNAALLLGEGFTLMLLGMGFVLAFLFLLIFAIRGMSAVITRFFPEPVAA
PAPRAVPAVDDFTRLKPVIAAAIHHHRLNA
>t2772 orgA, oxygen-regulated invasion protein
MNRQPLPIIWQRIIFDPLSYIHPQRLQIAPEMIVRPAARAAANELILATW
RLKNGEKECIQNSLTQLWLRQWRRLPQVAYLLGCHKLRADLARQGALLGL
PDWAQAFLAMHQGTSLSVCNKAPNHRFLLSVGYAQLNALNEFLPESLAQR
FPLLFPPFIEEALKQDAVEMSILLLALQYAQKYPNSVPAFAC
>t1618 osmB, osmotically inducible lipoprotein B precursor
MFMMSKKMAAAVLAITVAMSLSACSNWSKRDRNTAIGAGAGALGGAVLTD
GSTLGTLGGAAVGGVIGHQVGK
>t1190 osmE, osmotically inducible lipoprotein E precursor
MNKNVAGILSAAAVMTMLAGCTAYDRTKDQFVEPVVKDVKKGMSRAQVAQ
IAGKPSSEVSMIHARGTCQTYILGQRDGKAETYFVALDDTGHVINSGYQT
CAEYDTDPQAPKQ
>t1116 pagD, putative outer membrane virulence protein
MKHHAFMLWSLLIFSFHVLASSDHCSGLQQASWEIFIYDFGSKTPQPPTN
TDKKQARQISSPSCPTAKPMMSAPTNDARKGNTFSRT
>t2239 pagP, antimicrobial peptide resistance and lipid A acylation protein
MYVAMIIRKYFLIIALLVMPWLAIPSVSAADKGWFNTFTDNVAETWRQPE
YYDLYVPAITWHARFAYDKEKTDRYNERPWGVGFGQSRWDDKGNWHGLYM
MAFKDSFNKWEPIGGYGWEKTWRPLEDDNFRLGLGFTAGVTARDNWNYIP
IPVLLPLASIGYGPATFQMTYIPGSYNNGNVYFAWMRFQF
>t0830 pduH, PduH protein
MDSNHSAPAIVITVINDCASLWHEVLLGIEEEGIPFLLQHHPAGDVVDSA
WQAARSSPLLVGIACDRHSLVVHYKNLPASAPLFTLMHHQDSQAQRNTGN
NAARLVKGIPFRDLNS
>t0826 pduM, hypothetical protein
MNGETLQRIVEEIVSRLQRRALSTATLSVAQLRDADCPALFCQHASLRIL
LVDLPLLGQLADAETDDAAARKIHDALAFGIRVQLSLHSQLLPVIPVKKL
ARLPLVFTDEHGLPLVLHAGSVLGYRDVALLSRGRVVVHRKCIVTAMARD
AANARNIQLIKQE
>t1217 pheM, phenylalanyl-tRNA synthetase operon leader peptide
MNAAIFRFFFYFST
>t4244 pilP, pilus assembly protein
MPASEPATEAAASISSADLSDTVPIFPGNSVTAGQLEALQGKNLLLEAKV
QAARLLKELTSAQTPGDVGNVPTTSPFVMGMPAEMNATPVHTPPATGRIT
VLEVSGRGNALQATLSFPDGRQSLVQTGSVIPGTSLKVKAITLSSVTLSD
GQQLTF
>t4247 pilS, prepilin
MLSNSMKNETEGKMMNEVSTLNPCNRPDRGMSADAGATALFILVIIGVIA
AAVWSMWGKKDAGTELTNYQTLATNTIGMMKGVDGYAFTSGAKMTDTLIQ
AGAAKGMTVSGDPASGSATLWNSWGGQIVVAPDTAGGTGFNNGFTITTNK
VPQSACVSISTGMSRSGGTSGIKINGNNHTDAKVTAEIASSECTADNGRT
GTNTLVFNYNG
>t1831 pipA, hypothetical protein
MLPVTYRLIPQSGVSTYGLNTADTPVFPDIPEHAPNPSRLRLAHDSLAIN
SEFRLEPECVVEYLISGAGGIDPDTEIDDDTYDECYDELSSVLQNAYTQS
ETFRRLMNYAYEKELHDVEQRWLPGAGEAFETTVAQEHFKLSEGRKVICL
NLDDSDDSYTEHYESNEGRQLFDTKRSFTHEVVHALTHLQDKEENHPRGP
VVEYTNIILKEMGHPSPPRMVYIFNK
>t0559 pmrD, polymyxin B resistance protein
MEWLVKKSHYVKKRACHVLVLCDSGGSLKMIAEANSMILLSPGDILSPLQ
DAQYCINREKHQTLKIVDARCYSCDEWQRLTRKPL
>t2905 ppdC, prepilin peptidase dependent protein C precursor
MSHPLNFQRGFSLPEVLVAMVLMVMIVTALSGYQRVLMHTFALRHQYLQI
WRQAWQQTALYPFSPADGWNANRMQTTQSGCVSISVTMVSPSGRQGQMTR
LHCPNR
>t2776 prgH, pathogenicity 1 island effector protein
METSKEKTITSPGPYIVRLLNSSLNGCEFPLLTGRTLFVVGQSDALTASG
QLPDIPADSFFIPLDHGGVNFEIQVDTDATEIILHELKEGNPESRSVQLN
TPIQVGELLILIRPESEPWVPEQPEKLETSAKKNEPRFKNGIVAALAGFF
ILGIGTVGTLWILNSPQRQAAELDSLLGQEKERFQVLPGRDKMLYVAAQN
ERDTLWARQVLARGDYDKNARVINENEENKRISTWLDTYYPQLAYYRLHF
DEPRKPVFWLSRQRNTMSKKELEVLSQKLRALMPYADSVNITLMDDVTAA
GQAEAGLKQQALPYSRRNHKGGVTFVIQGALDDVEILRARQFVDSYYRTW
GGRYVQFAIELKDDWLKGRSFQYGAEGYIKMSPGHWYFPSPL
>t2774 prgJ, pathogenicity 1 island effector protein
MSIATIVPENAVIGQAVNIRPMETDIVSLDDRLLQAFSGSAIATAVDKQT
ITNRIEDPNLVTDPNELAISQEMISDYNLYVSMVSTLTRKGVGAVETLLR
S
>t2480 psiF, phosphate starvation-inducible protein PsiF
MKITLLVTLLFGLVFLTAVGATEKPLTPQQQRMTTCNQQATAQALKGDAR
KTYLSDCLKNSQSAPGEKSLTPQQQKMRECNVQATEQSLKGDDRSKFMSA
CLKKAA
>t1594 pspB, phage shock protein B
MSALFLAIPLTIFVLFVLPIWLWLHYSNRAGRGELSQSEQQRLLQLTDDA
QRMRERIQALEDILDAEHPNWRER
>t1592 pspD, phage shock protein D
MNTRWQRAGQKVKPGFKIAGKLVLLTALRYGPAGVAGWAVKSVARRPLKM
LLAFALEPVLRKAANKISQRYK
>t4150 pspG, phage shock protein G
MLELLFVLGFFLMLMVTGVSLLGILAALVVATAVMFLGGMFALMIKLLPW
LLLAVAVVWVIKAVKTPKIPQYQRNNRRFY
>t4496 pyrL, pyrBI operon leader peptide
MVQCVRHSVLPRLKKDAGLPFFFPLKTNTKPLN
>t0341 ratA, hypothetical protein
MDRNRQVNKVVHFLLTLLIMFVASIAPAQALLKGGTWQELNSATAAVNGT
VPRADGAIIPVYQGSMLLDPTKTYDVAFTAMPRDFSADATSTSMRAVNST
DTEGDLFSDPPTIAWENQQPPSVGLLWADAATPDTPLSPQPTPNQTFCAQ
NLAGRKLVVWPQPDDETTVPALWLYTHTGVPNNAAIPLLSPKVTVNIAQA
VGNPVSVSGDHVDASFKASKVKVGESITLTVTTRGCDGKLVKNAPFVIRR
EDAKNRQGVVNNTNPVHVGDAELTTVQTEYRGVTDAEGKATVVVTQDEGP
GVKTHLIVASQSYPTLTDGVDVIFTTITSPDTAQANMYGHMLESASATLN
GATYTFTRPKLAVEAGNADESVNDTNETWAQFTWSGADNHCDVLPDAEQL
VALRHAHSTLATYTGWPASGDAEYWSSTKDQMNNYHAAVHMNSASVVRAP
NSDTLLVSCVDKAQPAAHPQITLSPEGPYKAQVGESIDLVMTVVDKDTQK
PLPYRYMELFIDPAKNRKGEHQDAWDNQRVTVSSEDMRASSPEHYTGVTD
VNGQAHLTLQHDSGMGGETPIRIVMPDDEGGNVELPFSVIFTVITSPDVD
GANMWGHMRGVVDAGNLYKRPLLAVEASHKDGQFSENNEEWATFNSVASA
TAQCGVGQVPDQSSLAHLYSEHPSGKMEIEHGWPTEDYFIAADSDASGTV
HVNLANGDSNKFTNQPNYLTCSANEMVAVLDVYFNDDPTTKNADMTAKVG
EQIKLNIHSRNALNDMAIAYTDFTITMANGKRRDGLTTGFTDPSNGEMQF
DGVGYLAGQVYHGITDANGDATIILTQNKGVGLLTPLSIAPVDSLIKTPI
SRSVKFTVATSPDTVKAKMWGHMADTLTVGDWTFERPKLASEVNSPLRTQ
EESNETWTRVAHIDAAGNPDAGGCAANRLPRIDQLEALYSANSGGAMKKT
QGWPTLINYWSSTYQSATTWKLIALASGSEFPGSNTSVYTSCLASDNPVP
AAITIEPVDPSQWYDGSGVHALKVKKGDTLQLKVTVKDASGKPVPEAPFV
LTRGDGYDRKGEKYTAQDGSDLQNIVTPVVIDGESLAWTTTKMGSQTGPD
GTRIISVTRPDTHGTRTAITATLYENVAVSASIDTIFTVVTSPDVSVARM
WGHMTPSLTAADSAVYKRPLLYDELASKTGAAEYPEDNERWVVFYGPNTT
KTVSPEACSKGYFPSVEQLDSLYSKYPNGAIKTAQGWPIIRSYWSGTNAG
TITPGAPPYDYYTVDLNDDAHRKVPNISDSDRQYQICAATPQPLAGRITL
TSTLATDSDIQAVKAKNSDSIPLVITTTDAAGNPVPYTPFSLIRDAGTAR
NTSYTFTGSTNMMLAPPTGSAQQFYYNGYTIYGATGADGTAVLTLTQAAG
PGVKNVITAALTDTPTVTSTLPVVFTTVTSPDSPQANMYGHMPETFTASN
GAEFKRPLLYSELASTLGVKSYADTNENWPIVNNFDTSNYGACSINQMAT
LDDLKALYGDHPSGKVTTDIGLPVRKKWWAGDSLLQGQVIYWRYIDLSTG
IDYSMSGTPGNYYYQLCLTKPRQMNIALSTDAWNADKSAAVAKKGETIPM
TVKVTNAAGQPVSNATVKITRGDALTRAGSVYTTNNADDITLSNIQPSGT
ATYLLGSVDKYMYAQTDAQGQITFSVSQNNTMGLKTPIRVTVANDITATS
SKDVIFTVLTSPDVASANYWGHMPETVEGPDGLRYQRPHLQAEAPSGVNY
ITVNGEKWAAPTGVQTYTAGQSACDVEYMPLMNDLKALQQLYPDGALEDQ
FGWPVKTGKLWWSADLNSSKAHQAINLKTGQISAPTSTSLQACLVNARNV
PASITLTSTAMDAEKGAAVAKKGEAIPLTVTVKNRAGVPIANEPFTLKRG
DANDRLDIKYTWNTTADDLTLQELTPSPTTKSMTASGNVFSGVTGADGTA
TFTVNQDGSVGLKTELTASATGDVTQSTNTALGVIFNVITSPDTDKAQYW
GHMPDTLTVGGVTLHRPLLMKEAPAGATDSRKENNETWVSVYTKADGLTY
DMSKNCGGVAGFPAKGVLEKIRDEQMAVANGWPTVSLPYASSTPGTYNYC
WVSLAKGGTTHCPTTSSDYSIGYAACLVQP
>t0343 ratC, hypothetical protein
MPIRLGRGNYSQNRAGGSDSTNNSDMLLTPIAPPADAKVFAYHYSGEQLW
YWYGTTDESGRVQFELTQDNTPGLKTALQAMLSDNPPTVSNMDVIFTVIT
SPDSDKAKMWGHMPETAANSAGVKFHRPLLAAEMTSNSGTYYYNNETWPL
VTIANTQKAGATTCDAAYQPLFNDLQTLYSDHPDSALNTAFGWPVGAGKS
WLAVDQEPGTGYYQYLRLDTGAKGHTSSTSVSGAQVCLVEPHTSTPANIT
LTSTAMDSAKNAAVVEKGSAMPLTVTVKDSSGNPVANVGFTLSRGDSKNR
AGTVVTDGDVAADAGADDLMLKELTPASASRSMTTTGIVFTGTTGSDGTA
TFTLNQDKSLGLKTPLTVKLTDNTALHASLDVIFMVLTSPDTDKALFWGN
MSDTTSVNGKTLHRPWLQAELPSGVTPVFTNGVHANNEYWAMTHTVDNTK
WDIAKQCGSLSKAPDNNDLLTLYHSISSLGWPTLGYPYLSKSTSGGGMYC
GVDENTKQQNCAIQPASTAGYATCVE
>t0247 rcsF, rcsF protein
MRALPICLLALMLGGCSMLSRSPVEPVQSTATPPKAEPEKPKAPRAAPVR
IYTNAEDLVGKPFRDLGEVSGESCQATNQDSPPNIPTARKRMQINASKMK
ANAVLLHSCEITSGTPGCYRQAVCIGSALNISAK
>t3373 rffC, lipopolysaccharide biosynthesis protein
MQAKVPAENIAWLSALQSLGFSLVEGEVDFALPVKGHRDQHGAEIAHLTD
IPALRQLAGEAFTQSRFRAPWYAPDASARFYAQWIENAVRGTFDHQCLVL
RTETGAIRGYVSLRELNDTDVRIGLLAGRGAGAELMQAAICWAQSRGKAT
LRVATQLGNTAALKRYIQSGANIESTAYWLYR
>t3761 rmbA, hypothetical protein
MMRKPSQIVHCISCDLSCQLFPDSAVRVQYCHNAAFSIWPDGNAFLKKGF
IEKLLLDRHNHLSSGFIFVDFSFPNLRRFTDLQWADSLANSGMHIVLISD
RSLTPLANYWILKSNKIQGIIYSDDDDIVQQQKMHRLFTGRLANSKRGRT
LNYTEFILLKHFVSGISIQQIVNIDNIDIKKLYVHKLRLENKLGHSIHKI
ISNIL
>t2561 safA, probable lipoprotein
MKNIKKLIIASVLSMITASCYAGSIVVGSEQQSSVDIGFASPQQLSVTFA
PVAGLKAGVIKSNTEIATIAVSSVAAKQFAIAADFKAKNVMNGDTWTLYG
KNTGKGIKVYFYGETTSPKGNVNYNGHQWIIYDINDKLGVKLAGDQNVPA
DVFPMTVNIAAYQA
>t2557 safD, putative fimbrial structural subunit
MWMKIQRVKTVIYSVSLLVAASSLVPIANAAEKLQTTLRVGTYFRAGHVP
DGMVLAQGWVTYHGSHSGFRVWSDEQKAGNTPTVLLLSGQQDPRHHIQVR
LEGEGWQPDTVSGRGAILRTAADNASFSVVVDGNQEVPADTWTLDFKACA
LAQEDT
>t1807 scsA, membrane protein, suppressor for copper-sensitivity A
MAKQQRMGWWFLCLACVVVMVCTAQRMAGLHALQMQATASAAVVSAPSST
DDGSPVTPCELSAKSLLAAPPVLFEGAILALYLLLSLLAPVRVMRLPFSP
PRAISPPTLRVHLRFCVFRE
>t2781 sicP, chaperone
MGLPLTFDDNNQCLLLLDSDIFTSIEAKDDIWLLNGMIIPLSPVCGDSIW
RQIMVINGELAANNEGTLAYIDAAETLLFIHAITDLTNIYHIISQLESFV
NKQEALKNILQEYAKV
>t1696 sifA, secreted effector protein
MPITIGNGFLKSEILTNSPRNTKEAWWKVLWEKIKDFFFSTGKAKADRCL
HEMLFADRTPTRERFTEIFFELKELACASQRDRFQVHNPHENDDTIILRI
MDQNEENELLRITQNTDTFSCEVMGNAYFLIKDRPDILKPHPQMTAMINR
RYSEIVDYPLPSTLCLNPAGAPTLSVPLDNIEGYLYSEWRKGNLDEWKTQ
EKATYLAAKIQSGIEKTTRILQHANISESTQQNAFLETMTMCGLKQLEIP
PPHTHIPIEKMVEEVLLADKTFQAFLVTDPSASLSMLAEIVEAISDQVFH
AIFRIAPQSIQKMAEEQLTTLHVRSEQQSGYLCCFL
>t1511 sifB, secreted effector protein
MPITIGRGFLKSEMFSQSAISQRSFFTSLWEKIKDFFCDTQRSTADQYIK
ELCDVASPPDAQRLFDLFCALYELSSPSCRGNFHFQHYKDAECQYTNLCI
KDGEDIPLCIMIRQDHYYYEIMNRTVLCVDTQSAHLKRYSDINIKASTYV
CEPLCCLFPERLQLSLSGGITFSVDLKNIEETLIAMAEKGNLCDWKEQER
KAAISSRINLGIAQAGVTAIDDAIKNKIAAKVIENTNLKNAAFEPNYAQS
SVTQIIYSCLFKNEILMNMLEESSSHGLLCLNELTEYVALQIHNSLFSED
LSSLVETTKNEAHYQS
>t1828 sigD, cell invasion protein
MQIQSFYHSASLKTQEAFKSLQKTLYNGMQILSGQGKAPAKAPDARPEII
VLREPGATWGNYLQHQKTSNHSLHNLYNLQRDLLTVAATVLGKQDPVLTS
MANQMELAKVKADRPATKQEEAAAKALKKNLIELIAARTQQQNGLPAKEA
HRFAAVAFRDAQVKQLNNQPWQTIKNTLTHNGHHYTNTQLPAAEMKIGAK
DIFPSAYEGKGVCSWDTKNIHHANNLWMSTVSVHEDGKDKTLFCGIRHGV
LSPYHEKDPLLRQAGAENKAKEVLAAALFSKPELLNRALEGEAVSLKLVS
VGLLTASNIFGKEGTMVEDQMRAWQSLTQPGKMIHLKIRNKDGDLQTVKI
KPDVAAFNVGVNELALKLGFGLKASDSYNAEALHQLLGNDLRPEARPGGW
VGEWLAQYPDNYEVVNTLARQIKDIWKNNQHHKDGGEPYKLAQRLAMLAH
EIDAVPAWNCKSGKDRTGMMDSEIKRELISFHQTHMLSAPGSLPDSGGQK
IFQKVLLNSGNLEIQKQNTGGAGNKVMKNLSPEVLNLSYQKRVGDENIWQ
SVKGISSLITS
>t1829 sigE, cell invasion protein
MESLLNRLYDALGLDAPEDEPLLIIDDGIQVYFNESDHTLEMCCPFMPLP
DDTLTLQHFLRLNYASAVTIGADADNTALVALYRLPQTSTEEEALTGFEL
FISNVKQLKEHYA
>t0340 sinI, hypothetical protein
MQATVKRRLTKVALALVVAGYCAAPAVAANGNLKSGQWQIVSEQTGTIQG
TVPWITRAADKTADTDKDHVTVTIDRGDRKIVTEGDKQFHVGDKVTVNWA
IGDTEGDLDVDNAATKLTVQWMRYSDQNGSNPEEIGAKGSDTYEIQAGDA
DHYIGIKITPTTTTGDPAVATELLLKDLSTDAGGGADGDDIPEGPVVDEN
VHVVIHEKDSNTNLLKNSGTTLKTNTTYQVLLWSDKNGNGTYDAGENVTD
QYDYRWKFVGTSKIAGTGTGGIVNENWNDKDLVIPLTNAEAKEAFEGAEG
GVTVGSDGVQGFGLSIDYKRK
>t2784 sipA, pathogenicity island 1 effector protein
MVTSVRTQPPVIMPGMQTEIKTQATNLAANLSAVRESATATLSGEIKGQQ
LEDFPALIKQASLDALFKCGKDAEALKEVFTNSNNVAGKKAIMEFAGLFR
SALNATSDSPEAKTLLMKVGAEYTAQIIKDGLKEKSAFGPWLPETKKAEA
KLENLEKQLLDIIKNNTGGELSKLSTNLVMQEVMPYIASCIEHNFGCTLD
PLTRSSLTQLVDKAAAKAVEALDMCHQKLTQEQGTSVGREARHLEMQTLI
PLLLRNVFAQIPADKLPDPKIPEPAAGPVPDGGKKAEPTGINININIDSS
NHSVDNSKHINNSRSHVDNSQRHIDNSNHDNSRKTIDNSRTFIDNSQRHG
ESHHSTNSSNVSHSHSRVDSTTHQTETAHSASTGTIDHGIAGKIDVTAHA
TAEAVTNSSSESKDGKVVTSEKGTTGETTSFDEVDGVTSKSIIGKPLQAT
VHGVDDNKQQSQTAEIVNVKPLASQLAGVENVKIDTLQSDSTVITGNKAG
TTDNDNSQTDKTGPFSGLKFKQNSFLSTVPSVTNMHSIHFNAREAFLGVI
RKALEPDASTPFPVRRAFDGLRGEILPNDTIKSAALKAQCSDIDKHPELK
AKMETLKEVITHHPQKEKLAEIALQFAREAGLTRQKGETDYVLSNVLDGL
IGDGSWRAGPAYESYLNKPGVDRVITTVDGLHMQR
>t2785 sipD, pathogenicity island 1 effector protein
MLNIQNYSASPHPGIVAERPQTPSASEHAEIAVVPSTTEHRGTDIISLSQ
AATKIQQAQQTLQSTPPISEENNDERTLARQQLTSSLNALAKSGVSLSAE
QNENLRSTFSAPTSALFSASPMAQPRTTISDAEIWDMVSQNISAIGDSYL
GVYENVVAVYTDFYQAFSDILSKMGGWLSPGKDGNTIKLNVDSLKSEISS
LINKYTQINKNTILFPSQTGSGMTTATKAEAEQWIKELNLPDSCLKASGS
GYVVLVDTGPLSKMVSDLNGIGSGSALELDNAKYQAWQSGFKAQEENLKT
TLQTLTQKYSNANSLYDNLVKVLSSTISSSLETAKSFLQG
>t4303 sopE, invasion-associated secreted protein.
MTKITLSPQNFRIQKQETTLLKEKSTEKNSLAKSILAVKNHFIELRSKLS
ERFISHKNTESSATHFHRGSASEGRAVLTNKVVKDFMLQTLNDIDIRGSA
SKDPAYASQTREAILSAVYSKNKDQCCNLLISKGINIAPFLQEIGEAAKN
AGLPGTTKNDVFTPSGAGANPFITPLISSANSKYPRMFINQHQQASFKIY
AEKIIMTEVAPLFNECAMPTPQQFQLILENIANKYIQYTP
>t2795 spaM, virulence-associated secretory protein
MHSLTRIKVLQRRCTVFHSQCESILLRYQDEDRGLQAEEEAILEQIAGLK
LLLDTLRAENRQLSREEIYTLLRKQSIVRRQIKDLELQIIQIQEKRSELE
KKREEFQKKSKYWLRKEGNYQRWIIRQKRHYIQREIQQEEAESEEII
>t2794 spaN, antigen presentation protein SpaN
MGDVSAVSSSGNILLPQQDEVGGLSEALKKTVEKHKTEYSGDKKDRDYGD
AFVMHKETALPVLLAAWRHCAPAKSEHHNGNVSGLHHNGKGELRIAEKLL
KVTAEKSVGLISAEAKVDKSAALLSPKNRPLESVSGKKLSADLKAVESVS
EVTDNATGISDDNIKALPGDNKAIAGEGVRKEGAPLARDVAPARMAAANT
GKPDDKDHKKVKDVSQLPLQPTTIADLSQLTGGDEKMPLAAQSKPMMTIF
PTADGVKGEDSSLTYRFQRWGNDYSVNIQARQAGEFSLIPSNTQVEHRLH
DQWQNGNPQRWHLTRDDQQNPQQQQHRQQSGEEDDA
>t2797 spak, virulence-associated secretory protein
MQHLDIAELVRSALEVSGCDPSLIGGIDSHSTIVLDLFALPSICISVKDD
DVWIWAQLGADSMVVLQQRAYEILMTIMEGCHFARGGQLLLGEQNGELTL
KALVHPDFLSDGEKFSTALNGFYNYLEVFSRSLMR
>t1261 spiC, putative pathogenicity island secreted effector protein
MLAVLKGIPLIQDIRAEGNSRSWIMTIDGHPARGEIFSEAFSISLFLNDL
ESLPKPCLAYVTLLLAAHPDVHDYAIQLTADGGWLNGYYTTSSSSELIAI
EIEKHLALTCILKNVIRNHHKLYSGGV
>t1503 srfA, putative virulence effector protein
MVDCLAIPQLNDNGDRVDWYSPIEGQAIAWKAADEETRSRALRYLASTFE
SAAALSRKSLQSGKTALQLFGSLLEKATQFPGENHVFLVNGKPVITFWGF
VNLNENTRDDVLDCLRVTEAISDIPLVEPEPQPEEKPLVEAAFSQADEPL
LTSVIEPPKMPEEPAAPPVIVSEPKPAAPIPVAEAKRARRLPLWSLPVAA
VVIAAVATPFFWPSSSVDGASPVRTAPVATAKTDVTPMPELTAHLPLHRA
EVTSAPKAAPLPEAPVIIAAIPKDALVMDNTQMKLGTTRFLNGSWRVSVD
VKDPITGKPPSLRYQIQNNKGIARVVHGDNVVCRAEIFSGLHQTGELMIK
SRGNARCTDGSRYPMPEITCKAGVNDVATCTARYGDHAAIPLTFKKIGA
>t1263 ssaD, putative pathogenicity island protein
MAYLMVNPKSSWKIRFLGHVLQGREVWLNEGNLSLGEKGCDICIPLAINE
KIILREQADSLFVDAGKARVRVNGRRFNPNKPLPSSGVLQVAGVAIAFGK
QDCELADYQIPVSRSGYWWLAGVFLIFIGGMGVLLSISGQPETVNDLPLR
VKFLLDKSNIHYVRAQWKEDGSLQLSGYCSSSEQMQKVRATLESWGVMYR
DGVICDDLLIREVQDVLIKMGYPHAEVSSEGPGSVLIHDDIQMDQQWRKV
QPLLADIPGLLHWQISHSHQSQGDDIISAIIENGLVGLVNVTPMRRSFVI
SGVLDESHQRILQETLAALKKKDPALSLIYQDIAPSHDESKYLPAPVAGF
VQSRHGNYLLLTNKERLRVGALLPNGGEIVHLSADVVTIKHNDTLINYPL
DFK
>t1264 ssaE, putative secretion system protein
MTTLTRLEDLLLHSREEAKGIILQLRAARKQLEENNGRLQDPQQYQQNTL
LLEAIEQAENIINIIYYRYHNSALVVSEQE
>t1274 ssaG, putative pathogenicity island protein
MDIAQLVDMLSHMAHQAGQAINDKMNGNDLLNPESMIKAQFALQQYSTFI
NYESSLIKMIKDMLSGIIAKI
>t1275 ssaH, putative pathogenicity island protein
MFAGVNHSLISQVHAMLPALTVIVPDKKLQLVCLALLLAGLNEPLKAAKI
LSDIDLPEAMALRMLFPAPNEGFEN
>t1276 ssaI, putative pathogenicity island protein
MSIVPVSTQSYVKSFAEPSQEQINFFKQLLKDEASTSNASALLPQVMLTR
QMDYMQLTVGVDYLARISGAASQALNKLDSMT
>t1280 ssaL, putative secretion system protein
MNIKINEIKMTPPTAFTPGLVIEEQEVISPSMLALHELQETAGAALYETM
EEIGMALSGKLRESNKFTDAEKLERRQQALLRLIKQIQEDNGAALRPLTE
ENSDPDLQNAYQIIALAMALTAGGLSKKKKRDLQSQLDTLTAEEGWELAV
FSLLELGEVDTATLSSLKRFMQQAIDNDEMPLSQWFRRVADWPDRGERVR
ILLRAIAFELSICIEPSEQSRLAAALVRLRRLLLFLGLEKECQREEWICQ
LPPNTLLPLLLDIICERWLFSDWLLDRLTAIVSSSKMFNRLLQQLDAQFM
LIPDNCFNDEDQREQILETLREVKINQVLF
>t1281 ssaM, putative pathogenicity island protein
MDWDLITERNIQLFIQLAGLAERPLATNMFWRQGQYETCLNYHNGRIHLC
QILKQTFLDEELLFKALANWKPAAFQGIPQRLFLLRDGLAMSCSPPLSSS
AELWLRLHHRQIKFLESQCVHG
>t1284 ssaO, putative type III secretion protein
METLLEIIARREKQLRSKLTVLDQQQQAIITEQQICQTRALAVTTRLKEL
MGWQGTLSCHLLLDKKQQMAGLFTQAQSFLTQRQQLENQYQQLVSRRSEL
QKNFNALMKKKEKITMVLSDAYYQS
>t1285 ssaP, putative type III secretion protein
MRITKVEGSLGLPCQSYQDDNEAEAERMDFEQLMHQALPIGENNPPAALN
KNVVFTQRYRVSGGYLDGVECEVCESGGLIQLRINVPHHEIYRSMKALKQ
WLESQLLHMGYIISLEIFYVKNSE
>t0321 sseB, SseB protein
MIPMSETKNELEILLEKAATEPAHRSAFFRTLLESTVWVPGSAAEGEAIV
EDSALDLQHWEKEDGTTVIPFFTSLEALQQAVEDEQAFVVMPARTLFEMT
LGETLFLNAKLPTGKEFMPREISLLLAEEGSPLSTQEVLEGGESLILSEV
AEPPSQMIDSLTTLFKTIKPVKRAFLCAIKEHADAQPNLLIGIEADGEIE
EIIHAAGNVATDTLPGDEPIDICQVRKGAQGISHFITEHIAPFYERRWGG
FLRDFKQNRII
>t1269 sseD, putative pathogenicity island effector protein
MEASNVALVLPAPSLLTPSSTPSPSGEGMGTESMLLLFDDIWMKLMELAK
KLRDIMRSYNVEKQRLSWELQVNVLQTQMKTIDEAFRASMITAGGAMLSG
VLTIGLGAVGGETGLIAGQAVGHTAGGVMGLGSGVAQRQSDQDKAIADLQ
QNGAQSYNKSLTDIMEKATEIMQQIIGVGSSLVTVLAEILRALTR
>t1272 sseF, putative pathogenicity island effector protein
MKIHIPSAASNIVDGNSPPSGIQAKEASFPPPEIPAPGTPTVPVVLTPEQ
IRQQRDYAIHFMQYTIRALGATVVFGLSVAAAVISGGAGLPIAILAGAAL
VIAIGDACCAYHNYQLICQQKAPLQTASDSVALVVSALALKCGASLNCAN
TLANCLSLLIRSGIAISMLVLPLQFPLPAAENIAASLDMGSVITSVSLTA
IGAVLDYCLARPSGDDQENSVDELHADPSVLLAEQMALLCQSATTPALMD
SSGHTSRGEP
>t1273 sseG, putative pathogenicity island effector protein
MKPVSPNAQVGGQRPVNAPEESPPCPSLPHPETNMESGRIGPQQGKERLL
AGLAKRVIECFPKEIFSWQTVILGGQILCCSAGIALTVLSGGGAPLVALA
GIGLAIAIADVACLIYHHKHHLPMAHDSIGNAVFYIANCFANQRKSMAIA
KAVSLGGRLALTATVVTHSYWSGSLGLQPHLLEHLNDLTYGLMSFTRFGM
DGMAMTGMQVSSPLYRLLAQVTPEQRAPE
>t0190 staA, putative fimbrial protein
MKKAILAAAMVMAMGSTSAMAVEGGQIEFHGLVSATTCSKVVSSSRGNQA
TDGDVYLTTAAPGDITEGVAANAYGALPAPFSIILDCSGAADVTDATKAS
LVMDSSFSNTTGTLDNDTTLSVAGETGAENVNIAIHDADTKTQVKIDGAE
IHEASFKNKVATYNFMASYVRADASKEVTTGHVTTNAMYTFTYQ
>t0187 staD, putative fimbrial protein
MKRNKYLLAASVLAVIFPLTSQAENINVDFTATVLATTCSMSIAALDGSN
LSGDATSGYTLNVGDVGLDKIIKKSAESQKNFKFVAKDCSAALTKITTTL
ASSTDASGNFIKNQSAASGAATNVGMGFKRKSTTDETYFTPGSGSFSWTS
DERAANEVEMTVALRELTDGAGTTGAFSSTATFNFTYQ
>t0186 staE, putative fimbrial protein
MIMKKQILRVVIFSSLIATGAGISTMTYADGTNSLDLTVNANITAGTCSA
SVVEGDTITDTIAFGNVYISEVYAKSKIKLFKLRFSDCVGLKDKKAKFRL
APNNVACPGSSGTDGQFANASTSTTKAAMVAMEVWTTETPGGTGAVKLHC
WSKPEQTVDLSGASVTTPVDFPLSAMMVAQSGGTLQNMTAGDFYSPTTFT
ITYQ
>t0185 staF, putative fimbrial protein
MVIPMRRLREPTLATLFSGLATALFSATLYADTNVNFTASVQKDTCQIKI
DGNGTVNFATIAPAYFADGITAETDYEGGKEFTIKLISCPISDGKITNVT
FNFAPLNGQFSPENQQVFPNDIATDAGGVDNVGVVIFTTDSPRTNVLNTD
GSSRATFAASTYSDTVWTFYSRMQKIRSAEKVTTGELSSRVLVNVSYE
>t0184 staG, putative fimbrial protein
MLLKNTTWFAAFFLMMAIMSNCYAINTTLAVGDYASSEHDGPSGDSVFTD
NSHNFGQTIAIHKETALRQITVFNWSGIQYVMEMFCNGSGNHTYLQLTHN
YISAGKSYNGHPLYKTSIPGFYFTIEMTFLQPAENMTSSTFWFDKTSTPI
TSEFTEFPSACSRTNVYSNLGKLMYGLKIYAYVDSDFAPTEAQLQSFTLS
KNGDSDFYIDNPGSGLSNYKMKFNLAATGLKAVWPTCSASTISGTNVSGS
TVKLGSFYPKQIMEGLSPTKFQINLSSCQYINNIEVKLASNNVGTKNTSL
LTNNSTSNTKASGIGVLIEGLKSSSSAQMVLKPNDSSSIYKDTTNNTGDG
SPVGSATKSLYFQATLKPDGDNPTINPGDFKATAQFSITYP
>t2524 stbD, putative fimbrial protein
MLFSFRTLLFITSLFVSAGTWSSCIKVTDKSALSDAAIKAGYTAQNWIGA
LDTNTGNIGLPTVISISNSETFQPSGTLLASGIGNFLTAATGTPYSSKQV
LYRCDSADAGKLYEMYSTNGDSAFAGAFFTPEVEGAYYDVERNVAVRMTN
LSTGEYYSRFWKERQLTADSWFQDDKYIYIPASAFSNVLYEMFKIDSRKY
FAYQNPMDRDTWTQPRGYIAFKGPGLITERIKAGLDHASDYYGWPSYWPG
AWSTYNSVTYVRGALCKITDYPAIVKIPPVAVGILAAGGNSQAPFHVSLE
CESGAVSSALPSTSAANVAMGFVVNQPTAVAAARRLGLTTSAGGLSWLLD
THYGEPGVASGVGIRIYNDAGTPINLLPDRIKTGTGNARGWYGYKDLTTR
VSSGSVETYSGDFTASLEALGGQTVTAGSVNAQLQAVVSFQ
>t0704 stcA, putative fimbrial subunit protein
MKRSLIAASVLSAVFMSAGAFAVDEYDSGVLNINGKVVGTTCQFLGTNTA
EIRLNEIGADKIINLTPGQIYDAVTNQTQMPLKIKCQQGVAPRITFSSTQ
FDSHDITFNNGSAKGVGFAVYYGSTDNQIDPETGVTLDPNSSGEYDLTFL
ARYARLDGDVASGDVSSTLTLTVVTD
>t0707 stcD, hypothetical protein
MKLFLFIVMLLILPETYAACTGEITYQDNLIIREDFTINPNQSATYSHNF
NDTTCSGTYKITRMDPSDIIVGLYNDTVKLKLKIAWADNNTLTMPFTTGY
TVTVEPASSGANVNISAGSGNSVLINGVVSITSASSATQFTASLRFLGCL
LAGRGWNACAADYNSYLRGAGLYSFDLFVSYDRKQTTCKPEDLTITLPNI
ALSELYNTGKVSNKNAADNIRLQCDNLFGNAKQTSRKMTVYLSSSDLIPD
SYSVLRGAVNNGVGFILESGGKTVNISNTAEQGNASTLWKVDQVGTPLNS
DMITIPIIASYYVYDRDNIKPGDLKATALIYVKYD
>t2860 steE, fimbrial subunit
MKRVLILTLLITRFACADNLTFHGKLINPPACTINNGETLEVSFGSVIID
NIDGVNYLTEIPWTLTCDSSFRDDALTFTLSYLGTATPYSAKALTTNVPG
LGIELQQNGTVFPPGTSLTINESSLPTLKAVPVKQPGKEPAEGDFEAFAT
LQVDYQ
>t3662 stgD, probable fimbrial protein
MRLWTIILFSCFMVLISPVCRAGDGICHAVKGTYIYNLNLNDARIPAEKN
KAGTEVRDLETLTSSESYKVGCSCLTHFSSTFREVYYTARSPLSIDTTRN
GYTYYTLNDNLSIATSIAVLGRGFIAVPFEAEPNVVKDGMNCYTDTVEGN
LATLYTGSEVKVSFLINKPFVGQVAIPGTIVANLYGGLDASSSTASTDKL
AEVRIVGDIVAPQSCEIDSGQVIEVNFGKIPVADFSTTQGTAAAGHKVTK
TVQVKCTGMLDENIVYSTFNADPVDSSANMMKVLGNDDVGIMIYDKWDRM
VKVTGGKMDMDMGVNNGGAETNSLTFSAAPASATGARPQPGTFEAYATIT
LEITN
>t2875 syd, SecY interacting protein Syd
MDELTAQALKAFTTRYCDAWQEKHGSWPLSEELYGVPSPCIISSTRDAVY
WQPQPFEGEENVNAVERAFDIMVQPALHAFYTTQFAGDMPAQFADEKLTL
LQTWSQDDFRRVQENLIGHLVTQKRLKLPPTLFIATQENELEVISVCNLS
GEVIKETLGTRNRTVLAATLAEFLTQLNPLL
>t2549 tcfB, putative fimbrial subunit
MYTECTYITVINNKARLFFMNMKTSFIAAAVALATVYSFSVSAVQKDITV
TANIDSTLELLQADGSSLPSTMKLDFMPGKGLVHKSLQTRLYSNDQTKSV
NVKLLNAPQLINVLDPTKTIDMEVTLGGRSLTTTNSVLEAKTLFPDGKTG
DASALLNLDIGQKAGAALQNLPAGEYSGLVSLVISQAVTAG
>t2547 tcfD, putative fimbrail protein
MSNKMKWTSMTAHWSAIINFIRKYVYPARIIAILLMAGATLPQVADAITV
DLNYDKNNVAVITPVWSQEWSVANVLGGWVCRSNRNENEGACEETHLVWW
YAFGAYSKIRLRFREQISHAEITLILLGSVRDACYTGVINMNAAACQWGR
SLKLRIPSEELAKIPTSGTWKATLVLDYLQWGGDDPLGTSTTDITLNVTD
HFAENAAIYFPQFGTATPRVDLNLHRMNASQMSGRANLDMCLYDGGVKAR
SLQMKIEGSNKSGTGFQVIKSDSADTIDYAVSMNYGGRSIPVTRGVEFSL
DNVDKAATRPVVLPGQRQAVRCVPVPLTLTTQPFNIREKRSGEYQGTLTV
TMLMGTQTP
>t0001 thrL, thr operon leader peptide
MNRISTTTITTITITTGNGAG
>t2548 tsaC, outer membrane fimbrial user protein
MSRYFWMYYLLGLCSFTSQATLIPPPGFESLLEGQTEQIEVLLPGHSLGL
FPVVVKPDTVQFMSPLMVLESSGLAALPAAERQKALAALSRPLLRNSNLV
CGVSEAKDSSECGYVATDKEDVAVIFDENNAQLSLFLNRDWLPDEERRDK
RWLTPTPEGVSAFIHRQTLYLSDDLHSRNMTLNGSGALGLGDGRYLGGDW
AAIWNQSEHYNNSQAWFDNLFVRQDLGNQYYLQAGRMDQRNLSSATGGDF
GFSLLPLSRFDGLRTGTTQAYVNHEVDHNATPVMVQVTRNARIDIYRGSE
LLGSQFLTPGMHTLDTHSLPPGSYPLALRVYEDGILRRTETQPFSKGGNS
FSAQTQWFIQGGLEDTGDKASHYDGETVMAAGFQTGLRKNISLTEGISLA
HEAWYSETRLNSQHAVLDGTLDLSAGILHGTDSTSGNTEQVTYNDGFSAS
LWRNHTESDACSGRHPQSVHASMTCQTSMNASLSVSVGNWYALLGYSTSR
TEGRPVYRGYDDNSDKENVFWRQAYIPASHRESAQASATYSLNMAGMNIN
THGGVWRTRNDGMNDDGLFMSVSVSYASQPPTMTGSNRYTSAGTDIHSSR
NQKTQTSWNVNHVRSWQQDLYRELSVGFSGYNDDSWSGSLGGRMSGRMGE
LSATISNSHQRNAGSASSLTAGYSSSLALSRNGLFWGGGQDGEPASGMAV
NVESEGDEGSSGKVVSVRGSSQPFSLGFGQQSLLLMEGYNATEVTIEDAG
VSSQGMAGVKAGGGSRCYFLTPGHLLVHNISASMSRLYVGRVLDKDGRPL
LDAQPLNYPFLSLGPSGRFSLQSEHKESSLWLLSKNRILRCPMSVHKRRD
VMQVVGDVRCELSDVDALPQALQISPRVIRLLNVAGLLRHSVQEA
>t1338 tus, DNA replication terminus site-binding protein
MSRYDLVERLNGTFRQIEQHLAALSDNLQQHSLLIASVFSLPQVTKEAEH
APLDTIEVTQHLGKEAEALALRHYRHLFIQQQSENRSSKAAVRLPGVLCY
QVDNATQLDLENQVQRINQLKTTFEQMVTVESGLPSAARFEWVHRHLPGL
ITLNAYRTLTLINNPATIRFGWANKHIIKNLSRDEVLSQLKKSLASPRSV
PPWTREQWQFKLEREYQDIAALPQQAKLKIKRPVKVQPIARIWYKGQQKQ
VQHACPSPIIALINTDNGAGVPDIGGLENYDADNIQHRFKPQAQPLRLII
PRLHLYVAD
>t4353 tviA, Vi polysaccharide biosynthesis protein
MRFHHFWPPNDIYFGVGAAGIIEEVSLITNDRNYLFVNLNRYSLLNALNF
FTRMSDINKIIVIISSSRLMPLARFWLTECKNVIAVFDAATSVQDIIRNV
SQHQSGEKILTEQRDYRFRINRKDIVKMKYFLSESGMEELQDRFMNSSST
MYRWRKELAVKFGVREPRYLLLPDSVTLL
>t1510 ugtL, hypothetical protein
MKKSDGEIHEKTASWGILQSEWLRKCGRLLLLLLYRFVIGWAFFQLLAMI
VAGIFLLGVLLFHPIIFVQTIAITEKLNHASLDLWHILKLCLWHYGIIAG
FIFMAECTLSKSIRQVQRLSKKFGAQDFSSRP
>t3926 uspB, universal stress protein UspB
MISTVSLFWALCVVCIVNMARYFSSLRALLVVLRGCDPLLYQYVDGGGFF
TTHGQPNKQVRLVWYIYAQRYRDHHDEEFIRRCERVRRQFLLTSALCGLV
VVSLIALMIWH
>t3806 waaL, O-antigen ligase
MLTTSLTLNKEKWKPIWNKALVFLFVATYFLDGITRYKHLIIILMVITAI
YQVSRSPKSFPPLFKNSVFYSAAVLSLILVYSILISPDMKESFKEFENTV
LEGFLLYTLLIPVLLKDETKETIAKIVLFSFLTSLGLRCLAESILYIEDY
NKGIMPFISYAHRHMSDSMVFLFPALLNIWLFRKNSIKLVFLVLSAIYLF
FILGTLSRGGWLAVLIVGVLWAILNRQWKLIGVGAILLAIIGALVITQHT
NKPDPEHLLYKLQQTDSSYRYTNGTQGTAWILIQENPIKGYGYGNDVYDS
VYNKRVVDYPTWTFKESIGPHNTILYIWFSAGILGLASLAYLYGAIIRET
ASSTFRKVEISPYNAHLLLFLSFVGFYIVRGNFEQVDIAQIGIITGFLLA
LRNR
>t3797 waaP, lipopolysaccharide core biosynthesis protein
MVELKAPLTTLWRGKDAFEEVKTLQGEVFRELEMRRTLRFELDGKSYFLK
WHKGTSLKEIVKNLISLRMPVLGADREWHAIHRLHELGVDTMHGVGFGEK
GVNPLTRTSFIITEDLTPTISLEDYCVDWAVNPPDAQVKWMIIKRVATMV
RKMHAGGINHRDCYICHFLLHLPFTGREEDLKISVIDLHRAQIRQHVPLR
WRDKDLIGLYFSSMNIGLTQRDIFRFMREYFSLPLREILQKDSGLIHQAD
VKAARIKERTIRKNL
>t3804 waaZ, lipopolysaccharide core biosynthesis protein RfaZ
MGSVNFITHADVLQLIAKRTAEDCIIFLSGPTSRKTPLSLLRMKDVIAVN
GSVQYLLNNNVKPFLYLLTDVRFLHRRREDFYNFSRNSQFTIVNLDVYEQ
ASVDDQKYIEENCLIIRSFYRREKGGFLKKIKFNILKRVHKALLISVPLS
KRGRLAGFCKDISIGYCSYHTIAYTAIQVAYSLKYGRIICSGLDLTGSCP
RFYDESTSPMPSELSKDLFKILPFFTFMRKNVSDLNIFNLSDDTAIHYDI
IPYITASELEDEIYYDKIV
>t1928 xis, excisionase
MSNIIQLTPNKWVSEKVLIAVTGLKPGTITRARKESWMLGREYLHISPDG
NPKPSSECIYNREAVDQWIEAQKKNQPGAKTT
>t0011 yaaI, hypothetical protein
MRSVLTISVGLLFGLALSSVAHANDHKILGVIAMPRNETNDLALKIPVCR
IVKRIQLTADHGDIELSGASVYFKTARSASQSLNVPSSIKEGQTTGWINI
NSDNDNKRCVSKITFSGHTVNSSDMARLKVIGDD
>t0139 yacA, SecA regulator SecM
MVAASFGLPALSNAAETNTPARTTASTASKVNFSHLALLEASNRRPNFTV
DYWHQHAIRTVIRHLSFAMAPQTLPVADAPSPLQAHHIALLNTLSAMLTQ
EGTPPAIVRRLSLAYFAPQTAFSIPAWISQAQGIRAGPQRLS
>t0172 yacC, hypothetical protein
MKTFFRPVLFGSLMALCANSYALTESEAEDMADLTAVFVFLKNDCGYQNL
PNSQIRRALVFFAQQNQWDLSNYDTFDMKSLGEDSYRDLSGIGIPVAKKC
KALARDSLSLLAYVK
>t0163 yacH, hypothetical protein
MTLPFKPHLIALVCSAGLFAASGVLYVKSRAPEAPAQAAAPTSEPTQTAV
PAPVAKTTFTTAQIDQWVAPVALYPDALLSQVLMASTYPANVVQAVQWSR
DNPTLQGDAAIQAVASQPWDPSVKSLVAFPQLMALMGENPQWVQNLGDAF
LAQPQDVMDAVQRLRLLAQQTGSLKSTPQQTVTSVPKSSASTAVTTTTTT
SASTPATPSTVIKIEPANPQVVYVPNYNPTTVYGAWPNTAYPPVYLPPPP
GQQFADSFVKGFGYSLGVATTYALFSSIDWDDDDHHHHDDDHHDDDYHHG
GNGYQHNGDNININVNNFNRISGQNLPGQTMGWQHNPAWRNGVPYPNNTV
AQRFHPTNVSGGLSATQQAPVSRDSQRQAAMAQFQQRSHTSPANVSGETS
RDRQRKAASQQLNQIAQRNNYRGYDGTQNSSRREAAQQTLNKSTTQQHRS
ELKAKAQQHPVSQQQRDTARQRIESSTPQQRQAFRQNMHANVFSGNDSRS
PSWQSQQLRGLESRRGSHLNTEQRAAAREHFSEHHEFHRR
>t0212 yaeH, hypothetical protein
MYDNLKSLGITNPEEIDRYSLRQEANNDILKIYFQKDRGEFFAKSVKFKY
PRQRKTVVADGIGQGYKEVQEISPNLRYVIDELDQICQRDRSELDLKRKI
LDDLRHLESVVANKISEIEADLDKLTRK
>t0240 yaeP, hypothetical protein
MLTGGHVEKYCELVRKRYAEIASGDLGYVPDALGCVLKVLNEVAADSALS
ESVREKAAYAAANLLVSDYVNE
>t2475 yaiA, hypothetical protein
MPTRPPYPREAYIVTIEKGTPGQTVTWYQLRADHPKPDSLISEHPTAEEA
MDAKKRYEDPDKS
>t2481 yaiB, hypothetical protein
MVMKNLIAELLLKLAQKEEESKELVAQVEALEIIVTAMLRNMAQNEQEML
IRQVEGALEGVKPDASVPDHDTELLRQYVKKLLRHPRH
>t2449 yajI, putative lipoprotein
MTRRYLRILLVGSLLSLTACAPQSEVRQMHQSISTLNKEMTQLNQETVKI
TQQNKLNADSTRGVYLLPGANTPARLESQIGTLRMTLLEITPVADGAHAT
LRIQGKSRDPLPAFSATVEYGQIQGTTENYQEVNAQSLLVNAPASLLAPS
DVNIPLPLKGITPAQLGFIRIHDIQPVNQ
>t2386 ybaJ, hypothetical protein
MDEYSPKRHDIAQLKFLCETLYHDCLANLEESNHGWVNDPTSAVNLQLNE
LIEHIATFALNYKIKYNEDNKLIAQIDEYLDDTFMLFSSYGINTQDLQKW
RKSGNRLFRCFVNATRANPVSLSC
>t2380 ybaM, hypothetical protein
MSLENAPDEVKLAVDLIVLLEENRLPARTVLRALEIVMRDYENKLKSTED
DSQTE
>t2168 ybfA, hypothetical protein
MEHYKDYPAHVIFARRAFAVTAGVLALPVMLFWKDRARFYSYLHRVWSKT
SDKPVWMAQAEKATCDFY
>t2188 ybfM, putative outer membrane protein
MRTFSGKRSTLALAIAGITAMSGWIVVPQAQASGFFDDSTLTGGIYYWQR
ERDRKDVTDGDKYKTNLSHATWNANLDFQSGYAADMFGLDIAAFTAIEMA
ENGDSGHPNEIAFSKKNKGYDEDYSGDKSGISLYKAAAKFKYGPVWARAG
YIQPTGQTLLAPHWSFMPGTYQGAEAGASFDYGDAGALSFSYMWTNEYKA
PWHTEMDKFYQADKKTNVDYLHSIGAKYDFKNDLVLEAAFGQSEGYVDQY
FAKASYKFDLGGNPFTTSYQFYGARDKVDDRSVNDIYDGTAWLQALTFGY
KVAEVVDLRLEGTWVKADGQQGYFLQRMTPTYASSNGRLDIWWDNRSDFN
ANGEKAVFFGAMYDLKNWNLPGWAVGASYVYAWDAKPATWQSNPDAYYDK
NRTIEESSYSLDAVYTLQEGRAKGTMFKLHFTEYDNHSNIPSWGGGYGNI
FQDERDVKFIVIAPFTIF
>t2107 ybhT, hypothetical protein
MLELLKSLVFAVIMVPVVMAIILGLIYGLGEVFNIFSGIGQKDQSRQNR
>t2065 ybiJ, hypothetical protein
MKTIKYAVAAIALSTLSFGAFAAEPVSASQTQNLHKIGVVSADGATTLDG
LEAKLAEKAAAAGASAYNITSAVGNDKMSGTAVIYK
>t2023 ybjC, hypothetical protein
MRTIGVLPKSVLILECLGMILLALALLSLNHYLTLPAPFNTSLAGVLMVF
LGVVLILPAAVAMMWRIAQLLAPQLMKRPPDISSRSDREKHNESDH
>t1809 yccD, hypothetical protein
MANITVTFTITEFCLHTGVTEEELNEIVGLGVIEPYEDDNADWQFDDRAA
SVVQRALRLREELALDWPGIAVALTLLEENSRLREENRLLLQRLSRFISH
P
>t1756 yceB, hypothetical protein
MSAGATCCFVSRLGHRQHAKERHEKVFFAAALVVSGLLVGCNQLTQYTIS
EQEINQALEKRNNFSKDIGLPGIADAHIVLTNLASQIGREEPNKVTLTGD
ARLDMNSLFGSQKATMKLKLKALPVFDKEKGAIYLQEMEVVDATVTPEKM
QSVLQTLLPYLNQSLRSYFNQRPAYVLREDSSKGEALAKKLAKGIEVKPG
EIVIPFTN
>t1646 yciC, hypothetical protein
MSITAKSVYRDAGNFFRNQFITILLVSLLCAFITVVLGHAFSPSDAQIAQ
LSEGEHLAGSAGLFELVQNMTPEQQQILLRASAASTFSGLIGNAILAGGI
ILMIQLVSAGHRVSALRAIGASAPALPKLFILIFLTTLLVQIGIMLIVVP
GIIMAIVLALAPVMLVEEKMGVFAAMRSSMRLAWANMKLVAPAVIGWLLA
KTLLLLFAPSFAVLTPNVGAVLANTLSNLISAVLLIYLFRLYMLIRQ
>t1256 ydhZ, hypothetical protein
MTKPTKDDELYREMCRVVGKVVLEMRDLGQEPKYIVIAGVLRTALANQRI
QRSALEKQAMETVINALARS
>t0980 yebB, hypothetical protein
MKKHYPVQYETGDIVFTCIGAALFGQISTASQCWSNHVGIIIGHNGDDYL
VAESRVPLSTVTTLSLFIQRSAGQRYAVRRLCGGLTVEQKLAIMEQVPAR
LNKFYHTGFKYESSRQFCSKFVFDIYKEALCIPVGDIETFEELLHSNPDA
KLTFWKFWFLGSIPWDRKTVTPASLWQHPNLELISACGIENPQREAEGE
>t0996 yebF, hypothetical protein
MNKRGALLSLLLLSASVSAFAASTESKSVKFPQCEGLDAAGIAASVKRDY
QQNRIVRWADDQKKVGQADPVAWVNVQDVVGQNDKWTVPLTVRGKSADIH
YQVIVDCKAGKAEYKPR
>t0927 yecF, hypothetical protein
MSTPDFSTAENNQELATEVNCLKAMLTLMLQAMGQADAGRVILKMEKQIA
QMDDEAQAAVFSSTVKQIKQAYRQ
>t0942 yecH, hypothetical protein
MDSIHGHEVLNMMIESGEQYTHTSLEAAIKARFGERARFHTCSASDMTAA
ELVAFLAAKGKFIAVEDGFSTHESKICRH
>t0913 yedD, putative lipoprotein
MKKVAIVGALLVLAGCAEVENYNDVVKTPAPAGLEGYWQSKGPQRKLVSP
EAIASLVVTKEGDTLDCRQWQRVIALPGKLTMLSDDLTNVTVKRELYEIE
RDGNTLEYDGMTLQRVARPTPECAAALEKTPLPTPLP
>t0703 yehE, hypothetical protein
MKKYLLMGIIVSAYGISVPVFASDTATLTISGKVTAPTCSTEVVNAQLQQ
RCGNTIHVSTLQTPAATPMRGVTTQLYTVPGDSTRQIVVNHYD
>t0635 yejG, hypothetical protein
MNTLQLSIVHRLPQNYRWSAGFAGSKVEPIPQNGQSTENSLVALKLLSPA
GDSAWSVMHKLSQALSDIEVPCSVLECEGEPCLFVNRQDEFAATCRLKNF
GVAIAEPFSNNNPF
>t0444 yfeC, hypothetical protein
MFKERMTPEELANLTGYSRQTINKWVRKEGWATSPKPGVQGGKARLVHVN
EQVREYIRSAERSVDHHADTFTPASNASLEALLMTLAKEMTSSEQKQFTS
LLVREGITGLLQRLGIRDSK
>t0443 yfeD, hypothetical protein
MKRLRSKMTTEELAECLGVARQTVNRWIREQHWKTEKFPGVKGGRARLIH
IDASVREFILNIPAFRKLPAFYQAEEAFAEYANATHSHAYRQIIDAVENM
SAQEQEKLALFLSREGIRGFLTRLGINEAD
>t0294 yfhG, hypothetical protein
MNLSLVRMSHVFVQTIKRCLLRWGIPVGISCLALTACVPHASQQLPGSAA
QDTLPHYQLADYLPTACADIWSLRGQAVETNPLYWLRTIDCADRLMPVQS
RAEARALTDDNWQNAFRRGILLADAKITPPERRAIVTRLEALSAQIPAQV
RPVYQIWHDGQALQLALSAERQRYSKLQQTSDSELDALRQQQQALQTQLD
LTTRKLESLTDIERQLSTRKPAGNYNADTPHTNDKPATSEDGAAPSPSQD
EVTP
>t2697 ygaC, hypothetical protein
MYLRPDEVARVLEKAGFTVDVVTNKTYGYRRGENYVYVNREARMGRTALI
IHPRLKDRSSSLADPASDIKTCDHYQNFPLYLGGETHEHYGIPHGFSSRI
ALERYLNGLFGDEKSD
>t2762 ygbA, hypothetical protein
MPGKRIAREKLTIKKMIALYESQCPQASAVQGHYDALFAYAQKRLDKCVF
GEEKPACKQCPVHCYQPAKREEMKQIMRWAGPRMLWRHPVLTVRHLIDDK
RPVPELPEKYQRKK
>t2833 ygbE, hypothetical protein
MPGMVKVTGFNMRNSHNITFTRSDAFMVDDDATSAFPGAVVGFVSWLLAL
GIPFLLYGPNTLFFFLYTWPFFLALMPVSVIIGIALHLLVKGKILFSIMF
TLLAVGALFGALFIWLLG
>t2838 ygbF, hypothetical protein
MSMLVVVTENVPPRLRGRLAVWLLEIRAGVYVGDVSAKIREMIWQQVSVL
ADEGNVVMAWATNTESGFEFQTFGVNRRIPVDLDGLRLVSFLPVENQ
>t2906 ygdB, hypothetical protein
MNRERGASSLILALLILILGSLLLQGVNQQQASYAARVTTQSMAIQRQAL
VQSALEWGRGQLWSGVTEMECRRYSSSGARVCLRRLSGDEVVMAAQDDGM
TLWRLGNVIQGSIVFSPHGWSDFCPLKEVALCRIP
>t4092 yhdN, hypothetical protein
MWLLDQWAERHIIEAQRKGEFDNLPGRGEPLILDDDSHVPAELRAGYRLL
KNAGCLPPELEQRRDAIQLLDILNSIREDDPQYHQVSRQLSLLELKLRQA
GLSTDFLHGEYAEKLLHKINDN
>t3305 yhdV, possible lipoprotein
MKRLIPVALLTTLLAGCAHDSPCVPVYDDQGRLVHTNTCMKGTTQDNWET
AGAIAGGAAAVAGLTMGIIALSK
>t4034 yhfG, hypothetical protein
MKKLTDKQKSRFWEQRRNVNFQQSRRLEGIEIPLVTLTADEALARLDELR
RHYER
>t4026 yhfL, hypothetical protein
MFKFVKIAVVAGVLATLTACTGHIENKKNNCSYDYLLHPAISISKIIGGC
GPAADQ
>t4007 yhgE, hypothetical protein
MESVALSRTTRWGMLLTGLLQGVLCYLLMAWLVPQNSDWLFYGMPATIAL
SSMLLLTVVSFKQRALWGWLGLIFVVVLAMSGWLKWQVEAVEKWRLAELL
WLYGLRLVLMAMLVLPWMQYQLHSQTGSARYPQFYLRLWHNVLTLFIVLV
ANGLFWLVLLLWSALFRLVGIRFFSTLFFETEAFIYVTIGLITALAVILA
RTQSRLVAAVQKLLTLIATGLLPVVSLLALLFIVTLPFTGLEAISARVSA
TGLLSTLTLMLLLLVAIVNEPQKRVLPYPRVLRGMISASLCVAPIYMLLA
GWALWVRIQQYGWTPDRLYGALTASVLLVWSFGYLIGLLRRGRDPDEWQG
KVILSVSLLTLVILLLLASPVLDVWRISVNSHMARYHSGKITADQISLYM
LDHSGKPGQEALKSLRDDEAFTQNRKRNRELMTFLQRNKVPPTADDLARV
VMIAPGSQKPDAAFWAFVKEQSYSDASCLEPDACVLVSQDLNGDGQPEQV
LYNFIVAESQVYGLKEGKWTQKAFARLPDGFSKTQLLHAIAGHRLDSAPK
AWRDIIVDGQRLDVDYYNE
>t3999 yhgG, hypothetical protein
MASLIQVRDLLALRGRMEATQISHTLHAPQPMIDAMLNQLEIMGKAVRIP
EEPDGCLSGSCKSCPEGKACLREWWALR
>t3895 yhjS, hypothetical protein
MRDTVDPVFSLGISSLWDELRHMPTGGVWWVNADRQQDAISLVNQTIASQ
TENANVAVIGMEGDPGKVIKLDESHGPEKIRLFTMPDSEKGLYSLPHDLL
CSVNPTHYFFILICANNTWRNITSESLHKWLEKMNKWTRFHHCSLLVITP
CNNSDKQSSLLMGEYRSLFGLASLRFQGDQHLFDIAFWCNEKGVSARQQL
LLCQQDERWTLSHQEETAIQPRSDEKRILSHVAVLEGAPPLSEHWTLFDN
NEALFNDARTAQAATIIFSLTQNNQIEPLARRIHTLRRQRGSALKIVVRE
NIASLRATDERLLLGCGANMIIPWNAPLSRCLTLIESVQGQQFSRYVPED
ITTLLSMTQPLKLRGFQPWDTFCDAIHTMMSNTLLPADGKGVLVALRPVP
GIRVEQALTLCRPSRTGDIMTIGGNRLVLFLSFCRVNDLDTALNHIFPLP
TGDIFSNRMVWFEDKQISAELVQMRLLSPELWGTPLPLAKRADPVINAEH
DGRIWRRIPEPLRLLDDTAERAS
>t3893 yhjU, hypothetical protein
MTQHTQTPSMPSPLWQYWRGLSGWNFYFLVKFGLLWAGYLNFHPLLNLVF
MAFLLMPIPKYRLHRLRHWIAIPVGFALFWHDTWLPGPQSIMSQGTQVAE
FSSGYLLDLIARFINWQMIGAIFVLLVAWLFLSQWIRVTVFVVAIMVWLN
VLTLTGPVFTLWPAGQPTDTVTTTGGNAAATVATAGDKPVIGDMPAQTAP
PTTANLNAWLNTFYAAEEKRKTTFPAQLPPDAQPFDLLVINICSLSWSDV
EAAGLMSHPLWSHFDILFKHFNSGTSYSGPAAIRLLRASCGQPSHTRLYQ
PANNECYLFDNLAKLGFTQHLMMDHNGEFGGFLKEVRENGGMQSELMNQS
GLPTALLSFDGSPVYDDLAVLNRWLTGEEREANSRSATFFNLLPLHDGNH
FPGVSKTADYKIRAQKLFDELDAFFTELEKSGRKVMVVVVPEHGGALKGD
RMQISGLRDIPSPSITNVPAGVKFFGMKAPHEGAPIDINQPSSYLAISEL
VVRAVDGKLFTEDSVNWNKLTSNLPQTAPVSENANAVVIQYQGKPYVRLN
GGDWVPYPQ
>t3878 yhjY, hypothetical protein
MIVRKRRGRRTLRCLAGLMACSFFINTTYAWQQEYIAEAAPGHTTERYTW
DSDHQPNYNDILAERIQSTQNTVGPVLSLADETPLDATSGISMGWNFSLS
RRVTTGPVAALHYDGSTSSMYNEYGDSATTLALTDPLWHASVSTLGWRVN
SQFGDVRPWAQISYNQQFGENIWKAQSGLSRMTAGNQAGNWLDVTVGADV
LLNPHLAAYAAFSQAENSATDSDYLYTLGVSARF
>t3860 yiaB, hypothetical protein
MDDHVARRKRIFGLGMLVVGAVVYLVGLWPGCHTLSEKGYFFAAIVMCGF
PVLIRQEHTGNDRLLSRCKSLLLLGIGMVAVGVFNLALAGALKILCLVAL
GVSIYGTDLYASYSDDE
>t3872 yiaF, hypothetical protein
MATGKSCSRWFAPVVALLMVFSLSGCFDKEGDQRKAFVDFLQNTAMRSGE
RLPTLTADQKKQFGPFVSDYAILYGYSQQVNQAMDSGLRPVVDSVNAIRV
PQDYMTQREPLRQANGSLGVLAQQLQNAKLQADAAHGALKQADDLKPVFD
QVYKKVVTVPADALQPLIPAAQIFTQQLVQVGDYIAQQGEQVSFVANGIQ
FPTSQQASQYNALIGPLASQHQAFNQAWTAAVNATQ
>t3771 yicH, hypothetical protein
MKLIGRLLLYVLIACLVVIFGFYFLLQTRWGADHVSNWVSENSGYHLTFD
VMDHRFSAPSHLLLENVTFGRDGQPATLVAKTVDIGLSIRQLTAPLHVDT
ILLQDGTLNISVQTAPFPFEADRLQLRNMALNSPGSEWRLSAQRVNGGVM
PWRPEAGRVLGNKAQIQLSAGSLTLNDVPATNVLIEGSIDHDQMMLNTVG
ADMARGALTGVARRNADGSWVVENLRLNDIRLQSDKSLSEFFAPLTTVPS
LQIGRLEVTDSSLQGPDWAVTDLDLSLRNLTLRKEDWQSQEGKLSMNASE
FIYGSLHLLDPILNAEFSPQGVALRQFTTRWEGGMVRTSGAWLRESKALI
LDDTAIAGLEYTLPENWKQLWMKPLPDWLNSLTLKKFSASRNLVIDIDPT
FPWQITALDGYGANLELVQHHQWGVWSGNATLNAAAATFNRVDVRRPSLS
LTANASTVNISDLSAFTEKGILEATASVSQQPQRQTQISLNGRGVPMDVL
QQWGWPALPIAGDGNIQFTASGNIQADAPLKPTVNGQLHAVNAQKQQIMQ
TMQAGVVSGGEVTSTEPAL
>t3732 yicN, hypothetical protein
MRMIWLILATFVVVFIVGFRVLTSDTRRAIRRLSERLNIDVVPIESMIDQ
MGKTAGGEFLQYLHRPDESHLQNAAQVLLIWQMVIVDGGDQNLQRWHRLL
QKARLAAPITDTQVRLALGFLREMEPDMQEINAFQLRYNAFFQPEEGVHW
LH
>t3713 yidG, hypothetical protein
MPDSRKARRQNDPGLQPERTSLAWFRTLLGYGALMALAVRHYWHQAGFLF
WVSISVLAIVAVILWRYTLSRNLMDVAHSDFSETHVMRDKFLISLAVLSL
AILFAVTHIRQLIAFIGDFI
>t3344 yigF, hypothetical protein
MDKDYINDGSLSEKWKYRFSFYDQHGFPGFWKVSPEYKQAFKALKPRQRL
TIQINFIAFFFSWIYLFVLGLWKKAIIVILLGIVAIFIGALIGVNILGLV
VAAYVGVNTNKWFYEKEVKGINTWSL
>t3536 yiiQ, hypothetical protein
MKPGCTLFLLLFSALTASITAHAQLSSSTTTAPYLLAGSPTFDLSISQFR
ENFNRQNPDLPLNEFRAIENSRDKANLTRAASKINENLYASTALERGTLK
VKSMQITWLPIQGPEQKAAKAKALEYMAAIIRTVAPLLTKEQSQKKLQKM
LIAGKGKHYYAETEGAVRYVVADNGEKGLTFAVEPIKLALSENLEGAN
>t3460 yjaH, hypothetical protein
MNSFIEGACQPLLSVWRRAFLFSGALLLTACSHNASPPPFTASGFAGDHG
AVRIWRKDTNDEVHLLSVFSPWHSGSTTTSEYRWQGDTLSLIELNIYSKP
PEHIRARFDAHGELSFMQREVGGQKQQLSNDQIALYRYRAEQIRQTSDAL
RLGRVILRQGRWHADHTVTTCEGETLKPDLDSWAISHIERRQNHSSVEVS
VAWLEAPEGSQLLLVANSDFCHWQPQAKTF
>t4383 yjeI, hypothetical protein
MIMGNNMHVKYLAGIVGAALLMAGCSSSNELTAAGQNVRFVEDKPGAECQ
LIGTATGKQSNWFSGQHGEEGGSMRGAANDLRNQAAAMGGNVLYGVSSPS
QGMLSSFVPTASEMNGQVYKCPN
>t4384 yjeJ, hypothetical protein
MALTIKGLNTGVIRHNDKFIALALKVKSLRNKETLLFFPVLALRDLLIGL
EHRLYLQYSLPEQEQEKRQKAKSSHVLKMHENIPAILREELENADVNQRV
ESLALSDNTEKVLTFTLKLHNGSHLDLQVGEWQVEVLVMAIIHAINNAEM
RELALRISSMLDFLPLYDADCLDNGNIEFDTYNQPDWKHNLYNHYLALVY
RYTDEAGQSHDCGTIIKTRSQSGSKEAEAISRRLLNFSPRLKKLEGKPCK
VFVRTLGTGKAARLTQDQCMRALHNLRMASSQEKR
>t4398 yjeN, hypothetical protein
MPDKAPRVASGVYCPMENVMNDSSRDPIITEDEIRALNFTPEDILEIEKV
ILSSVHVARRKVAMVVGMTIGTLRDRDEDKWKHVSDIYCAYVVRCLVFRG
ELVGYGDLFRMRYSEINLPVADLDA
>t4424 yjfK, hypothetical protein
MARDRGSPMMQFFQRLLGKTSTPAPIRGPLGLHLNAGFTLDTLAFRLLES
SLLVALPGEKYTVAAASRIDLGGGSQIFRYYTSGDEFLQINTTGGTDVDD
IDDIKLFVYEESFGINEERHWRSAIAPAAIGPMTLNWQERRWQRFFNHEE
PGNIEPVYMLEKVENQQAEKWDVHNFTMGFQRQVTDDAWEYLLLNGEESF
NERGEPEWVFSRALGVDIPLTSLTVIG
>t4426 yjfM, hypothetical protein
MAKKRKSRSTSGVGHSAIRRIAEPVNPFERQRNRYTPKYLTLAIMGGAAF
FLLKGYGDGSNSDNDGDGTFYATVNDCIDDGNSASVCADGWNNAKTEFYA
NVPKQLTQESCQSQFGDCYFDITERSWVPVLSGFLLSRAIRQNRDEQYIY
SSGGSSYVSRPVWRTTSGDYAWRSGTGKSDTVTSPGYIIRKASTVSRGGY
GRSSSARGHWGG
>t4429 yjfN, hypothetical protein
MMERPVKCGVCNKELTMKQELALSSLLLSAGLVSTTAQSAEFASADCVTG
LNEIGLISVSNVAGNPQDVERIIALKADEQGASWYRIIQMYEEQQSNNWR
VQAILYA
>t4430 yjfO, hypothetical protein
MIIRKRDRVMRRFAPLIAALLLSACSMLQGTPQPAPPVADHPQEIRRDQT
QGLQRMGTVSALVRGSPDDAIDEIRAKAAAAKADYYVILMVDETVVTGQW
YSQAILYRQ
>t4440 yjfY, hypothetical protein
MRARIMLFLAALLPGITATAAIELNNHQATNMDDVRSLGVIYINHNFATE
SEANLALNDEADARNAMYYHAILIREPGSNGNIHASANIYR
>t4573 yjiW, hypothetical protein
MAVREHGHIAHHNHYSRRIAIMTTVHSIADPCDPEVSPTNNRHLTVSYAS
RYPDYTRIPALTMKGQWLEAAGFATGTEVDVRVMNGCIVLTAQQPQPEES
ELMQSLRQACKLSARKQKQVQAFISVMAGSK
>t4585 yjjA, putative secreted protein
MNTAKHVLCCAAIASVLISTSGIAASQLSAQGANAQQGGMSLSALTGLLS
RGAQSLSADNMNNAAGILQYCAKQKLASATNVENVKNQILNKLGLDTTQQ
EQDTNYLNGLQGLLKTKDGQQLNLNNIGSTPLAEKVKTKACDLVLQQGLN
FLS
>t0893 yodD, hypothetical protein
MKTAKEYSDTAKREVSVDVDALLAAINEISESEVHRSQEDPERVSVDGRE
YHTWHELAEAFELDIHDFSVTEVNR
>t4104 yrdB, hypothetical protein
MNQAIQFPDREEWDTAASAIIFPALVNGMQITCAIKKDVLAYRFGGETAE
QWLAIFREYRWDLEEEAEALILAQQEDDHGWIWLS
>t4017 yrfA, hypothetical protein
MKVSRGVLLCFCLLTLTGMRDPFRPPEDRCRIAELSQWRYQGAVRKGERW
TGILKDSQQKWRRVEEGQTLENGWTIVRLTAEALTLTTGKNCAPPQWRWL
RQGADNEAMDSHNTDSLDARRAGGKSGESDAGG
>t4015 yrfB, hypothetical protein
MNALFDIWYGMSRRGRVFCWCAGVLCLTLTVALSVGYPGWKTLDTQHARL
SQQREAARQQWRHLRRLSVAAEPLFGRTIENPRPFSPLDFQAPPLRLLHW
QPSAQGGEMALKTSWDAVPSLFVRLAESEMSVSRFSLRREGAELLITLQL
ERLANEG
>t4014 yrfD, hypothetical protein
MAFKTWQIGLHIQQHEALAIAVIRGASGWSLQRWWRLPLMNASTAEGTIP
DPQSLAHVLRPWSRELPLRHRIYLSFPANRTLQRAFPHPPMRLREREQVA
WLSQTMARELDMDPDLLRFDFQDDALSPAFNVTAVQSKEISELLTLAQTL
NVRIAAVTPDACALQRLLPFIPSGRQCLVWRDESQWLWATRYAWGRKSAR
EAMTLHDLAATLSVVPEHISLCAEGEFDPWRAVTVRQPPVPPDGYRFAIA
LGLAMGEIR
>t4459 ytfK, hypothetical protein
MRLKTVGPIVAPQGRNTLQETTMKIFQRYNPLQVAKYVKILFRGRLYIKD
VGAFEFDKGKILIPKVRDKQHLSVMSEVNRQVMRLQTEMA