Gene list
Applied filters:
COG category: Unclassified
Gene type: CDS
Genomic element: pSB4_227
Number of genes found: 40
![]() | ||||||
Show UniProt / TrEMBL protein name | ![]() |
View in Fasta format (DNA) | ![]() |
View as list | ![]() |
|
![]() |
# Shigella boydii Sb227, Sb227 >SBO_P080 hypothetical protein MGKNSEINKYSGALDKVKIFIFRSCNKIDIREKTYCLNLAEIVIRNECPR QISMFCHARNWLGGSRHQDLASC >SBO_P135 hypothetical protein MAHILALPLRHNRSYIAFINGLNLRPHNEARYSFARISGGPIFDSFGHPL TEVPEYFSTGTTPKNAAACSRIGN >SBO_P114 hypothetical protein MAQVNMSVRIDAELKDAFMAAAKSMDRNGSQLIRDFMRQTVERQHNSWFR DQVAAGRQQLECGDVLPHDMVESSAAAWRDEMSRKIADK >SBO_P147 hypothetical protein MISPIKNIKNVFPINTANTEYIVRNIYPRVEHGYFNESPNIYDKKYISGI TRSMAQLKIEEFINEKSRRLNYMKTMYSPCPEDFQPISRDEASIPEGSWL TVISGKRPMGQFSVDSLYHPDLHALCELPEISCKIFPKENSDFLYIIVVF RNDSPQGELRANRFIELYDIKREIMQVLRDESPELKSIKSEIIIAREMGE LFSYASEEIDSYIKQMNDRFSQIKARMSVT >SBO_P027 hypothetical protein MRNCSACSSVKAQKNFAQKPNGRYWKHRSESAHFRKKWRKRWVSNMTRYC HLPCASLQPVNRYLPHFPVKPGSSVRKRNAARPVVANSVLWDVMCQSSWS LSAAPLRLSKHNVRNWPVAGVTISCRHRCLQNPLHAVMPERGFWPMLSLE NMQAICRYTASQKYNVVRAWS >SBO_P058 hypothetical protein MDEKKLKALAAELAKGLKTEADLNQFSRMQTKLTVETVLNEELTDHLGHE KSYIR >SBO_P026 putative IS orf, fragment MDTSLAHENARLRALLQTQQDTIRQMAKYNRLLSQRVAAYASEINRLKAL VAKLQRMQFG >SBO_P101 hypothetical protein MINGVSLQGTAGYEAHTEEGNVNVKKLLESLNSKSLGDMDKDSELAATLQ KMINPSGGDGNCSGCALHACMAMLGYGVREAPVPNEISEYMTGFFHRHLE QIDSEGIVSHPNETYSKFRERIAENILQNTSKGSVVMISIEQATHWIAGF NDGEKIMFLDVQTGKGFNLYDPVEKSPDAFVDENSSVQVIHVSDQEFDHY ANSSSWKSKRLC >SBO_P020 IS91 ORF MLPRFADIFQQGNRWLNWLEKQPVQMSRLEHYAGQDEIGLRYNSHRTKRE ENLVMSGDEFMERFSWHVADKGFRMVIRGPESGEAAITGRCGVRHNGDSE KNGEANHKERDVSAVTEG >SBO_P087 IS1294 transposase MVRYFGFLANRVCGEKLPQVYRALGMDKPEPVAKVCYAQMVKQFLSRDPF ECVLCGGRMVYLRAIAGLNVEGLKKNARDISLLRYMPA >SBO_P073 IS1294 transposase MVRYFGFLANRVCGEKLPQVYRALGMDKPEPVAKVCYAQMVKQFLSRDPF ECVLCGGRMVYRRAIAGLNVEGLKKNARDISLLRYMPA >SBO_P097 hypothetical protein MLTRVKYPTLASICLTESHTALRINKIILITFDKRANELWRNQLSFMSKI QRYGEPKSVHPHRLPLQQLRADMLPETASV >SBO_P071 ISSfl1 ORF1 MINIIFNRLLSLLDANGFIDWSATALDGSNIRALKCAAGAQKNIPISTEI MARVALAAVLAPKSIWQQTEVASR >SBO_P136 IS1294 transposase MLSAFTPRPLKRLFTANQCWTSFLDAGGLRDIGVEAVTKMLACGTRILGV KEYICDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDCDWV HLVFTLPDTLWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFCAIH TYGRRLNWHPHVHVSVTCGGLNKHGQWKKLSFLKDAMRSRWMWNMRQLLL KAWSEGLAMPESLSHITTESQWRSLVLKAGGKYWHVYMSKKTAGGRNTAR YLGRYLKKPPIAASRLAHYNGGASLNFRYLDHKTGETATETLTQRELVAR LKQHIPEKFFKMVRYFGFLANRVCGEKLPQVYRALGMDKPEPVAKVCYAQ MVKQFLSRDPFECVLCGGRMVYRRAIAGLNVEGLKKNARDISLLRYMPA >SBO_P088 putative IS91 ORF2 MVCNYRYKNRQCHCLSGGYMARSAKPRKRKPASQRSKLPRYVVKLHEDDF FDEEDAEVLRFDSFDDAVECCADLNIPFFVDAGNKKLVFWFVRVDDEGYP EIARCTEREFATILSGISAGGMYCPECGTVHWPDGVAPPF >SBO_P007 hypothetical protein MWCCRGTVCTIPYVDQYNRNDNFRFRAQPKYILGHLSNRLPDTAPFFNKK INHF >SBO_P075 IS1294 transposase MTRSGGDFQPRPLKRLFTANQCWTSFLDAGGLRDIGVEAVTKMLACGTRI LGVKEYICDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDC DWVHLVFTLPDTLWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFC AIHTYGRRLNWHPHVHVSVTCGGLNKHGQWKKLSFLKDAMRSRWMWNMRQ LLLKAWSERLAMPESLSHITTESQWRSLVLKAGGKYWHVYMSKKTAGGRN TARYLGRYLKKPPIAASRLAHYNGEASLSFRYLDHKTGETATETLTQREL VARLKQHIPE >SBO_P090 IS91 ORF MACGTTLMGYTQWCCSSPDCCHTKKVCFRCKSRSCPHCGVKAGAQWIQYL LSLVPDCPWQHIVFTLPCQYWSLVFHNRWLLAEMSRIAADVILEICHQAD VEPRIFTVIHTWGRDQQWHPHIHLSTTAGGVTSGHTWKNLHFYARKVMSM WRYRITRLLSRKYPDLVMPDALAAEGSSKREWNRFLDTHYRRGWNVNVSR VMDNATHVAVYFGSYLKKPPVPVSRLEHYAGQDEIGLRYNSHRTKREEYL LMSGDEFMERFSWHVADKGFRMVRYYGFLSPAKRRLLEEVVYIITETVRK TAMQITWRGMYQRLLKVDPLKCVLCGSQMRFTGLKRGYRLAEQVLMHEPL ARMRWCG >SBO_P079 IS1294 transposase MAKVCYAQMVKQFLSRDPFECVLCGGRMVYRRAIAGLNVEGLKKNARDIS LLRYIPG >SBO_P139 IS1294 transposase MTRSGGDFQPRPLKRLFTANQCWTSFLDAGGLRDIRVEAVTKMLACGTRI LGVKEYICDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDC DWVHLVFTLPDTLWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFC AIHTYGRRLNWHPHVHVSVTCGGLNKHGQWKKLSFLKDAMRSRWMWNMRQ LLLKAWSEGLAMPESLSHITTESQWRSLVLKAGGKYWHVYMSKKTAGGRN TARYLGRYLKKPPIAASRLAHYNGGASLNFRYLDHKTGETATETLTQREL VARLKQHIPEKFFKMVRYFGFLANRVCGEKLPQVYRALGMDKPEPVAKVC YAQMVKQFLSRDPFECVLCGGRMVYRRAIAGLNVEGLKKNARDISLLRYM PA >SBO_P019 hypothetical protein MKVSFKSLGYIFHDIYNKKHTIDEFNDVVRKAVLSGKINELNACHKVAIF LAEKDNEITKKDKAKIIDTLTENYSIEFQQLMNISERTLNSSLYITPGES GFVSFVNREGKICHTAYVKSSDNSMAYYHANYSSIDKYITDMCGLICMRH IESTCIIFYMLDEKVLSAIAEFMNEKGWRAAFCSAKNLYKCV >SBO_P032 hypothetical protein MPGATVADEFDKTLAFLEAIVNADNETTIGEIRSFADTLGAVRFNRNKIN RQLSKPNLASLALEHEVI >SBO_P042 putative protein encoded within IS MKSTCSTDWKHYPRLLRQSRHRRPLPEHVSREIHHLEPEESCCPECGGEL DYSGEISVFLPERHTILQ >SBO_P110 hypothetical protein MVREPGYNLQSVVERLTTGNQPGQQPRGLTENQNLNQPEIIPGAVLHHQS PIPVSVLRGLTRSPDSVLLSPPPVPFSVGPYTTTEKGARPGAQPYYTSGK SLYAEKSDHAVPAFM >SBO_P021 putative IS91 ORF2 MDAGNKKLVFWFVRVDDEGYPEIARCTEREFATIPAGINADGMYCPECGT VHWPDGVIPPF >SBO_P015 hypothetical protein MQQRSAALHAAGAAYPGNIFVDTTFRPYPDQWAFLASMIPMNAHDIEPTI LRATGNTHPLDVTFIHEEDLAIPWKPEQSSVYAHVNPYGIFELDMETRLP IEVVA >SBO_P112 ccdB, post-segregation toxin MRTGTGEMQFKVYAYKRESRYRLFVDVQSDIIDTPGRRMVIPLASARLLS DKVSRELYPVVHVGDESWRMMTTDMASVPIFVIGEEVADLSHRENDIKNA INLMFWGI >SBO_P128 finO, FinO MTEQKRPVLTLKRKTEGTATVRSRKTIINVTTPPKWKVKKQKLAEKAARE AELAAKKAQARQALSIYLNLPTLDEAVNTLKPWWPGLFDGDTPRLLACGI RDVLLEDVAQRNIPLSHKKLRRALKAITRSESYLCAMKAGACRYDTEGYV TEHISQEEEAYAGARLAKIRRQNRIKAELQAVLDEK >SBO_P131 hmo, putative regulator MAKTKQEWLYQLRRCSSVNTLEKIIHKNRDSLSNSERESFNSAADHRLAE LITGKLYDRIPKEIWKYVR >SBO_P033 ipgB2, IpgB2 MLGTSFNNFGISLSHKRYFSGKVDEIIRCTMGKRIVKISSTKINTSILSS VSEQIGENITDWKNDEKKVYVSRVVNQCIDKFCAEHSRKIGDNLRKQIFK QVEKDYHISLDINAAQSSINHLVSGSSYFKKKMDELCEGMNRSVKNDTTS NVANLISDQFFEKNVQYIDLKKLRGNMSDYITNLESPF >SBO_P017 mkaD, mouse killing factor MPIKKPCLKLNLDSLNVVRSEIPQMLSANERLKNNFNILYNQIRQYPAYY FKVASNVPTYSDICQFFSVMYQGFQIVNHSGDVFIHACRENPQSKGDFVG DKFHISIAREQVPLAFQILSGLLFSEDSPIDKWKITDMNRVSQQSRVGIG AQFTLYVKSDQECSQYSALLLHKIRQFIMCLESNLLRSKIAPGEYPASDV RPEDWKYVSYRNELRSDRDGSERQEQMLREEPFYRLMIE >SBO_P077 ospC1, OspC1 MNISETLNSANTQCNIDSMDNRLHTLFPKVTSVRNAAQQTMPDEKNLKDS ANIIKDFFRKTIAAQSYSRMFSQGSNFKSLNIAIDAPSDAKASFKAIEHL DRLSKHYISEIREKLHPLSAEELNLLSLIINSDLIFRHQSNSDLSDKILN IKSFNKIQSEGICTKRNTYADDIKKIANHDFVFFGVEISSHQKKHPLNTK HHTVDFGANAYIIDHDSPYGYMTLTDHFDNAIPPVFYHEHQSFLDKFSEV NKEVSRYVHGSKGIIDVPIFNTKDMKLGLGLYLIDFIRKSEDQSFKEFCY GKNLAPVDLDRIINFVFQPEYHIPRMVSTENFKKVKIREISLEEAVTASN YEEINKQVTNKKIALQTLFLSITNQKEDVALYILSNFEITRQDVISIKHE LYDIEYLLSAHNSSCKVLEYFINKGLVDVNTKFKKTNSGDCMLDNAIKYE NAEMIKLLLKYGATSDNKYI >SBO_P138 ospC2, OspC2 MKIPEAVNHINVQNNIDLVDGKINPNKDTKALQKNISCVTNSSSSGISEK HLDHCADTVKSFLRKSIAAQSYSKMFSQGTSFKSLNLSIEAPSGARSSFR SLEHLDKVSRHYLSEIIQKTHPLSSDERHLLSIIINSDFNFRHQSNANLS NNTLNIKSFDKIKSENIQTYKNTFSEDIEEIANHDFVFFGVEISNHQETL PLNKTHHTVDFGANAYIIDHDSPYGYMTLTDHFDNAIPPVFYHEHQSFFL DNFKEVVDEVSRYVHGNQGKTDVPIFNTKDMRLGIGLHLIDFIRKSKDQR FREFCYNKNIDPVSLDRIINFVFQLEYHIPRMLSTDNFKKIRLRDISLED AIKASNYEEINNKVTDKKMAHQALAYSLGNAKSDMALYLLSKFNFTKQDI AEMEKMNNNMYCELYDVEYLLSEDSANYKVLEYFINNGLVDVNKRFQKAN SGDTMLDNAMKSKDSKTIDFLLRNGAVSGKRFGR >SBO_P104 ospC3, OspC3 MKIPEAVNHINVQNNIDLVDGKTNPNKATKALQKNILRVTNSSSSGISEK HLDHCANTVKNFLRKSIAAQSYSKMFSQGTSFKSLNLSIEAPSGARSSFR SLEHLDKVSRHYISEIIQKVHPLSSDERHLLSIIINSNFNFRHQSNSNLS NNILNIKSFDKIQSENIQTHKNTYSEDIKKISNHDFVFFGVEISNHQEKL PLNKTHHTVDFGANAYIIDHDSPYGYMTLTDHFDNAIPPVFYHEHQSFFL DNFKEVVDEVSRYVHGNQGKTDVPIFNTKDMRLGIGLHLIDFIRKSKDQG FREFCYNKNIDPVSLDRIINFVFQLEYHIPRMLSTDNFKKIKLRDISLED AIKASNYEEINNKVTDKKMAHQALAYSLGNKKADIALYLLSKFNFTKQDV AEMEKMNNNRYCNLYDVEYLLSKDGANYKVLEYFINNGLVDVNKKFQKAN SGDTMLDNAMKSKNPKMIDFLLKNGAILGKRFEI >SBO_P016 ospD2, OspD2 MPLNKTFSSSIFSTKNSLSTDMSVNRDNRTITSSIMRVSNSSELIQFKNK TAPYFSEKRNVKVNINGVAKDIYGRQIVCRHLASYWEMNFMETNGKVNYQ LLSTPDAIAKNVCLEKTEDFSKSPAYIYFVENKKWGTVITNFFYNMKKNG DFVRTLSACTLNHQMALGLKIKRVQESEKWVVQFFDPNRTVTHKRTVFTC DSHFELSQLSAKDFFDDFYWKIYGLEQPGQVIFEDRHNSPLTNTVKLLPD ELINSRVIYHAITKNLTEVLFILMEKYKNGEISQSKLVNLLATRSSDGTP AFYIALQNGCSDIIQVYGKILNMCNLSQETILTLLAAVGANNVPGLCMSF MNGHVDTIKAYGEIVFKTPLTSDKRLYLLAAKDSHDLPGLFFALQNGHAD SIRMFGSLLNKKMLSSEQIKELLKVKHGLFMALQNGHTKAIMAYGDILKI LPPHQEYIDELLWIKNPNGTSGLFMAFYNGHTETIRAFCNILKNYSFTTR RLVEMLSATNKDGIPGVFVSVVNRDKETILEYCRIIKENNLEPDTIAEQF SKKMKKTFIEIINRFNHFL >SBO_P076 ospD3, OspD3 MPSVNLIPSRKICLQNMINKDNVSVETIQSLLHSKQLPYFSDKRSFLLNL NCQVTDHSGKLIVCRHLASYWIAQFNKSSGHVDYHHFAFPDEIKNYVSVS EEEKAINVPAIIYFVENGSWGDIIFYIFNEMIFHSEKSRALEISTSNHNM ALGLKIKETKNGGDFVIQLYDPNHTATHLRAEFNKFNLAKIKKLTVDNFL DEKHQKCYGLISDGMSIFVDRHTPTSMSSIIRWPNNLLHPKVIYHAMRMG LTELIQKVTRVVQLSDLSDNTLELLLAAKNDDGLSGLLLALQNGHSDTIL AYGELLETSGLNLDKTVELLTAEGMGGRISGLSQALQNGHAETIKTYGRL LKKRAINIEYNKLKNLLTAYYYDEVHRQIPGLMFALQNGHADAIRAYGEL ILSPPLLNSEDIVNLLASRRYDNVPGLLLALNNGQADAILAYGDILNEAK LNLDKKAELLEAKDSNGLSGLFVALHNGCVETIIAYGKILHTADLTPHQA SKLLAAEGPNGVSGLIIAFQNRNFEAIKTYMGIIKNENITPEEIAEHLDK KNGSDFLEIMKNIKS >SBO_P134 repA, RepA MTDLQQTYYRQVKNPNPVFTPREGARTLPFCGKLMEKAVGFTSRFDFAIH VAHARSLGLRRRMPPVLRRRAIDALLQGLCFHYDPLANRVQCSITTLAIE CGLATESAAGKLSITRATRALTFLAELGLITYQTEYDPLIGCYIPTDITF TSALFAALDVSEEAVAAARRSRVEWENRQRKKQGLDTLGMDELMAKAWRF VRERFRSYQTELKSRGMKRARARRDADRQRQDIVTLVKRQLTREISEGRF TASREAVKREVERRVKERMILSRNRNYSRLATASP >SBO_P133 repB, RepB MSQTENAVTSSLSQKRFVRRGKPMTDSEKQMAAVARKRLTHKEIKVFVKN PLKDLMVEYCEREGITQAQFVEKIIKDELQRLDILK >SBO_P127 traX, F pilin acetylation protein MTTDNTNTTRNDSLAARTDTWLQSFLVWSPGQRDIIKTVALVLMVLDHIN LIFQLKQEWMFLAGRGAFPLFALVWGLNLSRHAHIRQPAINRLWGWGIIA QFAYYLAGFPWYEGNILFAFAVAAQVLTWCETRSGWRTAAAILLMALWGP LSGTSYGIAGLLMLAVSNRLYRAEDRAERLALVACLLAVIPALNLATSDA AAVAGLVMTVLTVGLVLCAGKSLPRFWPGDFFPTFYACHLAVLGVLAL >SBO_P129 yigA, hypothetical protein MNGFRNSSRNGQVWRYQRAGGRAVILEVSGRWMEAAEAWRRAACVAPRTD WQQFARKRAEHCHRRCRGRG