Gene Information

Name : tsh (APECO1_O1CoBM73)
Accession : YP_001481228.1
Strain :
Genome accession: NC_009837
Putative virulence/resistance : Virulence
Product : Tsh
Function : -
COG functional category : M : Cell wall/membrane/envelope biogenesis
COG ID : COG3468
EC number : -
Position : 58775 - 62908 bp
Length : 4134 bp
Strand : +
Note : temperature sensitive hemagglutinin; similar to AAA24698; identified by match to protein family HMM PF02395

DNA sequence :
ATGAACAGAATTTATTCTCTTCGCTACAGCGCTGTGGCCCGGGGCTTTATTGCCGTATCTGAGTTTGCTAGGAAATGTGT
TCATAAGTCTGTCAGACGTCTGTGTTTCCCGGTTTTATTACTGATCCCGGTACTATTCTCTGCAGGAAGTCTTGCGGGAA
CGGTCAATAATGAACTCGGGTATCAGTTATTTCGTGATTTTGCTGAAAATAAGGGGATGTTCCGCCCGGGGGCAACGAAT
ATCGCTATTTATAATAAGCAGGGAGAATTTGTCGGTACGCTGGATAAGGCAGCTATGCCTGATTTCAGTGCTGTGGATTC
GGAAATCGGTGTGGCGACACTGATAAACCCGCAGTATATCGCCAGCGTGAAACATAACGGGGGATATACAAACGTTAGCT
TTGGTGATGGTGAAAACCGTTACAATATCGTGGACCGGAATAATGCGCCGTCACTGGATTTTCATGCCCCCCGGCTGGGT
AAACTGGTGACAGAGGTTGCCCCTACTGCGGTGACGGCGCAGGGGGCAGTGGCTGGCGCATATCTGGATAAGGAGCGCTA
TCCTGTTTTTTATCGTCTGGGGTCTGGTACTCAGTATATTAAGGACAGTAACGGACAGCTGACAAAAATGGGAGGTGCAT
ATTCCTGGCTGACCGGCGGGACTGTCGGTAGCCTGTCATCCTATCAGAATGGAGAAATGATTAGCACCAGTTCAGGTCTG
GTTTTTGATTACAAACTTAATGGTGCAATGCCCATTTATGGCGAGGCCGGTGACAGCGGTTCGCCTTTATTTGCTTTTGA
TACTGTTCAGAATAAATGGGTGCTGGTCGGTGTTCTTACTGCGGGGAATGGCGCGGGGGGCAGGGGAAATAACTGGGCTG
TTATTCCACTGGATTTTATCGGGCAGAAATTTAATGAAGACAATGATGCCCCGGTCACGTTCAGAACATCGGAAGGTGGT
GCACTGGAGTGGAGCTTTAACAGCAGTACCGGAGCTGGTGCGCTGACACAGGGAACCACCACATATGCCATGCACGGGCA
GCAGGGAAATGACCTGAATGCTGGTAAGAACCTGATATTTCAGGGGCAGAATGGTCAGATTAACCTTAAGGATTCGGTTT
CTCAGGGGGCGGGTTCCCTGACGTTCCGTGATAATTACACAGTAACAACCTCTAACGGAAGTACCTGGACCGGTGCCGGT
ATTGTTGTGGACAACGGGGTGTCCGTAAACTGGCAGGTTAATGGTGTTAAGGGCGATAACCTGCATAAAATTGGTGAAGG
TACGCTGACGGTACAGGGTACAGGTATTAATGAAGGTGGCCTGAAGGTCGGGGACGGAAAGGTTGTACTGAACCAGCAGG
CGGACAATAAAGGACAGGTGCAGGCGTTCAGCAGTGTTAATATTGCCAGTGGCCGGCCGACCGTGGTACTGACTGATGAG
CGGCAGGTAAATCCGGATACCGTCTCATGGGGATATCGTGGGGGCACACTGGATGTTAATGGTAACAGTCTGACGTTTCA
TCAGTTGAAGGCGGCAGATTATGGTGCCGTGCTGGCGAATAACGTTGATAAACGGGCCACTATCACGCTGGACTATGCCC
TGCGGGCTGACAAAGTAGCACTGAATGGCTGGTCGGAATCAGGTAAAGGAACTGCCGGAAATTTATATAAATACAATAAC
CCGTACACAAATACGACGGATTACTTCATCCTGAAGCAGAGCACCTATGGTTATTTCCCCACGGACCAGAGCAGCAACGC
CACCTGGGAGTTTGTGGGGCACAGTCAGGGGGATGCACAGAAACTGGTAGCTGACCGTTTCAATACTGCAGGGTATCTGT
TTCACGGACAACTGAAAGGCAATCTGAATGTGGACAATCGCCTGCCTGAAGGCGTTACCGGTGCTCTGGTGATGGACGGA
GCTGCGGATATCTCCGGTACATTCACCCAGGAAAACGGGCGTCTGACGCTGCAGGGGCATCCGGTTATCCATGCATACAA
TACTCAGTCTGTGGCTGACAAACTGGCTGCCAGTGGAGACCATTCGGTTCTGACTCAGCCTACGTCATTCAGTCAGGAGG
ACTGGGAGAACCGCAGTTTTACCTTTGACAGGCTGTCACTGAAGAACACTGATTTTGGTCTTGGTCGCAATGCCACACTG
AACACAACCATCCAGGCAGATAACTCCAGCGTCACGCTGGGCGACAGCCGGGTATTTATCGACAAAAACGATGGCCAGGG
AACAGCCTTTACCCTTGAAGAAGGCACATCTGTTGCAACTAAAGATGCAGATAAAAGTGTCTTCAACGGCACCGTCAACC
TGGATAATCAGTCAGTGCTGAATATCAATGATATATTCAATGGCGGAATACAGGCGAACAACAGTACCGTGAATATCTCC
TCAGACAGTGCCGTTCTGGGGAACTCAACACTGACCAGTACCGCCCTGAATCTGAACAAGGGAGCAAATGCTCTGGCCAG
TCAGAGTTTTGTTTCTGACGGTCCAGTGAATATTTCTGATGCCACCCTGAGTCTGAACAGCCGTCCTGATGAGGTATCTC
ACACACTTTTACCTGTATACGATTATGCCGGTTCATGGAACCTGAAGGGAGACGATGCCCGCCTGAACGTGGGGCCGTAC
AGTATGTTGTCAGGTAATATCAATGTTCAGGATAAAGGGACTGTCACCCTCGGAGGGGAAGGGGAACTGAGTCCTGACCT
GACTCTTCAGAATCAGATGTTGTACAGCCTGTTTAACGGGTACCGCAATATCTGGAGCGGGAGCCTGAATGCACCGGATG
CCACCGTCAGCATGACAGACACCCAGTGGTCGATGAACGGAAACTCCACGGCAGGAAATATGAAACTTAACCGGACAATA
GTCGGTTTTAACGGGGGAACATCACCGTTCACGACACTGACAACAGATAATCTGGACGCGGTTCAGTCAGCATTTGTCAT
GCGTACAGACCTTAACAAGGCAGACAAACTGGTGATAAACAAGTCGGCAACAGGTCATGACAACAGCATCTGGGTTAACT
TCCTGAAAAAACCTTCTAACAAGGACACGCTTGATATTCCACTGGTCAGCGCACCTGAAGCGACAGCTGATAATCTGTTC
AGGGCATCAACACGGGTTGTGGGATTCAGTGATGTCACCCCCATCCTTAGTGTCAGAAAAGAGGACGGGAAAAAAGAGTG
GGTCCTCGATGGTTACCAGGTTGCACGTAACGACGGCCAGGGTAAGGCTGCCGCCACATTCATGCACATCAGCTATAACA
ACTTCATCACTGAAGTTAACAACCTGAACAAACGCATGGGCGATTTGAGGGATATTAATGGCGAAGCCGGTACGTGGGTG
CGTCTGCTGAACGGTTCCGGCTCTGCTGATGGCGGTTTCACTGACCACTATACCCTGCTGCAGATGGGGGCTGACCGTAA
GCACGAACTGGGAAGTATGGACCTGTTTACCGGCGTGATGGCCACCTACACTGACACAGATGCGTCAGCAGACCTGTACA
GCGGTAAAACAAAATCATGGGGTGGTGGTTTCTATGCCAGTGGTCTGTTCCGGTCCGGCGCTTACTTTGATGTGATTGCC
AAATATATTCACAATGAAAACAAATATGACCTGAACTTTGCCGGAGCTGGTAAACAGAACTTCCGCAGCCATTCACTGTA
TGCAGGTGCAGAAGTCGGATACCGTTATCATCTGACAGATACGACGTTTGTTGAACCTCAGGCGGAACTGGTCTGGGGAA
GACTGCAGGGCCAAACATTTAACTGGAACGACAGTGGAATGGATGTCTCAATGCGTCGTAACAGCGTTAATCCTCTGGTA
GGCAGAACCGGCGTTGTTTCCGGTAAAACCTTCAGTGGTAAGGACTGGAGTCTGACAGCCCGTGCCGGCCTGCATTATGA
GTTCGATCTGACGGACAGTGCTGACGTTCATCTGAAGGATGCAGCGGGAGAACATCAGATTAATGGCAGAAAAGACAGTC
GTATGCTTTACGGTGTGGGGTTAAATGCCCGGTTTGGCGACAATACGCGTCTGGGGCTGGAAGTTGAACGCTCTGCATTT
GGTAAATACAACACAGATGATGCGATAAACGCTAATATTCGTTATTCATTCTGA

Protein sequence :
MNRIYSLRYSAVARGFIAVSEFARKCVHKSVRRLCFPVLLLIPVLFSAGSLAGTVNNELGYQLFRDFAENKGMFRPGATN
IAIYNKQGEFVGTLDKAAMPDFSAVDSEIGVATLINPQYIASVKHNGGYTNVSFGDGENRYNIVDRNNAPSLDFHAPRLG
KLVTEVAPTAVTAQGAVAGAYLDKERYPVFYRLGSGTQYIKDSNGQLTKMGGAYSWLTGGTVGSLSSYQNGEMISTSSGL
VFDYKLNGAMPIYGEAGDSGSPLFAFDTVQNKWVLVGVLTAGNGAGGRGNNWAVIPLDFIGQKFNEDNDAPVTFRTSEGG
ALEWSFNSSTGAGALTQGTTTYAMHGQQGNDLNAGKNLIFQGQNGQINLKDSVSQGAGSLTFRDNYTVTTSNGSTWTGAG
IVVDNGVSVNWQVNGVKGDNLHKIGEGTLTVQGTGINEGGLKVGDGKVVLNQQADNKGQVQAFSSVNIASGRPTVVLTDE
RQVNPDTVSWGYRGGTLDVNGNSLTFHQLKAADYGAVLANNVDKRATITLDYALRADKVALNGWSESGKGTAGNLYKYNN
PYTNTTDYFILKQSTYGYFPTDQSSNATWEFVGHSQGDAQKLVADRFNTAGYLFHGQLKGNLNVDNRLPEGVTGALVMDG
AADISGTFTQENGRLTLQGHPVIHAYNTQSVADKLAASGDHSVLTQPTSFSQEDWENRSFTFDRLSLKNTDFGLGRNATL
NTTIQADNSSVTLGDSRVFIDKNDGQGTAFTLEEGTSVATKDADKSVFNGTVNLDNQSVLNINDIFNGGIQANNSTVNIS
SDSAVLGNSTLTSTALNLNKGANALASQSFVSDGPVNISDATLSLNSRPDEVSHTLLPVYDYAGSWNLKGDDARLNVGPY
SMLSGNINVQDKGTVTLGGEGELSPDLTLQNQMLYSLFNGYRNIWSGSLNAPDATVSMTDTQWSMNGNSTAGNMKLNRTI
VGFNGGTSPFTTLTTDNLDAVQSAFVMRTDLNKADKLVINKSATGHDNSIWVNFLKKPSNKDTLDIPLVSAPEATADNLF
RASTRVVGFSDVTPILSVRKEDGKKEWVLDGYQVARNDGQGKAAATFMHISYNNFITEVNNLNKRMGDLRDINGEAGTWV
RLLNGSGSADGGFTDHYTLLQMGADRKHELGSMDLFTGVMATYTDTDASADLYSGKTKSWGGGFYASGLFRSGAYFDVIA
KYIHNENKYDLNFAGAGKQNFRSHSLYAGAEVGYRYHLTDTTFVEPQAELVWGRLQGQTFNWNDSGMDVSMRRNSVNPLV
GRTGVVSGKTFSGKDWSLTARAGLHYEFDLTDSADVHLKDAAGEHQINGRKDSRMLYGVGLNARFGDNTRLGLEVERSAF
GKYNTDDAINANIRYSF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
vat YP_851472.1 vacuolating autotransporter Not tested PAI III APEC-O1 Protein 0.0 79
unnamed CAD66214.1 putative hemoglobin protease Not tested PAI III 536 Protein 0.0 79
vat AAO21903.1 vacuolating autotransporter toxin Virulence Not named Protein 0.0 78
she AAB58244.1 mucinase Virulence SHI-1 Protein 0.0 50
pic NP_708747.3 serine protease Not tested SHI-1 Protein 0.0 50
pic AAK00464.1 Pic Virulence SHI-1 Protein 0.0 50
pic NP_838464.1 serine protease precurser Virulence SHI-1 Protein 0.0 49
unnamed CAC39286.1 hypothetical protein Not tested LPA Protein 0.0 44

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
tsh YP_001481228.1 Tsh VFG1689 Protein 0.0 79
tsh YP_001481228.1 Tsh VFG0904 Protein 0.0 79
tsh YP_001481228.1 Tsh VFG0861 Protein 0.0 50
tsh YP_001481228.1 Tsh VFG0903 Protein 0.0 50
tsh YP_001481228.1 Tsh VFG0635 Protein 0.0 50