Gene Information

Name : ECH74115_B0018 (ECH74115_B0018)
Accession : YP_002268404.1
Strain :
Genome accession: NC_011350
Putative virulence/resistance : Virulence
Product : RTX C- domain protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 713 - 3709 bp
Length : 2997 bp
Strand : +
Note : identified by match to protein family HMM PF00353; match to protein family HMM PF02382; match to protein family HMM PF08339

DNA sequence :
ATGACAGTAAATAAAATAAAGAACATTTTCAATAATGCGACATTGACTACAAAATCAGCATTTAATACAGCATCATCAAG
CGTACGTTCCGCTGGAAAAAAACTCATATTATTAATACCTGATAATTATGAAGCTCAGGGCGTGGGTATTAATGAGTTGG
TCAAAGCTGCTGATGAGCTTGGAATAGAAATACACCGTACTGAACGAGATGATACAGCGATTGCAAACCAGTTTTTTGGT
GCAGCAGAAAAAGTTGTAGGATTAACTGAACGTGGTGTTGCAATATTCGCACCACAACTTGACAAACTTCTGCAGAAGTA
TCAGAAAGTTGGGAGTAAAATAGGAGGAACCGCTGAAAATGTAGGTAATAATCTGGGAAAAGCCGGAACAGTTCTCTCAG
CACTACAGAATTTTACGGGGATTGCTTTATCAGGCATGGCTCTTGATGAATTGCTGAGAAAACAACGGGCAGGAGAGGAT
ATAAGTCAGAATGATATTGCCAAAAGTAGTATTGAACTTATTAATCAGCTTGTAGATACAGTATCAAGTATAAACAGTAC
CGTTGATTCATTTTCTGAGCAGCTTAACCAGCTTGGCTCATTTTTATCCAGTAAACCTCGATTAAGTTCTGTTGGTGGGA
AATTACAAAATTTACCAGACCTGGGCCCCCTGGGGGATGGGCTGGATGTTGTCTCCGGAATTCTTTCTGCTGTATCAGCA
AGCTTTATTCTGGGAAACAGTGACGCACATACAGGAACAAAAGCTGCAGCGGGTATCGAACTGACAACTCAGGTTCTTGG
AAATGTTGGTAAAGCTGTTTCGCAATATATTCTGGCTCAGAGAATGGCACAGGGGTTATCGACAACAGCTGCAAGTGCGG
GTCTGATCACATCGGCTGTTATGCTGGCTATCAGTCCTCTTTCTTTCCTGGCTGCTGCAGATAAATTTGAGCGAGCTAAG
CAGCTTGAATCATATTCTGAACGATTTAAAAAATTGAATTATGAAGGGGATGCTTTACTCGCAGCCTTTCATAAAGAAAC
CGGAGCTATAGATGCAGCCCTGACAACAATAAATACTGTCCTGAGTTCTGTATCTGCGGGAGTTAGTGCAGCCTCCAGTG
CATCCCTCATAGGGGCCCCGATAAGCATGCTGGTGAGTGCATTAACCGGTACGATATCTGGCATTCTGGAAGCATCAAAA
CAGGCTATGTTTGAGCACGTTGCAGAGAAATTCGCTGCTCGGATCAATGAATGGGAAAAGGAGCATGGCAAAAATTATTT
TGAGAATGGATATGACGCAAGACATGCTGCGTTTTTAGAAGACTCTCTGTCTTTGCTTGCTGATTTTTCTCGTCAGCATG
CAGTAGAAAGAGCAGTCGCAATAACCCAGCAACATTGGGATGAGAAGATCGGTGAACTTGCAGGCATAACCCGTAATGCT
GATCGCAGTCAGAGTGGTAAGGCATATATTAATTATCTGGAAAATGGAGGGCTTTTAGAGGCTCAACCGAAGGAGTTTAC
ACAACAAGTGTTTGATCCTCAAAAAGGGACCATAGACCTTTCAACAGGTAATGTATCAAGTGTTTTGACATTTATAACAC
CAACATTTACCCCAGGAGAAGAAGTTAGAGAAAGAAAACAGAGTGGTAAATATGAATATATGACATCTCTTATTGTAAAT
GGTAAGGATACATGGTCTGTAAAAGGCATAAAAAATCATAAAGGTGTATATGATTATTCAAAATTGATTCAGTTTGTTGA
AAAGAATAACAAACACTATCAGGCGAGAATAATTTCTGAGCTCGGAGATAAAGACGATGTGGTTTATTCTGGAGCAGGCT
CATCAGAAGTATTTGCTGGTGAAGGTTATGATACCGTATCTTATAATAAGACGGATGTTGGTAAACTAACAATTGATGCA
ACAGGAGCATCAAAACCTGGTGAGTATATAGTTTCAAAAAATATGTATGGTGACGTGAAGGTATTGCAGGAAGTCGTTAA
GGAACAGGAGGTGTCAGTAGGGAAGCGAACAGAGAAAATACAATATCGTGATTTTGAATTCAGAACCGGTGGAATTCCTT
ATGATGTAATAGATAATCTTCATTCTGTTGAAGAGCTCATTGGCGGAAAACATGATGATGAATTCAAAGGCGGTAAGTTT
AATGATATATTCCATGGCGCAGATGGGAACGATTATATCGAAGGTAATTATGGTAATGATCGACTATACGGCGATGATGG
GGATGATTATATATCCGGAGGACAGGGAGACGACCAGTTATTTGGTGGTAGTGGAAACGATAAATTGAGTGGAGGGGATG
GTAATAATTATCTGACAGGAGGAAGCGGTAATGATGAGCTTCAGGCACACGGAGCTTATAATATTCTGTCAGGTGGTACT
GGTGATGATAAACTTTATGGTGGTGGTGGTATTGATCTTCTGGATGGAGGGGAAGGTAATGACTATCTGAATGGTGGTTT
TGGTAATGATATTTATGTTTATGGGCAAAACTATGGTCATCATACAATTGCAGATGAAGGAGGTAAAGGAGATCGTTTGC
ACTTATCTGATATTAGCTTTGATGATATCGCATTTAAGAGAGTTGGAAATGATCTTATCATGAATAAAGCCATTAATGGT
GTACTTTCATTTAATGAGTCAAATGATGTCAATGGGATAACATTTAAAAACTGGTTTGCGAAAGATGCCTCAGGAGCAGA
TAATCATCTTGTTGAGGTTATAACAGATAAAGATGGTCGAGAGATAAAAGTTGATAAGATACCTCATAATAATAATGAAC
GGTCAGGTTATATAAAAGCCAGTAATATAGCATCTGAAAAAAACATGGTTAATATCACCAGTGTTGCCAATGATATTAAT
AAGATTATTTCTTCAGTTTCAGGGTTCGATTCAGGTGATGAACGATTAGCATCTTTATATAATTTATCCTTACATCAAAA
CAACACACACTCAACAACTTTAACGACAACTGTCTGA

Protein sequence :
MTVNKIKNIFNNATLTTKSAFNTASSSVRSAGKKLILLIPDNYEAQGVGINELVKAADELGIEIHRTERDDTAIANQFFG
AAEKVVGLTERGVAIFAPQLDKLLQKYQKVGSKIGGTAENVGNNLGKAGTVLSALQNFTGIALSGMALDELLRKQRAGED
ISQNDIAKSSIELINQLVDTVSSINSTVDSFSEQLNQLGSFLSSKPRLSSVGGKLQNLPDLGPLGDGLDVVSGILSAVSA
SFILGNSDAHTGTKAAAGIELTTQVLGNVGKAVSQYILAQRMAQGLSTTAASAGLITSAVMLAISPLSFLAAADKFERAK
QLESYSERFKKLNYEGDALLAAFHKETGAIDAALTTINTVLSSVSAGVSAASSASLIGAPISMLVSALTGTISGILEASK
QAMFEHVAEKFAARINEWEKEHGKNYFENGYDARHAAFLEDSLSLLADFSRQHAVERAVAITQQHWDEKIGELAGITRNA
DRSQSGKAYINYLENGGLLEAQPKEFTQQVFDPQKGTIDLSTGNVSSVLTFITPTFTPGEEVRERKQSGKYEYMTSLIVN
GKDTWSVKGIKNHKGVYDYSKLIQFVEKNNKHYQARIISELGDKDDVVYSGAGSSEVFAGEGYDTVSYNKTDVGKLTIDA
TGASKPGEYIVSKNMYGDVKVLQEVVKEQEVSVGKRTEKIQYRDFEFRTGGIPYDVIDNLHSVEELIGGKHDDEFKGGKF
NDIFHGADGNDYIEGNYGNDRLYGDDGDDYISGGQGDDQLFGGSGNDKLSGGDGNNYLTGGSGNDELQAHGAYNILSGGT
GDDKLYGGGGIDLLDGGEGNDYLNGGFGNDIYVYGQNYGHHTIADEGGKGDRLHLSDISFDDIAFKRVGNDLIMNKAING
VLSFNESNDVNGITFKNWFAKDASGADNHLVEVITDKDGREIKVDKIPHNNNERSGYIKASNIASEKNMVNITSVANDIN
KIISSVSGFDSGDERLASLYNLSLHQNNTHSTTLTTTV

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
hlyA NP_755445.1 hemolysin A Virulence PAI I CFT073 Protein 0.0 62
hlyA CAD33759.1 hemolysin A Virulence PAI I 536 Protein 0.0 62
hlyA CAD42039.1 HlyA protein Virulence PAI II 536 Protein 0.0 62

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
ECH74115_B0018 YP_002268404.1 RTX C- domain protein VFG0840 Protein 0.0 99
ECH74115_B0018 YP_002268404.1 RTX C- domain protein VFG0906 Protein 0.0 62
ECH74115_B0018 YP_002268404.1 RTX C- domain protein VFG1558 Protein 0.0 62
ECH74115_B0018 YP_002268404.1 RTX C- domain protein VFG1500 Protein 0.0 62