Gene Information

Name : UM146_21845 (UM146_21845)
Accession : YP_006113209.1
Strain : Escherichia coli UM146
Genome accession: NC_017632
Putative virulence/resistance : Virulence
Product : hemolysin A
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 4566289 - 4569363 bp
Length : 3075 bp
Strand : -
Note : COG2931 RTX toxins and related Ca2+-binding proteins

DNA sequence :
ATGCCAACAATAACCACTGCACAAATTAAAAGCACACTACAGTCTGCAAAGCAATCCGCTGCAAATAAATTGCACTCAGC
AGGACAAAGCACGAAAGATGCATTAAAAAAAGCAGCAGAGCAAACCCGCAATGCGGGAAACAGACTCATTTTACTTATCC
CTAAAGATTATAAAGGACAGGGTTCAAGCCTTAATGACCTTGTCAGGACGGCAGATGAACTGGGAATTGAAGTCCAGTAT
GATGAAAAGAATGGCACGGCGATTACTAAACAGGTATTCGGCACAGCAGAGAAACTCATTGGCCTCACCGAACGGGGAGT
GACTATCTTTGCACCACAATTAGACAAATTACTGCAAAAGTATCAAAAAGCGGGTAATAAATTAGGCGGCAGTGCTGAAA
ATATAGGTGATAACTTAGGAAAGGCAGGCAGTGTACTGTCAACGTTTCAAAATTTTCTGGGTACTGCACTTTCCTCAATG
AAAATAGACGAACTGATAAAGAAACAAAAATCTGGTAGCAATGTCAGTTCTTCTGAACTGGCAAAAGCGAGTATTGAGCT
AATCAACCAACTCGTGGACACAGCTGCCAGCATTAATAATAATGTTAACTCATTTTCTCAACAACTCAATAAGCTGGGAA
GTGTATTATCCAATACAAAGCACCTGAACGGTGTTGGTAATAAGTTACAGAATTTACCTAACCTTGATAATATCGGTGCA
GGGTTAGATACTGTATCGGGTATTTTATCTGCGATTTCAGCAAGCTTCATTCTGAGCAATGCAGATGCAGATACCGGAAC
TAAAGCTGCAGCAGGTGTTGAATTAACAACGAAAGTACTGGGTAATGTTGGAAAAGGTATTTCTCAATATATTATCGCAC
AGCGCGCTGCACAGGGGTTATCTACATCTGCTGCTGCTGCCGGTTTAATTGCTTCTGTAGTGACATTAGCAATTAGTCCC
CTCTCATTCCTGTCCATTGCCGATAAGTTTAAACGTGCAAATAAAATAGAGGAGTATTCACAACGATTCAAAAAACTTGG
ATACGATGGTGACAGTTTACTTGCTGCTTTCCACAAAGAAACAGGAGCTATTGATGCATCATTAACAACGATAAGCACTG
TACTGGCTTCAGTATCTTCAGGTATTAGTGCTGCTGCAACGACATCTCTTGTTGGTGCACCGGTAAGCGCACTGGTAGGT
GCTGTTACGGGGATAATTTCAGGTATCCTTGAGGCTTCAAAGCAGGCAATGTTTGAACATGTTGCCAGTAAAATGGCTGA
TGTTATTGCTGAATGGGAGAAAAAACACGGTAAAAATTACTTTGAAAATGGATATGATGCCCGCCATGCTGCATTTTTAG
AAGATAACTTTAAAATATTATCTCAGTATAATAAAGAGTATTCTGTTGAAAGATCAGTCCTCATTACTCAACAACATTGG
GATATGCTGATAGGTGAGTTAGCTAGTGTCACCAGAAATGGAGACAAGACACTCAGTGGTAAAAGTTATATTGACTATTA
TGAAGAGGGAAAGCGGCTGGAAAGAAGGCCAAAAGAGTTCCAGCAACAAATCTTTGATCCATTAAAAGGAAATATTGACC
TTTCTGACAGCAAATCTTCTACGTTATTGAAATTTGTTACGCCATTGTTAACTCCCGGTGAGGAAATTCGTGAAAGGAGG
CAGTCCGGAAAATATGAATATATTACCGAGTTATTAGTCAAGGGTGTTGATAAATGGACGGTGAAGGGGGTTCAGGACAA
GGGGTCTGTATATGATTACTCTAACCTGATTCAGCATGCATCAGTCGGTAATAACCAGTATCGGGAAATTCGTATTGAGT
CACACCTGGGAGACGGGGATGATAAGGTCTTTTTATCTGCCGGCTCAGCCAATATCTACGCAGGTAAAGGACATGATGTT
GTTTATTATGATAAAACAGACACCGGTTATCTGACCATTGATGGCACAAAAGCAACCGAAGCGGGTAATTACACGGTAAC
ACGTGTACTTGGTGGTGATGTTAAGGTTTTACAGGAAGTTGTGAAGGAGCAGGAGGTTTCAGTCGGAAAAAGAACTGAAA
AAACGCAATATCGGAGTTATGAATTCACTCATATCAATGGTAAAAATTTAACAGAGACAGATAACTTATATTCCGTGGAA
GAACTTATTGGGACCACGCGTGCCGACAAGTTTTTTGGCAGTAAATTTACTGATATCTTCCATGGCGCGGATGGTGATGA
CCATATAGAAGGAAATGATGGGAATGACCGCTTATATGGTGATAAAGGTAATGATACGCTGAGGGGCGGAAACGGGGATG
ACCAGCTCTATGGCGGTGATGGCAATGATAAGTTAATTGGGGGGACAGGTAATAATTACCTTAACGGCGGTGACGGAGAT
GATGAGCTTCAGGTTCAGGGGAATTCTCTTGCTAAAAATGTATTATCCGGTGGAAAAGGTAATGACAAGTTGTACGGCAG
TGAGGGAGCAGACCTGCTTGATGGCGGAGAAGGGAATGATCTTCTGAAAGGTGGATATGGTAATGATATTTATCGTTATC
TTTCAGGATATGGCCATCATATTATTGACGATGAAGGGGGGAAAGACGATAAACTCAGTTTAGCTGATATAGATTTCCGG
GACGTTGCCTTTAAGCGAGAAGGGAATGACCTCATTATGTATAAAGCTGAAGGTAATGTTCTTTCTATTGGCCACAAAAA
TGGTATTACATTTAAAAACTGGTTTGAAAAAGAGTCAGATGATCTCTCTAATCATCAGATAGAGCAGATTTTTGATAAAG
ACGGCAGGGTAATCACACCAGATTCTCTTAAAAAAGCATTTGAATATCAGCAGAGTAATAACAAGGTAAGTTATGTGTAT
GGACATGATGCATCAACTTATGGGAGCCAGGACAATCTTAATCCATTAATTAATGAAATCAGCAAAATCATTTCAGCTGC
AGGTAACTTCGATGTTAAGGAGGAAAGATCTGCCGCTTCTTTATTGCAGTTGTCCGGTAATGCCAGTGATTTTTCATATG
GACGGAACTCAATAACTTTGACAGCATCAGCATAA

Protein sequence :
MPTITTAQIKSTLQSAKQSAANKLHSAGQSTKDALKKAAEQTRNAGNRLILLIPKDYKGQGSSLNDLVRTADELGIEVQY
DEKNGTAITKQVFGTAEKLIGLTERGVTIFAPQLDKLLQKYQKAGNKLGGSAENIGDNLGKAGSVLSTFQNFLGTALSSM
KIDELIKKQKSGSNVSSSELAKASIELINQLVDTAASINNNVNSFSQQLNKLGSVLSNTKHLNGVGNKLQNLPNLDNIGA
GLDTVSGILSAISASFILSNADADTGTKAAAGVELTTKVLGNVGKGISQYIIAQRAAQGLSTSAAAAGLIASVVTLAISP
LSFLSIADKFKRANKIEEYSQRFKKLGYDGDSLLAAFHKETGAIDASLTTISTVLASVSSGISAAATTSLVGAPVSALVG
AVTGIISGILEASKQAMFEHVASKMADVIAEWEKKHGKNYFENGYDARHAAFLEDNFKILSQYNKEYSVERSVLITQQHW
DMLIGELASVTRNGDKTLSGKSYIDYYEEGKRLERRPKEFQQQIFDPLKGNIDLSDSKSSTLLKFVTPLLTPGEEIRERR
QSGKYEYITELLVKGVDKWTVKGVQDKGSVYDYSNLIQHASVGNNQYREIRIESHLGDGDDKVFLSAGSANIYAGKGHDV
VYYDKTDTGYLTIDGTKATEAGNYTVTRVLGGDVKVLQEVVKEQEVSVGKRTEKTQYRSYEFTHINGKNLTETDNLYSVE
ELIGTTRADKFFGSKFTDIFHGADGDDHIEGNDGNDRLYGDKGNDTLRGGNGDDQLYGGDGNDKLIGGTGNNYLNGGDGD
DELQVQGNSLAKNVLSGGKGNDKLYGSEGADLLDGGEGNDLLKGGYGNDIYRYLSGYGHHIIDDEGGKDDKLSLADIDFR
DVAFKREGNDLIMYKAEGNVLSIGHKNGITFKNWFEKESDDLSNHQIEQIFDKDGRVITPDSLKKAFEYQQSNNKVSYVY
GHDASTYGSQDNLNPLINEISKIISAAGNFDVKEERSAASLLQLSGNASDFSYGRNSITLTASA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
hlyA CAD33759.1 hemolysin A Virulence PAI I 536 Protein 0.0 98
hlyA NP_755445.1 hemolysin A Virulence PAI I CFT073 Protein 0.0 98
hlyA CAD42039.1 HlyA protein Virulence PAI II 536 Protein 0.0 97

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
UM146_21845 YP_006113209.1 hemolysin A VFG0906 Protein 0.0 98
UM146_21845 YP_006113209.1 hemolysin A VFG1500 Protein 0.0 98
UM146_21845 YP_006113209.1 hemolysin A VFG1558 Protein 0.0 97
UM146_21845 YP_006113209.1 hemolysin A VFG0840 Protein 0.0 63