Gene Information

Name : iroC (UTI89_C1121)
Accession : YP_540136.1
Strain : Escherichia coli UTI89
Genome accession: NC_007946
Putative virulence/resistance : Virulence
Product : IroC protein
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG1132
EC number : -
Position : 1112788 - 1116573 bp
Length : 3786 bp
Strand : -
Note : -

DNA sequence :
ATGATAATCATATGCCAGCACCCATCCAGGAGTCATGCATCCATAACAATGCGGGACCGGAGCGTGCTGGCTTTTTTGCG
GAACGATCCGGCCAGTGGATCGTATTTTACAGACGGAGGAGGCTTAATGCCTGCGAATCACACTCCCACACCGGCTCAGT
CATGGATAGTTCGCCTGGCGCGCGTGTGCTGGGAACGTAAGAAACTTAGTGTCATCGTGGTGGTAGCGTCAGTATCGACT
ATTTTGCTGGCTGCGCTGACGCCACTGCTGACAAGACAGGCTGTGAATGACGCACTGGCGGGCAATCCGGCCCGCCTGCC
GTGGCTGGCCTGCGGGTTACTGTTGATCGCTTTTTTTGATTTCATCGGTAACTATGTGCGCCGTGGTTATGCCGGGATGC
TCTCACTCTGGGTGCAGCATACCCTCAGAGGACGGGTATTCGACAGTATTCAGAAACTTGACGGCGCAGGCCAGGATGCG
CTGCGCACCGGGCAGGTGATTTCACGGACCAACAGCGATCTGCAGCAGGTGCATACCCTGCTGCAGATGTGCCCGGTGCC
GCTGGCAGTGTTCACTTATTACATTGCCGGCATTGCCGTGATGCTGTGGATGTCCCCTGCCATGACGCTTATCGTCGTGT
GCGTACTGGTATGCCTGGCGATCACCGCGCTTCGTGCGCGTCGTAGGGTCTTCGCGCAAACCGGGCTGGCCTCGGACCAA
CTGGCGAATCTCACCGAACATATACGCGAGGTGCTGGCACAGATCTCAGTGGTAAAATCCTGTGTGGCAGAGATGCGTGA
AACGCACTGGCTCGATAGGCAGTCGCGGCAGATTGTGCGTGTACGCATCGGTGCGGTTATCTCGCAGGCGATGCCTGGGG
CCACCATGCTGGCGCTACCGGTGCTCGGGCAAATCGTCCTGCTGTGCTACGGCGGGTGGTCGGTCATGCACGGGCGGATC
GATCTCGGTACCTTCGTTGCATTCGCCAGCTTCCTCGCGATGCTGACCGGGCCAACCCGCGTACTGGCATCGTTTCTGGT
TATCGCACAGCGCACTCAGGCGTCCGTGGAGCGGGTGTTTGCACTGATCGACACCCGTTCACAGATGGAGGACGGGACGG
AGTCGATTAACAGTCAGGTTGTCGGACTGGAACTGGAGAATATGAGCTTTGACTACCACCATGGCGACAGACATATCCTC
AGCAATATCTCCTTTTCCCTGCGCGCCGGTGAAACCGTGGCGGTGGTGGGCGCATCGGGTTCAGGAAAATCGACCCTGTT
GATGCTACTGGCGCGTTTTTATGATCCCTGCTCCGGAAAGATATGGCTCAACACCAGCGAAGGCCGACAAAATCTTCGCG
ATATCAGACTGGAGGCGCTTCGTCGCCGGGTAGGCATCGTATTTGAAGATGCTTTTCTGTTTGCCGGTACGGTGGCGGAA
AATATCGCCTATGGCCACCCTCAGGCAACGGCGGACGACATTCGCCGTGCGGCAGCTGCTGCAGGAGCCAGCGATTTTAT
TAACGCCCTGCCGAAAGGCTTCGATAGCCTGTTAACCGAACGGGGTACGAATCTTTCCGGCGGGCAGAGGCAGCGAATAG
CGCTGGCGCGGGCGCTCATTACTGCACCGGACGTGTTAATCCTGGATGATACTACCTCAGCGGTTGATGCTGTTACGGAA
GCGGAGATTAATACCGCGCTGGGTCGCTATGCTGACGAAGGGCATATGCTGCTGGTGATTGCCCGACGGCGTTCAACACT
TCAGCTAGCCAGCCGGGTTGTGGTGCTGGATAAGGGCCGTATGGTGGATACCGGAACCCCGGCAGAACTTGAAGCGCGCT
GTCCGGCGTTCCGCGCACTGATGACCGGCGACAGCGATTTTCTGGCCACGTCCCACAATAGCCACAACGAATTGTGGCCG
GCTGAACCAGCGACACAAGACGATGTAACGGATACGGGGGATAAAGGTTTTGTCGCCCGTATGACCCGCGTACCGGAAAA
TGCAGTACAGCAGGCGCTGGCCGGTAAAGGTCGCAAAGTCACGTCACTACTGAAGCCTGTGGCGTGGATGTTCGTCATCG
CCGCTCTGCTGATCGCACTCGATTCTGCGGCAGGCGTAGGGGTACTGATACTGTTGCAGCACGGCATTGACTCCGGTGTC
GCCGCAGGCGATATGTCGATCATCGGCCTCTGTGCCCTGCTCGCCCTGTGCCTGGTCATTGTGGGCTGGTGCAGTTATTC
TCTGCAGACGGTCTTCGCCGCCAGAGCGGCGGAATCAGTTCAGCATTCGGTGCGCTTGCGCAGCTTCGGCCATATGCTGC
GTCTTGGACTCCCCTGGCATGAAAAGCATGCCGATTCGCGTCTTACCCGCATGACCGTTGATGTGGACTCTCTCGCCCGC
TTTCTGCAAAACGGCCTTGCCGGTGCGGCCACCAGTCTGGTGACGATGTTCGCAATCGCCGCCACCATGTTCTGGCTCGA
CCCGTTCCTGGCGCTGACGGCATTAAGCGCAGTGCCAGTGGCCGCACTGGCAACCATGATTTATCGCCGCCTCAGTACCC
CTGCTTATGCACAGGCACGGCTGGAAATAGGCAAAGTCAACAGCACCCTGCAGGAAAAAGTCTCTGGCCTGCGTGTCGTG
CAATCGCATGGTCAGCAGGAACTGGAGGGCGCCCGGCTGCGCGCGTTATCGGAGCGTTTCCGCGCAACCCGTGTGCGAGC
ACAAAAATACCTTGCAGTCTATTTTCCGTTCCTGACATTCTGCACCGAGGCCTCCTATGCCGCTGTTCTGTTAGTGGGAG
CTTCGCAGGTCGCCGCTGGAGAAATGACTGCCGGGGTACTGGCGGCTTTCTTCCTGTTGCTGGGGCAATTCTATGGGCCA
GTGCAGCAGTTATCAGGGATTGTCGACGCCTGGCAGCAGGCGACAGCCAGCGGCAAACATATTGATGAACTACTGGCGAC
AGAAGGCACTGAGAACCTCGGGTCCTCTTCGGTCCTCCCTGTCACCGGTGCACTGCATCTTGATGAGGTCACGTTCAGTT
ATCCCGACAGTCACGAGCCAGCTCTGAACAAACTTACCCTGACGATCCCTGAGGGAATGGTTGTCGCGGTCGTCGGTCGC
AGCGGTGCGGGTAAGTCGACGCTGATTAAGCTGATTGCCGGGTTGTATTTCCCCACGCACGGCAACATCAGAATCGGTGT
GCAAATGCTCGATGATGCCTCGCTCACTGAGTATCGTCGCCAGATTGGGCTTGTCGATCAGGATGTAGCACTGTTTAGTA
GTGATATTGCAGAAAACATTCGTTATTCACGGCCATCCGCCACCAATGAAGACGTTGAAATTGCCTCACAGCGGGCAGGG
CTGTATGAGATGGTGTGCAATCTGCCGCAGGGATTCCGGACACCGGTGAATAACGGCGGAGCCGATCTGCCCGCAGGTCA
GCGCCAGTTGATTGCGCTGGCCCGCGCGCAACTGGCGAATGCCCACATCCTGCTGCTCGACGAAGCCACGTCATGTCTGG
ATCGCACATCCGAAGAACGACTGATGTCATCGTTAACAGATGTCGTGCATGCCGGGAAGCACTCGGCGCTGATTGTTGCA
CATCGTCTGACCACCGCGCAACGCTGCGATCTGATTGCCGTTATTGATAAGGGGTTACTTGCGGAATACGGAACCCACGA
ACAGCTGTTATCTGCGGGCGGCCTCTATACCCGCTTATGGCATGACAGCGTCAGCAGTACTGCTCTCCATCGCCAGCACA
ACATGAAGGAGGAAACCCCGGGATAG

Protein sequence :
MIIICQHPSRSHASITMRDRSVLAFLRNDPASGSYFTDGGGLMPANHTPTPAQSWIVRLARVCWERKKLSVIVVVASVST
ILLAALTPLLTRQAVNDALAGNPARLPWLACGLLLIAFFDFIGNYVRRGYAGMLSLWVQHTLRGRVFDSIQKLDGAGQDA
LRTGQVISRTNSDLQQVHTLLQMCPVPLAVFTYYIAGIAVMLWMSPAMTLIVVCVLVCLAITALRARRRVFAQTGLASDQ
LANLTEHIREVLAQISVVKSCVAEMRETHWLDRQSRQIVRVRIGAVISQAMPGATMLALPVLGQIVLLCYGGWSVMHGRI
DLGTFVAFASFLAMLTGPTRVLASFLVIAQRTQASVERVFALIDTRSQMEDGTESINSQVVGLELENMSFDYHHGDRHIL
SNISFSLRAGETVAVVGASGSGKSTLLMLLARFYDPCSGKIWLNTSEGRQNLRDIRLEALRRRVGIVFEDAFLFAGTVAE
NIAYGHPQATADDIRRAAAAAGASDFINALPKGFDSLLTERGTNLSGGQRQRIALARALITAPDVLILDDTTSAVDAVTE
AEINTALGRYADEGHMLLVIARRRSTLQLASRVVVLDKGRMVDTGTPAELEARCPAFRALMTGDSDFLATSHNSHNELWP
AEPATQDDVTDTGDKGFVARMTRVPENAVQQALAGKGRKVTSLLKPVAWMFVIAALLIALDSAAGVGVLILLQHGIDSGV
AAGDMSIIGLCALLALCLVIVGWCSYSLQTVFAARAAESVQHSVRLRSFGHMLRLGLPWHEKHADSRLTRMTVDVDSLAR
FLQNGLAGAATSLVTMFAIAATMFWLDPFLALTALSAVPVAALATMIYRRLSTPAYAQARLEIGKVNSTLQEKVSGLRVV
QSHGQQELEGARLRALSERFRATRVRAQKYLAVYFPFLTFCTEASYAAVLLVGASQVAAGEMTAGVLAAFFLLLGQFYGP
VQQLSGIVDAWQQATASGKHIDELLATEGTENLGSSSVLPVTGALHLDEVTFSYPDSHEPALNKLTLTIPEGMVVAVVGR
SGAGKSTLIKLIAGLYFPTHGNIRIGVQMLDDASLTEYRRQIGLVDQDVALFSSDIAENIRYSRPSATNEDVEIASQRAG
LYEMVCNLPQGFRTPVNNGGADLPAGQRQLIALARAQLANAHILLLDEATSCLDRTSEERLMSSLTDVVHAGKHSALIVA
HRLTTAQRCDLIAVIDKGLLAEYGTHEQLLSAGGLYTRLWHDSVSSTALHRQHNMKEETPG

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
iroC CAC43427.2 ABC transport protein Virulence PAI III 536 Protein 0.0 99

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
iroC YP_540136.1 IroC protein VFG1653 Protein 0.0 99