Gene Information

Name : iroC (c1253)
Accession : NP_753167.1
Strain : Escherichia coli CFT073
Genome accession: NC_004431
Putative virulence/resistance : Virulence
Product : ABC transporter ATP-binding protein
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG1132
EC number : -
Position : 1202758 - 1206495 bp
Length : 3738 bp
Strand : -
Note : Residues 27 to 1231 of 1245 are 80.16 pct identical to residues 1 to 1209 of 1218 from GenPept.129 : >emb|CAD05883.1| (AL627276) ABC transporter protein [Salmonella enterica subsp. enterica serovar Typhi]

DNA sequence :
ATGCGGGACCGGAGTGTGCTGGCTTTTTTGCGGAACGATCCGGCCAGTGGATCGTATTTTACAGACGGAGGAGGCTTAAT
GCCTGCGAATCACACTCCCACACCGGCTCAGTCATGGATAGTTCGCCTGGCGCGCGTGTGCTGGGAACGTAAGAAACTTA
GTGTCATCGTGGTGGTAGCGTCAGTATCGACTATTTTGCTGGCTGCGCTGACGCCACTGCTGACAAGACAGGCTGTGAAT
GACGCACTGGCGGGCAATCCGGCCCGCCTGCCGTGGCTGGCCTGCGGGTTACTGTTGATCGCTTTTTTTGATTTCATCGG
TAACTATGTGCGCCGTGGTTATGCCGGGATGCTCTCACTCTGGGTGCAGCATACCCTCAGAGGACGGGTATTCGACAGTA
TTCAAAAACTTGACGGCGCAGGCCAGGATGCGCTGCGCACCGGGCAGGTGATTTCACGGACCAACAGCGATCTGCAGCAG
GTGCATACCCTGCTGCAGATGTGCCCGGTGCCGCTGGCAGTGTTCACTTATTACATTGCCGGCATTGCCGTGATGCTGTG
GATGTCCCCTGCCATGACGCTTATCGTCGTGTGCGTACTGGTATGCTTGGCGATCACCGCGCTTCGTGCGCGTCGTAGGG
TCTTCGCGCAAACCGGAATGGCCTCGGACCAACTGGCGAATCTCACCGAACATATACGCGAGGTGCTGGCACAGATCTCA
GTGGTAAAATCCTGTGTGGCAGAGATGCGTGAAACGCACTGGCTCGATAGGCAGTCGCGGCAGATTGTGCGTGTACGCAT
CGGTGCGGTTATCTCGCAGGCGATGCCAGGGGCCACTATGCTGGCGCTACCGGTGCTCGGGCAAATCGTCCTGCTGTGCT
ACGGCGGGTGGTCGGTCATGCACGGGCGGATCGATCTCGGTACCTTCGTTGCATTCGCCAGCTTCCTCGCGATGCTGACC
GGGCCAACCCGCGTACTGGCATCGTTTCTGGTTATCGCACAGCGCACTCAGGCGTCCGTGGAGCGGGTGTTTGCACTGAT
CGACACCCGTTCACAGATGGAGGACGGGACGGAGTCGATTAACAGTCAGGTTGTCGGACTGGAACTGGAGAATATGAGCT
TTGACTACCACCATGGCGACAGACATATCCTCAGCGATATCTCCTTTTCCCTGCGCGCTGGGGAAACCGTGGCGGTGGTG
GGCGCATCGGGTTCAGGAAAATCGACCCTGTTGATGCTGCTGGCGCGTTTTTATGATCCCTGCTCCGGAAAGATATGGCT
CAACACCAGCGAAGGCCGACAAAATCTTCGCGATATCAGACTGGAGGCGCTTCGTCGCCGGGTAGGCATCGTATTTGAAG
ATGCTTTTCTGTTTGCCGGTACGGTGGCGGAAAATATCGCCTATGGCCACCCTCAGGCAACGGCGGACGACATTCGCCGT
GCGGCAGCTGCTGCAGGAGCCAGCGATTTTATCAACGCCCTGCCGAAAGGCTTCGATAGCCTGTTAACCGAACGGGGTAC
GAATCTTTCCGGCGGGCAGAGGCAGCGAATAGCGCTGGCGCGGGCGCTCATTACTGCACCGGACGTGTTAATCCTGGATG
ATACCACCTCAGCGGTTGATGCTGTTACGGAAGCGGAGATTAATACCGCGCTGGGTCGCTATGCTGACGAAGGGCATATG
CTGCTGGTGATTGCCCGACGGCGTTCAACGCTTCAGCTAGCCAGCCGGGTTGTGGTGCTGGATAAGGGCCGTATGGTGGA
TACCGGAACCCCGGCAGAACTTGAAGCGCGCTGTCCGGCGTTCCGCGCACTGATGACCGGCGACAGCGATTTTCTGGCCA
CGTCCCACAATAGCCACAACGAATTGTGGCCGGCTGAACCAGCGACACAAGACGATGTAACGGATACGGGGGATAAAGGT
TTTGTCGCCCGTATGACCCGCGTACCGGAAAATGCAGTACAGCAGGCGCTGGCCGGTAAAGGTCGCAAAGTCACGTCACT
ACTGAAGCCTGTGGCATGGATGTTCGTCATCGCCGCTCTGCTGATCGCACTCGATTCTGCAGCAGGCGTAGGGGTGCTGA
TACTGTTGCAGCACGGCATTGACTCCGGTGTCGCCGCAGGCGATATGTCGACCATCGGCCTCTGTGCCCTGCTCGCCCTG
TGCCTAGTCATTGTGGGCTGGTGCAGTTATTCTCTGCAGACGGTCTTCGCCGCCAGAGCGGCGGAATCAGTTCAGCATTC
GGTGCGCTTGCGTAGCTTCGGCCATATGCTGCGTCTTGGACTCCCCTGGCACGAAAAGCATGCCGATTCGCGTCTGACCC
GCATGACCGTTGATGTGGACTCTCTCGCCCGCTTTCTGCAAAACGGCCTTGCCGGTGCGGCCACCAGTCTGGTGACGATG
TTCGCAATCGCCGCCACCATGTTCTGGCTCGACCCGATCCTGGCGCTGACGGCATTAAGCGCAGTGCCAGTGGCCGCACT
GGCAACCATGATTTATCGCCGCCTCAGTACCCCTGCTTATGCACAAGCACGGCTGGAAATAGGCAAAGTCAACAGCACCC
TGCAGGAAAAAGTCTCCGGCCTGCGTGTCGTGCAATCGCATGGTCAGCAGGAACTGGAGGGCGCCCGGCTGCGCGCGTTA
TCGGAGCGTTTCCGCGCAACCCGTGTGCGAGCACAAAAATACCTTGCAGTCTATTTTCCGTTCCTGACATTCTGCACCGA
GGCCTCCTATGCCGCCGTTCTGTTAGTGGGAGCTTCGCAGGTCGCCGCTGGAGAAATGACTGCCGGGGTACTGGCGGCTT
TCTTCCTGTTGCTGGGGCAATTCTATGGGCCAGTGCAGCAGTTATCAGGGATTGTCGACGCCTGGCAGCAGGCGACAGCC
AGCGGCAAACATATTGATGAACTACTGGCGACAGAAGGCACTGAGAACCTCGGGTCCTCTTCGGTCCTCCCTGTCACCGG
TGCACTGCATCTTGATGAGGTCACGTTCAGTTATCCCGACAGTCACGAGCCAGCTCTGAACAAACTTACCCTGACGATCC
CTGAGGGAATGGTTGTCGCGGTCGTCGGTCGCAGCGGTGCGGGTAAGTCGACGCTGATTAAGCTGATTGCCGGGTTGTAT
TTCCCCACGCACGGCAACATCAGAATCGGTGTGCAAATGCTCGATGATGCCTCGCTCACTGAGTATCGTCGCCAGATTGG
GCTTGTCGATCAGGATGTAGCACTGTTTAGTAGTGATATTGCAGAAAACATTCGTTATTCACGGCCATCCGCCACCAATG
AAGACGTTGAAATTGCCTCACAGCGGGCAGGGCTGTATGAGATGGTGTGCAATCTGCCGCAGGGATTCCGGACACCGGTG
AATAACGGCGGAGCCGATCTGTCCGCAGGTCAGCGCCAGTTGATTGCGCTGGCCCGCGCGCAACTGGCGAATGCTCACAT
CCTGCTGCTCGACGAAGCCACGTCATGTCTGGATCGCACATCCGAAGAACGACTGATGTCATCGTTAACAGATGTCGTGC
ATGCCGGGAAGCACTCGGCGCTGATTGTTGCACATCGTCTGACCACCGCGCAACGCTGCGATCTGATTGCCGTTATTGAT
AAGGGGTTACTTGCGGAATACGGAACCCACGAACAGCTGTTATCTGCAGGCGGCCTCTATACCCGCTTATGGCATGACAG
CGTCAGCAGTACTGCGCTCCATCGCCAGCACAACATGAAGGAGGAAACCCCGGGATAG

Protein sequence :
MRDRSVLAFLRNDPASGSYFTDGGGLMPANHTPTPAQSWIVRLARVCWERKKLSVIVVVASVSTILLAALTPLLTRQAVN
DALAGNPARLPWLACGLLLIAFFDFIGNYVRRGYAGMLSLWVQHTLRGRVFDSIQKLDGAGQDALRTGQVISRTNSDLQQ
VHTLLQMCPVPLAVFTYYIAGIAVMLWMSPAMTLIVVCVLVCLAITALRARRRVFAQTGMASDQLANLTEHIREVLAQIS
VVKSCVAEMRETHWLDRQSRQIVRVRIGAVISQAMPGATMLALPVLGQIVLLCYGGWSVMHGRIDLGTFVAFASFLAMLT
GPTRVLASFLVIAQRTQASVERVFALIDTRSQMEDGTESINSQVVGLELENMSFDYHHGDRHILSDISFSLRAGETVAVV
GASGSGKSTLLMLLARFYDPCSGKIWLNTSEGRQNLRDIRLEALRRRVGIVFEDAFLFAGTVAENIAYGHPQATADDIRR
AAAAAGASDFINALPKGFDSLLTERGTNLSGGQRQRIALARALITAPDVLILDDTTSAVDAVTEAEINTALGRYADEGHM
LLVIARRRSTLQLASRVVVLDKGRMVDTGTPAELEARCPAFRALMTGDSDFLATSHNSHNELWPAEPATQDDVTDTGDKG
FVARMTRVPENAVQQALAGKGRKVTSLLKPVAWMFVIAALLIALDSAAGVGVLILLQHGIDSGVAAGDMSTIGLCALLAL
CLVIVGWCSYSLQTVFAARAAESVQHSVRLRSFGHMLRLGLPWHEKHADSRLTRMTVDVDSLARFLQNGLAGAATSLVTM
FAIAATMFWLDPILALTALSAVPVAALATMIYRRLSTPAYAQARLEIGKVNSTLQEKVSGLRVVQSHGQQELEGARLRAL
SERFRATRVRAQKYLAVYFPFLTFCTEASYAAVLLVGASQVAAGEMTAGVLAAFFLLLGQFYGPVQQLSGIVDAWQQATA
SGKHIDELLATEGTENLGSSSVLPVTGALHLDEVTFSYPDSHEPALNKLTLTIPEGMVVAVVGRSGAGKSTLIKLIAGLY
FPTHGNIRIGVQMLDDASLTEYRRQIGLVDQDVALFSSDIAENIRYSRPSATNEDVEIASQRAGLYEMVCNLPQGFRTPV
NNGGADLSAGQRQLIALARAQLANAHILLLDEATSCLDRTSEERLMSSLTDVVHAGKHSALIVAHRLTTAQRCDLIAVID
KGLLAEYGTHEQLLSAGGLYTRLWHDSVSSTALHRQHNMKEETPG

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
iroC CAC43427.2 ABC transport protein Virulence PAI III 536 Protein 0.0 99

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
iroC NP_753167.1 ABC transporter ATP-binding protein VFG1653 Protein 0.0 99