A robust SNP barcode for typing Mycobacterium tuberculosis complex strains

Francesc Coll, Ruth McNerney, José Afonso Guerra-Assunção, Judith R. Glynn, João Perdigão, Miguel Viveiros, Isabel Portugal, Arnab Pain, Nigel Martin, Taane G. Clark

Research output: Contribution to journalArticlepeer-review

416 Scopus citations


Strain-specific genomic diversity in the Mycobacterium tuberculosis complex (MTBC) is an important factor in pathogenesis that may affect virulence, transmissibility, host response and emergence of drug resistance. Several systems have been proposed to classify MTBC strains into distinct lineages and families. Here, we investigate single-nucleotide polymorphisms (SNPs) as robust (stable) markers of genetic variation for phylogenetic analysis. We identify ∼92k SNP across a global collection of 1,601 genomes. The SNP-based phylogeny is consistent with the gold-standard regions of difference (RD) classification system. Of the ∼7k strain-specific SNPs identified, 62 markers are proposed to discriminate known circulating strains. This SNP-based barcode is the first to cover all main lineages, and classifies a greater number of sublineages than current alternatives. It may be used to classify clinical isolates to evaluate tools to control the disease, including therapeutics and vaccines whose effectiveness may vary by strain type. © 2014 Macmillan Publishers Limited.
Original languageEnglish (US)
JournalNature Communications
Issue number1
StatePublished - Sep 1 2014

ASJC Scopus subject areas

  • General Biochemistry, Genetics and Molecular Biology
  • General Chemistry
  • General Physics and Astronomy


Dive into the research topics of 'A robust SNP barcode for typing Mycobacterium tuberculosis complex strains'. Together they form a unique fingerprint.

Cite this