Large public datasets of the human microbiome now exist but combining them for large-scale analysis is difficult due to a lack of standardization. We developed curatedMetagenomicData (cMD) 3, a ...