Steps for submitting a new dataset and/or task

A machine learning repository for microbiome datasets

Steps for submitting a new dataset and/or task

  1. If you have either the raw FASTQ or processed FASTA file, please deposit it into a public repository. We list large files via publicly accessible URLs and do not support uploading of any large files. If you need assistance, please contact us.

    If starting with FASTQ, we recommend processing with SHI7 and OTU-picking with BURST, with NCBI RefSeq Prokaryote files and Green genes 97

  2. Fork our repository.
  3. Add new tasks and datasets directly into tasks and datasets. Make sure to fill out all sections.

    We expect you to apply rigorous standards in filtering, subsetting, and selecting samples for your classification and regression tasks.

  4. When ready, submit a pull request for our review.