Microbiome Shotgun Metagenomics Technical Documentation
Technical details
Plasmidsaurus Metagenomics enables you to comprehensively profile diverse communities.
Sequencing technology
We sequence each sample using the newest short-read sequencing technology from Illumina. We use transposase-mediated library preparation chemistry to profile the total genomic DNA present in your sample. For sequencing, we use a 2 x 150 pair-end read configuration.
Bioinformatics analysis
All samples are processed through our automated pipeline:
- Quality control | Raw paired-end reads are filtered using fastp with the following parameters:
- Minimum Phred quality score: 15
- Minimum read length: 50 bp
- Automatic adapter detection for paired-end reads
- Poly-G tail trimming (removes artifacts from NovaSeq two-color chemistry)
- Samples must yield at least 1M reads after QC to proceed to taxonomic analysis
- Host removal | Reads are aligned against human and mouse reference genomes using minimap2 to remove host-derived sequences. Only unmapped (non-host) read pairs are retained for downstream analysis.
- Taxonomic profiling | Non-host reads are profiled using Sylph with abundance-corrected MinHash, against curated reference databases covering bacteria, archaea (GTDB r226), fungi (RefSeq), protists, and eukaryotic DNA viruses. Taxonomy is assigned via sylph-tax.
Service levels
| Sample inputs | |||
|---|---|---|---|
| Service Level | Sequencing Configuration | Raw Data Target | Sample Submission |
| Standard | Illumina 2 x 150 paired-end reads | 2 Gb | 20 µL at 10-20 ng/µL normalized concentration |
| Big | 10 Gb | ||
Data deliverables and file types
- Raw reads (.fastq.gz) — unfiltered paired-end Illumina reads
- QC report (.html) — read quality, filtering, and adapter trimming summary from fastp
- Taxonomic profile (.tsv) — species-level abundance table across bacteria, archaea, fungi, protists, and DNA viruses
Troubleshooting
Likely causes of microbiome sequencing failure
For Metagenomic samples, "failure" means that your sample did not produce the minimum raw data target. The outcomes of the taxonomic analyses will be highly variable depending on the specific microbial community, so these analyses are not used to determine the sequencing status (complete or fail) for this service. Status is strictly based on raw data yield.
Although we do not provide definitive reasons for failure, common reasons are:
- Samples contain sequencing inhibitors or contaminants
- Depending on the biological source material and extraction method, extracted metagenomic DNA may still contain abundant or persistent impurities that require additional purification before library prep in order to obtain high quality sequencing results. Please purify your DNA using a spin column (we highly recommend Zymo OneStep PCR Inhibitor Removal Kit for metagenomic gDNA), with AMpure XP beads, or an equivalent method, and submit the purified gDNA in elution buffer (10 mM Tris, pH 8.5) or nuclease-free water
- Samples are not prepared at the required DNA concentration of 10 ng/µl
- The most common cause of this is using a Nanodrop to quantify DNA concentration. We strongly recommend using a Qubit or equivalent fluorometric method
To achieve optimal sequencing results, please follow our recommended sample prep instructions.
Guarantees and rerun policy
In cases where the DNA concentration and quality are adequate, but we do not hit the service level data target, we will evaluate the results of the initial sequencing attempt to determine whether additional sequencing may produce a more successful outcome, and if so we will repeat the sequencing at no additional charge. We will also combine the data from the two runs together to increase chances of success on the repeat attempt.
If we are not able to achieve the data target after the free rerun, we will not perform further reruns on the sample. We still charge for failed samples.