Ultrafast genome-wide gene expression analysis. Try RNA-Seq now. Learn More

Genotyping Analysis (Beta)

Characterize allelic composition and frequency at a single locus.

Genotyping PDP Header

Genotype with ease and speed

Accurate characterization of allelic composition and frequency at defined loci is critical, yet traditional workflows remain limited. Sanger sequencing lacks the ability to phase variants, while both Sanger and Illumina are constrained by amplicon length.
 

Plasmidsaurus redefines the genotyping workflow. We’ve eliminated the bottlenecks: no more PCR cleanup, no more library prep, and no sequencing primers. Whether your amplicons are short or massive, we deliver high-quality readouts of allelic composition and frequency in just 1–2 days. Send your samples today; get your insights tomorrow.


Note: Genotyping Analysis detects alleles present at >10% frequency and substitutions or small insertions/deletions under 200 bp in size. For more complex samples or to assess larger insertions/deletions, please consider our premium PCR or custom sequencing services.

Get clear answers, fast

Confidently characterize the sequence and frequency of each allele in your sample across amplicons of any length. Understand sequence changes at the nucleotide and amino acid level with data that you can’t get from Sanger or Illumina.

Say goodbye to data wrangling2

Say goodbye to data wrangling

Eliminate complex and tedious analysis pipelines. See visualizations comparing aligned alleles and get sequences and allelic frequencies in 1-2 days.

Key applications

Cell line, model organism, or germplasm QC

Confirm your cell line or model system contains a desired allele after gene editing or after receiving samples from an external source.

Locus-specific genotyping

Profile variants at defined loci to characterize their sequence. For example:

  1. You have a new mouse model showing a disease phenotype. You want to understand the variation at a specific gene.
  2. You discovered a new strawberry variant. You want to know if there is allelic variation at a specific locus.

Characterize your genetic crosses

Genotype specific loci in progeny or hybrids.

Biomarker-based stratification

Use locus-specific variants to group cell lines and assess genotype–phenotype relationships.

Phase variants

Understand if variants are present on the same strand of DNA (cis) or different strands (trans).

Screen transgenics

Identify founders with a target genotype.

Thumbs Up 2

“I've got two distinct indels, so it's a knockout! This tool is awesome! I'm definitely going to recommend this service to friends in my department!”
 

Dean Rosenthal
PhD Student, Harvard University
Thumbs Up 2

“Many times our CRISPR-edited mouse founders have multiple variants, and Sanger just can’t deconvolute the linked mutations. This new allele calling pipeline does the work for us. It clearly connects mutations in cis, gives us frequency of alleles, and saves us time digging through noisy raw reads. A huge improvement!”

Kyle Kaufmann
Scientist, TransViragen Inc.

Data deliverables & bioinformatics

Allele specific files

Image 102
The “alleles” folder contains the sequence of your alleles in fasta format. The “variants” folder contains frequencies of base calls, for each allele, displayed in tables (.tsv). The “allele-counts” folder contains total counts supporting each allele, displayed in tables (.tsv).

Virtual gel

Frame 1171276157
Shows the distribution of read lengths in your sample - great for quickly identifying samples with unexpected PCR products.

Per base data, prior to allele calling

Per base data
The “ab1-files” folder contains chromatograms (.ab1) and the “per-base-data” folder contains tables (.tsv) for the combined reads of all alleles within a similar-length group.

Feature annotations and reference alignments

Feature annotations
Plasmidsaurus-provided annotations are automatically provided in the feature map (online only). Automatic reference alignments also identify mismatches and associated protein consequences, between your reference and alleles, so you can immediately characterize your genetic variation.

Sequencing statistics including read length and coverage

Image 8
Histograms characterizing read length and coverage distribution across the consensus sequence as well as other sequencing metrics.

Raw reads

Frame 1171276161
Raw reads mapping to your sample are provided in fastq format.

Product specs & service levels

  • Long read sequencing with ONT
  • Optimized for linear, double-stranded DNA
  • Add sample cleanup for $5
Service LevelSizeTarget Read Depth/Sample*Minimum VolumeConcentration*CostTarget Turnaround Time
Standard100 bp - 25 kbUp to 3,000 reads10 μL1 ng/μL per 100 bp$301-2 days

* Sequencing is based on the molarity of your DNA insert, therefore the required concentration (ng/uL) will vary depending on your insert size.

Ready to sequence?

Put your DNA in a tube and drop it off. Click below for requirements and suggestion for optimal results.

Step 1

FAQs

We sequence each sample with Oxford Nanopore long reads to very high depth before generating reads using the latest basecalling and polishing software. Then we cluster these reads into phased alleles. 

  • We construct an amplification-free long-read sequencing library using the newest v14 library prep chemistry
  • We sequence the library with a primer-free protocol using the most accurate R10.4.1 flow cells.
  • We remove noise, phase alleles, and report frequencies only for alleles >10%. Frequencies are adjusted so that reported alleles total 100%.
  • If we detect a sequence at a frequency below 10%, the tool assumes it is likely due to PCR or sequencing error and assigns those reads to a “major”  allele (one that has >10% allele frequency) that is most similar in sequence.
    • Example: When a major allele is identified at 48% frequency and a closely related sequence is identified at 2%, the lower-frequency sequence is merged with the major allele. The final reported frequency of the major allele is therefore 50%.

Our pipeline phases variants into assembled sequences called alleles. If you provide a reference sequence, we align these alleles to your reference.

In some samples, your provided reference sequence may be the same as one of the observed alleles (for example one allele is WT and one allele is a mutant). If this is the case, we will denote this allele with a label eg Allele 1 (Your reference sequence name).

There may be cases when the provided reference is not one of the observed alleles, for example, in cases when all alleles are mutants. In these cases, we will only show your provided reference in the nucleotide and amino acid comparison view.

Finally, if you choose not to provide a reference sequence, we will align all sequences to the sequence with the highest frequency (denoted as Allele 1).

We have tested the genotyping analysis at a range of different allelic ratios for SNPs and small indels. In general we find a strong correlation between experimentally prepared ratios and pipeline-predictions (R2 > .98). If you notice significant discrepancies between our analysis and your expected results, please reach out to us at support@plasmidsaurus.com. To learn more about our data deliverables, see the Genotyping Analysis product page.
 

If your sample includes amplicons that differ significantly in length, for example due a large insertion or deletion on one allele, the pipeline will perform separate analyses on each amplicon and thus report allele frequencies for each amplicon separately. For example, in the case of a simple heterozygous locus with a large indel, you will see two amplicon maps, one for each sized amplicon, each with one allele at  100% allelic frequency.

Premium PCR and Genotyping Analysis return several of the same core data files.

  • FASTQ files: These contain the raw reads generated from your sequencing run in FASTQ format.
  • Histograms and AB1 files: We provide read-length and coverage histograms as well as chromatogram files for each amplicon in your sample. Amplicons of substantially different length are managed separately.
  • Virtual gel: The virtual gel displays the distribution of read lengths across your sample and provides a quick check on the length of amplicons present.

How Genotyping Analysis output files differ from Premium PCR

Unlike Premium PCR, Genotyping Analysis does not generate or report a single consensus sequence. Instead, reads are segregated into distinct allelic groups that are analyzed separately. As a result, you will not receive files associated with generating a consensus, including: the GenBank file, interactive feature map, comparison-results.tsv, or fasta assembly files that accompany Premium PCR samples.

Genotyping Analysis files

Genotyping analysis returns unique files that are not returned in the Premium PCR service:

  • Per-base data (.tsv): Reports base calls and their frequencies for each allele.
  • Allele count tables (.tsv): Summarizes the total read counts supporting each reported allele.
  • FASTA files: FASTA files for your reference sequence and each additional reported allele.