Home

Gene Fragments in NGS Workflows: From Synthesis to Sequencing

2026. 06. 11

Next-generation sequencing (NGS) has transformed genomics research, clinical diagnostics, and drug discovery by enabling massively parallel, high-throughput sequencing of DNA and RNA at unprecedented scale. Yet the reliability of any NGS result depends not only on the sequencing platform and bioinformatics pipeline, but on the quality and integrity of the DNA inputs at every stage of the workflow — from library preparation through target enrichment to final sequencing.

Synthetic gene fragments occupy a strategically important role across the entire NGS workflow. Their precisely defined sequences, known concentrations, and synthesis-verified accuracy make them uniquely suited to serve as controls, calibrators, assembly intermediates, and validation reagents at each stage where biological sample variability would otherwise introduce uncertainty. This article examines how gene fragments integrate into NGS workflows end-to-end, with particular attention to their intersection with hybridization-based target capture using NGS custom probes and whole exome sequencing (WES) probe panels. 

Stage 1: The NGS Workflow — A Brief Orientation

A standard NGS experiment proceeds through four core stages:

1.        Nucleic acid extraction — genomic DNA, cfDNA, or cDNA is isolated and quality-assessed

2.        Library preparation — DNA is fragmented, end-repaired, A-tailed, ligated with platform-specific adapters, and optionally PCR-amplified

3.        Target enrichment (optional) — for targeted sequencing panels or whole exome sequencing, specific genomic regions are captured using hybridization probes before sequencing

4.        Sequencing and data analysis — libraries are loaded onto the sequencing platform; base calls are generated and aligned to a reference genome

Synthetic gene fragments can contribute at Stages 2, 3, and 4, providing independent, ground-truth reference sequences that are not subject to the biological variability, pre-analytical degradation, or extraction biases affecting real patient or research samples.

Stage 2: Gene Fragments in Library Preparation

As Spike-In Sequencing Controls

Library preparation is the most technically variable stage of the NGS workflow. Differences in DNA fragmentation efficiency, end-repair completeness, adapter ligation efficiency, and PCR amplification bias all introduce quantitative and qualitative errors that are difficult to detect without a built-in reference. Synthetic gene fragments of known sequence and concentration serve as ideal spike-in controls:

          Concentration standards: Adding a defined mass of a synthetic gene fragment (e.g., 1 pg of a 500 bp construct per 100 ng of genomic DNA input) creates a known-copy-number reference that can be used to calculate absolute library yield and library preparation efficiency

          Adapter ligation controls: A gene fragment without prior adapter sequences will only produce sequencing reads if adapters are successfully ligated — its read count directly reports ligation efficiency

          PCR amplification bias monitors: By designing spike-in fragments with extreme GC contents (e.g., 20%, 50%, and 80% GC each at known concentration), differential PCR amplification of GC-rich versus GC-poor sequences can be directly quantified and corrected in downstream analysis

As UMI-Tagged Molecular Standards

Unique Molecular Identifier (UMI) technology is increasingly standard in high-sensitivity NGS applications — particularly liquid biopsy and minimal residual disease (MRD) monitoring. Synthetic gene fragments pre-appended with defined UMI sequences of known diversity allow laboratories to:

          Calibrate UMI deduplication efficiency (the fraction of PCR duplicates correctly collapsed per unique molecule)

          Establish the lower limit of detection (LOD) for rare variant detection at defined input DNA concentrations

          Verify that the UMI extraction and deduplication pipeline functions correctly before clinical sample analysis

As PCR Amplicon Controls for Amplicon-Based Libraries

In amplicon-based NGS library preparation — common for targeted gene panels in oncology diagnostics — gene fragments encoding known variants at defined allele frequencies serve as positive controls for each amplicon in the panel. A synthetic gene fragment carrying a BRAF V600E mutation at 5% allele frequency, for example, verifies that the panel can reliably detect mutations at the intended sensitivity threshold.

Stage 3: Gene Fragments in Target Enrichment — Hybridization Capture

How Hybridization Capture Works

Hybridization capture (also called target enrichment) is the method of choice for targeted sequencing panels and whole exome sequencing. The workflow involves:

1.        Preparing a whole-genome DNA sequencing library (fragmented, adapter-ligated)

2.        Denaturing the library and hybridizing it with biotinylated oligonucleotide probes that bind to the target regions of interest

3.        Pulling down probe-library hybrids using streptavidin magnetic beads

4.        Washing away non-target sequences and eluting the enriched library

5.        Sequencing the enriched library to achieve deep coverage of the targeted regions

The efficiency of this process — measured as on-target rate, uniformity of coverage, and sensitivity for rare variants — depends critically on the quality and specificity of the capture probes used. Dynegene's NGS Custom Probes are optimized for both Illumina and MGI sequencing platforms.

Gene Fragments as Probe Performance Standards

Synthetic gene fragments play a direct role in validating capture probe performance before probes are deployed in research or clinical settings:

          On-target rate validation: A synthetic gene fragment carrying the exact sequence targeted by a given probe is spiked into a library at known concentration. After capture, sequencing reads from this fragment should be enriched proportionally to reads from off-target sequences. The enrichment ratio quantifies on-target capture efficiency for that specific probe.

          Probe specificity testing: Multiple gene fragments with sequences similar but not identical to probe targets can be included to measure cross-hybridization — reads from near-miss sequences indicate probe non-specificity that may confound clinical variant calls

          Hybridization condition optimization: Gene fragments at varying concentrations enable dose-response characterization of capture efficiency under different hybridization temperatures, blocker concentrations, and wash stringency conditions

          Lot-to-lot probe consistency: Testing each new probe lot against a panel of gene fragment standards provides objective, quantitative documentation of inter-lot performance consistency — critical for IVD-regulated assays

Gene Fragments as Reference Material for Targeted Panel Development

When a laboratory designs a custom targeted sequencing panel, the process requires demonstrating that each probe in the panel captures its intended target with adequate efficiency and specificity. Synthetic gene fragments encoding each targeted region provide the reference sequences needed for this validation:

Validation Parameter

Gene Fragment Role

Minimum Requirement

On-target capture rate

Spike-in fragment per target region

> 50% of reads on target

Coverage uniformity (0.2× median)

Fragment at uniform input concentration

> 95% bases covered at 0.2×

Variant detection sensitivity

Fragment with known variant at defined VAF

LOD ≤ 1% VAF for somatic panels

Cross-reactivity

Off-target fragments with sequence similarity

< 1% reads from off-target fragments

Reproducibility (CV%)

Triplicate fragment spike-ins per lot

CV < 15% across replicates

 

Stage 3 (continued): Gene Fragments in Whole Exome Sequencing

The Role of WES in Genomics

Whole exome sequencing (WES) focuses sequencing effort on the approximately 2.5% of the human genome that encodes protein-coding exons — the regions where the vast majority of disease-causing variants are found. By capturing only this fraction of the genome, WES achieves deep per-exon coverage at a fraction of the cost of whole-genome sequencing, making it the preferred approach for rare disease diagnosis, cancer gene panel analysis, and population genetics studies.

Dynegene's WES probe panels — including the QuarXeq Human All Exon Probes series — target the complete human exome using biotinylated capture probes designed against curated databases including RefSeq, GENCODE, and ClinVar-annotated disease genes.

Synthetic Exon Reference Libraries

Validating a WES panel across the full ~20,000-gene human exome requires reference sequences for every targeted exon — a challenge that biological samples cannot reliably address due to:

          Uneven coverage across GC-extreme exons in biological samples

          Absence of rare variant alleles (pathogenic variants) in healthy control samples

          Sample-to-sample variability in DNA quality and extraction efficiency

Synthetic gene fragments provide an orthogonal solution. A synthetic exon reference library — a pool of gene fragments, each encoding one or more targeted exons at uniform concentration — enables:

          Pan-exome coverage validation: Every probe in the panel can be assessed for capture efficiency using a single reference library run, without requiring biological samples carrying rare variants

          GC-stratified performance analysis: Exons grouped by GC content (< 30%, 30–50%, 50–70%, > 70%) can be assessed independently, identifying probes that underperform at GC extremes before clinical deployment

Application: Tumor-Normal Somatic Variant Calling Validation

For oncology WES applications, somatic variant calling requires distinguishing true tumor variants from sequencing artifacts and germline polymorphisms. Synthetic gene fragments encoding cancer hotspot mutations (e.g., TP53 R175H, KRAS G12D, PIK3CA H1047R) at defined tumor allele fractions serve as ground-truth samples for:

          Establishing the LOD for somatic SNV detection

          Calibrating tumor content (purity) estimation algorithms

          Validating bioinformatics pipelines for somatic copy number variation (SCNV) calling

          Qualifying laboratory staff on variant interpretation workflows

Stage 4: Gene Fragments in Sequencing Data Validation

Sequencing Platform Performance Standards

Different NGS platforms (Illumina, MGI, Oxford Nanopore, PacBio) have distinct error profiles, read length capabilities, and systematic biases. Synthetic gene fragments of identical sequence submitted to multiple platforms enable:

          Cross-platform concordance analysis: The fraction of variants called identically across platforms for the same synthetic sequence quantifies platform-level discordance

          Systematic error profiling: Sequencing errors that appear at the same position across multiple libraries from the same synthetic fragment identify platform-specific sequence-context errors (e.g., Illumina's known issues with GGC trinucleotides, or homopolymer errors on Oxford Nanopore)

          Phased variant calling validation: Long synthetic gene fragments (1,000–3,000 bp) encoding multiple phased variants allow validation of haplotype phasing algorithms using known ground-truth phase relationships

Liquid Biopsy: cfDNA Assay Development

Circulating cell-free DNA (cfDNA) liquid biopsy assays face extreme analytical challenges: cfDNA molecules are highly fragmented (typically 150–200 bp in plasma), present at very low concentrations, and the tumor-derived fraction may represent less than 0.1% of total cfDNA. Synthetic gene fragments:

          Mimic cfDNA fragment size: Gene fragments of 150–200 bp replicate the size distribution of cfDNA, enabling realistic spike-in controls without requiring patient plasma

          Serve as reference materials for ctDNA assay validation: Regulatory guidance for clinical liquid biopsy assays (e.g., FDA guidance for somatic mutation detection) recommends the use of reference materials with known variant allele fractions — synthetic gene fragment-based standards meet this requirement

          Enable multiplexed variant panel controls: A pool of synthetic gene fragments, each carrying a different clinically relevant somatic mutation, provides comprehensive panel-level QC in a single tube

The Dynegene End-to-End Solution

The applications described above illustrate how gene fragments, NGS capture probes, and WES panels form an integrated, mutually reinforcing system rather than three independent product categories. Dynegene's microarray synthesis platform generates gene fragments with up to 350 nt per oligo, supporting constructs from short cfDNA-mimicking fragments (150–200 bp) through large synthetic exon reference sequences (up to 3,000 bp). With throughput reaching 4.35 million unique sequences per chip and 1 Gb of DNA per run, the platform is equipped to generate comprehensive reference libraries at the scale demanded by modern WES panel validation and large targeted sequencing programs.

Practical Recommendations: Integrating Gene Fragments into Your NGS Workflow

For laboratories establishing or upgrading NGS workflows, the following phased integration strategy maximizes the value of synthetic gene fragment controls:

Phase 1: Library Preparation QC (Immediate Impact)

          Order 3–5 gene fragments of known sequence and concentration (200–400 bp each)

          Use as spike-in controls in every library preparation batch

          Track recovery rates across batches to detect reagent degradation, protocol drift, or operator variability

Phase 2: Panel Validation (Before Clinical or Publication Use)

          Design gene fragments covering 10–20% of panel targets, spanning the full GC content range of the panel

          Generate on-target enrichment data for each fragment to document panel performance

          Include at least one fragment per variant type (SNV, indel, fusion) at the intended LOD

Phase 3: Comprehensive Reference Library (Full Validation Program)

          Commission a synthetic exon or target reference library from Dynegene covering all panel targets

          Perform annual re-qualification using the reference library as a fixed performance benchmark

          Submit reference library data as supporting documentation for regulatory submissions or accreditation audits

Conclusion

Synthetic gene fragments are not merely a synthesis convenience — they are a foundational quality assurance tool that enables NGS laboratories to operate with measurable, documented, and reproducible accuracy across every stage of the sequencing workflow. From the earliest library preparation QC steps through full WES panel validation and clinical liquid biopsy assay development, gene fragments provide the ground-truth reference sequences that make high-confidence genomics possible.

For organizations seeking to implement this integrated approach using a single, high-throughput synthesis platform, Dynegene provides all three essential components: high-fidelity gene fragments, validated NGS custom capture probes, and comprehensive whole exome sequencing probe panels — supported by the technical expertise to help design the right solution for each application.

 

 

Build your integrated NGS quality system. Contact Dynegene at info2@dynegene.com or visit dynegene.com to discuss your gene fragment and probe requirements with the technical team.

Contact Us

Tel: 400-017-9077

Address: Floor 2, Building 5, No. 248 Guanghua Road, Minhang District, Shanghai

Email:info2@dynegene.com

Dynegene Next-Gen Synthesis: Powering Biotech Revolution With Nucleic Acids

Contact Us

Tel: 400-017-9077

Address: Floor 2, Building 5, No. 248 Guanghua Road, Minhang District, Shanghai

Email: info2@dynegene.com

Follow Us

Copyright © 2026 Shanghai Dynegene Technologies Co., Ltd.
All rights reserved.   

Website Map I Privacy Policy