Appearance
This report is written by MaltSci based on the latest literature and research findings
What are the applications of long-read sequencing?
Abstract
The advent of long-read sequencing technologies has significantly transformed the field of genomics, providing researchers with enhanced capabilities to explore the complexities of the human genome and other organisms. Unlike traditional short-read sequencing methods, which are limited in read length, long-read sequencing can generate reads extending tens of thousands of base pairs. This ability is critical for accurately resolving complex genomic regions, structural variations, and repetitive sequences that are often difficult to analyze with shorter reads. The implications of long-read sequencing extend to personalized medicine, as it enhances our understanding of the genetic basis of diseases, leading to more effective therapeutic strategies. Recent advancements in long-read sequencing technologies, particularly those developed by Pacific Biosciences and Oxford Nanopore Technologies, have demonstrated their efficacy in various fields, including cancer genomics, rare genetic disorders, and infectious disease diagnostics. This review systematically explores the diverse applications of long-read sequencing, highlighting its strengths in genome assembly, transcriptome analysis, epigenomics, and clinical research. Key applications include the ability to produce comprehensive genome assemblies, characterize full-length transcripts, and assess DNA methylation and chromatin accessibility. In clinical settings, long-read sequencing shows promise in improving diagnostic rates for complex genetic diseases and facilitating the integration of genomic insights into personalized medicine. As these technologies continue to evolve, their applications are expected to expand, ultimately enhancing our understanding of genetic diversity and disease mechanisms.
Outline
This report will discuss the following questions.
- 1 Introduction
- 2 Overview of Long-Read Sequencing Technologies
- 2.1 Types of Long-Read Sequencing Technologies
- 2.2 Comparison with Short-Read Sequencing
- 3 Applications in Genome Assembly
- 3.1 De Novo Genome Assembly
- 3.2 Structural Variation Detection
- 4 Applications in Transcriptome Analysis
- 4.1 Full-Length Transcript Sequencing
- 4.2 Alternative Splicing Analysis
- 5 Applications in Epigenomics
- 5.1 DNA Methylation Studies
- 5.2 Chromatin Accessibility
- 6 Applications in Clinical Research
- 6.1 Disease Association Studies
- 6.2 Personalized Medicine
- 7 Conclusion
1 Introduction
The advent of long-read sequencing technologies has ushered in a new era in genomics, significantly enhancing our ability to explore the complexities of the human genome and other organisms. Unlike traditional short-read sequencing methods, which typically generate reads of only a few hundred base pairs, long-read sequencing allows for the production of reads that can extend tens of thousands of base pairs. This capability is crucial for accurately resolving complex genomic regions, structural variations, and repetitive sequences that are often challenging to analyze with shorter reads. As such, long-read sequencing has become a powerful tool in various fields of biological and medical research, facilitating a deeper understanding of genome architecture, gene expression, and the molecular underpinnings of diseases.
The significance of long-read sequencing extends beyond mere technical advancements; it holds the potential to transform personalized medicine and enhance diagnostic capabilities. By providing comprehensive insights into genomic structures and variations, long-read sequencing can improve our understanding of the genetic basis of diseases, paving the way for more effective therapeutic strategies and targeted treatments. Recent studies have demonstrated the efficacy of long-read sequencing in elucidating complex genetic diseases, including various forms of cancer, where traditional sequencing methods often fall short in identifying critical alterations [1][2]. Furthermore, as the cost of sequencing continues to decline and throughput increases, the integration of long-read technologies into clinical settings is becoming increasingly feasible [3].
Current research highlights the rapid evolution of long-read sequencing technologies, particularly those developed by Pacific Biosciences and Oxford Nanopore Technologies. These platforms have demonstrated their ability to produce high-quality genomic data, enabling researchers to conduct comprehensive analyses that were previously unattainable [4][5]. The ability to detect structural variants, characterize full-length transcripts, and explore epigenomic landscapes has positioned long-read sequencing as a critical component of modern genomic research [6][7].
This review will systematically explore the diverse applications of long-read sequencing across various domains, structured as follows:
- An overview of long-read sequencing technologies, including a comparison with short-read sequencing methods, will provide foundational knowledge on the strengths and limitations of each approach.
- The applications of long-read sequencing in genome assembly will be examined, focusing on de novo assembly and structural variation detection, which are pivotal for understanding genomic architecture.
- The role of long-read sequencing in transcriptome analysis will be discussed, emphasizing full-length transcript sequencing and alternative splicing analysis, both of which are crucial for comprehending gene expression dynamics.
- We will explore the applications of long-read sequencing in epigenomics, highlighting its utility in DNA methylation studies and chromatin accessibility, which are essential for understanding gene regulation.
- Finally, the review will address the applications of long-read sequencing in clinical research, including disease association studies and personalized medicine, underscoring its potential to revolutionize diagnostic practices.
In conclusion, as long-read sequencing technologies continue to advance, their applications in biomedical research are expected to expand, ultimately enhancing our understanding of genetic diversity and disease mechanisms. This review aims to provide a comprehensive overview of these technologies, informing researchers and clinicians about their potential to advance both basic and applied sciences in genomics and medicine.
2 Overview of Long-Read Sequencing Technologies
2.1 Types of Long-Read Sequencing Technologies
Long-read sequencing (LRS) technologies have emerged as transformative tools in genomics, enabling the comprehensive analysis of complex genomic regions that are often inaccessible to traditional short-read sequencing methods. The applications of LRS span various fields, including clinical diagnostics, cancer genomics, rare genetic disorders, and microbial studies, among others.
In clinical settings, long-read sequencing is particularly advantageous for molecular diagnosis and therapy selection. It excels in identifying structural variants (SVs), such as insertions, deletions, and duplications, which are critical for understanding the genetic basis of diseases. For instance, studies have shown that LRS can accurately detect SVs throughout the genome, including regions that are challenging to analyze with short reads, such as repetitive sequences and segmental duplications. This capability is crucial for diagnosing rare genetic disorders, where structural variants are often the underlying cause [8].
In cancer genomics, LRS is rapidly being recognized for its ability to provide a comprehensive view of the genome, transcriptome, and epigenome. It allows for the detection of alterations that previous technologies have overlooked, including complex rearrangements and variations within repetitive regions. The ability to resolve these alterations can lead to improved understanding of cancer biology and potential therapeutic targets [1]. Moreover, LRS facilitates the assembly of complete, gapless genomes, enabling the study of previously intractable regions like centromeres [5].
Additionally, LRS has shown promise in the field of infectious disease diagnostics. It enables the rapid and accurate identification of pathogens directly from clinical samples, providing essential information such as antimicrobial resistance profiles. This capability is particularly valuable in managing infectious diseases where timely and precise identification of the causative agent is critical [9].
In the realm of transcriptomics, long-read RNA sequencing (RNA-seq) has revolutionized the study of transcriptome complexity. By enabling the sequencing of full-length RNA molecules, LRS provides a more accurate characterization of transcript isoforms, allowing researchers to explore variations that cannot be reliably assessed with standard short-read methods [10]. This technology is crucial for understanding gene expression patterns in various diseases, including cancer [11].
The advancements in LRS technologies, particularly those developed by companies like Oxford Nanopore Technologies and Pacific Biosciences, have significantly improved the accuracy, throughput, and cost-effectiveness of sequencing. These developments have opened new avenues for research and clinical applications, with ongoing improvements aimed at addressing challenges such as computational complexity and the need for efficient interpretation of genomic data [3].
In summary, long-read sequencing technologies are versatile tools with a wide range of applications in clinical diagnostics, cancer research, rare disease identification, and microbial genomics. Their ability to provide high-resolution insights into complex genomic regions is poised to enhance our understanding of genetic diversity and disease mechanisms, paving the way for advancements in precision medicine and targeted therapies.
2.2 Comparison with Short-Read Sequencing
Long-read sequencing technologies have emerged as a transformative approach in genomics, providing significant advancements in the analysis of complex genomic regions that are often inaccessible with traditional short-read sequencing methods. These technologies, notably from Pacific Biosciences and Oxford Nanopore Technologies, allow for the generation of reads that can span tens to thousands of kilobases, enabling a comprehensive understanding of genomic architecture.
One of the primary applications of long-read sequencing is in the detection and characterization of structural variants (SVs). Long-read technologies have demonstrated a remarkable ability to identify complex structural rearrangements, including large deletions, duplications, and gene fusions, which are critical in understanding various cancers and genetic disorders. For instance, studies have shown that long-read sequencing can successfully uncover SVs in cancer genomes that are often missed by short-read approaches, thus providing insights into the molecular etiology of tumors (Sakamoto et al. 2021; Iyer et al. 2024) [2][7].
Moreover, long-read sequencing enhances the resolution of genomic analyses by facilitating the assembly of diploid genomes, thereby revealing the full spectrum of human genetic variation. This capability is particularly important in population-scale studies and rare disease research, where understanding genotype-phenotype correlations is essential (Rausch et al. 2025) [6]. Additionally, long-read sequencing technologies enable the analysis of epigenetic modifications, such as DNA methylation, directly from the sequencing data, further enriching the understanding of gene regulation and expression (Mahmoud et al. 2025) [12].
In clinical settings, long-read sequencing has been recognized for its potential to improve diagnostic rates for complex genetic diseases. For example, it has been shown to cover regions of the genome that are typically difficult to interpret with short-read sequencing, thus facilitating the identification of previously unrecognized variants associated with diseases (Kobayashi et al. 2022) [13]. Furthermore, the ability of long-read sequencing to detect small variants alongside larger structural changes enhances its utility in precision medicine (Xu et al. 2025) [14].
In comparison to short-read sequencing, long-read technologies offer distinct advantages. Short-read sequencing, while cost-effective and high-throughput, often struggles with repetitive regions and complex genomic structures due to its limited read length (Warburton et al. 2023) [5]. Long reads can bridge these gaps, providing more contiguous assemblies and a clearer picture of genomic architecture. However, long-read sequencing typically has higher error rates and may be more expensive, which poses challenges in achieving sufficient coverage for certain applications (Mahmoud et al. 2025) [12].
In summary, long-read sequencing technologies represent a significant advancement in genomic research and clinical applications, offering enhanced capabilities for detecting structural variants, understanding genetic diversity, and improving diagnostic rates for complex diseases. Their ability to generate comprehensive genomic data positions them as a critical tool in both research and clinical settings, paving the way for deeper insights into human health and disease.
3 Applications in Genome Assembly
3.1 De Novo Genome Assembly
Long-read sequencing technologies have significantly advanced the field of de novo genome assembly, enabling the reconstruction of complex genomes with improved accuracy and continuity. These technologies, such as those developed by Pacific Biosciences and Oxford Nanopore, provide extended DNA sequences capable of spanning intricate and repetitive regions of genomes, which are often challenging to assemble using traditional short-read sequencing methods.
One of the primary applications of long-read sequencing in de novo genome assembly is its ability to generate more contiguous and complete genome assemblies. For instance, the hybrid assembly approach, which combines long reads for scaffolding with short reads for error correction, has demonstrated remarkable improvements in assembly quality. This method allows for the correction of errors in long read sequences using complementary short reads, resulting in higher-quality assemblies with increased contiguity [15]. Furthermore, tools like Fast-SG utilize ultrafast alignment-free algorithms to construct scaffolding graphs from either short or long reads, facilitating the assembly of large genomes with lower computational demands and costs [16].
Long-read sequencing has proven particularly beneficial for assembling the genomes of non-model organisms, which often exhibit high polymorphism and complex genomic architectures. A study evaluating ten long-read assemblers on Pacific Biosciences datasets revealed that long-read technologies could produce superior quality assemblies compared to those generated using second-generation sequencing [17]. This capability is critical for advancing our understanding of the genetic diversity and evolutionary biology of various species.
Moreover, the application of long-read sequencing extends to the resolution of structural variants (SVs) within genomes, which are essential for linking genotypes to phenotypes in both population-scale studies and rare disease research. The ability to accurately detect SVs has been highlighted as a transformative aspect of long-read sequencing, enabling researchers to explore genomic features that were previously inaccessible [6].
In addition to its role in genome assembly, long-read sequencing has been instrumental in enhancing the characterization of genomic variation at the single-cell level. For example, a study demonstrated that long-read sequencing could provide insights into the genomic architecture of individual cells, revealing a substantial number of high-confidence SVs that were undetectable using short-read approaches [18].
The ongoing advancements in long-read sequencing technologies continue to shape the landscape of genomic research, offering robust solutions for de novo genome assembly and facilitating deeper insights into complex genomic structures and variations. As the technology evolves, it holds promise for addressing the remaining challenges in genome assembly and expanding its applications across diverse biological contexts.
3.2 Structural Variation Detection
Long-read sequencing technologies have significantly advanced the field of genomics, particularly in the detection and characterization of structural variations (SVs) within genomes. These applications are crucial for understanding genetic diversity, disease mechanisms, and the complexities of genomic architecture. The following outlines key applications of long-read sequencing in structural variation detection:
Detection of Structural Variants: Long-read sequencing enables the precise identification of various structural variants, including insertions, deletions, duplications, inversions, and complex rearrangements that are often missed by short-read sequencing technologies. This capability is particularly beneficial in cancer genomics, where SVs can play a critical role in tumorigenesis. For instance, studies have demonstrated that long-read sequencing can accurately characterize complex structural aberrations in cancer genomes, such as large deletions and gene fusions, which are essential for understanding the molecular etiology of cancers [19].
Characterization of Non-Coding Regions: Long-read sequencing has proven to be effective in identifying SVs impacting non-coding elements of the genome, which are crucial for regulatory functions. A study involving families with autism spectrum disorders highlighted how long-read sequencing could detect a higher number of SVs in non-coding regions compared to short-read methods, thereby uncovering potential risk factors associated with these conditions [20].
Comprehensive Genome Assembly: The ability of long-read sequencing to produce longer reads allows for better assembly of complex genomic regions that are typically challenging to resolve with short reads. This technology has facilitated the generation of complete, gapless assemblies of human genomes, including repetitive regions that are often problematic for conventional sequencing approaches [5]. Long-read sequencing has also enabled the assembly of diploid genomes, which is critical for revealing the full spectrum of human genetic variation [4].
Population-Scale Studies: Recent advancements in long-read sequencing technologies have allowed for their application in population-scale genomic studies. This has been pivotal in detecting structural variants across large cohorts, which is essential for linking genotypes to phenotypes in various genetic conditions and for understanding the evolutionary dynamics of populations [21].
Improved Diagnostic Rates: Long-read sequencing has shown significant promise in clinical settings, particularly for rare genetic disorders. Its ability to detect structural variants that traditional methods overlook can lead to improved diagnostic rates. For example, in pediatric endocrine disorders, long-read sequencing has identified pathogenic variants that were previously undetected, thereby enhancing the diagnostic yield [22].
Integration with Bioinformatics Tools: The development of specialized bioinformatics tools for long-read data analysis has further enhanced the ability to detect and interpret structural variants. Tools like Sniffles2 and Dysgu have been designed to handle the unique challenges posed by long-read sequencing data, improving the sensitivity and precision of SV detection [8][23].
In summary, long-read sequencing has revolutionized the detection of structural variations by providing comprehensive insights into complex genomic architectures, thereby playing a crucial role in advancing both research and clinical applications in genomics. Its ability to reveal previously inaccessible genomic information positions it as a transformative tool in understanding genetic diversity and disease mechanisms.
4 Applications in Transcriptome Analysis
4.1 Full-Length Transcript Sequencing
Long-read RNA sequencing (RNA-seq) has emerged as a transformative technology in the field of transcriptome analysis, enabling the comprehensive study of full-length transcripts and providing significant advantages over traditional short-read sequencing methods. The applications of long-read sequencing in transcriptome analysis are vast and impactful, particularly in the context of understanding complex transcript structures, alternative splicing events, and gene expression regulation.
One of the primary applications of long-read RNA-seq is its ability to generate full-length transcript sequences. This capability allows researchers to identify and characterize novel isoforms, alternative splicing events, and fusion transcripts that are often missed by short-read approaches. For instance, long-read sequencing has been utilized to uncover extensive transcript diversity in various biological contexts, such as in human-derived cortical neurons, where a study identified 15,072 transcripts in stem cell-derived cortical neurons alone, alongside numerous differential expression events and usage analyses related to neurodevelopmental and neurodegenerative diseases [24].
Additionally, long-read RNA-seq plays a crucial role in enhancing the accuracy of transcriptome annotation. By enabling the sequencing of longer reads, researchers can more effectively assemble full-length transcripts, thereby improving the completeness and accuracy of transcriptome assemblies. A study on human pancreatic cancer cell lines reported the generation of a high-coverage long-read transcriptome dataset, which not only facilitated isoform discovery but also supported downstream applications such as transcriptome annotation and integration with other omics data [11].
Long-read sequencing technologies also provide insights into transcriptome complexity by enabling the detection of structural variations and epigenetic marks. This is particularly relevant in plant pathology, where long-read sequencing has revolutionized the study of plant and pathogen genomes and transcriptomes, contributing to the understanding of plant-pathogen interactions and disease management [25].
Moreover, the integration of long-read sequencing with novel computational tools enhances its applications in transcriptome analysis. For example, the introduction of TranSigner, a tool designed for accurate read assignment and abundance estimation, demonstrates how long-read data can be effectively utilized to improve transcript quantification [26]. Similarly, the SQANTI tool allows for extensive characterization of long-read transcript sequences, ensuring quality control in full-length transcriptome identification and quantification [27].
In summary, long-read RNA sequencing has a multitude of applications in transcriptome analysis, including the characterization of full-length transcripts, enhancement of transcriptome annotation, investigation of transcript diversity, and integration with advanced computational tools for improved data analysis. These applications are paving the way for a deeper understanding of gene expression regulation and the complexities of transcriptomic landscapes in various biological contexts.
4.2 Alternative Splicing Analysis
Long-read sequencing (LRS) technologies have emerged as a transformative tool in transcriptome analysis, particularly in the study of alternative splicing (AS). The inherent ability of LRS to produce full-length transcripts enables researchers to gain deeper insights into the complexity of splicing mechanisms and their implications in various biological contexts.
One significant application of LRS in alternative splicing analysis is its capacity to uncover previously unannotated isoforms and to characterize isoform expression across different cell states. For instance, a study utilizing Oxford Nanopore Technologies for long-read sequencing on the neuroblastoma cell line SH-SY5Y identified a novel transcript of the voltage-gated calcium channel subunit gene, CACNA2D2, revealing differential expression and usage of transcripts during differentiation. This highlights the potential of LRS to facilitate the identification of candidates for future research into state change regulation (Wright et al., 2022) [28].
Furthermore, LRS has been shown to provide a comprehensive mapping of transcript diversity, enabling the identification of extensive transcript variants across different cell types. For example, a study profiling human fibroblasts, induced pluripotent stem cells, and stem cell-derived cortical neurons identified 15,072 transcripts in cortical neurons, with a total of 35,519 differential transcript expression events. This underscores the complexity of transcriptomic regulation and the utility of LRS in advancing our understanding of neurodevelopmental and neurodegenerative diseases (Xu et al., 2025) [24].
LRS also allows for detailed profiling of alternative splicing events at the isoform level, which is critical for understanding the functional consequences of splicing variations. The integration of LRS with computational tools, such as IsoTools, facilitates the reconstruction and quantification of transcripts, enabling researchers to identify alternative splicing events with high accuracy (Lienhard et al., 2023) [29].
Moreover, the use of LRS in combination with other sequencing technologies enhances the resolution of AS analysis. For instance, a comparative study on the use of short-read and long-read sequencing for detecting differential exon usage demonstrated that long-reads excelled in identifying AS events at untranslated regions (UTRs), while short-reads provided better quantification of expression levels. This complementary approach allows for a more comprehensive understanding of AS dynamics during processes such as cell differentiation (Leshkowitz et al., 2022) [30].
In summary, long-read sequencing plays a crucial role in advancing the field of alternative splicing analysis by enabling the identification of novel isoforms, providing insights into transcriptomic complexity, and facilitating the exploration of splicing dynamics in various biological contexts. The integration of LRS with existing technologies and computational frameworks continues to enhance our understanding of the functional implications of alternative splicing in health and disease.
5 Applications in Epigenomics
5.1 DNA Methylation Studies
Long-read sequencing technologies have significantly advanced the field of epigenomics, particularly in the study of DNA methylation. This approach allows for a more comprehensive understanding of the methylome, which is crucial for elucidating the regulatory mechanisms underlying gene expression and various biological processes. The applications of long-read sequencing in DNA methylation studies can be categorized as follows:
Enhanced Methylation Profiling: Long-read sequencing facilitates the direct detection of DNA methylation patterns at single-base resolution without the need for bisulfite treatment, which can introduce biases and degrade DNA quality. This capability allows researchers to analyze methylation in complex genomic regions, including repetitive elements and transposable elements, which are often challenging to characterize using traditional short-read sequencing methods. For instance, nanopore sequencing can provide insights into the methylation status of 5-methylcytosine and 5-hydroxymethylcytosine across long DNA fragments, enhancing the accuracy of methylation level estimations in transposable elements associated with cancer [31].
Single-Molecule Analysis: Long-read sequencing enables the analysis of DNA methylation at the single-molecule level, revealing heterogeneity in methylation patterns that bulk-level analyses might overlook. Studies have shown that single-molecule methylation profiling can uncover large-scale variations in methylation across the genome, particularly within heterochromatin regions [32]. This granular approach is essential for understanding the epigenetic landscape of various cell types and how they may differ in aging or disease states [33].
Epigenetic Aging Studies: The ability to profile DNA methylation with long-read sequencing has opened new avenues for research into epigenetic aging. By capturing full CpG contexts and accommodating diverse genomic regions, long-read sequencing provides insights into age-associated methylation patterns, facilitating the development of biomarkers for aging [34]. This methodology allows for tracking cellular aging dynamics at a resolution that was previously unattainable.
Clinical Applications: Long-read sequencing has significant potential in clinical settings, particularly in cancer diagnostics. By analyzing DNA methylation changes in cancerous tissues, researchers can identify specific methylation signatures associated with tumorigenesis. For example, a targeted long-read methylation analysis method has been developed that integrates hybridization capture to enhance the detection of methylation patterns in clinical specimens, allowing for haplotype-aware analysis of cancers [35].
Computational Method Development: The growth of long-read sequencing technologies has prompted the development of advanced computational tools to analyze the resulting methylation data. These tools aim to address the unique challenges posed by long-read data, such as alignment and methylation signal calling, enhancing the overall efficiency and accuracy of methylation studies [6].
Integration with Genomic Insights: Long-read sequencing allows for simultaneous measurement of genetic and epigenetic information, providing a holistic view of the genome's regulatory mechanisms. This integration is crucial for understanding complex disease mechanisms, as it helps to link variations in DNA methylation with genetic alterations [36].
In summary, long-read sequencing technologies have revolutionized DNA methylation studies by providing deeper insights into the epigenome, enhancing the resolution of methylation profiling, and enabling new clinical applications. These advancements underscore the transformative potential of long-read sequencing in both basic and applied epigenomic research.
5.2 Chromatin Accessibility
Long-read sequencing (LRS) technologies, particularly those developed by Oxford Nanopore Technologies and Pacific Biosciences, have significantly advanced the field of epigenomics, particularly in the study of chromatin accessibility. These technologies provide several key advantages over traditional short-read sequencing methods, enabling a more comprehensive understanding of chromatin dynamics and regulatory mechanisms.
One of the primary applications of long-read sequencing in epigenomics is the simultaneous profiling of chromatin accessibility and DNA methylation. For instance, a study utilizing nanopore sequencing demonstrated the ability to evaluate CpG methylation and chromatin accessibility concurrently on long strands of DNA. This approach, termed nanoNOMe, was applied to four human cell lines, revealing insights into nucleosome occupancy and the combinatorial epigenetic signatures of promoters at single-molecule resolution. Such capabilities allow researchers to generate fully phased human epigenomes, which include chromosome-level allele-specific profiles of chromatin accessibility and methylation, crucial for understanding the differences between cancerous and noncancerous cells [37].
Moreover, long-read sequencing enables high-resolution profiling of chromatin states, as seen in studies employing techniques like Simultaneous Accessibility and DNA Methylation Sequencing (SAM-seq). This method leverages long-read nanopore sequencing to obtain detailed accessibility and methylation landscapes in plant genomes, revealing interactions between chromatin accessibility and DNA methylation across various genomic regions, including genes and transposable elements [38].
In mammalian systems, long-read sequencing has been pivotal in elucidating chromatin accessibility dynamics during early embryonic development. The development of scNanoATAC-seq2 allows researchers to assess chromatin accessibility at single-cell resolution in mouse preimplantation embryos. This technique has uncovered distinct chromatin signatures associated with lineage segregation and provided insights into the reprogramming of chromatin accessibility during early development [39].
Furthermore, long-read sequencing facilitates the exploration of chromatin accessibility in complex systems, such as whole organisms. For example, research in Caenorhabditis elegans has shown that chromatin accessibility changes throughout development can reveal key regulatory elements, emphasizing the importance of studying chromatin dynamics in vivo [40].
The ability of long-read sequencing to directly detect DNA modifications, such as methylation, enhances its utility in profiling chromatin accessibility. This feature allows for the interpretation of molecular phenotypes and provides insights into the parent-of-origin effects of mutations, which is critical for understanding the genetic basis of diseases [6].
In summary, long-read sequencing has revolutionized the field of epigenomics by enabling comprehensive profiling of chromatin accessibility and associated modifications across various biological contexts. Its applications span from basic research in developmental biology to potential clinical applications in cancer genomics, highlighting its transformative potential in understanding the regulatory landscape of the genome.
6 Applications in Clinical Research
6.1 Disease Association Studies
Long-read sequencing (LRS) has emerged as a transformative technology in clinical research, particularly in the context of disease association studies. This technology addresses significant limitations associated with short-read sequencing, enabling a more comprehensive analysis of complex genomic regions that are often associated with various diseases.
One of the primary applications of long-read sequencing is in the diagnosis of rare genetic disorders. Long-read sequencing effectively detects and clarifies additional disease-associated variants that may be missed by standard diagnostic workflows, particularly for structural variants (SVs), which include insertions, deletions, duplications, inversions, and complex rearrangements. These SVs are crucial in understanding the etiology of many rare diseases, as they often underlie the genetic basis of these conditions. Studies have shown that LRS can yield an additional diagnostic rate of 7%-17% following negative short-read genome sequencing, significantly enhancing the diagnostic yield for undiagnosed rare diseases (Del Gobbo & Boycott, 2025) [41].
Moreover, long-read sequencing is particularly adept at resolving complex genomic regions that are challenging for short-read technologies, such as repetitive sequences and areas with high sequence similarity. This capability allows for a more accurate characterization of genomic alterations, thereby facilitating the identification of disease mechanisms. For instance, long-read sequencing has been successfully applied to detect structural variants in cancer genomes, where it has enabled precise identification of alterations that were previously overlooked by traditional sequencing methods (Sakamoto et al., 2021) [2].
In the context of cancer research, long-read sequencing provides a comprehensive view of genomic, transcriptomic, and epigenomic alterations. This technology not only enhances the detection of SVs but also allows for the evaluation of epigenetic modifications, which are crucial for understanding cancer biology. By utilizing long-read sequencing, researchers can uncover alterations in complex rearrangements and repetitive regions that contribute to tumorigenesis (Li et al., 2025) [1].
Long-read sequencing also plays a pivotal role in elucidating the genetic basis of common diseases by enabling the identification of genetic variants associated with complex traits. The ability to generate high-resolution insights into the human genome has significant implications for linking genotype to phenotype in population-scale studies. Recent advancements in sequencing throughput and computational methodologies are paving the way for integrating long-read sequencing into clinical cohort studies and disease diagnostics (Rausch et al., 2025) [6].
Furthermore, long-read sequencing has applications beyond genetic disorders and cancer. It has been shown to improve the accuracy of pathogen identification in infectious diseases, allowing for the direct sequencing of pathogens from clinical samples and providing critical information such as antimicrobial resistance profiles (Hoang et al., 2021) [9]. This capability is particularly important for managing infectious diseases, where rapid and accurate identification of causative agents is essential for effective treatment.
In summary, long-read sequencing is revolutionizing clinical research by enhancing the diagnostic yield for rare genetic disorders, improving cancer genomics, and facilitating the identification of disease-associated variants across various conditions. Its ability to provide comprehensive genomic insights positions it as a cornerstone technology for future disease association studies and precision medicine initiatives.
6.2 Personalized Medicine
Long-read sequencing (LRS) has emerged as a transformative technology in clinical research, particularly in the realm of personalized medicine. Its unique capabilities address several limitations associated with traditional short-read sequencing, making it a powerful tool for enhancing molecular diagnostics and therapy selection.
One of the primary applications of long-read sequencing in personalized medicine is its ability to provide comprehensive insights into complex genomic regions that are often challenging to analyze with short reads. Long-read technologies, such as those developed by Pacific Biosciences and Oxford Nanopore Technologies, enable the detection of structural variations, long-range haplotype phasing, and the identification of base modifications. This capability is particularly beneficial in cases where short-read sequencing fails to yield conclusive results, as it allows for better detection of structural variants and more accurate resolution of repetitive or non-unique genomic regions (Conlin et al., 2022; Hoang et al., 2021) [9][42].
Furthermore, long-read sequencing has shown significant promise in the molecular diagnostics of constitutional genetic disorders. It has been reported that LRS can increase the diagnostic yield for patients with genetic disorders, thereby facilitating more precise treatment plans and enabling families to make informed decisions regarding care. The technology's ability to uncover previously undetected variants and improve the understanding of genetic disease mechanisms is pivotal for the advancement of personalized medicine (Mastrorosa et al., 2023) [43].
In the context of cancer genomics, long-read sequencing provides a comprehensive view of the genomic landscape, enabling the identification of alterations that previous technologies have overlooked. This includes complex rearrangements and structural variants that are critical for understanding tumor biology and for the development of targeted therapies (Li et al., 2025; Sakamoto et al., 2021) [1][2]. The ability to analyze the epigenomic status surrounding structural variants further enhances the understanding of their functional implications in cancer progression.
Long-read sequencing is also poised to play a significant role in the diagnosis and management of rare diseases. Its application can help resolve cases that remain unsolved after conventional testing methods, thus increasing the diagnostic yield and expediting the identification of genetic causes (Xu et al., 2025) [14]. Additionally, LRS can provide valuable insights into the genetic basis of complex diseases by allowing for the characterization of large structural variations and repeat expansions that are often implicated in disease etiology.
Moreover, as LRS technologies continue to advance, the integration of these methods into routine clinical practice is expected to enhance the precision of genetic diagnostics. This integration will not only improve the identification of pathogenic variants but also facilitate the exploration of the interactions between genetic and environmental factors in disease development (Warburton & Sebra, 2023) [5].
In summary, the applications of long-read sequencing in personalized medicine are vast and multifaceted. By improving the accuracy and efficiency of genetic diagnostics, LRS holds the potential to revolutionize patient care through more personalized and targeted therapeutic strategies. Its ability to elucidate complex genomic structures and variations is essential for advancing our understanding of genetic diseases and enhancing the effectiveness of precision medicine.
7 Conclusion
Long-read sequencing technologies have revolutionized the field of genomics, offering unprecedented capabilities in the analysis of complex genomic regions and structural variations. The primary findings indicate that long-read sequencing excels in detecting structural variants, characterizing full-length transcripts, and elucidating epigenomic landscapes, which are often challenging for traditional short-read methods. The current state of research highlights the rapid evolution of these technologies, with significant advancements made by companies like Pacific Biosciences and Oxford Nanopore Technologies. As the cost of long-read sequencing continues to decrease and its accuracy improves, its applications in clinical settings are expanding, particularly in personalized medicine and rare disease diagnostics. Future research directions should focus on enhancing computational tools for data analysis, integrating long-read sequencing into routine clinical workflows, and exploring its potential in population-scale studies. Overall, long-read sequencing holds the promise to further transform our understanding of genetic diversity and disease mechanisms, paving the way for advancements in precision medicine and targeted therapies.
References
- [1] Qiuhui Li;Ayse G Keskus;Justin Wagner;Michal B Izydorczyk;Winston Timp;Fritz J Sedlazeck;Alison P Klein;Justin M Zook;Mikhail Kolmogorov;Michael C Schatz. Unraveling the hidden complexity of cancer through long-read sequencing.. Genome research(IF=5.5). 2025. PMID:40113261. DOI: 10.1101/gr.280041.124.
- [2] Yoshitaka Sakamoto;Suzuko Zaha;Yutaka Suzuki;Masahide Seki;Ayako Suzuki. Application of long-read sequencing to the detection of structural variants in human cancer genomes.. Computational and structural biotechnology journal(IF=4.1). 2021. PMID:34527193. DOI: 10.1016/j.csbj.2021.07.030.
- [3] Josephine B Oehler;Helen Wright;Zornitza Stark;Andrew J Mallett;Ulf Schmitz. The application of long-read sequencing in clinical settings.. Human genomics(IF=4.3). 2023. PMID:37553611. DOI: 10.1186/s40246-023-00522-3.
- [4] Glennis A Logsdon;Mitchell R Vollger;Evan E Eichler. Long-read human genome sequencing and its applications.. Nature reviews. Genetics(IF=52.0). 2020. PMID:32504078. DOI: 10.1038/s41576-020-0236-x.
- [5] Peter E Warburton;Robert P Sebra. Long-Read DNA Sequencing: Recent Advances and Remaining Challenges.. Annual review of genomics and human genetics(IF=7.9). 2023. PMID:37075062. DOI: 10.1146/annurev-genom-101722-103045.
- [6] Tobias Rausch;Tobias Marschall;Jan O Korbel. The impact of long-read sequencing on human population-scale genomics.. Genome research(IF=5.5). 2025. PMID:40228902. DOI: 10.1101/gr.280120.124.
- [7] Shruti V Iyer;Sara Goodwin;William Richard McCombie. Leveraging the power of long reads for targeted sequencing.. Genome research(IF=5.5). 2024. PMID:39567237. DOI: 10.1101/gr.279168.124.
- [8] Efthalia Moustakli;Panagiotis Christopoulos;Anastasios Potiris;Athanasios Zikopoulos;Despoina Mavrogianni;Grigorios Karampas;Nikolaos Kathopoulis;Ismini Anagnostaki;Ekaterini Domali;Alexandros T Tzallas;Peter Drakakis;Sofoklis Stavros. Long-Read Sequencing and Structural Variant Detection: Unlocking the Hidden Genome in Rare Genetic Disorders.. Diagnostics (Basel, Switzerland)(IF=3.3). 2025. PMID:40722552. DOI: 10.3390/diagnostics15141803.
- [9] Minh Thuy Vi Hoang;Laszlo Irinyi;Yiheng Hu;Benjamin Schwessinger;Wieland Meyer. Long-Reads-Based Metagenomics in Clinical Diagnosis With a Special Focus on Fungal Infections.. Frontiers in microbiology(IF=4.5). 2021. PMID:35069461. DOI: 10.3389/fmicb.2021.708550.
- [10] Isabelle Heifetz Ament;Nicole DeBruyne;Feng Wang;Lan Lin. Long-read RNA sequencing: A transformative technology for exploring transcriptome complexity in human diseases.. Molecular therapy : the journal of the American Society of Gene Therapy(IF=12.0). 2025. PMID:39563027. DOI: 10.1016/j.ymthe.2024.11.025.
- [11] Shengnan Luo;Jiling Feng;Yu Zeng;Hao Wu;Yingxia Zheng;Shengli Li. Long-read RNA sequencing dataset of human pancreatic cancer cell lines.. Scientific data(IF=6.9). 2025. PMID:41115920. DOI: 10.1038/s41597-025-05939-0.
- [12] Medhat Mahmoud;Daniel P Agustinho;Fritz J Sedlazeck. A Hitchhiker's Guide to long-read genomic analysis.. Genome research(IF=5.5). 2025. PMID:40228901. DOI: 10.1101/gr.279975.124.
- [13] Erica Sanford Kobayashi;Serge Batalov;Aaron M Wenger;Christine Lambert;Harsharan Dhillon;Richard J Hall;Primo Baybayan;Yan Ding;Seema Rego;Kristen Wigby;Jennifer Friedman;Charlotte Hobbs;Matthew N Bainbridge. Approaches to long-read sequencing in a clinical setting to improve diagnostic rate.. Scientific reports(IF=3.9). 2022. PMID:36210382. DOI: 10.1038/s41598-022-20113-x.
- [14] Ran Xu;Mengmeng Zhang;Xiaoming Yang;Weiming Tian;Changyan Li. Decoding complexity: The role of long-read sequencing in unraveling genetic disease etiologies.. Mutation research. Reviews in mutation research(IF=4.2). 2025. PMID:39788369. DOI: 10.1016/j.mrrev.2024.108529.
- [15] Jeremy R Wang;James Holt;Leonard McMillan;Corbin D Jones. FMLRC: Hybrid long read error correction using an FM-index.. BMC bioinformatics(IF=3.3). 2018. PMID:29426289. DOI: 10.1186/s12859-018-2051-3.
- [16] Alex Di Genova;Gonzalo A Ruz;Marie-France Sagot;Alejandro Maass. Fast-SG: an alignment-free algorithm for hybrid assembly.. GigaScience(IF=3.9). 2018. PMID:29741627. DOI: 10.1093/gigascience/giy048.
- [17] Vasanthan Jayakumar;Yasubumi Sakakibara. Comprehensive evaluation of non-hybrid genome assembly tools for third-generation PacBio long-read sequence data.. Briefings in bioinformatics(IF=7.7). 2019. PMID:29112696. DOI: 10.1093/bib/bbx147.
- [18] Joanna Hård;Jeff E Mold;Jesper Eisfeldt;Christian Tellgren-Roth;Susana Häggqvist;Ignas Bunikis;Orlando Contreras-Lopez;Chen-Shan Chin;Jessica Nordlund;Carl-Johan Rubin;Lars Feuk;Jakob Michaëlsson;Adam Ameur. Long-read whole-genome analysis of human single cells.. Nature communications(IF=15.7). 2023. PMID:37620373. DOI: 10.1038/s41467-023-40898-3.
- [19] Yoshitaka Sakamoto;Liu Xu;Masahide Seki;Toshiyuki T Yokoyama;Masahiro Kasahara;Yukie Kashima;Akihiro Ohashi;Yoko Shimada;Noriko Motoi;Katsuya Tsuchihara;Susumu S Kobayashi;Takashi Kohno;Yuichi Shiraishi;Ayako Suzuki;Yutaka Suzuki. Long-read sequencing for non-small-cell lung cancer genomes.. Genome research(IF=5.5). 2020. PMID:32887687. DOI: 10.1101/gr.261941.120.
- [20] Ghausia Begum;Ammar Albanna;Asma Bankapur;Nasna Nassir;Richa Tambi;Bakhrom K Berdiev;Hosneara Akter;Noushad Karuvantevida;Barbara Kellam;Deena Alhashmi;Wilson W L Sung;Bhooma Thiruvahindrapuram;Alawi Alsheikh-Ali;Stephen W Scherer;Mohammed Uddin. Long-Read Sequencing Improves the Detection of Structural Variations Impacting Complex Non-Coding Elements of the Genome.. International journal of molecular sciences(IF=4.9). 2021. PMID:33669700. DOI: 10.3390/ijms22042060.
- [21] Wouter De Coster;Matthias H Weissensteiner;Fritz J Sedlazeck. Towards population-scale long-read sequencing.. Nature reviews. Genetics(IF=52.0). 2021. PMID:34050336. DOI: 10.1038/s41576-021-00367-3.
- [22] Yoko Kuroki;Atsushi Hattori;Keiko Matsubara;Maki Fukami. Long-read next-generation sequencing for molecular diagnosis of pediatric endocrine disorders.. Annals of pediatric endocrinology & metabolism(IF=3.3). 2024. PMID:38956752. DOI: 10.6065/apem.2448028.014.
- [23] Kez Cleal;Duncan M Baird. Dysgu: efficient structural variant calling using short or long reads.. Nucleic acids research(IF=13.1). 2022. PMID:35100420. DOI: 10.1093/nar/gkac039.
- [24] Jishu Xu;Michaela Hörner;Elena Buena Atienza;Kalaivani Manibarathi;Maike Nagel;Stefan Hauser;Jakob Admard;Nicolas Casadei;Stephan Ossowski;Rebecca Schuele. Long-read RNA-sequencing reveals transcript-specific regulation in human-derived cortical neurons.. Open biology(IF=3.6). 2025. PMID:40735840. DOI: 10.1098/rsob.250200.
- [25] Islam Hamim;Ken-Taro Sekine;Ken Komatsu. How do emerging long-read sequencing technologies function in transforming the plant pathology research landscape?. Plant molecular biology(IF=3.8). 2022. PMID:35962900. DOI: 10.1007/s11103-022-01305-5.
- [26] Hyun Joo Ji;Mihaela Pertea. Enhancing transcriptome expression quantification through accurate assignment of long RNA sequencing reads with TranSigner.. Genome biology(IF=9.4). 2025. PMID:40877901. DOI: 10.1186/s13059-025-03723-2.
- [27] Manuel Tardaguila;Lorena de la Fuente;Cristina Marti;Cécile Pereira;Francisco Jose Pardo-Palacios;Hector Del Risco;Marc Ferrell;Maravillas Mellado;Marissa Macchietto;Kenneth Verheggen;Mariola Edelmann;Iakes Ezkurdia;Jesus Vazquez;Michael Tress;Ali Mortazavi;Lennart Martens;Susana Rodriguez-Navarro;Victoria Moreno-Manzano;Ana Conesa. SQANTI: extensive characterization of long-read transcript sequences for quality control in full-length transcriptome identification and quantification.. Genome research(IF=5.5). 2018. PMID:29440222. DOI: 10.1101/gr.222976.117.
- [28] David J Wright;Nicola A L Hall;Naomi Irish;Angela L Man;Will Glynn;Arne Mould;Alejandro De Los Angeles;Emily Angiolini;David Swarbreck;Karim Gharbi;Elizabeth M Tunbridge;Wilfried Haerty. Long read sequencing reveals novel isoforms and insights into splicing regulation during cell state changes.. BMC genomics(IF=3.7). 2022. PMID:35012468. DOI: 10.1186/s12864-021-08261-2.
- [29] Matthias Lienhard;Twan van den Beucken;Bernd Timmermann;Myriam Hochradel;Stefan Börno;Florian Caiment;Martin Vingron;Ralf Herwig. IsoTools: a flexible workflow for long-read transcriptome sequencing analysis.. Bioinformatics (Oxford, England)(IF=5.4). 2023. PMID:37267159. DOI: 10.1093/bioinformatics/btad364.
- [30] Dena Leshkowitz;Merav Kedmi;Yael Fried;David Pilzer;Hadas Keren-Shaul;Elena Ainbinder;Bareket Dassa. Exploring differential exon usage via short- and long-read RNA sequencing strategies.. Open biology(IF=3.6). 2022. PMID:36168804. DOI: 10.1098/rsob.220206.
- [31] Alexandra Chera;Mircea Stancu-Cretu;Nicolae Radu Zabet;Octavian Bucur. Shedding light on DNA methylation and its clinical implications: the impact of long-read-based nanopore technology.. Epigenetics & chromatin(IF=3.5). 2024. PMID:39734197. DOI: 10.1186/s13072-024-00558-2.
- [32] Lyndsay Kerr;Ioannis Kafetzopoulos;Ramon Grima;Duncan Sproul. Genome-wide single-molecule analysis of long-read DNA methylation reveals heterogeneous patterns at heterochromatin that reflect nucleosome organisation.. PLoS genetics(IF=3.7). 2023. PMID:37782664. DOI: 10.1371/journal.pgen.1010958.
- [33] Alec Eames;Mahdi Moqri;Jesse R Poganik;Vadim N Gladyshev. Profiling Epigenetic Aging at Cell-Type Resolution Through Long-Read Sequencing.. Aging cell(IF=7.1). 2025. PMID:40599132. DOI: 10.1111/acel.70084.
- [34] Elena-Cristina Găitănaru;Roua Gabriela Popescu;Andreea-Angelica Stroe;Sergiu Emil Georgescu;George Cătălin Marinescu. Exploring Epigenetic Ageing Using Direct Methylome Sequencing.. Epigenomes(IF=3.5). 2025. PMID:40700041. DOI: 10.3390/epigenomes9030025.
- [35] Keisuke Kunigo;Satoi Nagasawa;Keiko Kajiya;Yoshitaka Sakamoto;Suzuko Zaha;Yuta Kuze;Akinori Kanai;Kotaro Nomura;Masahiro Tsuboi;Genichiro Ishii;Ai Motoyoshi;Koichiro Tsugawa;Motohiro Chosokabe;Junki Koike;Ayako Suzuki;Yutaka Suzuki;Masahide Seki. Targeted long-read methylation analysis using hybridization capture suitable for clinical specimens.. Cell reports methods(IF=4.5). 2025. PMID:41187747. DOI: 10.1016/j.crmeth.2025.101215.
- [36] Carolina Montano;Winston Timp. Evolution of genome-wide methylation profiling technologies.. Genome research(IF=5.5). 2025. PMID:40228903. DOI: 10.1101/gr.278407.123.
- [37] Isac Lee;Roham Razaghi;Timothy Gilpatrick;Michael Molnar;Ariel Gershman;Norah Sadowski;Fritz J Sedlazeck;Kasper D Hansen;Jared T Simpson;Winston Timp. Simultaneous profiling of chromatin accessibility and methylation on human cell lines with nanopore sequencing.. Nature methods(IF=32.1). 2020. PMID:33230324. DOI: 10.1038/s41592-020-01000-7.
- [38] Basile Leduque;Alejandro Edera;Clémentine Vitte;Leandro Quadrana. Simultaneous profiling of chromatin accessibility and DNA methylation in complete plant genomes using long-read sequencing.. Nucleic acids research(IF=13.1). 2024. PMID:38676941. DOI: 10.1093/nar/gkae306.
- [39] Mengyao Li;Zhenhuan Jiang;Xueqiang Xu;Xinglong Wu;Yun Liu;Kexuan Chen;Yuhan Liao;Wen Li;Xiao Wang;Yuqing Guo;Bo Zhang;Lu Wen;Kehkooi Kee;Fuchou Tang. Chromatin accessibility landscape of mouse early embryos revealed by single-cell NanoATAC-seq2.. Science (New York, N.Y.)(IF=45.8). 2025. PMID:40146829. DOI: 10.1126/science.adp4319.
- [40] Aaron C Daugherty;Robin W Yeo;Jason D Buenrostro;William J Greenleaf;Anshul Kundaje;Anne Brunet. Chromatin accessibility dynamics reveal novel functional enhancers in C. elegans.. Genome research(IF=5.5). 2017. PMID:29141961. DOI: 10.1101/gr.226233.117.
- [41] Giulia F Del Gobbo;Kym M Boycott. The additional diagnostic yield of long-read sequencing in undiagnosed rare diseases.. Genome research(IF=5.5). 2025. PMID:39900460. DOI: 10.1101/gr.279970.124.
- [42] Laura K Conlin;Erfan Aref-Eshghi;Deborah A McEldrew;Minjie Luo;Ramakrishnan Rajagopalan. Long-read sequencing for molecular diagnostics in constitutional genetic disorders.. Human mutation(IF=3.7). 2022. PMID:36086952. DOI: 10.1002/humu.24465.
- [43] Francesco Kumara Mastrorosa;Danny E Miller;Evan E Eichler. Applications of long-read sequencing to Mendelian genetics.. Genome medicine(IF=11.2). 2023. PMID:37316925. DOI: 10.1186/s13073-023-01194-3.
MaltSci Intelligent Research Services
Search for more papers on MaltSci.com
long-read sequencing · genome assembly · personalized medicine · structural variation · transcriptome analysis
© 2025 MaltSci
