-
PDF
- Split View
-
Views
-
Cite
Cite
Minghao Li, Xuaoyu Guo, Jin Zhao, VirDiG: a de novo transcriptome assembler for coronavirus, Bioinformatics Advances, 2025;, vbaf075, https://doi.org/10.1093/bioadv/vbaf075
- Share Icon Share
Abstract
The discontinuous transcription mechanism of coronaviruses contributes to their adaptation to different host environments and plays a critical role in their lifecycle. Accurate assembly of coronavirus transcripts is vital for understanding the virus’s biological traits and developing precise prevention and treatment strategies. However, existing de novo assembly algorithms are primarily designed for alternative splicing events in eukaryotes and are not suitable for assembling coronavirus transcriptome, which consists of both genomic RNA and subgenomic mRNAs. Coronavirus transcriptome reconstruction from short reads remains a challenging problem.
In this work, we present VirDiG, a de novo transcriptome assembler specifically designed for coronaviruses. VirDiG utilizes a discontinuous graph to facilitate accurate transcript assembly by incorporating information from paired-end reads, sequence depth, and start and stop codons. Experimental results from both simulated and real datasets show that VirDiG exhibits significant advantages in reconstructing the transcriptome of coronaviruses when compared to traditional de novo assemblers tailored for classical eukaryotic transcriptome assembly.
VirDiG is freely available at https://github.com/Limh616/VirDiG.git.