Skip to Main content Skip to Navigation
New interface
Book sections

viroCapt: A Bioinformatics Pipeline for Identifying Viral Insertion in Human Host Genome

Abstract : Introduction: The implication of viruses in human cancers, as well as the emergence of next generation sequencing has permitted to investigate further their role and pathophysiology in the development of this disease. One such mechanism is the integration of portions of viral genomes in the human genome, as well as the specific action of viral oncogenes.inding integration sites and preserved oncogenes is still relying on heavy manual intervention. Methods: We developed an analysis and interpretation pipeline to determine viral insertions. Using data from directed viral capture, the pipeline conducts a crude genotyping phase to select reference viral genomes, identifies chimeric reads, extracts the putative human sequences to locate in the human reference genome, scores and ranks candidate junctions, and exports tabular and visual results. Results: We leverage common bioinformatics tools (bowtie2, samtools, blat), and a dedicated filtering and ranking algorithm, implemented in R, to infer candidate junctions and insertions. Static results (tables, figures) are produced, as well as an interactive interpretation tool developed as a shiny web app. Discussion: We validated this pipeline against published results of HPV, HBV, and AAV2 insertions and show good information retrieval.
Complete list of metadata
Contributor : Jessica Zucman-Rossi Connect in order to contact the contributor
Submitted on : Thursday, September 8, 2022 - 8:54:52 AM
Last modification on : Wednesday, September 28, 2022 - 4:20:12 PM


Publication funded by an institution



Maxime Wack, David Veyer, Camille Peneau, Sonia Lameiras, William Digan, et al.. viroCapt: A Bioinformatics Pipeline for Identifying Viral Insertion in Human Host Genome. Challenges of Trustable AI and Added-Value on Health, 294, IOS Press, pp.834-838, 2022, Studies in Health Technology and Informatics, ⟨10.3233/SHTI220602⟩. ⟨inserm-03772185⟩



Record views


Files downloads