WIND (Workflow for pIRNAs aNd beyonD): a strategy for in-depth analysis of small RNA-seq data [version 3; peer review: 2 approved]

Current bioinformatics workflows for PIWI-interacting RNA (piRNA) analysis focus primarily on germline-derived piRNAs and piRNA-clusters. Frequently, they suffer from outdated piRNA databases, questionable quantification methods, and lack of reproducibility. Often, pipelines specific to miRNA analys...

Full description

Bibliographic Details
Main Authors: Konstantinos Geles, Domenico Palumbo, Assunta Sellitto, Giorgio Giurato, Eleonora Cianflone, Fabiola Marino, Daniele Torella, Valeria Mirici Cappa, Giovanni Nassa, Roberta Tarallo, Alessandro Weisz, Francesca Rizzo
Format: Article
Language:English
Published: F1000 Research Ltd 2021-07-01
Series:F1000Research
Online Access:https://f1000research.com/articles/10-1/v3
id doaj-dd1edb0bdde94f56b64faa62aaef6b63
record_format Article
collection DOAJ
language English
format Article
sources DOAJ
author Konstantinos Geles
Domenico Palumbo
Assunta Sellitto
Giorgio Giurato
Eleonora Cianflone
Fabiola Marino
Daniele Torella
Valeria Mirici Cappa
Giovanni Nassa
Roberta Tarallo
Alessandro Weisz
Francesca Rizzo
spellingShingle Konstantinos Geles
Domenico Palumbo
Assunta Sellitto
Giorgio Giurato
Eleonora Cianflone
Fabiola Marino
Daniele Torella
Valeria Mirici Cappa
Giovanni Nassa
Roberta Tarallo
Alessandro Weisz
Francesca Rizzo
WIND (Workflow for pIRNAs aNd beyonD): a strategy for in-depth analysis of small RNA-seq data [version 3; peer review: 2 approved]
F1000Research
author_facet Konstantinos Geles
Domenico Palumbo
Assunta Sellitto
Giorgio Giurato
Eleonora Cianflone
Fabiola Marino
Daniele Torella
Valeria Mirici Cappa
Giovanni Nassa
Roberta Tarallo
Alessandro Weisz
Francesca Rizzo
author_sort Konstantinos Geles
title WIND (Workflow for pIRNAs aNd beyonD): a strategy for in-depth analysis of small RNA-seq data [version 3; peer review: 2 approved]
title_short WIND (Workflow for pIRNAs aNd beyonD): a strategy for in-depth analysis of small RNA-seq data [version 3; peer review: 2 approved]
title_full WIND (Workflow for pIRNAs aNd beyonD): a strategy for in-depth analysis of small RNA-seq data [version 3; peer review: 2 approved]
title_fullStr WIND (Workflow for pIRNAs aNd beyonD): a strategy for in-depth analysis of small RNA-seq data [version 3; peer review: 2 approved]
title_full_unstemmed WIND (Workflow for pIRNAs aNd beyonD): a strategy for in-depth analysis of small RNA-seq data [version 3; peer review: 2 approved]
title_sort wind (workflow for pirnas and beyond): a strategy for in-depth analysis of small rna-seq data [version 3; peer review: 2 approved]
publisher F1000 Research Ltd
series F1000Research
issn 2046-1402
publishDate 2021-07-01
description Current bioinformatics workflows for PIWI-interacting RNA (piRNA) analysis focus primarily on germline-derived piRNAs and piRNA-clusters. Frequently, they suffer from outdated piRNA databases, questionable quantification methods, and lack of reproducibility. Often, pipelines specific to miRNA analysis are used for the piRNA research in silico. Furthermore, the absence of a well-established database for piRNA annotation, as for miRNA, leads to uniformity issues between studies and generates confusion for data analysts and biologists. For these reasons, we have developed WIND (Workflow for pIRNAs aNd beyonD), a bioinformatics workflow that addresses the crucial issue of piRNA annotation, thereby allowing a reliable analysis of small RNA sequencing data for the identification of piRNAs and other small non-coding RNAs (sncRNAs) that in the past have been incorrectly classified as piRNAs. WIND allows the creation of a comprehensive annotation track of sncRNAs combining information available in RNAcentral, with piRNA sequences from piRNABank, the first database dedicated to piRNA annotation. WIND was built with Docker containers for reproducibility and integrates widely used bioinformatics tools for sequence alignment and quantification. In addition, it includes Bioconductor packages for exploratory data and differential expression analysis. Moreover, WIND implements a "dual" approach for the evaluation of sncRNAs expression level quantifying the aligned reads to the annotated genome and carrying out an alignment-free transcript quantification using reads mapped to the transcriptome. Therefore, a broader range of piRNAs can be annotated, improving their quantification and easing the subsequent downstream analysis. WIND performance has been tested with several small RNA-seq datasets, demonstrating how our approach can be a useful and comprehensive resource to analyse piRNAs and other classes of sncRNAs.
url https://f1000research.com/articles/10-1/v3
work_keys_str_mv AT konstantinosgeles windworkflowforpirnasandbeyondastrategyforindepthanalysisofsmallrnaseqdataversion3peerreview2approved
AT domenicopalumbo windworkflowforpirnasandbeyondastrategyforindepthanalysisofsmallrnaseqdataversion3peerreview2approved
AT assuntasellitto windworkflowforpirnasandbeyondastrategyforindepthanalysisofsmallrnaseqdataversion3peerreview2approved
AT giorgiogiurato windworkflowforpirnasandbeyondastrategyforindepthanalysisofsmallrnaseqdataversion3peerreview2approved
AT eleonoracianflone windworkflowforpirnasandbeyondastrategyforindepthanalysisofsmallrnaseqdataversion3peerreview2approved
AT fabiolamarino windworkflowforpirnasandbeyondastrategyforindepthanalysisofsmallrnaseqdataversion3peerreview2approved
AT danieletorella windworkflowforpirnasandbeyondastrategyforindepthanalysisofsmallrnaseqdataversion3peerreview2approved
AT valeriamiricicappa windworkflowforpirnasandbeyondastrategyforindepthanalysisofsmallrnaseqdataversion3peerreview2approved
AT giovanninassa windworkflowforpirnasandbeyondastrategyforindepthanalysisofsmallrnaseqdataversion3peerreview2approved
AT robertatarallo windworkflowforpirnasandbeyondastrategyforindepthanalysisofsmallrnaseqdataversion3peerreview2approved
AT alessandroweisz windworkflowforpirnasandbeyondastrategyforindepthanalysisofsmallrnaseqdataversion3peerreview2approved
AT francescarizzo windworkflowforpirnasandbeyondastrategyforindepthanalysisofsmallrnaseqdataversion3peerreview2approved
_version_ 1721295150969257984
spelling doaj-dd1edb0bdde94f56b64faa62aaef6b632021-07-19T09:29:35ZengF1000 Research LtdF1000Research2046-14022021-07-011010.12688/f1000research.27868.358313WIND (Workflow for pIRNAs aNd beyonD): a strategy for in-depth analysis of small RNA-seq data [version 3; peer review: 2 approved]Konstantinos Geles0Domenico Palumbo1Assunta Sellitto2Giorgio Giurato3Eleonora Cianflone4Fabiola Marino5Daniele Torella6Valeria Mirici Cappa7Giovanni Nassa8Roberta Tarallo9Alessandro Weisz10Francesca Rizzo11Genomix4Life, via S. Allende 43/L, Baronissi, Salerno (SA), 84081, ItalyLaboratory of Molecular Medicine and Genomics, Department of Medicine, Surgery and Dentistry ‘Scuola Medica Salernitana’, University of Salerno, Baronissi, Salerno (SA), 84081, ItalyLaboratory of Molecular Medicine and Genomics, Department of Medicine, Surgery and Dentistry ‘Scuola Medica Salernitana’, University of Salerno, Baronissi, Salerno (SA), 84081, ItalyLaboratory of Molecular Medicine and Genomics, Department of Medicine, Surgery and Dentistry ‘Scuola Medica Salernitana’, University of Salerno, Baronissi, Salerno (SA), 84081, ItalyDepartment of Medical and Surgical Sciences, Magna Graecia University, Viale Europa, Catanzaro, 88100, ItalyDepartment of Experimental and Clinical Medicine, Molecular and Cellular Cardiology, Magna Graecia University, Viale Europa, Catanzaro, 88100, ItalyDepartment of Experimental and Clinical Medicine, Molecular and Cellular Cardiology, Magna Graecia University, Viale Europa, Catanzaro, 88100, ItalyLaboratory of Molecular Medicine and Genomics, Department of Medicine, Surgery and Dentistry ‘Scuola Medica Salernitana’, University of Salerno, Baronissi, Salerno (SA), 84081, ItalyGenomix4Life, via S. Allende 43/L, Baronissi, Salerno (SA), 84081, ItalyLaboratory of Molecular Medicine and Genomics, Department of Medicine, Surgery and Dentistry ‘Scuola Medica Salernitana’, University of Salerno, Baronissi, Salerno (SA), 84081, ItalyLaboratory of Molecular Medicine and Genomics, Department of Medicine, Surgery and Dentistry ‘Scuola Medica Salernitana’, University of Salerno, Baronissi, Salerno (SA), 84081, ItalyLaboratory of Molecular Medicine and Genomics, Department of Medicine, Surgery and Dentistry ‘Scuola Medica Salernitana’, University of Salerno, Baronissi, Salerno (SA), 84081, ItalyCurrent bioinformatics workflows for PIWI-interacting RNA (piRNA) analysis focus primarily on germline-derived piRNAs and piRNA-clusters. Frequently, they suffer from outdated piRNA databases, questionable quantification methods, and lack of reproducibility. Often, pipelines specific to miRNA analysis are used for the piRNA research in silico. Furthermore, the absence of a well-established database for piRNA annotation, as for miRNA, leads to uniformity issues between studies and generates confusion for data analysts and biologists. For these reasons, we have developed WIND (Workflow for pIRNAs aNd beyonD), a bioinformatics workflow that addresses the crucial issue of piRNA annotation, thereby allowing a reliable analysis of small RNA sequencing data for the identification of piRNAs and other small non-coding RNAs (sncRNAs) that in the past have been incorrectly classified as piRNAs. WIND allows the creation of a comprehensive annotation track of sncRNAs combining information available in RNAcentral, with piRNA sequences from piRNABank, the first database dedicated to piRNA annotation. WIND was built with Docker containers for reproducibility and integrates widely used bioinformatics tools for sequence alignment and quantification. In addition, it includes Bioconductor packages for exploratory data and differential expression analysis. Moreover, WIND implements a "dual" approach for the evaluation of sncRNAs expression level quantifying the aligned reads to the annotated genome and carrying out an alignment-free transcript quantification using reads mapped to the transcriptome. Therefore, a broader range of piRNAs can be annotated, improving their quantification and easing the subsequent downstream analysis. WIND performance has been tested with several small RNA-seq datasets, demonstrating how our approach can be a useful and comprehensive resource to analyse piRNAs and other classes of sncRNAs.https://f1000research.com/articles/10-1/v3