Motivation MicroRNAs (miRNAs) are small RNA molecules (∼22 nucleotide long) involved in post-transcriptional gene regulation. Advances in high-throughput sequencing technologies led to the discovery of isomiRs, which are miRNA sequence variants. While many miRNA-seq analysis tools exist, the diversity of output formats hinders accurate comparisons between tools and precludes data sharing and the development of common downstream analysis methods. Results To overcome this situation, we present here a community-based project, miRNA Transcriptomic Open Project (miRTOP) working towards the optimization of miRNA analyses. The aim of miRTOP is to promote the development of downstream isomiR analysis tools that are compatible with existing detection and quantification tools. Based on the existing GFF3 format, we first created a new standard format, mirGFF3, for the output of miRNA/isomiR detection and quantification results from small RNA-seq data. Additionally, we developed a command line Python tool, mirtop, to create and manage the mirGFF3 format. Currently, mirtop can convert into mirGFF3 the outputs of commonly used pipelines, such as seqbuster, isomiR-SEA, sRNAbench, Prost! as well as BAM files. Some tools have also incorporated the mirGFF3 format directly into their code, such as, miRge2.0, IsoMIRmap and OptimiR. Its open architecture enables any tool or pipeline to output or convert results into mirGFF3. Collectively, this isomiR categorization system, along with the accompanying mirGFF3 and mirtop API, provide a comprehensive solution for the standardization of miRNA and isomiR annotation, enabling data sharing, reporting, comparative analyses and benchmarking, while promoting the development of common miRNA methods focusing on downstream steps of miRNA detection, annotation and quantification. Availability and implementation https://github.com/miRTop/mirGFF3/ and https://github.com/miRTop/mirtop.

Unification of miRNA and isomiR research: the mirGFF3 format and the mirtop API / Desvignes, Thomas; Loher, Phillipe; Eilbeck, Karen; Ma, Jeffery; Urgese, Gianvito; Fromm, Bastian; Sydes, Jason; Aparicio-Puerta, Ernesto; Barrera, Victor; Espin, Roderic; Londin, Eric; Telonis, Aristeidis G; Ficarra, Elisa; Friedlander, Marc R; Postlethwait, John H; Rigoutsos, Isidore; Hackenberg, Michael; Vlachos, Ioannis S; Halushka, Marc K.; Pantano, Lorena. - In: BIOINFORMATICS. - ISSN 1367-4811. - ELETTRONICO. - 36:3(2019), pp. 698-703. [10.1093/bioinformatics/btz675]

Unification of miRNA and isomiR research: the mirGFF3 format and the mirtop API

Urgese, Gianvito;Ficarra, Elisa;
2019

Abstract

Motivation MicroRNAs (miRNAs) are small RNA molecules (∼22 nucleotide long) involved in post-transcriptional gene regulation. Advances in high-throughput sequencing technologies led to the discovery of isomiRs, which are miRNA sequence variants. While many miRNA-seq analysis tools exist, the diversity of output formats hinders accurate comparisons between tools and precludes data sharing and the development of common downstream analysis methods. Results To overcome this situation, we present here a community-based project, miRNA Transcriptomic Open Project (miRTOP) working towards the optimization of miRNA analyses. The aim of miRTOP is to promote the development of downstream isomiR analysis tools that are compatible with existing detection and quantification tools. Based on the existing GFF3 format, we first created a new standard format, mirGFF3, for the output of miRNA/isomiR detection and quantification results from small RNA-seq data. Additionally, we developed a command line Python tool, mirtop, to create and manage the mirGFF3 format. Currently, mirtop can convert into mirGFF3 the outputs of commonly used pipelines, such as seqbuster, isomiR-SEA, sRNAbench, Prost! as well as BAM files. Some tools have also incorporated the mirGFF3 format directly into their code, such as, miRge2.0, IsoMIRmap and OptimiR. Its open architecture enables any tool or pipeline to output or convert results into mirGFF3. Collectively, this isomiR categorization system, along with the accompanying mirGFF3 and mirtop API, provide a comprehensive solution for the standardization of miRNA and isomiR annotation, enabling data sharing, reporting, comparative analyses and benchmarking, while promoting the development of common miRNA methods focusing on downstream steps of miRNA detection, annotation and quantification. Availability and implementation https://github.com/miRTop/mirGFF3/ and https://github.com/miRTop/mirtop.
File in questo prodotto:
File Dimensione Formato  
btz675.pdf

non disponibili

Descrizione: Articolo principale
Tipologia: 2a Post-print versione editoriale / Version of Record
Licenza: Non Pubblico - Accesso privato/ristretto
Dimensione 694.36 kB
Formato Adobe PDF
694.36 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
505222v1_pre_print.pdf

accesso aperto

Descrizione: Articolo principale Preprint
Tipologia: 1. Preprint / submitted version [pre- review]
Licenza: Creative commons
Dimensione 670.5 kB
Formato Adobe PDF
670.5 kB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2796252