Automation of functional annotation of genomes and transcriptomes

Cadavid Gutiérrez, Luis Fernando; Pérez Castillo, José Nelson; Rojas Quintero, Cristian Alejandro; Vera Parra, Nelson Enrique

Automation of functional annotation of genomes and transcriptomes

Autores

Cadavid Gutiérrez, Luis Fernando

Pérez Castillo, José Nelson

Rojas Quintero, Cristian Alejandro

Vera Parra, Nelson Enrique

Editor

Universidad Distrital Francisco José de Caldas. Colombia

Compartir

Altmetric

Descripción

Functional annotation represents a means to investigate and classify genes and transcripts according to their function within a given organism.This paper presents Massive Automatic Functional Annotation (MAFA - Web), which is an online free bioinformatics tool that allows automation, unification and optimization of functional annotation processes when dealing with large volumes of sequences. MAFA includes tools for categorization and statistical analysis of associations between sequences. We have evaluated the performance of MAFA with a set of data taken from Diploria-Strigosatranscriptome (using an 8-core computer, namely E7450 @ 2,40GHZ with 256GB RAM), processing rates of 2,7 seconds per sequence (using Uniprot database) and 50,0 seconds per sequence (using Non-redundant from NCBI database) were found together with particular RAM usage patterns that depend on the database being processed (1GB for Uniprot database and 9GB for Non-redundant database).. Aviability: https://github.com/BioinfUD/MAFA.
Functional annotation represents a means to investigate and classify genes and transcripts according to their function within a given organism.This paper presents Massive Automatic Functional Annotation (MAFA - Web), which is an online free bioinformatics tool that allows automation, unification and optimization of functional annotation processes when dealing with large volumes of sequences. MAFA includes tools for categorization and statistical analysis of associations between sequences. We have evaluated the performance of MAFA with a set of data taken from Diploria-Strigosatranscriptome (using an 8-core computer, namely E7450 @ 2,40GHZ with 256GB RAM), processing rates of 2,7 seconds per sequence (using Uniprot database) and 50,0 seconds per sequence (using Non-redundant from NCBI database) were found together with particular RAM usage patterns that depend on the database being processed (1GB for Uniprot database and 9GB for Non-redundant database). Aviability: https://github.com/BioinfUD/MAFA.

Palabras clave

Annotator, Functional annotation, Gene ontology, High Throughput Sequencing., Annotator, Functional annotation, Gene ontology, High Throughput Sequencing.

URI

http://hdl.handle.net/11349/20854

Colecciones

Tecnura

Página completa del ítem

Automation of functional annotation of genomes and transcriptomes

Fecha

Autores

Autor corporativo

Título de la revista

ISSN de la revista

Título del volumen

Editor

Compartir

Director

Altmetric

Resumen

Descripción

Palabras clave

Citación

URI

Colecciones