Combining Strengths for Multi-genome Visual Analytics Comparison

The eclosion of data acquisition technologies has shifted the bottleneck in molecular biology research from data acquisition to data analysis. Such is the case in Comparative Genomics, where sequence analysis has transitioned from genes to genomes of several orders of magnitude larger. This fact has...

Full description

Bibliographic Details
Main Authors: Sergio Diaz-del-Pino, Pablo Rodriguez-Brazzarola, Esteban Perez-Wohlfeil, Oswaldo Trelles
Format: Article
Language:English
Published: SAGE Publishing 2019-01-01
Series:Bioinformatics and Biology Insights
Online Access:https://doi.org/10.1177/1177932218825127
id doaj-c8579ac953ed4267b0fdbf2330aa99e5
record_format Article
spelling doaj-c8579ac953ed4267b0fdbf2330aa99e52020-11-25T03:32:03ZengSAGE PublishingBioinformatics and Biology Insights1177-93222019-01-011310.1177/1177932218825127Combining Strengths for Multi-genome Visual Analytics ComparisonSergio Diaz-del-PinoPablo Rodriguez-BrazzarolaEsteban Perez-WohlfeilOswaldo TrellesThe eclosion of data acquisition technologies has shifted the bottleneck in molecular biology research from data acquisition to data analysis. Such is the case in Comparative Genomics, where sequence analysis has transitioned from genes to genomes of several orders of magnitude larger. This fact has revealed the need to adapt software to work with huge experiments efficiently and to incorporate new data-analysis strategies to manage results from such studies. In previous works, we presented GECKO, a software to compare large sequences; now we address the representation, browsing, data exploration, and post-processing of the massive amount of information derived from such comparisons. GECKO-MGV is a web-based application organized as client-server architecture. It is aimed at visual analysis of the results from both pairwise and multiple sequences comparison studies combining a set of common commands for image exploration with improved state-of-the-art solutions. In addition, GECKO-MGV integrates different visualization analysis tools while exploiting the concept of layers to display multiple genome comparison datasets. Moreover, the software is endowed with capabilities for contacting external-proprietary and third-party services for further data post-processing and also presents a method to display a timeline of large-scale evolutionary events. As proof-of-concept, we present 2 exercises using bacterial and mammalian genomes which depict the capabilities of GECKO-MGV to perform in-depth, customizable analyses on the fly using web technologies. The first exercise is mainly descriptive and is carried out over bacterial genomes, whereas the second one aims to show the ability to deal with large sequence comparisons. In this case, we display results from the comparison of the first Homo sapiens chromosome against the first 5 chromosomes of Mus musculus .https://doi.org/10.1177/1177932218825127
collection DOAJ
language English
format Article
sources DOAJ
author Sergio Diaz-del-Pino
Pablo Rodriguez-Brazzarola
Esteban Perez-Wohlfeil
Oswaldo Trelles
spellingShingle Sergio Diaz-del-Pino
Pablo Rodriguez-Brazzarola
Esteban Perez-Wohlfeil
Oswaldo Trelles
Combining Strengths for Multi-genome Visual Analytics Comparison
Bioinformatics and Biology Insights
author_facet Sergio Diaz-del-Pino
Pablo Rodriguez-Brazzarola
Esteban Perez-Wohlfeil
Oswaldo Trelles
author_sort Sergio Diaz-del-Pino
title Combining Strengths for Multi-genome Visual Analytics Comparison
title_short Combining Strengths for Multi-genome Visual Analytics Comparison
title_full Combining Strengths for Multi-genome Visual Analytics Comparison
title_fullStr Combining Strengths for Multi-genome Visual Analytics Comparison
title_full_unstemmed Combining Strengths for Multi-genome Visual Analytics Comparison
title_sort combining strengths for multi-genome visual analytics comparison
publisher SAGE Publishing
series Bioinformatics and Biology Insights
issn 1177-9322
publishDate 2019-01-01
description The eclosion of data acquisition technologies has shifted the bottleneck in molecular biology research from data acquisition to data analysis. Such is the case in Comparative Genomics, where sequence analysis has transitioned from genes to genomes of several orders of magnitude larger. This fact has revealed the need to adapt software to work with huge experiments efficiently and to incorporate new data-analysis strategies to manage results from such studies. In previous works, we presented GECKO, a software to compare large sequences; now we address the representation, browsing, data exploration, and post-processing of the massive amount of information derived from such comparisons. GECKO-MGV is a web-based application organized as client-server architecture. It is aimed at visual analysis of the results from both pairwise and multiple sequences comparison studies combining a set of common commands for image exploration with improved state-of-the-art solutions. In addition, GECKO-MGV integrates different visualization analysis tools while exploiting the concept of layers to display multiple genome comparison datasets. Moreover, the software is endowed with capabilities for contacting external-proprietary and third-party services for further data post-processing and also presents a method to display a timeline of large-scale evolutionary events. As proof-of-concept, we present 2 exercises using bacterial and mammalian genomes which depict the capabilities of GECKO-MGV to perform in-depth, customizable analyses on the fly using web technologies. The first exercise is mainly descriptive and is carried out over bacterial genomes, whereas the second one aims to show the ability to deal with large sequence comparisons. In this case, we display results from the comparison of the first Homo sapiens chromosome against the first 5 chromosomes of Mus musculus .
url https://doi.org/10.1177/1177932218825127
work_keys_str_mv AT sergiodiazdelpino combiningstrengthsformultigenomevisualanalyticscomparison
AT pablorodriguezbrazzarola combiningstrengthsformultigenomevisualanalyticscomparison
AT estebanperezwohlfeil combiningstrengthsformultigenomevisualanalyticscomparison
AT oswaldotrelles combiningstrengthsformultigenomevisualanalyticscomparison
_version_ 1724570034928353280