DCMS: A Data Analytics and Management System for Molecular Simulation

Despite the fact that Molecular Simulation systems represent a major research tool in multiple scientific and engineering fields, there is still a lack of systems for effective data management and fast data retrieval and processing. This is mainly due to the nature of MS which generate a very large...

Full description

Bibliographic Details
Main Author: Berrada, Meryem
Format: Others
Published: Scholar Commons 2015
Subjects:
Online Access:https://scholarcommons.usf.edu/etd/5453
https://scholarcommons.usf.edu/cgi/viewcontent.cgi?article=6647&context=etd
id ndltd-USF-oai-scholarcommons.usf.edu-etd-6647
record_format oai_dc
spelling ndltd-USF-oai-scholarcommons.usf.edu-etd-66472019-10-04T05:09:13Z DCMS: A Data Analytics and Management System for Molecular Simulation Berrada, Meryem Despite the fact that Molecular Simulation systems represent a major research tool in multiple scientific and engineering fields, there is still a lack of systems for effective data management and fast data retrieval and processing. This is mainly due to the nature of MS which generate a very large amount of data - a system usually encompass millions of data information, and one query usually runs for tens of thousands of time frames. For this purpose, we designed and developed a new application, DCMS (A data Analytics and Management System for molecular Simulation), that intends to speed up the process of new discovery in the medical/physics fields. DCMS stores simulation data in a database; and provides users with a user-friendly interface to upload, retrieve, query, and analyze MS data without having to deal with any raw data. In addition, we also created a new indexing scheme, the Time-Parameterized Spatial (TPS) tree, to accelerate query processing through indexes that take advantage of the locality relationships between atoms. The tree was implemented directly inside the PostgreSQL kernel, on top of the SP-GiST platform. Along with this new tree, two new data types were also defined, as well as new algorithms for five data points' retrieval queries. 2015-03-16T07:00:00Z text application/pdf https://scholarcommons.usf.edu/etd/5453 https://scholarcommons.usf.edu/cgi/viewcontent.cgi?article=6647&context=etd default Graduate Theses and Dissertations Scholar Commons Scientific database Molecular Dynamics Big Data Quadtree SP-GIST Computer Engineering Computer Sciences
collection NDLTD
format Others
sources NDLTD
topic Scientific database
Molecular Dynamics
Big Data
Quadtree
SP-GIST
Computer Engineering
Computer Sciences
spellingShingle Scientific database
Molecular Dynamics
Big Data
Quadtree
SP-GIST
Computer Engineering
Computer Sciences
Berrada, Meryem
DCMS: A Data Analytics and Management System for Molecular Simulation
description Despite the fact that Molecular Simulation systems represent a major research tool in multiple scientific and engineering fields, there is still a lack of systems for effective data management and fast data retrieval and processing. This is mainly due to the nature of MS which generate a very large amount of data - a system usually encompass millions of data information, and one query usually runs for tens of thousands of time frames. For this purpose, we designed and developed a new application, DCMS (A data Analytics and Management System for molecular Simulation), that intends to speed up the process of new discovery in the medical/physics fields. DCMS stores simulation data in a database; and provides users with a user-friendly interface to upload, retrieve, query, and analyze MS data without having to deal with any raw data. In addition, we also created a new indexing scheme, the Time-Parameterized Spatial (TPS) tree, to accelerate query processing through indexes that take advantage of the locality relationships between atoms. The tree was implemented directly inside the PostgreSQL kernel, on top of the SP-GiST platform. Along with this new tree, two new data types were also defined, as well as new algorithms for five data points' retrieval queries.
author Berrada, Meryem
author_facet Berrada, Meryem
author_sort Berrada, Meryem
title DCMS: A Data Analytics and Management System for Molecular Simulation
title_short DCMS: A Data Analytics and Management System for Molecular Simulation
title_full DCMS: A Data Analytics and Management System for Molecular Simulation
title_fullStr DCMS: A Data Analytics and Management System for Molecular Simulation
title_full_unstemmed DCMS: A Data Analytics and Management System for Molecular Simulation
title_sort dcms: a data analytics and management system for molecular simulation
publisher Scholar Commons
publishDate 2015
url https://scholarcommons.usf.edu/etd/5453
https://scholarcommons.usf.edu/cgi/viewcontent.cgi?article=6647&context=etd
work_keys_str_mv AT berradameryem dcmsadataanalyticsandmanagementsystemformolecularsimulation
_version_ 1719260190791958528