ChemWARD : extracting chemical structure from printed diagrams

Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2009. === Cataloged from PDF version of thesis. === Includes bibliographical references (p. 117-118). === Over the years, a vast amount of literature in the field of chemistry has accumulat...

Full description

Bibliographic Details
Main Author: Moscicki, Angelique (Angelique E.)
Other Authors: Randall Davis.
Format: Others
Language:English
Published: Massachusetts Institute of Technology 2011
Subjects:
Online Access:http://hdl.handle.net/1721.1/61299
id ndltd-MIT-oai-dspace.mit.edu-1721.1-61299
record_format oai_dc
spelling ndltd-MIT-oai-dspace.mit.edu-1721.1-612992019-05-02T15:51:39Z ChemWARD : extracting chemical structure from printed diagrams Extracting chemical structure from printed diagrams Moscicki, Angelique (Angelique E.) Randall Davis. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Electrical Engineering and Computer Science. Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2009. Cataloged from PDF version of thesis. Includes bibliographical references (p. 117-118). Over the years, a vast amount of literature in the field of chemistry has accumulated, and searching for documents about specific molecules is a formidable task. To the extent that the literature is textual, services like Google enable relatively easy search. While search indexes like Google are very good at finding such things, its difficult to describe molecules completely using text because text can't easily indicate molecular structure, and molecular structure defines chemical properties. ChemWARD is a system that extracts the molecular structure from the printed diagrams that are ubiquitous in chemistry literature and converts them to a machine readable format in order to allow chemists to search the literature by drawing a molecular structure instead of typing a chemical formula. We describe the architecture of the system and report on its performance, demonstrating its ability to achieve an overall accuracy rate of 85.5% on printed diagrams extracted from published chemical literature. by Angelique Moscicki. M.Eng. 2011-02-23T15:01:05Z 2011-02-23T15:01:05Z 2009 2009 Thesis http://hdl.handle.net/1721.1/61299 702651513 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 118 p. application/pdf Massachusetts Institute of Technology
collection NDLTD
language English
format Others
sources NDLTD
topic Electrical Engineering and Computer Science.
spellingShingle Electrical Engineering and Computer Science.
Moscicki, Angelique (Angelique E.)
ChemWARD : extracting chemical structure from printed diagrams
description Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2009. === Cataloged from PDF version of thesis. === Includes bibliographical references (p. 117-118). === Over the years, a vast amount of literature in the field of chemistry has accumulated, and searching for documents about specific molecules is a formidable task. To the extent that the literature is textual, services like Google enable relatively easy search. While search indexes like Google are very good at finding such things, its difficult to describe molecules completely using text because text can't easily indicate molecular structure, and molecular structure defines chemical properties. ChemWARD is a system that extracts the molecular structure from the printed diagrams that are ubiquitous in chemistry literature and converts them to a machine readable format in order to allow chemists to search the literature by drawing a molecular structure instead of typing a chemical formula. We describe the architecture of the system and report on its performance, demonstrating its ability to achieve an overall accuracy rate of 85.5% on printed diagrams extracted from published chemical literature. === by Angelique Moscicki. === M.Eng.
author2 Randall Davis.
author_facet Randall Davis.
Moscicki, Angelique (Angelique E.)
author Moscicki, Angelique (Angelique E.)
author_sort Moscicki, Angelique (Angelique E.)
title ChemWARD : extracting chemical structure from printed diagrams
title_short ChemWARD : extracting chemical structure from printed diagrams
title_full ChemWARD : extracting chemical structure from printed diagrams
title_fullStr ChemWARD : extracting chemical structure from printed diagrams
title_full_unstemmed ChemWARD : extracting chemical structure from printed diagrams
title_sort chemward : extracting chemical structure from printed diagrams
publisher Massachusetts Institute of Technology
publishDate 2011
url http://hdl.handle.net/1721.1/61299
work_keys_str_mv AT moscickiangeliqueangeliquee chemwardextractingchemicalstructurefromprinteddiagrams
AT moscickiangeliqueangeliquee extractingchemicalstructurefromprinteddiagrams
_version_ 1719030180250386432