Extracting Illustrated Pages from Digital Libraries with Python

Machine learning and API extensions by HathiTrust and Internet Archive are making it easier to extract page regions of visual interest from digitized volumes. This lesson shows how to efficiently extract those regions and, in doing so, prompt new, visual research questions.

Bibliographic Details
Main Author: Stephen Krewson
Format: Article
Language:English
Published: Editorial Board of the Programming Historian 2019-01-01
Series:The Programming Historian
Online Access:https://programminghistorian.org/en/lessons/extracting-illustrated-pages
id doaj-5b068a3babfc4308b09b158123f99dab
record_format Article
spelling doaj-5b068a3babfc4308b09b158123f99dab2020-11-25T00:27:03ZengEditorial Board of the Programming HistorianThe Programming Historian2397-20682397-20682019-01-018Extracting Illustrated Pages from Digital Libraries with PythonStephen Krewson0Yale UniversityMachine learning and API extensions by HathiTrust and Internet Archive are making it easier to extract page regions of visual interest from digitized volumes. This lesson shows how to efficiently extract those regions and, in doing so, prompt new, visual research questions.https://programminghistorian.org/en/lessons/extracting-illustrated-pages
collection DOAJ
language English
format Article
sources DOAJ
author Stephen Krewson
spellingShingle Stephen Krewson
Extracting Illustrated Pages from Digital Libraries with Python
The Programming Historian
author_facet Stephen Krewson
author_sort Stephen Krewson
title Extracting Illustrated Pages from Digital Libraries with Python
title_short Extracting Illustrated Pages from Digital Libraries with Python
title_full Extracting Illustrated Pages from Digital Libraries with Python
title_fullStr Extracting Illustrated Pages from Digital Libraries with Python
title_full_unstemmed Extracting Illustrated Pages from Digital Libraries with Python
title_sort extracting illustrated pages from digital libraries with python
publisher Editorial Board of the Programming Historian
series The Programming Historian
issn 2397-2068
2397-2068
publishDate 2019-01-01
description Machine learning and API extensions by HathiTrust and Internet Archive are making it easier to extract page regions of visual interest from digitized volumes. This lesson shows how to efficiently extract those regions and, in doing so, prompt new, visual research questions.
url https://programminghistorian.org/en/lessons/extracting-illustrated-pages
work_keys_str_mv AT stephenkrewson extractingillustratedpagesfromdigitallibrarieswithpython
_version_ 1725341277624467456