Automatic Transcription of Historical Documents : Transkribus as a Tool for Libraries, Archives and Scholars

Digital libraries and archives are major portals to rich sources of information. They undertake large-scale digitization to enhance their digital collections and offer users valuable text data. When it comes to handwritten documents, usually these are only provided as digitized images and not accomp...

Full description

Bibliographic Details
Main Author: Milioni, Nikolina
Format: Others
Language:English
Published: Uppsala universitet, Institutionen för ABM 2020
Subjects:
HTR
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-412565
id ndltd-UPSALLA1-oai-DiVA.org-uu-412565
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-uu-4125652020-06-24T03:32:33ZAutomatic Transcription of Historical Documents : Transkribus as a Tool for Libraries, Archives and ScholarsengMilioni, NikolinaUppsala universitet, Institutionen för ABM2020Digital HumanitiesHTRHistorical DocumentsDigital ArchivesAutomatic TranscriptionTranskribusOther Humanities not elsewhere specifiedÖvrig annan humanioraDigital libraries and archives are major portals to rich sources of information. They undertake large-scale digitization to enhance their digital collections and offer users valuable text data. When it comes to handwritten documents, usually these are only provided as digitized images and not accompanied by their transcriptions. Text in non-machine-readable format restricts contemporary scholars to conduct research, especially by employing digital humanities approaches, such as distant reading and data mining. The purpose of this thesis is to evaluate Transkribus platform as a linguistic tool mainly developed for producing automatic transcriptions of handwritten documents. The results are correlated with the findings of a questionnaire distributed to libraries and archives across Europe to expand our knowledge on the policy they follow regarding manuscripts and transcription provision. A model for a specific writing style in Latin language is trained and the accuracy on various Latin handwritten pages is tested. Finally, the tool’s validation is discussed, as well as to what extent it meets the general needs of the cultural heritage institutions and of humanities scholars. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-412565Local 3Theses within Digital Humanities ; 3application/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Others
sources NDLTD
topic Digital Humanities
HTR
Historical Documents
Digital Archives
Automatic Transcription
Transkribus
Other Humanities not elsewhere specified
Övrig annan humaniora
spellingShingle Digital Humanities
HTR
Historical Documents
Digital Archives
Automatic Transcription
Transkribus
Other Humanities not elsewhere specified
Övrig annan humaniora
Milioni, Nikolina
Automatic Transcription of Historical Documents : Transkribus as a Tool for Libraries, Archives and Scholars
description Digital libraries and archives are major portals to rich sources of information. They undertake large-scale digitization to enhance their digital collections and offer users valuable text data. When it comes to handwritten documents, usually these are only provided as digitized images and not accompanied by their transcriptions. Text in non-machine-readable format restricts contemporary scholars to conduct research, especially by employing digital humanities approaches, such as distant reading and data mining. The purpose of this thesis is to evaluate Transkribus platform as a linguistic tool mainly developed for producing automatic transcriptions of handwritten documents. The results are correlated with the findings of a questionnaire distributed to libraries and archives across Europe to expand our knowledge on the policy they follow regarding manuscripts and transcription provision. A model for a specific writing style in Latin language is trained and the accuracy on various Latin handwritten pages is tested. Finally, the tool’s validation is discussed, as well as to what extent it meets the general needs of the cultural heritage institutions and of humanities scholars.
author Milioni, Nikolina
author_facet Milioni, Nikolina
author_sort Milioni, Nikolina
title Automatic Transcription of Historical Documents : Transkribus as a Tool for Libraries, Archives and Scholars
title_short Automatic Transcription of Historical Documents : Transkribus as a Tool for Libraries, Archives and Scholars
title_full Automatic Transcription of Historical Documents : Transkribus as a Tool for Libraries, Archives and Scholars
title_fullStr Automatic Transcription of Historical Documents : Transkribus as a Tool for Libraries, Archives and Scholars
title_full_unstemmed Automatic Transcription of Historical Documents : Transkribus as a Tool for Libraries, Archives and Scholars
title_sort automatic transcription of historical documents : transkribus as a tool for libraries, archives and scholars
publisher Uppsala universitet, Institutionen för ABM
publishDate 2020
url http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-412565
work_keys_str_mv AT milioninikolina automatictranscriptionofhistoricaldocumentstranskribusasatoolforlibrariesarchivesandscholars
_version_ 1719323686207488000