Discovering Implant Terms in Medical Records

Implant terms are terms like "pacemaker" which indicate the presence of artifacts in the body of a human. These implant terms are key to determining if a patient can safely undergo Magnetic Resonance Imaging (MRI). However, to identify these terms in medical records is time-consuming, labo...

Full description

Bibliographic Details
Main Author: Jerdhaf, Oskar
Format: Others
Language:English
Published: Linköpings universitet, Institutionen för datavetenskap 2021
Subjects:
AI
EMR
NER
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-181885
id ndltd-UPSALLA1-oai-DiVA.org-liu-181885
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-liu-1818852021-12-18T05:49:54ZDiscovering Implant Terms in Medical RecordsengJerdhaf, OskarLinköpings universitet, Institutionen för datavetenskap2021AIMachine LearningMedical RecordsPatient RecordsMedicalRecordElectronic RecordsElectronic Medical RecordsBERTEMRImplant TermsImplantsTermTermsTerm DiscoveryArtificial IntelligenceWordSimilarityWord Similarityword-similarityembeddingsword embeddingsword-embeddingstransformersKDTREEBALLTREENERAIArtificiel IntelligensMaskininlärningPatient JournalMedicinsk JournalElektronisk Medicinsk JournalTermerBERTKDTREEBALLTREENERliknande ordtransformersEMRLanguage Technology (Computational Linguistics)Språkteknologi (språkvetenskaplig databehandling)Implant terms are terms like "pacemaker" which indicate the presence of artifacts in the body of a human. These implant terms are key to determining if a patient can safely undergo Magnetic Resonance Imaging (MRI). However, to identify these terms in medical records is time-consuming, laborious and expensive, but necessary for taking the correct precautions before an MRI scan. Automating this process is of great interest to radiologists as it ideally saves time, prevents mistakes and as a result saves lives. The electronic medical records (EMR) contain the documented medical history of a patient, including any implants or objects that an individual would have inside their body. Information about such objects and implants are of great interest when determining if and how a patient can be scanned using MRI. This information is unfortunately not easily extracted through automatic means. Due to their sparse presence and the unusual structure of medical records compared to most written text, makes it very difficult to automate using simple means. By leveraging the recent advancements in Artificial Intelligence (AI), this thesis explores the ability to identify and extract such terms automatically in Swedish EMRs. For the task of identifying implant terms in medical records a generally trained Swedish Bidirectional Encoder Representations from Transformers (BERT) model is used, which is then fine-tuned on Swedish medical records. Using this model a variety of approaches are explored two of which will be covered in this thesis. Using this model a variety of approaches are explored, namely BERT-KDTree, BERT-BallTree, Cosine Brute Force and unsupervised NER. The results show that BERT-KDTree and BERT-BallTree are the most rewarding methods. Results from both methods have been evaluated by domain experts and appear promising for such an early stage, given the difficulty of the task. The evaluation of BERT-BallTree shows that multiple methods of extraction may be preferable as they provide different but still useful terms. Cosine brute force is deemed to be an unrealistic approach due to computational and memory requirements. The NER approach was deemed too impractical and laborious to justify for this study, yet is potentially useful if not more suitable given a different set of conditions and goals. While there is much to be explored and improved, these experiments are a clear indication that automatic identification of implant terms is possible, as a large number of implant terms were successfully discovered using automated means. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-181885application/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Others
sources NDLTD
topic AI
Machine Learning
Medical Records
Patient Records
Medical
Record
Electronic Records
Electronic Medical Records
BERT
EMR
Implant Terms
Implants
Term
Terms
Term Discovery
Artificial Intelligence
Word
Similarity
Word Similarity
word-similarity
embeddings
word embeddings
word-embeddings
transformers
KDTREE
BALLTREE
NER
AI
Artificiel Intelligens
Maskininlärning
Patient Journal
Medicinsk Journal
Elektronisk Medicinsk Journal
Termer
BERT
KDTREE
BALLTREE
NER
liknande ord
transformers
EMR
Language Technology (Computational Linguistics)
Språkteknologi (språkvetenskaplig databehandling)
spellingShingle AI
Machine Learning
Medical Records
Patient Records
Medical
Record
Electronic Records
Electronic Medical Records
BERT
EMR
Implant Terms
Implants
Term
Terms
Term Discovery
Artificial Intelligence
Word
Similarity
Word Similarity
word-similarity
embeddings
word embeddings
word-embeddings
transformers
KDTREE
BALLTREE
NER
AI
Artificiel Intelligens
Maskininlärning
Patient Journal
Medicinsk Journal
Elektronisk Medicinsk Journal
Termer
BERT
KDTREE
BALLTREE
NER
liknande ord
transformers
EMR
Language Technology (Computational Linguistics)
Språkteknologi (språkvetenskaplig databehandling)
Jerdhaf, Oskar
Discovering Implant Terms in Medical Records
description Implant terms are terms like "pacemaker" which indicate the presence of artifacts in the body of a human. These implant terms are key to determining if a patient can safely undergo Magnetic Resonance Imaging (MRI). However, to identify these terms in medical records is time-consuming, laborious and expensive, but necessary for taking the correct precautions before an MRI scan. Automating this process is of great interest to radiologists as it ideally saves time, prevents mistakes and as a result saves lives. The electronic medical records (EMR) contain the documented medical history of a patient, including any implants or objects that an individual would have inside their body. Information about such objects and implants are of great interest when determining if and how a patient can be scanned using MRI. This information is unfortunately not easily extracted through automatic means. Due to their sparse presence and the unusual structure of medical records compared to most written text, makes it very difficult to automate using simple means. By leveraging the recent advancements in Artificial Intelligence (AI), this thesis explores the ability to identify and extract such terms automatically in Swedish EMRs. For the task of identifying implant terms in medical records a generally trained Swedish Bidirectional Encoder Representations from Transformers (BERT) model is used, which is then fine-tuned on Swedish medical records. Using this model a variety of approaches are explored two of which will be covered in this thesis. Using this model a variety of approaches are explored, namely BERT-KDTree, BERT-BallTree, Cosine Brute Force and unsupervised NER. The results show that BERT-KDTree and BERT-BallTree are the most rewarding methods. Results from both methods have been evaluated by domain experts and appear promising for such an early stage, given the difficulty of the task. The evaluation of BERT-BallTree shows that multiple methods of extraction may be preferable as they provide different but still useful terms. Cosine brute force is deemed to be an unrealistic approach due to computational and memory requirements. The NER approach was deemed too impractical and laborious to justify for this study, yet is potentially useful if not more suitable given a different set of conditions and goals. While there is much to be explored and improved, these experiments are a clear indication that automatic identification of implant terms is possible, as a large number of implant terms were successfully discovered using automated means.
author Jerdhaf, Oskar
author_facet Jerdhaf, Oskar
author_sort Jerdhaf, Oskar
title Discovering Implant Terms in Medical Records
title_short Discovering Implant Terms in Medical Records
title_full Discovering Implant Terms in Medical Records
title_fullStr Discovering Implant Terms in Medical Records
title_full_unstemmed Discovering Implant Terms in Medical Records
title_sort discovering implant terms in medical records
publisher Linköpings universitet, Institutionen för datavetenskap
publishDate 2021
url http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-181885
work_keys_str_mv AT jerdhafoskar discoveringimplanttermsinmedicalrecords
_version_ 1723964860226600960