Discovering Implant Terms in Medical Records
Implant terms are terms like "pacemaker" which indicate the presence of artifacts in the body of a human. These implant terms are key to determining if a patient can safely undergo Magnetic Resonance Imaging (MRI). However, to identify these terms in medical records is time-consuming, labo...
Main Author: | |
---|---|
Format: | Others |
Language: | English |
Published: |
Linköpings universitet, Institutionen för datavetenskap
2021
|
Subjects: | |
Online Access: | http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-181885 |
id |
ndltd-UPSALLA1-oai-DiVA.org-liu-181885 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-UPSALLA1-oai-DiVA.org-liu-1818852021-12-18T05:49:54ZDiscovering Implant Terms in Medical RecordsengJerdhaf, OskarLinköpings universitet, Institutionen för datavetenskap2021AIMachine LearningMedical RecordsPatient RecordsMedicalRecordElectronic RecordsElectronic Medical RecordsBERTEMRImplant TermsImplantsTermTermsTerm DiscoveryArtificial IntelligenceWordSimilarityWord Similarityword-similarityembeddingsword embeddingsword-embeddingstransformersKDTREEBALLTREENERAIArtificiel IntelligensMaskininlärningPatient JournalMedicinsk JournalElektronisk Medicinsk JournalTermerBERTKDTREEBALLTREENERliknande ordtransformersEMRLanguage Technology (Computational Linguistics)Språkteknologi (språkvetenskaplig databehandling)Implant terms are terms like "pacemaker" which indicate the presence of artifacts in the body of a human. These implant terms are key to determining if a patient can safely undergo Magnetic Resonance Imaging (MRI). However, to identify these terms in medical records is time-consuming, laborious and expensive, but necessary for taking the correct precautions before an MRI scan. Automating this process is of great interest to radiologists as it ideally saves time, prevents mistakes and as a result saves lives. The electronic medical records (EMR) contain the documented medical history of a patient, including any implants or objects that an individual would have inside their body. Information about such objects and implants are of great interest when determining if and how a patient can be scanned using MRI. This information is unfortunately not easily extracted through automatic means. Due to their sparse presence and the unusual structure of medical records compared to most written text, makes it very difficult to automate using simple means. By leveraging the recent advancements in Artificial Intelligence (AI), this thesis explores the ability to identify and extract such terms automatically in Swedish EMRs. For the task of identifying implant terms in medical records a generally trained Swedish Bidirectional Encoder Representations from Transformers (BERT) model is used, which is then fine-tuned on Swedish medical records. Using this model a variety of approaches are explored two of which will be covered in this thesis. Using this model a variety of approaches are explored, namely BERT-KDTree, BERT-BallTree, Cosine Brute Force and unsupervised NER. The results show that BERT-KDTree and BERT-BallTree are the most rewarding methods. Results from both methods have been evaluated by domain experts and appear promising for such an early stage, given the difficulty of the task. The evaluation of BERT-BallTree shows that multiple methods of extraction may be preferable as they provide different but still useful terms. Cosine brute force is deemed to be an unrealistic approach due to computational and memory requirements. The NER approach was deemed too impractical and laborious to justify for this study, yet is potentially useful if not more suitable given a different set of conditions and goals. While there is much to be explored and improved, these experiments are a clear indication that automatic identification of implant terms is possible, as a large number of implant terms were successfully discovered using automated means. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-181885application/pdfinfo:eu-repo/semantics/openAccess |
collection |
NDLTD |
language |
English |
format |
Others
|
sources |
NDLTD |
topic |
AI Machine Learning Medical Records Patient Records Medical Record Electronic Records Electronic Medical Records BERT EMR Implant Terms Implants Term Terms Term Discovery Artificial Intelligence Word Similarity Word Similarity word-similarity embeddings word embeddings word-embeddings transformers KDTREE BALLTREE NER AI Artificiel Intelligens Maskininlärning Patient Journal Medicinsk Journal Elektronisk Medicinsk Journal Termer BERT KDTREE BALLTREE NER liknande ord transformers EMR Language Technology (Computational Linguistics) Språkteknologi (språkvetenskaplig databehandling) |
spellingShingle |
AI Machine Learning Medical Records Patient Records Medical Record Electronic Records Electronic Medical Records BERT EMR Implant Terms Implants Term Terms Term Discovery Artificial Intelligence Word Similarity Word Similarity word-similarity embeddings word embeddings word-embeddings transformers KDTREE BALLTREE NER AI Artificiel Intelligens Maskininlärning Patient Journal Medicinsk Journal Elektronisk Medicinsk Journal Termer BERT KDTREE BALLTREE NER liknande ord transformers EMR Language Technology (Computational Linguistics) Språkteknologi (språkvetenskaplig databehandling) Jerdhaf, Oskar Discovering Implant Terms in Medical Records |
description |
Implant terms are terms like "pacemaker" which indicate the presence of artifacts in the body of a human. These implant terms are key to determining if a patient can safely undergo Magnetic Resonance Imaging (MRI). However, to identify these terms in medical records is time-consuming, laborious and expensive, but necessary for taking the correct precautions before an MRI scan. Automating this process is of great interest to radiologists as it ideally saves time, prevents mistakes and as a result saves lives. The electronic medical records (EMR) contain the documented medical history of a patient, including any implants or objects that an individual would have inside their body. Information about such objects and implants are of great interest when determining if and how a patient can be scanned using MRI. This information is unfortunately not easily extracted through automatic means. Due to their sparse presence and the unusual structure of medical records compared to most written text, makes it very difficult to automate using simple means. By leveraging the recent advancements in Artificial Intelligence (AI), this thesis explores the ability to identify and extract such terms automatically in Swedish EMRs. For the task of identifying implant terms in medical records a generally trained Swedish Bidirectional Encoder Representations from Transformers (BERT) model is used, which is then fine-tuned on Swedish medical records. Using this model a variety of approaches are explored two of which will be covered in this thesis. Using this model a variety of approaches are explored, namely BERT-KDTree, BERT-BallTree, Cosine Brute Force and unsupervised NER. The results show that BERT-KDTree and BERT-BallTree are the most rewarding methods. Results from both methods have been evaluated by domain experts and appear promising for such an early stage, given the difficulty of the task. The evaluation of BERT-BallTree shows that multiple methods of extraction may be preferable as they provide different but still useful terms. Cosine brute force is deemed to be an unrealistic approach due to computational and memory requirements. The NER approach was deemed too impractical and laborious to justify for this study, yet is potentially useful if not more suitable given a different set of conditions and goals. While there is much to be explored and improved, these experiments are a clear indication that automatic identification of implant terms is possible, as a large number of implant terms were successfully discovered using automated means. |
author |
Jerdhaf, Oskar |
author_facet |
Jerdhaf, Oskar |
author_sort |
Jerdhaf, Oskar |
title |
Discovering Implant Terms in Medical Records |
title_short |
Discovering Implant Terms in Medical Records |
title_full |
Discovering Implant Terms in Medical Records |
title_fullStr |
Discovering Implant Terms in Medical Records |
title_full_unstemmed |
Discovering Implant Terms in Medical Records |
title_sort |
discovering implant terms in medical records |
publisher |
Linköpings universitet, Institutionen för datavetenskap |
publishDate |
2021 |
url |
http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-181885 |
work_keys_str_mv |
AT jerdhafoskar discoveringimplanttermsinmedicalrecords |
_version_ |
1723964860226600960 |