Image Transfer Between Magnetic Resonance Images and Speech Diagrams
Realtime Magnetic Resonance Imaging (MRI) is a method used for human anatomical study. MRIs give exceptionally detailed information about soft-tissue structures, such as tongues, that other current imaging techniques cannot achieve. However, the process requires special equipment and is expensive...
Main Author: | |
---|---|
Other Authors: | |
Format: | Others |
Language: | en |
Published: |
Université d'Ottawa / University of Ottawa
2020
|
Subjects: | |
Online Access: | http://hdl.handle.net/10393/41533 http://dx.doi.org/10.20381/ruor-25757 |
id |
ndltd-uottawa.ca-oai-ruor.uottawa.ca-10393-41533 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-uottawa.ca-oai-ruor.uottawa.ca-10393-415332020-12-05T05:38:56Z Image Transfer Between Magnetic Resonance Images and Speech Diagrams Wang, Kang Lee, Wonsook Generative adversarial network Super resolution MRI Speech diagram Realtime Magnetic Resonance Imaging (MRI) is a method used for human anatomical study. MRIs give exceptionally detailed information about soft-tissue structures, such as tongues, that other current imaging techniques cannot achieve. However, the process requires special equipment and is expensive. Hence, it is not quite suitable for all patients. Speech diagrams show the side view positions of organs like the tongue, throat, and lip of a speaking or singing person. The process of making a speech diagram is like the semantic segmentation of an MRI, which focuses on the selected edge structure. Speech diagrams are easy to understand with a clear speech diagram of the tongue and inside mouth structure. However, it often requires manual annotation on the MRI machine by an expert in the field. By using machine learning methods, we achieved transferring images between MRI and speech diagrams in two directions. We first matched videos of speech diagram and tongue MRIs. Then we used various image processing methods and data augmentation methods to make the paired images easy to train. We built our network model inspired by different cross-domain image transfer methods and applied reference-based super-resolution methods—to generate high-resolution images. Thus, we can do the transferring work through our network instead of manually. Also, generated speech diagram can work as an intermediary part to be transferred to other medical images like computerized tomography (CT), since it is simpler in structure compared to an MRI. We conducted experiments using both the data from our database and other MRI video sources. We use multiple methods to do the evaluation and comparisons with several related methods show the superiority of our approach. 2020-12-03T20:25:02Z 2020-12-03T20:25:02Z 2020-12-03 Thesis http://hdl.handle.net/10393/41533 http://dx.doi.org/10.20381/ruor-25757 en application/pdf Université d'Ottawa / University of Ottawa |
collection |
NDLTD |
language |
en |
format |
Others
|
sources |
NDLTD |
topic |
Generative adversarial network Super resolution MRI Speech diagram |
spellingShingle |
Generative adversarial network Super resolution MRI Speech diagram Wang, Kang Image Transfer Between Magnetic Resonance Images and Speech Diagrams |
description |
Realtime Magnetic Resonance Imaging (MRI) is a method used for human
anatomical study. MRIs give exceptionally detailed information about soft-tissue
structures, such as tongues, that other current imaging techniques cannot achieve.
However, the process requires special equipment and is expensive. Hence, it is not quite
suitable for all patients.
Speech diagrams show the side view positions of organs like the tongue, throat,
and lip of a speaking or singing person. The process of making a speech diagram is like
the semantic segmentation of an MRI, which focuses on the selected edge structure.
Speech diagrams are easy to understand with a clear speech diagram of the tongue and
inside mouth structure. However, it often requires manual annotation on the MRI
machine by an expert in the field.
By using machine learning methods, we achieved transferring images between
MRI and speech diagrams in two directions. We first matched videos of speech diagram
and tongue MRIs. Then we used various image processing methods and data
augmentation methods to make the paired images easy to train. We built our network
model inspired by different cross-domain image transfer methods and applied
reference-based super-resolution methods—to generate high-resolution images. Thus,
we can do the transferring work through our network instead of manually. Also,
generated speech diagram can work as an intermediary part to be transferred to other
medical images like computerized tomography (CT), since it is simpler in structure
compared to an MRI.
We conducted experiments using both the data from our database and other MRI
video sources. We use multiple methods to do the evaluation and comparisons with
several related methods show the superiority of our approach. |
author2 |
Lee, Wonsook |
author_facet |
Lee, Wonsook Wang, Kang |
author |
Wang, Kang |
author_sort |
Wang, Kang |
title |
Image Transfer Between Magnetic Resonance Images and Speech Diagrams |
title_short |
Image Transfer Between Magnetic Resonance Images and Speech Diagrams |
title_full |
Image Transfer Between Magnetic Resonance Images and Speech Diagrams |
title_fullStr |
Image Transfer Between Magnetic Resonance Images and Speech Diagrams |
title_full_unstemmed |
Image Transfer Between Magnetic Resonance Images and Speech Diagrams |
title_sort |
image transfer between magnetic resonance images and speech diagrams |
publisher |
Université d'Ottawa / University of Ottawa |
publishDate |
2020 |
url |
http://hdl.handle.net/10393/41533 http://dx.doi.org/10.20381/ruor-25757 |
work_keys_str_mv |
AT wangkang imagetransferbetweenmagneticresonanceimagesandspeechdiagrams |
_version_ |
1719368569201885184 |