Summary: | 碩士 === 國立中山大學 === 資訊工程學系研究所 === 103 === In recent years, cloud computing, distributed computing, and big data technology, are more and more important. Many data analysis methods appear. Various different industries now use analysis of Big Data.
In medicine we can also use Big Data technology. Taiwan''s national health insurance from A.D. 1995. We have complete of the Taiwan’s people medical record. We have over 20 years, at least more than 20 million people’s health-care information. This huge data suitable for Big Data analysis. So that researchers can better understand the disease, but also through the analysis of big data, found that previously did not notice the details. Solve more mysteries in medicine.
The main thesis of this medicine to the needs of researchers. Implement a health insurance database analysis system. Use Hadoop Distributed File System as a data storage to store large amounts of health care information. According to the conditions, use Hive or Impala tool to filter information. Then use a Java program for further filter. Finally, showing data to the researchers.
In the past researchers commonly used statistical software, such as: SPSS or SAS. Because of large data and complex operation conditions. Calculation can not be performed. Today, Big Data technology can be used to overcome these problems.
|