A Lucene-based Platform for Contextual Analysis from Texts

碩士 === 元智大學 === 資訊管理學系 === 103 === Starting PC invention, has been the development of Internet and cloud technology, computer technology, and people have a close relationship, beginning in 2012, "Big Data" is becoming a new concept is now the most watched. Big Data is also known as big dat...

Full description

Bibliographic Details
Main Authors: Yu-Ju Shen, 沈育儒
Other Authors: Liang-Chih Yu
Format: Others
Language:zh-TW
Online Access:http://ndltd.ncl.edu.tw/handle/76906838030621695468
id ndltd-TW-103YZU05396049
record_format oai_dc
spelling ndltd-TW-103YZU053960492016-09-25T04:04:59Z http://ndltd.ncl.edu.tw/handle/76906838030621695468 A Lucene-based Platform for Contextual Analysis from Texts 以Lucene為基礎之文脈分析平臺 Yu-Ju Shen 沈育儒 碩士 元智大學 資訊管理學系 103 Starting PC invention, has been the development of Internet and cloud technology, computer technology, and people have a close relationship, beginning in 2012, "Big Data" is becoming a new concept is now the most watched. Big Data is also known as big data, massive data, their data growth continues to be in part from the extensive collection of information from various sources, such as mobile devices, high-altitude sensing technology, the Internet community media, software recording ... etc. By 2020, these data will double every two years to increase the speed of growth, but its importance does not lie in how much data, but how to use tools from a variety of sources, and to find out the clues and trends, 60% of respondents believed the organization could use more data to analyze, so that organizational innovation, and to achieve differentiation, this is truly the key to competition. The purpose of this paper, is to use Lucene as the basis for data indexing and search by the Java development environment. The results can be indexed and calculate TF-IDF values, statistical number and weight of each word appears heavy, and can also calculate word keyword in the query text, selected in line with the speech feature emotions when personnel to provide follow-up analyze text clouds or emotional words, the number of times a keyword appears on the timeline can be calculated according to different API, as forecast analysis. Liang-Chih Yu 禹良治 學位論文 ; thesis 33 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 元智大學 === 資訊管理學系 === 103 === Starting PC invention, has been the development of Internet and cloud technology, computer technology, and people have a close relationship, beginning in 2012, "Big Data" is becoming a new concept is now the most watched. Big Data is also known as big data, massive data, their data growth continues to be in part from the extensive collection of information from various sources, such as mobile devices, high-altitude sensing technology, the Internet community media, software recording ... etc. By 2020, these data will double every two years to increase the speed of growth, but its importance does not lie in how much data, but how to use tools from a variety of sources, and to find out the clues and trends, 60% of respondents believed the organization could use more data to analyze, so that organizational innovation, and to achieve differentiation, this is truly the key to competition. The purpose of this paper, is to use Lucene as the basis for data indexing and search by the Java development environment. The results can be indexed and calculate TF-IDF values, statistical number and weight of each word appears heavy, and can also calculate word keyword in the query text, selected in line with the speech feature emotions when personnel to provide follow-up analyze text clouds or emotional words, the number of times a keyword appears on the timeline can be calculated according to different API, as forecast analysis.
author2 Liang-Chih Yu
author_facet Liang-Chih Yu
Yu-Ju Shen
沈育儒
author Yu-Ju Shen
沈育儒
spellingShingle Yu-Ju Shen
沈育儒
A Lucene-based Platform for Contextual Analysis from Texts
author_sort Yu-Ju Shen
title A Lucene-based Platform for Contextual Analysis from Texts
title_short A Lucene-based Platform for Contextual Analysis from Texts
title_full A Lucene-based Platform for Contextual Analysis from Texts
title_fullStr A Lucene-based Platform for Contextual Analysis from Texts
title_full_unstemmed A Lucene-based Platform for Contextual Analysis from Texts
title_sort lucene-based platform for contextual analysis from texts
url http://ndltd.ncl.edu.tw/handle/76906838030621695468
work_keys_str_mv AT yujushen alucenebasedplatformforcontextualanalysisfromtexts
AT chényùrú alucenebasedplatformforcontextualanalysisfromtexts
AT yujushen yǐlucenewèijīchǔzhīwénmàifēnxīpíngtái
AT chényùrú yǐlucenewèijīchǔzhīwénmàifēnxīpíngtái
AT yujushen lucenebasedplatformforcontextualanalysisfromtexts
AT chényùrú lucenebasedplatformforcontextualanalysisfromtexts
_version_ 1718385613524172800