Discuss the Performance of R Language under Big Data

碩士 === 銘傳大學 === 應用統計與資料科學學系碩士班 === 107 === With the development of science and technology, access to data is also more and more convenient way, and various industries have accumulated and compiled many data. If we reuse the technology of data mining, we can extract valuable information from the...

Full description

Bibliographic Details
Main Authors: CHANG, KUNG-CHI, 張宮綺
Other Authors: LEE, YUE-SHI
Format: Others
Language:zh-TW
Published: 2019
Online Access:http://ndltd.ncl.edu.tw/handle/7xc6e3
Description
Summary:碩士 === 銘傳大學 === 應用統計與資料科學學系碩士班 === 107 === With the development of science and technology, access to data is also more and more convenient way, and various industries have accumulated and compiled many data. If we reuse the technology of data mining, we can extract valuable information from the data. The R language is a free open source software that is good at data analysis. It can be enhanced by downloading and installing a free kit, so it is a complete and convenient analysis software. At present, many industries are entering the era of big data, but when R language faces too large amount of data, it often exceeds the load, resulting in lower performance, allowing us to spend a lot of time waiting. Therefore, this study hopes to explore how to improve the performance of the R language. This study mainly uses a large amount of data from the retail industry to explore and at the same time try to solve the performance problems of the R language. We hope that through this technology, users of R language will be more efficient in their use and operation.