Summary: | 碩士 === 國立交通大學 === 應用數學系數學建模與科學計算碩士班 === 107 === Nowadays, Machine learning performs astonishingly in many different fields. The more data we have, our machine learning methods show better results. However, in some cases, the data owners may not want to share the information they have, because those materials contain privacy issues. On the other hand, sometimes we encounter a very large dataset, which are difficult to store in a single machine. To deal with these two problems, we propose the distributed consensus reduced support vector machine (DCRSVM) for binary classification. Imagine that we have many local working units and a central master, and each working unit owns its data. The DCRSVM includes the following two merits. First, our method keeps the privacy of data, so we are not going to disclose local data to the central master. Besides, when we confront a large dataset, which is hard to store in a single server, the central master can still derive a good machine learning model even if the data stores only in local devices. Our method successfully solves the problems we mentioned above, and it generates a competitive result.
|