Using Machine Learning Methods to Classify and Retrieve Web APIs
碩士 === 國立臺灣海洋大學 === 資訊工程學系 === 106 === Nowadays, the retrieval of Web APIs to build applications is a must for developers. However, existing API systems are not able to help developers to precisely locate Web APIs. We identified two weaknesses of current API systems: 1) API category is an important...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2018
|
Online Access: | http://ndltd.ncl.edu.tw/handle/65m3ch |
id |
ndltd-TW-106NTOU5394045 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-106NTOU53940452019-11-01T05:28:50Z http://ndltd.ncl.edu.tw/handle/65m3ch Using Machine Learning Methods to Classify and Retrieve Web APIs 運用機器學習方法達成Web API分類與檢索 Pan, Tian-Yun 潘天允 碩士 國立臺灣海洋大學 資訊工程學系 106 Nowadays, the retrieval of Web APIs to build applications is a must for developers. However, existing API systems are not able to help developers to precisely locate Web APIs. We identified two weaknesses of current API systems: 1) API category is an important way to find relevant APIs. Current systems usually classify APIs into only one category. The single-category design is not reasonable and hinders the retrieval of relevant Web APIs, since most APIs belong to multiple categories; 2) Current systems expect users to issue keywords when searching APIs, not considering semantics. It makes that the search results are very likely few or unrelated. Therefore, in this thesis, we propose a new system, referred to as API Locator, to address the above issues. In API Locator, we apply the techniques of Machine Learning and most commonly used NLP (Nature Language Processing) mechanisms to devise a set of classification and recommendation algorithms. These algorithms are divided into three parts: 1) classification identification, which applies the traditional machine learning (SGD Classifier) and deep learning (LSTM) to identify the classification tags; 2) classification exclusion, which uses Word2Vec to exclude unrelated tags semantically; 3) API recommendation, which retrieves and sorts all the possible APIs with TF-IDF and Cosine Similarity. Notably, unlike the traditional search mode, the API Locator provides users with a way of searching by descriptions and supplies more possible classification information to the user query. Besides, the experimental results show high precision to multi-classification and API retrieval and demonstrate the feasibility and efficacy of API Locator. 馬尚彬 2018 學位論文 ; thesis 48 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立臺灣海洋大學 === 資訊工程學系 === 106 === Nowadays, the retrieval of Web APIs to build applications is a must for developers. However, existing API systems are not able to help developers to precisely locate Web APIs. We identified two weaknesses of current API systems: 1) API category is an important way to find relevant APIs. Current systems usually classify APIs into only one category. The single-category design is not reasonable and hinders the retrieval of relevant Web APIs, since most APIs belong to multiple categories; 2) Current systems expect users to issue keywords when searching APIs, not considering semantics. It makes that the search results are very likely few or unrelated.
Therefore, in this thesis, we propose a new system, referred to as API Locator, to address the above issues. In API Locator, we apply the techniques of Machine Learning and most commonly used NLP (Nature Language Processing) mechanisms to devise a set of classification and recommendation algorithms. These algorithms are divided into three parts: 1) classification identification, which applies the traditional machine learning (SGD Classifier) and deep learning (LSTM) to identify the classification tags; 2) classification exclusion, which uses Word2Vec to exclude unrelated tags semantically; 3) API recommendation, which retrieves and sorts all the possible APIs with TF-IDF and Cosine Similarity. Notably, unlike the traditional search mode, the API Locator provides users with a way of searching by descriptions and supplies more possible classification information to the user query. Besides, the experimental results show high precision to multi-classification and API retrieval and demonstrate the feasibility and efficacy of API Locator.
|
author2 |
馬尚彬 |
author_facet |
馬尚彬 Pan, Tian-Yun 潘天允 |
author |
Pan, Tian-Yun 潘天允 |
spellingShingle |
Pan, Tian-Yun 潘天允 Using Machine Learning Methods to Classify and Retrieve Web APIs |
author_sort |
Pan, Tian-Yun |
title |
Using Machine Learning Methods to Classify and Retrieve Web APIs |
title_short |
Using Machine Learning Methods to Classify and Retrieve Web APIs |
title_full |
Using Machine Learning Methods to Classify and Retrieve Web APIs |
title_fullStr |
Using Machine Learning Methods to Classify and Retrieve Web APIs |
title_full_unstemmed |
Using Machine Learning Methods to Classify and Retrieve Web APIs |
title_sort |
using machine learning methods to classify and retrieve web apis |
publishDate |
2018 |
url |
http://ndltd.ncl.edu.tw/handle/65m3ch |
work_keys_str_mv |
AT pantianyun usingmachinelearningmethodstoclassifyandretrievewebapis AT pāntiānyǔn usingmachinelearningmethodstoclassifyandretrievewebapis AT pantianyun yùnyòngjīqìxuéxífāngfǎdáchéngwebapifēnlèiyǔjiǎnsuǒ AT pāntiānyǔn yùnyòngjīqìxuéxífāngfǎdáchéngwebapifēnlèiyǔjiǎnsuǒ |
_version_ |
1719285335967399936 |