Using Machine Learning Methods to Classify and Retrieve Web APIs

碩士 === 國立臺灣海洋大學 === 資訊工程學系 === 106 === Nowadays, the retrieval of Web APIs to build applications is a must for developers. However, existing API systems are not able to help developers to precisely locate Web APIs. We identified two weaknesses of current API systems: 1) API category is an important...

Full description

Bibliographic Details
Main Authors: Pan, Tian-Yun, 潘天允
Other Authors: 馬尚彬
Format: Others
Language:zh-TW
Published: 2018
Online Access:http://ndltd.ncl.edu.tw/handle/65m3ch
id ndltd-TW-106NTOU5394045
record_format oai_dc
spelling ndltd-TW-106NTOU53940452019-11-01T05:28:50Z http://ndltd.ncl.edu.tw/handle/65m3ch Using Machine Learning Methods to Classify and Retrieve Web APIs 運用機器學習方法達成Web API分類與檢索 Pan, Tian-Yun 潘天允 碩士 國立臺灣海洋大學 資訊工程學系 106 Nowadays, the retrieval of Web APIs to build applications is a must for developers. However, existing API systems are not able to help developers to precisely locate Web APIs. We identified two weaknesses of current API systems: 1) API category is an important way to find relevant APIs. Current systems usually classify APIs into only one category. The single-category design is not reasonable and hinders the retrieval of relevant Web APIs, since most APIs belong to multiple categories; 2) Current systems expect users to issue keywords when searching APIs, not considering semantics. It makes that the search results are very likely few or unrelated. Therefore, in this thesis, we propose a new system, referred to as API Locator, to address the above issues. In API Locator, we apply the techniques of Machine Learning and most commonly used NLP (Nature Language Processing) mechanisms to devise a set of classification and recommendation algorithms. These algorithms are divided into three parts: 1) classification identification, which applies the traditional machine learning (SGD Classifier) and deep learning (LSTM) to identify the classification tags; 2) classification exclusion, which uses Word2Vec to exclude unrelated tags semantically; 3) API recommendation, which retrieves and sorts all the possible APIs with TF-IDF and Cosine Similarity. Notably, unlike the traditional search mode, the API Locator provides users with a way of searching by descriptions and supplies more possible classification information to the user query. Besides, the experimental results show high precision to multi-classification and API retrieval and demonstrate the feasibility and efficacy of API Locator. 馬尚彬 2018 學位論文 ; thesis 48 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立臺灣海洋大學 === 資訊工程學系 === 106 === Nowadays, the retrieval of Web APIs to build applications is a must for developers. However, existing API systems are not able to help developers to precisely locate Web APIs. We identified two weaknesses of current API systems: 1) API category is an important way to find relevant APIs. Current systems usually classify APIs into only one category. The single-category design is not reasonable and hinders the retrieval of relevant Web APIs, since most APIs belong to multiple categories; 2) Current systems expect users to issue keywords when searching APIs, not considering semantics. It makes that the search results are very likely few or unrelated. Therefore, in this thesis, we propose a new system, referred to as API Locator, to address the above issues. In API Locator, we apply the techniques of Machine Learning and most commonly used NLP (Nature Language Processing) mechanisms to devise a set of classification and recommendation algorithms. These algorithms are divided into three parts: 1) classification identification, which applies the traditional machine learning (SGD Classifier) and deep learning (LSTM) to identify the classification tags; 2) classification exclusion, which uses Word2Vec to exclude unrelated tags semantically; 3) API recommendation, which retrieves and sorts all the possible APIs with TF-IDF and Cosine Similarity. Notably, unlike the traditional search mode, the API Locator provides users with a way of searching by descriptions and supplies more possible classification information to the user query. Besides, the experimental results show high precision to multi-classification and API retrieval and demonstrate the feasibility and efficacy of API Locator.
author2 馬尚彬
author_facet 馬尚彬
Pan, Tian-Yun
潘天允
author Pan, Tian-Yun
潘天允
spellingShingle Pan, Tian-Yun
潘天允
Using Machine Learning Methods to Classify and Retrieve Web APIs
author_sort Pan, Tian-Yun
title Using Machine Learning Methods to Classify and Retrieve Web APIs
title_short Using Machine Learning Methods to Classify and Retrieve Web APIs
title_full Using Machine Learning Methods to Classify and Retrieve Web APIs
title_fullStr Using Machine Learning Methods to Classify and Retrieve Web APIs
title_full_unstemmed Using Machine Learning Methods to Classify and Retrieve Web APIs
title_sort using machine learning methods to classify and retrieve web apis
publishDate 2018
url http://ndltd.ncl.edu.tw/handle/65m3ch
work_keys_str_mv AT pantianyun usingmachinelearningmethodstoclassifyandretrievewebapis
AT pāntiānyǔn usingmachinelearningmethodstoclassifyandretrievewebapis
AT pantianyun yùnyòngjīqìxuéxífāngfǎdáchéngwebapifēnlèiyǔjiǎnsuǒ
AT pāntiānyǔn yùnyòngjīqìxuéxífāngfǎdáchéngwebapifēnlèiyǔjiǎnsuǒ
_version_ 1719285335967399936