Microsoft Concept Graph: Mining Semantic Concepts for Short Text Understanding

Knowlege is important for text-related applications. In this paper, we introduce Microsoft Concept Graph, a knowledge graph engine that provides concept tagging APIs to facilitate the understanding of human languages. Microsoft Concept Graph is built upon Probase, a universal p...

Full description

Bibliographic Details
Main Authors: Ji, Lei, Wang, Yujing, Shi, Botian, Zhang, Dawei, Wang, Zhongyuan, Yan, Jun
Format: Article
Language:English
Published: The MIT Press 2019-06-01
Series:Data Intelligence
Online Access:https://www.mitpressjournals.org/doi/abs/10.1162/dint_a_00013
id doaj-d497b5fa0aa443979e5ec843027e877c
record_format Article
spelling doaj-d497b5fa0aa443979e5ec843027e877c2020-11-25T03:17:44ZengThe MIT PressData Intelligence2641-435X2019-06-011323827010.1162/dint_a_00013Microsoft Concept Graph: Mining Semantic Concepts for Short Text UnderstandingJi, LeiWang, YujingShi, BotianZhang, DaweiWang, ZhongyuanYan, Jun Knowlege is important for text-related applications. In this paper, we introduce Microsoft Concept Graph, a knowledge graph engine that provides concept tagging APIs to facilitate the understanding of human languages. Microsoft Concept Graph is built upon Probase, a universal probabilistic taxonomy consisting of instances and concepts mined from the Web. We start by introducing the construction of the knowledge graph through iterative semantic extraction and taxonomy construction procedures, which extract 2.7 million concepts from 1.68 billion Web pages. We then use conceptualization models to represent text in the concept space to empower text-related applications, such as topic search, query recommendation, Web table understanding and Ads relevance. Since the release in 2016, Microsoft Concept Graph has received more than 100,000 pageviews, 2 million API calls and 3,000 registered downloads from 50,000 visitors over 64 countries. https://www.mitpressjournals.org/doi/abs/10.1162/dint_a_00013
collection DOAJ
language English
format Article
sources DOAJ
author Ji, Lei
Wang, Yujing
Shi, Botian
Zhang, Dawei
Wang, Zhongyuan
Yan, Jun
spellingShingle Ji, Lei
Wang, Yujing
Shi, Botian
Zhang, Dawei
Wang, Zhongyuan
Yan, Jun
Microsoft Concept Graph: Mining Semantic Concepts for Short Text Understanding
Data Intelligence
author_facet Ji, Lei
Wang, Yujing
Shi, Botian
Zhang, Dawei
Wang, Zhongyuan
Yan, Jun
author_sort Ji, Lei
title Microsoft Concept Graph: Mining Semantic Concepts for Short Text Understanding
title_short Microsoft Concept Graph: Mining Semantic Concepts for Short Text Understanding
title_full Microsoft Concept Graph: Mining Semantic Concepts for Short Text Understanding
title_fullStr Microsoft Concept Graph: Mining Semantic Concepts for Short Text Understanding
title_full_unstemmed Microsoft Concept Graph: Mining Semantic Concepts for Short Text Understanding
title_sort microsoft concept graph: mining semantic concepts for short text understanding
publisher The MIT Press
series Data Intelligence
issn 2641-435X
publishDate 2019-06-01
description Knowlege is important for text-related applications. In this paper, we introduce Microsoft Concept Graph, a knowledge graph engine that provides concept tagging APIs to facilitate the understanding of human languages. Microsoft Concept Graph is built upon Probase, a universal probabilistic taxonomy consisting of instances and concepts mined from the Web. We start by introducing the construction of the knowledge graph through iterative semantic extraction and taxonomy construction procedures, which extract 2.7 million concepts from 1.68 billion Web pages. We then use conceptualization models to represent text in the concept space to empower text-related applications, such as topic search, query recommendation, Web table understanding and Ads relevance. Since the release in 2016, Microsoft Concept Graph has received more than 100,000 pageviews, 2 million API calls and 3,000 registered downloads from 50,000 visitors over 64 countries.
url https://www.mitpressjournals.org/doi/abs/10.1162/dint_a_00013
work_keys_str_mv AT jilei microsoftconceptgraphminingsemanticconceptsforshorttextunderstanding
AT wangyujing microsoftconceptgraphminingsemanticconceptsforshorttextunderstanding
AT shibotian microsoftconceptgraphminingsemanticconceptsforshorttextunderstanding
AT zhangdawei microsoftconceptgraphminingsemanticconceptsforshorttextunderstanding
AT wangzhongyuan microsoftconceptgraphminingsemanticconceptsforshorttextunderstanding
AT yanjun microsoftconceptgraphminingsemanticconceptsforshorttextunderstanding
_version_ 1724630304988069888