The proximity of ideas: An analysis of patent text using machine learning.
This paper introduces a measure of the proximity in ideas using unsupervised machine learning. Knowledge transfers are considered a key driving force of innovation and regional economic growth. I explore knowledge relationships by deriving vector space representations of a patent's abstract tex...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2020-01-01
|
Series: | PLoS ONE |
Online Access: | https://doi.org/10.1371/journal.pone.0234880 |
id |
doaj-277b0701d91145b7b446662f61d2ed1c |
---|---|
record_format |
Article |
spelling |
doaj-277b0701d91145b7b446662f61d2ed1c2021-03-03T21:55:35ZengPublic Library of Science (PLoS)PLoS ONE1932-62032020-01-01157e023488010.1371/journal.pone.0234880The proximity of ideas: An analysis of patent text using machine learning.Sijie FengThis paper introduces a measure of the proximity in ideas using unsupervised machine learning. Knowledge transfers are considered a key driving force of innovation and regional economic growth. I explore knowledge relationships by deriving vector space representations of a patent's abstract text using Document Vectors (Doc2Vec), and using cosine similarity to measure their proximity in ideas space. I illustrate the potential uses of this method with an application to geographic localization in knowledge spillovers. For patents in the same technology field, their normalized text similarity is 0.02-0.05 S.D.s higher if they are located within the same city, compared to patents from other cities. This effect is much smaller than when knowledge transfers are measured using normalized patent citations: local patents receive about 0.23-0.30 S.D.s more local citations than compared to non-local control patents. These findings suggest that the effect of geography on knowledge transfers may be much smaller than the previous literature using citations suggests.https://doi.org/10.1371/journal.pone.0234880 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Sijie Feng |
spellingShingle |
Sijie Feng The proximity of ideas: An analysis of patent text using machine learning. PLoS ONE |
author_facet |
Sijie Feng |
author_sort |
Sijie Feng |
title |
The proximity of ideas: An analysis of patent text using machine learning. |
title_short |
The proximity of ideas: An analysis of patent text using machine learning. |
title_full |
The proximity of ideas: An analysis of patent text using machine learning. |
title_fullStr |
The proximity of ideas: An analysis of patent text using machine learning. |
title_full_unstemmed |
The proximity of ideas: An analysis of patent text using machine learning. |
title_sort |
proximity of ideas: an analysis of patent text using machine learning. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2020-01-01 |
description |
This paper introduces a measure of the proximity in ideas using unsupervised machine learning. Knowledge transfers are considered a key driving force of innovation and regional economic growth. I explore knowledge relationships by deriving vector space representations of a patent's abstract text using Document Vectors (Doc2Vec), and using cosine similarity to measure their proximity in ideas space. I illustrate the potential uses of this method with an application to geographic localization in knowledge spillovers. For patents in the same technology field, their normalized text similarity is 0.02-0.05 S.D.s higher if they are located within the same city, compared to patents from other cities. This effect is much smaller than when knowledge transfers are measured using normalized patent citations: local patents receive about 0.23-0.30 S.D.s more local citations than compared to non-local control patents. These findings suggest that the effect of geography on knowledge transfers may be much smaller than the previous literature using citations suggests. |
url |
https://doi.org/10.1371/journal.pone.0234880 |
work_keys_str_mv |
AT sijiefeng theproximityofideasananalysisofpatenttextusingmachinelearning AT sijiefeng proximityofideasananalysisofpatenttextusingmachinelearning |
_version_ |
1714814280553988096 |