A Bimodal Domain-Specific Deep Learning Approach for Automated Patent Classification

碩士 === 國立臺北科技大學 === 電子工程系 === 107 === Patent classification is fundamental to search patents in huge patent database. An automatic patent classification system is beneficial for subject technology to be effectively compared with previous cases of relevant technology, and further improve the efficien...

Full description

Bibliographic Details
Main Authors: LIAO, WEN-HUI, 廖文慧
Other Authors: HSIAO, RONG-SHUE
Format: Others
Language:zh-TW
Published: 2019
Online Access:http://ndltd.ncl.edu.tw/handle/wtu56h
id ndltd-TW-107TIT00427099
record_format oai_dc
spelling ndltd-TW-107TIT004270992019-11-10T05:31:30Z http://ndltd.ncl.edu.tw/handle/wtu56h A Bimodal Domain-Specific Deep Learning Approach for Automated Patent Classification 一種基於雙模態領域特定深度學習的自動專利文本分類方法 LIAO, WEN-HUI 廖文慧 碩士 國立臺北科技大學 電子工程系 107 Patent classification is fundamental to search patents in huge patent database. An automatic patent classification system is beneficial for subject technology to be effectively compared with previous cases of relevant technology, and further improve the efficiency of patent analysis. In this research, we present a bimodal domain-specific deep learning approach for automated patent classification. Classification is based on International Patent Classification (IPC), which is published and managed by the World Intellectual Property Organization (WIPO). The system comprises word embedding pre-trained by US patent documents to convert word into word vector, and adopts CNN, CNN-BiLSTM and Hierarchical Attention Model in deep learning for text classification. The research further uses a bimodal data processing method to fuse the image-processed features of patent with the text-embedded vector-converted features to try to improve the classification accuracy. Training and testing data came from patent documents granted by the United States Patent and Trademark Office, which was classified into 8 categories to IPC at section level, 131 categories at class level, and 646 categories at subclass level. The experimental results showed that the system, comprising word embedding pre-trained by US patent documents and Hierarchical Attention Model for feature extraction and classification, obtained a better performance compared with previous researches. Among them, the system reaches a micro-average precision of 61.66% at IPC class level and 43.25% at IPC subclass level using claim as input. HSIAO, RONG-SHUE 蕭榮修 2019 學位論文 ; thesis 57 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立臺北科技大學 === 電子工程系 === 107 === Patent classification is fundamental to search patents in huge patent database. An automatic patent classification system is beneficial for subject technology to be effectively compared with previous cases of relevant technology, and further improve the efficiency of patent analysis. In this research, we present a bimodal domain-specific deep learning approach for automated patent classification. Classification is based on International Patent Classification (IPC), which is published and managed by the World Intellectual Property Organization (WIPO). The system comprises word embedding pre-trained by US patent documents to convert word into word vector, and adopts CNN, CNN-BiLSTM and Hierarchical Attention Model in deep learning for text classification. The research further uses a bimodal data processing method to fuse the image-processed features of patent with the text-embedded vector-converted features to try to improve the classification accuracy. Training and testing data came from patent documents granted by the United States Patent and Trademark Office, which was classified into 8 categories to IPC at section level, 131 categories at class level, and 646 categories at subclass level. The experimental results showed that the system, comprising word embedding pre-trained by US patent documents and Hierarchical Attention Model for feature extraction and classification, obtained a better performance compared with previous researches. Among them, the system reaches a micro-average precision of 61.66% at IPC class level and 43.25% at IPC subclass level using claim as input.
author2 HSIAO, RONG-SHUE
author_facet HSIAO, RONG-SHUE
LIAO, WEN-HUI
廖文慧
author LIAO, WEN-HUI
廖文慧
spellingShingle LIAO, WEN-HUI
廖文慧
A Bimodal Domain-Specific Deep Learning Approach for Automated Patent Classification
author_sort LIAO, WEN-HUI
title A Bimodal Domain-Specific Deep Learning Approach for Automated Patent Classification
title_short A Bimodal Domain-Specific Deep Learning Approach for Automated Patent Classification
title_full A Bimodal Domain-Specific Deep Learning Approach for Automated Patent Classification
title_fullStr A Bimodal Domain-Specific Deep Learning Approach for Automated Patent Classification
title_full_unstemmed A Bimodal Domain-Specific Deep Learning Approach for Automated Patent Classification
title_sort bimodal domain-specific deep learning approach for automated patent classification
publishDate 2019
url http://ndltd.ncl.edu.tw/handle/wtu56h
work_keys_str_mv AT liaowenhui abimodaldomainspecificdeeplearningapproachforautomatedpatentclassification
AT liàowénhuì abimodaldomainspecificdeeplearningapproachforautomatedpatentclassification
AT liaowenhui yīzhǒngjīyúshuāngmótàilǐngyùtèdìngshēndùxuéxídezìdòngzhuānlìwénběnfēnlèifāngfǎ
AT liàowénhuì yīzhǒngjīyúshuāngmótàilǐngyùtèdìngshēndùxuéxídezìdòngzhuānlìwénběnfēnlèifāngfǎ
AT liaowenhui bimodaldomainspecificdeeplearningapproachforautomatedpatentclassification
AT liàowénhuì bimodaldomainspecificdeeplearningapproachforautomatedpatentclassification
_version_ 1719289380556767232