Analytical research on application of improved cosine similarity to patents of LED reading lamp

碩士 === 國立臺灣科技大學 === 機械工程系 === 101 === The study firstly aims substrate hydrolysis process to establish an innovative patent search method and patent analysis that combine with the innovative cosine similarity method. Furthermore, the study combines with the innovative concept of cosine similarity t...

Full description

Bibliographic Details
Main Authors: De-Wei Wu, 吳德偉
Other Authors: Zone-Ching Lin
Format: Others
Language:zh-TW
Published: 2013
Online Access:http://ndltd.ncl.edu.tw/handle/60555997033073312324
Description
Summary:碩士 === 國立臺灣科技大學 === 機械工程系 === 101 === The study firstly aims substrate hydrolysis process to establish an innovative patent search method and patent analysis that combine with the innovative cosine similarity method. Furthermore, the study combines with the innovative concept of cosine similarity to improve the probability of patent attribution (PJ value). For the paper’s innovative cosine similarity method, all the important terms of a patent document are regarded as individual vector dimensions. The values of different dimensions, being the weights of those terms, are combined as a vector. Improved cosine similarity developed by the study proposes treating the normalized numerical values of different technical words, part and component words and functional words as the weights of those terms, and categorizing the patents highdy relating to various techniques and functional in a word cluster.After that, the study adds a new patent, and acquires the cluster of technical and functional words of this patent. Using improved cosine similarity method, the study compares the word clusters of highdy-related patents. After comparison, the highdy-related word clusters of technical and functional hierarchies are expanded. After expansion and establishment of highdy-related word clusters, and before using the probability method of patent attribution (PJ value), the comparison way of improved cosine similarities can be firstly employed to rule out the irrelevant technical hierarchy or functional hierarchy so as to achieve a more accurate and faster judgment method of patent attribution. After completion of establishment and test by the above methods for substrate hydrolysis process, the study uses LED reading lamp as a carrier of the research, combines with the patent search method of improved cosine similarity to retrieve the traditional and simplified Chinese and English patent documents in relation to LED reading lamp, and conducts IPC analysis of the traditional and simplified Chinese and English patents of LED reading lamp. Then the study uses improved cosine similarity to compare the technical words, part and component words, and synonyms of functional words in the traditional and simplified Chinese and English patents, and search the relevant Chinese and English patents. Combining the semantic analysis of the traditional and simplified Chinese and English patents and the term segmentation and word segmentation system with the concept of improved cosine similarity, the paper semi-automatically and semi-manually induces the technical and component word cluster and functional word cluster of relevant traditional and simplified Chinese and English patents, makes a statistics of the frequency of various technical words, component words and functional words, and calculates their normalized numerical values. Subsequently, the study applies the searched traditional and simplified Chinese and English patents in relation to LED reading lamp and collects the relevant synonyms in order to establish a check list of the traditional and simplified Chinese and English synonyms. After that, the study establishes the first and second hierarchies of technical/functional matrices of the traditional and simplified Chinese and English patents relating to LED reading lamp, and analyzes the technical viewpoint-oriented traditional and simplified Chinese and English patents of LED reading lamp. First of all, the paper applies the method of judging whether a patent of substrate hydrolysis process belongs to technical or functional category, to patents in relation to LED reading lamp. Using the check list of traditional and simplified Chinese and English synonyms established above, together with frequency of the keywords in part and component word cluster, technical word cluster and functional word cluster, the paper establishes fuzzy values for relevant keywords, and further applies the concept of normalized frequency. The paper firstly uses improved cosine similarity to make screening, and then adopts probability method (PJ value) to calculate the probability for the traditional and simplified Chinese and English patents to belong to technical or functional category. Then, focusing on the rearranged traditional and simplified Chinese and English patents of LED reading lamp, the paper makes IPC analysis, inventor analysis, company analysis, country analysis and overall trend analysis. After analysis on relevant term segmentation and word segmentation of component words, technical terms and functional terms of the traditional and simplified Chinese and English patents, the paper establishes the technical/functional word cluster, and produces the first and second technical hierarchies of technical/functional matrices. From the first and second technical hierarchies of technical/functional matrices, the paper establishes the first and second technical hierarchies of different item techniques and companies, analysis chart of their core competitiveness, competitor R&D activity trend chart, and technical-hierarchy activity trend chart. Finally, the paper practically verifies whether relevant patents belong to the first or second technical hierarchy, and proves the feasibility of this research approach. Relevant traditional and simplified Chinese and English patents of LED reading lamp searched above, and the results of IPC analysis and technical analysis mentioned above can be provided as a reference for enterprises and engineers to save their R&D time in production of new patients, and for enterprises to know the core competitiveness of their competitors.