Identifying Technological Topic Changes in Patent Claims Using Topic Modeling

Hongshu Chen*, Yi Zhang, Donghua Zhu

*Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingChapterpeer-review

    4 Citations (Scopus)

    Abstract

    Patent claims usually embody the core technological scope and the most essential terms to define the protection of an invention, which makes them the ideal resource for patent topic identification and theme changes analysis. However, conducting content analysis manually on massive technical terms is very time-consuming and laborious. Even with the help of traditional text mining techniques, it is still difficult to model topic changes over time, because single keywords alone are usually too general or ambiguous to represent a concept. Moreover, term frequency that used to rank keywords cannot separate polysemous words that are actually describing a different concept. To address this issue, this research proposes a topic change identification approach based on latent dirichlet allocation, to model and analyze topic changes and topic-based trend with minimal human intervention. After textual data cleaning, underlying semantic topics hidden in large archives of patent claims are revealed automatically. Topics are defined by probability distributions over words instead of terms and their frequency, so that polysemy is allowed. A case study using patents published in the United States Patent and Trademark Office (USPTO) from 2009 to 2013 with Australia as their assignee country is presented, to demonstrate the validity of the proposed topic change identification approach. The experimental result shows that the proposed approach can be used as an automatic tool to provide machine-identified topic changes for more efficient and effective R&D management assistance.

    Original languageEnglish
    Title of host publicationInnovation, Technology and Knowledge Management
    PublisherSpringer
    Pages187-209
    Number of pages23
    DOIs
    Publication statusPublished - 2016

    Publication series

    NameInnovation, Technology and Knowledge Management
    ISSN (Print)2197-5698
    ISSN (Electronic)2197-5701

    Keywords

    • Patent analysis
    • Tech mining
    • Topic modeling

    Fingerprint

    Dive into the research topics of 'Identifying Technological Topic Changes in Patent Claims Using Topic Modeling'. Together they form a unique fingerprint.

    Cite this