A key capability in managing patent applications or a patent portfolio is comparing claims to other text, e.g. a patent specification. Because the language of claims is different from language used elsewhere in the patent application or in non-patent text, this has been challenging for computer based natural language processing. We test two new LLM-based approaches and find that both provide substantially better performance than previously published values. The ability to match dense information from one domain against much more distributed information expressed in a different vocabulary may also be useful beyond the intellectual property space.
翻译:在管理专利申请或专利组合时,一项关键能力是将权利要求与其他文本(例如专利说明书)进行比较。由于权利要求的语言与专利申请其他部分或非专利文本中使用的语言存在差异,这对基于计算机的自然语言处理一直构成挑战。我们测试了两种基于大型语言模型的新方法,发现这两种方法均提供了显著优于以往公布值的性能表现。将来自某一领域的高密度信息与以不同词汇表达的更分散信息进行匹配的能力,在知识产权领域之外也可能具有应用价值。