We introduce OneKE, a dockerized schema-guided knowledge extraction system, which can extract knowledge from the Web and raw PDF Books, and support various domains (science, news, etc.). Specifically, we design OneKE with multiple agents and a configure knowledge base. Different agents perform their respective roles, enabling support for various extraction scenarios. The configure knowledge base facilitates schema configuration, error case debugging and correction, further improving the performance. Empirical evaluations on benchmark datasets demonstrate OneKE's efficacy, while case studies further elucidate its adaptability to diverse tasks across multiple domains, highlighting its potential for broad applications. We have open-sourced the Code at https://github.com/zjunlp/OneKE and released a Video at http://oneke.openkg.cn/demo.mp4.
翻译:本文介绍了OneKE,一种基于Docker容器化、模式引导的知识抽取系统,该系统能够从互联网及原始PDF书籍中抽取知识,并支持多种领域(如科学、新闻等)。具体而言,我们设计了包含多智能体与可配置知识库的OneKE架构。不同智能体执行各自角色,从而支持多样化的抽取场景。可配置知识库便于模式配置、错误案例调试与修正,进一步提升了系统性能。在基准数据集上的实证评估验证了OneKE的有效性,案例研究进一步阐明了其在跨领域多任务中的适应能力,凸显了其广泛的应用潜力。我们已在https://github.com/zjunlp/OneKE开源代码,并在http://oneke.openkg.cn/demo.mp4发布了演示视频。