In this work, we present \skfp, a Python package for computation of molecular fingerprints for applications in chemoinformatics. Our library offers an industry-standard scikit-learn interface, allowing intuitive usage and easy integration with machine learning pipelines. It is also highly optimized, featuring parallel computation that enables efficient processing of large molecular datasets. Currently, \skfp~stands as the most feature-rich library in the open source Python ecosystem, offering over 30 molecular fingerprints. Our library simplifies chemoinformatics tasks based on molecular fingerprints, including molecular property prediction and virtual screening. It is also flexible, highly efficient, and fully open source.
翻译:本文介绍了\skfp,一个用于化学信息学中分子指纹计算的Python软件包。我们的库提供了行业标准的scikit-learn接口,支持直观使用并易于与机器学习流程集成。该库经过高度优化,具备并行计算能力,能够高效处理大规模分子数据集。目前,\skfp是开源Python生态系统中功能最丰富的库,提供超过30种分子指纹。我们的库简化了基于分子指纹的化学信息学任务,包括分子性质预测和虚拟筛选。该库同时具备灵活性、高效率和完全开源的特点。