Applying the maximum entropy principle to neural networks enhances multi-species distribution models

The rapid expansion of citizen science initiatives has led to a significant growth of biodiversity databases, and particularly presence-only (PO) observations. PO data are invaluable for understanding species distributions and their dynamics, but their use in a Species Distribution Model (SDM) is curtailed by sampling biases and the lack of information on absences. Poisson point processes are widely used for SDMs, with Maxent being one of the most popular methods. Maxent maximises the entropy of a probability distribution across sites as a function of predefined transformations of variables, called features. In contrast, neural networks and deep learning have emerged as a promising technique for automatic feature extraction from complex input variables. Arbitrarily complex transformations of input variables can be learned from the data efficiently through backpropagation and stochastic gradient descent (SGD). In this paper, we propose DeepMaxent, which harnesses neural networks to automatically learn shared features among species, using the maximum entropy principle. To do so, it employs a normalised Poisson loss where for each species, presence probabilities across sites are modelled by a neural network. We evaluate DeepMaxent on a benchmark dataset known for its spatial sampling biases, using PO data for calibration and presence-absence (PA) data for validation across six regions with different biological groups and covariates. Our results indicate that DeepMaxent performs better than Maxent and other leading SDMs across all regions and taxonomic groups. The method performs particularly well in regions of uneven sampling, demonstrating substantial potential to increase SDM performances. In particular, our approach yields more accurate predictions than traditional single-species models, which opens up new possibilities for methodological enhancement.

翻译：公民科学计划的快速扩张显著促进了生物多样性数据库的增长，特别是仅出现（PO）观测数据。PO数据对于理解物种分布及其动态具有重要价值，但其在物种分布模型（SDM）中的应用受限于采样偏差和缺乏缺失信息。泊松点过程被广泛用于SDM，其中Maxent是最流行的方法之一。Maxent通过最大化跨站点的概率分布熵来实现，该分布是预定义变量变换（称为特征）的函数。相比之下，神经网络和深度学习已成为从复杂输入变量中自动提取特征的有前景技术。通过反向传播和随机梯度下降（SGD），可以从数据中高效学习任意复杂的输入变量变换。本文提出DeepMaxent方法，该方法利用神经网络自动学习物种间的共享特征，并应用最大熵原理。为此，它采用归一化泊松损失函数，其中每个物种在各站点的出现概率由神经网络建模。我们在一个以空间采样偏差著称的基准数据集上评估DeepMaxent，使用PO数据进行校准，并利用出现-缺失（PA）数据在六个具有不同生物群组和协变量的区域进行验证。结果表明，DeepMaxent在所有区域和分类群组中的表现均优于Maxent及其他主流SDM。该方法在采样不均匀区域表现尤为突出，显示出显著提升SDM性能的潜力。特别值得注意的是，我们的方法比传统单物种模型产生更准确的预测，这为方法学改进开辟了新的可能性。

相关内容

SDM

关注 11

数据挖掘是从数据中发现有价值的知识的计算过程，是现代数据科学的核心。它在许多领域有着巨大的应用，包括科学、工程、医疗保健、商业和医学。这些字段中的典型数据集是大的、复杂的，而且通常是有噪声的。从这些数据集中提取知识需要使用复杂的、高性能的、有原则的分析技术和算法。这些技术反过来又需要在高性能计算基础设施上的实现，这些基础设施需要经过仔细的性能调优。强大的可视化技术和有效的用户界面对于使数据挖掘工具吸引来自不同学科的研究人员、分析师、数据科学家和应用程序开发人员以及利益相关者的可用性也至关重要。SDM确立了自己在数据挖掘领域的领先地位，并为解决这些问题的研究人员提供了一个在同行评审论坛上展示其工作的场所。SDM强调原则方法和坚实的数学基础，以其高质量和高影响力的技术论文而闻名，并提供强大的研讨会和教程程序(包括在会议注册中)。官网地址：http://dblp.uni-trier.de/db/conf/sdm/

美陆军研究报告《基于熵引导的深度神经网络加速收敛与性能提升方法》最新26页

专知会员服务

17+阅读 · 2025年7月3日

大语言模型在多模态推荐系统中的应用综述

专知会员服务

17+阅读 · 2025年5月17日

使用多模态大语言模型进行深度学习的图像、文本和语音数据增强：综述

专知会员服务

28+阅读 · 2025年2月4日

大规模语言模型在生物信息学中的应用

专知会员服务

18+阅读 · 2025年1月16日