SENECA: Small-Sample Discrete Entropy Estimation via Self-Consistent Missing Mass - 专知论文

会员服务 ·

0

估计/估计量 · MASS · 离散化 · INFORMS · 随机变量 ·

SENECA: Small-Sample Discrete Entropy Estimation via Self-Consistent Missing Mass

翻译：暂无翻译

Lucas H. McCabe,H. Howie Huang

Discrete entropy estimation is a classic information theory problem, wherein the average information content of a discrete random variable is estimated from samples alone. Naive approaches, such as the plugin method, fail to account for the probability mass associated with members of the random variable's support that are unobserved in a given sample, known as the "missing mass." The resulting systemic underestimation is particularly problematic when data is time-consuming or costly to gather. We propose SENECA, an entropy estimation scheme based on a novel ``self-consistent'' missing mass calculation. Extensive numerical experiments indicate that our approach outperforms many state-of-the-art alternatives overall in the small-sample setting. We then apply SENECA to two practical use cases, namely biodiversity estimation and the detection of incorrect large language model responses, where our method is competitive with domain-specific approaches. Our work advances SENECA as an effective drop-in replacement for small-sample entropy estimation, with broad utility across several domains.

翻译：暂无翻译

0

相关内容

估计/估计量

估计/估计量

EMNLP2023：Schema自适应的知识图谱构建

EMNLP2023：Schema自适应的知识图谱构建

专知会员服务

44+阅读 · 2023年12月3日

【CVPR 2022】面向无噪声对象轮廓的弱监督语义分割，Towards Noiseless Object Contours for Weakly Supervised Semantic Segmentation

【CVPR 2022】面向无噪声对象轮廓的弱监督语义分割，Towards Noiseless Object Contours for Weakly Supervised Semantic Segmentation

专知会员服务

10+阅读 · 2022年3月12日

【CVPR 2022】利用变分图信息瓶颈改进子图识别，Improving Subgraph Recognition with Variational Graph Information Bottleneck

【CVPR 2022】利用变分图信息瓶颈改进子图识别，Improving Subgraph Recognition with Variational Graph Information Bottleneck

专知会员服务

11+阅读 · 2022年3月12日

【ICML2021】基于经典迭代算法的图神经网络

专知会员服务

30+阅读 · 2021年5月21日

[WWW2021]图结构估计神经网络

[WWW2021]图结构估计神经网络

专知会员服务

43+阅读 · 2021年3月29日

【CVPR2020-中科院计算所】弱监督语义分割的自监督等价注意力机制，Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation

【CVPR2020-中科院计算所】弱监督语义分割的自监督等价注意力机制，Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation

专知会员服务

76+阅读 · 2020年4月10日

必读的7篇 IJCAI 2019【图神经网络（GNN）】相关论文

必读的7篇 IJCAI 2019【图神经网络（GNN）】相关论文

专知会员服务

92+阅读 · 2020年1月10日

【论文推荐】不同图像域弱监督语义分割的综合分析，A Comprehensive Analysis of Weakly-Supervised Semantic Segmentation in Different Image Domains

【论文推荐】不同图像域弱监督语义分割的综合分析，A Comprehensive Analysis of Weakly-Supervised Semantic Segmentation in Different Image Domains

专知会员服务

28+阅读 · 2019年12月27日

【Nature论文】用于理解图像分类决策和改进神经网络鲁棒性的对抗性解释（Adversarial Explanations for Understanding Image Classiﬁcation Decisions and Improved Neural Network Robustness ）

【Nature论文】用于理解图像分类决策和改进神经网络鲁棒性的对抗性解释（Adversarial Explanations for Understanding Image Classiﬁcation Decisions and Improved Neural Network Robustness ）

专知会员服务

13+阅读 · 2019年11月25日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

NeurIPS 2019 | 用于弱监督图像语义分割的新型损失函数

NeurIPS 2019 | 用于弱监督图像语义分割的新型损失函数

PaperWeekly

20+阅读 · 2019年10月8日

赛尔原创 | EMNLP 2019 基于上下文感知的变分自编码器建模事件背景知识进行If-Then类型常识推理

赛尔原创 | EMNLP 2019 基于上下文感知的变分自编码器建模事件背景知识进行If-Then类型常识推理

哈工大SCIR

17+阅读 · 2019年9月23日

Single-Shot Object Detection with Enriched Semantics

Single-Shot Object Detection with Enriched Semantics

统计学习与视觉计算组

14+阅读 · 2018年8月29日

【论文推荐】最新六篇知识图谱相关论文—事件演化图、神经词义消歧、增强神经网络、Mem2Seq、用户偏好传播、概率嵌入

【论文推荐】最新六篇知识图谱相关论文—事件演化图、神经词义消歧、增强神经网络、Mem2Seq、用户偏好传播、概率嵌入

专知

19+阅读 · 2018年6月14日

论文浅尝 | 嵌入常识知识的注意力 LSTM 模型用于特定目标的基于侧面的情感分析

论文浅尝 | 嵌入常识知识的注意力 LSTM 模型用于特定目标的基于侧面的情感分析

开放知识图谱

28+阅读 · 2018年6月11日

讲透RCNN, Fast-RCNN, Faster-RCNN，将CNN用于目标检测

讲透RCNN, Fast-RCNN, Faster-RCNN，将CNN用于目标检测

数据挖掘入门与实战

18+阅读 · 2018年4月20日

Focal Loss for Dense Object Detection

Focal Loss for Dense Object Detection

统计学习与视觉计算组

12+阅读 · 2018年3月15日

基于attention的seq2seq机器翻译实践详解

基于attention的seq2seq机器翻译实践详解

黑龙江大学自然语言处理实验室

11+阅读 · 2018年3月14日

论文浅尝 | Improved Neural Relation Detection for KBQA

论文浅尝 | Improved Neural Relation Detection for KBQA

开放知识图谱

13+阅读 · 2018年1月21日

IJCAI | Cascade Dynamics Modeling with Attention-based RNN

IJCAI | Cascade Dynamics Modeling with Attention-based RNN

KingsGarden

13+阅读 · 2017年7月16日

基于深度学习的联合作战态势智能辅助分析研究

国家自然科学基金

335+阅读 · 2017年12月31日

张量框架下高维遥感影像空-谱协同解译方法研究

国家自然科学基金

1+阅读 · 2016年12月31日

SAR影像古遗址自动检测方法研究

国家自然科学基金

4+阅读 · 2015年12月31日

形貌和结构双向可控SERS基底的构筑及其对多环芳烃的高特异性高灵敏检测

国家自然科学基金

0+阅读 · 2015年12月31日

基于缺失数据分析和信息几何理论的SAR图像自动目标识别研究

国家自然科学基金

3+阅读 · 2015年12月31日

基于脉冲机动可达域的小行星探测器二维发射窗口研究

国家自然科学基金

0+阅读 · 2015年12月31日

微小卫星编队的自主协同容错控制技术研究

国家自然科学基金

2+阅读 · 2015年12月31日

几类含∞-Laplace算子的特征值问题的研究

国家自然科学基金

1+阅读 · 2015年12月31日

隐写模糊安全性测度及其优化嵌入算法研究

国家自然科学基金

0+阅读 · 2014年12月31日

可重构的环境自适应RS码软判决译码器研究

国家自然科学基金

0+阅读 · 2014年12月31日

An Interdisciplinary and Cross-Task Review on Missing Data Imputation

Arxiv

0+阅读 · 4月24日

seneca: A Personalized Conversational Planner

Arxiv

0+阅读 · 4月21日

Sessa: Selective State Space Attention

Arxiv

0+阅读 · 4月20日

SegWithU: Uncertainty as Perturbation Energy for Single-Forward-Pass Risk-Aware Medical Image Segmentation

Arxiv

0+阅读 · 4月16日

Causal Inference with Missing Exposures and Missing Outcomes

Arxiv

0+阅读 · 4月14日

Variational Autoencoding Discrete Diffusion with Enhanced Dimensional Correlations Modeling

Arxiv

0+阅读 · 4月14日

Computational relative entropy

Arxiv

0+阅读 · 4月7日

Smoothing the Landscape: Causal Structure Learning via Diffusion Denoising Objectives

Arxiv

0+阅读 · 4月2日

Online monotone density estimation and log-optimal calibration

Arxiv

0+阅读 · 3月30日

Prediction with Missing Data: Target Probabilities and Missingness Mechanisms

Arxiv

0+阅读 · 3月18日

VIP会员

文章信息

相关主题

估计/估计量

最新内容

DeepSeek 版Claude Code，免费小白安装教程来了！

DeepSeek 版Claude Code，免费小白安装教程来了！

专知会员服务

6+阅读 · 5月5日

【ICML Spotlight 2026】 T²PO: 不确定性引导的探索控制框架，实现稳定多轮Agentic强化学习

【ICML Spotlight 2026】 T²PO: 不确定性引导的探索控制框架，实现稳定多轮Agentic强化学习

专知会员服务

2+阅读 · 5月5日

基础模型驱动的工业智能体：技术成熟度、能力变迁与未竟之挑战

基础模型驱动的工业智能体：技术成熟度、能力变迁与未竟之挑战

专知会员服务

2+阅读 · 5月5日

《机动炮兵的演进与未来：技术进步、历史沿革与炮兵作战前瞻》

《机动炮兵的演进与未来：技术进步、历史沿革与炮兵作战前瞻》

专知会员服务

4+阅读 · 5月5日

《火炮弹药快速效能建模：提升互操作性与技术优势》（报告）

《火炮弹药快速效能建模：提升互操作性与技术优势》（报告）

专知会员服务

5+阅读 · 5月5日

《美空军条令出版物 2-0：情报（2026版）》

《美空军条令出版物 2-0：情报（2026版）》

专知会员服务

12+阅读 · 5月5日

美陆军“飞蝇陷阱5.0”项目将新兴技术交到作战人员手中

美陆军“飞蝇陷阱5.0”项目将新兴技术交到作战人员手中

专知会员服务

4+阅读 · 5月5日

帕兰提尔 Gotham：一个游戏规则改变器

帕兰提尔 Gotham：一个游戏规则改变器

专知会员服务

6+阅读 · 5月5日

【ICML 2026】用测试时训练线性化视觉Transformer：T⁵ 实现 Softmax 注意力到线性复杂度的快速转换

【ICML 2026】用测试时训练线性化视觉Transformer：T⁵ 实现 Softmax 注意力到线性复杂度的快速转换

专知会员服务

2+阅读 · 5月5日

【AAAI 2026】大模型做知识蒸馏：CMM将LLM特征拆解给小模型协同学习

【AAAI 2026】大模型做知识蒸馏：CMM将LLM特征拆解给小模型协同学习

专知会员服务

2+阅读 · 5月5日

【ICML Spotlight 2026 】NonZero：交互引导探索的多智能体蒙特卡洛树搜索

【ICML Spotlight 2026 】NonZero：交互引导探索的多智能体蒙特卡洛树搜索

专知会员服务

8+阅读 · 5月4日

【综述】机器人学习中的世界模型：全面综述

【综述】机器人学习中的世界模型：全面综述

专知会员服务

11+阅读 · 5月4日

伊朗的导弹-无人机行动及其对美国威慑的影响

伊朗的导弹-无人机行动及其对美国威慑的影响

专知会员服务

9+阅读 · 5月4日

《未来战术无人机系统案例研究：量身定制采办策略方法》100页报告

《未来战术无人机系统案例研究：量身定制采办策略方法》100页报告

专知会员服务

9+阅读 · 5月4日

战争贩子：2026年第一季度美国对中东潜在军售激增

战争贩子：2026年第一季度美国对中东潜在军售激增

专知会员服务

6+阅读 · 5月4日

相关VIP内容

EMNLP2023：Schema自适应的知识图谱构建

EMNLP2023：Schema自适应的知识图谱构建

专知会员服务

44+阅读 · 2023年12月3日

【CVPR 2022】面向无噪声对象轮廓的弱监督语义分割，Towards Noiseless Object Contours for Weakly Supervised Semantic Segmentation

【CVPR 2022】面向无噪声对象轮廓的弱监督语义分割，Towards Noiseless Object Contours for Weakly Supervised Semantic Segmentation

专知会员服务

10+阅读 · 2022年3月12日

【CVPR 2022】利用变分图信息瓶颈改进子图识别，Improving Subgraph Recognition with Variational Graph Information Bottleneck

【CVPR 2022】利用变分图信息瓶颈改进子图识别，Improving Subgraph Recognition with Variational Graph Information Bottleneck

专知会员服务

11+阅读 · 2022年3月12日

【ICML2021】基于经典迭代算法的图神经网络

专知会员服务

30+阅读 · 2021年5月21日

[WWW2021]图结构估计神经网络

[WWW2021]图结构估计神经网络

专知会员服务

43+阅读 · 2021年3月29日

【CVPR2020-中科院计算所】弱监督语义分割的自监督等价注意力机制，Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation

【CVPR2020-中科院计算所】弱监督语义分割的自监督等价注意力机制，Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation

专知会员服务

76+阅读 · 2020年4月10日

必读的7篇 IJCAI 2019【图神经网络（GNN）】相关论文

必读的7篇 IJCAI 2019【图神经网络（GNN）】相关论文

专知会员服务

92+阅读 · 2020年1月10日

【论文推荐】不同图像域弱监督语义分割的综合分析，A Comprehensive Analysis of Weakly-Supervised Semantic Segmentation in Different Image Domains

【论文推荐】不同图像域弱监督语义分割的综合分析，A Comprehensive Analysis of Weakly-Supervised Semantic Segmentation in Different Image Domains

专知会员服务

28+阅读 · 2019年12月27日

【Nature论文】用于理解图像分类决策和改进神经网络鲁棒性的对抗性解释（Adversarial Explanations for Understanding Image Classiﬁcation Decisions and Improved Neural Network Robustness ）

【Nature论文】用于理解图像分类决策和改进神经网络鲁棒性的对抗性解释（Adversarial Explanations for Understanding Image Classiﬁcation Decisions and Improved Neural Network Robustness ）

专知会员服务

13+阅读 · 2019年11月25日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

热门VIP内容

开通专知VIP会员享更多权益服务

【ICML Spotlight 2026】 T²PO: 不确定性引导的探索控制框架，实现稳定多轮Agentic强化学习

《机动炮兵的演进与未来：技术进步、历史沿革与炮兵作战前瞻》

DeepSeek 版Claude Code，免费小白安装教程来了！

基础模型驱动的工业智能体：技术成熟度、能力变迁与未竟之挑战

相关资讯

NeurIPS 2019 | 用于弱监督图像语义分割的新型损失函数

NeurIPS 2019 | 用于弱监督图像语义分割的新型损失函数

PaperWeekly

20+阅读 · 2019年10月8日

赛尔原创 | EMNLP 2019 基于上下文感知的变分自编码器建模事件背景知识进行If-Then类型常识推理

赛尔原创 | EMNLP 2019 基于上下文感知的变分自编码器建模事件背景知识进行If-Then类型常识推理

哈工大SCIR

17+阅读 · 2019年9月23日

Single-Shot Object Detection with Enriched Semantics

Single-Shot Object Detection with Enriched Semantics

统计学习与视觉计算组

14+阅读 · 2018年8月29日

【论文推荐】最新六篇知识图谱相关论文—事件演化图、神经词义消歧、增强神经网络、Mem2Seq、用户偏好传播、概率嵌入

【论文推荐】最新六篇知识图谱相关论文—事件演化图、神经词义消歧、增强神经网络、Mem2Seq、用户偏好传播、概率嵌入

专知

19+阅读 · 2018年6月14日

论文浅尝 | 嵌入常识知识的注意力 LSTM 模型用于特定目标的基于侧面的情感分析

论文浅尝 | 嵌入常识知识的注意力 LSTM 模型用于特定目标的基于侧面的情感分析

开放知识图谱

28+阅读 · 2018年6月11日

讲透RCNN, Fast-RCNN, Faster-RCNN，将CNN用于目标检测

讲透RCNN, Fast-RCNN, Faster-RCNN，将CNN用于目标检测

数据挖掘入门与实战

18+阅读 · 2018年4月20日

Focal Loss for Dense Object Detection

Focal Loss for Dense Object Detection

统计学习与视觉计算组

12+阅读 · 2018年3月15日

基于attention的seq2seq机器翻译实践详解

基于attention的seq2seq机器翻译实践详解

黑龙江大学自然语言处理实验室

11+阅读 · 2018年3月14日

论文浅尝 | Improved Neural Relation Detection for KBQA

论文浅尝 | Improved Neural Relation Detection for KBQA

开放知识图谱

13+阅读 · 2018年1月21日

IJCAI | Cascade Dynamics Modeling with Attention-based RNN

IJCAI | Cascade Dynamics Modeling with Attention-based RNN

KingsGarden

13+阅读 · 2017年7月16日

相关论文

An Interdisciplinary and Cross-Task Review on Missing Data Imputation

Arxiv

0+阅读 · 4月24日

seneca: A Personalized Conversational Planner

Arxiv

0+阅读 · 4月21日

Sessa: Selective State Space Attention

Arxiv

0+阅读 · 4月20日

SegWithU: Uncertainty as Perturbation Energy for Single-Forward-Pass Risk-Aware Medical Image Segmentation

Arxiv

0+阅读 · 4月16日

Causal Inference with Missing Exposures and Missing Outcomes

Arxiv

0+阅读 · 4月14日

Variational Autoencoding Discrete Diffusion with Enhanced Dimensional Correlations Modeling

Arxiv

0+阅读 · 4月14日

Computational relative entropy

Arxiv

0+阅读 · 4月7日

Smoothing the Landscape: Causal Structure Learning via Diffusion Denoising Objectives

Arxiv

0+阅读 · 4月2日

Online monotone density estimation and log-optimal calibration

Arxiv

0+阅读 · 3月30日

Prediction with Missing Data: Target Probabilities and Missingness Mechanisms

Arxiv

0+阅读 · 3月18日

相关基金

基于深度学习的联合作战态势智能辅助分析研究

国家自然科学基金

335+阅读 · 2017年12月31日

张量框架下高维遥感影像空-谱协同解译方法研究

国家自然科学基金

1+阅读 · 2016年12月31日

SAR影像古遗址自动检测方法研究

国家自然科学基金

4+阅读 · 2015年12月31日

形貌和结构双向可控SERS基底的构筑及其对多环芳烃的高特异性高灵敏检测

国家自然科学基金

0+阅读 · 2015年12月31日

基于缺失数据分析和信息几何理论的SAR图像自动目标识别研究

国家自然科学基金

3+阅读 · 2015年12月31日

基于脉冲机动可达域的小行星探测器二维发射窗口研究

国家自然科学基金

0+阅读 · 2015年12月31日

微小卫星编队的自主协同容错控制技术研究

国家自然科学基金

2+阅读 · 2015年12月31日

几类含∞-Laplace算子的特征值问题的研究

国家自然科学基金

1+阅读 · 2015年12月31日

隐写模糊安全性测度及其优化嵌入算法研究

国家自然科学基金

0+阅读 · 2014年12月31日

可重构的环境自适应RS码软判决译码器研究

国家自然科学基金

0+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员