MLCBART: Multilabel Classification with Bayesian Additive Regression Trees - 专知论文

会员服务 ·

0

贝叶斯 · 多标签分类 · BART · 结构 · 不确定 ·

MLCBART: Multilabel Classification with Bayesian Additive Regression Trees

翻译：MLCBART：基于贝叶斯加性回归树的多标签分类方法

Jiahao Tian,Hugh Chipman,Thomas Loughin

Multilabel Classification (MLC) deals with the simultaneous classification of multiple binary labels. The task is challenging because, not only may there be arbitrarily different and complex relationships between predictor variables and each label, but associations among labels may exist even after accounting for effects of predictor variables. In this paper, we present a Bayesian additive regression tree (BART) framework to model the problem. BART is a nonparametric and flexible model structure capable of uncovering complex relationships within the data. Our adaptation, MLCBART, assumes that labels arise from thresholding an underlying numeric scale, where a multivariate normal model allows explicit estimation of the correlation structure among labels. This enables the discovery of complicated relationships in various forms and improves MLC predictive performance. Our Bayesian framework not only enables uncertainty quantification for each predicted label, but our MCMC draws produce an estimated conditional probability distribution of label combinations for any predictor values. Simulation experiments demonstrate the effectiveness of the proposed model by comparing its performance with a set of models, including the oracle model with the correct functional form. Results show that our model predicts vectors of labels more accurately than other contenders and its performance is close to the oracle model. An example highlights how the method's ability to produce measures of uncertainty on predictions provides nuanced understanding of classification results.

翻译：多标签分类（MLC）涉及对多个二元标签同时进行分类。该任务具有挑战性，不仅因为预测变量与每个标签之间可能存在任意不同且复杂的关系，而且在考虑预测变量的影响后，标签之间仍可能存在关联。本文提出一个贝叶斯加性回归树（BART）框架对该问题进行建模。BART是一种非参数且灵活的模型结构，能够揭示数据中的复杂关系。我们提出的改进模型MLCBART假设标签产生于对底层数值尺度的阈值化处理，其中多元正态模型允许显式估计标签间的相关性结构。这使得模型能够发现各种形式的复杂关系，并提升MLC的预测性能。我们的贝叶斯框架不仅能够量化每个预测标签的不确定性，而且通过MCMC抽样可以为任意预测变量值生成标签组合的条件概率分布估计。仿真实验通过将所提模型与一组模型（包括具有正确函数形式的理想模型）进行性能比较，证明了该模型的有效性。结果表明，我们的模型在标签向量的预测精度上优于其他竞争模型，且其性能接近理想模型。通过具体案例展示了该方法生成预测不确定性度量的能力，如何为分类结果提供细致入微的理解。

0

相关内容

贝叶斯

《不完全多标签学习综述：最新进展与未来趋势》

《不完全多标签学习综述：最新进展与未来趋势》

专知会员服务

26+阅读 · 2024年6月11日

《深度学习多标签学习》最新综述

《深度学习多标签学习》最新综述

专知会员服务

47+阅读 · 2024年1月31日

监督和半监督学习下的多标签分类综述

监督和半监督学习下的多标签分类综述

专知会员服务

46+阅读 · 2022年8月3日

【ICML2021】基于稀疏标签编码的多维分类

专知会员服务

15+阅读 · 2021年9月29日

多标签文本分类研究进展

专知会员服务

40+阅读 · 2021年5月18日

【WWW2021】大规模层次结构中的元数据感知文本分类

专知会员服务

17+阅读 · 2021年2月17日

注意力图神经网络的多标签文本分类

注意力图神经网络的多标签文本分类

专知会员服务

112+阅读 · 2020年3月28日

【AAAI2020】多模态注意力语义图嵌入多标签分类（Cross-Modality Attention with Semantic Graph Embedding for Multi-Label Classification）

【AAAI2020】多模态注意力语义图嵌入多标签分类（Cross-Modality Attention with Semantic Graph Embedding for Multi-Label Classification）

专知会员服务

92+阅读 · 2019年12月22日

【KDD2019|讲座推荐】成本敏感多类多标签分类研究进展：Advances in Cost-sensitive Multiclass and Multilabel Classification

【KDD2019|讲座推荐】成本敏感多类多标签分类研究进展：Advances in Cost-sensitive Multiclass and Multilabel Classification

专知会员服务

20+阅读 · 2019年12月9日

【ECML-PKDD 2019】可解释序列分类的背景知识注入（Background Knowledge Injection forInterpretable Sequence Classification）

【ECML-PKDD 2019】可解释序列分类的背景知识注入（Background Knowledge Injection forInterpretable Sequence Classification）

专知会员服务

15+阅读 · 2019年12月3日

ICCV 2019 论文解读：用图神经网络改善视频的多标签分类

ICCV 2019 论文解读：用图神经网络改善视频的多标签分类

AI科技评论

11+阅读 · 2019年11月28日

【资源】NLP多标签文本分类代码实现工具包

【资源】NLP多标签文本分类代码实现工具包

专知

40+阅读 · 2019年11月20日

周志华团队：深度森林挑战多标签学习，9大数据集超越传统方法

周志华团队：深度森林挑战多标签学习，9大数据集超越传统方法

新智元

18+阅读 · 2019年11月20日

标签间相关性在多标签分类问题中的应用

标签间相关性在多标签分类问题中的应用

人工智能前沿讲习班

23+阅读 · 2019年6月5日

BAM！利用知识蒸馏和多任务学习构建的通用语言模型

BAM！利用知识蒸馏和多任务学习构建的通用语言模型

机器之心

15+阅读 · 2019年3月18日

【干货】用BRET进行多标签文本分类（附代码）

【干货】用BRET进行多标签文本分类（附代码）

专知

276+阅读 · 2019年2月9日

【GitHub项目推荐】文本分类最好的几个深度学习方法 TensorFlow 实践

【GitHub项目推荐】文本分类最好的几个深度学习方法 TensorFlow 实践

专知

39+阅读 · 2018年11月27日

手把手教你用Keras进行多标签分类（附代码）

手把手教你用Keras进行多标签分类（附代码）

数据派THU

11+阅读 · 2018年7月17日

深度学习文本分类方法综述（代码）

深度学习文本分类方法综述（代码）

中国人工智能学会

28+阅读 · 2018年6月16日

一文读懂贝叶斯分类算法（附学习资源）

一文读懂贝叶斯分类算法（附学习资源）

大数据文摘

12+阅读 · 2017年12月14日

基于分类能力结构度量与类相关性关系保留的特征选取方法研究

国家自然科学基金

1+阅读 · 2017年12月31日

多标记文本数据流分类方法研究

国家自然科学基金

3+阅读 · 2015年12月31日

基于形状信息和结果反馈的多图谱图像分割方法

国家自然科学基金

0+阅读 · 2015年12月31日

基于多样化查询的多标记主动学习研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于概率语义分析的多关系图多类标分类方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

模糊认知集群优化的聚类算法

国家自然科学基金

9+阅读 · 2015年12月31日

基于异构信息网络的分类算法推荐方法研究

国家自然科学基金

7+阅读 · 2015年12月31日

对具有非平衡多标签特性的蛋白质功能类型分类预测研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于狄利克雷过程的潜变量模型贝叶斯半参数分析

国家自然科学基金

2+阅读 · 2014年12月31日

蛋白质结构类预测中的特征信息提取与分类算法研究

国家自然科学基金

1+阅读 · 2014年12月31日

Multi-Integration of Labels across Categories for Component Identification (MILCCI)

Arxiv

0+阅读 · 2月4日

Bayesian Additive Regression Trees for functional ANOVA model

Arxiv

0+阅读 · 2月4日

Layered Modal ML: Syntax and Full Abstraction

Arxiv

0+阅读 · 2月3日

Multivariate Bayesian Last Layer for Regression with Uncertainty Quantification and Decomposition

Arxiv

0+阅读 · 1月30日

Hierarchical Text Classification with LLM-Refined Taxonomies

Arxiv

0+阅读 · 1月26日

Variable Splitting Binary Tree Models Based on Bayesian Context Tree Models for Time Series Segmentation

Arxiv

0+阅读 · 1月22日

InstructTime++: Time Series Classification with Multimodal Language Modeling via Implicit Feature Enhancement

Arxiv

0+阅读 · 1月21日

Bayesian Additive Regression Tree Copula Processes for Scalable Distributional Prediction

Arxiv

0+阅读 · 1月13日

Noise-Adaptive Regularization for Robust Multi-Label Remote Sensing Image Classification

Arxiv

0+阅读 · 1月13日

Bayesian Additive Regression Tree Copula Processes for Scalable Distributional Prediction

Arxiv

0+阅读 · 1月8日

VIP会员

文章信息

相关主题

多标签分类

最新内容

【斯坦福博士论文】语言模型的机械可解释性与控制

【斯坦福博士论文】语言模型的机械可解释性与控制

专知会员服务

0+阅读 · 今天13:13

大语言模型智能体长期记忆安全性综述：迈向记忆主权

大语言模型智能体长期记忆安全性综述：迈向记忆主权

专知会员服务

0+阅读 · 今天13:08

美军被摧毁的空战装备：伊朗战争如何重创美国空中力量

美军被摧毁的空战装备：伊朗战争如何重创美国空中力量

专知会员服务

3+阅读 · 今天7:11

人工智能赋能无人机：俄乌战争（万字长文）

人工智能赋能无人机：俄乌战争（万字长文）

专知会员服务

5+阅读 · 今天6:56

国外海军作战管理系统与作战训练系统

国外海军作战管理系统与作战训练系统

专知会员服务

2+阅读 · 今天4:16

美军条令《海军陆战队规划流程（2026版）》

美军条令《海军陆战队规划流程（2026版）》

专知会员服务

10+阅读 · 今天3:36

《压缩式分布式交互仿真标准》120页

《压缩式分布式交互仿真标准》120页

专知会员服务

4+阅读 · 今天3:21

《电子战数据交换模型研究报告》

《电子战数据交换模型研究报告》

专知会员服务

6+阅读 · 今天3:13

美军运用水下无人机与机器人系统竞速清除霍尔木兹海峡水雷

美军运用水下无人机与机器人系统竞速清除霍尔木兹海峡水雷

专知会员服务

4+阅读 · 今天2:55

《基于Transformer的异常舰船导航识别与跟踪》80页

《基于Transformer的异常舰船导航识别与跟踪》80页

专知会员服务

8+阅读 · 今天2:45

《美国太空系统司令部实验室原型作战管理系统的数据与决策可追溯性》

《美国太空系统司令部实验室原型作战管理系统的数据与决策可追溯性》

专知会员服务

6+阅读 · 今天2:41

《低数据领域军事目标检测模型研究》

《低数据领域军事目标检测模型研究》

专知会员服务

6+阅读 · 今天2:37

《为韧性而设计：在战略不确定时代提升军事空军基地的生存能力》

《为韧性而设计：在战略不确定时代提升军事空军基地的生存能力》

专知会员服务

6+阅读 · 今天2:32

【CMU博士论文】物理世界的视觉感知与深度理解

【CMU博士论文】物理世界的视觉感知与深度理解

专知会员服务

10+阅读 · 4月22日

多智能体系统：从经典范式到大基础模型驱动的未来

多智能体系统：从经典范式到大基础模型驱动的未来

专知会员服务

18+阅读 · 4月22日

相关VIP内容

《不完全多标签学习综述：最新进展与未来趋势》

《不完全多标签学习综述：最新进展与未来趋势》

专知会员服务

26+阅读 · 2024年6月11日

《深度学习多标签学习》最新综述

《深度学习多标签学习》最新综述

专知会员服务

47+阅读 · 2024年1月31日

监督和半监督学习下的多标签分类综述

监督和半监督学习下的多标签分类综述

专知会员服务

46+阅读 · 2022年8月3日

【ICML2021】基于稀疏标签编码的多维分类

专知会员服务

15+阅读 · 2021年9月29日

多标签文本分类研究进展

专知会员服务

40+阅读 · 2021年5月18日

【WWW2021】大规模层次结构中的元数据感知文本分类

专知会员服务

17+阅读 · 2021年2月17日

注意力图神经网络的多标签文本分类

注意力图神经网络的多标签文本分类

专知会员服务

112+阅读 · 2020年3月28日

【AAAI2020】多模态注意力语义图嵌入多标签分类（Cross-Modality Attention with Semantic Graph Embedding for Multi-Label Classification）

【AAAI2020】多模态注意力语义图嵌入多标签分类（Cross-Modality Attention with Semantic Graph Embedding for Multi-Label Classification）

专知会员服务

92+阅读 · 2019年12月22日

【KDD2019|讲座推荐】成本敏感多类多标签分类研究进展：Advances in Cost-sensitive Multiclass and Multilabel Classification

【KDD2019|讲座推荐】成本敏感多类多标签分类研究进展：Advances in Cost-sensitive Multiclass and Multilabel Classification

专知会员服务

20+阅读 · 2019年12月9日

【ECML-PKDD 2019】可解释序列分类的背景知识注入（Background Knowledge Injection forInterpretable Sequence Classification）

【ECML-PKDD 2019】可解释序列分类的背景知识注入（Background Knowledge Injection forInterpretable Sequence Classification）

专知会员服务

15+阅读 · 2019年12月3日

热门VIP内容

开通专知VIP会员享更多权益服务

大语言模型智能体长期记忆安全性综述：迈向记忆主权

人工智能赋能无人机：俄乌战争（万字长文）

【斯坦福博士论文】语言模型的机械可解释性与控制

美军被摧毁的空战装备：伊朗战争如何重创美国空中力量

相关资讯

ICCV 2019 论文解读：用图神经网络改善视频的多标签分类

ICCV 2019 论文解读：用图神经网络改善视频的多标签分类

AI科技评论

11+阅读 · 2019年11月28日

【资源】NLP多标签文本分类代码实现工具包

【资源】NLP多标签文本分类代码实现工具包

专知

40+阅读 · 2019年11月20日

周志华团队：深度森林挑战多标签学习，9大数据集超越传统方法

周志华团队：深度森林挑战多标签学习，9大数据集超越传统方法

新智元

18+阅读 · 2019年11月20日

标签间相关性在多标签分类问题中的应用

标签间相关性在多标签分类问题中的应用

人工智能前沿讲习班

23+阅读 · 2019年6月5日

BAM！利用知识蒸馏和多任务学习构建的通用语言模型

BAM！利用知识蒸馏和多任务学习构建的通用语言模型

机器之心

15+阅读 · 2019年3月18日

【干货】用BRET进行多标签文本分类（附代码）

【干货】用BRET进行多标签文本分类（附代码）

专知

276+阅读 · 2019年2月9日

【GitHub项目推荐】文本分类最好的几个深度学习方法 TensorFlow 实践

【GitHub项目推荐】文本分类最好的几个深度学习方法 TensorFlow 实践

专知

39+阅读 · 2018年11月27日

手把手教你用Keras进行多标签分类（附代码）

手把手教你用Keras进行多标签分类（附代码）

数据派THU

11+阅读 · 2018年7月17日

深度学习文本分类方法综述（代码）

深度学习文本分类方法综述（代码）

中国人工智能学会

28+阅读 · 2018年6月16日

一文读懂贝叶斯分类算法（附学习资源）

一文读懂贝叶斯分类算法（附学习资源）

大数据文摘

12+阅读 · 2017年12月14日

相关论文

Multi-Integration of Labels across Categories for Component Identification (MILCCI)

Arxiv

0+阅读 · 2月4日

Bayesian Additive Regression Trees for functional ANOVA model

Arxiv

0+阅读 · 2月4日

Layered Modal ML: Syntax and Full Abstraction

Arxiv

0+阅读 · 2月3日

Multivariate Bayesian Last Layer for Regression with Uncertainty Quantification and Decomposition

Arxiv

0+阅读 · 1月30日

Hierarchical Text Classification with LLM-Refined Taxonomies

Arxiv

0+阅读 · 1月26日

Variable Splitting Binary Tree Models Based on Bayesian Context Tree Models for Time Series Segmentation

Arxiv

0+阅读 · 1月22日

InstructTime++: Time Series Classification with Multimodal Language Modeling via Implicit Feature Enhancement

Arxiv

0+阅读 · 1月21日

Bayesian Additive Regression Tree Copula Processes for Scalable Distributional Prediction

Arxiv

0+阅读 · 1月13日

Noise-Adaptive Regularization for Robust Multi-Label Remote Sensing Image Classification

Arxiv

0+阅读 · 1月13日

Bayesian Additive Regression Tree Copula Processes for Scalable Distributional Prediction

Arxiv

0+阅读 · 1月8日

相关基金

基于分类能力结构度量与类相关性关系保留的特征选取方法研究

国家自然科学基金

1+阅读 · 2017年12月31日

多标记文本数据流分类方法研究

国家自然科学基金

3+阅读 · 2015年12月31日

基于形状信息和结果反馈的多图谱图像分割方法

国家自然科学基金

0+阅读 · 2015年12月31日

基于多样化查询的多标记主动学习研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于概率语义分析的多关系图多类标分类方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

模糊认知集群优化的聚类算法

国家自然科学基金

9+阅读 · 2015年12月31日

基于异构信息网络的分类算法推荐方法研究

国家自然科学基金

7+阅读 · 2015年12月31日

对具有非平衡多标签特性的蛋白质功能类型分类预测研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于狄利克雷过程的潜变量模型贝叶斯半参数分析

国家自然科学基金

2+阅读 · 2014年12月31日

蛋白质结构类预测中的特征信息提取与分类算法研究

国家自然科学基金

1+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员