Cross-Study Replicability in Cluster Analysis - 专知论文

会员服务 ·

0

簇 · 聚类分析 · cancer · Analysis · 可辨认的 ·

2023 年 5 月 9 日

Cross-Study Replicability in Cluster Analysis

翻译：聚类分析中的跨研究可复现性

Lorenzo Masoero,Emma Thomas,Giovanni Parmigiani,Svitlana Tyekucheva,Lorenzo Trippa

from arxiv, Accepted for publication in Statistical Science

In cancer research, clustering techniques are widely used for exploratory analyses and dimensionality reduction, playing a critical role in the identification of novel cancer subtypes, often with direct implications for patient management. As data collected by multiple research groups grows, it is increasingly feasible to investigate the replicability of clustering procedures, that is, their ability to consistently recover biologically meaningful clusters across several datasets. In this paper, we review existing methods to assess replicability of clustering analyses, and discuss a framework for evaluating cross-study clustering replicability, useful when two or more studies are available. These approaches can be applied to any clustering algorithm and can employ different measures of similarity between partitions to quantify replicability, globally (i.e. for the whole sample) as well as locally (i.e. for individual clusters). Using experiments on synthetic and real gene expression data, we illustrate the utility of replicability metrics to evaluate if the same clusters are identified consistently across a collection of datasets.

翻译：在癌症研究中，聚类技术被广泛用于探索性分析和降维，在识别新型癌症亚型中发挥着关键作用，通常对患者管理具有直接影响。随着多个研究团队收集的数据不断增长，研究聚类流程的可复现性——即其在多个数据集中一致地恢复具有生物学意义的聚类的能力——变得越来越可行。本文回顾了评估聚类分析可复现性的现有方法，并讨论了一个用于评估跨研究聚类可复现性的框架，该框架在两个或多个研究可用时尤为实用。这些方法可适用于任何聚类算法，并能采用不同的划分间相似性度量来全局（即针对整个样本）和局部（即针对单个聚类）量化可复现性。通过在合成和真实基因表达数据上的实验，我们展示了可复现性指标在评估同一聚类是否能在多个数据集中被一致性识别时的实用性。

0

相关内容

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

94+阅读 · 2020年2月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

37+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

60+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

80+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

84+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

106+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

基于p-NiO/CH3NH3PbI3/n-ZnO简易三明治结构钙钛矿太阳电池的界面调控与性能优化

国家自然科学基金

0+阅读 · 2014年12月31日

基于一维有序TiO2纳米阵列的全无机耗尽体相异质结量子点太阳能电池的结构构筑和性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

准一维ZnO纳米材料/聚合物复合异质结发光器件的构筑及性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

聚集诱导发光功能化的柔性金属-有机骨架物在小分子传感中的应用研究

国家自然科学基金

0+阅读 · 2013年12月31日

含锂石榴石薄膜材料的制备和离子传导研究

国家自然科学基金

0+阅读 · 2013年12月31日

本征空位缺陷态Ca2Nb2O7薄膜的可控制备及d0磁性研究

国家自然科学基金

0+阅读 · 2013年12月31日

过掺杂补偿缺陷与异质结能级对纳米ZnO气敏特性调节机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

石榴石相LuAG:Ce(Pr)闪烁晶体的缺陷控制和性能优化

国家自然科学基金

0+阅读 · 2012年12月31日

缺陷和非磁性元素掺杂诱导的氮化镓纳米结构磁性机理及调控研究

国家自然科学基金

0+阅读 · 2011年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

Multivariate Rank-Based Analysis of Multiple Endpoints in Clinical Trials: A Global Test Approach

Arxiv

0+阅读 · 2023年6月28日

Replicable Reinforcement Learning

Arxiv

0+阅读 · 2023年6月27日

The Dual PC Algorithm and the Role of Gaussianity for Structure Learning of Bayesian Networks

Arxiv

0+阅读 · 2023年6月27日

General multiple tests for functional data

Arxiv

0+阅读 · 2023年6月27日

Cross-Attention is Not Enough: Incongruity-Aware Hierarchical Multimodal Sentiment Analysis and Emotion Recognition

Arxiv

0+阅读 · 2023年6月27日

Challenges and Opportunities of Shapley values in a Clinical Context

Arxiv

0+阅读 · 2023年6月26日

A nonparametrically corrected likelihood for Bayesian spectral analysis of multivariate time series

Arxiv

0+阅读 · 2023年6月23日

Trading-off price for data quality to achieve fair online allocation

Arxiv

0+阅读 · 2023年6月23日

Bayesian Deep Learning for Graphs

Arxiv

23+阅读 · 2022年2月24日

Meta-Learning to Cluster

Meta-Learning to Cluster

Arxiv

18+阅读 · 2019年10月30日

VIP会员

文章信息

相关主题

最新内容

《无人系统互操作性导论——无人系统联合架构（JAUS）》

《无人系统互操作性导论——无人系统联合架构（JAUS）》

专知会员服务

5+阅读 · 今天5:53

美空军新型反无人机部队初探

美空军新型反无人机部队初探

专知会员服务

1+阅读 · 今天5:45

《对抗性电磁环境下远程巡飞弹作战的安全指挥与控制数据链》

《对抗性电磁环境下远程巡飞弹作战的安全指挥与控制数据链》

专知会员服务

2+阅读 · 今天5:23

《北约下一代建模与仿真（NexGen M&S）计划》2026年69页

《北约下一代建模与仿真（NexGen M&S）计划》2026年69页

专知会员服务

1+阅读 · 今天5:11

《防空交战流程的概率建模研究》

《防空交战流程的概率建模研究》

专知会员服务

4+阅读 · 今天5:04

ICML 2026 教程 | 数值优化理论还重要吗？

ICML 2026 教程 | 数值优化理论还重要吗？

专知会员服务

4+阅读 · 7月26日

ICM 2026 | 陶哲轩：人工智能时代的数学

ICM 2026 | 陶哲轩：人工智能时代的数学

专知会员服务

7+阅读 · 7月26日

《面向可扩展高韧性无人机集群网络的速度感知分层通信框架》

《面向可扩展高韧性无人机集群网络的速度感知分层通信框架》

专知会员服务

7+阅读 · 7月26日

《面向概率推理的可定制战术引擎及其在军事任务规划中的应用》

《面向概率推理的可定制战术引擎及其在军事任务规划中的应用》

专知会员服务

9+阅读 · 7月26日

《先进防空系统选型战略框架：基于巴基斯坦的实证启示》

《先进防空系统选型战略框架：基于巴基斯坦的实证启示》

专知会员服务

8+阅读 · 7月26日

《反无人机交战场景下的战斗归零研究》

《反无人机交战场景下的战斗归零研究》

专知会员服务

7+阅读 · 7月26日

霍尔木兹与不对称作战时代：水雷、无人系统与海军力量的重新定义

霍尔木兹与不对称作战时代：水雷、无人系统与海军力量的重新定义

专知会员服务

4+阅读 · 7月26日

博士论文 | 用代码结构感知方法推进代码大模型

博士论文 | 用代码结构感知方法推进代码大模型

专知会员服务

5+阅读 · 7月25日

综述 | 遥感多模态大模型：领域专用还是通用模型？

综述 | 遥感多模态大模型：领域专用还是通用模型？

专知会员服务

5+阅读 · 7月25日

《面向指挥控制训练与实时北约兼容数据分发的战术模拟器》

《面向指挥控制训练与实时北约兼容数据分发的战术模拟器》

专知会员服务

5+阅读 · 7月25日

相关VIP内容

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

94+阅读 · 2020年2月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

37+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

60+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

80+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

84+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

106+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

美空军新型反无人机部队初探

《北约下一代建模与仿真（NexGen M&S）计划》2026年69页

《无人系统互操作性导论——无人系统联合架构（JAUS）》

《对抗性电磁环境下远程巡飞弹作战的安全指挥与控制数据链》

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

相关论文

Multivariate Rank-Based Analysis of Multiple Endpoints in Clinical Trials: A Global Test Approach

Arxiv

0+阅读 · 2023年6月28日

Replicable Reinforcement Learning

Arxiv

0+阅读 · 2023年6月27日

The Dual PC Algorithm and the Role of Gaussianity for Structure Learning of Bayesian Networks

Arxiv

0+阅读 · 2023年6月27日

General multiple tests for functional data

Arxiv

0+阅读 · 2023年6月27日

Cross-Attention is Not Enough: Incongruity-Aware Hierarchical Multimodal Sentiment Analysis and Emotion Recognition

Arxiv

0+阅读 · 2023年6月27日

Challenges and Opportunities of Shapley values in a Clinical Context

Arxiv

0+阅读 · 2023年6月26日

A nonparametrically corrected likelihood for Bayesian spectral analysis of multivariate time series

Arxiv

0+阅读 · 2023年6月23日

Trading-off price for data quality to achieve fair online allocation

Arxiv

0+阅读 · 2023年6月23日

Bayesian Deep Learning for Graphs

Arxiv

23+阅读 · 2022年2月24日

Meta-Learning to Cluster

Meta-Learning to Cluster

Arxiv

18+阅读 · 2019年10月30日

相关基金

基于p-NiO/CH3NH3PbI3/n-ZnO简易三明治结构钙钛矿太阳电池的界面调控与性能优化

国家自然科学基金

0+阅读 · 2014年12月31日

基于一维有序TiO2纳米阵列的全无机耗尽体相异质结量子点太阳能电池的结构构筑和性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

准一维ZnO纳米材料/聚合物复合异质结发光器件的构筑及性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

聚集诱导发光功能化的柔性金属-有机骨架物在小分子传感中的应用研究

国家自然科学基金

0+阅读 · 2013年12月31日

含锂石榴石薄膜材料的制备和离子传导研究

国家自然科学基金

0+阅读 · 2013年12月31日

本征空位缺陷态Ca2Nb2O7薄膜的可控制备及d0磁性研究

国家自然科学基金

0+阅读 · 2013年12月31日

过掺杂补偿缺陷与异质结能级对纳米ZnO气敏特性调节机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

石榴石相LuAG:Ce(Pr)闪烁晶体的缺陷控制和性能优化

国家自然科学基金

0+阅读 · 2012年12月31日

缺陷和非磁性元素掺杂诱导的氮化镓纳米结构磁性机理及调控研究

国家自然科学基金

0+阅读 · 2011年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员