How to Estimate Model Transferability of Pre-Trained Speech Models? - 专知论文

会员服务 ·

0

估计/估计量 · MoDELS · 秩 · 得分 · 相互独立的 ·

2023 年 6 月 1 日

How to Estimate Model Transferability of Pre-Trained Speech Models?

翻译：如何评估预训练语音模型的迁移性？

Zih-Ching Chen,Chao-Han Huck Yang,Bo Li,Yu Zhang,Nanxin Chen,Shou-Yiin Chang,Rohit Prabhavalkar,Hung-yi Lee,Tara N. Sainath

from arxiv, Accepted to Interspeech. Code will be released

In this work, we introduce a ``score-based assessment'' framework for estimating the transferability of pre-trained speech models (PSMs) for fine-tuning target tasks. We leverage upon two representation theories, Bayesian likelihood estimation and optimal transport, to generate rank scores for the PSM candidates using the extracted representations. Our framework efficiently computes transferability scores without actual fine-tuning of candidate models or layers by making a temporal independent hypothesis. We evaluate some popular supervised speech models (e.g., Conformer RNN-Transducer) and self-supervised speech models (e.g., HuBERT) in cross-layer and cross-model settings using public data. Experimental results show a high Spearman's rank correlation and low $p$-value between our estimation framework and fine-tuning ground truth. Our proposed transferability framework requires less computational time and resources, making it a resource-saving and time-efficient approach for tuning speech foundation models.

翻译：在本工作中，我们提出了一种基于评分的评估框架，用于估计预训练语音模型（PSM）在目标任务微调中的迁移性。我们利用贝叶斯似然估计和最优传输两种表示理论，通过提取的表示为候选PSM生成排名分数。通过引入时间独立假设，该框架无需实际微调候选模型或层，即可高效计算迁移性评分。我们使用公开数据在跨层和跨模型设置下评估了若干主流监督语音模型（如Conformer RNN-Transducer）和自监督语音模型（如HuBERT）。实验结果表明，我们的评估框架与微调真实值之间具有较高的斯皮尔曼等级相关系数及较低的p值。该迁移性评估框架计算时间和资源需求较低，为语音基础模型的调优提供了一种节省资源和时间的高效方法。

0

相关内容

估计/估计量

估计/估计量

百篇论文纵览大型语言模型最新研究进展

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

52+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

129+阅读 · 2020年11月20日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

37+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

60+阅读 · 2019年10月17日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

关于若干模型泛函不等式及其应用的研究

国家自然科学基金

1+阅读 · 2015年12月31日

LncRNA介导肿瘤相关巨噬细胞促进乳腺癌转移分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

SMAD2调控ERK通路干预M2巨噬细胞活化在糖尿病肾病小鼠肾脏纤维化中的作用及机制

国家自然科学基金

0+阅读 · 2015年12月31日

核酸适配体结构与动力学的非线性光谱研究

国家自然科学基金

0+阅读 · 2015年12月31日

N-乙酰氨基葡萄糖转移酶V对间充质干细胞迁移、分化的作用及分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

太赫兹量子阱探测器偏振特性研究

国家自然科学基金

1+阅读 · 2013年12月31日

非均相聚合物流体的粘弹性介观模型及其相分离研究

国家自然科学基金

0+阅读 · 2012年12月31日

不同类型强心苷抗肿瘤活性的研究

国家自然科学基金

0+阅读 · 2009年12月31日

分数布朗运动环境下金融保险中优化问题的研究

国家自然科学基金

0+阅读 · 2009年12月31日

穿膜肽Penetratin及其衍生物的解离动力学研究

国家自然科学基金

0+阅读 · 2008年12月31日

UPPLIED: UAV Path Planning for Inspection through Demonstration

UPPLIED: UAV Path Planning for Inspection through Demonstration

Arxiv

0+阅读 · 2023年7月24日

AdaBest: Minimizing Client Drift in Federated Learning via Adaptive Bias Estimation

Arxiv

0+阅读 · 2023年7月24日

TransFusion: Generating Long, High Fidelity Time Series using Diffusion Models with Transformers

Arxiv

0+阅读 · 2023年7月24日

One-Shot Device Testing Data Analysis under Logistic-Exponential Lifetimes with an Application to SEER Gallbladder Cancer Data

Arxiv

0+阅读 · 2023年7月24日

RED-PSM: Regularization by Denoising of Partially Separable Models for Dynamic Imaging

Arxiv

0+阅读 · 2023年7月24日

Transfer Learning and Bias Correction with Pre-trained Audio Embeddings

Arxiv

0+阅读 · 2023年7月20日

MotionBERT: A Unified Perspective on Learning Human Motion Representations

Arxiv

0+阅读 · 2023年7月20日

Improving Pre-trained Language Models' Generalization

Arxiv

0+阅读 · 2023年7月19日

Pre-Trained Models: Past, Present and Future

Arxiv

19+阅读 · 2021年6月15日

Event Extraction with Generative Adversarial Imitation Learning

Arxiv

13+阅读 · 2018年4月21日

VIP会员

文章信息

相关主题

估计/估计量

相互独立的

最新内容

博士论文 | 面向大模型推理的内存高效算法

博士论文 | 面向大模型推理的内存高效算法

专知会员服务

2+阅读 · 7月27日

论文解读 | 从预训练到后训练：理解大模型推理能力如何形成

论文解读 | 从预训练到后训练：理解大模型推理能力如何形成

专知会员服务

3+阅读 · 7月27日

《无人系统互操作性导论——无人系统联合架构（JAUS）》

《无人系统互操作性导论——无人系统联合架构（JAUS）》

专知会员服务

9+阅读 · 7月27日

美空军新型反无人机部队初探

美空军新型反无人机部队初探

专知会员服务

5+阅读 · 7月27日

《对抗性电磁环境下远程巡飞弹作战的安全指挥与控制数据链》

《对抗性电磁环境下远程巡飞弹作战的安全指挥与控制数据链》

专知会员服务

3+阅读 · 7月27日

《北约下一代建模与仿真（NexGen M&S）计划》2026年69页

《北约下一代建模与仿真（NexGen M&S）计划》2026年69页

专知会员服务

3+阅读 · 7月27日

《防空交战流程的概率建模研究》

《防空交战流程的概率建模研究》

专知会员服务

7+阅读 · 7月27日

ICML 2026 教程 | 数值优化理论还重要吗？

ICML 2026 教程 | 数值优化理论还重要吗？

专知会员服务

6+阅读 · 7月26日

ICM 2026 | 陶哲轩：人工智能时代的数学

ICM 2026 | 陶哲轩：人工智能时代的数学

专知会员服务

9+阅读 · 7月26日

《面向可扩展高韧性无人机集群网络的速度感知分层通信框架》

《面向可扩展高韧性无人机集群网络的速度感知分层通信框架》

专知会员服务

8+阅读 · 7月26日

《面向概率推理的可定制战术引擎及其在军事任务规划中的应用》

《面向概率推理的可定制战术引擎及其在军事任务规划中的应用》

专知会员服务

11+阅读 · 7月26日

《先进防空系统选型战略框架：基于巴基斯坦的实证启示》

《先进防空系统选型战略框架：基于巴基斯坦的实证启示》

专知会员服务

8+阅读 · 7月26日

《反无人机交战场景下的战斗归零研究》

《反无人机交战场景下的战斗归零研究》

专知会员服务

7+阅读 · 7月26日

霍尔木兹与不对称作战时代：水雷、无人系统与海军力量的重新定义

霍尔木兹与不对称作战时代：水雷、无人系统与海军力量的重新定义

专知会员服务

4+阅读 · 7月26日

博士论文 | 用代码结构感知方法推进代码大模型

博士论文 | 用代码结构感知方法推进代码大模型

专知会员服务

6+阅读 · 7月25日

相关VIP内容

百篇论文纵览大型语言模型最新研究进展

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

52+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

129+阅读 · 2020年11月20日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

37+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

60+阅读 · 2019年10月17日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

论文解读 | 从预训练到后训练：理解大模型推理能力如何形成

美空军新型反无人机部队初探

博士论文 | 面向大模型推理的内存高效算法

《无人系统互操作性导论——无人系统联合架构（JAUS）》

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

UPPLIED: UAV Path Planning for Inspection through Demonstration

UPPLIED: UAV Path Planning for Inspection through Demonstration

Arxiv

0+阅读 · 2023年7月24日

AdaBest: Minimizing Client Drift in Federated Learning via Adaptive Bias Estimation

Arxiv

0+阅读 · 2023年7月24日

TransFusion: Generating Long, High Fidelity Time Series using Diffusion Models with Transformers

Arxiv

0+阅读 · 2023年7月24日

One-Shot Device Testing Data Analysis under Logistic-Exponential Lifetimes with an Application to SEER Gallbladder Cancer Data

Arxiv

0+阅读 · 2023年7月24日

RED-PSM: Regularization by Denoising of Partially Separable Models for Dynamic Imaging

Arxiv

0+阅读 · 2023年7月24日

Transfer Learning and Bias Correction with Pre-trained Audio Embeddings

Arxiv

0+阅读 · 2023年7月20日

MotionBERT: A Unified Perspective on Learning Human Motion Representations

Arxiv

0+阅读 · 2023年7月20日

Improving Pre-trained Language Models' Generalization

Arxiv

0+阅读 · 2023年7月19日

Pre-Trained Models: Past, Present and Future

Arxiv

19+阅读 · 2021年6月15日

Event Extraction with Generative Adversarial Imitation Learning

Arxiv

13+阅读 · 2018年4月21日

相关基金

关于若干模型泛函不等式及其应用的研究

国家自然科学基金

1+阅读 · 2015年12月31日

LncRNA介导肿瘤相关巨噬细胞促进乳腺癌转移分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

SMAD2调控ERK通路干预M2巨噬细胞活化在糖尿病肾病小鼠肾脏纤维化中的作用及机制

国家自然科学基金

0+阅读 · 2015年12月31日

核酸适配体结构与动力学的非线性光谱研究

国家自然科学基金

0+阅读 · 2015年12月31日

N-乙酰氨基葡萄糖转移酶V对间充质干细胞迁移、分化的作用及分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

太赫兹量子阱探测器偏振特性研究

国家自然科学基金

1+阅读 · 2013年12月31日

非均相聚合物流体的粘弹性介观模型及其相分离研究

国家自然科学基金

0+阅读 · 2012年12月31日

不同类型强心苷抗肿瘤活性的研究

国家自然科学基金

0+阅读 · 2009年12月31日

分数布朗运动环境下金融保险中优化问题的研究

国家自然科学基金

0+阅读 · 2009年12月31日

穿膜肽Penetratin及其衍生物的解离动力学研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员