On the Stability of a non-hyperbolic nonlinear map with non-bounded set of non-isolated fixed points with applications to Machine Learning

This paper deals with the convergence analysis of the SUCPA (Semi Unsupervised Calibration through Prior Adaptation) algorithm, defined from a first-order non-linear difference equations, first developed to correct the scores output by a supervised machine learning classifier. The convergence analysis is addressed as a dynamical system problem, by studying the local and global stability of the nonlinear map derived from the algorithm. This map, which is defined by a composition of exponential and rational functions, turns out to be non-hyperbolic with a non-bounded set of non-isolated fixed points. Hence, a non-standard method for solving the convergence analysis is used consisting of an ad-hoc geometrical approach. For a binary classification problem (two-dimensional map), we rigorously prove that the map is globally asymptotically stable. Numerical experiments on real-world application are performed to support the theoretical results by means of two different classification problems: Sentiment Polarity performed with a Large Language Model and Cat-Dog Image classification. For a greater number of classes, the numerical evidence shows the same behavior of the algorithm, and this is illustrated with a Natural Language Inference example. The experiment codes are publicly accessible online at the following repository: https://github.com/LautaroEst/sucpa-convergence

翻译：本文研究了SUCPA（通过先验自适应进行半无监督校准）算法的收敛性分析，该算法由一阶非线性差分方程定义，最初用于校正由监督机器学习分类器输出的分数。收敛性分析被作为动力系统问题处理，通过研究算法导出的非线性映射的局部与全局稳定性。该映射由指数函数与有理函数复合定义，呈现非双曲特性，且具有非孤立不动点的无界集。因此，本文采用了一种非标准方法进行收敛性分析，即一种特设的几何方法。针对二分类问题（二维映射），我们严格证明了该映射具有全局渐近稳定性。通过两种不同分类问题进行的真实世界应用数值实验支持了理论结果：使用大型语言模型进行的情感极性分类，以及猫狗图像分类。对于更多类别的情况，数值证据显示算法呈现相同行为，并通过自然语言推理示例加以说明。实验代码可通过以下仓库公开获取：https://github.com/LautaroEst/sucpa-convergence

相关内容

Machine Learning

关注 2249

机器学习（Machine Learning）是一个研究计算学习方法的国际论坛。该杂志发表文章，报告广泛的学习方法应用于各种学习问题的实质性结果。该杂志的特色论文描述研究的问题和方法，应用研究和研究方法的问题。有关学习问题或方法的论文通过实证研究、理论分析或与心理现象的比较提供了坚实的支持。应用论文展示了如何应用学习方法来解决重要的应用问题。研究方法论文改进了机器学习的研究方法。所有的论文都以其他研究人员可以验证或复制的方式描述了支持证据。论文还详细说明了学习的组成部分，并讨论了关于知识表示和性能任务的假设。官网地址：http://dblp.uni-trier.de/db/journals/ml/

AAAI2024 | 关于曲率多样性的探索和研究——结合motif的多曲率图卷积网络

专知会员服务

16+阅读 · 2024年4月14日

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

专知会员服务

42+阅读 · 2020年5月30日

【AI应用】Facebook-利用神经网络求解高等数学方程, Using neural networks to solve advanced mathematics equations

专知会员服务

34+阅读 · 2020年1月15日