(Sets of ) Complement Scattered Factors - 专知论文

会员服务 ·

0

因子 · 算法 · 包含 · 嵌入 · 构建 ·

(Sets of ) Complement Scattered Factors

翻译：（组的）互补分散因子

Duncan Adamson,Pamela Fleischmann,Annika Huch

Starting in the 1970s with the fundamental work of Imre Simon, \emph{scattered factors} (also known as subsequences or scattered subwords) have remained a consistently and heavily studied object. The majority of work on scattered factors can be split into two broad classes of problems: given a word, what information, in the form of scattered factors, are contained, and which are not. In this paper, we consider an intermediary problem, introducing the notion of \emph{complement scattered factors}. Given a word $w$ and a scattered factor $u$ of $w$, the complement scattered factors of $w$ with regards to $u$, $C(w, u)$, is the set of scattered factors in $w$ that can be formed by removing any embedding of $u$ from $w$. This is closely related to the \emph{shuffle} operation in which two words are intertwined, i.e., we extend previous work relating to the shuffle operator, using knowledge about scattered factors. Alongside introducing these sets, we provide combinatorial results on the size of the set $C(w, u)$, an algorithm to compute the set $C(w, u)$ from $w$ and $u$ in $O(\vert w \vert \cdot \vert u \vert \binom{w}{u})$ time, where $\binom{w}{u}$ denotes the number of embeddings of $u$ into $w$, an algorithm to construct $u$ from $w$ and $C(w, u)$ in $O(\vert w \vert^2 \binom{\vert w \vert}{\vert w \vert - \vert u \vert})$ time, and an algorithm to construct $w$ from $u$ and $C(w, u)$ in $O(\vert u \vert \cdot \vert w \vert^{\vert u \vert + 1})$ time.

翻译：自20世纪70年代Imre Simon的开创性工作以来，分散因子（亦称子序列或分散子词）一直是持续且深入研究的对象。关于分散因子的研究主要可分为两大类问题：给定一个词，哪些分散因子信息包含其中，哪些不包含。本文研究了一个中间问题，引入了“互补分散因子”的概念。给定词$w$和$w$的一个分散因子$u$，$w$关于$u$的互补分散因子$C(w, u)$定义为从$w$中移除$u$的任意嵌入后所能形成的所有分散因子的集合。该概念与两个词交织的“洗牌”操作紧密相关，即我们利用关于分散因子的知识，扩展了先前关于洗牌算子的研究。在引入这些集合的同时，我们提供了关于集合$C(w, u)$大小的组合结果，以及一个算法：从$w$和$u$计算$C(w, u)$，时间复杂度为$O(\vert w \vert \cdot \vert u \vert \binom{w}{u})$，其中$\binom{w}{u}$表示$u$在$w$中的嵌入数量；一个算法：从$w$和$C(w, u)$构建$u$，时间复杂度为$O(\vert w \vert^2 \binom{\vert w \vert}{\vert w \vert - \vert u \vert})$；以及一个算法：从$u$和$C(w, u)$构建$w$，时间复杂度为$O(\vert u \vert \cdot \vert w \vert^{\vert u \vert + 1})$。

0

相关内容

CVPR 2026教程｜扩散模型原理：连续、离散与实时生成

CVPR 2026教程｜扩散模型原理：连续、离散与实时生成

专知会员服务

10+阅读 · 6月11日

【牛津大学博士论文】机器学习中的组合性和函数不变量，224页pdf

【牛津大学博士论文】机器学习中的组合性和函数不变量，224页pdf

专知会员服务

45+阅读 · 2023年3月25日

【博士论文】具有关系和上下文信息的因子分解模型，178页pdf

专知会员服务

35+阅读 · 2021年9月13日

【MPG & MILA 】因果表示学习，Towards Causal Representation Learning

专知会员服务

52+阅读 · 2021年7月29日

【KDD2021】具有残差独立性的可微分因果发现

专知会员服务

35+阅读 · 2021年7月1日

结合领域知识的因子分析: 在金融风险模型上的应用

专知会员服务

31+阅读 · 2021年2月7日

可解释高效异构图卷积网络，Interpretable and Efficient Heterogeneous Graph Convolutional Network

可解释高效异构图卷积网络，Interpretable and Efficient Heterogeneous Graph Convolutional Network

专知会员服务

63+阅读 · 2020年7月12日

【KDD2020】CAST:一种基于相关关系的多尺度数据自适应光谱聚类算法,CAST: A Correlation-based Adaptive Spectral Clustering Algorithm on Multi-scale Data

【KDD2020】CAST:一种基于相关关系的多尺度数据自适应光谱聚类算法,CAST: A Correlation-based Adaptive Spectral Clustering Algorithm on Multi-scale Data

专知会员服务

20+阅读 · 2020年6月11日

【MIT】生成模型提出的分子的可合成性，48页pdf,The Synthesizability of Molecules Proposed by Generative Models

【MIT】生成模型提出的分子的可合成性，48页pdf,The Synthesizability of Molecules Proposed by Generative Models

专知会员服务

28+阅读 · 2020年2月20日

【NeurIPS2019教程】机器学习中的组合性（Compositionality In Machine Learning）

【NeurIPS2019教程】机器学习中的组合性（Compositionality In Machine Learning）

专知会员服务

17+阅读 · 2019年12月16日

【AAAI2021】对比聚类，Contrastive Clustering

【AAAI2021】对比聚类，Contrastive Clustering

专知

26+阅读 · 2021年1月30日

从模型到应用，一文读懂因子分解机

从模型到应用，一文读懂因子分解机

AI100

10+阅读 · 2019年9月6日

【初学者系列】Factorization Machines 因子分解机详解

【初学者系列】Factorization Machines 因子分解机详解

专知

37+阅读 · 2019年8月17日

【论文】Awesome Relation Classification Paper（关系分类）（PART II）

【论文】Awesome Relation Classification Paper（关系分类）（PART II）

AINLP

15+阅读 · 2019年8月12日

面试题：数组中子序列的个数

面试题：数组中子序列的个数

七月在线实验室

15+阅读 · 2019年6月26日

图分类：结合胶囊网络Capsule和图卷积GCN（附代码）

图分类：结合胶囊网络Capsule和图卷积GCN（附代码）

中国人工智能学会

36+阅读 · 2019年2月26日

详解GAN的谱归一化（Spectral Normalization）

详解GAN的谱归一化（Spectral Normalization）

PaperWeekly

11+阅读 · 2019年2月13日

相关性≠因果：概率图模型和do-calculus

相关性≠因果：概率图模型和do-calculus

论智

31+阅读 · 2018年10月29日

【学界】融合对抗学习的因果关系抽取

【学界】融合对抗学习的因果关系抽取

GAN生成式对抗网络

16+阅读 · 2018年7月14日

最新｜深度离散哈希算法，可用于图像检索！

最新｜深度离散哈希算法，可用于图像检索！

全球人工智能

14+阅读 · 2017年12月15日

结构矩阵线性互补问题的模系矩阵分裂迭代方法

国家自然科学基金

0+阅读 · 2015年12月31日

分数次椭圆型方程解的集中现象

国家自然科学基金

0+阅读 · 2015年12月31日

IVA族和IIIA-VA族半导体复合材料结构性质与弱相互作用机制的理论研究

国家自然科学基金

0+阅读 · 2015年12月31日

扩散过程离散化形式下的若干统计问题的大偏差原理

国家自然科学基金

0+阅读 · 2014年12月31日

曲率，第二基本形式与几何算子的相似性的研究

国家自然科学基金

2+阅读 · 2014年12月31日

积分微分方程和反常扩散问题的高效谱方法

国家自然科学基金

0+阅读 · 2014年12月31日

面向基因组相关性研究的迁移学习理论与方法

国家自然科学基金

0+阅读 · 2014年12月31日

某些分形集上拉普拉斯算子的谱分析及相关问题

国家自然科学基金

0+阅读 · 2014年12月31日

迭代函数系的分离条件及其应用

国家自然科学基金

0+阅读 · 2014年12月31日

含有隐变量的因果结构学习与统计因果推断

国家自然科学基金

21+阅读 · 2013年12月31日

ComboStoc: Combinatorial Stochasticity for Diffusion Generative Models

Arxiv

0+阅读 · 4月29日

Complementarity by Construction: A Lie-Group Approach to Solving Quadratic Programs with Linear Complementarity Constraints

Arxiv

0+阅读 · 4月27日

Divergence-Guided Particle Swarm Optimization

Arxiv

0+阅读 · 4月13日

Causality-Based Scores Alignment in Explainable Data Management

Arxiv

0+阅读 · 4月3日

Attribution of Spurious Factors from High-Dimensional Functional Time Series

Arxiv

0+阅读 · 3月27日

Refactor Analysis: Predictive Evaluations of Factor Models and Dimensionality

Arxiv

0+阅读 · 3月24日

Communication Complexity of Disjointness under Product Distributions

Arxiv

0+阅读 · 3月19日

Factor Dimensionality and the Bias-Variance Tradeoff in Diffusion Portfolio Models

Arxiv

0+阅读 · 3月11日

Rethinking Disentanglement under Dependent Factors of Variation

Arxiv

0+阅读 · 2月24日

Stochastic Discount Factors with Cross-Asset Spillovers

Arxiv

0+阅读 · 2月24日

VIP会员

文章信息

相关主题

最新内容

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

专知会员服务

3+阅读 · 6月22日

综述 | 3D场景图：开放挑战与未来方向

综述 | 3D场景图：开放挑战与未来方向

专知会员服务

4+阅读 · 6月22日

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

专知会员服务

5+阅读 · 6月22日

21世纪的无人机战争

21世纪的无人机战争

专知会员服务

4+阅读 · 6月22日

《伊朗与以色列-美国热战及其对数字技术的影响》

《伊朗与以色列-美国热战及其对数字技术的影响》

专知会员服务

4+阅读 · 6月22日

《量子技术的军事任务技术适配与利用》

《量子技术的军事任务技术适配与利用》

专知会员服务

4+阅读 · 6月22日

《美国陆军军官学校（西点军校）本科生科研中生成式人工智能的使用》

《美国陆军军官学校（西点军校）本科生科研中生成式人工智能的使用》

专知会员服务

4+阅读 · 6月22日

美国从乌克兰无人机战争中学习经验

美国从乌克兰无人机战争中学习经验

专知会员服务

7+阅读 · 6月21日

ICML 2026 | 面向视觉语言模型的语义鲁棒性认证

ICML 2026 | 面向视觉语言模型的语义鲁棒性认证

专知会员服务

5+阅读 · 6月21日

综述 | 智能体电子设计自动化：从“交接有效性”重新理解Agentic EDA

综述 | 智能体电子设计自动化：从“交接有效性”重新理解Agentic EDA

专知会员服务

8+阅读 · 6月21日

深入解读 Palantir AIP：全球最具争议的人工智能平台究竟如何运作

深入解读 Palantir AIP：全球最具争议的人工智能平台究竟如何运作

专知会员服务

21+阅读 · 6月20日

ICML 2026 | 多任务贝叶斯上下文学习：让 Transformer 在测试时显式适应新先验

ICML 2026 | 多任务贝叶斯上下文学习：让 Transformer 在测试时显式适应新先验

专知会员服务

5+阅读 · 6月19日

ACL 2026综述 | 大规模手语数据集：资源、基准与标注标准

ACL 2026综述 | 大规模手语数据集：资源、基准与标注标准

专知会员服务

8+阅读 · 6月19日

ICML 2026 Spotlight | SmoothSMoE：解析稀疏 MoE 路由不连续

ICML 2026 Spotlight | SmoothSMoE：解析稀疏 MoE 路由不连续

专知会员服务

7+阅读 · 6月18日

综述 | 周期表视角下的大模型推理：范式、方法与失败模式

综述 | 周期表视角下的大模型推理：范式、方法与失败模式

专知会员服务

9+阅读 · 6月18日

相关VIP内容

CVPR 2026教程｜扩散模型原理：连续、离散与实时生成

CVPR 2026教程｜扩散模型原理：连续、离散与实时生成

专知会员服务

10+阅读 · 6月11日

【牛津大学博士论文】机器学习中的组合性和函数不变量，224页pdf

【牛津大学博士论文】机器学习中的组合性和函数不变量，224页pdf

专知会员服务

45+阅读 · 2023年3月25日

【博士论文】具有关系和上下文信息的因子分解模型，178页pdf

专知会员服务

35+阅读 · 2021年9月13日

【MPG & MILA 】因果表示学习，Towards Causal Representation Learning

专知会员服务

52+阅读 · 2021年7月29日

【KDD2021】具有残差独立性的可微分因果发现

专知会员服务

35+阅读 · 2021年7月1日

结合领域知识的因子分析: 在金融风险模型上的应用

专知会员服务

31+阅读 · 2021年2月7日

可解释高效异构图卷积网络，Interpretable and Efficient Heterogeneous Graph Convolutional Network

可解释高效异构图卷积网络，Interpretable and Efficient Heterogeneous Graph Convolutional Network

专知会员服务

63+阅读 · 2020年7月12日

【KDD2020】CAST:一种基于相关关系的多尺度数据自适应光谱聚类算法,CAST: A Correlation-based Adaptive Spectral Clustering Algorithm on Multi-scale Data

【KDD2020】CAST:一种基于相关关系的多尺度数据自适应光谱聚类算法,CAST: A Correlation-based Adaptive Spectral Clustering Algorithm on Multi-scale Data

专知会员服务

20+阅读 · 2020年6月11日

【MIT】生成模型提出的分子的可合成性，48页pdf,The Synthesizability of Molecules Proposed by Generative Models

【MIT】生成模型提出的分子的可合成性，48页pdf,The Synthesizability of Molecules Proposed by Generative Models

专知会员服务

28+阅读 · 2020年2月20日

【NeurIPS2019教程】机器学习中的组合性（Compositionality In Machine Learning）

【NeurIPS2019教程】机器学习中的组合性（Compositionality In Machine Learning）

专知会员服务

17+阅读 · 2019年12月16日

热门VIP内容

开通专知VIP会员享更多权益服务

综述 | 3D场景图：开放挑战与未来方向

21世纪的无人机战争

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

相关资讯

【AAAI2021】对比聚类，Contrastive Clustering

【AAAI2021】对比聚类，Contrastive Clustering

专知

26+阅读 · 2021年1月30日

从模型到应用，一文读懂因子分解机

从模型到应用，一文读懂因子分解机

AI100

10+阅读 · 2019年9月6日

【初学者系列】Factorization Machines 因子分解机详解

【初学者系列】Factorization Machines 因子分解机详解

专知

37+阅读 · 2019年8月17日

【论文】Awesome Relation Classification Paper（关系分类）（PART II）

【论文】Awesome Relation Classification Paper（关系分类）（PART II）

AINLP

15+阅读 · 2019年8月12日

面试题：数组中子序列的个数

面试题：数组中子序列的个数

七月在线实验室

15+阅读 · 2019年6月26日

图分类：结合胶囊网络Capsule和图卷积GCN（附代码）

图分类：结合胶囊网络Capsule和图卷积GCN（附代码）

中国人工智能学会

36+阅读 · 2019年2月26日

详解GAN的谱归一化（Spectral Normalization）

详解GAN的谱归一化（Spectral Normalization）

PaperWeekly

11+阅读 · 2019年2月13日

相关性≠因果：概率图模型和do-calculus

相关性≠因果：概率图模型和do-calculus

论智

31+阅读 · 2018年10月29日

【学界】融合对抗学习的因果关系抽取

【学界】融合对抗学习的因果关系抽取

GAN生成式对抗网络

16+阅读 · 2018年7月14日

最新｜深度离散哈希算法，可用于图像检索！

最新｜深度离散哈希算法，可用于图像检索！

全球人工智能

14+阅读 · 2017年12月15日

相关论文

ComboStoc: Combinatorial Stochasticity for Diffusion Generative Models

Arxiv

0+阅读 · 4月29日

Complementarity by Construction: A Lie-Group Approach to Solving Quadratic Programs with Linear Complementarity Constraints

Arxiv

0+阅读 · 4月27日

Divergence-Guided Particle Swarm Optimization

Arxiv

0+阅读 · 4月13日

Causality-Based Scores Alignment in Explainable Data Management

Arxiv

0+阅读 · 4月3日

Attribution of Spurious Factors from High-Dimensional Functional Time Series

Arxiv

0+阅读 · 3月27日

Refactor Analysis: Predictive Evaluations of Factor Models and Dimensionality

Arxiv

0+阅读 · 3月24日

Communication Complexity of Disjointness under Product Distributions

Arxiv

0+阅读 · 3月19日

Factor Dimensionality and the Bias-Variance Tradeoff in Diffusion Portfolio Models

Arxiv

0+阅读 · 3月11日

Rethinking Disentanglement under Dependent Factors of Variation

Arxiv

0+阅读 · 2月24日

Stochastic Discount Factors with Cross-Asset Spillovers

Arxiv

0+阅读 · 2月24日

相关基金

结构矩阵线性互补问题的模系矩阵分裂迭代方法

国家自然科学基金

0+阅读 · 2015年12月31日

分数次椭圆型方程解的集中现象

国家自然科学基金

0+阅读 · 2015年12月31日

IVA族和IIIA-VA族半导体复合材料结构性质与弱相互作用机制的理论研究

国家自然科学基金

0+阅读 · 2015年12月31日

扩散过程离散化形式下的若干统计问题的大偏差原理

国家自然科学基金

0+阅读 · 2014年12月31日

曲率，第二基本形式与几何算子的相似性的研究

国家自然科学基金

2+阅读 · 2014年12月31日

积分微分方程和反常扩散问题的高效谱方法

国家自然科学基金

0+阅读 · 2014年12月31日

面向基因组相关性研究的迁移学习理论与方法

国家自然科学基金

0+阅读 · 2014年12月31日

某些分形集上拉普拉斯算子的谱分析及相关问题

国家自然科学基金

0+阅读 · 2014年12月31日

迭代函数系的分离条件及其应用

国家自然科学基金

0+阅读 · 2014年12月31日

含有隐变量的因果结构学习与统计因果推断

国家自然科学基金

21+阅读 · 2013年12月31日

微信扫码咨询专知VIP会员