We propose a method to explore the flavor structure of quarks and leptons with reinforcement learning. As a concrete model, we utilize a basic value-based algorithm for models with $U(1)$ flavor symmetry. By training neural networks on the $U(1)$ charges of quarks and leptons, the agent finds 21 models to be consistent with experimentally measured masses and mixing angles of quarks and leptons. In particular, an intrinsic value of normal ordering tends to be larger than that of inverted ordering, and the normal ordering is well fitted with the current experimental data in contrast to the inverted ordering. A specific value of effective mass for the neutrinoless double beta decay and a sizable leptonic CP violation induced by an angular component of flavon field are predicted by autonomous behavior of the agent. Our finding results indicate that the reinforcement learning can be a new method for understanding the flavor structure.
翻译:我们提出了一种利用强化学习探索夸克和轻子味结构的方法。作为具体模型,我们采用了一种基于值的基本算法来处理具有 $U(1)$ 味对称性的模型。通过训练神经网络处理夸克和轻子的 $U(1)$ 荷,该智能体找到了21个与实验测量的夸克和轻子质量及混合角一致的模型。特别地,正常排序的内在价值往往大于颠倒排序,且与颠倒排序相比,正常排序与当前实验数据拟合良好。通过智能体的自主行为,预测了无中微子双β衰变的有效质量的具体数值,以及由味子场角分量诱导的显著轻子CP破坏。我们的研究结果表明,强化学习可以成为理解味结构的一种新方法。