Advances in LLMs with Focus on Reasoning, Adaptability, Efficiency and Ethics

This survey paper outlines the key developments in the field of Large Language Models (LLMs), including enhancements to their reasoning skills, adaptability to various tasks, increased computational efficiency, and the ability to make ethical decisions. The techniques that have been most effective in bridging the gap between human and machine communications include the Chain-of-Thought prompting, Instruction Tuning, and Reinforcement Learning from Human Feedback. The improvements in multimodal learning and few-shot or zero-shot techniques have further empowered LLMs to handle complex jobs with minor input. A significant focus is placed on efficiency, detailing scaling strategies, optimization techniques, and the influential Mixture-of-Experts (MoE) architecture, which strategically routes inputs to specialized subnetworks to boost predictive accuracy, while optimizing resource allocation. This survey also offers a broader perspective on recent advancements in LLMs, going beyond isolated aspects such as model architecture or ethical concerns. Additionally, it explores the role of LLMs in Agentic AI and their use as Autonomous Decision-Making Systems, and categorizes emerging methods that enhance LLM reasoning, efficiency, and ethical alignment. The survey also identifies underexplored areas such as interpretability, cross-modal integration, and sustainability. While significant advancements have been made in LLMs, challenges such as high computational costs, biases, and ethical risks remain. Overcoming these requires a focus on bias mitigation, transparent decision-making, and explicit ethical guidelines. Future research will generally focus on enhancing the model's ability to handle multiple inputs, thereby making it more intelligent, safe, and reliable.

翻译：本综述论文概述了大型语言模型（LLMs）领域的关键进展，包括其推理能力的提升、对不同任务的适应性增强、计算效率的提高以及伦理决策能力的构建。在弥合人机沟通鸿沟方面最为有效的技术包括思维链提示、指令微调以及基于人类反馈的强化学习。多模态学习与小样本/零样本技术的进步进一步赋能大型语言模型，使其能够以少量输入处理复杂任务。研究特别关注效率问题，详细阐述了扩展策略、优化技术以及具有影响力的混合专家（MoE）架构——该架构通过策略性地将输入路由至专用子网络来提升预测精度，同时优化资源分配。本综述还提供了关于大型语言模型最新进展的更广阔视角，超越了模型架构或伦理问题等孤立层面。此外，文章探讨了大型语言模型在智能体人工智能中的作用及其作为自主决策系统的应用，并对提升LLM推理能力、效率及伦理对齐的新兴方法进行了分类。综述同时指出了尚未充分探索的领域，如可解释性、跨模态整合与可持续性。尽管大型语言模型已取得显著进展，但高计算成本、偏见及伦理风险等挑战依然存在。克服这些挑战需要聚焦于偏见缓解、透明化决策和明确的伦理准则。未来研究总体上将侧重于增强模型处理多模态输入的能力，从而使其更智能、安全且可靠。