Gradient Policy on "CartPole" game and its' expansibility to F1Tenth Autonomous Vehicles

Policy gradient is an effective way to estimate continuous action on the environment. This paper, it about explaining the mathematical formula and code implementation. In the end, comparing between the rotation angle of the stick on CartPole , and the angle of the Autonomous vehicle when turning, and utilizing the Bicycle Model, a simple Kinematic dynamic model, are the purpose to discover the similarity between these two models, so as to facilitate the model transfer from CartPole to the F1tenth Autonomous vehicle.

翻译：政策梯度是估计环境持续行动的有效方法。本文是关于解释数学公式和代码执行的。最后, 比较CartPole上的杆子的旋转角度和自动车在翻转时的旋转角度, 并使用自行车模式(一个简单的虚拟动力模型), 目的是发现这两种模式之间的相似性, 以便于从CartPole向F1tenth自动车的模型转移。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

《行为与认知机器人学》，241页pdf

专知会员服务

55+阅读 · 2021年4月11日

【硬核课】机器人学习课程，UT Austin朱玉可博士讲述自主机器人的人工智能与机器学习机器学习算法

专知会员服务

41+阅读 · 2020年9月21日

自动驾驶汽车的协调:分类和调查综述（Coordination of Autonomous Vehicles: Taxonomy and Survey），附31页pdf

专知会员服务

14+阅读 · 2020年1月9日

Understanding Color and the In-Camera Image Processing Pipeline for Computer Vision 【Michael S. Brown IEEE】韩国 ICCV 2019

专知会员服务

10+阅读 · 2019年10月30日