【决策Transformers 导论】Introducing Decision Transformers on Hugging Face 🤗

在 Hugging Face，我们正在为深度强化学习的研究人员和爱好者的生态系统做出贡献。最近，我们集成了Deep RL框架，比如Stable-Baselines3。

今天，我们很高兴地宣布，我们将Decision Transformer(一种离线强化学习方法)集成到🤗Transformer库和拥抱面部中心中。我们有一些令人兴奋的计划来提高Deep RL领域的可访问性，我们期待着在未来的几周和几个月与您分享。

什么是离线强化学习? 引入决策 Transformers 使用🤗Transformer中的Decision Transformer 结论接下来是什么? 参考文献

成为VIP会员查看完整内容

相关内容

Hugging Face

关注 7

斯坦福大学《博弈论基础简介》2017版，A Brief Introduction to the Basics of Game Theory，21页论文

专知会员服务

33+阅读 · 2022年4月1日

【EPFL-Nicolas Boumal新书】光滑流形优化导论，362页pdf，An introduction to optimization on smooth manifolds

专知会员服务

34+阅读 · 2022年3月4日

【2022新书】Transformer自然语言处理，Natural Language Processing with Transformers: Building Language Applications with Hugging Face

专知会员服务

524+阅读 · 2022年1月31日

最新《Transformers模型》教程，64页ppt

专知会员服务

326+阅读 · 2020年11月26日

数据科学导论，54页ppt，Introduction to Data Science

专知会员服务

43+阅读 · 2020年7月27日

【MLSS2020】流数据贝叶斯预测，米兰Sonia Petrone教授，80页ppt

专知会员服务

48+阅读 · 2020年7月5日

【CVPR2020】视觉导航的神经拓扑SLAM，Neural Topological SLAM for Visual Navigation

专知会员服务

52+阅读 · 2020年5月26日

【CMU课程：深度学习导论(Spring 2020)】“11-785 Introduction to Deep Learning | Carnegie Mellon University | Spring 2020” by Bhiksha Raj

专知会员服务

29+阅读 · 2020年2月3日

【DLBM-SS暑期课程】深度学习与贝叶斯方法 Deep Learning and Bayesian Methods

专知会员服务

67+阅读 · 2019年11月10日

【课程推荐】深度学习中的新兴挑战（Emerging Challenges in Deep Learning）

专知会员服务

17+阅读 · 2019年11月10日

【Hugging Face硬核书】Transformer自然语言处理(Hugging Face)：构建语言应用

专知

34+阅读 · 2022年4月7日

【Manning新书】Spring实战圣经，第六版，Spring in Action

专知

41+阅读 · 2022年3月13日

NLP大牛Thomas Wolf等新书《Transformer自然语言处理》，466页pdf及代码

专知

36+阅读 · 2022年2月7日

【Manning新书】Kafka实战，272页pdf，Kafka in Action

专知

23+阅读 · 2022年1月30日

【数据科学导论书】Introduction to Datascience，253页pdf

专知

1+阅读 · 2021年11月15日

中科大《数据科学导论》课程

专知

7+阅读 · 2021年10月17日

基于Hugging Face的Transformer库，300行实现命名实体识别

专知

119+阅读 · 2020年2月25日

AI Challenger 2018 第4名PPT分享---细粒度情感分析赛道

AINLP

17+阅读 · 2018年12月25日

【下载】深度强化学习实战书籍和代码《Deep Reinforcement Learning in Action》

专知

78+阅读 · 2018年8月7日

为你推荐一份深度学习书单，来学习吧~

THU数据派

12+阅读 · 2018年3月13日

互联网与数学文化传播研讨会

国家自然科学基金

1+阅读 · 2018年9月23日

天元数学交流项目图像处理中的数学理论及方法研讨会

国家自然科学基金

9+阅读 · 2017年12月31日

应用数学暑期学校（2015）

国家自然科学基金

5+阅读 · 2015年7月12日

数学天元基金统计学研究生暑期学校2015

国家自然科学基金

2+阅读 · 2015年5月31日

基于实船操纵性试验的船舶紧急情况避碰行动评估研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于多准则分区的有源配电网双层式状态估计研究

国家自然科学基金

0+阅读 · 2013年12月31日

社交网络中软安全机制研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于视觉注意机制的多尺度图像融合的研究

国家自然科学基金

1+阅读 · 2009年12月31日

不等式基础理论公理化研究与不等式机器证明

国家自然科学基金

0+阅读 · 2009年12月31日

线性微分-差分系统求解及分解的机械化算法研究

国家自然科学基金

0+阅读 · 2008年12月31日

Human-Object Interaction Detection via Disentangled Transformer

Arxiv

0+阅读 · 2022年4月20日

A posteriori error estimates for hierarchical mixed-dimensional elliptic equations

Arxiv

0+阅读 · 2022年4月19日

ITSS: Interactive Web-Based Authoring and Playback Integrated Environment for Programming Tutorials

Arxiv

1+阅读 · 2022年4月19日

Leveraging Language to Learn Program Abstractions and Search Heuristics

Arxiv

0+阅读 · 2022年4月18日

Event Transformer. A sparse-aware solution for efficient event data processing

Arxiv

0+阅读 · 2022年4月18日

Risk and optimal policies in bandit experiments

Arxiv

0+阅读 · 2022年4月18日

Higher-Order SGFEM for One-Dimensional Interface Elliptic Problems with Discontinuous Solutions

Arxiv

0+阅读 · 2022年4月15日

TubeR: Tubelet Transformer for Video Action Detection

Arxiv

0+阅读 · 2022年4月15日

Condition-Invariant and Compact Visual Place Description by Convolutional Autoencoder

Arxiv

0+阅读 · 2022年4月15日

A Survey on Visual Transformer

Arxiv

19+阅读 · 2020年12月23日

VIP会员