資源簡介
深度強(qiáng)化學(xué)習(xí)系列論文,包括最基礎(chǔ)的DQN,DQN模型改進(jìn),DQN算法改進(jìn),分層DRL,基于策略梯度的深度強(qiáng)化學(xué)習(xí)等等,論文基本源自頂會
代碼片段和文件信息
?屬性????????????大小?????日期????時(shí)間???名稱
-----------?---------??----------?-----??----
?????文件????1228584??2017-04-11?10:41??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\Benchmarking?Deep?Reinforcement?Learning?for?Continuous?Control.pdf
?????文件????1251801??2017-04-11?10:40??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\Combining?policy?gradient?and?Q-learning.pdf
?????文件????1090260??2017-04-11?10:26??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\Compatible?Value?Gradients?for?Reinforcement?Learning?of?Continuous?Deep?Policies(1).pdf
?????文件????1090260??2017-04-11?10:21??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\Compatible?Value?Gradients?for?Reinforcement?Learning?of?Continuous?Deep?Policies.pdf
?????文件?????663698??2017-04-11?10:19??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\Continuous?control?with?deep?reinforcement?learning.pdf
?????文件????1708226??2017-04-11?10:41??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\Continuous?Deep?Q-Learning?with?Model-based?Acceleration.pdf
?????文件?????572750??2017-04-11?10:21??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\Deep?Reinforcement?Learning?in?Parameterized?Action?Space.pdf
?????文件?????343663??2017-04-11?10:18??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\Deterministic?Policy?Gradient?Algorithms.pdf
?????文件?????672837??2017-04-11?10:39??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\Efficient?Exploration?for?Dialogue?Policy?Learning?with?BBQ?Networks?&?Replay?Buffer?Spiking.pdf
?????文件????4728764??2017-04-11?10:18??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\End-to-End?Training?of?Deep?Visuomotor?Policies.pdf
?????文件?????443482??2017-04-11?10:39??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\Gradient?Estimation?Using?Stochastic?Computation?Graphs.pdf
?????文件????1798272??2017-04-11?10:22??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\High-Dimensional?Continuous?Control?Using?Generalized?Advantage?Estimation.pdf
?????文件?????903324??2017-04-11?10:36??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\Interactive?Control?of?Diverse?Complex?Characters?with?Neural?Networks.pdf
?????文件?????854278??2017-04-11?10:41??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\Learning?Continuous?Control?Policies?by?Stochastic?Value?Gradients.pdf
?????文件?????881400??2017-04-11?10:13??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\Learning?Deep?Control?Policies?for?Autonomous?Aerial?Vehicles?with?MPC-Guided?Policy?Search.pdf
?????文件?????693924??2017-04-11?10:21??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\Memory-based?control?with?recurrent?neural?networks.pdf
?????文件?????850845??2017-04-11?10:38??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\Q-Prop?Sample-Efficient?Policy?Gradient?with?An?Off-Policy?Critic.pdf
?????文件????1444030??2017-04-11?10:29??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\Sample?Efficient?Actor-Critic?with?Experience?Replay.pdf
?????文件????8820782??2017-04-11?10:24??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\Terrain-Adaptive?Locomotion?Skills?Using?Deep?Reinforcement?Learning?.pdf
?????文件????1024402??2017-04-11?10:11??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\Trust?Region?Policy?Optimization.pdf
?????文件????4601440??2017-12-22?01:17??DQN?開山篇\Human-level?control?through?deep?reinforcementlearning.pdf
?????文件?????435604??2017-06-23?09:40??DQN?開山篇\Playing?Atari?with?Deep?Reinforcement?Learning.pdf
?????文件????8119239??2017-04-11?10:01??DQN?模型改進(jìn)\Control?of?Memory?Active?Perception?and?Action?in?Minecraft.pdf
?????文件?????316250??2017-04-11?09:58??DQN?模型改進(jìn)\Deep?Attention?Recurrent?Q-Network.pdf
?????文件?????843143??2017-04-11?09:59??DQN?模型改進(jìn)\Deep?Recurrent?Q-Learning?for?Partially?Observable?MDPs.pdf
?????文件????1372673??2017-04-11?10:01??DQN?模型改進(jìn)\Hierarchical?Deep?Reinforcement?Learning?Integrating?Temporal?Abstraction?and?Intrinsic?Motivation.pdf
?????文件?????612255??2017-04-11?09:59??DQN?模型改進(jìn)\Language?Understanding?for?Text-based?Games?Using?Deep?Reinforcement?Learning.pdf
?????文件????1024426??2017-04-11?10:00??DQN?模型改進(jìn)\Learning?to?Communicate?to?Solve?Riddles?with?Deep?Distributed?Recurrent?Q-Networks.pdf
?????文件?????404203??2017-04-11?10:02??DQN?模型改進(jìn)\Mazebase?A?Sandbox?for?Learning?from?Games.pdf
?????文件????4278630??2017-04-11?10:01??DQN?模型改進(jìn)\Progressive?Neural?Networks.pdf
............此處省略29個(gè)文件信息
-----------?---------??----------?-----??----
?????文件????1228584??2017-04-11?10:41??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\Benchmarking?Deep?Reinforcement?Learning?for?Continuous?Control.pdf
?????文件????1251801??2017-04-11?10:40??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\Combining?policy?gradient?and?Q-learning.pdf
?????文件????1090260??2017-04-11?10:26??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\Compatible?Value?Gradients?for?Reinforcement?Learning?of?Continuous?Deep?Policies(1).pdf
?????文件????1090260??2017-04-11?10:21??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\Compatible?Value?Gradients?for?Reinforcement?Learning?of?Continuous?Deep?Policies.pdf
?????文件?????663698??2017-04-11?10:19??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\Continuous?control?with?deep?reinforcement?learning.pdf
?????文件????1708226??2017-04-11?10:41??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\Continuous?Deep?Q-Learning?with?Model-ba
?????文件?????572750??2017-04-11?10:21??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\Deep?Reinforcement?Learning?in?Parameterized?Action?Space.pdf
?????文件?????343663??2017-04-11?10:18??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\Deterministic?Policy?Gradient?Algorithms.pdf
?????文件?????672837??2017-04-11?10:39??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\Efficient?Exploration?for?Dialogue?Policy?Learning?with?BBQ?Networks?&?Replay?Buffer?Spiking.pdf
?????文件????4728764??2017-04-11?10:18??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\End-to-End?Training?of?Deep?Visuomotor?Policies.pdf
?????文件?????443482??2017-04-11?10:39??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\Gradient?Estimation?Using?Stochastic?Computation?Graphs.pdf
?????文件????1798272??2017-04-11?10:22??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\High-Dimensional?Continuous?Control?Using?Generalized?Advantage?Estimation.pdf
?????文件?????903324??2017-04-11?10:36??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\Interactive?Control?of?Diverse?Complex?Characters?with?Neural?Networks.pdf
?????文件?????854278??2017-04-11?10:41??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\Learning?Continuous?Control?Policies?by?Stochastic?Value?Gradients.pdf
?????文件?????881400??2017-04-11?10:13??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\Learning?Deep?Control?Policies?for?Autonomous?Aerial?Vehicles?with?MPC-Guided?Policy?Search.pdf
?????文件?????693924??2017-04-11?10:21??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\Memory-ba
?????文件?????850845??2017-04-11?10:38??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\Q-Prop?Sample-Efficient?Policy?Gradient?with?An?Off-Policy?Critic.pdf
?????文件????1444030??2017-04-11?10:29??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\Sample?Efficient?Actor-Critic?with?Experience?Replay.pdf
?????文件????8820782??2017-04-11?10:24??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\Terrain-Adaptive?Locomotion?Skills?Using?Deep?Reinforcement?Learning?.pdf
?????文件????1024402??2017-04-11?10:11??基于策略梯度的深度強(qiáng)化學(xué)習(xí)\Trust?Region?Policy?Optimization.pdf
?????文件????4601440??2017-12-22?01:17??DQN?開山篇\Human-level?control?through?deep?reinforcementlearning.pdf
?????文件?????435604??2017-06-23?09:40??DQN?開山篇\Playing?Atari?with?Deep?Reinforcement?Learning.pdf
?????文件????8119239??2017-04-11?10:01??DQN?模型改進(jìn)\Control?of?Memory?Active?Perception?and?Action?in?Minecraft.pdf
?????文件?????316250??2017-04-11?09:58??DQN?模型改進(jìn)\Deep?Attention?Recurrent?Q-Network.pdf
?????文件?????843143??2017-04-11?09:59??DQN?模型改進(jìn)\Deep?Recurrent?Q-Learning?for?Partially?Observable?MDPs.pdf
?????文件????1372673??2017-04-11?10:01??DQN?模型改進(jìn)\Hierarchical?Deep?Reinforcement?Learning?Integrating?Temporal?Abstraction?and?Intrinsic?Motivation.pdf
?????文件?????612255??2017-04-11?09:59??DQN?模型改進(jìn)\Language?Understanding?for?Text-ba
?????文件????1024426??2017-04-11?10:00??DQN?模型改進(jìn)\Learning?to?Communicate?to?Solve?Riddles?with?Deep?Distributed?Recurrent?Q-Networks.pdf
?????文件?????404203??2017-04-11?10:02??DQN?模型改進(jìn)\Mazeba
?????文件????4278630??2017-04-11?10:01??DQN?模型改進(jìn)\Progressive?Neural?Networks.pdf
............此處省略29個(gè)文件信息
- 上一篇:車牌管理軟件T16
- 下一篇:佳能相機(jī)開發(fā)包EDSDK 3.5
評論
共有 條評論