lab 4: q-learning (table) - github pageshunkim.github.io/ml/rl/rl-l04.pdflab 4: q-learning (table)...

Lab 4: Q-learning (table)exploit&exploration and discounted future reward

Reinforcement Learning with TensorFlow&OpenAI GymSung Kim <hunkim+ml@gmail.com>

Exploit VS Exploration: decaying E-greedy

for i in range (1000)

e = 0.1 / (i+1)

if random(1) < e: a = random

else: a = argmax(Q(s, a))

Exploit VS Exploration: add random noise

0.5 0.6 0.3 0.2 0.5

for i in range (1000) a = argmax(Q(s, a) + random_values / (i+1))

Discounted reward ( = 0.9)

Q-learning algorithm

Code: setup

https://medium.com/emergent-future/simple-reinforcement-learning-with-tensorflow-part-0-q-learning-with-tables-and-neural-networks-d195264329d0#.pjz9g59ap

Code: Q learning

Code: results

Code: e-greedy

Code: e-greedy results

Nondeterministic/

Stochastic worlds

lab 4: q-learning (table) - github pageshunkim.github.io/ml/rl/rl-l04.pdflab 4: q-learning (table)...

Documents

manix au - automatic state construction using dt for rl...

soar-rl: reinforcement learning and soar shelley nason

teaching multiple learning agents by environment-dynamics...

deep reinforcement learning - github pages · 1 q-learning...

multiagent learning...

lecture 12: policy optimization...

modeling 3d shapes by reinforcement learning...

reinforcement learning dealing with complexity and safety in...

deep reinforcement learning - david silver · reinforcement...

lecture 4: q-learning (table) - github...

learning to optimize join queries with deep rl€¦ ·...

rl 55.950 site boundary...rl rl ground 9.850 rl level 01...

bandits, active learning, bayesian rl and global

rc and rl circuits - electronics â€“ online distance...

reinforcement learning. overview introduction q-learning ...

model-based rl and policy...

lecture 7: dqn - github...

a brief introduction to reinforcement...

an introduction to reinforcement learning (rl)

lab 6-1: q network - github...