RL Theory #1

eleurent · 2020-04-21T13:55:01Z

RL Theory is not properly represented. A new section should be added, with at least:

Tabular setting
- With a generative model
  - QVI
- Without
  - UCRL2
  - UCBVI
- Episodic
- Q-learning+UCB
Extensions to compact state-action spaces
Extension to Kernels
Performance measures: PAC, simple regret, cumulative regret, etc.
RL with compatible function approximation

Is there a difference between generative models (sample any transition) and simulators (simulate trajectories from current states only)?

See #1

eleurent added a commit that referenced this issue Apr 28, 2020

Add a Theory section

d8ae986

See #1

eleurent added a commit that referenced this issue Apr 28, 2020

Add sample complexity of RL with generative model

e5dc789

See #1

eleurent added a commit that referenced this issue Apr 28, 2020

Add UCBVI, improving over UCRL2

79dae6b

See #1

eleurent added a commit that referenced this issue Apr 28, 2020

Add a paper on different RL theory settings and possible conversions

c39bf82

See #1

eleurent added a commit that referenced this issue Apr 28, 2020

Add LSVI with UCB for linear mdps

23aad9c

See #1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RL Theory #1

RL Theory #1

eleurent commented Apr 21, 2020 •

edited

Loading

RL Theory #1

RL Theory #1

Comments

eleurent commented Apr 21, 2020 • edited Loading

eleurent commented Apr 21, 2020 •

edited

Loading