| | Flashcard 4362124791052#reinforcement-learning | Q: What is the form of a non-linear Bellman equation? | A: v(s)=\mathbb{E}\left[f\left(R_{t+1}, v\left(S_{t+1}\right)\right) | S_{t}=s, A_{t} \sim \pi\left(S_{t}\right)\right] |
|
If you want to change selection, open document below and click on "Move attachment"
pdf
owner:
reseal - (no access) - General non-linear Bellman equations, p1
Summary
status | not read | | reprioritisations | |
---|
last reprioritisation on | | | suggested re-reading day | |
---|
started reading on | | | finished reading on | |
---|
Details