Tags
#reinforcement-learning
Question
What is the form of a non-linear Bellman equation?
$$v(s)=\mathbb{E}\left[f\left(R_{t+1}, v\left(S_{t+1}\right)\right) | S_{t}=s, A_{t} \sim \pi\left(S_{t}\right)\right]$$

Tags
#reinforcement-learning
Question
What is the form of a non-linear Bellman equation?
?

Tags
#reinforcement-learning
Question
What is the form of a non-linear Bellman equation?
$$v(s)=\mathbb{E}\left[f\left(R_{t+1}, v\left(S_{t+1}\right)\right) | S_{t}=s, A_{t} \sim \pi\left(S_{t}\right)\right]$$
If you want to change selection, open original toplevel document below and click on "Move attachment"

#### Parent (intermediate) annotation

Open it
We consider a broader class of Bellman equations that are non-linear in the rewards and future values: $$v(s)=\mathbb{E}\left[f\left(R_{t+1}, v\left(S_{t+1}\right)\right) | S_{t}=s, A_{t} \sim \pi\left(S_{t}\right)\right]$$ .

#### Original toplevel document (pdf)

owner: reseal - (no access) - General non-linear Bellman equations, p1

#### Summary

status measured difficulty not learned 37% [default] 0

No repetitions