W2D4 content discussion

In Tutorial 2, we got confused by the last statement in this image. It seems to say that a_t = a_bar_t + L_t * (s_t - g_t). Is this correct?

In the faculty Q&A, Dr. Saxena said that the controller action a_t = L_t * s_t and that it does not depend on a_bar_t.

What are the correct equations here?

1 Like

Well itā€™s not really about the content, but I was curious about the " *\ " operator that was used in the ā€œbelief_updateā€ function in tutorial1 and couldnā€™t find anything about it.

@ido.benartzi \ is used merely to split the statement into multiple lines so that the code easier to read. It does not affect the output, making * just regular multiplication.

3 Likes

Hi, I have the same questionā€¦is there an answer for that?

And Iā€™m also not sure how to interpret the sum of \bar a_t and L_t(s_t-g_t). If the former is the action to get to g_{t+1} and the latter is the action to get to g_t, then linearly wouldnā€™t this sum let us get to g_t+g_{t+1}?
Thanks!