Reward Values
Reward Values
Description
these represent feedback reward values over the period of learning episodes. Red=failed runs, green=succeeded runs