|
|
|
Figure 1
Example episode illustrating the agent's adaptive policy after training. The information acquired through measurements y1 and y2 is used to emphasize exposure on the five patches of signal when generating exposure maps x2 and x3. By computing the weighted sum defined in equation (3) |
Open
access
access
journal menu![[Figure 1]](gy5080fig1.jpg)



