Reinforcement Learning for Partially Observable Linear Gaussian Systems Using Batch Dynamics of Noisy Observations | IEEE Journals & Magazine | IEEE Xplore