The actor-critic learning is behind the matching law: Matching vs. optimal behaviors
Y. Sakai, T. Fukai
Neural Computation
20
1
227-251