Simple statistical gradient-following algorithms for connectionist reinforcement learning on resea.org. DOI 10.1007/bf00992696. Ronald J. Williams.
No abstract available from OpenAlex for this work.