Papers
Topics
Authors
Recent
Search
2000 character limit reached

Distributed Reinforcement Learning via Gossip

Published 28 Oct 2013 in cs.DC, cs.AI, and math.OC | (1310.7610v1)

Abstract: We consider the classical TD(0) algorithm implemented on a network of agents wherein the agents also incorporate the updates received from neighboring agents using a gossip-like mechanism. The combined scheme is shown to converge for both discounted and average cost problems.

Citations (57)

Summary

Paper to Video (Beta)

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.