Papers
Topics
Authors
Recent
Search
2000 character limit reached

Hierarchical Soft Actor-Critic: Adversarial Exploration via Mutual Information Optimization

Published 17 Jun 2019 in cs.LG, cs.AI, cs.IT, math.IT, and stat.ML | (1906.07122v1)

Abstract: We describe a novel extension of soft actor-critics for hierarchical Deep Q-Networks (HDQN) architectures using mutual information metric. The proposed extension provides a suitable framework for encouraging explorations in such hierarchical networks. A natural utilization of this framework is an adversarial setting, where meta-controller and controller play minimax over the mutual information objective but cooperate on maximizing expected rewards.

Citations (3)

Summary

Paper to Video (Beta)

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Authors (2)

Collections

Sign up for free to add this paper to one or more collections.