Skip to content

Hierarchical Actor-Critic Exploration with Synchronized, Adversarial, & Knowledge-Based Actions

Notifications You must be signed in to change notification settings

ajain-23/HAC-E-SAK

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

HAC-E-SAK

In this repository, we lean into the efficiency and robustness advantages of a hierarchical learning structure with HAC-E-SAK, iterating on the Hierarchical Actor-Critic (HAC) framework. We do so through the introduction of a synchronized knowledge-based exploration paradigm motivating guided environment discovery for tasks with continuous state and action spaces. While HAC emphasizes a strictly-defined hierarchical organization for rapid learning through parallelized training of multilevel subtask transition functions, it does not extend this principle to the exploration phase of training, an oversight addressed by our approach. Further, HAC's exploration strategy consists of simple $\epsilon$-greedy based perturbations to deterministic actions generated from the DDPG algorithm. Our approach substitutes this with an alternate adversarial strategy relying on knowledge of prior agent experiences.

We successfully extend the aforementioned hierarchical organization used by leading methods in subtask learning for the parallel purpose of structured exploration, allowing for explicit synchronization between levels. We demonstrate the merits of our method experimentally through testing in sparse-reward, complex-action scenarios, showing the value of our novel approach in terms of further improved sample efficiency and consistently robust performance. The combination of our synchronization structure with our adversarial knowledge-based exploration learning strategy clearly outperforms all other presented procedures, validating the viability of HAC-E-SAK as a robust method for hierarchical learning.

About

Hierarchical Actor-Critic Exploration with Synchronized, Adversarial, & Knowledge-Based Actions

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages