MushroomRL
latest

API:

  • Agent-Environment Interface
  • Actor-Critic
  • Policy search
  • Value-Based
  • Approximators
  • Distributions
  • Environments
  • Features
  • Policy
  • Solvers
  • Utils

Tutorials:

  • How to make a simple experiment
  • How to make an advanced experiment
  • How to create a regressor
  • How to make a deep RL experiment
  • How to use the Logger
  • How to use the Environment interface
  • How to Save and Load (Serializable interface)
  • Usage Examples
    • Finite MDPs with Temporal Difference
    • TD with function approximation
    • Classical Policy Search and Actor-Critic
    • Black Box Optimization
    • Deep Critic-Only
    • Deep Actor-Critic
    • Continuos Control From Pixels
    • Others Examples (Environment and Tools)
MushroomRL
  • Usage Examples
  • Edit on GitHub

Usage Examples

In the following, we collect the links to MushroomRL scripts showing examples for most approaches available in MushroomRL.

The examples can be all found in the examples folder in the MushroomRL repository.

Finite MDPs with Temporal Difference

  • Simple Chain

  • Double Chain

  • Grid World

  • Taxi

TD with function approximation

  • Mountain Car with SARSA

  • Puddle World with SARSA

  • CarOnHill with FQI

  • CartPole with LSPI

Classical Policy Search and Actor-Critic

  • LQR with Policy Gradient

  • Pendulum with Stochastic Actor-Critic

  • Pendulum with Deterministic Actor-Critic

Black Box Optimization

  • LQR with BBO

  • Segway with BBO

  • Ship Steering with BBO

Deep Critic-Only

  • Acrobot with DQN

  • Minigrid with DQN

  • Atari

Deep Actor-Critic

  • Acrobot with A2C

  • Pendulum with A2C

  • Pendulum with Trust Region approaches

  • Pendulum with Deterministic Gradient

  • Pendulum with SAC

Continuos Control From Pixels

  • Walker (Stand Task) from Pixel

  • Walker (Stand Task) from Pixel and Shared Network

Others Examples (Environment and Tools)

  • Using Dataset Plotting callback and State Normalization

  • Habitat Navigation Task with DQN

  • Habitat Rearrange Task with SAC from Pixel and Shared Network

  • iGibson with DQN

Previous

© Copyright 2018-2021 Carlo D'Eramo, Davide Tateo. Revision 1a4f54ed.

Built with Sphinx using a theme provided by Read the Docs.