ElegantRL
stable
  • Home

HelloWorld

  • Hello, World!
  • Networks: net.py
  • Agents: agent.py
  • Environment: env.py
  • Main: run.py
  • Quickstart

Overview

  • Key Concepts and Features
  • Cloud-native Paradigm
  • Muti-level Parallelism

Tutorials

  • Example 1: LunarLanderContinuous-v2
  • Example 2: BipedalWalker-v3
  • How to create a VecEnv on GPUs
  • How to run worker parallelism: Isaac Gym
  • How to run learner parallelism: REDQ
  • How to learn stably: H-term
  • Cloud Example 1: Generational Evolution
  • Cloud Example 2: Tournament-based Ensemble Training

Algorithms

  • DQN
  • Double DQN
  • DDPG
  • TD3
  • SAC
  • A2C
  • PPO
  • REDQ
  • MADDPG
  • MATD3
  • QMix
  • VDN
  • MAPPO

API Reference

  • Configuration: config.py
  • Run: run.py
  • Worker: worker.py
  • Replay Buffer: replay_buffer.py
  • Evaluator: evaluator.py

Other

  • FAQ
ElegantRL
  • Search


© Copyright 2021, ElegantRL. Revision a59f6c8c.

Built with Sphinx using a theme provided by Read the Docs.