OpenAI Gym is a toolkit for developing and comparing reinforcement learning algorithms. This is the gym
open-source library, which gives you access to a standardized set of environments. (A static export of the old accompanying gym website can be found at https://gym.openai.com/read-only.html.)
gym
makes no assumptions about the structure of your agent, and is compatible with any numerical computation library, such as TensorFlow or Theano. You can use it from Python code, and soon from other languages.
If you're not sure where to start, we recommend beginning with the docs on our site. See also the FAQ.
A whitepaper for OpenAI Gym is available at http://arxiv.org/abs/1606.01540, and here's a BibTeX entry that you can use to cite it in a publication:
@misc{1606.01540, Author = {Greg Brockman and Vicki Cheung and Ludwig Pettersson and Jonas Schneider and John Schulman and Jie Tang and Wojciech Zaremba}, Title = {OpenAI Gym}, Year = {2016}, Eprint = {arXiv:1606.01540}, }
Contents of this document
There are two basic concepts in reinforcement learning: the environment (namely, the outside world) and the agent (namely, the algorithm you are writing). The agent sends actions to the environment, and the environment replies with observations and rewards (that is, a score).
The core gym interface is Env, which is
the unified environment interface. There is no interface for agents;
that part is left to you. The following are the Env
methods you
should know:
You can perform a minimal install of gym
with:
git clone https://github.com/openai/gym.git
cd gym
pip install -e .
If you prefer, you can do a minimal install of the packaged version directly from PyPI:
pip install gym
You'll be able to run a few environments right away:
pyglet
to render though)We recommend playing with those environments at first, and then later installing the dependencies for the remaining environments.
To install the full set of environments, you'll need to have some system packages installed. We'll build out the list here over time; please let us know what you end up installing on your platform.
On OSX:
brew install cmake boost boost-python sdl2 swig wget
On Ubuntu 14.04:
apt-get install -y python-numpy python-dev cmake zlib1g-dev libjpeg-dev xvfb libav-tools xorg-dev python-opengl libboost-all-dev libsdl2-dev swig
MuJoCo has a proprietary dependency we can't set up for you. Follow
the
instructions
in the mujoco-py
package for help.
Once you're ready to install everything, run pip install -e '.[all]'
(or pip install 'gym[all]'
).
We currently support Linux and OS X running Python 2.7 or 3.5. Some users on OSX + Python3 may need to run
brew install boost-python --with-python3
If you want to access Gym from languages other than python, we have limited support for non-python frameworks, such as lua/Torch, using the OpenAI Gym HTTP API.
To run pip install -e '.[all]'
, you'll need a semi-recent pip.
Please make sure your pip is at least at version 1.5.0
. You can
upgrade using the following: pip install --ignore-installed
pip
. Alternatively, you can open setup.py and
install the dependencies by hand.
If you're trying to render video on a server, you'll need to connect a
fake display. The easiest way to do this is by running under
xvfb-run
(on Ubuntu, install the xvfb
package):
xvfb-run -s "-screen 0 1400x900x24" bash
If you'd like to install the dependencies for only specific environments, see setup.py. We maintain the lists of dependencies on a per-environment group basis.
The code for each environment group is housed in its own subdirectory gym/envs. The specification of each task is in gym/envs/__init__.py. It's worth browsing through both.
These are a variety of algorithmic tasks, such as learning to copy a sequence.
import gym
env = gym.make('Copy-v0')
env.reset()
env.render()
The Atari environments are a variety of Atari video games. If you didn't do the full install, you can install dependencies via pip install -e '.[atari]'
(you'll need cmake
installed) and then get started as follow:
import gym
env = gym.make('SpaceInvaders-v0')
env.reset()
env.render()
This will install atari-py
, which automatically compiles the Arcade Learning Environment. This can take quite a while (a few minutes on a decent laptop), so just be prepared.
The board game environments are a variety of board games. If you didn't do the full install, you can install dependencies via pip install -e '.[board_game]'
(you'll need cmake
installed) and then get started as follow:
import gym
env = gym.make('Go9x9-v0')
env.reset()
env.render()
Box2d is a 2D physics engine. You can install it via pip install -e '.[box2d]'
and then get started as follow:
import gym
env = gym.make('LunarLander-v2')
env.reset()
env.render()
These are a variety of classic control tasks, which would appear in a typical reinforcement learning textbook. If you didn't do the full install, you will need to run pip install -e '.[classic_control]'
to enable rendering. You can get started with them via:
import gym
env = gym.make('CartPole-v0')
env.reset()
env.render()
MuJoCo is a physics engine which can do
very detailed efficient simulations with contacts. It's not
open-source, so you'll have to follow the instructions in mujoco-py
to set it up. You'll have to also run pip install -e '.[mujoco]'
if you didn't do the full install.
import gym
env = gym.make('Humanoid-v1')
env.reset()
env.render()
Toy environments which are text-based. There's no extra dependency to install, so to get started, you can just do:
import gym
env = gym.make('FrozenLake-v0')
env.reset()
env.render()
See the examples
directory.
We are using pytest for tests. You can run them via:
pytest
此处可能存在不合适展示的内容,页面不予展示。您可通过相关编辑功能自查并修改。
如您确认内容无涉及 不当用语 / 纯广告导流 / 暴力 / 低俗色情 / 侵权 / 盗版 / 虚假 / 无价值内容或违法国家有关法律法规的内容,可点击提交进行申诉,我们将尽快为您处理。