Module `environments`¶

The environments module includes all necessary functionality to spawn and wrap environments.

The module atari_wrappers is a modified copy from the torchbeast project.

Exposed classes:

EnvSpawner

Unexposed modules:

atari_wrappers

Exposed Classes¶

Environment spawner object (`environments.EnvSpawner`)¶

class pytorch_seed_rl.environments.EnvSpawner(env_id: str, num_envs: int = 1)[source]¶

Bases: object

Class that is given to actor threads to spawn local environments by invoking spawn().

An instance of this class exposes spawn().

Parameters

env_id (str) – The environments identifier as registered with gym.
num_envs (int) – The number of environments spawn() returns.

Variables

self.env_info (dict) – Infos about the spawned environments as dictionary.
self.placeholder_obs (dict) – A dictionary with the same structure as observations return by the spawned environments step() method.

spawn() → List[gym.Env][source]¶

Returns a list of wrapped environments (using OpenAI’s gym).

Applies:

ClipRewardEnv
DictObservationsEnv
EpisodicLifeEnv
FireResetEnv, if env contains an action with meaning ‘FIRE’
ImageToPyTorch
MaxAndSkipEnv (skip = 4)
NoopResetEnv (noop_max = 30)
WarpFrame

Unexposed Submodules¶

Utility functions for wrapping (`environments.atari_wrappers`)¶

A collection of wrappers applicable to environments following the OpenAI gym API

Wrappers for OpenAI gym (`environments.atari_wrappers`)¶

class pytorch_seed_rl.environments.atari_wrappers.AutoResetWrapper(*args: Any, **kwargs: Any)[source]¶

Bases: gym.Wrapper

A wrapper that automatically resets the environment in case of termination.

Parameters: env (gym.Env) – An environment that will be wrapped.

class pytorch_seed_rl.environments.atari_wrappers.ClipRewardEnv(*args: Any, **kwargs: Any)[source]¶

Bases: gym.RewardWrapper

Clips rewards.

Parameters: env (gym.Env) – An environment that will be wrapped.

reward(reward)[source]¶: Bin reward to {+1, 0, -1} by its sign.

class pytorch_seed_rl.environments.atari_wrappers.DictObservationsEnv(*args: Any, **kwargs: Any)[source]¶

Bases: gym.Wrapper

Provides observations as dict with additional metrics.

Adds initial() method, which returns the initial observation.

Parameters: env (gym.Env) – An environment that will be wrapped.

initial() → dict [source]¶: Returns an initial observation.

class pytorch_seed_rl.environments.atari_wrappers.EpisodicLifeEnv(*args: Any, **kwargs: Any)[source]¶

Bases: gym.Wrapper

Make end-of-life == end-of-episode, but only reset on true game over. Done by DeepMind for the DQN and co. since it helps value estimation.

Parameters: env (gym.Env) – An environment that will be wrapped.

reset(**kwargs)[source]¶: Reset only when lives are exhausted. This way all states are still reachable even though lives are episodic, and the learner need not know about any of this behind-the-scenes.

class pytorch_seed_rl.environments.atari_wrappers.FireResetEnv(*args: Any, **kwargs: Any)[source]¶

Bases: gym.Wrapper

Take action on reset for environments that are fixed until firing.

Parameters: env (gym.Env) – An environment that will be wrapped.

class pytorch_seed_rl.environments.atari_wrappers.FrameStack(*args: Any, **kwargs: Any)[source]¶

Bases: gym.Wrapper

Stack k last frames. Returns lazy array, which is much more memory efficient.

Module environments¶

Exposed Classes¶

Environment spawner object (environments.EnvSpawner)¶

Unexposed Submodules¶

Utility functions for wrapping (environments.atari_wrappers)¶

Wrappers for OpenAI gym (environments.atari_wrappers)¶

Module `environments`¶

Environment spawner object (`environments.EnvSpawner`)¶

Utility functions for wrapping (`environments.atari_wrappers`)¶

Wrappers for OpenAI gym (`environments.atari_wrappers`)¶