Sourcify
Effortlessly find and manage open-source dependencies for your projects.

The industry-standard API for Reinforcement Learning environments and benchmarking.

Gymnasium is the community-maintained fork of OpenAI Gym, serving as the definitive interface for developing and comparing reinforcement learning (RL) algorithms in 2026. Managed by the Farama Foundation, it provides a standardized Markov Decision Process (MDP) interface that decouples the learning agent from the simulation environment. Its technical architecture is designed for high-performance research, supporting a wide array of environments ranging from classic control tasks and Box2D physics to complex robotics simulations via MuJoCo. As of 2026, Gymnasium has evolved to include native JAX-compatible functional environments and improved vectorized environment support, allowing researchers to scale simulations across thousands of CPU or GPU cores simultaneously. It remains the essential middleware for AI researchers and engineers, ensuring that RL models remain portable, reproducible, and compatible with modern deep learning frameworks like PyTorch, TensorFlow, and JAX. The platform's ecosystem is bolstered by specialized wrappers for reward shaping, observation filtering, and frame stacking, making it the most versatile tool for training agents in both discrete and continuous action spaces.
Gymnasium is the community-maintained fork of OpenAI Gym, serving as the definitive interface for developing and comparing reinforcement learning (RL) algorithms in 2026.
Explore all tools that specialize in environment simulation. This domain focus ensures Gymnasium delivers optimized results for this specific requirement.
Enables the execution of multiple independent environments in parallel, either synchronously or asynchronously.
A modular class-based system to modify observations, rewards, and actions without changing the underlying environment code.
Uses specific classes (Box, Discrete, MultiDiscrete, Dict) to define input/output shapes rigorously.
Native support for the MuJoCo physics engine for high-fidelity robotics simulation.
A stateless version of the environment API designed to work with JAX's JIT and auto-diff.
Internal tracking of environment versions (e.g., v1, v2) to ensure research reproducibility.
Support for multiple render modes including 'human', 'rgb_array', and 'depth_array'.
Verify Python 3.8+ environment installation.
Install the core library using 'pip install gymnasium'.
Install optional sub-modules like 'gymnasium[atari]' or 'gymnasium[mujoco]' for specific physics engines.
Import the library using 'import gymnasium as gym'.
Initialize an environment instance using the 'gym.make()' command with a valid environment ID.
Define the agent-environment loop using 'env.reset()' to obtain the initial observation.
Implement an action selection policy (e.g., random or neural network-driven).
Execute actions via 'env.step(action)' to receive observations, rewards, and termination flags.
Apply Environment Wrappers to transform data or normalize rewards as needed.
Close the environment using 'env.close()' to release system resources.
All Set
Ready to go
Verified feedback from other users.
"Widely praised for being the essential standard for RL research. The transition from OpenAI to Farama Foundation has improved maintenance and documentation significantly."
Post questions, share tips, and help other users.
Effortlessly find and manage open-source dependencies for your projects.

End-to-end typesafe APIs made easy.

Page speed monitoring with Lighthouse, focusing on user experience metrics and data visualization.

Topcoder is a pioneer in crowdsourcing, connecting businesses with a global talent network to solve technical challenges.

Explore millions of Discord Bots and Discord Apps.

Build internal tools 10x faster with an open-source low-code platform.

Open-source RAG evaluation tool for assessing accuracy, context quality, and latency of RAG systems.

AI-powered synthetic data generation for software and AI development, ensuring compliance and accelerating engineering velocity.