DiffMjStep: Custom Autograd Function for Differentiable MuJoCo Dynamics

Description

An efficient integration between PyTorch and MuJoCo. Enables automatic differentiation through MuJoCo simulation trajectories, allowing for gradient-based optimization of control policies directly within PyTorch.

Features

Efficient Gradient Computations: Significantly more efficient than naive Jacobian finite differencing calculations as it utilizes the built-in finite difference method in MuJoCo mjd_transitionFD.

Multi-Step Calculations: Provides the ability to estimate gradients over multiple simulation steps, by propagating gradients through the entire trajectory.

Batch Simulation Support: Enables batched simulations and gradient computations, significantly improving computational efficiency for large-scale experiments.

Execution Benchmark

Usage

import torch
import mujoco as mj
from DiffMjStep import MjStep

# Initialize MuJoCo model and data
xml_path = 'path/to/your/model.xml'
mj_model = mj.MjModel.from_xml_path(filename=xml_path)
mj_data = mj.MjData(mj_model)

# Define initial state and control input tensors
state = torch.rand(mj_model.nq + mj_model.nv + mj_model.na, requires_grad=True)
ctrl = torch.rand(mj_model.nu, requires_grad=True)

# Compute next state and gradients
next_state, dydx, dydu = MjStep.apply(state, ctrl, n_steps=4, mj_model, mj_model, mj_data)

Citation

If you use this package in your research, a citation would be appreciated:

 @software{DiffMjStep2024,
  author = {Sharony, Elad},
  title = {{DiffMjStep: Custom Autograd Function for Differentiable MuJoCo Dynamics}},
  year = {2024},
  version = {1.0},
  howpublished = {\url{https://github.com/EladSharony/DiffMjStep}},
}

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.idea		.idea
assets		assets
CITATION.cff		CITATION.cff
LICENSE		LICENSE
README.md		README.md
autograd_mujoco.py		autograd_mujoco.py
execution_time.svg		execution_time.svg
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.idea

.idea

assets

assets

CITATION.cff

CITATION.cff

LICENSE

LICENSE

README.md

README.md

autograd_mujoco.py

autograd_mujoco.py

execution_time.svg

execution_time.svg

main.py

main.py

Repository files navigation

DiffMjStep: Custom Autograd Function for Differentiable MuJoCo Dynamics

Description

Features

Execution Benchmark

Usage

Citation

About

Languages

License

EladSharony/DiffMjStep

Folders and files

Latest commit

History

Repository files navigation

DiffMjStep: Custom Autograd Function for Differentiable MuJoCo Dynamics

Description

Features

Execution Benchmark

Usage

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Languages