Actuators

Actuators#

Actuators convert high-level commands (position, velocity, effort) into low-level efforts that drive joints. Implementations use either built-in actuators (physics engine computes torques and integrates damping forces implicitly) or explicit actuators (user computes torques explicitly, integrator cannot account for their velocity derivatives).

Choosing an Actuator Type#

Built-in actuators (BuiltinPositionActuator, BuiltinVelocityActuator): Use MuJoCo’s native implementations. The physics engine computes torques and integrates damping forces implicitly, providing the best numerical stability.

Explicit actuators (IdealPdActuator, DcMotorActuator, LearnedMlpActuator): Compute torques explicitly so the simulator cannot account for velocity derivatives. Use when you need custom control laws or actuator dynamics that can’t be expressed with built-in types (e.g., velocity-dependent torque limits, learned actuator networks).

XML actuators (XmlPositionActuator, XmlMotorActuator, XmlVelocityActuator): Wrap actuators already defined in your robot’s XML file.

Delayed actuators (DelayedActuator): Generic wrapper that adds command delays to any actuator type. Use for modeling communication latency.

TL;DR#

Basic PD control:

from mjlab.actuator import BuiltinPositionActuatorCfg
from mjlab.entity import EntityCfg, EntityArticulationInfoCfg

robot_cfg = EntityCfg(
    spec_fn=lambda: load_robot_spec(),
    articulation=EntityArticulationInfoCfg(
        actuators=(
            BuiltinPositionActuatorCfg(
                joint_names_expr=(".*_hip_.*", ".*_knee_.*"),
                stiffness=80.0,
                damping=10.0,
                effort_limit=100.0,
            ),
        ),
    ),
)

Add delays:

from mjlab.actuator import DelayedActuatorCfg, BuiltinPositionActuatorCfg

DelayedActuatorCfg(
    base_cfg=BuiltinPositionActuatorCfg(
        joint_names_expr=(".*",),
        stiffness=80.0,
        damping=10.0,
    ),
    delay_target="position",
    delay_min_lag=2,  # Minimum 2 physics steps
    delay_max_lag=5,  # Maximum 5 physics steps
)

Actuator Interface#

All actuators implement a unified compute() interface that receives an ActuatorCmd (containing position, velocity, and effort targets) and returns control signals for the low-level MuJoCo actuators driving each joint. The abstraction provides lifecycle hooks for model modification, initialization, reset, and runtime updates.

Core interface:

def compute(self, cmd: ActuatorCmd) -> torch.Tensor:
    """Convert high-level commands to control signals.

    Args:
        cmd: Command containing position_target, velocity_target, effort_target
            (each is a [num_envs, num_joints] tensor or None)

    Returns:
        Control signals for this actuator ([num_envs, num_joints] tensor)
    """

Lifecycle hooks:

edit_spec: Modify MjSpec before compilation (add actuators, set gains)
initialize: Post-compilation setup (resolve indices, allocate buffers)
reset: Per-environment reset logic
update: Pre-step updates
compute: Convert commands to control signals

Properties:

joint_ids: Tensor of local joint indices controlled by this actuator
joint_names: List of joint names controlled by this actuator
ctrl_ids: Tensor of global control input indices for this actuator

Actuator Types#

Built-in Actuators#

Built-in actuators use MuJoCo’s native actuator types via the MjSpec API. The physics engine computes the control law and integrates velocity-dependent damping forces implicitly, providing best numerical stability.

BuiltinPositionActuator: Creates <position> actuators for PD control.

BuiltinVelocityActuator: Creates <velocity> actuators for velocity control.

BuiltinMotorActuator: Creates <motor> actuators for direct torque control.

from mjlab.actuator import BuiltinPositionActuatorCfg, BuiltinVelocityActuatorCfg

# Mobile manipulator: PD for arm joints, velocity control for wheels.
actuators = (
    BuiltinPositionActuatorCfg(
        joint_names_expr=(".*_shoulder_.*", ".*_elbow_.*", ".*_wrist_.*"),
        stiffness=100.0,
        damping=10.0,
        effort_limit=150.0,
    ),
    BuiltinVelocityActuatorCfg(
        joint_names_expr=(".*_wheel_.*",),
        damping=20.0,
        effort_limit=50.0,
    ),
)

Explicit Actuators#

These actuators explicitly compute efforts and forward them to an underlying <motor> actuator acting as a passthrough. This enables custom control laws and actuator dynamics that can’t be expressed with built-in types.

Important

Explicit actuators may be less numerically stable than built-in actuators because the integrator cannot account for the velocity derivatives of the control forces, especially with high damping gains.

IdealPdActuator: Base class that implements an ideal PD controller.

DcMotorActuator: Example of a more realistic actuator model built on top of IdealPdActuator. Adds velocity-dependent torque saturation to model DC motor torque-speed curves (back-EMF effects). It implements a linear torque-speed curve: maximum torque at zero velocity, zero torque at maximum velocity.

from mjlab.actuator import IdealPdActuatorCfg, DcMotorActuatorCfg

# Ideal PD for hips, DC motor model with torque-speed curve for knees.
actuators = (
    IdealPdActuatorCfg(
        joint_names_expr=(".*_hip_.*",),
        stiffness=80.0,
        damping=10.0,
        effort_limit=100.0,
    ),
    DcMotorActuatorCfg(
        joint_names_expr=(".*_knee_.*",),
        stiffness=80.0,
        damping=10.0,
        effort_limit=25.0,       # Continuous torque limit
        saturation_effort=50.0,  # Peak torque at stall
        velocity_limit=30.0,     # No-load speed (rad/s)
    ),
)

DcMotorActuator parameters:

saturation_effort: Peak motor torque at zero velocity (stall torque)
velocity_limit: Maximum motor velocity (no-load speed, rad/s)
effort_limit: Continuous torque limit (from base class)

LearnedMlpActuator: Neural network-based actuator that uses a trained MLP to predict torque outputs from joint state history. Useful when analytical models can’t capture complex actuator dynamics like delays, nonlinearities, and friction effects. Inherits DC motor velocity-based torque limits.

from mjlab.actuator import LearnedMlpActuatorCfg

actuators = (
    LearnedMlpActuatorCfg(
        joint_names_expr=(".*_ankle_.*",),
        network_file="models/ankle_actuator.pt",  # TorchScript model
        pos_scale=1.0,        # Input scaling for position errors
        vel_scale=0.05,       # Input scaling for velocities
        torque_scale=10.0,    # Output scaling for torques
        input_order="pos_vel",
        history_length=3,     # Use current + 2 previous timesteps
        saturation_effort=50.0,
        velocity_limit=30.0,
        effort_limit=25.0,
    ),
)

LearnedMlpActuator parameters:

network_file: Path to TorchScript MLP model (.pt file)
pos_scale: Scaling factor for position error inputs
vel_scale: Scaling factor for velocity inputs
torque_scale: Scaling factor for network torque outputs
input_order: pos_vel (position then velocity) or vel_pos
history_length: Number of timesteps to use (e.g., 3 = current + 2 past)
saturation_effort, velocity_limit, effort_limit: Same as DcMotorActuator

The network receives scaled inputs [pos_error[t], pos_error[t-1], ..., vel[t], vel[t-1], ...] and outputs torques that are scaled and clipped by DC motor limits.

XML Actuators#

XML actuators wrap actuators already defined in your robot’s XML file. The config finds existing actuators by matching their target joint name against the joint_names_expr patterns. Each joint must have exactly one matching actuator.

XmlPositionActuator: Wraps existing <position> actuators

XmlVelocityActuator: Wraps existing <velocity> actuators

XmlMotorActuator: Wraps existing <motor> actuators

from mjlab.actuator import XmlPositionActuatorCfg

# Robot XML already has:
# <actuator>
#   <position name="hip_joint" joint="hip_joint" kp="100"/>
# </actuator>

# Wrap existing XML actuators.
actuators = (
    XmlPositionActuatorCfg(joint_names_expr=("hip_joint",)),
)

Delayed Actuator#

Generic wrapper that adds command delays to any actuator. Useful for modeling actuator latency and communication delays. The delay operates on command targets before they reach the actuator’s control law.

from mjlab.actuator import DelayedActuatorCfg, IdealPdActuatorCfg

# Add 2-5 step delay to position commands.
actuators = (
    DelayedActuatorCfg(
        base_cfg=IdealPdActuatorCfg(
            joint_names_expr=(".*",),
            stiffness=80.0,
            damping=10.0,
        ),
        delay_target="position",     # Delay position commands
        delay_min_lag=2,
        delay_max_lag=5,
        delay_hold_prob=0.3,         # 30% chance to keep previous lag
        delay_update_period=10,      # Update lag every 10 steps
    ),
)

Multi-target delays:

DelayedActuatorCfg(
    base_cfg=IdealPdActuatorCfg(...),
    delay_target=("position", "velocity", "effort"),
    delay_min_lag=2,
    delay_max_lag=5,
)

Delays are quantized to physics timesteps. For example, with 500Hz physics (2ms/step), delay_min_lag=2 represents a 4ms minimum delay.

Note

Each target gets an independent delay buffer with its own lag schedule. This provides maximum flexibility for modeling different latency characteristics for position, velocity, and effort commands.

PD Control and Integrator Choice#

The distinction between built-in and explicit PD control only makes sense in the context of how MuJoCo integrates velocity-dependent forces. This section explains how each actuator style interacts with the integrator, and why mjlab uses <implicitfast> by default.

Built-in vs Explicit PD Control#

BuiltinPositionActuator uses MuJoCo’s internal PD implementation:

Creates <position> actuators in the MjSpec
Physics engine computes the PD law and integrates velocity-dependent damping forces implicitly

IdealPdActuator implements PD control explicitly:

Creates <motor> actuators in the MjSpec
Computes torques explicitly: τ = Kp·pos_error + Kd·vel_error
The integrator cannot account for the velocity derivatives of these forces

They match closely in the linear, unconstrained regime and small time steps. However, built-in PD is more numerically robust and as such can be used with larger gains and larger timesteps.

Integrator Behavior in MuJoCo#

The choice of integrator in MuJoCo strongly affects stability for velocity-dependent forces:

euler is semi-implicit but treats joint damping implicitly. Other forces, including explicit actuator damping, are integrated explicitly.
implicitfast treats all known velocity-dependent forces implicitly, stabilizing systems with large damping or stiff actuation.

mjlab Recommendation#

mjlab actuators apply damping inside the actuator (not in joints). Because of this, Euler cannot integrate the damping implicitly, making it less stable. The implicitfast integrator, however, handles both proportional and damping terms of the actuator implicitly, improving stability without additional cost.

Note

mjlab defaults to <implicitfast>, as it is MuJoCo’s recommended integrator and provides superior stability for actuator-side damping.

Authoring Actuator Configs#

Since actuator parameters are uniform within each config, use separate actuator configs for joints that need different parameters:

from mjlab.actuator import BuiltinPositionActuatorCfg

# G1 humanoid with different gains per joint group.
G1_ACTUATORS = (
    BuiltinPositionActuatorCfg(
        joint_names_expr=(".*_hip_.*", "waist_yaw_joint"),
        stiffness=180.0,
        damping=18.0,
        effort_limit=88.0,
        armature=0.0015,
    ),
    BuiltinPositionActuatorCfg(
        joint_names_expr=("left_hip_pitch_joint", "right_hip_pitch_joint"),
        stiffness=200.0,
        damping=20.0,
        effort_limit=88.0,
        armature=0.0015,
    ),
    BuiltinPositionActuatorCfg(
        joint_names_expr=(".*_knee_joint",),
        stiffness=150.0,
        damping=15.0,
        effort_limit=139.0,
        armature=0.0025,
    ),
    BuiltinPositionActuatorCfg(
        joint_names_expr=(".*_ankle_.*",),
        stiffness=40.0,
        damping=5.0,
        effort_limit=25.0,
        armature=0.0008,
    ),
)

This design choice reflects a deliberate simplification in mjlab: each ActuatorCfg represents a single actuator type (e.g., a specific motor/gearbox model) applied uniformly across all joints it drives. Hardware parameters such as armature (reflected rotor inertia) and gear describe properties of the actuator hardware, even though they are implemented in MuJoCo as joint or actuator fields. In other frameworks (like Isaac Lab), these fields may accept float | dict[str, float] to support per-joint variation. mjlab instead encourages one config per actuator type or per joint group, keeping the hardware model physically consistent and explicit. The main trade-off is verbosity in special cases, such as parallel linkages, where per-joint overrides could have been convenient, but the benefit is clearer semantics and simpler maintenance.

Computing Hardware Parameters from Motor Specs#

mjlab provides utilities in mjlab.utils.actuator to compute actuator parameters from physical motor specifications. This is particularly useful for computing reflected inertia (armature) and deriving appropriate control gains from hardware datasheets.

Example: Unitree G1 motor configuration

from math import pi

from mjlab.utils.actuator import (
    reflected_inertia_from_two_stage_planetary,
    ElectricActuator
)

# Motor specs from manufacturer datasheet.
ROTOR_INERTIAS_7520_14 = (
    0.489e-4,  # Motor rotor inertia (kg·m**2)
    0.098e-4,  # Planet carrier inertia
    0.533e-4,  # Output stage inertia
)
GEARS_7520_14 = (
    1,            # First stage (motor to planet)
    4.5,          # Second stage (planet to carrier)
    1 + (48/22),  # Third stage (carrier to output)
)

# Compute reflected inertia at joint output.
# J_reflected = J_motor*(N₁*N₂)**2 + J_carrier*N₂**2 + J_output.
ARMATURE_7520_14 = reflected_inertia_from_two_stage_planetary(
    ROTOR_INERTIAS_7520_14, GEARS_7520_14
)

# Create motor spec container.
ACTUATOR_7520_14 = ElectricActuator(
    reflected_inertia=ARMATURE_7520_14,
    velocity_limit=32.0,   # rad/s at joint
    effort_limit=88.0,     # N·m continuous torque
)

# Derive PD gains from natural frequency and damping ratio.
NATURAL_FREQ = 10 * 2*pi  # 10 Hz bandwidth.
DAMPING_RATIO = 2.0       # Overdamped, see note below.
STIFFNESS = ARMATURE_7520_14 * NATURAL_FREQ**2
DAMPING = 2 * DAMPING_RATIO * ARMATURE_7520_14 * NATURAL_FREQ

# Use in actuator config.
from mjlab.actuator import BuiltinPositionActuatorCfg

actuator = BuiltinPositionActuatorCfg(
    joint_names_expr=(".*_hip_pitch_joint",),
    stiffness=STIFFNESS,
    damping=DAMPING,
    effort_limit=ACTUATOR_7520_14.effort_limit,
    armature=ACTUATOR_7520_14.reflected_inertia,
)

Note

The example uses DAMPING_RATIO = 2.0 (overdamped) rather than the critically damped value of 1.0. This is because the reflected inertia calculation only accounts for the motor’s rotor inertia, not the apparent inertia of the links being moved. In practice, the total effective inertia at the joint is higher than just the reflected motor inertia, so using an overdamped ratio provides better stability margins when the true system inertia is underestimated.

Parallel linkage approximation:

For joints driven by parallel linkages (like the G1’s ankles with dual motors), the effective armature in the nominal configuration can be approximated as the sum of the individual motor armatures:

# Two 5020 motors driving ankle through parallel linkage.
G1_ACTUATOR_ANKLE = BuiltinPositionActuatorCfg(
    joint_names_expr=(".*_ankle_pitch_joint", ".*_ankle_roll_joint"),
    stiffness=STIFFNESS_5020 * 2,
    damping=DAMPING_5020 * 2,
    effort_limit=ACTUATOR_5020.effort_limit * 2,
    armature=ACTUATOR_5020.reflected_inertia * 2,
)

Using Actuators in Environments#

Action Terms#

Actuators are typically controlled via action terms in the action manager:

from mjlab.envs.mdp.actions import JointPositionActionCfg

JointPositionActionCfg(
    asset_name="robot",
    actuator_names=(".*",),  # Regex patterns for joint selection
    scale=1.0,
    use_default_offset=True,  # Use robot's default joint positions as offset
)

Available action terms:

JointPositionAction: Sets position targets (for PD actuators)
JointVelocityAction: Sets velocity targets (for velocity actuators)
JointEffortAction: Sets effort/torque targets (for torque actuators)

The action manager calls entity.set_joint_position_target(), set_joint_velocity_target(), or set_joint_effort_target() under the hood, which populate the ActuatorCmd passed to each actuator’s compute() method.

Domain Randomization#

from mjlab.envs.mdp import events
from mjlab.managers.manager_term_config import EventTermCfg
from mjlab.managers.scene_entity_config import SceneEntityCfg

EventTermCfg(
    func=events.randomize_pd_gains,
    mode="reset",
    params={
        "asset_cfg": SceneEntityCfg("robot", actuator_names=(".*",)),
        "kp_range": (0.8, 1.2),
        "kd_range": (0.8, 1.2),
        "distribution": "uniform",
        "operation": "scale",  # or "abs" for absolute values
    },
)

EventTermCfg(
    func=events.randomize_effort_limits,
    mode="reset",
    params={
        "asset_cfg": SceneEntityCfg("robot", actuator_names=(".*_leg_.*",)),
        "effort_limit_range": (0.7, 1.0),  # Reduce effort by 0-30%
        "operation": "scale",
    },
)

Actuators

Contents

Actuators#

Choosing an Actuator Type#

TL;DR#

Actuator Interface#

Actuator Types#

Built-in Actuators#

Explicit Actuators#

XML Actuators#

Delayed Actuator#

PD Control and Integrator Choice#

Built-in vs Explicit PD Control#

Integrator Behavior in MuJoCo#

mjlab Recommendation#

Authoring Actuator Configs#

Computing Hardware Parameters from Motor Specs#

Using Actuators in Environments#

Action Terms#

Domain Randomization#