Acrobot Swing-Up#

Mathematical Formulation#

Problem Statement#

Find the optimal elbow torque \(\tau(t)\) that swings up the acrobot from the hanging equilibrium to the inverted equilibrium while minimizing time and control effort:

\[J = t_f + 0.01 \int_0^{t_f} \tau^2 dt\]

Subject to the underactuated acrobot dynamics:

\[\frac{d\theta_1}{dt} = \dot{\theta}_1\]

\[\frac{d\theta_2}{dt} = \dot{\theta}_2\]

\[\frac{d\dot{\theta}_1}{dt} = \frac{-I_2 \left( g l_1 m_2 \sin(\theta_1) + g l_{c1} m_1 \sin(\theta_1) + g l_{c2} m_2 \sin(\theta_1 + \theta_2) - l_1 l_{c2} m_2 (2\dot{\theta}_1 + \dot{\theta}_2) \sin(\theta_2) \dot{\theta}_2 \right) + (I_2 + l_1 l_{c2} m_2 \cos(\theta_2)) \left( g l_{c2} m_2 \sin(\theta_1 + \theta_2) + l_1 l_{c2} m_2 \sin(\theta_2) \dot{\theta}_1^2 - \tau \right)}{I_1 I_2 + I_2 l_1^2 m_2 - l_1^2 l_{c2}^2 m_2^2 \cos^2(\theta_2)}\]

\[\frac{d\dot{\theta}_2}{dt} = \frac{(I_2 + l_1 l_{c2} m_2 \cos(\theta_2)) \left( g l_1 m_2 \sin(\theta_1) + g l_{c1} m_1 \sin(\theta_1) + g l_{c2} m_2 \sin(\theta_1 + \theta_2) - l_1 l_{c2} m_2 (2\dot{\theta}_1 + \dot{\theta}_2) \sin(\theta_2) \dot{\theta}_2 \right) - \left( g l_{c2} m_2 \sin(\theta_1 + \theta_2) + l_1 l_{c2} m_2 \sin(\theta_2) \dot{\theta}_1^2 - \tau \right) (I_1 + I_2 + l_1^2 m_2 + 2 l_1 l_{c2} m_2 \cos(\theta_2))}{I_1 I_2 + I_2 l_1^2 m_2 - l_1^2 l_{c2}^2 m_2^2 \cos^2(\theta_2)}\]

Boundary Conditions#

Initial conditions: \(\theta_1(0) = \frac{\pi}{6}\) rad, \(\theta_2(0) = 0\) rad, \(\dot{\theta}_1(0) = 0\) rad/s, \(\dot{\theta}_2(0) = 0\) rad/s
Final conditions: \(\theta_1(t_f) = \pi\) rad, \(\theta_2(t_f) = 0\) rad, \(\dot{\theta}_1(t_f) = 0\) rad/s, \(\dot{\theta}_2(t_f) = 0\) rad/s
Control bounds: \(-20.0 \leq \tau \leq 20.0\) N⋅m

Physical Parameters#

Upper arm mass: \(m_1 = 1.0\) kg
Forearm mass: \(m_2 = 1.0\) kg
Upper arm length: \(l_1 = 1.0\) m
Forearm length: \(l_2 = 1.0\) m
Upper arm COM distance: \(l_{c1} = 0.5\) m
Forearm COM distance: \(l_{c2} = 0.5\) m
Upper arm inertia about shoulder: \(I_1 = \frac{m_1 l_1^2}{3} = 0.333\) kg⋅m²
Forearm inertia about elbow: \(I_2 = \frac{m_2 l_2^2}{3} = 0.333\) kg⋅m²
Gravity: \(g = 9.81\) m/s²

State Variables#

\(\theta_1(t)\): Shoulder joint angle from downward vertical (rad)
\(\theta_2(t)\): Elbow joint angle relative to upper arm (rad)
\(\dot{\theta}_1(t)\): Shoulder joint angular velocity (rad/s)
\(\dot{\theta}_2(t)\): Elbow joint angular velocity (rad/s)

Control Variable#

\(\tau(t)\): Elbow joint torque (N⋅m)

Notes#

The acrobot is a classic underactuated robotics problem where only the elbow joint can be controlled while the shoulder joint is passive. This creates a challenging swing-up task requiring the system to use nonlinear coupling between the joints to achieve the inverted configuration. The problem demonstrates how underactuated systems can achieve full controllability through dynamic coupling, despite having fewer actuators than degrees of freedom.

Dynamics Derivation#

The acrobot dynamics were derived using Lagrangian mechanics with SymPy, including comprehensive term-by-term verification against established literature. The derivation systematically constructs the mass matrix, gravity vector, and Coriolis terms:

examples/acrobot/acrobot_dynamics.py#

import sympy as sm
import sympy.physics.mechanics as me

from maptor.mechanics import lagrangian_to_maptor_dynamics


# ============================================================================
# Physical Parameters
# ============================================================================

# Link masses (kg)
m1, m2 = sm.symbols("m1 m2")

# Link lengths (m)
l1, l2 = sm.symbols("l1 l2")

# Center of mass distances (m)
lc1, lc2 = sm.symbols("lc1 lc2")

# Moments of inertia about pivots (kg⋅m²)
I1, I2 = sm.symbols("I1 I2")

# Gravity
g = sm.symbols("g")

# Control torque (only on second joint)
tau = sm.symbols("tau")

# Joint coordinates: theta1 (shoulder from vertical), theta2 (elbow relative)
theta1, theta2 = me.dynamicsymbols("theta1 theta2")
theta1d, theta2d = me.dynamicsymbols("theta1 theta2", 1)


# ============================================================================
# Reference Frames
# ============================================================================

# N: Inertial frame (Y-axis points up, X-axis points right)
N = me.ReferenceFrame("N")

# A: Link 1 frame (rotated by theta1 from vertical)
# Positive theta1 rotates counterclockwise from downward vertical (-Y direction)
A = N.orientnew("A", "Axis", (theta1, N.z))

# B: Link 2 frame (rotated by theta2 relative to link 1)
# Positive theta2 rotates counterclockwise relative to link 1
B = A.orientnew("B", "Axis", (theta2, A.z))


# ============================================================================
# Points and Velocities
# ============================================================================

# Fixed shoulder joint (origin)
O = me.Point("O")
O.set_vel(N, 0)

# Elbow joint (end of link 1)
# Link 1 extends in -A.y direction (downward in link 1 frame)
P1 = O.locatenew("P1", l1 * (-A.y))
P1.v2pt_theory(O, N, A)

# End effector (end of link 2)
# Link 2 extends in -B.y direction (downward in link 2 frame)
P2 = P1.locatenew("P2", l2 * (-B.y))
P2.v2pt_theory(P1, N, B)

# Centers of mass
# Link 1 COM: distance lc1 from shoulder along link 1
G1 = O.locatenew("G1", lc1 * (-A.y))
G1.v2pt_theory(O, N, A)

# Link 2 COM: distance lc2 from elbow along link 2
G2 = P1.locatenew("G2", lc2 * (-B.y))
G2.v2pt_theory(P1, N, B)


# ============================================================================
# Rigid Bodies
# ============================================================================

# Link 1: Inertia about shoulder pivot
I1_dyadic = I1 * me.inertia(A, 0, 0, 1)
link1_body = me.RigidBody("link1", G1, A, m1, (I1_dyadic, O))  # Inertia about pivot O

# Link 2: Inertia about elbow pivot
I2_dyadic = I2 * me.inertia(B, 0, 0, 1)
link2_body = me.RigidBody("link2", G2, B, m2, (I2_dyadic, P1))  # Inertia about pivot P1


# ============================================================================
# Forces (Passive Only)
# ============================================================================

# Gravitational forces on centers of mass (gravity acts in -N.y direction)
loads = [
    (G1, -m1 * g * N.y),  # Gravity on link 1 COM
    (G2, -m2 * g * N.y),  # Gravity on link 2 COM
]


# ============================================================================
# Lagrangian Mechanics
# ============================================================================

L = me.Lagrangian(N, link1_body, link2_body)
LM = me.LagrangesMethod(L, [theta1, theta2], forcelist=loads, frame=N)


# ============================================================================
# Control Forces
# ============================================================================

# Acrobot: only second joint (elbow) is actuated
# No torque on theta1 (shoulder), torque tau on theta2 (elbow)
control_forces = sm.Matrix([0, tau])


# ============================================================================
# Term-by-Term Verification
# ============================================================================

print("=== ACROBOT TERM-BY-TERM VERIFICATION ===")
print("Literature reference: https://underactuated.csail.mit.edu/acrobot.html#section1")
print()

# Form the equations to access components
LM.form_lagranges_equations()

# Extract mass matrix M(q)
M = LM.mass_matrix
print("MASS MATRIX M(q):")
print("Literature equation (8):")
print("M11 = I1 + I2 + m2*l1² + 2*m2*l1*lc2*cos(θ2)")
print("M12 = I2 + m2*l1*lc2*cos(θ2)")
print("M21 = I2 + m2*l1*lc2*cos(θ2)")
print("M22 = I2")
print()
print("SymPy generated:")
print(f"M11 = {sm.simplify(M[0, 0])}")
print(f"M12 = {sm.simplify(M[0, 1])}")
print(f"M21 = {sm.simplify(M[1, 0])}")
print(f"M22 = {sm.simplify(M[1, 1])}")
print()

# Extract gravity vector by setting velocities to zero
gravity_forcing = LM.forcing.subs([(theta1d, 0), (theta2d, 0)])
print("GRAVITY VECTOR τg(q):")
print("Literature equation (10):")
print("τg1 = -m1*g*lc1*sin(θ1) - m2*g*(l1*sin(θ1) + lc2*sin(θ1+θ2))")
print("τg2 = -m2*g*lc2*sin(θ1+θ2)")
print()
print("SymPy generated:")
print(f"τg1 = {sm.simplify(gravity_forcing[0])}")
print(f"τg2 = {sm.simplify(gravity_forcing[1])}")
print()

# Extract Coriolis terms by subtracting gravity from total forcing
total_forcing = LM.forcing
coriolis_forcing = sm.simplify(total_forcing - gravity_forcing)
print("CORIOLIS TERMS (velocity-dependent):")
print("Literature: C(q,q̇)*q̇ where")
print("C11 = -2*m2*l1*lc2*sin(θ2)*θ̇2")
print("C12 = -m2*l1*lc2*sin(θ2)*θ̇2")
print("C21 = m2*l1*lc2*sin(θ2)*θ̇1")
print("C22 = 0")
print()
print("SymPy generated Coriolis forcing:")
print(f"Coriolis1 = {sm.simplify(coriolis_forcing[0])}")
print(f"Coriolis2 = {sm.simplify(coriolis_forcing[1])}")
print()

# Verify equation structure
print("COMPLETE EQUATION VERIFICATION:")
print("Literature form: M(q)*q̈ + C(q,q̇)*q̇ = τg(q) + B*u")
print("Where B = [0, 1]ᵀ and u = τ (elbow torque)")
print()


# ============================================================================
# Convert to MAPTOR Format
# ============================================================================

print("=== MAPTOR DYNAMICS GENERATION ===")
lagrangian_to_maptor_dynamics(LM, [theta1, theta2], control_forces, "acrobot_dynamics.txt")

"""
=== ACROBOT TERM-BY-TERM VERIFICATION ===
Literature reference: https://underactuated.csail.mit.edu/acrobot.html#section1

MASS MATRIX M(q):
Literature equation (8):
M11 = I1 + I2 + m2*l1² + 2*m2*l1*lc2*cos(θ2)
M12 = I2 + m2*l1*lc2*cos(θ2)
M21 = I2 + m2*l1*lc2*cos(θ2)
M22 = I2

SymPy generated:
M11 = I1 + I2 + l1**2*m2 + 2*l1*lc2*m2*cos(theta2(t))
M12 = I2 + l1*lc2*m2*cos(theta2(t))
M21 = I2 + l1*lc2*m2*cos(theta2(t))
M22 = I2

GRAVITY VECTOR τg(q):
Literature equation (10):
τg1 = -m1*g*lc1*sin(θ1) - m2*g*(l1*sin(θ1) + lc2*sin(θ1+θ2))
τg2 = -m2*g*lc2*sin(θ1+θ2)

SymPy generated:
τg1 = -g*(l1*m2*sin(theta1(t)) + lc1*m1*sin(theta1(t)) + lc2*m2*sin(theta1(t) + theta2(t)))
τg2 = -g*lc2*m2*sin(theta1(t) + theta2(t))

CORIOLIS TERMS (velocity-dependent):
Literature: C(q,q̇)*q̇ where
C11 = -2*m2*l1*lc2*sin(θ2)*θ̇2
C12 = -m2*l1*lc2*sin(θ2)*θ̇2
C21 = m2*l1*lc2*sin(θ2)*θ̇1
C22 = 0

SymPy generated Coriolis forcing:
Coriolis1 = l1*lc2*m2*(2*Derivative(theta1(t), t) + Derivative(theta2(t), t))*sin(theta2(t))*Derivative(theta2(t), t)
Coriolis2 = -l1*lc2*m2*sin(theta2(t))*Derivative(theta1(t), t)**2

COMPLETE EQUATION VERIFICATION:
Literature form: M(q)*q̈ + C(q,q̇)*q̇ = τg(q) + B*u
Where B = [0, 1]ᵀ and u = τ (elbow torque)

=== MAPTOR DYNAMICS GENERATION ===
CasADi MAPTOR Dynamics:
============================================================

State variables:
theta1 = phase.state('theta1')
theta2 = phase.state('theta2')
theta1_dot = phase.state('theta1_dot')
theta2_dot = phase.state('theta2_dot')

Control variables:
tau = phase.control('tau')

MAPTOR dynamics dictionary:
phase.dynamics(
    {
        theta1: theta1_dot,
        theta2: theta2_dot,
        theta1_dot: (
            -I2
            * (
                g * l1 * m2 * ca.sin(theta1)
                + g * lc1 * m1 * ca.sin(theta1)
                + g * lc2 * m2 * ca.sin(theta1 + theta2)
                - l1 * lc2 * m2 * (2 * theta1_dot + theta2_dot) * ca.sin(theta2) * theta2_dot
            )
            + (I2 + l1 * lc2 * m2 * ca.cos(theta2))
            * (
                g * lc2 * m2 * ca.sin(theta1 + theta2)
                + l1 * lc2 * m2 * ca.sin(theta2) * theta1_dot**2
                - tau
            )
        )
        / (I1 * I2 + I2 * l1**2 * m2 - l1**2 * lc2**2 * m2**2 * ca.cos(theta2) ** 2),
        theta2_dot: (
            (I2 + l1 * lc2 * m2 * ca.cos(theta2))
            * (
                g * l1 * m2 * ca.sin(theta1)
                + g * lc1 * m1 * ca.sin(theta1)
                + g * lc2 * m2 * ca.sin(theta1 + theta2)
                - l1 * lc2 * m2 * (2 * theta1_dot + theta2_dot) * ca.sin(theta2) * theta2_dot
            )
            - (
                g * lc2 * m2 * ca.sin(theta1 + theta2)
                + l1 * lc2 * m2 * ca.sin(theta2) * theta1_dot**2
                - tau
            )
            * (I1 + I2 + l1**2 * m2 + 2 * l1 * lc2 * m2 * ca.cos(theta2))
        )
        / (I1 * I2 + I2 * l1**2 * m2 - l1**2 * lc2**2 * m2**2 * ca.cos(theta2) ** 2),
    }
)
"""

This symbolic derivation produces the complex coupled dynamics equations used in the swing-up controller implementation, ensuring mathematical correctness through comparison with MIT’s Underactuated Robotics course materials and providing complete transparency in the underlying mechanics.

Running This Example#

cd examples/acrobot
python acrobot.py
python acrobot_animate.py

Code Implementation#

examples/acrobot/acrobot.py#

import casadi as ca
import numpy as np

import maptor as mtor


# ============================================================================
# Physical Parameters
# ============================================================================

# Link masses (kg)
m1 = 1.0  # Upper arm mass
m2 = 1.0  # Forearm mass

# Link lengths (m)
l1 = 1.0  # Upper arm length
l2 = 1.0  # Forearm length

# Center of mass distances (m)
lc1 = 0.5  # Upper arm COM distance from shoulder
lc2 = 0.5  # Forearm COM distance from elbow

# Moments of inertia about pivots (kg⋅m²)
I1 = m1 * l1**2 / 3  # Upper arm about shoulder
I2 = m2 * l2**2 / 3  # Forearm about elbow

# Gravity
g = 9.81  # m/s²


# ============================================================================
# Problem Setup
# ============================================================================

problem = mtor.Problem("Acrobot Swing-Up")
phase = problem.set_phase(1)


# ============================================================================
# Variables
# ============================================================================

# Time variable
t = phase.time(initial=0.0)

# State variables
theta1 = phase.state("theta1", initial=np.pi / 6, final=np.pi)  # Shoulder: down to up
theta2 = phase.state("theta2", initial=0.0, final=0.0)  # Elbow: straight to straight
theta1_dot = phase.state("theta1_dot", initial=0.0, final=0.0)  # Start and end at rest
theta2_dot = phase.state("theta2_dot", initial=0.0, final=0.0)  # Start and end at rest

# Control variable (only elbow joint is actuated)
tau = phase.control("tau", boundary=(-20.0, 20.0))  # Elbow torque (N⋅m)


# ============================================================================
# Dynamics (Generated from acrobot_dynamics.py)
# ============================================================================

phase.dynamics(
    {
        theta1: theta1_dot,
        theta2: theta2_dot,
        theta1_dot: (
            -I2
            * (
                g * l1 * m2 * ca.sin(theta1)
                + g * lc1 * m1 * ca.sin(theta1)
                + g * lc2 * m2 * ca.sin(theta1 + theta2)
                - l1 * lc2 * m2 * (2 * theta1_dot + theta2_dot) * ca.sin(theta2) * theta2_dot
            )
            + (I2 + l1 * lc2 * m2 * ca.cos(theta2))
            * (
                g * lc2 * m2 * ca.sin(theta1 + theta2)
                + l1 * lc2 * m2 * ca.sin(theta2) * theta1_dot**2
                - tau
            )
        )
        / (I1 * I2 + I2 * l1**2 * m2 - l1**2 * lc2**2 * m2**2 * ca.cos(theta2) ** 2),
        theta2_dot: (
            (I2 + l1 * lc2 * m2 * ca.cos(theta2))
            * (
                g * l1 * m2 * ca.sin(theta1)
                + g * lc1 * m1 * ca.sin(theta1)
                + g * lc2 * m2 * ca.sin(theta1 + theta2)
                - l1 * lc2 * m2 * (2 * theta1_dot + theta2_dot) * ca.sin(theta2) * theta2_dot
            )
            - (
                g * lc2 * m2 * ca.sin(theta1 + theta2)
                + l1 * lc2 * m2 * ca.sin(theta2) * theta1_dot**2
                - tau
            )
            * (I1 + I2 + l1**2 * m2 + 2 * l1 * lc2 * m2 * ca.cos(theta2))
        )
        / (I1 * I2 + I2 * l1**2 * m2 - l1**2 * lc2**2 * m2**2 * ca.cos(theta2) ** 2),
    }
)


# ============================================================================
# Objective
# ============================================================================

control_effort = phase.add_integral(tau**2)
problem.minimize(t.final + 0.01 * control_effort)


# ============================================================================
# Mesh Configuration and Initial Guess
# ============================================================================

num_interval = 16
phase.mesh([3] * num_interval, np.linspace(-1.0, 1.0, num_interval + 1))

phase.guess(
    terminal_time=10.0,
)


# ============================================================================
# Solve
# ============================================================================

solution = mtor.solve_adaptive(
    problem,
    error_tolerance=1e-3,
    max_iterations=20,
    min_polynomial_degree=3,
    max_polynomial_degree=8,
    nlp_options={
        "ipopt.max_iter": 1000,
        "ipopt.mumps_pivtol": 5e-7,
        "ipopt.linear_solver": "mumps",
        "ipopt.constr_viol_tol": 1e-7,
        "ipopt.print_level": 0,
        "ipopt.nlp_scaling_method": "gradient-based",
        "ipopt.mu_strategy": "adaptive",
        "ipopt.check_derivatives_for_naninf": "yes",
        "ipopt.hessian_approximation": "exact",
        "ipopt.tol": 1e-8,
    },
)


# ============================================================================
# Results
# ============================================================================

if solution.status["success"]:
    print(f"Objective: {solution.status['objective']:.6f}")
    print(f"Mission time: {solution.status['total_mission_time']:.3f} seconds")

    # Final joint angles
    theta1_final = solution["theta1"][-1]
    theta2_final = solution["theta2"][-1]
    print(f"Final shoulder angle: {theta1_final:.6f} rad ({np.degrees(theta1_final):.2f}°)")
    print(f"Final elbow angle: {theta2_final:.6f} rad ({np.degrees(theta2_final):.2f}°)")

    # End-effector position analysis
    # Initial position (both links hanging down)
    x_ee_initial = l1 * np.sin(0) + l2 * np.sin(0 + 0)
    y_ee_initial = -l1 * np.cos(0) - l2 * np.cos(0 + 0)

    # Final position (both links pointing up)
    x_ee_final = l1 * np.sin(theta1_final) + l2 * np.sin(theta1_final + theta2_final)
    y_ee_final = -l1 * np.cos(theta1_final) - l2 * np.cos(theta1_final + theta2_final)

    print(
        f"End-effector moved from ({x_ee_initial:.3f}, {y_ee_initial:.3f}) to ({x_ee_final:.3f}, {y_ee_final:.3f})"
    )

    # Control statistics
    tau_max = max(np.abs(solution["tau"]))
    print(f"Maximum elbow torque: {tau_max:.3f} N⋅m")

    solution.plot()

else:
    print(f"Failed: {solution.status['message']}")