Examples

This page provides examples of using TwisteRL for reinforcement learning tasks.

Training Examples

8-Puzzle Example

The 8-puzzle is a classic sliding puzzle that consists of a 3x3 grid with numbered tiles and one empty space.

Training:

python -m twisterl.train --config examples/ppo_puzzle8_v1.json

This example demonstrates:

Fast training (under 1 minute on CPU)
Discrete observation and action spaces
PPO algorithm implementation

15-Puzzle Example

A more challenging 4x4 version of the sliding puzzle:

python -m twisterl.train --config examples/ppo_puzzle15_v1.json

Inference Examples

Jupyter Notebooks

The examples/ directory contains Jupyter notebooks for interactive exploration:

puzzle.ipynb: Interactive example showing inference with trained puzzle models
hub_puzzle_model.ipynb: Loading and using models from HuggingFace Hub

These notebooks demonstrate:

Loading trained models
Running inference
Visualizing agent behavior

Creating Custom Environments

TwisteRL supports custom environments implemented in Rust. The examples/grid_world directory provides a complete working example.

Steps to create a custom environment:

Create a new Rust crate:

cargo new --lib examples/my_env

Add dependencies in Cargo.toml:

[package]
name = "my_env"
version = "0.1.0"
edition = "2021"

[lib]
name = "my_env"
crate-type = ["cdylib"]

[dependencies]
pyo3 = { version = "0.24", features = ["extension-module"] }
twisterl = { path = "../../rust", features = ["python_bindings"] }

Implement the environment by implementing the twisterl::rl::env::Env trait.
Expose it to Python using PyBaseEnv:

use pyo3::prelude::*;
use twisterl::rl::env::Env;
use twisterl::python_interface::env::PyBaseEnv;

#[pyclass(name="MyEnv", extends=PyBaseEnv)]
struct PyMyEnv;

#[pymethods]
impl PyMyEnv {
    #[new]
    fn new(/* your params */) -> (Self, PyBaseEnv) {
        let env = MyEnv::new(/* ... */);
        (PyMyEnv, PyBaseEnv { env: Box::new(env) })
    }
}

#[pymodule]
fn my_env(_py: Python<'_>, m: &Bound<'_, PyModule>) -> PyResult<()> {
    m.add_class::<PyMyEnv>()?;
    Ok(())
}

Build and install the module:

pip install .

Use from Python in a config file or directly.

See the grid_world example for a complete implementation.

Python Environments

TwisteRL also supports Python environments through the PyEnv wrapper:

{
    "env_cls": "twisterl.envs.PyEnv",
    "env": {
        "pyenv_cls": "mymodule.MyPythonEnv"
    }
}

Your Python environment class must implement the required interface (reset, next, observe, etc.). See Environments for the complete list of required methods.

Note that Python environments may be slower than native Rust environments due to the Python-Rust interop overhead.

Use Cases

TwisteRL is particularly well-suited for:

Quantum Circuit Transpilation

Currently used in Qiskit/qiskit-ibm-transpiler for AI-based circuit optimization including Clifford synthesis and routing.

Puzzle-like Optimization Problems

Problems with discrete state and action spaces where fast inference is important.

Production-ready RL Inference

Scenarios requiring high-performance inference with portable Rust-based models.