Skip to content

Latest commit

 

History

History
 
 

pusht

PushT: Diffusion-Policy-based Implementation

Table of Contents

Installation

  1. Create a Conda environment with conda_environment.yaml (we also recommend Mambaforge)

    mamba env create -f conda_environment.yaml
    # Or you can use conda
    # mamba env create -f conda_environment.yaml

    Remarks: The dgl package installed using the YAML file is only for CUDA version of 11.8. If you are using a different CUDA version, please try:

    # Try the following if you are using CUDA other than 11.8
    ## Uninstall the cu118 ver. DGL
    mamba activate ltldogdp
    pip uninstall dgl
    ## Install the right one
    ### replace "cu1xx" with the CUDA version you use, e.g., cu116, cu117, cu121, etc.
    pip install dgl==1.1.1 -f https://data.dgl.ai/wheels/cu1xx/repo.html

    If it doesn't work, please proceed to DGL official site for more support.

  2. Install diffusion_policy as a package in your newly created environment

    mamba activate ltldogdp 
    pip install -e . 

Play with the Env (Optional)

PushT Task Demo

The demo script inherits from Diffusion Policy. Familiarize yourself with the environment by

python demo_pusht.py --help

Training

Dataset

  • Default data are stored at data/ (you can use symbolic links if needed):

    mkdir -p ./data/pusht/LTLs  ./data/pretrained/diffusion ./data/pretrained/value 
    cp ltl_txt/*.txt data/pusht/LTLs/

    Now the LTL formulas used in the paper are copied.

  • Training logs and saved checkpoints are at data/outputs/.

PushT Trajectory Dataset

  • Trajectory datasets should be put under data/pusht for training.
  • The original PushT dataset used in Diffusion Policy should be available here.
  • For the augmented trajectory dataset used in our paper, download from Google Drive.

LTL formulas

See .txt files under data/pusht/LTLs/. See more below.

Value Dataset

For LTLDoG-R variant, we need to train a regressor model with LTL satisfaction values. These values are calculated given the trajectory dataset and the LTLf formulas.

  • Values can be calculated by our script. Remember to check the source codes before executing.

    python scripts/generate_pusht_value_dataset.py

Configs

The configuration setting pipeline inherits from DiffusionPolicy.

Training Configs

Training configs are located at diffusion_policy/config/. Adjust dataset path in corresponding subconfigs under diffusion_policy/config/tasks/.

LTL Atomic Propositions

In our experiments, each atomic proposition (AP, denoted as "pX" in LTL formulas) represents a region in the ambient state space. These regions are defined in diffusion_policy/constraints/pusht_constraints.py.

Users may devise and configure customized regions, not limitted to circles, by defining proper parameterization of the regions and implementing differentiable value functions that can determine the truth of an AP (positive value for True and negative for False by default). Check the script for an intuition.

Strat Training

Simply call the training script with a desired config file. E.g.:

  • For training the vanilla Diffusion Policy (as the diffusion backbone of LTLDoG):

    python scripts/train.py --config-name=H16O2A8D100_train_diffusion_unet_lowdim_workspace
  • For training a value regressor:

    python scripts/train.py --config-name=H16O2A8_train_pusht_ef_no_value

Reminder:

  • Using the pretrained model released from Diffusion Policy is not enough; the model should be trained over an augmented dataset with more abundunt behaviors. See the appendix of our paper for explanation.
  • Move the trained checkpoints from data/outputs to data/pretrained/diffusion or data/pretrained/value for inference later.

Inference

Evaluation scripts are under scripts/:

  • Run eval_H\d_pusht_*.py with a proper config file for inference.

  • Calling examples:

    # LTLDoG-S
    python scripts/eval_H16_pusht_guided_parallel.py  --config-name eval_H16_pusht_ps_guide 
    # LTLDoG-R
    ## with a trained regressor model configured in the config file
    python scripts/eval_H16_pusht_guided_parallel.py  --config-name eval_H16_pusht_guided 
  • There are two versions of models: H16 and H192 (for obstacle avoidance and temporal tasks, respectively). Check the scripts' source codes for details.

There are also sequential executing scripts that could be used to run multiple times (for different parameters), see eval_H\d_seq.py for details.

  • Calling example:

    python scripts/eval_H16_seq.py  --gpu-id=0 --guider=rg

    Argument --guider should be one of {rg, ps, baseline}. Explore more settings in the script.

Results are recorded by default under logs/tests/.

Inference Configs

  • Configuration files diffusion_policy/config/eval_*.yaml.
  • Configure the configs to adjust parameters and make sure calling the right config file when launching eval scripts.
  • Parameters in the config file will be executed while parameters set in the eval scripts are only for naming purpose. Remember to adjust both to avoid unecesssary confusion.