tshiamor/aic_gt_sfp_all_trimmed_v2
Viewer • Updated • 276k • 1.17k
ForceVLA (Pi0-Guidance) model fine-tuned for SFP cable insertion on the AIC challenge.
v1 was trained on data with zero rotation actions — the model could not learn the 21 degree pitch alignment needed for insertion, causing 100% failure at inference.
v2 is trained on aic_gt_sfp_all_trimmed_v2, which includes axis-angle orientation deltas computed from consecutive end-effector quaternions. The model now outputs both translation and rotation commands.
| Field | Value |
|---|---|
| Base model | Pi0-Guidance (PaliGemma 2B + Gemma 300M action expert) |
| Fine-tuning | LoRA on both VLM and action expert |
| Training steps | 50,000 |
| Batch size | 4 |
| Dataset | aic_gt_sfp_all_trimmed_v2 (475 episodes, 10 SFP tasks) |
| GPU | NVIDIA RTX 5090 32GB |
| Framework | JAX/Flax (OpenPI) |
Inputs:
Outputs:
cd ~/ForceVLA && conda activate forcevla_eval
python scripts/serve_policy.py policy:checkpoint \
--policy.config forcevla_sfp_all_trimmed_v2 \
--policy.dir checkpoints/forcevla_sfp_all_trimmed_v2/sfp_all_trimmed_v2/49999
pixi run ros2 run aic_model aic_model --ros-args \
-p use_sim_time:=true \
-p policy:=aic_example_policies.ros.RunForceVLA
| Step | Path |
|---|---|
| 10000 | sfp_all_trimmed_v2/10000/ |
| 20000 | sfp_all_trimmed_v2/20000/ |
| 30000 | sfp_all_trimmed_v2/30000/ |
| 40000 | sfp_all_trimmed_v2/40000/ |
| 49999 | sfp_all_trimmed_v2/49999/ (final) |
| Index | Field | Scale (std) |
|---|---|---|
| 0-2 | position delta (m) | ~0.0003-0.0008 |
| 3-5 | rotation delta (rad) | ~0.001-0.004 |
| 6 | gripper | 0 (constant) |