FlagOS-Robo User Guide

FlagOS-Robo User Guide#

Overview#

FlagOS-Robo is an integrated training and inference framework for AI models used in robots (Embodied Intelligence), built upon FlagOS, the unified and open-source AI system software stack for various AI chips.

FlagOS-Robo can be deployed across diverse scenarios, ranging from edge to cloud. Being portable across various chip models, it enables efficient training, inference, and deployment for both Vision Language Models (VLMs) and Vision Language Action (VLA) models. VLMs usually act as the brain for task planning, while VLA models act as the cerebellum to output actions for robot control.

Key Features#

FlagScale as the user entrypoint, supporting robot-related AI model training and inference
Full Model Lifecycle: Data loading, supervised fine-tuning (SFT), inference deployment, and evaluation
Multi-Format Data Loading: Supports webdataset, Megatron-Energon, and lerobot dataset formats
RoboOS Integration: Cross-embodiment collaboration with different data formats and edge-cloud coordination
RoboXStudio Integration: One-stop services including data collection, annotation, model training, simulation evaluation, and deployment

Getting Started#

Supported Models#

Model	Type	Checkpoint	Train	Inference	Serve	Evaluate
PI0	VLA	HuggingFace	Yes	Yes	Yes	-
PI0.5	VLA	HuggingFace	Yes	Yes	Yes	-
RoboBrain-2.0	VLM	HuggingFace	Yes	Yes	Yes	Yes
RoboBrain-2.5	VLM	HuggingFace	Yes	Yes	Yes	Yes
RoboBrain-X0	VLA	HuggingFace	Yes	-	Yes	-
Qwen-GR00T	VLA	HuggingFace	Yes	Yes	Yes	Yes
GR00T-N1.5	VLA	HuggingFace	Yes	-	Yes	-

For detailed guides on each model, refer to the FlagScale examples.

Evaluation#

FlagEval-Robo Platform#

FlagOS-Robo integrates with the FlagEval-Robo platform for testing and evaluation of embodied intelligence models.

Auto-Evaluation Tool#

Auto-Evaluation is a multi-chip adaptive model automatic evaluation tool built on the FlagEval platform. It supports Embodied VLM models and is bound with ten classic datasets. Currently, it supports online evaluation only.

Tool API Endpoint: 120.92.17.239:5050

Available APIs#

API	Method	Description
`/evaluation`	POST	Start evaluation
`/evaldiffs`	GET	Query evaluation results
`/stop_evaluation`	POST	Stop evaluation
`/resume_evaluation`	POST	Resume evaluation from breakpoint
`/evaluation_progress`	POST	View evaluation progress
`/evaluation_diffs`	POST	Compare results between multiple models

Start an Evaluation#

curl -X POST http://120.92.17.239:5050/evaluation \
  -H "Content-Type: application/json" \
  -d '{
    "eval_infos": [{
      "eval_model": "my-model-eval",
      "model": "Qwen/Qwen3-8B",
      "eval_url": "http://<your-host>:9010/v1/chat/completions",
      "tokenizer": "Qwen/Qwen3-8B"
    }],
    "domain": "MM",
    "mode": "EmbodiedVerse"
  }'

Key parameters:

eval_model: Unique name for the evaluation task
model: The deployed model name
eval_url: Your model’s evaluation endpoint
tokenizer: Vendor and model information (e.g., Qwen/Qwen3-8B)
chip: Chip used for evaluation (default: Nvidia-H100)

RoboXStudio Platform#

The RoboXStudio platform provides a SaaS environment for the full embodied intelligence pipeline:

Data Collection: Real-robot data acquisition across diverse robotic embodiments
Data Annotation: Labeling and data augmentation tools
Model Training: SFT built on FlagOS with multi-chip adaptation
Simulation Evaluation: Integrated testing and evaluation
Model Deployment: End-to-end deployment pipeline

RoboXStudio has completed the full training and inference workflow based on FlagOS + BAAI RoboBrain-X0 model, achieving an end-to-end pipeline from model development and real-robot validation to task execution.

Links:

Official Website: https://ei2data.baai.ac.cn
Product Manual: Feishu Wiki
Getting Started Guide: Feishu Wiki