worldflux is a Typer application. Run any command with --help to see the same flag list inline.
Global flags
Global flags appear before the subcommand:worldflux --json runs list.
Emit machine-readable JSON output where supported.
Increase diagnostic output. Repeat for more detail (
-vv).Suppress non-essential output.
Disable colored terminal output.
Print the WorldFlux version and exit.
Top-level commands
| Command | Purpose |
|---|---|
worldflux init <project> | Create a new project. |
worldflux run | Run the project’s configured recipe/runtime and write artifacts. |
worldflux verify | Verify manifest invariants. |
worldflux report | Render report.md at the project root. |
worldflux visualize | Render visual artifacts the recipe produced. |
worldflux compare | Diff two project directories. Writes comparison.md. |
worldflux compare-runs | Diff two run ids inside the same project. |
worldflux action-server | Record local action traces. |
worldflux import | Import an external model or dataset. |
worldflux export | Export an adapter for an external eval harness. |
worldflux add | Add an external recipe to the local catalog. |
worldflux doctor | Run the local environment health check. |
worldflux identify | Identify the world model attached to a project. |
worldflux ppi-augment | Compute a PPI estimate from sim and real audit inputs. |
worldflux login / logout / status / sync | Cloud session and upload. |
worldflux use <adapter> | Install a curated adapter into ~/.worldflux/curated/<id>/. |
worldflux uninstall <adapter> | Remove an installed curated adapter. |
worldflux claim | Create, inspect, draft, and falsify claim packages. |
worldflux evidence | Inspect, diff, package, and cost-annotate evidence. |
worldflux audit | Import, run, sign, verify, and publish audit packages. |
worldflux share | Generate public-share approval templates for hosted evidence links. |
worldflux eval-profile | Create and inspect local-private robotics/VLA eval profiles. |
worldflux eval-portfolio | Plan and render local-private protocol plans, run sheets, eval packets, and reviewer briefs. |
worldflux vla-matrix | Plan apple-to-apple VLA matrices and bind imported evidence. |
worldflux world-model-benchmark | Inspect and score world-model benchmark protocols without launching benchmarks. |
worldflux lab | Inspect lab prerequisites, EC2 orphans, and GPU sanity plans. |
worldflux schema / paper / museum / dd / positioning / billing | Generate schema, paper, counterexample, diligence, standards, and billing artifacts. |
worldflux init
Project directory to create.
Built-in recipe id. List options with
worldflux recipes list.Dataset id for recipes that support dataset switching.
Overwrite generated WorldFlux files.
worldflux run
Project directory.
Override the runtime declared in
worldflux.yaml. Examples: local, modal, aws-ec2, replicate.Plan remote execution without starting it. Prints the resolved instance type, region, and cost guard.
Local MP4 to stage under
inputs/ and send to remote video adapters.Local JSONL trace recorded by
worldflux action-server. Used by action-conditioned adapters.--instance-type, --max-cost, --timeout, --region, --key-name, --key-path, --security-group-id, --subnet-id, --spot/--on-demand, --max-artifact-mb, --collect-large-artifacts/--skip-large-artifacts.worldflux import
<scheme>:<id> prefix. Schemes today: lerobot:, lerobot-dataset:.
worldflux export
--target vla-eval is wired.
worldflux identify
worldflux ppi-augment
ppi_estimate.json from paired simulation and real-world audit inputs.
worldflux doctor
Health check for Python, uv, runtime plugins, and (when logged in) the cloud session. The command also runs worldflux runtime doctor under the hood.
Cloud session
worldflux login
Inline API key. Stored in shell history; prefer
--token-env for CI.Read the API key from this env var, then drop it from memory.
Cloud control plane base URL.
Default workspace UUID to remember for
sync.Default project UUID to remember for
sync.API key expiration time. Used for the warning prompt as the key approaches expiry.
Fall back to
~/.worldflux/credentials.toml when the OS keyring is unavailable.worldflux logout
Deletes the API key from the OS keyring (or credentials.toml) and clears the cached cloud config.
worldflux status
Prints workspace, project, plan, plan_status, run quota, storage quota.
worldflux sync
POSTs the current project’s manifest, metrics, logs, and committed artifact metadata. Idempotent on client_run_id. Artifact bytes go through signed URLs the cloud returns; the manifest itself is small.
Subcommand trees
worldflux recipes
inspect prints the full recipe record, including known issues and the dataset id (if any).
worldflux runs
worldflux datasets
worldflux curated
The curated tree wraps the catalog adapters that ship with WorldFlux.
worldflux installed
Local installs of curated adapters.
worldflux env
Build and manage the per-adapter Python environments.
worldflux runtime
Inspect runtime plugins.
worldflux cloud
Use the dashboard to list/create workspaces and projects. The CLI stores the
UUID selections used by sync and publish commands.
worldflux claim
claim create writes both claim.json and protocol.json; from-paper-url creates a local draft without calling an external LLM.
worldflux audit
lerobot, openpi, lbm_eval, vla-eval, gr00t-n1.7, pi-0.7, embodied-gov-bench, and cosmos-predict. cosmos-predict import expects a bridge-generated audit_input_cosmos_predict.json file or a directory containing that file. audit run can emit compliance mappings, ML-BOM sidecars, SAVI/PPI stopping metadata, and completeness scores for supported sources.
run-folder is a read-only triage importer for already-extracted Physical AI
run folders. It writes a private operator report and can also write
--customer-report import_report_public.md, a public-safe Markdown summary with
tier counts, claim-safe candidates, report-only signals, inventory-only artifact
counts, rejection reasons, missing evidence labels, and next action. It only
emits audit_input.json when a single selected candidate is claim-safe;
audit run run-folder is intentionally unsupported in the MVP.
audit publish --share requires Cloud login, a signed and verified evidence
package, and either --cloud-run-id <cloud-run-uuid> so the package can be
uploaded to that Cloud run with --confirm-public-share-upload, or
--evidence-package-artifact-id <artifact-uuid> to reuse an already uploaded
evidence package artifact. Hosted public shares also require --approval-file
and a reviewer access code. Prefer --password-env WORLDFLUX_SHARE_ACCESS_CODE;
--password is kept for backward compatibility but can expose values in process
listings.
worldflux share
share approval-template writes the customer approval file required by hosted
public evidence shares. It does not upload artifacts, publish a share, or read
the reviewer access code. The operator must review and sign the template before
passing it to the hosted share publish command.
worldflux evidence
worldflux eval-profile and worldflux eval-portfolio
eval-profile and eval-portfolio create local-private protocol plans and measured eval packets for robotics/VLA evidence review. The commands do not run benchmarks, upload artifacts, or decide deployment acceptability.
LOCAL_PRIVATE and not public-share-ready. Reviewer briefs keep redaction and customer-consent limitations visible. WorldFlux records declared consent markers but does not verify legal consent validity. eval-profile create requires at least one training-exposure disclosure plus --training-exposure-caveat. eval-portfolio packet accepts repeatable --audit-input <probe_id>=<path> values and checks each episode against the frozen protocol plan before rendering a packet. Use --strict-missing-evidence when missing or underpowered evidence should fail packet generation instead of rendering as warnings.
worldflux vla-matrix
worldflux world-model-benchmark
worldflux lab
cleanup-orphans --apply is used.
Publishing and business artifacts
Error envelope
When the CLI fails, it writes a JSON envelope to stdout and exits non-zero:code is stable (e.g. init_failed, login_token_env_missing, credentials_unavailable). The message is human-facing and may change between versions.