Curated catalog

The catalog is two artifacts that stay in sync:

src/worldflux/curated.py — the Python source the CLI reads at worldflux init --recipe <id> and worldflux curated * time.
web/lib/catalog-data.ts — the TypeScript copy the dashboard’s Catalog tab renders.

Both files describe the same adapter set. The TS file is regenerated from the Python source; do not hand-edit it.

Entry shape

type CatalogAdapter = {
  id: string;
  name: string;
  description: string;
  provider: string;
  providerSlug: string;
  runtime: string;                 // "local" | "replicate"
  tier: "active" | "historical";
  modelFamily: string;
  endpointTypes: string[];         // "rollout" | "train" | "eval" | "embed" | "predict" | "serve" | "generate"
  minimumVramGb: number | null;
  minimumCuda: string | null;
  upstreamLicense: string;
  version: string;
  smokePrompt: string;
  metricKeys: string[];
  outputArtifacts: string[];
  evalBenchId: string | null;
  knownIssues: string[];
};

Field notes:

Field	Purpose
`id`	What you pass to `--recipe`. Matches the directory under `src/worldflux/templates/` for adapters that ship a template.
`runtime`	The runtime plugin to use. Today the catalog only points at `local` or `replicate`; `aws-ec2` and `modal` come from the adapter’s recipe overrides.
`tier`	`active` if CI runs against it, `historical` if it loads but no longer gets fixes. The dashboard pill is green / red.
`endpointTypes`	What the adapter can do. Used by the dashboard’s filter + the recipe’s task picker.
`minimumVramGb`	Observed floor on a clean install. Lower works for tiny modes; do not assume.
`minimumCuda`	Required CUDA toolkit version, if any. `null` for CPU-friendly adapters.
`evalBenchId`	The eval bridge wired in for this adapter (`libero`, `robocasa`, `vjepa_embed`, `minecraft_offline`).
`metricKeys`	The metric names the adapter writes into `manifest.metrics`. Used by `worldflux compare` to know what to diff.
`outputArtifacts`	Filenames the adapter promises to commit. The manifest writer rejects runs that miss them.
`smokePrompt`	The prompt or seed used by the smoke test.
`knownIssues`	Short notes the dashboard surfaces in a yellow callout.

What ships today

`id`	Tier	Runtime	Bench
`cosmos_predict25`	active	`replicate`	`robocasa`
`dreamer4`	active	`local`	`minecraft_offline`
`internvla_m1`	active	`local`	`libero`
`openpi`	active	`local`	`libero`
`vjepa2`	active	`local`	`vjepa_embed`
`diamond`	historical	`local`	(not wired)
`dreamerv3`	historical	`local`	(not wired)
`storm`	historical	`local`	(not wired)
`tdmpc2`	historical	`local`	(not wired)
`vjepa2_vitl`	historical	`local`	`vjepa_embed`

Templates

Three adapters ship with a starter run.py under src/worldflux/templates/:

cosmos_predict25/
openpi/
vjepa2_vitl/

worldflux init <project> --recipe <id> copies the template into the project root and rewrites the manifest paths.

Browsing

worldflux curated list
worldflux curated inspect cosmos_predict25

inspect prints the full record, including known issues. The dashboard’s Catalog tab renders the same data with search and runtime/tier filters.

Adding an adapter

Add to curated.py

Append a CuratedAdapter(...) entry to src/worldflux/curated.py. Fill every field; the dashboard treats null as “unknown” and renders accordingly.

Regenerate the TS copy

cd web && npm run generate:api re-emits lib/catalog-data.ts from the Python source. Commit both files.

Run it once

The cutoff for tier: active is one passing CI run plus a documented benchmark id. Until then, set tier: "historical" and evalBenchId: null.

Drop a template (optional)

If the adapter has a non-trivial run.py, add it under src/worldflux/templates/<id>/. The init command picks it up automatically.

Get started

Concepts

Reference

Entry shape

What ships today

Templates

Browsing

Adding an adapter

Get started

Concepts

Reference

Documentation Index

​Entry shape

​What ships today

​Templates

​Browsing

​Adding an adapter

Entry shape

What ships today

Templates

Browsing

Adding an adapter