Use this path when you already have a task, dataset, or existing agent code and want an AI coding agent to help turn it into a runnable Osmosis rollout. If you are new to Osmosis and want the shortest copy-paste path, start with Run the Multiply Example instead.Documentation Index
Fetch the complete documentation index at: https://docs.osmosis.ai/llms.txt
Use this file to discover all available pages before exploring further.
This guide assumes you have completed Onboarding: your workspace repository is cloned, the CLI is installed and authenticated, and your AI coding environment is open in the workspace directory.
What Workspace Skills Do
Platform-created workspace repositories include project-local Agent Skills under.agents/skills/. Agents that support the open Agent Skills format can use those skills to move through the same local loop an experienced Osmosis user would follow:
When to Use This Path
| Use this path when | Run the multiply example when |
|---|---|
| You already know the task you want to train on | You want proof that the platform works end to end |
| You have sample data or a platform dataset | You do not want to design a dataset yet |
| You want the agent to create or adapt rollout code | You want to copy commands without making product choices |
| You are comfortable inspecting generated code and eval output | You are still learning the Osmosis workflow |
Use the Skills in Your Workspace Repository
Open your platform-created workspace repository in your AI coding environment. The repository includes the workspace contract and Agent Skills alongside rollout code, configs, and data:AGENTS.md contains the always-loaded workspace contract. .agents/skills/ contains the canonical workflow skills, and .claude/skills/<skill-name> exposes the same skills to Claude Code through symlinks back to .agents/skills/.
Start in a Workspace Repository
The skills assume this repository layout for source files:Ask the Agent to Plan from the Dataset
Start by describing your task domain and asking the agent to design the first dataset and eval loop. A useful first prompt is:Plan training
Inspect
data/, existing rollouts, and workspace config. The agent should settle the dataset schema before writing rollout code.Create or adapt a rollout
Write the smallest
AgentWorkflow and Grader that can load, run, and score samples. Generated files should stay under rollouts/, configs/eval/, configs/training/, and data/.Debug until the eval is clean
Fix loading, dataset, grader, dependency, and reward issues before training. A passing local eval is the handoff point from creation to training readiness.
Workspace Skills
The workspace skills are organized around rollout creation stages:| Skill | Purpose |
|---|---|
plan-training | Turn a task idea or dataset into a concrete local experiment plan |
create-rollouts | Create or adapt rollout code, graders, entrypoints, and baseline eval configs |
evaluate-rollouts | Run local evals, compare baselines, and inspect failures |
debug-rollouts | Diagnose eval, config, dataset, dependency, or preflight failures |
submit-training | Prepare a training config, submit training, and check training status |
Next Steps
Rollout Overview
Understand the
AgentWorkflow and Grader contract behind generated rollout code.Local Evaluation
Validate rollouts before submitting training.
Git Sync
Push rollout changes and let the platform sync the code version used for training.
Training Runs
Submit and monitor training after local eval passes.