A repeatable delivery process for enterprise AI agents and AI applications.
Run a four-stage cycle that keeps agent design, supervision, evaluation evidence, and release readiness aligned. Delivery breaks down when planning, execution, evidence, and release readiness are split across separate products and teams.
Four stages, each with clear activities and evidence.
Define activities, outcomes, and delivery evidence that travel with the work.
Key activities
- Define Business Vision and Success Metrics
- Stakeholder Onboarding and Kickoff
- Identify User Personas and Scenarios
- Define Solution Requirements and Constraints
Artifacts & evidence
17 artifact templates
Key activities
- Develop User Stories and Draft Flows
- Define Data Requirements and Sources
- Evaluate and Select Tools/Integrations
Artifacts & evidence
11 artifact templates
User guide
Key activities
- Define Agent Persona, Purpose, and Scope
- Architect Agent Components and Integrations
- Develop and Refine Prompts and Instructions
- Test and Refine Agent Behavior
- Design Conversation UI/UX
- Test End-to-End Conversation Experience
Artifacts & evidence
23 artifact templates
User guide
Key activities
- Define Evaluation Criteria and Test Plan
- Run Functional and Scenario-Based Evaluations
- Analyze System and Model Metrics
- Conduct Governance and Safety Review
- Approve for Production and Monitor Rollout
Artifacts & evidence
16 artifact templates
User guide
The delivery playbook, operationalized inside Gaia.
The Delivery Management module turns the framework into cycles, stage tracking, and evidence capture so teams stay aligned from kickoff through evaluation and release.
- Create cycles in minutes
- Track active stage & status
- Archive cycles for history
User guide
- Playbook-driven activities
- Stage status and reopen flow
- Role assignments at a glance
User guide
- Template-driven evidence
- Markdown notes & attachments
- Export-ready documentation
User guide
- Assign tasks to owners
- Track delivery status and risk actions
- Keep delivery work accountable
User guide
Bring the delivery playbook into every project.
Use the process module to keep roles, activities, and evidence aligned from kickoff through ModelOps-style evaluation and release decisions.