Specifications by Theme

Organized by architectural concerns to show how different aspects of the system work together.

🎬 Core Agent System

The seven agents that implement the multi-agent video production pipeline.

Agent Role Key Innovation
ProducerAgent Budget planning, pilot strategies Competitive pilot system with dynamic budget allocation
ScriptWriterAgent Scene breakdown from concept Provider-aware prompt optimization using learnings
VideoGeneratorAgent Video generation Pluggable providers with unified interface
AudioGeneratorAgent TTS narration Multi-provider TTS (ElevenLabs, OpenAI, Google)
QAVerifierAgent Vision-based verification Claude Vision analysis of video frames
CriticAgent Quality evaluation Gap analysis, learning extraction
EditorAgent EDL creation Selects best scenes from multiple pilots

Design Philosophy: Each agent has a single, well-defined responsibility. They communicate through structured data, enabling parallel execution and easy testing.

πŸ—οΈ System Architecture

Foundation and infrastructure specifications.

πŸ”Œ Provider System

Pluggable providers for video, audio, image, music, and storage.

Key Innovation: Providers are hot-swappable. Same script can be produced with Luma, Runway, or Pika just by changing a CLI flag.

🧠 Memory & Learning System

Continuous improvement through learning and knowledge integration.

Evolution: Started with simple prompt optimization, evolved to full knowledge management and enterprise memory system.

πŸ› οΈ Developer Experience

Tools and infrastructure for building and testing.

Focus: Make it easy to develop without incurring API costs. Mock mode first, live mode when ready.


View By Timeline

Timeline View β†’ Evolution Story β†’

← Back to Home