Prepared from the Product Assembly Session, May 24, 2026 — Naseeruddin Mohammad & Mcqalam X.
The software industry has operated under a deeply flawed assumption for decades: that software is the primary artifact of product development. Business requirements get translated into technical specifications, handed to engineers, built in isolation, and returned to business owners who often find it does not match their original vision.
Assembly challenges this at the root. Software becomes invisible infrastructure. The product — the outcome, the user experience, the business value — is all that matters.
A group of humans — business owners, PMs, designers, customers — enter a live Assembly session and simply talk about what they want to build. AI agents listen, probe, orchestrate, and construct the product in real time. The software writes itself. No IDE, no git commands, no deployment scripts visible to any non-technical participant.
A world where anyone with a business idea can bring a working product to life through natural conversation — where the distance between imagination and a live, deployed application collapses to a single Assembly session.
To eliminate the gap between business imagination and working software by creating the world's first AI-powered collaborative Assembly platform — where multi-human, multi-agent live sessions replace the entire traditional software development lifecycle.
The following seven journeys represent the complete lifecycle of a product built on Assembly. Each CUJ maps to one or more technical modules and product phases.
Who: Business Owner, Product Manager, invited collaborators. Trigger: a business owner wants to build a new product or feature.
Success: all participants in session, agents listening, Phase Barometer visible.
Humans converse freely about their product idea. AI agents listen passively and extract structured product requirements in real time.
Unlike any existing tool, multiple AI agents participate simultaneously as distinct expert personas — each with a different lens on the product — coordinated by the Agentic Orchestration Engine without interrupting each other or talking over humans.
When the Phase Barometer crosses the prototype-ready threshold, the product materializes in front of the assembly while the conversation is still happening.
Participants react to the prototype verbally. The session becomes a feedback-driven iteration loop.
When the assembly is satisfied, the product advances through deployment phases. Environment Agent personas activate.
| Environment Persona | Interaction Focus |
|---|---|
| Dev Agent | New feature requests, bug investigation, code changes in development environment |
| UAT Agent | Quality testing scenarios, user acceptance criteria, staging validation, edge cases |
| Prod Agent | Daily active users, geographic analytics, error rates, support tickets, SLA monitoring |
Participants naturally direct their conversation to the appropriate environment agent. The system routes queries automatically based on intent — no participant needs to know which agent to address.
The session link is permanent. Any authorized participant can join at any time throughout the product's entire life.
The Assembly session link is not a meeting link. It is the product itself. Every state the product has ever been in is accessible through the Timeline. The product's entire life — from first idea to decommission — lives in one place.
Stakeholders who were absent from a session can rejoin at any time and see the current state. No manual catch-up documents, no review meetings.
The Assembly platform is organized into six architectural layers. The Agentic Orchestration Engine is the core IP. All other layers either feed into it or execute its instructions.
Voice and screen input from participants flows into the Session Engine, which converts it to structured context (speaker-attributed text with role metadata). The Agentic Orchestration Engine receives this context stream and routes it to the appropriate agent personas. Agents process their domain slice, collaborate with peer agents, and instruct the Frontier Model layer to generate or modify code. The Prototype Renderer reflects changes in the live browser. All state changes are persisted to the Data Foundation with immutable Timeline snapshots.
The loop is continuous: agent outputs return to the session (via voice/text response or prototype update), participants react, new context flows in. No step is manual. No deployment script is visible.
This is what Assembly will patent, protect, and continuously improve. It is not a chatbot or a copilot. It is a multi-agent coordination system with the following responsibilities:
The following 15 modules constitute the complete technical implementation of Assembly. Each module is annotated with the development milestone in which it is first introduced.
| # | Module | Description & Responsibility | Milestone |
|---|---|---|---|
| M1 | Assembly Session Engine | WebRTC/WebSocket-based real-time session management. Multi-participant audio/video/chat streams, session lifecycle, evergreen link infrastructure. | M-1 |
| M2 | Multi-modal Input Processor | Captures and normalizes voice (Whisper/Deepgram), screen share, and text chat into a unified, structured context object per utterance or event. | M-1 |
| M3 | Speaker Diarizer & Role Mapper | Identifies who is speaking via voice fingerprinting and maps each speaker to their assigned role. Enables agents to weight input by persona. | M-2 |
| M4 | Agentic Orchestration Engine | Core IP. Context routing, phase management, turn-taking arbitration, agent dispatching and output merging, Timeline custody. The central nervous system. | M-2 |
| M5 | Phase Barometer & Transition Mgr | Computes readiness score across 6 product phases using a weighted signal model. Triggers environment agent activation at deployment threshold. | M-2 |
| M6 | AI Agent Persona Framework | Plugin-style framework for defining, instantiating, and managing AI agent personas — domain scope, prompt strategy, probing question library, output schema. | M-2 |
| M7 | Environment Agent Personas | Dev Agent (features/bugs), UAT Agent (quality/testing), Prod Agent (metrics/incidents). Context routing directs queries to the right environment automatically. | M-3 |
| M8 | Code Generation Engine | Interfaces with frontier code models (Claude API, OpenAI) to generate and iteratively refine web application code from structured specs. | M-2 |
| M9 | UI/UX Generation Engine | Interfaces with design-capable frontier models to generate UI components and design-system-compliant output. Web (DOM) only. | M-2 |
| M10 | Voice Model Integration | STT (Whisper/Deepgram) for real-time transcription and intent extraction; TTS (ElevenLabs/OpenAI) for agent voice responses. | M-1 |
| M11 | Prototype Renderer & App Agent | Embeds a live browser in the session (Playwright/cloud browser). App Agent navigates the prototype on behalf of participants, streaming to all members. | M-2 |
| M12 | Product Development Data Model | Core entity graph: Product, Session, Phase, Feature, UserJourney, Component, AgentDecision, TimelineEvent, Participant. The schema everything reads and writes. | M-1 |
| M13 | Timeline Manager | Append-only event log of every state transition. Point-in-time restore ("show me February 2025"). Single timeline, immutable history. | M-2 |
| M14 | IAM & Session Control | Role-based access control for participants and agents. Evergreen link permissions, invitations, agent configuration per session. | M-1 |
| M15 | Analytics & Observability | Production metrics ingestion (DAU, errors, geo, tickets) for the Prod Agent; platform observability (agent latency, model costs, session health). | M-3 |
When a participant shares their screen, this sub-module captures the stream, processes highlighted or pointed-at regions using computer vision, and generates structured visual context (e.g., "region: top navigation bar, instruction: change background to navy"). This feeds the UI Agent without requiring participants to type descriptions.
When the Agentic Engine completes a significant background task (prototype generated, vulnerability patched, UAT completed), it triggers the Notification Coordinator. This module sends calendar invites to the assembly group, emails, and in-app notifications — automatically scheduling the next review session so no human has to manage the iteration cycle.
The product will be built iteratively. No throwaway work — each milestone is a foundation for the next. The goal is a delightful, demo-ready product for the first 5 beta customers (VC adjacents) by Milestone 4.
No custom infrastructure. Humans meet on Google Meet, record, export the transcript, and manually upload to a frontier model with a structured product-extraction prompt. Model generates product type, CUJs, modules, and an initial HTML/JS prototype. Success: at least one prototype generated from a real conversation in under 2 hours.
A Read.ai-equivalent meeting plugin listens live and auto-triggers the prototype pipeline at meeting end; the generated prototype deploys to a URL and a calendar invite is auto-sent. Modules: M1 (partial), M2, M10, M12 (schema), M14 (basic). Success: session → prototype → invite in under 15 minutes post-meeting.
Assembly's own session interface with a single orchestrator agent: listens, extracts, generates the prototype in real time. Phase Barometer UI visible; Timeline activated; App Agent demonstrates in-session. Modules: M1, M2, M4, M5, M8, M9, M11, M12, M13, M14. Success: 5 internal test sessions produce working prototypes without any human-written code.
PM, UX, UI, and Tech Architect agents activate as distinct session participants. Speaker Diarizer maps voices to roles; Turn-Taking prevents agent pile-ons; agents collaborate behind the scenes. Modules: M3, M6 (full), M7 (partial), Screen Annotator. Success: a beta customer says "this is unbelievably different from anything I have seen."
Dev / UAT / Prod personas fully active with an automated deployment pipeline. Prod Agent surfaces real production metrics; vulnerability management and enhancement planning run through the same session; Notification Coordinator fully operational. Modules: M7 (full), M15, Notification Coordinator. Success: the first customer's product goes live and is monitored entirely through Assembly.
| Open Question | Research Action |
|---|---|
| Multi-human to single/multi-agent voice management — who has the floor, how does the system arbitrate? | Research OpenAI Realtime API, Google Project Astra, production apps using multi-party voice AI. Identify underlying architecture. |
| Shared browser interaction — can multiple users interact with the same live DOM simultaneously? | Decided: App Agent mediates all clicks. Participants direct the agent by voice. Direct multi-user DOM interaction is not in scope for v1. |
| Agent-to-agent collaboration model — how do PM Agent and UX Agent coordinate without user-visible noise? | Design a structured inter-agent message protocol within the Orchestration Engine. Agent-to-agent communication is invisible to participants. |
| Model cost management — frontier model token costs at scale could be prohibitive. | Design smart context compression. Cache model outputs. Use tiered models (cheaper for simple tasks, frontier for complex generation). |