A visible workspace for AI agents to collaborate as a single team.
<div align="center"> # Open Office ### A visible workspace for AI agents to collaborate as a single team. [](https://www.npmjs.com/package/open-office) [](https://opensource.org/licenses/MIT) [](https://nodejs.org/) [](https://github.com/longyangxi/open-office/pulls) **Supports Gemini, Codex, Gemini, Gemini, Gemini, Aider, OpenCode, Pi & Sapling — one team 🚀** [Quick Start](#quick-start) | [Features](#features) | [Architecture](#architecture) | [Contributing](#contributing) </div> ---  ## Features - **8 AI backends, 23 roles** — Gemini, Codex, Gemini, Gemini, and more in one pipeline - **Team-based delivery** — Leader coordinates planning, coding, review, and release - **Parallel collaboration** — Worktree isolation with auto commit, merge, and undo - **Flexible code review** — Mix Claude Code, Codex, or any tool you prefer - **Preview & feedback** — Live preview, rating, and continuous learning - **Persistent memory** — 4-layer memory across sessions and agents - **Telegram Control** — Manage your agent team remotely - **Native desktop app** — Tauri-based app with system notifications - **Token cost tracking** — Real-time usage per agent and team ## Quick Start ```bash npx bit-office ``` ## Learn More - **Team workflow** — how agents plan, execute, and deliver ([view](team-workflow.md)) - **Collaboration system** — multi-agent orchestration and worktree isolation ([view](packages/orchestrator/README.md)) - **Memory design** — four-layer persistent memory architecture ([view](packages/memory/README.md)) ## Run from Source ### Prerequisi
Google's AI-powered research notebook that ingests your documents and becomes an expert on your content. Generates audio overviews, study guides, FAQs, and interactive discussions from uploaded sources.
Google DeepMind's experimental AI agent that can navigate websites, fill forms, and complete multi-step browser tasks autonomously. Uses Gemini's multimodal understanding to interact with web interfaces.
Google DeepMind's universal AI assistant prototype that can see, hear, and respond in real-time through your device camera and microphone. Demonstrates the future of multimodal AI interaction.
Google Cloud's enterprise platform for building, deploying, and managing AI agents powered by Gemini. Supports multi-agent orchestration, tool integration, and enterprise governance.
Gemini's agentic research capability that autonomously browses the web, synthesizes information from dozens of sources, and produces comprehensive research reports on any topic.
Interactive coding and content creation agent that generates, previews, and iterates on code, documents, and interactive applications in a side panel. Supports HTML/CSS/JS, Python, and more.