An md file as a chat interface and editable history in one.
# chat.md: The Hacker's AI Chat Interface [Experimental] **Finally, a fully editable chat interface with MCP support on any LLM.** chat.md is a Visual Studio Code extension that reimagines AI interaction through plain text files. Unlike ephemeral web interfaces or proprietary chat windows, chat.md embraces a file-first approach where your conversations with AI are just markdown files with a `.chat.md` extension. Edit them, version control them, share them - they're your files. The AI directly writes its response in the file. Any '*.chat.md' file is now an AI agent hackable by you. Go crazy with non linear AI conversation. <img width="1005" alt="image" src="https://github.com/user-attachments/assets/67983da2-6046-4ba8-bc8c-16944a5476fd" /> [Usage video](https://www.youtube.com/watch?v=DyYoZLmpzc0) [Here's the chat I used to publish this vscode extension using gemini-2.5-pro and wcgw mcp](samples/publishing-help/chat.chat.md) NOTE ⚠️: chat.md is 100% AI coded and should be treated as a feature rich POC. ## Why chat.md? | Other AI Tools | chat.md | |----------------|---------| | ❌ Linear conversations or limited editing | ✅ Non-linear editing - rewrite history, branch conversations | | ❌ Tool execution tied to proprietary implementations | ✅ Any LLM model can do tool calling | | ❌ Can't manually edit AI responses | ✅ Put words in LLM's mouth - edit and have it continue from there | | ❌ MCP not supported in many LLMs | ✅ Any LLM model can use MCP servers | | ❌ Max token limit for assistant response can't be resumed | ✅ Resume incomplete AI responses at any point | | ❌ Conversations live in the cloud or inaccessible | ✅ Files stored locally alongside your code in human readable format | | ❌ Separate context from your workspace | ✅ Attach files directly from your project | ## Features ### 🗣️ File-Based Conversations Unlike Gemini's inline suggestions, Gemini's web interface, or Gemini's side panel, chat.md treats conversations as *first-class files* in your
Google's AI-powered research notebook that ingests your documents and becomes an expert on your content. Generates audio overviews, study guides, FAQs, and interactive discussions from uploaded sources.
Google DeepMind's experimental AI agent that can navigate websites, fill forms, and complete multi-step browser tasks autonomously. Uses Gemini's multimodal understanding to interact with web interfaces.
Google DeepMind's universal AI assistant prototype that can see, hear, and respond in real-time through your device camera and microphone. Demonstrates the future of multimodal AI interaction.
Google Cloud's enterprise platform for building, deploying, and managing AI agents powered by Gemini. Supports multi-agent orchestration, tool integration, and enterprise governance.
Gemini's agentic research capability that autonomously browses the web, synthesizes information from dozens of sources, and produces comprehensive research reports on any topic.
Interactive coding and content creation agent that generates, previews, and iterates on code, documents, and interactive applications in a side panel. Supports HTML/CSS/JS, Python, and more.