A versatile multi-modal chat application that enables users to develop custom agents, create images, leverage visual recognition, and engage in voice interactions. It integrates seamlessly with local LLMs and commercial models like OpenAI, Gemini, Perplexity, and Claude, and allows to converse with uploaded documents and websites.
<div align="center">
<img src="./docs/assets/logo-large.jpg" alt="logo-large" width="400" height="400">
<h1>Stellar Chat</h3>
A powerful multi-modal chat application that empowers users to create custom agents, generate images, utilize visual recognition, and engage in voice conversations. It seamlessly integrates with local LLMs and commercial models like OpenAI, Gemini, Gemini, and Gemini, while also offering the capability to converse with uploaded documents and websites.
<p align="center">
<a href="https://docs.stellar-chat.com/"><strong>Documentation</strong></a>
|
<a href="https://github.com/ktutak1337/Stellar-Chat/issues/new?assignees=&labels=%F0%9F%90%9B+Bug&projects=&template=bug_report.yml&title=%5BBug%5D+"><strong>Report Bug</strong></a>
|
<a href="https://github.com/ktutak1337/Stellar-Chat/issues/new?assignees=&labels=%F0%9F%A4%A9+Feature+Request&projects=&template=feature_request.yml&title=%5BRequest%5D+"><strong>Request Feature</strong></a>
</p>
[](https://github.com/ktutak1337/Stellar-Chat/actions/workflows/github-actions.yaml)
[](https://dotnet.microsoft.com/en-us/download/dotnet/8.0)
[](https://github.com/ktutak1337/Stellar-Chat/blob/main/LICENSE)
[](https://100commitow.pl)
<h3>⭐️ Your star motivates me greatly! ⭐️</h3>
> \[!NOTE]
>
> This project is part of the ["100 Commits"](https://100commitow.pl/) competition, which challenges participants to commit to their projects by making at least one meaningful commit every day for 100 consecutive days.
>
</div>
<details>
<summary><kbd>Table of Contents</kbd></summary>
1. [🎥 Demo](#-demo)
2. [✨ FeatGoogle's AI-powered research notebook that ingests your documents and becomes an expert on your content. Generates audio overviews, study guides, FAQs, and interactive discussions from uploaded sources.
Google DeepMind's experimental AI agent that can navigate websites, fill forms, and complete multi-step browser tasks autonomously. Uses Gemini's multimodal understanding to interact with web interfaces.
Google DeepMind's universal AI assistant prototype that can see, hear, and respond in real-time through your device camera and microphone. Demonstrates the future of multimodal AI interaction.
Google Cloud's enterprise platform for building, deploying, and managing AI agents powered by Gemini. Supports multi-agent orchestration, tool integration, and enterprise governance.
Gemini's agentic research capability that autonomously browses the web, synthesizes information from dozens of sources, and produces comprehensive research reports on any topic.
Interactive coding and content creation agent that generates, previews, and iterates on code, documents, and interactive applications in a side panel. Supports HTML/CSS/JS, Python, and more.