A comprehensive full-stack healthcare application that leverages AI-powered voice interactions for patient triage, symptom analysis, specialist mapping, and appointment booking. It is built with modern technologies including FastAPI, PostgreSQL, React, and integrated with leading AI models like Gemini and GPT-4.
# π©Ί AI Healthcare Voice Assistant
A comprehensive full-stack healthcare application that leverages AI-powered voice interactions for patient triage, symptom analysis, specialist mapping, and appointment booking. Built with modern technologies including FastAPI, PostgreSQL, React, and integrated with leading AI models.
---
## π System Architecture & User Flow
```
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β π€ VOICE-ENABLED USER INTERFACE β
β (React Frontend) β
βββββββββββββββββββββββ¬ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β
βΌ
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β π USER AUTHENTICATION β
β βββββββββββββββββββ ββββββββββββββββββββ βββββββββββββββββββββββββββββββββββ β
β β User Login βββββΆβ FastAPI Backend βββββΆβ PostgreSQL Database β β
β β (Email/Password)β β sp_login_user β β (users table) β β
β βββββββββββββββββββ ββββββββββββββββββββ βββββββββββββββββββββββββββββββββββ β
βββββββββββββββββββββββ¬ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β
βΌ
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β ποΈ VOICE SYMPTOM COLLECTION β
β βββββββββββββββββββ ββββββββββββββββββββ βββββββββββββββββββββββββββββββββββ β
β β Web Speech API βββββΆβ Speech-to-Text βββββΆβ Symptom Phrases Array β β
β β (Microphone) β β Conversion β β ["headache", "fever", ...] β β
β βββββββββββββββββββ ββββββββββββββββββββ βββββββββββββββββββββββββββββββββββ β
βββββββββββββββββββββββ¬βββββββββββββββββGoogle's AI-powered research notebook that ingests your documents and becomes an expert on your content. Generates audio overviews, study guides, FAQs, and interactive discussions from uploaded sources.
Google DeepMind's experimental AI agent that can navigate websites, fill forms, and complete multi-step browser tasks autonomously. Uses Gemini's multimodal understanding to interact with web interfaces.
Google DeepMind's universal AI assistant prototype that can see, hear, and respond in real-time through your device camera and microphone. Demonstrates the future of multimodal AI interaction.
Google Cloud's enterprise platform for building, deploying, and managing AI agents powered by Gemini. Supports multi-agent orchestration, tool integration, and enterprise governance.
Gemini's agentic research capability that autonomously browses the web, synthesizes information from dozens of sources, and produces comprehensive research reports on any topic.
Interactive coding and content creation agent that generates, previews, and iterates on code, documents, and interactive applications in a side panel. Supports HTML/CSS/JS, Python, and more.