Free Podwise: AI Podcast & Youtube Transcription & Understanding Agent | 播客+youtube转文字/学习/可视化AI工具
# 🎧 PodLens - Free Podwise: Podcast & Youtube Transcription & Summary AI Agent 🧠 For knowledge-seekers who want to learn from audio content more effectively. 🤖 Now with 24x7 automation service & 📧 smart email digest & 📒 sync to Notion! A fast & cost-free & AI-powered tool that: - 🎙️ transcribes audio content from Apple Podcast and YouTube platforms - 📝 summarizes - 📊 visualizes - 🌏 features bilingual Chinese/English interface [中文版 README](README_zh.md) | **English README**  ## ✨ Key Features - 🤖 **24x7 Intelligent Automation**: Set-and-forget service monitors your favorite podcasts and YouTube channels, automatically processing new episodes hourly - **autopodlens** - 📧 **Smart Email Digest**: Daily automated email summaries with AI-generated insights and processed content overview - 📝 **Sync to Notion**: Automatically sync processed content to Notion with your own Notion page and token - 🎯 **Interactive Manual Mode**: On-demand processing with intuitive command-line interface for immediate transcription and analysis of specific episodes - **podlens** - ⚡ **Ultra-Fast Smart Transcription**: Multiple AI-powered methods (Groq API for speed, MLX Whisper for large files) with intelligent fallback chain - 🍎 **Apple Podcast & YouTube Integration**: Seamless content extraction from both major platforms with smart episode detection - 🧠 **AI-Powered Analysis**: Generate intelligent summaries and insights using Google Gemini AI with structured topic analysis - 🎨 **Interactive Visual Stories**: Transform content into beautiful, responsive HTML visualizations with data charts and modern UI - 🌍 **Bilingual Support**: Full Chinese/English interface with smart language detection and switching - 🗂️ **Smart Organization**: Episode-based folder structure with automatic file management and duplicate detection ## 📦 Installation ```bash pip install podlens ``` ## 🔧 Configuration ### 1. Create .env Configuration File Cr
Google's AI-powered research notebook that ingests your documents and becomes an expert on your content. Generates audio overviews, study guides, FAQs, and interactive discussions from uploaded sources.
Google DeepMind's experimental AI agent that can navigate websites, fill forms, and complete multi-step browser tasks autonomously. Uses Gemini's multimodal understanding to interact with web interfaces.
Google DeepMind's universal AI assistant prototype that can see, hear, and respond in real-time through your device camera and microphone. Demonstrates the future of multimodal AI interaction.
Google Cloud's enterprise platform for building, deploying, and managing AI agents powered by Gemini. Supports multi-agent orchestration, tool integration, and enterprise governance.
Gemini's agentic research capability that autonomously browses the web, synthesizes information from dozens of sources, and produces comprehensive research reports on any topic.
Interactive coding and content creation agent that generates, previews, and iterates on code, documents, and interactive applications in a side panel. Supports HTML/CSS/JS, Python, and more.