A next-generation AI-powered infinite canvas workspace built for creators and developers. Experience the future of Generative AI with a drag-and-drop node interface that combines Google Gemini 3 Pro, Veo 3.1, and LangGraph Agents into a seamless creative workflow.
<div align="center"> <img src="public/TwitCanva-logo.png" alt="TwitCanva Logo" width="120" /> <h1>TwitCanva</h1> </div> A modern, AI-powered canvas application for generating and manipulating images and videos using OpenAI GPT Image, Google Gemini, Kling AI, Hailuo AI (MiniMax), and Fal.ai. Built with React, TypeScript, and Vite.     ## Star History [](https://www.star-history.com/#SankaiAI/TwitCanva-Video-Workflow&type=date&legend=top-left) ## ✨ Features - **🎨 Visual Canvas Interface** - Drag-and-drop node-based workflow - **🤖 Multi-Model AI Generation** - GPT Image 1.5, Gemini Pro, Kling V1-V2.5 for images - **🎬 Multi-Model Video Generation** - Veo 3.1, Kling V1-V2.6, Hailuo 2.3/O2 for videos - **🎥 Camera Angle Control** - Transform any image by adjusting camera rotation and tilt angles (Qwen-Image-Edit) - **📋 Storyboard** - Create video storyboards with consistent characters and layouts - **💃 Motion Control** - Transfer motion from reference videos to character images (Kling V2.6 via Fal.ai) - **📥 TikTok Import** - Download TikTok videos without watermark for use as motion references - **📤 Post to X** - Share generated images/videos directly to Twitter/X with one click - **📤 Post to TikTok** - Share generated videos directly to TikTok with one click - **🖼️ Image-to-Image** - Use reference images for generation - **📽️ Frame-to-Frame Video** - Animate between start and end frames - **🔗 Smart Node Connections** - Type-aware validation (IMAGE→VIDEO, TEXT→IMAGE, etc.) - **💬 AI Chat Assistant** - Built-in chat with LangGraph agent - **📚 Asset Library** - Save and reuse generated assets
Google's AI-powered research notebook that ingests your documents and becomes an expert on your content. Generates audio overviews, study guides, FAQs, and interactive discussions from uploaded sources.
Google DeepMind's experimental AI agent that can navigate websites, fill forms, and complete multi-step browser tasks autonomously. Uses Gemini's multimodal understanding to interact with web interfaces.
Google DeepMind's universal AI assistant prototype that can see, hear, and respond in real-time through your device camera and microphone. Demonstrates the future of multimodal AI interaction.
Google Cloud's enterprise platform for building, deploying, and managing AI agents powered by Gemini. Supports multi-agent orchestration, tool integration, and enterprise governance.
Gemini's agentic research capability that autonomously browses the web, synthesizes information from dozens of sources, and produces comprehensive research reports on any topic.
Interactive coding and content creation agent that generates, previews, and iterates on code, documents, and interactive applications in a side panel. Supports HTML/CSS/JS, Python, and more.