Agent Factory Recap: AI Agents for Data Engineering and Data Science — DeepSeek Blog | Neura Market
    Neura MarketNeura Market/DeepSeek
    ChatGPTChatGPTClaudeClaudeGeminiGeminiCursorCursorGrokGrokPerplexityPerplexityDeepSeekDeepSeek
    CoPilotCoPilotStable DiffusionStable DiffusionMidjourneyMidjourney
    View All Directories
    OverviewRulesPromptsMCPsAgentsBlogVideosGuidesCoursesCommunityTrendingGenerate
    DeepSeekBlogAgent Factory Recap: AI Agents for Data Engineering and Data Science
    Back to Blog
    Agent Factory Recap: AI Agents for Data Engineering and Data Science
    datascience

    Agent Factory Recap: AI Agents for Data Engineering and Data Science

    Lucía Subatin January 12, 2026
    0 views

    Welcome to another exciting episode of The Agent Factory, the podcast that goes beyond the hype to...

    Welcome to another exciting episode of [The Agent Factory](https://www.youtube.com/playlist?list=PLIivdWyY5sqLXR1eSkiM5bE6pFlXC-OSs), the podcast that goes beyond the hype to build production-ready AI agents! In this episode, we were thrilled to host Lucia Subatin, who guided us through the world of data agents and their transformative power for data engineers and scientists. She also showcased some truly innovative applications of graph databases and AI for better access to knowledge. This post guides you through the key ideas from our conversation. Use it to quickly recap topics or dive deeper into specific segments with links and timestamps. ## The Agent Industry Pulse Timestamp: [[01:45](https://www.youtube.com/watch?v=ATgIU47V1yI&list=PLIivdWyY5sqLXR1eSkiM5bE6pFlXC-OSs&index=1&t=105s)] This week, the agent industry is buzzing with some groundbreaking releases: - **[Gemini API's Computer Use Model](https://ai.google.dev/gemini-api/docs/computer-use)**: A new model that grants AI agents the ability to "see" and interact with your computer screen. It takes screenshots, decides on UI actions (click, scroll, type, open webpage), and executes them, allowing agents to automate real-world browser tasks like filling forms or testing user flows. Built with robust safety layers, every action undergoes a safety check, requiring human confirmation for risky operations. We even saw a [demo](https://youtu.be/ATgIU47V1yI?si=f3NihlFVN6jRYx_y&t=216) of it looking up pricing on a documentation page! ![](https://storage.googleapis.com/gweb-cloudblog-publish/images/computer-use-model-demo.max-2200x2200.png) - **[CodeMender - AI Agent for Code Security](https://deepmind.google/discover/blog/introducing-codemender-an-ai-agent-for-code-security/)**: This AI agent is designed to autonomously patch new vulnerabilities as they arise (reactive) and rewrite existing code to secure entire classes of flaws (proactive). Leveraging the reasoning power of Gemini Deep Think and equipped with self-correction tools like static analysis and fuzzing, CodeMender automates the creation and validation of high-quality security patches at scale. It has already upstreamed 72 security fixes to open-source projects, marking a significant breakthrough for software security. ## The Factory Floor The Factory Floor is our segment for getting hands-on. Here, we moved from high-level concepts to practical code with live demos. ## [The BigQuery Data Engineering Agent](https://www.youtube.com/redirect?event=video_description&redir_token=QUFFLUhqbVViNWZFcXF3Y1Z1eHplckhVLUpsdHBfaGJCd3xBQ3Jtc0trRHRCNHYtQVBlTFV0UUlBNVk4aEtEQktkcDV0TFhEdTdSSU5YQmtwcUdtVGdnMHBxaVYxekVrSWp2SjJGWTNTVFZucndDeVZwMXktWG1LeGhieU9rbWpucFQ2S2V6UVZkNDBtMGRoenZxaEk2SUtEaw&q=https%3A%2F%2Fgoo.gle%2Fbq-dea-pre-ga-form&v=ATgIU47V1yI) ![](https://storage.googleapis.com/gweb-cloudblog-publish/images/data-engineering-agent.max-2200x2200.png) Timestamp: [[06:44](https://www.youtube.com/watch?v=ATgIU47V1yI&list=PLIivdWyY5sqLXR1eSkiM5bE6pFlXC-OSs&index=1&t=404s)] We dove into the BigQuery Data Engineering Agent, a powerful tool for automating data pipeline creation and management directly within BigQuery. - **Generating Sales Regions:** Lucia demonstrated how to use the agent to add a new sales_region field to an accounts table based on the billing_country, leveraging BigQuery's AI_GENERATE function to call Gemini 2.5 Flash from a SQL statement. - **Creating a Time Dimension Table:** The agent was then prompted to generate a comprehensive time_dimension table, crucial for natural language to SQL queries by providing readily available date components (year, quarter, month name) for easier analysis. - **Automating Data Quality Assertions:** Finally, Lucia showed how the agent can automatically generate data quality assertions for all tables, such as ensuring non-null IDs and unique account names, to maintain data cleanliness and reliability for agent applications. ## [The Data Science Agent](https://www.youtube.com/redirect?event=video_description&redir_token=QUFFLUhqbm5zX2JVMzlwcUR5dFJqVklvZENJX2d0OGtOd3xBQ3Jtc0tsVjBtRGtaTXdhQWZxUEhNV1ZhUDVKVHM3eG1Lb1Bzb3BaRVZUQ0tsdWVXSDBncDlqR0RQQkxCa3JNS0tsZVV1dFZMNVBUOF9UVTNvX0dWN3RnRGFoWEk4WUtacWlSWlZJeThkRHJ6YW12S3RXTDEyUQ&q=https%3A%2F%2Fgoo.gle%2F4n8ceUV&v=ATgIU47V1yI) ![](https://storage.googleapis.com/gweb-cloudblog-publish/images/data-science-agent.max-2200x2200.png) Timestamp: [[07:24](https://youtu.be/ATgIU47V1yI?si=NAWeNZEmC3dp_Ol-&t=444)] Next, we explored the Data Science Agent, operating within Colab Enterprise, to extract insights and prepare data for agent applications. - **Anomaly Detection:** Lucia tasked the agent with detecting anomalies in a Case table. The agent formulated a plan to load and describe data, preprocess it for anomaly detection, train an isolation forest model, and provide visualizations. - **Identifying Anomalous Records:** After executing its plan, the agent successfully identified anomalous records, provided a summary of its findings, and even presented a visual confirmation of the separation between normal and anomalous data points. It also offered insights and next steps to understand the root causes of these anomalies, proving invaluable for improving data collection processes. ### Creating Comics from Spanner Concepts using an ADK Timestamp: [[26:01](https://www.youtube.com/watch?v=ATgIU47V1yI&list=PLIivdWyY5sqLXR1eSkiM5bE6pFlXC-OSs&index=1&t=1561s)] In a truly unique demonstration, we saw how to combine a graph database with AI for creative content generation. - **Spanner Graph Database:** Lucia explained Spanner as a globally distributed, strongly consistent database with graph capabilities. She showcased a graph database built from Spanner's documentation, traversable via GQL. - **Knowledge Traversal and Comic Generation:** Using an ADK application, a knowledge agent traversed this Spanner graph database to answer "What are regions?" Based on the retrieved information, another agent generated a detailed prompt for Nano Banana, an image generation model, to create a six-panel comic strip explaining Spanner regions in a vibrant tech illustration style. The comic visually explained regional, dual-region, and multi-region configurations. The following is an example of another comic generated by the agent, responding to the question “What is interleaving?” ![](https://storage.googleapis.com/gweb-cloudblog-publish/images/blog_post_image.max-600x600.png) It was incredible to see how agents could not only retrieve precise information but also transform it into engaging visual content, even with multiple iterations to refine text clarity in the generated images. ## Developer Q&A Timestamp: [[38:49](https://www.youtube.com/watch?v=ATgIU47V1yI&t=2329s)] We wrapped up with some great questions from our developer community: ### On the Availability of Data Science and Data Engineering Agents Timestamp: [[38:53](https://youtu.be/ATgIU47V1yI?si=3o3RxVVo4p7fz6-a&t=2333)] Both the Data Science Agent and the Data Engineering Agent are currently in preview. The Data Science Agent is in public preview, while access to the Data Engineering Agent requires following a specific link, which we'll provide in the description. This means developers can start experimenting with these powerful tools today! ### On the Scalability and Deployment of the Data Engineering Agent Timestamp: [[39:33](https://youtu.be/ATgIU47V1yI?si=tWIKbWRZgEtWITAw&t=2373)] The Data Engineering Agent leverages highly scalable platforms: BigQuery and Dataform. It can perform analysis across multiple tables, datasets, and projects, provided the executing pipeline has the necessary permissions. For deployment to higher environments (staging, production), Dataform excels in assisting the data pipeline lifecycle by generating declarative artifacts that can be released and configured for deployment across various project and dataset combinations, ensuring a robust software delivery lifecycle for your data pipelines. What an incredible journey through the world of data agents and creative AI! We hope this episode inspired you to explore the possibilities of augmenting your data workflows and even generating engaging content with these innovative tools. The power to build cleaner data pipelines, derive deeper insights, and bring complex concepts to life through AI is truly at your fingertips. ## Your turn to build Ready to get hands-on? Dive into the resources linked below and start building your own data agents and AI-powered applications today! Don't forget to [watch the full episode](https://www.youtube.com/watch?v=ATgIU47V1yI&list=PLIivdWyY5sqLXR1eSkiM5bE6pFlXC-OSs&index=2) for all the practical demonstrations. ## Connect with us - Smitha Kolan → [LinkedIn](https://www.linkedin.com/in/smithakolan/), [YouTube](https://www.youtube.com/@smithakolan), [X](https://x.com/smithakolan), [Instagram](https://www.instagram.com/girlknowsai/) - Lucia Subatin → [LinkedIn](https://www.linkedin.com/in/lsubatin/), [YouTube](https://www.youtube.com/@LuciaSubatin), [TikTok](https://www.tiktok.com/@unicorn.on.the.cl), [Instagram](https://www.instagram.com/a.cloudy.unicorn/)

    Tags

    datascienceagentsgeminidatabase

    Comments

    More Blog

    View all
    How I'm using ASTs and Gemini to solve the "Codebase Onboarding" problem 🧠ai

    How I'm using ASTs and Gemini to solve the "Codebase Onboarding" problem 🧠

    Hi everyone! 👋 I’m Tara, a Senior Software Engineer and Consultant. Over the years, I've jumped...

    T
    tworrell
    Local AI Will Save Us All (The Math Says So, Trust Me)ai

    Local AI Will Save Us All (The Math Says So, Trust Me)

    Every few weeks a take goes viral in tech circles making the case for ditching cloud AI and running...

    S
    Sebastian Schürmann
    Lost in the AI Hype, I Started Smallai

    Lost in the AI Hype, I Started Small

    And it helped me get back into tech without drowning TL;DR at the end Coming back to...

    R
    Rohini Gaonkar
    Building a Replay-Tested Interactive Brokers Client in Gogo

    Building a Replay-Tested Interactive Brokers Client in Go

    I wanted an IBKR library that felt like Go and had testing I could trust. So I wrote one.

    T
    Thomas Marcelis
    Playwright in Pictures: Fully Parallel Modeplaywright

    Playwright in Pictures: Fully Parallel Mode

    Playwright’s fullyParallel mode is often treated as a simple performance switch. In practice, it...

    V
    Vitaliy Potapov
    Designing a CLI for Both Humans and Agentscli

    Designing a CLI for Both Humans and Agents

    Learn how Alpic designed its CLI for both human developers and AI agents — covering tradeoffs like polling, context windows, interactivity, and statelessness.

    J
    Julien Vallini

    Stay up to date

    Get the latest DeepSeek prompts, rules, and resources delivered to your inbox weekly.

    Neura Market LogoNeura Market

    Discover the best AI prompts, plugins, and resources for DeepSeek and more.

    Content Types

    • Rules
    • Prompts
    • MCPs
    • Agents
    • Guides

    Platforms

    • ChatGPT Directory
    • Claude Directory
    • Gemini Directory
    • Cursor Directory
    • Grok Directory
    • Perplexity Directory
    • DeepSeek Directory
    • CoPilot Directory
    • Stable Diffusion Directory
    • Midjourney Directory
    • All Directories

    Resources

    • Blog
    • Documentation
    • Help Center
    • Marketplace

    Legal

    • Privacy Policy
    • Terms of Service

    © 2026 Neura Market. All rights reserved.

    |

    Not affiliated with any AI platform vendors.