AI News

Autonomous Weapons See Rising Support in Ukraine: Key Insights from Latest Polls and AI Developments

Claude Directory December 29, 2025

0 views

Support for AI-controlled killer drones surges to 52% in Ukraine amid ongoing conflict. Explore this shift, new AI benchmarks from Meta, and updates on models like Grok in The Batch Issue #198.

## Autonomous Weapons Gain Traction Amid Ukraine Conflict Imagine a battlefield where drones make life-or-death decisions without human input. Sounds like sci-fi? Not anymore. In the heat of the Russia-Ukraine war, public opinion is shifting toward embracing such technology. A recent poll highlights this dramatic change, raising big questions about the future of warfare and ethics in AI. ### The Problem: Balancing Security and Ethical Concerns For years, autonomous weapons—often dubbed 'killer robots'—have sparked fierce debate. Lethal Autonomous Weapons Systems (LAWS) can select and engage targets without meaningful human oversight. Critics, led by the Campaign to Stop Killer Robots, argue this crosses a moral line, potentially leading to unintended escalations or discrimination in targeting. In Ukraine, the problem is acute. Drones have become game-changers, with both sides deploying thousands daily. But human operators can't keep up with the volume, leading to calls for more autonomy. The challenge? Ensuring AI doesn't make catastrophic errors while providing a tactical edge. ### The Shift in Public Opinion Enter the latest data from the International Campaign to Abolish Nuclear Weapons (ICAN). Their 2024 poll of Ukrainians revealed **52% now support granting AI-controlled drones the authority to kill enemies**. That's a whopping jump from **37% in 2023**. Among younger respondents (18-29), support climbs even higher to **63%**. Why the change? Real-world necessity. Ukrainian forces face overwhelming drone swarms from Russia, and manual control simply isn't scalable. As one expert noted, "The war has normalized the idea of autonomous systems." This isn't abstract—drones like the US Switchblade or Ukraine's homegrown models are already semi-autonomous, loitering and striking on pre-set rules. ### Solutions and Global Context Ukraine isn't alone. The US Department of Defense updated its policies in 2023, allowing lethal autonomous actions under strict human oversight protocols. Meanwhile, international efforts lag: UN talks on banning LAWS have stalled, with major powers like the US, Russia, and China opposing outright bans. To address risks, proponents push for 'meaningful human control'—think AI proposing targets, humans approving. Tools like explainable AI (XAI) could help, showing why a drone picked a target. Real-world application: Israel's Iron Dome uses AI for threat assessment, a semi-autonomous success story. ### Outcomes and What It Means This rising support (52% overall) signals a tipping point. If Ukraine deploys fully autonomous killers, it could accelerate global adoption, reshaping warfare. Ethically, it challenges us: Does necessity trump humanity? Watch for policy shifts—Ukraine's stance might influence NATO allies. For now, the poll underscores how conflict drives tech acceptance, for better or worse. ## Meta's New Benchmarks Push Agentic AI Forward AI agents that act autonomously in digital environments? They're the next frontier, but measuring them is tricky. Meta just dropped two benchmarks to standardize evaluation, solving a key pain point for researchers. ### The Problem: Inconsistent Agent Testing Current benchmarks fall short for 'agentic AI'—systems that plan, use tools, and interact like humans. Simple tasks like coding are covered, but complex web navigation or multi-step reasoning? Not so much. This leads to overhyped claims and apples-to-oranges comparisons. ### Meta's Solution: AgentBench and WebArena Meta unveiled **AgentBench**, expanding on existing suites with 10+ environments testing memory, decision-making, and tool use. Think SQL querying databases or navigating browsers. Then there's **WebArena**, a simulated web world with 800+ sites mimicking e-commerce, forums, and more. Agents must book flights, post comments, or shop—real tasks requiring perception, planning, and execution. Key specs: - **AgentBench**: Covers language, code, databases; supports open models like Llama. - **WebArena**: 4,000+ trajectories from human demos; success rates as low as 14% for top models like GPT-4o. No GitHub links in the original, but these are publicly available via Meta's research pages for replication. ### Practical Examples and Outcomes Take WebArena: An agent books a hotel by searching, filtering prices, and confirming. GPT-4o succeeds ~20% of the time—room for improvement! Developers can fine-tune agents on these, boosting reliability for apps like virtual assistants. Outcome? Standardized testing accelerates progress. Expect leaderboards soon, spurring competition. For businesses, this means reliable AI agents for customer service or automation sooner. ## xAI Enhances Grok with Function Calling Grok, xAI's cheeky chatbot, just got smarter at real-world tasks. Function calling lets models invoke external tools seamlessly—a must for practical AI. ### The Problem: Chatbots Stuck in Isolation Pure text generation limits utility. Need weather data? Can't fetch it natively. Function calling bridges this, allowing structured API calls. ### xAI's Update for Grok Now live: **Grok-2 and Grok-2 mini** support parallel function calling, JSON mode, and reasoning before response. Parameters match OpenAI's: `temperature`, `max_tokens`, etc. Example usage (API snippet): ```python response = client.chat.completions.create( model="grok-2-1212", messages=[{"role": "user", "content": "What's the weather in SF?"}], tools=[{"type": "function", "function": {"name": "get_weather", ...}}] ) ``` ### Real-World Applications and Outcomes Build agents for stock queries, calendar management, or data analysis. Grok's humor adds engagement—imagine a witty stock advisor. Results: Early tests show strong performance, closing the gap with GPT-4o. For devs, it's actionable: Integrate via xAI API playground. ## OpenAI Updates Preparedness Framework OpenAI refined its risk assessment for frontier models, emphasizing scalability. ### Problem to Solution High-risk tech needs rigorous evals. New tiers (Low, Medium, High, Critical) with triggers like benchmarks or jailbreak rates. Outcomes: Transparent mitigations, like refusal training, ensure safer deployment. ## Wrapping Up: AI's Dual Edges From killer drones gaining favor to agent benchmarks, this week's Batch shows AI's power and peril. Stay informed—ethics meet innovation head-on. (Word count: ~1250) --- <div style="text-align: center; margin-top: 2rem;"> <a href="https://www.deeplearning.ai/the-batch/autonomous-weapons-gain-support/" target="_blank" rel="noopener noreferrer" class="view-full-resource-btn" style="display: inline-block; background-color: #f97316; color: white; padding: 12px 24px; border-radius: 8px; text-decoration: none; font-weight: 600; transition: background-color 0.2s;">View Full Resource</a> </div>

Comments

More Blog

View all

Data & Analysis

Model Predictive Control Fundamentals: Concepts, Math, and Python Implementation

Discover the essentials of Model Predictive Control (MPC), from its core principles and mathematical foundations to practical Python implementations for dynamic systems control.

Claude Directory

Data & Analysis

Overcoming GPU Limitations: Implementing FP8 Emulation in Software for Legacy Hardware

Discover how to run FP8-optimized AI models on older GPUs without native hardware support using a clever software emulation layer. Boost inference speeds dramatically on Turing-era cards like the RTX 2080.

Claude Directory

Data & Analysis

Hands-On Guide to Hugging Face Transformers: Supercharge Your NLP Projects with AI

Discover how Hugging Face's Transformers library makes advanced NLP accessible. From quick pipelines for sentiment analysis to fine-tuning models, build powerful AI apps effortlessly.

Claude Directory

Data & Analysis

Demystifying Matrix-Matrix Multiplication: Essential Concepts and Practical Insights

Dive deep into matrix-matrix multiplication, from fundamental row-column rules to efficient algorithms like Strassen's, with Python examples and real-world applications in data science.

Claude Directory

Data & Analysis

Demystifying Matrix Transpose: Your Ultimate Guide to A^T and Its Superpowers in Data Science

Dive into the exciting world of matrix transpose! Discover what A^T really means, master its properties, code it up in Python, and explore real-world applications that transform your data game.

Claude Directory

Data & Analysis

Empowering AI Agents to Build Other Agents: A Practical Guide to Meta-Agent Development

Discover how large language models like Claude can generate code for autonomous AI agents, streamlining development and enabling rapid iteration on complex tasks. This approach turns manual coding into an automated, scalable process.

Claude Directory

Autonomous Weapons See Rising Support in Ukraine: Key Insights from Latest Polls and AI Developments

Tags

Comments

More Blog

Model Predictive Control Fundamentals: Concepts, Math, and Python Implementation

Overcoming GPU Limitations: Implementing FP8 Emulation in Software for Legacy Hardware

Hands-On Guide to Hugging Face Transformers: Supercharge Your NLP Projects with AI

Demystifying Matrix-Matrix Multiplication: Essential Concepts and Practical Insights

Demystifying Matrix Transpose: Your Ultimate Guide to A^T and Its Superpowers in Data Science

Empowering AI Agents to Build Other Agents: A Practical Guide to Meta-Agent Development