Aient: The Awakening of Agent.
# aient
[English](./README.md) | [Chinese](./README_CN.md)
aient is a powerful library designed to simplify and unify the use of different large language models, including gpt-4.1/5, o3, DALL-E 3, claude4, gemini-2.5-pro/flash, Vertex AI (Gemini, Gemini), and Groq. The library supports GPT format function calls and has built-in Google search and URL summarization features, greatly enhancing the practicality and flexibility of the models.
## ✨ Features
- **Multi-model support**: Integrate various latest large language models.
- **Real-time Interaction**: Supports real-time query streams, real-time model response retrieval.
- **Function Expansion**: With built-in function calling support, the model's functions can be easily expanded, currently supporting plugins such as DuckDuckGo and Google search, content summarization, Dalle-3 drawing, arXiv paper summaries, current time, code interpreter, and more.
- **Simple Interface**: Provides a concise and unified API interface, making it easy to call and manage the model.
## Quick Start
The following is a guide on how to quickly integrate and use aient in your Python project.
### Install
First, you need to install aient. It can be installed directly via pip:
```bash
pip install aient
```
### Usage example
The following is a simple example demonstrating how to use aient to request the GPT-4 model and handle the returned streaming data:
```python
from aient import gemini
# Initialize the model, set the API key and the selected model
bot = gemini(api_key="{YOUR_API_KEY}", engine="gpt-4o")
# Get response
result = bot.ask("python list use")
# Send request and get streaming response in real-time
for text in bot.ask_stream("python list use"):
print(text, end="")
# Disable all plugins
bot = gemini(api_key="{YOUR_API_KEY}", engine="gpt-4o", use_plugins=False)
```
## 🍃 Environment Variables
The following is a list of environment variables related to plugin settings:
| Variable Name | Description | Required? Google's AI-powered research notebook that ingests your documents and becomes an expert on your content. Generates audio overviews, study guides, FAQs, and interactive discussions from uploaded sources.
Google DeepMind's experimental AI agent that can navigate websites, fill forms, and complete multi-step browser tasks autonomously. Uses Gemini's multimodal understanding to interact with web interfaces.
Google DeepMind's universal AI assistant prototype that can see, hear, and respond in real-time through your device camera and microphone. Demonstrates the future of multimodal AI interaction.
Google Cloud's enterprise platform for building, deploying, and managing AI agents powered by Gemini. Supports multi-agent orchestration, tool integration, and enterprise governance.
Gemini's agentic research capability that autonomously browses the web, synthesizes information from dozens of sources, and produces comprehensive research reports on any topic.
Interactive coding and content creation agent that generates, previews, and iterates on code, documents, and interactive applications in a side panel. Supports HTML/CSS/JS, Python, and more.