Loading...
Loading...

Edgee is an AI Gateway designed to optimize and manage LLM traffic at the edge. Its standout feature is edge-native token compression, which reduces the size of prompts before they reach providers like OpenAI or Anthropic, cutting costs by up to 50% without losing the user's intent. It provides a single, OpenAI-compatible API to access over 200 models, offering intelligent routing, fallbacks, real-time observability, and cost governance. Developers can use it to track spending by feature or team, host private models, and invoke edge tools to lower latency and improve reliability in production AI applications.
## How to Use
Developers can integrate Edgee by using its OpenAI-compatible API or SDKs available for TypeScript, Python, Go, and Rust. Simply replace your existing LLM provider endpoint with Edgee's gateway, use your API key, and call models via the edgee.send method. You can also configure routing policies and tags for cost tracking directly in the request metadata.
## Key Features
- Token Compression (reduces prompt size up to 50%) - OpenAI-compatible API for 200+ models - Intelligent Routing, Fallbacks, and Retries - Cost Governance with custom tags and spend alerts - Edge-native Observability (latency, usage, and error tracking) - Private Model Hosting and Edge Tools
## Use Cases
- Reducing token costs for long-context RAG pipelines - Implementing multi-provider redundancy to prevent downtime - Tracking AI expenditure across different teams or features using metadata tags - Running small models at the edge for request classification or redaction
Twilio
Cloud communications APIs
Weaviate
Open-source vector database
gpt-researcher
An autonomous agent that conducts deep research on any data using any LLM providers
Modal
Serverless cloud for AI
Cohere
Enterprise NLP and RAG APIs
trigger.dev
Trigger.dev – build and deploy fully‑managed AI agents and workflows