[https://gist.github.com/jasonkneen/4c065df2d7a95610e4fd30c3e3398b17](https://gist.github.com/jasonkneen/4c065df2d7a95610e4fd30c3e3398b17)
Anyone else here doing full-stack Next.js in Cursor and watching the Claude quota evaporate before lunch? I used to be in the same boat — massive context windows from all the components, pages, and DB logic would smoke the default limits fast. Not anymore. I’ve been on this setup for weeks and basically never hit a wall while still getting top-tier answers. Here’s exactly what I do: **1. .cursorrules is non-negotiable** I keep one in the root of every project. The key line I added: “Never explain the code to me. Just output the code blocks.” That single rule saves me thousands of output tokens a day. No more walls of “here’s what I changed and why” — just the goods. **2. Stopped using Cursor’s built-in Claude quota** I killed the default Cursor Pro subscription for the heavy stuff. Instead I use my own API keys and point Cursor’s “OpenAI Compatible” base URL at LLM Router Gateway. Inside [llmrouter](https://llmrouter.app/) routing settings I set up simple tags routing like this: * **UI & CSS tweaks**: gemini-3.1-flash → gpt-5.4-mini * **Deep backend / complex logic**: claude-opus-4.6 → deepseek-v3.2 * **General / quick questions**: llama-4-scout I sorted the fallback chains by speed vs intelligence. The router auto-detects the query type, so 90% of my UI polish and small fixes go to Gemini (basically free + huge context). I only actually hit Claude Opus 4.6 when I’m doing nasty database refactors or tricky architecture stuff. My Anthropic bill dropped \~70% overnight. **3. Cmd+K for everything small** Don’t open the full chat sidebar just to rename a variable or extract a component. Highlight the code, hit Cmd+K, let a fast model handle the inline edit. Saves a ton of tokens and feels way snappier. That’s it. Super simple but it completely changed how much I can actually use Cursor in a day. How are you all managing the limits? Using a Cursor Team? Or did you build your own router hacks too? Drop your setups — always looking to steal better ideas.
We’re introducing Cursor 3. It is simpler, more powerful, and built for a world where all code is written by agents, while keeping the depth of a development environment. With the new Cursor, you can run as many agents as you want, everywhere you want: locally, in a worktree, on remote ssh, and in the cloud. And it has the best parts of the editor available when you need them. The new interface is available as a separate window that complements the IDE. Update Cursor to try it. We recently launched Composer 2, a frontier model with high limits. Then, with cloud, we gave agents their own computers so they can work truly autonomously. And now with Cursor 3, we’re releasing a new interface to collaborate with agents on software.
Cursor 3 out now
They kinda cute
Anyone got any workarounds to this? Got the meme from [ijustvibecodedthis.com](http://ijustvibecodedthis.com) (the AI coding newsletter thingy)