[https://gist.github.com/jasonkneen/4c065df2d7a95610e4fd30c3e3398b17](https://gist.github.com/jasonkneen/4c065df2d7a95610e4fd30c3e3398b17)
Anyone else here doing full-stack Next.js in Cursor and watching the Claude quota evaporate before lunch? I used to be in the same boat — massive context windows from all the components, pages, and DB logic would smoke the default limits fast. Not anymore. I’ve been on this setup for weeks and basically never hit a wall while still getting top-tier answers. Here’s exactly what I do: **1. .cursorrules is non-negotiable** I keep one in the root of every project. The key line I added: “Never explain the code to me. Just output the code blocks.” That single rule saves me thousands of output tokens a day. No more walls of “here’s what I changed and why” — just the goods. **2. Stopped using Cursor’s built-in Claude quota** I killed the default Cursor Pro subscription for the heavy stuff. Instead I use my own API keys and point Cursor’s “OpenAI Compatible” base URL at LLM Router Gateway. Inside [llmrouter](https://llmrouter.app/) routing settings I set up simple tags routing like this: * **UI & CSS tweaks**: gemini-3.1-flash → gpt-5.4-mini * **Deep backend / complex logic**: claude-opus-4.6 → deepseek-v3.2 * **General / quick questions**: llama-4-scout I sorted the fallback chains by speed vs intelligence. The router auto-detects the query type, so 90% of my UI polish and small fixes go to Gemini (basically free + huge context). I only actually hit Claude Opus 4.6 when I’m doing nasty database refactors or tricky architecture stuff. My Anthropic bill dropped \~70% overnight. **3. Cmd+K for everything small** Don’t open the full chat sidebar just to rename a variable or extract a component. Highlight the code, hit Cmd+K, let a fast model handle the inline edit. Saves a ton of tokens and feels way snappier. That’s it. Super simple but it completely changed how much I can actually use Cursor in a day. How are you all managing the limits? Using a Cursor Team? Or did you build your own router hacks too? Drop your setups — always looking to steal better ideas.
So I used to be a big cursor fan - striked the right balance between trusting the AI and being able to check the work. The new version however, sucks. Worktrees are now essentially AI initiated magic git commands that require manual approval, which is slow and annoying, and means I have to read through a bunch of git guff, and I have to wait like a minute before it does any real work for it to essentially do something that can be duplicated with a script, and it doesn't work well half the time - it had to retry twice to apply worktree changes! The new agent screen is pretty useless, because now you need to go and select a bunch of information that previously would be entirely obvious from context i.e. alt-tabbing into a specific repo's IDE. I don't like this direction and if they continue, I might start looking at competitors. Cursor is clearly aimed at and used by developers who can read code and want to read the code produced by AI. I use Claude Code if I want to just trust the AI.
We’re introducing Cursor 3. It is simpler, more powerful, and built for a world where all code is written by agents, while keeping the depth of a development environment. With the new Cursor, you can run as many agents as you want, everywhere you want: locally, in a worktree, on remote ssh, and in the cloud. And it has the best parts of the editor available when you need them. The new interface is available as a separate window that complements the IDE. Update Cursor to try it. We recently launched Composer 2, a frontier model with high limits. Then, with cloud, we gave agents their own computers so they can work truly autonomously. And now with Cursor 3, we’re releasing a new interface to collaborate with agents on software.
They kinda cute
Anyone got any workarounds to this? Got the meme from [ijustvibecodedthis.com](http://ijustvibecodedthis.com) (the AI coding newsletter thingy)