The AI agent used to be the star of every demo.
Now it's on the shutdown list. Not because the model got worse.
The most valuable asset in your AI program is in none of the quotes you ever signed.
A demo is a showroom. Good light, everything polis...
Build your AI skill once with your best model. Then run it on a model that costs a tenth as much until the next flagship ships. The output will not drop.
That sounds like a downgrade. It is not. It fixes the two things that make AI agents painful ri...
Some of the rules in your CLAUDE.md should not be rules at all.
Not because they are wrong. Because you have written the things you cannot afford to lose into a file the model reads once and then slowly forgets. A preference survives that. A constra...
I changed one model string in ten cron jobs last night. 4.7 to 4.8. Then I went to bed.
The benchmark threads can wait until morning. My agents can't. They fire whether I'm awake or not: a briefing at 6:30, follow-up drafts at 11:45, a sync at 4 AM ...
You added a skill last Tuesday. The agent hasn't called it once. Each new skill silently weakens the discovery odds of the ones you already have.
You assume it's a description problem. It isn't.
Everyone's pushing past 50 skills now. Vercel ships a...
TL;DR — why browserground, not the other 2B grounding models
You already know the hybrid-AI argument: don't pay frontier-vision rates for "where is the button?" There are three good 2B specialists for that job — UI-TARS, ShowUI, browserground. Here...
The first stage of AI work is prompting.
The last stage is removing the model from most of the workflow.
That sounds backwards.
It is not.
When a workflow is new, the LLM is useful because the work is still ambiguous. You are discovering what goo...
Forrestchang's andrej-karpathy-skillshttps://github.com/forrestchang/andrej-karpathy-skills CLAUDE.md is four rules aimed at the moment Claude is writing code. They work. What they don't cover is the moment Claude is running. Once a Claude-driven pip...
Everyone's sharing their skill libraries right now. "Here are my 20 custom slash commands." "Check out my prompt template collection." "This skill saves me 2 hours a day."
I use skills too. I have about a dozen. They handle cover letters, content pi...