I think the emerging best way is to do "agentic search" over files. If you think...

nl · 2026-01-15T04:45:10 1768452310

Gemini 3 Flash is very good at the search task (it benchmarks quite close to 3 Pro in coding tasks but is much faster). I believe Amp switch to Gemini Flash for their search agent because it is better.

esperent · 2026-01-15T09:03:03 1768467783

I very much doubt this. I've been using gemini whenever I get hit by codex limits over the past week. 3 pro is very good - a bit behind codex but still very useful. I've tried 3 flash several times and each time what I got back was complete garbage. After the third or fourth attempt I stopped trying.

theshrike79 · 2026-01-15T08:26:28 1768465588

Gemini the model is good, Gemini the framework around the model is a steaming pile of crap.

woggy · 2026-01-15T08:53:36 1768467216

Yup. Gemini CLI needs a lot of work.

esperent · 2026-01-15T09:05:08 1768467908

I actually defended Gemini CLI when someone said this a few days ago. Then Murphy's law hit me with bug after bug, all of which have been reported many times already going back over year or more. I keep having to totally clear out the ~/.gemini folder and then it works again for a while.

theshrike79 · 2026-01-15T13:07:23 1768482443

I'm pretty sure Anthropic is the only company that actually dogfoods their CLI tool.

Gemini seems like an afterthought an intern did, Codex has cool features but none of them align with real-world need.

theshrike79 · 2026-01-15T08:25:51 1768465551

You can also create per-project agents with specific "expertise" in different parts of the code.

Basically they're just few kilobytes of text that's given as extra context to "explore" agents when looking at specific parts of the code.

atombender · 2026-01-16T16:04:25 1768579465

How does one do this?

I have trouble making Claude even follow CLAUDE.md. Like, it just ignores it, and when it needs to do a task (like testing or running a certain command native to the code base), it does a lot of "find" and "ls" an so on to understand what to run, and frequently runs commands with the wrong flags, badly escaped input etc.

I think Claude could work a lot better (and waste vastly fewer tokens) if it had small bite-size instructions for various tasks ("to run the integration test suite, run "cargo run integration_tests..."), but so far I've been unable to control it.