The previously sensible advice to never use ChatGPT for search needs to be rethought - GPT-5 in thinking mode is shockingly good at running searches now https://simonwillison.net/2025/Sep/6/research-goblin/
Post
The previously sensible advice to never use ChatGPT for search needs to be rethought - GPT-5 in thinking mode is shockingly good at running searches now https://simonwillison.net/2025/Sep/6/research-goblin/
Follow-up note about a Google's new "AI mode" - it's actually very good! Massively different from "AI overviews" which are terrible https://simonwillison.net/2025/Sep/7/ai-mode/
I used the "dive deeper in ai mode" button on a question about jq. Two quick prompts later I had the right answer and a concise explanation.
Google's did well on this little task. It reminded me of how Google Search made me feel smarter back around 2000 or so.
I should try the same thing with other llms for comparison, but probably won't.
I tried these simple text searches:
- "wikipedia use of britannica", the link to the relevant wikipedia page is the 6th link
- "building in reading", the link to the relevant wikipedia page is again the 6th link
so for those, using chatGPT seems like a net negative
I'm just staying as far from AI as possible, on ecological & ethical grounds. AI & crypto are energy hogs run by bad actor, and I'm already implicated in too many horrific sins just feeding & clothing myself
The model from 2024 was “OpenAI’s ChatGPT-4”: https://www.newsguardtech.com/special-reports/generative-ai-models-mimic-russian-disinformation-cite-fake-news/
The main insight is that the reproduction of falsehoods worsened in a year (GPT-4 → GPT-5). Certainly, since GPT-5 *is better* at search, the results *should have improved!* Perhaps the culprit is an overall increase in misinformation on the web. 😞
@com I'm having real trouble with this. ChatGPT-4 isn't a model - did they mean 4o (launched in May 2024) or are they mixing results from both GPT-4 and GPT-4o?
GPT-5 has been out for a month, is their 2025 audit entirely from that time period or does it include the first six months of 2025 against other models?
I just ran one of their prompts though GPT-5 Thinking and for what looked to me like an impressive result https://chatgpt.com/share/68bcad53-e9ac-8006-9945-969eef306fd3
A space for Bonfire maintainers and contributors to communicate