Tag: physics
-
Antimatter Sunflower Seeds
In the world of new LLMs, Anthropic recently released Claude Sonnet 4.5, claiming improved reasoning ability compared to Claude Sonnet 4. I compared the reasoning abilities of Sonnet 4 and Sonnet 4.5 with an absurd physics problem involving antimatter sunflower seeds and orbital mechanics. I also gave the same problem to OpenAI GPT-5 for comparison.…
-
LLM Eye Blink Test
Introduction This is a comparison of OpenAI GPT-5 vs. Anthropic Claude Sonnet 4. These are comparably priced commercial LLMs and are available in reasoning and non-reasoning varieties. The prompt On Tuesday morning at 7:00 a.m., Jane Wilkenson of Akron, Ohio woke up and blinked. Her eye blink generated a gravitational wave. Calculate the strain h…
-
Flashlight LLM Test
Introduction OpenAI recently released GPT-5 with mixed reviews. Here is my first comparison test between Anthropic Sonnet 4 and OpenAI GPT-5. The models are comparably priced per token. All tests were through their respective APIs. Test prompt Imagine that every person on Earth was given a common typical flashlight. Imagine that on a Tuesday at…