Tag: physics

  • Antimatter Sunflower Seeds

    Antimatter Sunflower Seeds

    by

    in ,

    In the world of new LLMs, Anthropic recently released Claude Sonnet 4.5, claiming improved reasoning ability compared to Claude Sonnet 4. I compared the reasoning abilities of Sonnet 4 and Sonnet 4.5 with an absurd physics problem involving antimatter sunflower seeds and orbital mechanics. I also gave the same problem to OpenAI GPT-5 for comparison.…

  • LLM Eye Blink Test

    LLM Eye Blink Test

    Introduction This is a comparison of OpenAI GPT-5 vs. Anthropic Claude Sonnet 4. These are comparably priced commercial LLMs and are available in reasoning and non-reasoning varieties. The prompt On Tuesday morning at 7:00 a.m., Jane Wilkenson of Akron, Ohio woke up and blinked. Her eye blink generated a gravitational wave. Calculate the strain h…

  • Flashlight LLM Test

    Flashlight LLM Test

    Introduction OpenAI recently released GPT-5 with mixed reviews. Here is my first comparison test between Anthropic Sonnet 4 and OpenAI GPT-5. The models are comparably priced per token. All tests were through their respective APIs. Test prompt Imagine that every person on Earth was given a common typical flashlight. Imagine that on a Tuesday at…