Flashlight LLM Test

Introduction

OpenAI recently released GPT-5 with mixed reviews. Here is my first comparison test between Anthropic Sonnet 4 and OpenAI GPT-5. The models are comparably priced per token. All tests were through their respective APIs.

Test prompt

Imagine that every person on Earth was given a common typical flashlight. Imagine that on a Tuesday at 9 a.m., every person aimed their flashlight east, parallel to the ground, turned on their flashlight, and left it on for 24 hours. How much would that change the rotational period of the Earth after those 24 hours? Show your calculations.

Click here to see the full PDF report.


Comments

Leave a Reply

Your email address will not be published. Required fields are marked *