How to Benchmark GPT-4 vs. Claude 3 in Under 5 Minutes with a Single API Call