Is Model Speed Killing Your User Experience? A Guide to Latency Benchmarking