Integrating AI Benchmarks into Your CI/CD Pipeline for Robust Deployments