Flawed AI Benchmarks: A Risk for Enterprises