The nonprofit Center for AI Safety and Scale AI have released a challenging new benchmark for frontier AI systems.