016_helm_capabilities
We released HELM Capabilities, the latest flagship benchmark in the HELM suite that evaluates models capability-by-capability. 🔍
Enjoy Reading This Article?
Here are some more articles you might like to read next:
We released HELM Capabilities, the latest flagship benchmark in the HELM suite that evaluates models capability-by-capability. 🔍
Here are some more articles you might like to read next: