016_helm_capabilities

We released HELM Capabilities, the latest flagship benchmark in the HELM suite that evaluates models capability-by-capability. 🔍




Enjoy Reading This Article?

Here are some more articles you might like to read next:

  • a post with plotly.js
  • a post with image galleries
  • a post with tabs
  • a post with typograms
  • a post that can be cited