Command Palette

Search for a command to run...

MATH-500

Math benchmark of five hundred graduate problems that stress symbolic reasoning and step-by-step proofs.

MATH-500 measures a model's ability to solve difficult mathematics problems in a controlled exam format.1

  References

  References

  1. vals.ai