Command Palette

Search for a command to run...

HumanEval

Standard code generation test suite for Python functions with unit tests assessing functional correctness.

HumanEval measures whether generated Python functions satisfy given docstrings and pass rigorous unit tests.1

  References

  References

  1. arxiv.org