Command Palette

Search for a command to run...

LiveCodeBench

Real-time coding benchmark capturing interactive programming workflows with continuous evaluation.

LiveCodeBench scores models on dynamic code tasks that require incremental updates and debugging in a live environment.1

  References

  References

  1. sky.cs.berkeley.edu