Instruction Following

Instruction following is a model's ability to comply with directives given in natural language while respecting system and safety constraints.

Evaluation Benchmarks

Benchmark	Domain	Metric	GPT-4o Score
MT-Bench	Chat tasks	Pairwise win rate	88 %
AlpacaEval 2	General	Pass@1	92 %
HELM Instruction	Multi-domain	Exact match	74 %

Hierarchical prompting reserves first 200 tokens for safety and style then appends dynamic context.
Instruction-fine-tuned open models (Hermes-2 Pro) reach GPT-3.5 compliance at 1/10th cost.
LLMs self-generate synthetic instruction datasets for continual tuning.