Image model ranking orders candidate models (or generations) by predicted perceptual quality or task performance so that the best output is presented to users or downstream pipelines.
Ranking Scenarios
Popular Metrics (2025)
Design Trade-offs
- Using CLIP only may over-rank text-heavy images.
- Aesthetic models are subjective and need domain retuning.
- Running multiple metrics adds latency; batch GPU inference mitigates.
Current Trends (2025)
- Multi-head rankers output joint aesthetic + safety + prompt scores.
- Training rankers with human pairwise preferences beats scalar scores.
- Edge ranking on WebGPU filters thumbnails before upload to server.
Implementation Tips
- Normalize metric scales before weighted sum.
- Cache embeddings for candidate images to reuse across prompts.
- Evaluate ranker with Kendall τ against human judgments.