PostTrainBench: How Far Can AI Agents Go in Automating LLM Post-Training?
Introduction
Post-training is where the real cost of LLM development lives. Taking a pretrained base model and turning it into something actually useful - an assistant that follows instructions, reasons carefully, and behaves safely - requires months of supervised fine-tuning, reward modeling, and alignment work from teams of skilled ML