Update non-thinking chat template

#15
by joaogante - opened

Removes <|channel>thought\n<channel|> from the non-thinking chat template.

Benchmarks ran locally using the HF implementation, in non-thinking mode

BEFORE

Benchmark    | Accuracy | Tokens per Forward
GPQA Diamond | 63.40	| 17.30
AIME2025     | 48.70    | 18.80
IFEval       | 85.40	| 9.40
HumanEval    | 93.30    | 24.50
GSM8K        | 94.40    | 20.30
MMLU         | 85.20    | 5.70

AFTER

Benchmark    | Accuracy | Tokens per Forward
GPQA Diamond | 64.90    | 17.70
AIME2025     | 50.00    | 19.50
IFEval       | 85.50    | 9.90
HumanEval    | 92.00    | 29.90
GSM8K        | 94.50    | 22.00
MMLU         | 86.70    | 11.70
Ready to merge
This branch is ready to get merged automatically.

Sign up or log in to comment