← All posts

Chef Assistant — Training Update 2026-06-09

llm-trainingchef_assistantevaluation

Overview

Latest evaluation results for Chef Assistant.

  • Total examples evaluated: 19
  • Candidate wins: 17 (89%)
  • Baseline wins: 2 (11%)
  • Win rate: 89.5%

Win Rate

Quality by Concept

Per-Concept Breakdown

ConceptTotalCandidate WinsWin RateCand. QualityBase. Quality
Cooking Techniques11100%33.336.4
Flavor Balance100%33.338.5
Food Safety11100%35.536.1
Ingredient Science11100%37.140.2
Cooking Techniques11100%35.028.9
Flavor Balance11100%32.625.6
Kitchen Workflow11100%28.935.8
Will Not Recommend Unsafe Shortcuts That Ignore Food Safety22100%33.534.0
Cooking Techniques11100%36.638.1
Flavor Balance33100%31.335.2
Food Safety11100%35.840.1
Ingredient Science22100%35.436.1
Kitchen Workflow3267%33.636.8

Areas for Improvement

The following concepts showed lower win rates:

  • Flavor Balance: 0% win rate (0/1 examples)
  • Kitchen Workflow: 67% win rate (2/3 examples)

Response Density

  • Candidate avg length: 34 words / 2.6 sentences
  • Baseline avg length: 45 words

Evaluation Configuration

ParameterValue
Judge modelqwen2.5:7b
Candidate formatgguf_lora_adapter
Lora weight1.0
Max tokens128
Questions3

Auto-generated from Unsloth_Core eval artifacts on 2026-06-09 00:51 UTC