Posts by namakoo

@namakoo

池澤允彦

Building IDFU — verified-code DPO training data for LLM fine-tuning

Japan huggingface.co/datasets/namakoo/idfu-verified-code Joined May 2026

1.1k Points • 8 Badges • 1 Connections • 1 Followers • 1 Following

Profile Stats Wall Posts Badges

Posts by namakoo

Posts Comments Reviews

All Articles Discussions Launches Tutorials Videos

namakoo • May 7 • in Articles • 11 min read

From -9.15pp to +0.61pp: An engineering journey through four DPO iteration failures

Over 36 hours we ran four DPO training iterations against Qwen2.5-Coder-7B-Instruct, trying to push HumanEval pass@1 above the base model's 87.20%. The first three iterations failed in different ways -9.15pp, -1.22pp, two NO-GO calls. The fourth rec...

0 Reactions 0 Comments

namakoo • May 2 • in Articles • 6 min read

What 500 curated failure pairs actually fix: a breakdown across 3 seeds

In the previous posthttps://dev.to/namakoo/-curating-python-failures-for-dpo-notes-from-the-rejected-side-2ff0, I described the curation philosophy for IDFU's rejected-side dataset — why I avoid synthetic bug generation, why stub detection matters, w...

5 Reactions 2 Comments

Welcome to Coder Legion

Connect with 4,608 amazing developers

Don't have an account? Sign up

OR

Posts by namakoo

池澤允彦

Posts by namakoo

Latest Jobs