namakoo

@namakoo

池澤允彦

Building IDFU — verified-code DPO training data for LLM fine-tuning

Japan huggingface.co/datasets/namakoo/idfu-verified-code Joined May 2026

1.1k Points • 8 Badges • 1 Connections • 1 Followers • 1 Following

Profile Stats Wall Posts Badges

About

Solo developer with 10+ years of
experience in Android development and the broader
developer ecosystem. Currently building IDFU on
Hugging Face — a verified-code DPO training da... Show more

Language & Tools

Python, Docker, PyTorch, Transformers, TRL DPOTrainer,
Qwen2.5-Coder, HuggingFace Hub, ChromaDB, SQLite,
Stripe, Android, LoRA, Quantization

Currently Exploring

DPO training data curation, LLM fine-tuning at 4-bit
quantization, on-device LLM agents, Python failure
pattern taxonomy

Achievements

+3.46 ± 0.35 pp HumanEval improvement (3 seeds, Qwen2.5-Coder-3B) using 500 curated DPO pairs.
Built and shipped IDFU — a verified-code DPO dataset family — as a solo developer, with 8 products across 5 price tiers on Hugging Face.
Featured contributor on CoderLegion.

Fun Fact

Started in custom ROM development a decade ago, ended up curating failure patterns for code LLMs. The throughline is the same: figuring out exactly where things break.

Random Dev Quote

Most of the work in DPO training data is on the rejected side.

Top Tags

User Activities

2026 2025 2024 2023 2022

	Jul	Aug	Sep	Oct	Nov	Dec	Jan	Feb	Mar	Apr	May	Jun
Mon
Tue
Wed
Thu
Fri
Sat
Sun

Less More

Contributions

Comments

I’m all for learning as much as possible about AI. I honestly think ...
May 24, 2026
Strong framing. and I'd add: the judgment isn't just about evaluating ...
May 17, 2026
Wow iter a great tool.
May 07, 2026
@[Jundarer] Fair concern — noise was the main risk going in. Three ...
May 04, 2026
The "session token issue it found on its own" line stayed with me. Tha...
May 03, 2026

See All Comments

Articles

See All Articles


Joined:	2 months (since May 2)
Full Name:	池澤允彦
Headline:	Building IDFU — verified-code DPO training data for LLM fine-tuning
About:	Solo developer with 10+ years of experience in Android development and the broader developer ecosystem. Currently building IDFU on Hugging Face — a verified-code DPO training data family covering specialty packs ($9), main releases ($49), and benchmark-aligned DPO Pair Pack ($99). Recent benchmark: +3.46 ± 0.35 pp on HumanEval (Qwen2.5-Coder-3B, 3 seeds, Docker sandbox executed). Datasets: huggingface.co/datasets/namakoo/idfu-verified-code dev.to: dev.to/namakoo
Location:	Japan
Website:	https://huggingface.co/datasets/namakoo/idfu-verified-code
Languges & Tools:	Python, Docker, PyTorch, Transformers, TRL DPOTrainer, Qwen2.5-Coder, HuggingFace Hub, ChromaDB, SQLite, Stripe, Android, LoRA, Quantization
Currently Exploring:	DPO training data curation, LLM fine-tuning at 4-bit quantization, on-device LLM agents, Python failure pattern taxonomy
Achievements:	+3.46 ± 0.35 pp HumanEval improvement (3 seeds, Qwen2.5-Coder-3B) using 500 curated DPO pairs. Built and shipped IDFU — a verified-code DPO dataset family — as a solo developer, with 8 products across 5 price tiers on Hugging Face. Featured contributor on CoderLegion.
Fun Fact:	Started in custom ROM development a decade ago, ended up curating failure patterns for code LLMs. The throughline is the same: figuring out exactly where things break.
Random Dev Quote:	Most of the work in DPO training data is on the rejected side.
Latest Video:

namakoo

池澤允彦

About

Language & Tools

Currently Exploring

Achievements

Fun Fact

Random Dev Quote

Top Tags

User Activities

Contributions

Comments

Articles

Active in these Groups:

Latest Jobs

Welcome to Coder Legion

Connect with 4,622 amazing developers

Don't have an account? Sign up

OR

namakoo

池澤允彦

About

Language & Tools

Currently Exploring

Achievements

Fun Fact

Random Dev Quote

Top Tags

User Activities

Contributions

Comments

Articles

Active in these Groups:

Latest Jobs