namakoo

@namakoo

池澤允彦

Leader

Building IDFU — verified-code DPO training data for LLM fine-tuning

Japan huggingface.co/datasets/namakoo/idfu-verified-code Joined May 2026

640 Points • 6 Badges • 1 Connections • 1 Followers • 1 Following

Profile Stats Wall Posts Badges

About

Solo developer with 10+ years of
experience in Android development and the broader
developer ecosystem. Currently building IDFU on
Hugging Face — a verified-code DPO training da... Show more

Language & Tools

Python, Docker, PyTorch, Transformers, TRL DPOTrainer,
Qwen2.5-Coder, HuggingFace Hub, ChromaDB, SQLite,
Stripe, Android, LoRA, Quantization

Currently Exploring

DPO training data curation, LLM fine-tuning at 4-bit
quantization, on-device LLM agents, Python failure
pattern taxonomy

Achievements

+3.46 ± 0.35 pp HumanEval improvement (3 seeds, Qwen2.5-Coder-3B) using 500 curated DPO pairs.
Built and shipped IDFU — a verified-code DPO dataset family — as a solo developer, with 8 products across 5 price tiers on Hugging Face.
Featured contributor on CoderLegion.

Fun Fact

Started in custom ROM development a decade ago, ended up curating failure patterns for code LLMs. The throughline is the same: figuring out exactly where things break.

Random Dev Quote

Most of the work in DPO training data is on the rejected side.

Joined:

2 days (since May 2)

Extra privileges:

Editing any comment

Full Name:

池澤允彦

Headline:

Building IDFU — verified-code DPO training data for LLM fine-tuning

About:

Solo developer with 10+ years of
experience in Android development and the broader
developer ecosystem. Currently building IDFU on
Hugging Face — a verified-code DPO training data
family covering specialty packs ($9), main releases
($49), and benchmark-aligned DPO Pair Pack ($99).

Recent benchmark: +3.46 ± 0.35 pp on HumanEval
(Qwen2.5-Coder-3B, 3 seeds, Docker sandbox executed).

Datasets: huggingface.co/datasets/namakoo/idfu-verified-code
dev.to: dev.to/namakoo

Location:

Japan

Website:

https://huggingface.co/datasets/namakoo/idfu-verified-code

Languges & Tools:

Python, Docker, PyTorch, Transformers, TRL DPOTrainer,
Qwen2.5-Coder, HuggingFace Hub, ChromaDB, SQLite,
Stripe, Android, LoRA, Quantization

Currently Exploring:

DPO training data curation, LLM fine-tuning at 4-bit
quantization, on-device LLM agents, Python failure
pattern taxonomy

Achievements:

Fun Fact:

Started in custom ROM development a decade ago, ended up curating failure patterns for code LLMs. The throughline is the same: figuring out exactly where things break.

Random Dev Quote:

Most of the work in DPO training data is on the rejected side.

Top Tags

Activity

User Activities

2026 2025 2024 2023 2022

	Jan	Feb	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec
Mon
Tue
Wed
Thu
Fri
Sat
Sun

Less More

Contribution

Articles

What 500 curated failure pairs actually fix: a breakdown across 3 seeds
May 02, 2026

See All Articles

Comments

@[Jundarer] Fair concern — noise was the main risk going in. Three ...
May 04, 2026
The "session token issue it found on its own" line stayed with me. Tha...
May 03, 2026

See All Comments

namakoo

池澤允彦

About

Language & Tools

Currently Exploring

Achievements

Fun Fact

Random Dev Quote

Top Tags

User Activities

Articles

Comments

Latest Jobs

Active in these Groups:

Welcome to Coder Legion

Connect with 4,092 amazing developers

Don't have an account? Sign up

OR

namakoo

池澤允彦

About

Language & Tools

Currently Exploring

Achievements

Fun Fact

Random Dev Quote

Top Tags

User Activities

Articles

Comments

Latest Jobs

Active in these Groups: