Cool project. Slicing mixes into 30s chunks is smart, but do you worry about label noise? Like parts of a mix not fully matching the genre?
Expanding the GTZAN Dataset: A Journey from YouTube to Mel Spectrograms
alejandrotg-codeLeader
●1 ●2 ●12
calendar_today
• schedule2 min read
— Originally published at dev.to
2 Comments
wanderer
•
alejandrotg-code
•
@[wanderer] That’s a very valid concern! To address that, I implemented a logic to skip the beginning and the end of each track, specifically to avoid silences, long intros, or 'fade-outs' that don't represent the genre's core features. This ensures the 30s chunks are pulled from the most information-dense part of the song. Do you think a fixed offset is enough, or would you go for a dynamic detection of audio activity?
Please log in to add a comment.
🔥 Join developers growing publicly
Share your knowledge, build in public, and grow your developer presence with a global community.
Please log in to comment on this post.
More Posts
- © 2026 Coder Legion
- Feedback / Bug
- Privacy
- About Us
- Contacts
- Premium Subscription
- Terms of Service
- Refund
- Early Builders
chevron_left
More From alejandrotg-code
Related Jobs
- Full Stack Developer - Geospatial & 3D datasets Tempe, AZESR Healthcare · Full time · Tempe, AZ
- Service Technician - LP/Oil Burner JourneymanDead River Company · Full time · Rumford, ME
- Service Technician - LP/Oil Burner JourneymanDead River Company · Full time · Rumford, ME
Commenters (This Week)
Hussein Mahdi
3 comments
NILE GREEN
1 comment
SCURA
1 comment
Contribute meaningful comments to climb the leaderboard and earn badges!