Cool project. Slicing mixes into 30s chunks is smart, but do you worry about label noise? Like parts of a mix not fully matching the genre?
Expanding the GTZAN Dataset: A Journey from YouTube to Mel Spectrograms
alejandrotg-codeverifiedBackerLeader
posted
Originally published at dev.to
2 min read
2 Comments
wanderer
•
alejandrotg-codeverified
•
@[wanderer] That’s a very valid concern! To address that, I implemented a logic to skip the beginning and the end of each track, specifically to avoid silences, long intros, or 'fade-outs' that don't represent the genre's core features. This ensures the 30s chunks are pulled from the most information-dense part of the song. Do you think a fixed offset is enough, or would you go for a dynamic detection of audio activity?
Please log in to add a comment.
Please log in to comment on this post.
More Posts
- © 2026 Coder Legion
- Feedback / Bug
- Privacy
- About Us
- Contacts
- Premium Subscription
- Terms of Service
- Refund
- Early Builders
chevron_left
More From alejandrotg-code
Related Jobs
- Service Technician - LP/Oil Burner JourneymanDead River Company · Full time · Rumford, ME
- Service Technician - LP/Oil Burner JourneymanDead River Company · Full time · Rumford, ME
- SME-React Developer JourneymanAHU Technologies, Inc. · Full time · Washington DC
Commenters (This Week)
Tom Smithverified
1 comment
ashimov
1 comment
Hamdane Khiari
1 comment
Contribute meaningful comments to climb the leaderboard and earn badges!