Introducing AudioLoop
I’ve been working for a while on AudioLoop, so I figure it needs a post. AudioLoop is a tool to help researchers create labels for large unlabeled audio datasets. It’s meant to indirectly solve a problem I see in bioacoustics: the lack of many large labeled datasets. Hopefully AudioLoop can help others create more of them. The problem is that labeling can be a long, drawn-out, draining manual process. Having to listen to a million clips and click “yes” or “no” for each one is challenging even for the most determined, not to mention expensive. And many bioacoustics datasets are highly imbalanced: events of interest can be quite rare. Listening to 100 clips of noise for every one clip of interest isn’t a great way to spend time or resources. ...