Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

You mean something like http://www.voxforge.org/

Seriously there is no massive CC-licensed source of audio data out there. Most of the fancy algorithms for doing speech recognition are on github. What isn't is a massive and diverse dataset. I encourage others to reply if they have seen otherwise.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: