Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It would be nice if you could report some accuracy metrics of this approach on well known text datasets, after hiding the label


Great suggestion, I'll look into that. My expectation is that this library would not be state-of-the-art compared to training on labeled data (the intended purpose is building models where labels aren't available, if you have labels, it's obviously good to use the labels, ha). But it would be interesting to see how much of the performance is retained relative to training on the gold labels.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: