It would be nice if you could report some accuracy metrics of this approach on w...

andersonbcdefg · on Sept 21, 2024

Great suggestion, I'll look into that. My expectation is that this library would not be state-of-the-art compared to training on labeled data (the intended purpose is building models where labels aren't available, if you have labels, it's obviously good to use the labels, ha). But it would be interesting to see how much of the performance is retained relative to training on the gold labels.