I had considered this task infeasible, due to a relative lack of training data. After all, isn't the received wisdom that you must shove every scrap of Common Crawl into your pre-training or you're doing it wrong? ;)
But reading the outputs here, it would appear that quality has won out over quantity after all!
But reading the outputs here, it would appear that quality has won out over quantity after all!