I assume they all crib from the same training sets, but surely one of the billion dollar companies behind them can make their own?
You must log in or # to comment.
This is due to the training sets, one of them being CommonCrawl, which is disgusting. The Chinese LLMs like DeepSeek R1 and Qwen 3 use a different set of training materials that was actually good, despite it being censored too.
What’s common crawl?


