Undertrained Tokens in DeepSeek R1

(tokencontributions.substack.com)

3 points | by pr337h4m 224 days ago

0 comments