Producer: Ben Ellman
Пострадавшие. Кадр: Telegram-канал Mash
。黑料对此有专业解读
I was doing something different. I wasn’t changing what the model knew. I was changing how it thought. Layer duplication gives the model more iterations through its internal reasoning space without adding any new information. The difference between giving someone a bigger library and giving them more time to think. I was genuinely shocked when I took top spot on the leaderboard; but I think it’s proof that the method probably works.,更多细节参见手游
The same is true for research papers.,推荐阅读移动版官网获取更多信息