Benchmarks are not interesting in deciding the "size class". Bigger size means more knowledge. Also, the Qwen 3.5 27B is a dense 27B active parameter model. StepFun 3.5 Flash has 11B active parameters.
No, because smart people realize they are playing an iterated game and that behaving in a way that people identify as Machiavellian is actually suboptimal in the long run.
So they're smart enough to be calculated and stupid enough not to be so calculated that they look untrustworthy.
> No, because smart people realize they are playing an iterated game and that behaving in a way that people identify as Machiavellian is actually suboptimal in the long run.
Even if you are right coincidentally (which I wouldn't be so sure about), that's still poor argument assuming you realize your belief in what optimal strategy is what it is - just an educated guess.
reply