AI companies use model specifications to define target behaviors during training and evaluation. Do current specs state the intended behaviors with enough precision, and do frontier models exhibit ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results