Raw LLM Responses

Inspect the exact model output for any coded comment.

Comment
Minor nuance on METR: the chart you referenced measures the success rate of tasks measured by the duration of how long it takes a human expert to perform it, not the AI model itself. So on the latest 50% chart, the ML bug fix example would take a human expert on average 15 hours to complete, and Opus 4.6 is able to complete that task successfully 50% of the time. It doesn't list how long it took the model to do the task though, for all we know it might have been a few minutes. If you pair that with the fact that these things can work for hours and hours independently, it does seem like something big is happening.
youtube AI Jobs 2026-02-24T15:1… ♥ 10
Coding Result
DimensionValue
Responsibilitynone
Reasoningunclear
Policyunclear
Emotionindifference
Coded at2026-04-27T06:24:53.388235
Raw LLM Response
[ {"id":"ytc_UgxlAyaeyzywBfGW78R4AaABAg","responsibility":"none","reasoning":"unclear","policy":"unclear","emotion":"approval"}, {"id":"ytc_UgysL2W6nOgnDlCxQwR4AaABAg","responsibility":"none","reasoning":"unclear","policy":"unclear","emotion":"indifference"}, {"id":"ytc_Ugybr59IUXJC-KwimG94AaABAg","responsibility":"user","reasoning":"virtue","policy":"none","emotion":"outrage"}, {"id":"ytc_Ugw-XLDZjc6mQtMwk7h4AaABAg","responsibility":"company","reasoning":"consequentialist","policy":"none","emotion":"fear"}, {"id":"ytc_Ugw7-W4sgH64d0bno354AaABAg","responsibility":"company","reasoning":"deontological","policy":"regulate","emotion":"outrage"}, {"id":"ytc_UgxgMXrCqcIvF_6iBFN4AaABAg","responsibility":"company","reasoning":"consequentialist","policy":"unclear","emotion":"mixed"}, {"id":"ytc_UgzLa69DYQQGj8yLkDR4AaABAg","responsibility":"none","reasoning":"unclear","policy":"none","emotion":"resignation"}, {"id":"ytc_UgwDxGuUMqYesUx3tk54AaABAg","responsibility":"none","reasoning":"unclear","policy":"none","emotion":"indifference"}, {"id":"ytc_UgzEVmaYxOIo5Gpip3V4AaABAg","responsibility":"none","reasoning":"unclear","policy":"unclear","emotion":"indifference"}, {"id":"ytc_UgyAJVJSypPpyAinoBJ4AaABAg","responsibility":"none","reasoning":"unclear","policy":"unclear","emotion":"approval"} ]