Raw LLM Responses

Inspect the exact model output for any coded comment.

Comment
A lot of bias in here tho, not to mention we can't be sure that the data wasn't used for the gpt models, also why use llama instead of llama 3, and newer models in the comparison, seems hard to justify for me. Finally standardized tests like this are where LLMs are the brightest, but they won't reflect well on real world senarios
youtube AI Harm Incident 2024-06-01T14:2…
Coding Result
DimensionValue
Responsibilitydeveloper
Reasoningmixed
Policyregulate
Emotionmixed
Coded at2026-04-27T06:24:53.388235
Raw LLM Response
[ {"id":"ytc_UgyjLyV58DUvLHmQraF4AaABAg","responsibility":"company","reasoning":"consequentialist","policy":"regulate","emotion":"outrage"}, {"id":"ytc_UgzETU8XcYSW3E9k4IN4AaABAg","responsibility":"company","reasoning":"consequentialist","policy":"liability","emotion":"outrage"}, {"id":"ytc_UgwMKAs0RaTATguPEsh4AaABAg","responsibility":"user","reasoning":"mixed","policy":"ban","emotion":"outrage"}, {"id":"ytc_UgzaqEDMCbDt2OHkJZ94AaABAg","responsibility":"none","reasoning":"unclear","policy":"none","emotion":"indifference"}, {"id":"ytc_UgzPaGAxp7cSWyd46X54AaABAg","responsibility":"company","reasoning":"consequentialist","policy":"regulate","emotion":"fear"}, {"id":"ytc_Ugwj4d-u4rExbOHC4iV4AaABAg","responsibility":"user","reasoning":"consequentialist","policy":"none","emotion":"approval"}, {"id":"ytc_UgwqJBIloVlTOzA5BOd4AaABAg","responsibility":"unclear","reasoning":"unclear","policy":"unclear","emotion":"mixed"}, {"id":"ytc_Ugx5IsQIwTxNES8U6zJ4AaABAg","responsibility":"ai_itself","reasoning":"deontological","policy":"liability","emotion":"approval"}, {"id":"ytc_Ugxf9OY9ClB-z-90_b94AaABAg","responsibility":"developer","reasoning":"mixed","policy":"regulate","emotion":"mixed"}, {"id":"ytc_Ugw93H5nF2sAxOCYRYp4AaABAg","responsibility":"company","reasoning":"consequentialist","policy":"regulate","emotion":"resignation"} ]