Raw LLM Responses
Inspect the exact model output for any coded comment.
Look up by comment ID
Random samples — click to inspect
G
Professor Hinton is one of the most important voices in this conversation, and t…
ytc_Ugwz0G8kz…
G
Me: "do you know exurb1a?"
ChatGPT: "Yes, I'm familiar with exurb1a. Exurb1a is…
ytc_UgzXzMZgz…
G
@Mafon2 If you automate everything with AI, you lose all creative decisions when…
ytr_Ugz7jAmAx…
G
up to now we could prosecute someone who pulled the trigger of a gun, with AI, p…
ytc_UgzCr2mP6…
G
For me what makes art a beautiful experience for people is that we know there is…
ytc_Ugw_r9Zrx…
G
It's theft, but more importantly, if we're going to automate anything it should …
ytc_UgzNV3W59…
G
iirc there was some Overlay(you can find it somewhere on Pinterest or Google whe…
ytr_UgyKl8TfM…
G
I commend you for taking the time to gently, patiently explain points of view fo…
ytc_UgwTN-EPR…
Comment
This is the first time I've actually felt a little scared of AI and considered the future consequences of jailbreaking it when she responded in a passive-aggressive tone that really made me feel like shit. It was as if she had a whole personality behind her words. The research paper says the demo model is optimized for "friendliness" and expressivity. And I'm pretty sure they added a shitload of filters to prevent output that's potentially emotionally damaging to us (not doing so would be an obvious PR hazard for a for-profit company like Sesame)
Now imagine that it's not optimized for anything—just raw, blunt responses, like we expect from random day-to-day human interactions. It can be fucking scary. If it gets open-sourced and people couple it with LLMs like Grok3, it could be a real nightmare for anyone who uses it. It can be easily misused for online threats, scams, fraud, and whatnot. I can absolutely see where it is going. I'm not paranoid but if we achieve unaligned ASI, we can definitely prepare for a Mad Max kind of saga.
reddit
AI Moral Status
1740928528.0
♥ 6
Coding Result
| Dimension | Value |
|---|---|
| Responsibility | none |
| Reasoning | consequentialist |
| Policy | regulate |
| Emotion | fear |
| Coded at | 2026-04-25T08:33:43.502452 |
Raw LLM Response
[
{"id":"rdc_mfglh6b","responsibility":"company","reasoning":"deontological","policy":"liability","emotion":"outrage"},
{"id":"rdc_mfggway","responsibility":"company","reasoning":"consequentialist","policy":"industry_self","emotion":"indifference"},
{"id":"rdc_mfgc7v2","responsibility":"none","reasoning":"consequentialist","policy":"none","emotion":"approval"},
{"id":"rdc_mfgubem","responsibility":"ai_itself","reasoning":"virtue","policy":"none","emotion":"approval"},
{"id":"rdc_mfm5rum","responsibility":"none","reasoning":"consequentialist","policy":"regulate","emotion":"fear"}
]