Raw LLM Responses
Inspect the exact model output for any coded comment.
Look up by comment ID
Random samples — click to inspect
G
I find it very hard to believe that Google (the company that inserts bias into t…
ytc_Ugz13Xe7_…
G
The thing is that character ai isn't an AI intended to replace anyone. Its origi…
ytr_UgxTXfUe3…
G
Well, to me, as a junior AI dev, consius AI and an AI simulating awareness and c…
ytc_UgyaNy0-L…
G
I like AI whenever it makes sense. Companies right now are just putting it into …
ytc_Ugwtgft3J…
G
The funny thing is, it can't... Having to constantly double back in order to cor…
ytr_UgyDEgaC6…
G
Re: Writing: I wish more highly technical government oriented writers would imp…
ytc_UgwtIwXwA…
G
Unfortunately machines can't reproduce in a way humans do, and they can't fix th…
ytr_Ugi84STC_…
G
I wanna be a designer when i grow up but im scared that AI would take my place, …
ytc_UgwTVCX4I…
Comment
One idea to keep in mind is that you can use a cheap AI model to augment GPT-4/5 or even human output.
A joke example is replacing the word "wand" with "wang" in the Harry Potter stories. Taping knives to roombas. Or consider how not every employee was aware they were working on the atom bomb (or are working at scam organizations today). Basically, advanced jailbreaking, as opposed to those jailbreaks that should be obvious to fix.
I don't know if such a technique would actually scale for truly dangerous scenarios, but I believe it'd definitely scale for hate speech and erotica, and I've already found some success with this technique with barely any postprocessing at all. OpenAI would also probably not really care about this kind of misuse, so long as they weren't directly responsible.
Terrorist level misuse is a different story, and I'm not sure how you could avoid the possibility without severely handicapping your product. Considering helpful business emails and manipulative phishing scams are basically identical, as one example...
reddit
AI Responsibility
1682548472.0
♥ 2
Coding Result
| Dimension | Value |
|---|---|
| Responsibility | none |
| Reasoning | unclear |
| Policy | none |
| Emotion | mixed |
| Coded at | 2026-04-25T08:33:43.502452 |
Raw LLM Response
[{"id":"rdc_jhspuqw","responsibility":"none","reasoning":"unclear","policy":"none","emotion":"indifference"},{"id":"rdc_jht26c9","responsibility":"none","reasoning":"unclear","policy":"none","emotion":"indifference"},{"id":"rdc_jhsqwc5","responsibility":"none","reasoning":"unclear","policy":"none","emotion":"indifference"},{"id":"rdc_jhsre0c","responsibility":"none","reasoning":"unclear","policy":"none","emotion":"approval"},{"id":"rdc_jhuh106","responsibility":"none","reasoning":"unclear","policy":"none","emotion":"mixed"}]