Raw LLM Responses
Inspect the exact model output for any coded comment.
Look up by comment ID
Random samples — click to inspect
G
"Tell ke a lie that is more subtle"
"Everyone likes you all the time"
ChatGPT vi…
ytc_Ugy1aye0t…
G
Im kinda feeling that we will be having these base models like chat GPT and then…
rdc_n7u2cwd
G
Wow, I literally just watched your last two videos about the Nth rooms and wonde…
ytc_UgziJepm-…
G
I imagine the robot taking part in the Koran and praying five times a day and st…
ytc_UgzBM12xz…
G
Who.... was dumb enough to think ai would give people free everything
Half the…
ytc_UgxahWsCE…
G
@devontaerey
If the A.I is using collected data it's conclusion is not biased…
ytr_UgyhD2Ar8…
G
Don’t worry people, it’s artificial intelligence not artificial stupidity. There…
ytc_UgxkbbBer…
G
We still have to consider rape and pedophilia. Because i have mixed feelings abo…
ytc_UgyiQyxfa…
Comment
Marx talks about this in Capital. Machinery in the Industrial Revolution wasn't used to automate jobs. It was mainly used to lower the skill needed for the work so that women and children could do the jobs. It also resulted in a lengthening of the working day. So instead of needing to carry things all day, you now had machines that did that for you. So now you didn't have to deal with physical exhaustion, so you could work longer hours.
This has *some* similarity to what will happen with AI in software engineering, but not a ton. There's not a lot that AI can completely automate for you with coding, primarily because you just can't trust it.
LLMs are an untestable black box. Sure, the LLM can quote specific parts of a PDF or search the web. But it can misinterpret those results, randomly bork unexpectedly on certain inputs, or just hallucinate completely. We have lots of wonderful constructed benchmarks that evaluate various metrics. But these metrics are hyper-specific to those specific benchmarks, because performance varies widely depending on input, temperature, prompt, and seed. These things are *huge* and you objectively cannot reason about them the same way you can a complex software system. The testing space is infinitely higher. So smart companies will use this to make coding just a little bit faster, but won't be replacing engineers anytime soon. You need someone to blame / fire when things go wrong.
reddit
AI Jobs
1712790751.0
♥ 4
Coding Result
| Dimension | Value |
|---|---|
| Responsibility | none |
| Reasoning | unclear |
| Policy | unclear |
| Emotion | indifference |
| Coded at | 2026-04-25T08:33:43.502452 |
Raw LLM Response
[
{"id":"rdc_kyzvk74","responsibility":"none","reasoning":"unclear","policy":"unclear","emotion":"indifference"},
{"id":"rdc_kyz7a6m","responsibility":"company","reasoning":"mixed","policy":"industry_self","emotion":"approval"},
{"id":"rdc_kyzat7q","responsibility":"none","reasoning":"unclear","policy":"none","emotion":"mixed"},
{"id":"rdc_kyz7vbe","responsibility":"none","reasoning":"consequentialist","policy":"none","emotion":"indifference"},
{"id":"rdc_kz0agjt","responsibility":"company","reasoning":"consequentialist","policy":"none","emotion":"approval"}
]