Raw LLM Responses
Inspect the exact model output for any coded comment.
Look up by comment ID
Random samples — click to inspect
G
I want to see them automate securing all types of loads.
Can a fuel truck be 100…
ytc_Ugwrbf9pY…
G
Ill admit, I was almost caught off gaurd at how realistic it was. I almost suspe…
ytc_UgxHQjlmL…
G
@florianschneider3982 it's easier to use ai then to actually draw an artwork. A…
ytr_Ugy-81kVJ…
G
if ECHO OR AI is so intelligent, then it should be able to DISARM RUSSIAS nuclea…
ytc_UgyL9gS-n…
G
it gov: ban chat gpt.
Open ai: sure thing. Is there anything else I could assist…
ytr_UgzuBNcMk…
G
What would be the solution?
To start post carriages again, stop truck drivers an…
ytc_UgxeJ-TCR…
G
Maybe the AIs that get close to a consciousness, maybe the first thing it would …
ytc_UgwrLmycN…
G
If AI will ever become really intelligent, especially more intelligent than the …
ytc_UgxNhUWYQ…
Comment
That is an incredibly long output, wow, but after stopping to read a couple of random sections, it doesn't seem super impressive.
Literally the first thing I saw is outdated info:
>Emerging Capabilities in the Next 6–12 Months
>
> ...
>Google’s Gemini is expected to be multimodal from the outset
>
> ...
>Context window lengths will likely continue to expand. After models like Claude demonstrated handling 100K-token inputs, we might see mainstream LLMs capable of reading and analyzing entire books or large codebases at once.
Gemini 1.5 Pro was released a year ago and was both multimodal and able to handle up to 10m tokens. Searching the text for "1.5" and "Gemini" shows it did not mention Gemini 1.5 Pro at all, and basically said nothing about Google’s Gemini models at all.
Here's another random snippet I found that is outdated or poorly phrased at best:
>For formal reasoning or coding, some systems have “analysis” modes (e.g. GPT-4 has a mode where it can output thought in brackets that the user might not see). If you have access to such features (often via an API or specific interface), leveraging them can be powerful. But in a typical chat, an effective approach is to explicitly ask for structured output.
Analysis modes? I don't think anyone has ever used that phrase, and this doesn't seem to be referencing the 'reasoning process' by o1, o3, R1, etc. or even the analysis that Deep Search itself is supposed to do.
It mentions "Reasoning-Focused Models or Modes" without mentioning any of the actual known "reasoning focused models" like o1 and R1. This specific section of the report seems to be mainly hallucinations that are vaguely based on a few outdated ideas.
For someone who hasn't been paying any significant attention to the AI space in years (and who isn't looking for the most specific and up-to-date info), this report is probably pretty good. But for me, it seems lacking.
I'm guessing a different prompt, one more focused on
reddit
AI Moral Status
1739023836.0
♥ 14
Coding Result
| Dimension | Value |
|---|---|
| Responsibility | none |
| Reasoning | unclear |
| Policy | none |
| Emotion | indifference |
| Coded at | 2026-04-25T08:33:43.502452 |
Raw LLM Response
[
{"id":"rdc_mbnry5r","responsibility":"none","reasoning":"unclear","policy":"none","emotion":"indifference"},
{"id":"rdc_mc7lhim","responsibility":"ai_itself","reasoning":"mixed","policy":"unclear","emotion":"fear"},
{"id":"rdc_mbq9i6x","responsibility":"none","reasoning":"unclear","policy":"none","emotion":"mixed"},
{"id":"rdc_mbqkv5a","responsibility":"none","reasoning":"unclear","policy":"none","emotion":"approval"},
{"id":"rdc_mcf1us8","responsibility":"none","reasoning":"unclear","policy":"none","emotion":"approval"}
]