Raw LLM Responses
Inspect the exact model output for any coded comment.
Look up by comment ID
Random samples — click to inspect
in
Demis. Stochastic weights won't solve AGI alignment or the megawatt energy wall.…
7463323625832…
in
Assessment shapes behaviour. Change the assessment, change the behaviour. When g…
7464220733544…
in
I’ve sat in alignment meetings where an engineering team flagged a subtle vector…
7464854627189…
in
This has to be managed but there is always a missing part to this dicusssion - t…
7463182832509…
in
Yes, organizations purchased AI licenses across teams without fully evaluating w…
7465617770982…
in
The release of Magnifica Humanitas proves that the AI revolution has officially …
7466136344793…
in
A really important perspective on how AI is changing the value of work rather th…
7466149909025…
in
Respecting your struggle over the last many years Matthew, it’s good to know fin…
7465141383905…
Comment
Great to see the focus on transparency and security, especially with the SynthID adoption. However, as we approach the 'foothills of the singularity' and empower these agentic systems, the challenge isn't just transparency—it's the robustness of the agents themselves against sophisticated prompt injection and environment manipulation. Ready to see these models pushed to their limits in real-world red teaming.
LinkedIn
AI Safety & Risk
AI Evaluation & Safety | LLM Behavior Analysis …
2026-05-27T05:2…
Coding Result
| Dimension | Value |
|---|---|
| Primary value | safety |
| Secondary value | transparency |
| Alignment target | humanity |
| Stance | demanding |
| Emotion | approval |
| Value justification | The speaker emphasizes the need for robustness of agentic systems against sophisticated attacks, indicating a primary concern for safety. |
| Target justification | The mention of approaching the 'foothills of the singularity' and the impact of agentic systems suggests a focus on the well-being of humanity as a whole. |
| Coded at | 2026-06-11T08:21:01Z |
Raw LLM Response
```
{
"value_primary": "safety",
"value_secondary": "transparency",
"target": "humanity",
"stance": "demanding",
"emotion": "approval",
"value_justification": "The speaker emphasizes the need for robustness of agentic systems against sophisticated attacks, indicating a primary concern for safety.",
"target_justification": "The mention of approaching the 'foothills of the singularity' and the impact of agentic systems suggests a focus on the well-being of humanity as a whole."
}
```