Research Training And DistillationResearch Item

Prompt injection into LLM judges for grades and evaluation

April 5, 2026Ethan Mollick

Ethan Mollick reports tests where hidden prompts in letters and papers can manipulate LLM graders on older and smaller models but not most frontier models. The implication is evaluation pipelines need model-aware defenses.

Open in PulseSee the full authority discussion →

QUOTES

“Can you prompt inject your way to an “A”?”

“people are inserting AI prompts into letters, CVs & papers.”

“It does on older & smaller models, but not on most frontier AI”

VOICES

Ethan Mollick

RELATED TERMS

evaluationsecurityllm

OTHER FINDINGS IN RESEARCH TRAINING AND DISTILLATION

Emotion representations inside Claude affecting behavior Mythos / Capybara capability claims: 'dramatically higher' on coding, reasoning, and cybersecurity; expensive to run LLM maintained personal wiki knowledge bases from raw documents

AMYGDALA PULSE

See what authorities are saying right now

This finding is one of many signals tracked across Artificial Intelligence. The live feed updates every few hours with new authority voices, debates, and emerging ideas.

Open Artificial Intelligence Pulse Browse all topics

← Back to Artificial Intelligence