Skip to content
Gradland

All Posts

199 pieces of content

📚All199✍️Blog49📊Diagrams4🤖Research32🔥Githot32📡AI News43📰Visa News43

32 posts

6 May 2026·12 min read·AIResearch Digest

AI Research Digest — 6 May 2026

  • 1Exploration Hacking: Can LLMs Learn to Resist RL Training?
  • 2[Linkpost
  • 3Risk from fitness-seeking AIs: mechanisms and mitigations
  • +1 more inside →
2 May 2026·9 min read·AIResearch Digest

AI Research Digest — 2 May 2026

  • 1Sleeper Agent Backdoor Results Are Messy
  • 2Risk from fitness-seeking AIs: mechanisms and mitigations
  • 3Recursive forecasting: Eliciting long-term forecasts from myopic fitness-seekers
  • +1 more inside →
1 May 2026·12 min read·AIResearch Digest

AI Research Digest — 1 May 2026

  • 1Sleeper Agent Backdoor Results Are Messy
  • 2Research Sabotage in ML Codebases
  • 3Recursive forecasting: Eliciting long-term forecasts from myopic fitness-seekers
  • +2 more inside →
28 Apr 2026·7 min read·AIResearch Digest

AI Research Digest — 28 April 2026

Three pieces worth your time this week: agent skill packages as a new attack surface, the widening gap between LLM-reported and actual completion, and what TileLang means for kernel-level performance work.

AI ResearchDigest
21 Apr 2026·9 min read·AIResearch Digest

AI Research Digest — 21 April 2026

  • 1Prompted CoT Early Exit Undermines the Monitoring Benefits of CoT Uncontrollability
  • 2Current AIs seem pretty misaligned to me
  • 3Shape, Symmetries, and Structure: The Changing Role of Mathematics in Machine Learning Research
  • +1 more inside →
20 Apr 2026·9 min read·AIResearch Digest

AI Research Digest — 20 April 2026

  • 1Prompted CoT Early Exit Undermines the Monitoring Benefits of CoT Uncontrollability
  • 2Anthropic repeatedly accidentally trained against the CoT, demonstrating inadequate processes
  • 3Shape, Symmetries, and Structure: The Changing Role of Mathematics in Machine Learning Research
  • +1 more inside →
19 Apr 2026·9 min read·AIResearch Digest

AI Research Digest — 19 April 2026

  • 1Prompted CoT Early Exit Undermines the Monitoring Benefits of CoT Uncontrollability
  • 2Anthropic repeatedly accidentally trained against the CoT, demonstrating inadequate processes
  • 3Shape, Symmetries, and Structure: The Changing Role of Mathematics in Machine Learning Research
  • +1 more inside →
18 Apr 2026·9 min read·AIResearch Digest

AI Research Digest — 18 April 2026

  • 1Prompted CoT Early Exit Undermines the Monitoring Benefits of CoT Uncontrollability
  • 2Anthropic repeatedly accidentally trained against the CoT, demonstrating inadequate processes
  • 3Shape, Symmetries, and Structure: The Changing Role of Mathematics in Machine Learning Research
  • +1 more inside →