#Reinforcement Learning

Total 2 articles

AlphaGo Can't Beat a Matchstick Game. That's a Problem.

A new paper proves that the self-play training method behind AlphaGo and AlphaZero structurally fails on a whole category of games. What that means for AI systems making real-world decisions.

Mar 14, 2026·

Doyun Han

TechEN

OpenAI Deploys AI 'Red Team' to Harden ChatGPT Atlas Against Prompt Injection Attacks

OpenAI is using automated red teaming with reinforcement learning to strengthen ChatGPT Atlas against prompt injection attacks, creating a proactive loop to discover and patch exploits early.

Dec 22, 2025·

Doyun Han