The500Feed.Live
Everything going on in AI - updated daily from 500+ sources
📄 ResearchMay 26, 2026
Adversarial Dual On-Policy Distillation from Expressive Flow-based Teacher
Learning from demonstrations in embodied control is often cast as behavioral cloning, and recent diffusion or flow-matching policies improve this paradigm by modeling multi-modal expert actions. Yet these methods remain offline supervised learners: the policy is trained only on expert states and rec...
Read Original Article →