The500Feed.Live

Everything going on in AI - updated daily from 500+ sources

← Back to The 500 Feed
📄 ResearchMay 20, 2026

Linear-DPO: Linear Direct Preference Optimization for Diffusion and Flow-Matching Generative Models

Direct Preference Optimization (DPO) is successful for alignment in LLMs but still faces challenges in text-to-image generation. Existing studies are confined to denoising diffusion models while overlooking flow-matching, and suffer from an objective mismatch when applying discrete NLP-based DPO to ...

Read Original Article →

Source

http://arxiv.org/abs/2605.21123v1