The500Feed.Live

Everything going on in AI - updated daily from 500+ sources

← Back to The 500 Feed
📄 ResearchMay 13, 2026

Trajectory-Level Data Augmentation for Offline Reinforcement Learning

We propose a data augmentation method for offline reinforcement learning, motivated by active positioning problems. Particularly, our approach enables the training of off-policy models from a limited number of suboptimal trajectories. We introduce a trajectory-based augmentation technique that explo...

Read Original Article →

Source

http://arxiv.org/abs/2605.13401v1