i-SRT: Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective Judgment Paper • 2406.11280 • Published Jun 17, 2024
Story Visualization by Online Text Augmentation with Context Memory Paper • 2308.07575 • Published Aug 15, 2023 • 1
Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback Paper • 2402.03746 • Published Feb 6, 2024