A 7B LVLM with 128K context window and 512K generalization through long-context continued pre-training
Zhaowei Wang
ZhaoweiWang
AI & ML interests
NLP
Recent Activity
commentedon a paper 4 days ago
Training Long-Context Vision-Language Models Effectively with Generalization Beyond 128K Context