Teacher Demonstrations in a BabyLM's Zone of Proximal Development for Contingent Multi-Turn Interaction Paper • 2510.20411 • Published Oct 23, 2025 • 2
ByteSpan: Information-Driven Subword Tokenisation Paper • 2506.18639 • Published Jun 23, 2025 • 3
view article Article Reinforcement Learning for Large Language Models: Beyond the Agent Paradigm Mar 19, 2025 • 8
Less is More: Pre-Training Cross-Lingual Small-Scale Language Models with Cognitively-Plausible Curriculum Learning Strategies Paper • 2410.22886 • Published Oct 30, 2024 • 1