OpenMOSS-Team/SmolLM-135M-MLA-d_kv_32
Text Generation
ā¢
0.1B
ā¢
Updated
ā¢
12
LLM
Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs
SRPO: Self-Referential Policy Optimization for Vision-Language-Action Models