IPBench: Benchmarking the Knowledge of Large Language Models in Intellectual Property Paper β’ 2504.15524 β’ Published Apr 22 β’ 4
VCM: Vision Concept Modeling Based on Implicit Contrastive Learning with Vision-Language Instruction Fine-Tuning Paper β’ 2504.19627 β’ Published Apr 28
CLaSp: In-Context Layer Skip for Self-Speculative Decoding Paper β’ 2505.24196 β’ Published May 30 β’ 13
Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving Paper β’ 2507.23726 β’ Published 14 days ago β’ 106
Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference Paper β’ 2508.02193 β’ Published 11 days ago β’ 120
Efficient Agents: Building Effective Agents While Reducing Cost Paper β’ 2508.02694 β’ Published 21 days ago β’ 79
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper β’ 2508.06471 β’ Published 6 days ago β’ 134
WideSearch: Benchmarking Agentic Broad Info-Seeking Paper β’ 2508.07999 β’ Published 3 days ago β’ 93
AnyCap Project: A Unified Framework, Dataset, and Benchmark for Controllable Omni-modal Captioning Paper β’ 2507.12841 β’ Published 29 days ago β’ 40
VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning Paper β’ 2507.22607 β’ Published 15 days ago β’ 44
Which Side Are You On? A Multi-task Dataset for End-to-End Argument Summarisation and Evaluation Paper β’ 2406.03151 β’ Published Jun 5, 2024
Consent in Crisis: The Rapid Decline of the AI Data Commons Paper β’ 2407.14933 β’ Published Jul 20, 2024 β’ 12
MMRA: A Benchmark for Multi-granularity Multi-image Relational Association Paper β’ 2407.17379 β’ Published Jul 24, 2024 β’ 3