EvoFSM: Controllable Self-Evolution for Deep Research with Finite State Machines Paper • 2601.09465 • Published 3 days ago • 38
FinEval: A Chinese Financial Domain Knowledge Evaluation Benchmark for Large Language Models Paper • 2308.09975 • Published Aug 19, 2023
Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning Paper • 2503.16252 • Published Mar 20, 2025 • 29
FinGAIA: A Chinese Benchmark for AI Agents in Real-World Financial Domain Paper • 2507.17186 • Published Jul 23, 2025 • 1
LightAgent: Production-level Open-source Agentic AI Framework Paper • 2509.09292 • Published Sep 11, 2025
FinTeam: A Multi-Agent Collaborative Intelligence System for Comprehensive Financial Scenarios Paper • 2507.10448 • Published Jul 5, 2025
BizFinBench.v2: A Unified Dual-Mode Bilingual Benchmark for Expert-Level Financial Capability Alignment Paper • 2601.06401 • Published 8 days ago • 9
FinVault: Benchmarking Financial Agent Safety in Execution-Grounded Environments Paper • 2601.07853 • Published 9 days ago • 7
FinEval-KR: A Financial Domain Evaluation Framework for Large Language Models' Knowledge and Reasoning Paper • 2506.21591 • Published Jun 18, 2025
VisFinEval: A Scenario-Driven Chinese Multimodal Benchmark for Holistic Financial Understanding Paper • 2508.09641 • Published Aug 13, 2025