FSMBench

university

Activity Feed

AI & ML interests

Evaluating and Benchmarking Large Multimodal Models

Recent Activity

taesiri submitted a paper about 16 hours ago

Streaming Video Instruction Tuning

taesiri submitted a paper about 16 hours ago

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

taesiri submitted a paper about 16 hours ago

NVIDIA Nemotron 3: Efficient and Open Intelligence

View all activity

taesiri

submitted 4 papers to Daily Papers about 16 hours ago

Streaming Video Instruction Tuning

Paper • 2512.21334 • Published about 23 hours ago • 4

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2512.20848 • Published 2 days ago • 14

NVIDIA Nemotron 3: Efficient and Open Intelligence

Paper • 2512.20856 • Published 1 day ago • 11

LLM Swiss Round: Aggregating Multi-Benchmark Performance via Competitive Swiss-System Dynamics

Paper • 2512.21010 • Published 1 day ago

taesiri

submitted 4 papers to Daily Papers 1 day ago

SpatialTree: How Spatial Abilities Branch Out in MLLMs

Paper • 2512.20617 • Published 2 days ago • 39

QuantiPhy: A Quantitative Benchmark Evaluating Physical Reasoning Abilities of Vision-Language Models

Paper • 2512.19526 • Published 3 days ago • 6

SAM Audio: Segment Anything in Audio

Paper • 2512.18099 • Published 6 days ago • 12

Step-DeepResearch Technical Report

Paper • 2512.20491 • Published 2 days ago • 48

taesiri

submitted 5 papers to Daily Papers 3 days ago

StoryMem: Multi-shot Long Video Storytelling with Memory

Paper • 2512.19539 • Published 3 days ago • 15

Name That Part: 3D Part Segmentation and Naming

Paper • 2512.18003 • Published 6 days ago • 3

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

Paper • 2512.19693 • Published 3 days ago • 60

MobileWorld: Benchmarking Autonomous Mobile Agents in Agent-User Interactive, and MCP-Augmented Environments

Paper • 2512.19432 • Published 3 days ago • 10

GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators

Paper • 2512.19682 • Published 3 days ago • 14

taesiri

submitted 5 papers to Daily Papers 4 days ago

Animate Any Character in Any World

Paper • 2512.17796 • Published 7 days ago • 10

Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows

Paper • 2512.16969 • Published 7 days ago • 103

SWE-Bench++: A Framework for the Scalable Generation of Software Engineering Benchmarks from Open-Source Repositories

Paper • 2512.17419 • Published 6 days ago • 9

Seed-Prover 1.5: Mastering Undergraduate-Level Theorem Proving via Learning from Experience

Paper • 2512.17260 • Published 6 days ago • 47

When Reasoning Meets Its Laws

Paper • 2512.17901 • Published 6 days ago • 53

taesiri

submitted 2 papers to Daily Papers 7 days ago

TabReX : Tabular Referenceless eXplainable Evaluation

Paper • 2512.15907 • Published 8 days ago • 1

Adaptation of Agentic AI

Paper • 2512.16301 • Published 7 days ago • 92

AI & ML interests

Recent Activity

Team members 5

FSMBench's activity