Felix Tuma's picture

95 58

Felix Tuma

floom

·

AI & ML interests

NLP

Recent Activity

updated a collection about 24 hours ago

PotentialApplication

upvoted a paper 4 days ago

LiveMCPBench: Can Agents Navigate an Ocean of MCP Tools?

upvoted a paper 4 days ago

Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

View all activity

Organizations

None yet

updated a collection about 24 hours ago

PotentialApplication

32 items • Updated about 24 hours ago

upvoted 2 papers 4 days ago

LiveMCPBench: Can Agents Navigate an Ocean of MCP Tools?

Paper • 2508.01780 • Published 8 days ago • 12

Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

Paper • 2508.01191 • Published 10 days ago • 191

upvoted a paper 14 days ago

Deep Researcher with Test-Time Diffusion

Paper • 2507.16075 • Published 21 days ago • 57

upvoted a paper 17 days ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published 19 days ago • 279

upvoted a paper 19 days ago

Beyond Context Limits: Subconscious Threads for Long-Horizon Reasoning

Paper • 2507.16784 • Published 20 days ago • 116

upvoted a paper 21 days ago

Stabilizing Knowledge, Promoting Reasoning: Dual-Token Constraints for RLVR

Paper • 2507.15778 • Published 21 days ago • 19

updated a collection 21 days ago

PotentialApplication

32 items • Updated about 24 hours ago

upvoted a paper 25 days ago

Seq vs Seq: An Open Suite of Paired Encoders and Decoders

Paper • 2507.11412 • Published 27 days ago • 25

upvoted 3 papers 29 days ago

Go to Zero: Towards Zero-shot Motion Generation with Million-scale Data

Paper • 2507.07095 • Published Jul 9 • 54

MIRIX: Multi-Agent Memory System for LLM-Based Agents

Paper • 2507.07957 • Published Jul 10 • 66

Machine Bullshit: Characterizing the Emergent Disregard for Truth in Large Language Models

Paper • 2507.07484 • Published Jul 10 • 17

upvoted a paper about 1 month ago

ZeCO: Zero Communication Overhead Sequence Parallelism for Linear Attention

Paper • 2507.01004 • Published Jul 1 • 10

updated a collection about 1 month ago

PotentialApplication

32 items • Updated about 24 hours ago

upvoted a collection about 1 month ago

Skywork-Reward-V2

Scaling preference data curation to the extreme • 9 items • Updated Jul 4 • 23

upvoted 3 papers about 1 month ago

KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality

Paper • 2506.19807 • Published Jun 24 • 7

Orthogonal Finetuning Made Scalable

Paper • 2506.19847 • Published Jun 24 • 7

Can Large Language Models Capture Human Annotator Disagreements?

Paper • 2506.19467 • Published Jun 24 • 18

upvoted a paper about 2 months ago

EMLoC: Emulator-based Memory-efficient Fine-tuning with LoRA Correction

Paper • 2506.12015 • Published Jun 13 • 4

updated a collection about 2 months ago

ShowAndTell

66 items • Updated Jun 25