arxiv:2602.10210
Junhong Lin
junhongmit
AI & ML interests
None yet
Recent Activity
authored
a paper
about 21 hours ago
Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces
authored
a paper
about 21 hours ago
Temporal Reasoning with Large Language Models Augmented by Evolving Knowledge Graphs
authored
a paper
about 21 hours ago
How Much Reasoning Do Retrieval-Augmented Models Add beyond LLMs? A Benchmarking Framework for Multi-Hop Inference over Hybrid Knowledge
Organizations
None yet