DateTimeReasoning

community

AI & ML interests

None defined yet.

Recent Activity

pminervini authored a paper about 2 months ago

OpenSIR: Open-Ended Self-Improving Reasoner

pminervini authored a paper 2 months ago

Unveiling and Consulting Core Experts in Retrieval-Augmented MoE-based LLMs

pminervini authored a paper 2 months ago

PosterSum: A Multimodal Benchmark for Scientific Poster Summarization

View all activity

pminervini

authored a paper about 2 months ago

OpenSIR: Open-Ended Self-Improving Reasoner

Paper • 2511.00602 • Published Nov 1 • 20

pminervini

authored 9 papers 2 months ago

Unveiling and Consulting Core Experts in Retrieval-Augmented MoE-based LLMs

Paper • 2410.15438 • Published Oct 20, 2024

PosterSum: A Multimodal Benchmark for Scientific Poster Summarization

Paper • 2502.17540 • Published Feb 24 • 3

Self-Training Large Language Models for Tool-Use Without Demonstrations

Paper • 2502.05867 • Published Feb 9

Q-Filters: Leveraging QK Geometry for Efficient KV Cache Compression

Paper • 2503.02812 • Published Mar 4 • 10

Parameter-Efficient Fine-Tuning of LLaMA for the Clinical Domain

Paper • 2307.03042 • Published Jul 6, 2023

An Analysis of Decoding Methods for LLM-based Agents for Faithful Multi-Hop Question Answering

Paper • 2503.23415 • Published Mar 30 • 1

MedDistant19: Towards an Accurate Benchmark for Broad-Coverage Biomedical Relation Extraction

Paper • 2204.04779 • Published Apr 10, 2022

PiCSAR: Probabilistic Confidence Selection And Ranking

Paper • 2508.21787 • Published Aug 29 • 4

Learning GUI Grounding with Spatial Reasoning from Visual Feedback

Paper • 2509.21552 • Published Sep 25 • 11

aryopg

authored a paper 4 months ago

PiCSAR: Probabilistic Confidence Selection And Ranking

Paper • 2508.21787 • Published Aug 29 • 4

aryopg

authored 4 papers 5 months ago

Self-Training Large Language Models for Tool-Use Without Demonstrations

Paper • 2502.05867 • Published Feb 9

Parameter-Efficient Fine-Tuning of LLaMA for the Clinical Domain

Paper • 2307.03042 • Published Jul 6, 2023

Scalpel vs. Hammer: GRPO Amplifies Existing Capabilities, SFT Replaces Them

Paper • 2507.10616 • Published Jul 13 • 1

Inverse Scaling in Test-Time Compute

Paper • 2507.14417 • Published Jul 19 • 27

pminervini

authored a paper 5 months ago

Inverse Scaling in Test-Time Compute

Paper • 2507.14417 • Published Jul 19 • 27

pminervini

authored 2 papers 7 months ago

MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly

Paper • 2505.10610 • Published May 15 • 54

Neurosymbolic Diffusion Models

Paper • 2505.13138 • Published May 19 • 36

rohitsaxena

authored 2 papers 8 months ago

What Is That Talk About? A Video-to-Text Summarization Dataset for Scientific Presentations

Paper • 2502.08279 • Published Feb 12 • 1

MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly

Paper • 2505.10610 • Published May 15 • 54