UC Berkeley

university

Verified

https://www.berkeley.edu/

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

Xuandong submitted a paper 2 days ago

InfoSynth: Information-Guided Benchmark Synthesis for LLMs

sgoel9 authored a paper 6 days ago

SAGE: A Realistic Benchmark for Semantic Understanding

chuyishang submitted a paper 12 days ago

Latent Implicit Visual Reasoning

View all activity

Papers

InfoSynth: Information-Guided Benchmark Synthesis for LLMs

Latent Implicit Visual Reasoning

View all Papers

Xuandong

submitted a paper to Daily Papers 2 days ago

InfoSynth: Information-Guided Benchmark Synthesis for LLMs

Paper • 2601.00575 • Published 5 days ago • 1

chuyishang

submitted a paper to Daily Papers 12 days ago

Latent Implicit Visual Reasoning

Paper • 2512.21218 • Published 14 days ago • 66

cheryyunl

submitted a paper to Daily Papers 19 days ago

MomaGraph: State-Aware Unified Scene Graphs with Vision-Language Model for Embodied Task Planning

Paper • 2512.16909 • Published 19 days ago • 1

davidchan

authored a paper 10 months ago

TULIP: Towards Unified Language-Image Pretraining

Paper • 2503.15485 • Published Mar 19, 2025 • 49

wisepigeon

authored a paper over 1 year ago

Robot See Robot Do: Imitating Articulated Object Manipulation with Monocular 4D Reconstruction

Paper • 2409.18121 • Published Sep 26, 2024 • 8

davidchan

authored 4 papers over 1 year ago

Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition

Paper • 2403.19822 • Published Mar 28, 2024

ALOHa: A New Measure for Hallucination in Captioning Models

Paper • 2404.02904 • Published Apr 3, 2024

Virtual Personas for Language Models via an Anthology of Backstories

Paper • 2407.06576 • Published Jul 9, 2024 • 1

Visual Haystacks: Answering Harder Questions About Sets of Images

Paper • 2407.13766 • Published Jul 18, 2024 • 2

davidchan

posted an update over 1 year ago

Post

621

🚨 Launching The Visual Haystacks (VHs) Benchmark: the first "visual-centric" Needle-In-A-Haystack (NIAH) benchmark to assess LMMs' capability in long-context visual retrieval and reasoning.

Check it out!
tsunghanwu/visual_haystacks
https://visual-haystacks.github.io/
https://arxiv.org/abs/2407.13766
https://github.com/visual-haystacks/vhs_benchmark

wisepigeon

authored a paper almost 2 years ago

GARField: Group Anything with Radiance Fields

Paper • 2401.09419 • Published Jan 17, 2024 • 21

davidchan

authored 6 papers almost 2 years ago

Task Oriented Dialogue as a Catalyst for Self-Supervised Automatic Speech Recognition

Paper • 2401.02417 • Published Jan 4, 2024 • 1

Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification

Paper • 2312.14378 • Published Dec 22, 2023

See, Say, and Segment: Teaching LMMs to Overcome False Premises

Paper • 2312.08366 • Published Dec 13, 2023

CLAIR: Evaluating Image Captions with Large Language Models

Paper • 2310.12971 • Published Oct 19, 2023

Using External Off-Policy Speech-To-Text Mappings in Contextual End-To-End Automated Speech Recognition

Paper • 2301.02736 • Published Jan 6, 2023

ANIM-400K: A Large-Scale Dataset for Automated End-To-End Dubbing of Video

Paper • 2401.05314 • Published Jan 10, 2024 • 12

marwaa

authored a paper about 2 years ago

Moral Foundations of Large Language Models

Paper • 2310.15337 • Published Oct 23, 2023 • 1

wisepigeon

authored a paper over 2 years ago

Language Embedded Radiance Fields for Zero-Shot Task-Oriented Grasping

Paper • 2309.07970 • Published Sep 14, 2023 • 8