xAI

company

Verified

xai

xai-org

Activity Feed Request to join this org

AI & ML interests

Understand the universe

ethanhe

authored a paper 4 months ago

Training Video Foundation Models with NVIDIA NeMo

Paper • 2503.12964 • Published Mar 17 • 7

liuhaotian

authored a paper 4 months ago

AlayaDB: The Data Foundation for Efficient and Effective Long-context LLM Inference

Paper • 2504.10326 • Published Apr 14 • 26

yuchenlin

authored a paper 4 months ago

CrossWordBench: Evaluating the Reasoning Capabilities of LLMs and LVLMs with Controllable Puzzle Generation

Paper • 2504.00043 • Published Mar 30 • 10

yuchenlin

authored 2 papers 6 months ago

Small Models Struggle to Learn from Strong Reasoners

Paper • 2502.12143 • Published Feb 17 • 39

ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning

Paper • 2502.01100 • Published Feb 3 • 18

ethanhe

authored a paper 7 months ago

Cosmos World Foundation Model Platform for Physical AI

Paper • 2501.03575 • Published Jan 7 • 81

yuchenlin

authored a paper 9 months ago

On Memorization of Large Language Models in Logical Reasoning

Paper • 2410.23123 • Published Oct 30, 2024 • 18

ethanhe

authored 6 papers 10 months ago

Epipolar Transformers

Paper • 2005.04551 • Published May 10, 2020 • 1

Feature Selective Anchor-Free Module for Single-Shot Object Detection

Paper • 1903.00621 • Published Mar 2, 2019

AMC: AutoML for Model Compression and Acceleration on Mobile Devices

Paper • 1802.03494 • Published Feb 10, 2018

Channel Pruning for Accelerating Very Deep Neural Networks

Paper • 1707.06168 • Published Jul 19, 2017

LongVILA: Scaling Long-Context Visual Language Models for Long Videos

Paper • 2408.10188 • Published Aug 19, 2024 • 53

Upcycling Large Language Models into Mixture of Experts

Paper • 2410.07524 • Published Oct 10, 2024 • 4

YikangS

authored 4 papers 12 months ago

The infrastructure powering IBM's Gen AI model development

Paper • 2407.05467 • Published Jul 7, 2024 • 2

Scaling Granite Code Models to 128K Context

Paper • 2407.13739 • Published Jul 18, 2024 • 20

FlexAttention for Efficient High-Resolution Vision-Language Models

Paper • 2407.20228 • Published Jul 29, 2024 • 1

Power Scheduler: A Batch Size and Token Number Agnostic Learning Rate Scheduler

Paper • 2408.13359 • Published Aug 23, 2024 • 25

YikangS

authored a paper about 1 year ago

Octo-planner: On-device Language Model for Planner-Action Agents

Paper • 2406.18082 • Published Jun 26, 2024 • 49

liuhaotian

authored a paper about 1 year ago

CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs

Paper • 2406.18521 • Published Jun 26, 2024 • 30

yuchenlin

authored a paper about 1 year ago

WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

Paper • 2406.18495 • Published Jun 26, 2024 • 13