7 1232 640

Kye Gomez

kye

https://discord.gg/qUtxnK2NMf

kyegomezb

AI & ML interests

Neuroscience, Behavior Science, Anti-Matter, Anti-Gravity propulsion,

Recent Activity

upvoted a paper about 7 hours ago

REFINE-AF: A Task-Agnostic Framework to Align Language Models via Self-Generated Instructions using Reinforcement Learning from Automated Feedback

upvoted a paper about 7 hours ago

Unified Continuous Generative Models

upvoted a paper about 7 hours ago

Learning from Peers in Reasoning Models

View all activity

Organizations

kye's activity

upvoted 5 papers about 7 hours ago

REFINE-AF: A Task-Agnostic Framework to Align Language Models via Self-Generated Instructions using Reinforcement Learning from Automated Feedback

Paper • 2505.06548 • Published 4 days ago • 26

Unified Continuous Generative Models

Paper • 2505.07447 • Published 1 day ago • 31

Learning from Peers in Reasoning Models

Paper • 2505.07787 • Published 1 day ago • 34

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published 1 day ago • 53

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published 2 days ago • 84

upvoted 3 papers 1 day ago

Healthy LLMs? Benchmarking LLM Knowledge of UK Government Public Health Information

Paper • 2505.06046 • Published 4 days ago • 11

Sailing AI by the Stars: A Survey of Learning from Rewards in Post-Training and Test-Time Scaling of Large Language Models

Paper • 2505.02686 • Published 8 days ago • 12

UniVLA: Learning to Act Anywhere with Task-centric Latent Actions

Paper • 2505.06111 • Published 4 days ago • 19

upvoted 4 papers 4 days ago

Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models

Paper • 2505.04921 • Published 6 days ago • 127

upvoted 8 papers 5 days ago

X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains

Paper • 2505.03981 • Published 7 days ago • 14

Scalable Chain of Thoughts via Elastic Reasoning

Paper • 2505.05315 • Published 5 days ago • 22

Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models

Paper • 2505.02847 • Published 12 days ago • 24

On Path to Multimodal Generalist: General-Level and General-Bench

Paper • 2505.04620 • Published 6 days ago • 71

OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning

Paper • 2505.04601 • Published 6 days ago • 18

Benchmarking LLMs' Swarm intelligence

Paper • 2505.04364 • Published 7 days ago • 18

R&B: Domain Regrouping and Data Mixture Balancing for Efficient Foundation Model Training

Paper • 2505.00358 • Published 13 days ago • 20

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Paper • 2505.04588 • Published 6 days ago • 55