Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Changxin Tian's picture
7 11

Changxin Tian

ChangxinTian
·

AI & ML interests

None yet

Organizations

None yet

authored 2 papers 2 months ago

MergeMix: Optimizing Mid-Training Data Mixtures via Learnable Model Merging

Paper • 2601.17858 • Published Jan 25 • 1

MaP: A Unified Framework for Reliable Evaluation of Pre-training Dynamics

Paper • 2510.09295 • Published Oct 10, 2025
authored a paper 5 months ago

Every Activation Boosted: Scaling General Reasoner to 1 Trillion Open Language Foundation

Paper • 2510.22115 • Published Oct 25, 2025 • 85
authored 5 papers 6 months ago

Arrows of Math Reasoning Data Synthesis for Large Language Models: Diversity, Complexity and Correctness

Paper • 2508.18824 • Published Aug 26, 2025 • 1

Toward Stable and Consistent Evaluation Results: A New Methodology for Base Model Evaluation

Paper • 2503.00812 • Published Mar 2, 2025

Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs

Paper • 2503.05139 • Published Mar 7, 2025 • 6

Towards Greater Leverage: Scaling Laws for Efficient Mixture-of-Experts Language Models

Paper • 2507.17702 • Published Jul 23, 2025 • 6

WSM: Decay-Free Learning Rate Schedule via Checkpoint Merging for LLM Pre-training

Paper • 2507.17634 • Published Jul 23, 2025 • 2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs