Ye Zhiling's picture

3 4 17

Ye Zhiling

yzlnew

·

https://yzlnew.com

yzlnew

AI & ML interests

Data → Pre-train → Post-train

Recent Activity

updated a model 6 days ago

AQ-MedAI/Kimi-K2-Instruct-eagle3

liked a Space about 1 month ago

lvwerra/distill-blog-template

authored a paper 2 months ago

Self-Rewarding Rubric-Based Reinforcement Learning for Open-Ended Reasoning

View all activity

Organizations

authored a paper 2 months ago

Self-Rewarding Rubric-Based Reinforcement Learning for Open-Ended Reasoning

Paper • 2509.25534 • Published Sep 19 • 2

authored a paper 3 months ago

MedReseacher-R1: Expert-Level Medical Deep Researcher via A Knowledge-Informed Trajectory Synthesis Framework

Paper • 2508.14880 • Published Aug 20 • 15

authored a paper 5 months ago

Learning to Align, Aligning to Learn: A Unified Approach for Self-Optimized Alignment

Paper • 2508.07750 • Published Aug 11 • 19