Eric Lan

Eric-Lan

https://ericglan.github.io/

AI & ML interests

Reinforcement Fine-Tuning, Reinforcement Learning, RLHF/VR, LLM Alignment, Reasoning, Diffusion Model, Speculative Decoding, Federated Learning

Recent Activity

liked a model 18 days ago

huseyinatahaninan/Qwen2.5-7B-Instruct-CI

liked a dataset about 1 month ago

Eric-Lan/healthbench_axe

updated a dataset about 1 month ago

Eric-Lan/healthbench_axe

View all activity

Organizations

liked a model 18 days ago

huseyinatahaninan/Qwen2.5-7B-Instruct-CI

8B • Updated 24 days ago • 37 • 1

liked a dataset about 1 month ago

Eric-Lan/healthbench_axe

Viewer • Updated Nov 15 • 16.7k • 27 • 1

updated a dataset about 1 month ago

Eric-Lan/healthbench_axe

Viewer • Updated Nov 15 • 16.7k • 27 • 1

published a dataset about 1 month ago

Eric-Lan/healthbench_axe

Viewer • Updated Nov 15 • 16.7k • 27 • 1

updated a dataset about 1 month ago

Eric-Lan/healthbench

Viewer • Updated Nov 14 • 5k • 6 • 1

liked a dataset about 1 month ago

huseyinatahaninan/ContextualIntegritySyntheticDataset

Viewer • Updated Nov 14 • 729 • 52 • 1

liked a dataset 2 months ago

Eric-Lan/healthbench

Viewer • Updated Nov 14 • 5k • 6 • 1

published a dataset 2 months ago

Eric-Lan/healthbench

Viewer • Updated Nov 14 • 5k • 6 • 1

authored a paper 5 months ago

MaPPO: Maximum a Posteriori Preference Optimization with Prior Knowledge

Paper • 2507.21183 • Published Jul 27 • 14

upvoted a paper 5 months ago

MaPPO: Maximum a Posteriori Preference Optimization with Prior Knowledge

Paper • 2507.21183 • Published Jul 27 • 14

commented a paper 5 months ago

MaPPO: Maximum a Posteriori Preference Optimization with Prior Knowledge

Paper • 2507.21183 • Published Jul 27 • 14 •

upvoted a paper 7 months ago

Contextual Integrity in LLMs via Reasoning and Reinforcement Learning

Paper • 2506.04245 • Published May 29 • 4

commented a paper 7 months ago

Contextual Integrity in LLMs via Reasoning and Reinforcement Learning

Paper • 2506.04245 • Published May 29 • 4 •

New activity in Proactive-LMM/train 10 months ago

[bot] Conversion to Parquet

#1 opened 10 months ago by

parquet-converter

liked a model about 1 year ago

Eric-Lan/stack-llama-2

Text Generation • 7B • Updated Apr 28, 2024 • 8 • 1

upvoted a paper about 1 year ago

SePPO: Semi-Policy Preference Optimization for Diffusion Alignment

Paper • 2410.05255 • Published Oct 7, 2024 • 5

authored a paper about 1 year ago

SePPO: Semi-Policy Preference Optimization for Diffusion Alignment

Paper • 2410.05255 • Published Oct 7, 2024 • 5

liked a model about 1 year ago

DwanZhang/SePPO

Text-to-Image • Updated Oct 15, 2024 • 31 • 4

upvoted a paper over 1 year ago

Iterative Nash Policy Optimization: Aligning LLMs with General Preferences via No-Regret Learning

Paper • 2407.00617 • Published Jun 30, 2024 • 7

updated a model over 1 year ago

Eric-Lan/stack-llama-2

Text Generation • 7B • Updated Apr 28, 2024 • 8 • 1

Eric Lan

AI & ML interests

Recent Activity

Organizations

Eric-Lan's activity

[bot] Conversion to Parquet