Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
hlzhang109's picture
3 2

hlzhang109

hlzhang109
21world's profile picture cyberpunk1636's profile picture
·

AI & ML interests

None yet

Recent Activity

commented on a paper about 2 months ago
Discovering Hierarchical Latent Capabilities of Language Models via Causal Representation Learning
liked a Space 6 months ago
nanotron/ultrascale-playbook
authored a paper 8 months ago
Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the MACHIAVELLI Benchmark
View all activity

Organizations

Harvard University's profile picture DataComp 's profile picture ZhentingNLP's profile picture rlsamplingJF's profile picture

authored 5 papers 8 months ago

Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the MACHIAVELLI Benchmark

Paper • 2304.03279 • Published Apr 6, 2023 • 2

CoLoR-Filter: Conditional Loss Reduction Filtering for Targeted Language Model Pre-training

Paper • 2406.10670 • Published Jun 15, 2024 • 4

DataComp-LM: In search of the next generation of training sets for language models

Paper • 2406.11794 • Published Jun 17, 2024 • 54

Eliminating Position Bias of Language Models: A Mechanistic Approach

Paper • 2407.01100 • Published Jul 1, 2024 • 9

Mind the Gap: Examining the Self-Improvement Capabilities of Large Language Models

Paper • 2412.02674 • Published Dec 3, 2024
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs