Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
40
10
taicheng guo
taicheng
Follow
esselte974's profile picture
lx865712528's profile picture
Theartplug's profile picture
10 followers
·
56 following
AI & ML interests
None yet
Recent Activity
liked
a model
11 days ago
Qwen/Qwen3-1.7B
upvoted
a
paper
12 days ago
Reinforcement Learning for Reasoning in Large Language Models with One Training Example
upvoted
a
paper
13 days ago
Toward Evaluative Thinking: Meta Policy Optimization with Evolving Reward Models
View all activity
Organizations
Papers
2
arxiv:
2402.01680
arxiv:
2305.18365
models
46
Sort: Recently updated
taicheng/zephyr-7b-align-scan-0.0-0.0-linear-1
Text Generation
•
Updated
Sep 28, 2024
•
5
taicheng/zephyr-7b-align-scan-0.0-0.0-polynomial-1
Text Generation
•
Updated
Sep 28, 2024
•
4
taicheng/zephyr-7b-align-scan-0.0-0.0-cosine-2
Text Generation
•
Updated
Sep 28, 2024
•
4
taicheng/zephyr-7b-align-scan-0.0-0.0-polynomial-2
Text Generation
•
Updated
Sep 28, 2024
•
3
taicheng/zephyr-7b-align-scan-0.0-0.0-polynomial-3
Text Generation
•
Updated
Sep 28, 2024
•
3
taicheng/zephyr-7b-align-scan-0.0-0.0-linear-3
Text Generation
•
Updated
Sep 28, 2024
•
4
taicheng/zephyr-7b-align-scan
Text Generation
•
Updated
Sep 28, 2024
•
5
taicheng/zephyr-7b-align-scan-1e-07-0.27-polynomial-1.0
Updated
Sep 28, 2024
taicheng/zephyr-7b-align-scan-7e-07-0.45-cosine-3.0
Text Generation
•
Updated
Sep 28, 2024
•
3
taicheng/zephyr-7b-align-scan-6e-07-0.53-polynomial-2.0
Text Generation
•
Updated
Sep 28, 2024
•
6
Expand 46 models
datasets
0
None public yet