Efficient Process Reward Model Training via Active Learning.
Sea AI Lab
company
Verified
AI & ML interests
None defined yet.
Recent Activity
View all activity
-
Understanding R1-Zero-Like Training: A Critical Perspective
Paper β’ 2503.20783 β’ Published β’ 56 -
sail/Qwen2.5-Math-7B-Oat-Zero
Text Generation β’ 8B β’ Updated β’ 1.4k β’ β’ 6 -
sail/Qwen2.5-Math-1.5B-Oat-Zero
Text Generation β’ 2B β’ Updated β’ 943 β’ β’ 4 -
sail/Llama-3.2-3B-Oat-Zero
Text Generation β’ 3B β’ Updated β’ 28 β’ 1
Efficient Process Reward Model Training via Active Learning.
-
Understanding R1-Zero-Like Training: A Critical Perspective
Paper β’ 2503.20783 β’ Published β’ 56 -
sail/Qwen2.5-Math-7B-Oat-Zero
Text Generation β’ 8B β’ Updated β’ 1.4k β’ β’ 6 -
sail/Qwen2.5-Math-1.5B-Oat-Zero
Text Generation β’ 2B β’ Updated β’ 943 β’ β’ 4 -
sail/Llama-3.2-3B-Oat-Zero
Text Generation β’ 3B β’ Updated β’ 28 β’ 1
spaces
7
Running
on
Zero
26
Sailor2 20B Chat
π±
Chat with Sailor2 for detailed answers in multiple languages
Running
11
Scaling With Vocab Demo
π
Predict optimal vocabulary size for models
Running
4
Pipeline Parallellism with Controllable Memory
π
Calculate and visualize pipeline schedules
Running
21
Zero Bubble Pipeline Parallellism
π
Calculate and visualize pipeline schedules
Running
6
RegMix
π
Generate predictions and visualize regression results from CSV data
Runtime error
6
Sailor 14B Chat
β
Generate responses to text questions in multiple languages
models
79

sail/longspec-Llama-3-8B-Instruct-262k
Text Generation
β’
0.3B
β’
Updated
β’
9

sail/longspec-QwQ-32B-Preview
Text Generation
β’
0.6B
β’
Updated
β’
5

sail/longspec-vicuna-13b-v1.5-16k
Text Generation
β’
0.4B
β’
Updated
β’
5

sail/longspec-longchat-13b-16k
Text Generation
β’
0.4B
β’
Updated
β’
7

sail/longspec-vicuna-7b-v1.5-16k
Text Generation
β’
0.3B
β’
Updated
β’
4

sail/longspec-longchat-7b-v1.5-32k
Text Generation
β’
0.3B
β’
Updated
β’
3

sail/Qwen2.5-Math-7B-Oat-Zero
Text Generation
β’
8B
β’
Updated
β’
1.4k
β’
β’
6

sail/Qwen2.5-Math-1.5B-Oat-Zero
Text Generation
β’
2B
β’
Updated
β’
943
β’
β’
4

sail/Llama-3.2-3B-Oat-Zero
Text Generation
β’
3B
β’
Updated
β’
28
β’
1

sail/Sailor2-20B
Text Generation
β’
19B
β’
Updated
β’
121
β’
10
datasets
7
sail/longspec-data
Preview
β’
Updated
β’
81
β’
1
sail/ActPRMData
Viewer
β’
Updated
β’
663k
β’
79
β’
1
sail/regmix-data
Viewer
β’
Updated
β’
13.7M
β’
975
β’
4
sail/regmix-data-sample
Viewer
β’
Updated
β’
698k
β’
180
β’
2
sail/Sailcompass_data
Preview
β’
Updated
β’
57
sail/sailcraft_lm_resource
Updated
β’
296
β’
1
sail/symbolic-instruction-tuning
Viewer
β’
Updated
β’
875k
β’
370
β’
15