Spaces:
Running
Running
wujun
commited on
Commit
·
18d999c
1
Parent(s):
8e64547
update link of gg_init in README.md
Browse files
README.md
CHANGED
@@ -12,7 +12,7 @@ short_description: The code of gg prior.
|
|
12 |
# Introduction
|
13 |
- Reference: It Takes a Good Model to Train a Good Model: Generalized Gaussian Priors for Optimized LLMs
|
14 |
- Authors: Jun Wu, Yirong Xiong, Jiangtao Wen, Yuxing Han
|
15 |
-
- Paper Link: [https://
|
16 |
|
17 |
This repository provides a complete implementation of the methods described in the corresponding paper. Specifically, we implement the Generalized Gaussian Initialization, DeepShape, and the RF8 floating-point format as proposed in the paper. Furthermore, we adapt and reproduce the BackSlash training algorithm, and incorporate it seamlessly into our framework based on generalized Gaussian priors.
|
18 |
|
|
|
12 |
# Introduction
|
13 |
- Reference: It Takes a Good Model to Train a Good Model: Generalized Gaussian Priors for Optimized LLMs
|
14 |
- Authors: Jun Wu, Yirong Xiong, Jiangtao Wen, Yuxing Han
|
15 |
+
- Paper Link: [https://arxiv.org/abs/2506.00486](https://arxiv.org/abs/2506.00486)
|
16 |
|
17 |
This repository provides a complete implementation of the methods described in the corresponding paper. Specifically, we implement the Generalized Gaussian Initialization, DeepShape, and the RF8 floating-point format as proposed in the paper. Furthermore, we adapt and reproduce the BackSlash training algorithm, and incorporate it seamlessly into our framework based on generalized Gaussian priors.
|
18 |
|