kevin009 commited on
Commit
0f271d5
·
verified ·
1 Parent(s): e639790

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +17 -18
README.md CHANGED
@@ -1,22 +1,21 @@
1
- ---
2
- base_model: kevin009/llama322
3
- tags:
4
- - text-generation-inference
5
- - transformers
6
- - unsloth
7
- - llama
8
- - trl
9
- license: apache-2.0
10
- language:
11
- - en
12
- ---
13
 
14
- # Uploaded model
15
 
16
- - **Developed by:** kevin009
17
- - **License:** apache-2.0
18
- - **Finetuned from model :** kevin009/llama322
19
 
20
- This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
 
 
 
 
 
21
 
22
- [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
 
2
+ # kevin009/llama322
3
 
4
+ ## Model Description
5
+ This is a LoRA-tuned version of kevin009/llama322 using KTO (Knowledge Transfer Optimization).
 
6
 
7
+ ## Training Parameters
8
+ - Learning Rate: 5e-06
9
+ - Batch Size: 1
10
+ - Training Steps: 2043
11
+ - LoRA Rank: 16
12
+ - Training Date: 2024-12-29
13
 
14
+ ## Usage
15
+ ```python
16
+ from peft import AutoPeftModelForCausalLM
17
+ from transformers import AutoTokenizer
18
+
19
+ model = AutoPeftModelForCausalLM.from_pretrained("kevin009/llama322", token="YOUR_TOKEN")
20
+ tokenizer = AutoTokenizer.from_pretrained("kevin009/llama322")
21
+ ```