kevin009
/

llama322

kevin009 commited on Dec 29, 2024

Commit

0f271d5

verified ·

1 Parent(s): e639790

Upload README.md with huggingface_hub

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,22 +1,21 @@
----
-base_model: kevin009/llama322
-tags:
-- text-generation-inference
-- transformers
-- unsloth
-- llama
-- trl
-license: apache-2.0
-language:
-- en
----
-# Uploaded  model
-- **Developed by:** kevin009
-- **License:** apache-2.0
-- **Finetuned from model :** kevin009/llama322
-This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
-[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)

+# kevin009/llama322
+## Model Description
+This is a LoRA-tuned version of kevin009/llama322 using KTO (Knowledge Transfer Optimization).
+## Training Parameters
+- Learning Rate: 5e-06
+- Batch Size: 1
+- Training Steps: 2043
+- LoRA Rank: 16
+- Training Date: 2024-12-29
+## Usage
+```python
+from peft import AutoPeftModelForCausalLM
+from transformers import AutoTokenizer
+model = AutoPeftModelForCausalLM.from_pretrained("kevin009/llama322", token="YOUR_TOKEN")
+tokenizer = AutoTokenizer.from_pretrained("kevin009/llama322")
+```