Spaces:

Dovakiins
/

qwerrwe

Build error

App Files Files Community

hamel commited on Mar 14, 2024

Commit

8b12468

unverified ·

1 Parent(s): 0976781

Add QLoRA + FSDP Docs (#1403)

Browse files

* pre commit

* Update fsdp_qlora.md

Files changed (2) hide show

README.md +4 -0
docs/fsdp_qlora.md +37 -0

README.md CHANGED Viewed

@@ -1080,6 +1080,10 @@ fsdp_config:
   fsdp_transformer_layer_cls_to_wrap: LlamaDecoderLayer
 ```
 ##### Weights & Biases Logging
 Make sure your `WANDB_API_KEY` environment variable is set (recommended) or you login to wandb with `wandb login`.

   fsdp_transformer_layer_cls_to_wrap: LlamaDecoderLayer
 ```
+##### FSDP + QLoRA
+Axolotl supports training with FSDP and QLoRA, see [these docs](docs/fsdp_qlora.md) for more information.
 ##### Weights & Biases Logging
 Make sure your `WANDB_API_KEY` environment variable is set (recommended) or you login to wandb with `wandb login`.

docs/fsdp_qlora.md ADDED Viewed

	@@ -0,0 +1,37 @@

+# FDSP + QLoRA
+## Background
+Using FSDP with QLoRA is essential for **fine-tuning larger (70b+ parameter) LLMs on consumer GPUs.**  For example, you can use FSDP + QLoRA to train a 70b model on two 24GB GPUs[^1].
+Below, we describe how to use this feature in Axolotl.
+## Usage
+To enable `QLoRA` with `FSDP`, you need to perform the following steps:
+> ![Tip]
+> See the [example config](#example-config) file in addition to reading these instructions.
+1. Set `adapter: qlora` in your axolotl config file.
+2. Enable FSDP in your axolotl config, as [described here](https://github.com/OpenAccess-AI-Collective/axolotl?tab=readme-ov-file#fsdp).
+3. Use one of the supported model types: `llama`, `mistral` or `mixtral`.
+## Example Config
+[examples/llama-2/qlora-fsdp.yml](../examples/llama-2/qlora-fsdp.yml) contains an example of how to enable QLoRA + FSDP in axolotl.
+## References
+- [PR #1378](https://github.com/OpenAccess-AI-Collective/axolotl/pull/1378) enabling QLoRA in FSDP in Axolotl.
+- [Blog Post](https://www.answer.ai/posts/2024-03-06-fsdp-qlora.html) from the [Answer.AI](https://www.answer.ai/) team describing the work that enabled QLoRA in FSDP.
+- Related HuggingFace PRs Enabling FDSP + QLoRA:
+    - Accelerate [PR#2544](https://github.com/huggingface/accelerate/pull/2544 )
+    - Transformers [PR#29587](https://github.com/huggingface/transformers/pull/29587)
+    - TRL [PR#1416](https://github.com/huggingface/trl/pull/1416)
+    - PEFT [PR#1550](https://github.com/huggingface/peft/pull/1550)
+[^1]: This was enabled by [this work](https://www.answer.ai/posts/2024-03-06-fsdp-qlora.html) from the Answer.AI team.