kernels-community
/

flash-attn3

Model card Files Files and versions Community

danieldk HF Staff commited on Jun 25

Commit

a743610

·

1 Parent(s): 8071e7c

Add README

Files changed (1) hide show

README.md +13 -0

README.md ADDED Viewed

	@@ -0,0 +1,13 @@

+---
+license: bsd-3-clause
+tags:
+  - kernel
+---
+# Flash Attention 3
+Flash Attention is a fast and memory-efficient implementation of the
+attention mechanism, designed to work with large models and long sequences.
+This is a Hugging Face compliant kernel build of Flash Attention.
+Original code here [https://github.com/Dao-AILab/flash-attention](https://github.com/Dao-AILab/flash-attention).