kernels-community
/

flash-attn3

Model card Files Files and versions Community

flash-attn3 / README.md

danieldk's picture

danieldk HF Staff

Remove warning

6fff937 18 days ago

|

history blame contribute delete

394 Bytes

	---
	license: bsd-3-clause
	tags:
	- kernel
	---

	# Flash Attention 3

	Flash Attention is a fast and memory-efficient implementation of the
	attention mechanism, designed to work with large models and long sequences.
	This is a Hugging Face compliant kernel build of Flash Attention.

	Original code here [https://github.com/Dao-AILab/flash-attention](https://github.com/Dao-AILab/flash-attention).