A newer version of the Gradio SDK is available: 5.35.0
5.35.0
Example script of using FlashAttention for inference coming soon.