4 3 2

Rémi Ouazan Reboul

ror

AI & ML interests

None yet

Recent Activity

upvoted an article 1 day ago

Welcome GPT OSS, the new open-source model family from OpenAI!

commented on their article 10 days ago

Creating custom kernels for the AMD MI300

liked a Space 20 days ago

eustlb/transformers-audio-ci

View all activity

Organizations

upvoted an article 1 day ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

and 11 others •

2 days ago

• 387

commented on Creating custom kernels for the AMD MI300 10 days ago

I do not, but you could change the code to dispatch a 8 rows GEMM to the dense MFMA with 8 rows of padding and check the numbers then!
Your understanding of the dispatch logic is correct.

liked a Space 20 days ago

Transformers Audio Ci

🐢

Retrieve and visualize audio model CI test results

updated a dataset 28 days ago

huggingface/documentation-images

Viewer • Updated 15 minutes ago • 55 • 2.68M • 77

New activity in huggingface/documentation-images about 1 month ago

mi300kernels-image

#517 opened about 1 month ago by

ror

published an article about 1 month ago

Article

Creating custom kernels for the AMD MI300

and 1 other •

29 days ago

• 43

upvoted 2 articles about 2 months ago

Article

Sensitivity Aware Mixed Precision Quantization V1

and 1 other •

Jun 13

• 19

Article

Hugging Face and AMD partner on accelerating state-of-the-art models for CPU and GPU platforms

•

Jun 13, 2023

• 4

New activity in kernels-community/triton-layer-norm 3 months ago

Update autotune configuration to avoid crash on AMD devices

#2 opened 3 months ago by

ror

liked a Space 5 months ago

2.97k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

Rémi Ouazan Reboul

AI & ML interests

Recent Activity

Organizations

ror's activity

Welcome GPT OSS, the new open-source model family from OpenAI!

Transformers Audio Ci

mi300kernels-image

Creating custom kernels for the AMD MI300

Sensitivity Aware Mixed Precision Quantization V1

Hugging Face and AMD partner on accelerating state-of-the-art models for CPU and GPU platforms

Update autotune configuration to avoid crash on AMD devices

The Ultra-Scale Playbook