Ilyas Moutawwakil's picture

Ilyas Moutawwakil

IlyasMoutawwakil

·

IlyasMoutawwakil

AI & ML interests

Optimization, LLMs, Hardware, Backends, ..

Recent Activity

updated a model 3 days ago

optimum-internal-testing/tiny-random-TrocrForCausalLM

published a model 3 days ago

optimum-internal-testing/tiny-random-TrocrForCausalLM

updated a model 3 days ago

optimum-internal-testing/tiny-random-VisionEncoderDecoderModel-trocr

View all activity

Organizations

upvoted an article 5 days ago

Article

Accelerate ND-Parallel: A Guide to Efficient Multi-GPU Training

By

and 4 others •

7 days ago

• 44

upvoted an article 16 days ago

Article

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

By

and 8 others •

Apr 29

• 39

upvoted a collection 22 days ago

Tiny dummy models

Randomly initialized tiny models for debugging/testing purpose • 112 items • Updated 3 days ago • 6

upvoted 2 articles about 1 month ago

Article

Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub

By

and 6 others •

Jun 12

• 125

Article

Creating custom kernels for the AMD MI300

By

and 1 other •

Jul 9

• 43

upvoted an article about 2 months ago

Article

Nano-vLLM meets Inference Endpoints

By

•

Jun 25

• 9

upvoted a collection 3 months ago

ColPali v1.3 ONNX

6 items • Updated May 22 • 1

upvoted 2 articles 4 months ago

Article

Introducing HELMET

By

and 6 others •

Apr 16

• 35

Article

Comparing sub 50GB Llama 4 Scout quants (KLD/Top P)

By

•

Apr 9

• 42

upvoted 5 articles 5 months ago

Article

Accelerating LLM Inference with TGI on Intel Gaudi

By

and 4 others •

Mar 28

• 14

Article

Open R1: Update #4

By

and 3 others •

Mar 26

• 48

Article

Universal Assisted Generation: Faster Decoding with Any Assistant Model

By

and 7 others •

Oct 29, 2024

• 57

Article

Introducing Gradio's new Dataframe!

By

and 1 other •

Mar 24

• 28

Article

Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques

By

and 8 others •

Mar 24

• 19

upvoted 3 articles 7 months ago

Article

Welcome to Inference Providers on the Hub 🔥

By

and 6 others •

Jan 28

• 487

Article

Timm ❤️ Transformers: Use any timm model with transformers

By

and 4 others •

Jan 16

• 51

Article

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

By

and 1 other •

Jan 16

• 75

upvoted an article 12 months ago

Article

The 5 Most Under-Rated Tools on Hugging Face

By

•

Aug 22, 2024

• 90

upvoted a paper about 1 year ago

Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

Paper • 2406.06525 • Published Jun 10, 2024 • 72

upvoted an article about 1 year ago

Article

CPU Optimized Embeddings with 🤗 Optimum Intel and fastRAG

By

and 5 others •

Mar 15, 2024

• 10