Pouya Esmaeili's picture

8 4

Pouya Esmaeili

Pouyae

·

https://pouyae.xyz

AI & ML interests

RAG/LLM/Agents

Organizations

None yet

upvoted a collection 4 months ago

NVIDIA Nemotron V2

Open, Production-ready Enterprise Models. Nvidia Open Model license. • 9 items • Updated 1 day ago • 100

upvoted 3 articles 7 months ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

+2

Dec 9, 2022

•

384

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

Jan 30

•

201

Article

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

Apr 16

•

56

upvoted 2 collections 7 months ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 649

Phi-4

Phi-4 family of small language, multi-modal and reasoning models. • 17 items • Updated Jul 10 • 192

upvoted a paper almost 2 years ago

Grandmaster-Level Chess Without Search

Paper • 2402.04494 • Published Feb 7, 2024 • 69

upvoted a paper about 2 years ago

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 260