Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2402.14905

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22, 2024 • 133
facebook/MobileLLM-125M

Text Generation • Updated May 5 • 2.53k • 120
facebook/MobileLLM-350M

Text Generation • Updated May 5 • 219 • 35
facebook/MobileLLM-600M

Text Generation • Updated May 5 • 1.27k • 29

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22, 2024 • 133

on-Device (phone)

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22, 2024 • 257
MiniCPM-V: A GPT-4V Level MLLM on Your Phone

Paper • 2408.01800 • Published Aug 3, 2024 • 85
SlimLM: An Efficient Small Language Model for On-Device Document Assistance

Paper • 2411.09944 • Published Nov 15, 2024 • 12
MobileQuant: Mobile-friendly Quantization for On-device Language Models

Paper • 2408.13933 • Published Aug 25, 2024 • 16

on device use case

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22, 2024 • 133

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22, 2024 • 133

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 77
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 19
RoBERTa: A Robustly Optimized BERT Pretraining Approach

Paper • 1907.11692 • Published Jul 26, 2019 • 9
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter

Paper • 1910.01108 • Published Oct 2, 2019 • 17

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22, 2024 • 133

Running

2.55k

2.55k

Anycoder

🏢

Generate HTML/CSS/JS code for web applications
Runtime error

274

274

Qwen2.5 Coder Artifacts

🐢

Generate application code with Qwen2.5-Coder-32B
Running

922

922

QwQ-32B-Preview

🔍

QwQ-32B-Preview
Running on CPU Upgrade

13.4k

13.4k

Open LLM Leaderboard

🏆

Track, rank and evaluate open LLMs and chatbots

SLM - small language models

A Survey of Small Language Models

Paper • 2410.20011 • Published Oct 25, 2024 • 45
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22, 2024 • 133
HuggingFaceTB/SmolLM2-1.7B-Instruct-GGUF

Text Generation • 2B • Updated Nov 5, 2024 • 3.58k • 42
OpenGVLab/Mini-InternVL-Chat-2B-V1-5

Image-Text-to-Text • 2B • Updated Mar 25 • 13.7k • 72

Head-wise Shareable Attention for Large Language Models

Paper • 2402.11819 • Published Feb 19, 2024 • 1
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22, 2024 • 133

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22, 2024 • 133
facebook/MobileLLM-125M

Text Generation • Updated May 5 • 2.53k • 120
facebook/MobileLLM-350M

Text Generation • Updated May 5 • 219 • 35
facebook/MobileLLM-600M

Text Generation • Updated May 5 • 1.27k • 29

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 77
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 19
RoBERTa: A Robustly Optimized BERT Pretraining Approach

Paper • 1907.11692 • Published Jul 26, 2019 • 9
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter

Paper • 1910.01108 • Published Oct 2, 2019 • 17

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22, 2024 • 133

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22, 2024 • 133

on-Device (phone)

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22, 2024 • 257
MiniCPM-V: A GPT-4V Level MLLM on Your Phone

Paper • 2408.01800 • Published Aug 3, 2024 • 85
SlimLM: An Efficient Small Language Model for On-Device Document Assistance

Paper • 2411.09944 • Published Nov 15, 2024 • 12
MobileQuant: Mobile-friendly Quantization for On-device Language Models

Paper • 2408.13933 • Published Aug 25, 2024 • 16

Running

2.55k

2.55k

Anycoder

🏢

Generate HTML/CSS/JS code for web applications
Runtime error

274

274

Qwen2.5 Coder Artifacts

🐢

Generate application code with Qwen2.5-Coder-32B
Running

922

922

QwQ-32B-Preview

🔍

QwQ-32B-Preview
Running on CPU Upgrade

13.4k

13.4k

Open LLM Leaderboard

🏆

Track, rank and evaluate open LLMs and chatbots

on device use case

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22, 2024 • 133

SLM - small language models

A Survey of Small Language Models

Paper • 2410.20011 • Published Oct 25, 2024 • 45
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22, 2024 • 133
HuggingFaceTB/SmolLM2-1.7B-Instruct-GGUF

Text Generation • 2B • Updated Nov 5, 2024 • 3.58k • 42
OpenGVLab/Mini-InternVL-Chat-2B-V1-5

Image-Text-to-Text • 2B • Updated Mar 25 • 13.7k • 72

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22, 2024 • 133

Head-wise Shareable Attention for Large Language Models

Paper • 2402.11819 • Published Feb 19, 2024 • 1
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22, 2024 • 133

Previous
1
2
3
...
5
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs