🏗️ Building on HF
Ashish Soni
ashish-soni08
AI & ML interests
None yet
Recent Activity
liked a model 7 days ago
nvidia/nemotron-3.5-asr-streaming-0.6b liked a model 13 days ago
empero-ai/Qwythos-9B-Claude-Mythos-5-1M liked a model 16 days ago
unsloth/GLM-5.2-GGUFOrganizations
How_LLMS_Think _and_Reason_Papers
Embedding_Models
MoE_Models
Pre-Training-Data-for-LLMs
Open-Source Datasets that have been employed for pre-training Large Language Models
Reasoning Datasets
Microsoft Models
Leaderboards
Meta AI
Models released by Meta
-
meta-llama/Llama-3.1-8B-Instruct
Text Generation • 8B • Updated • 9.29M • • 6.22k -
meta-llama/Llama-3.1-405B
Text Generation • 406B • Updated • 244k • • 978 -
meta-llama/Llama-3.1-405B-Instruct
Text Generation • 406B • Updated • 209k • 596 -
meta-llama/Llama-3.1-8B
Text Generation • 8B • Updated • 1.53M • • 2.3k
Privacy_Masking_for_LLMs
Function Calling
Reasoning Datasets
How_LLMS_Think _and_Reason_Papers
Microsoft Models
Embedding_Models
Leaderboards
MoE_Models
Meta AI
Models released by Meta
-
meta-llama/Llama-3.1-8B-Instruct
Text Generation • 8B • Updated • 9.29M • • 6.22k -
meta-llama/Llama-3.1-405B
Text Generation • 406B • Updated • 244k • • 978 -
meta-llama/Llama-3.1-405B-Instruct
Text Generation • 406B • Updated • 209k • 596 -
meta-llama/Llama-3.1-8B
Text Generation • 8B • Updated • 1.53M • • 2.3k
Pre-Training-Data-for-LLMs
Open-Source Datasets that have been employed for pre-training Large Language Models
Privacy_Masking_for_LLMs