3 15 22

AlphaSue

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

upvoted an article 14 days ago

Open-source DeepResearch – Freeing our search agents

new activity 15 days ago

tokyotech-llm/swallow-math:Why the data only has answers without questions?

View all activity

Organizations

None yet

liked 3 models 4 months ago

liked a Space 6 months ago

116

TxT360: Trillion Extracted Text

📖

Create a large-scale deduplicated text dataset for LLM training

liked a model 6 months ago

jinaai/ReaderLM-v2

Text Generation • 2B • Updated Mar 4 • 17.6k • • 682

liked a Space 6 months ago

3.05k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked a dataset 8 months ago

microsoft/RedStone

Updated Dec 5, 2024 • 17 • 34

liked a model 8 months ago

open-web-math/filtering-models

Updated Nov 2, 2023 • 9

liked a dataset 8 months ago

m-a-p/FineFineWeb

Viewer • Updated Dec 19, 2024 • 4.89B • 109k • 73

liked 2 models 12 months ago

nvidia/quality-classifier-deberta

0.2B • Updated Jan 31 • 1.07k • 65

oliverguhr/fullstop-punctuation-multilang-large

Token Classification • 0.6B • Updated Nov 16, 2023 • 932k • • 168

liked a dataset about 1 year ago

teknium/OpenHermes-2.5

Viewer • Updated Apr 15, 2024 • 1M • 2.48k • 745

liked a model about 1 year ago

Snowflake/snowflake-arctic-embed-m

liked a Space about 1 year ago

1.03k

FineWeb: decanting the web for the finest text data at scale

🍷

Generate high-quality web text data for LLM training

liked a dataset about 1 year ago

liwu/MNBVC

Updated 2 days ago • 16.1k • 557

liked 3 datasets over 1 year ago

togethercomputer/RedPajama-Data-1T

Viewer • Updated Jun 17, 2024 • 1.73M • 3.43k • 1.1k

allenai/dolma

Updated Apr 17, 2024 • 684 • 929

HuggingFaceFW/fineweb

Viewer • Updated Jul 11 • 52.5B • 411k • 2.3k

liked a Space almost 2 years ago

1.15k

ControlNet V1.1

📉

Transform images using various styles and effects

liked a model about 2 years ago

TheBloke/Llama-2-7B-Chat-GGML

Text Generation • Updated Sep 27, 2023 • 687 • 872