daje (daje kang)

Collections 1

Cached Transformers: Improving Transformers with Differentiable Memory Cache

Paper • 2312.12742 • Published Dec 20, 2023 • 14
ProTIP: Progressive Tool Retrieval Improves Planning

Paper • 2312.10332 • Published Dec 16, 2023 • 8
Paloma: A Benchmark for Evaluating Language Model Fit

Paper • 2312.10523 • Published Dec 16, 2023 • 13
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25, 2024 • 98

models 39

datasets 17

daje/synthetic-ko-sql-hard-add-llm-result

Viewer • Updated Apr 11 • 1.68k • 33

daje/synthetic-ko-sql-hard

Viewer • Updated Apr 10 • 1.68k • 53 • 1

daje/kotext-to-sql-v1-hard

Viewer • Updated Apr 8 • 2k • 24

daje/kaggle-image-datasets

Viewer • Updated Mar 10 • 44.4k • 14

daje/de-identify-chat-ko

Viewer • Updated Mar 6 • 9.92k • 16

daje/ko-hatefulmemes_train_8500

Viewer • Updated Jan 14 • 8.2k • 22

daje/ko-hatefulmemes_train_8500_kmhas

Viewer • Updated Jan 14 • 95.3k • 30

daje/ko-hatefulmemes_train_2000

Viewer • Updated Jan 13 • 1.91k • 23

daje/Ko-SciecneQA

Viewer • Updated Nov 8, 2024 • 12.7k • 16

daje/keyword_summary

Viewer • Updated Aug 8, 2024 • 1k • 45

daje kang

AI & ML interests

Recent Activity

Organizations

Collections 1

Cached Transformers: Improving Transformers with Differentiable Memory Cache

ProTIP: Progressive Tool Retrieval Improves Planning

Paloma: A Benchmark for Evaluating Language Model Fit

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

models 39

daje/Meta-Llama-3.1-8B-Instruct-de-identification

daje/Qwen2.5-14B-Instruct-tools

daje/model_0.0002_alpha-32_r-64

daje/model_0.0002_alpha-8_r-16

daje/model_5e-05_alpha-128_r-256

daje/model_2e-4_alpha-8_r-16

daje/model_Lora

daje/model_2e-4

daje/model

daje/Qwen2-7B-Instruct-harmful_detector_2000-H100_1

datasets 17

daje/synthetic-ko-sql-hard-add-llm-result

daje/synthetic-ko-sql-hard

daje/kotext-to-sql-v1-hard

daje/kaggle-image-datasets

daje/de-identify-chat-ko

daje/ko-hatefulmemes_train_8500

daje/ko-hatefulmemes_train_8500_kmhas

daje/ko-hatefulmemes_train_2000

daje/Ko-SciecneQA

daje/keyword_summary

daje kang

AI & ML interests

Recent Activity

Organizations

Collections 1

models 39 Sort: Recently updated

datasets 17 Sort: Recently updated

models 39

datasets 17