TheMrCodes
TheMrCodes
·
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
8 days ago
Recurrence-Complete Frame-based Action Models
upvoted
a
paper
8 months ago
4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large
Language Models
Organizations
None yet
AI Safety
Interesting Datasets
LM Research
-
TinyLlama: An Open-Source Small Language Model
Paper • 2401.02385 • Published • 94 -
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models
Paper • 2401.01335 • Published • 68 -
Asynchronous Local-SGD Training for Language Modeling
Paper • 2401.09135 • Published • 12 -
Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
Paper • 2404.07143 • Published • 111
Interesting for LLM Products
Tiny MMLM
Knowledge Graph
Cool Papers
Image Gen
Milestomes
Read later list
Waiting for model weights
-
DocLLM: A layout-aware generative language model for multimodal document understanding
Paper • 2401.00908 • Published • 189 -
Multilingual E5 Text Embeddings: A Technical Report
Paper • 2402.05672 • Published • 22 -
Accelerating High-Fidelity Waveform Generation via Adversarial Flow Matching Optimization
Paper • 2408.08019 • Published • 11
Fundamental Research
Bio ML
Point Tracking Models
Cool Papers
AI Safety
Image Gen
Interesting Datasets
Milestomes
LM Research
-
TinyLlama: An Open-Source Small Language Model
Paper • 2401.02385 • Published • 94 -
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models
Paper • 2401.01335 • Published • 68 -
Asynchronous Local-SGD Training for Language Modeling
Paper • 2401.09135 • Published • 12 -
Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
Paper • 2404.07143 • Published • 111
Read later list
Interesting for LLM Products
Waiting for model weights
-
DocLLM: A layout-aware generative language model for multimodal document understanding
Paper • 2401.00908 • Published • 189 -
Multilingual E5 Text Embeddings: A Technical Report
Paper • 2402.05672 • Published • 22 -
Accelerating High-Fidelity Waveform Generation via Adversarial Flow Matching Optimization
Paper • 2408.08019 • Published • 11
Tiny MMLM
Fundamental Research
Knowledge Graph
Bio ML