Manuel Romero's picture

In a Training Loop 🔄

Manuel Romero PRO

mrm8488

·

https://mrm8488.github.io

AI & ML interests

#AI Research and Democratization. NLP/NLG 🤗

Recent Activity

liked a dataset 6 days ago

nvidia/Nemotron-Terminal-Synthetic-Tasks

liked a dataset 6 days ago

nvidia/Nemotron-Terminal-Corpus

upvoted a paper 6 days ago

Diffusion-Pretrained Dense and Contextual Embeddings

View all activity

Organizations

upvoted a paper 6 days ago

Diffusion-Pretrained Dense and Contextual Embeddings

Paper • 2602.11151 • Published 22 days ago • 21

upvoted a collection 12 days ago

GPT 5 Codex

Distilled models and datasets for GPT 5 Codex • 7 items • Updated Dec 20, 2025 • 4

upvoted an article 20 days ago

Article

Forge: Scalable Agent RL Framework and Algorithm

21 days ago

•

133

upvoted 2 collections 2 months ago

🧮functiongemma ft mobile-actions

A collection of functiongemma-270m-it models fine-tuned on mobile actions dataset for Spanish, French and Italian • 3 items • Updated Jan 5 • 3

JustRL

2 items • Updated Nov 1, 2025 • 5

upvoted 2 articles 2 months ago

Article

Aligning to What? Rethinking Agent Generalization in MiniMax M2

Oct 30, 2025

•

43

Article

Encoding the World's Medical Knowledge into 970K

Dec 22, 2025

•

15

upvoted a paper 3 months ago

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Paper • 2511.22699 • Published Nov 27, 2025 • 240

upvoted an article 3 months ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

607

upvoted a collection 3 months ago

Nemotron RAG

Set of tools to build retrieval-augmented generation (RAG) systems, improve search and ranking accuracy, and extract structured data from complex docs • 9 items • Updated 2 days ago • 78

upvoted a paper 3 months ago

Latent Collaboration in Multi-Agent Systems

Paper • 2511.20639 • Published Nov 25, 2025 • 123

upvoted a collection 3 months ago

Olmo 3 Post-training

All artifacts for post-training Olmo 3. Datasets follow the model that resulted from training on them. • 32 items • Updated Dec 23, 2025 • 50

upvoted a paper 4 months ago

Black-Box On-Policy Distillation of Large Language Models

Paper • 2511.10643 • Published Nov 13, 2025 • 52

upvoted an article 4 months ago

Article

How to train a new language model from scratch using Transformers and Tokenizers

Feb 14, 2020

•

60

upvoted a paper 4 months ago

Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

Paper • 2510.04618 • Published Oct 6, 2025 • 129

upvoted a collection 4 months ago

Luth Datasets

2 items • Updated 4 days ago • 5

upvoted an article 4 months ago

Article

Luth: Efficient French Specialization for Small Language Models

Aug 11, 2025

•

18

upvoted a collection 4 months ago

Luth x Qwen3

4 items • Updated Sep 24, 2025 • 7

upvoted an article 5 months ago

Article

Supercharge your OCR Pipelines with Open Models

+5

Oct 21, 2025

•

306

upvoted a paper 5 months ago

Fantastic (small) Retrievers and How to Train Them: mxbai-edge-colbert-v0 Tech Report

Paper • 2510.14880 • Published Oct 16, 2025 • 19