John-Paul K.'s picture

John-Paul K.

johnpaulbin

·

johnpaulbin

AI & ML interests

None yet

Recent Activity

updated a model 2 days ago

johnpaulbin/pt8

published a model 2 days ago

johnpaulbin/pt8

updated a model 3 days ago

johnpaulbin/pt7

View all activity

Organizations

johnpaulbin's activity

upvoted a collection 15 days ago

Qwen3

Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 65 items • Updated 1 day ago • 140

upvoted a paper about 2 months ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18 • 126

upvoted 2 papers 4 months ago

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published Dec 9, 2024 • 85

Tag-LLM: Repurposing General-Purpose LLMs for Specialized Domains

Paper • 2402.05140 • Published Feb 6, 2024 • 24

upvoted a collection 10 months ago

Llama 3.1

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 665

upvoted a paper 11 months ago

Are We Done with MMLU?

Paper • 2406.04127 • Published Jun 6, 2024 • 40

upvoted 2 papers almost 2 years ago

STEVE-1: A Generative Model for Text-to-Behavior in Minecraft

Paper • 2306.00937 • Published Jun 1, 2023 • 9

SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

Paper • 2307.01952 • Published Jul 4, 2023 • 86