On Vacation 🏝️

20 69 227

NB

Skier8402

https://nyab.notion.site

Shuyib

AI & ML interests

Explainable Computer Vision w/ Mech Interpretability, Optimization, NLP and multimodal system implementation.

Recent Activity

upvoted a collection 7 days ago

Sparse Auto-Encoders (SAEs) for Mechanistic Interpretability

updated a collection 7 days ago

Interpretability tools

liked a Space 7 days ago

dlouapre/eiffel-tower-llama

View all activity

Organizations

upvoted a collection 7 days ago

Sparse Auto-Encoders (SAEs) for Mechanistic Interpretability

Collection

A compilation of sparse auto-encoders trained on large language models. • 37 items • Updated 17 days ago • 18

updated a collection 7 days ago

Interpretability tools

Collection

Opening the hood of Computer Vision model for example ResNets, ConvNext & DETR, multimodal models and NLP models:BERT & GPTs. • 7 items • Updated 7 days ago • 2

liked a Space 7 days ago

The Eiffel Tower Llama

📝

Explore the Eiffel Tower Llama experiment with open-source models

updated a collection 9 days ago

biomedical

Collection

6 items • Updated 9 days ago

liked a model 9 days ago

Qwen/Qwen3-235B-A22B-Thinking-2507

Text Generation • 235B • Updated Aug 17, 2025 • 28.8k • • 390

updated a collection 9 days ago

biomedical

Collection

6 items • Updated 9 days ago

liked a dataset 9 days ago

nsk7153/MedCalc-Bench-Verified

Viewer • Updated 6 days ago • 11.7k • 162 • 3

liked 2 datasets 16 days ago

mistralai/mmlu_speech

Viewer • Updated Jul 15, 2025 • 14.3k • 524 • 14

mistralai/gsm8k_speech

Viewer • Updated Jul 15, 2025 • 1.32k • 94 • 6

upvoted a collection 16 days ago

Speech Evals

Collection

Synthesized speech evals generated by MistralAI from popular text evaluation datasets to evaluate spoken-language reasoning capabilities of Audio LLMs • 3 items • Updated Nov 28, 2025 • 12

upvoted a paper 20 days ago

Transformer Explainer: Interactive Learning of Text-Generative Models

Paper • 2408.04619 • Published Aug 8, 2024 • 174

updated 2 collections 20 days ago

Interpretability tools

Collection

Opening the hood of Computer Vision model for example ResNets, ConvNext & DETR, multimodal models and NLP models:BERT & GPTs. • 7 items • Updated 7 days ago • 2

Diffusion model tools

Collection

a couple of controlnets to improve various aspects of an images • 8 items • Updated 20 days ago

updated a collection 21 days ago

Datasets

Collection

Interesting datasets to help train LLMs and beyond • 45 items • Updated 21 days ago

liked a dataset 21 days ago

OpenMed/Medical-Reasoning-SFT-GPT-OSS-120B

Viewer • Updated 21 days ago • 200k • 3.63k • 234

updated a collection 21 days ago

Datasets

Collection

Interesting datasets to help train LLMs and beyond • 45 items • Updated 21 days ago

liked a dataset 21 days ago

Nadhari/Swahili-Thinking

Viewer • Updated Nov 23, 2025 • 166 • 73 • 8

updated a collection 21 days ago

Swahili models

Collection

5 items • Updated 21 days ago

liked a model 21 days ago

Nadhari/swa-csm-1b

Text-to-Speech • Updated 21 days ago • 94 • 3

upvoted an article 27 days ago

Article

We Got Claude to Fine-Tune an Open Source LLM

29 days ago

•

553

NB

AI & ML interests

Recent Activity

Organizations

Skier8402's activity

The Eiffel Tower Llama

We Got Claude to Fine-Tune an Open Source LLM