25 15 56

Jordan Legg PRO

takarajordan

https://takara.ai

AI & ML interests

Chief AI Officer @takara.ai. Diffusion, Inference optimisation and all things MultiModal.

Recent Activity

reacted to tomaarsen's post with ❤️ 7 days ago

😎 I just published Sentence Transformers v5.1.0, and it's a big one. 2x-3x speedups of SparseEncoder models via ONNX and/or OpenVINO backends, easier distillation data preparation with hard negatives mining, and more: 1️⃣ Faster ONNX and OpenVINO backends for SparseEncoder models Usage is as simple as `backend="onnx"` or `backend="openvino"` when initializing a SparseEncoder to get started, but I also included utility functions for optimization, dynamic quantization, and static quantization, plus benchmarks. 2️⃣ New `n-tuple-scores` output format from `mine_hard_negatives` This new output format is immediately compatible with the MarginMSELoss and SparseMarginMSELoss for training SentenceTransformer, CrossEncoder, and SparseEncoder losses. 3️⃣ Gathering across devices When doing multi-GPU training using a loss that has in-batch negatives (e.g. MultipleNegativesRankingLoss), you can now use `gather_across_devices=True` to load in-batch negatives from the other devices too! Essentially a free lunch, pretty big impact potential in my evals. 4️⃣ Trackio support If you also upgrade `transformers`, and you install `trackio` with `pip install trackio`, then your experiments will also automatically be tracked locally with trackio. Just open up localhost and have a look at your losses/evals, no logins, no metric uploading. 5️⃣ MTEB Documentation We've added some documentation on evaluating SentenceTransformer models properly with MTEB. It's rudimentary as the documentation on the MTEB side is already great, but it should get you started. Plus many more smaller features & fixes (crash fixes, compatibility with datasets v4, FIPS compatibility, etc.). See the full release notes here: https://github.com/UKPLab/sentence-transformers/releases/tag/v5.1.0 Big thanks to all of the contributors for helping with the release, many of the features from this release were proposed by others. I have a big list of future potential features that I'd love to add, but I'm

reacted to openfree's post with 🔥 8 days ago

🚀 GPT-OSS 120B & 20B - Use Both Models in One Space! https://huggingface.co/spaces/openfree/OpenAI-gpt-oss https://huggingface.co/spaces/VIDraft/gpt-oss-RAG 🎯 Two Models, One Space! GPT-OSS hit #1 on HF just 2 hours after release! 🏆 Now you can use both models conveniently in a single space. 📋 Model Selection Made Easy! Just pick from the dropdown ✅ ├── GPT-OSS-120B (Complex tasks) └── GPT-OSS-20B (Quick chats) 💫 How to Use (Takes 30 seconds!) Sign in → With your HF account 🔐 Select model → Choose what you need 📌 Apply → Click! ⚡ Start chatting → That's it! 💬 🌈 Perfect For: 120B → Deep analysis, professional work 🧠 20B → Fast responses, casual conversations ⚡ No installation needed - just use it in your browser! 🌐 ✨ Special Features 🎨 Beautiful gradient UI 🌙 Dark mode support 🔄 Real-time model switching 🆓 Completely free! 👉 Try it now! It's really that simple! #GPT-OSS #HuggingFace #FreeAI #EasyToUse

posted an update 8 days ago

What do you all actually think about the open source OpenAI models? Are they legitimately any good or are they hype?

View all activity

Organizations

upvoted a paper 4 months ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7 • 197

upvoted a paper 5 months ago

ABC: Achieving Better Control of Multimodal Embeddings using VLMs

Paper • 2503.00329 • Published Mar 1 • 19

upvoted a collection 7 months ago

SwarmFormer

Collection

Our collection of our frontier SwarmFormer architecture models. • 2 items • Updated Jan 24 • 3

upvoted a paper 8 months ago

Are Your LLMs Capable of Stable Reasoning?

Paper • 2412.13147 • Published Dec 17, 2024 • 95

upvoted 2 papers 9 months ago

3D Convex Splatting: Radiance Field Rendering with 3D Smooth Convexes

Paper • 2411.14974 • Published Nov 22, 2024 • 16

CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models

Paper • 2411.18613 • Published Nov 27, 2024 • 59

upvoted 2 papers 10 months ago

Stealing User Prompts from Mixture of Experts

Paper • 2410.22884 • Published Oct 30, 2024 • 14

Teach Multimodal LLMs to Comprehend Electrocardiographic Images

Paper • 2410.19008 • Published Oct 21, 2024 • 24

upvoted a paper 11 months ago

MuCodec: Ultra Low-Bitrate Music Codec

Paper • 2409.13216 • Published Sep 20, 2024 • 24

upvoted a paper 12 months ago

T3M: Text Guided 3D Human Motion Synthesis from Speech

Paper • 2408.12885 • Published Aug 23, 2024 • 13

upvoted a paper about 1 year ago

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery

Paper • 2408.06292 • Published Aug 12, 2024 • 127

upvoted 2 articles about 1 year ago

Article

Local AI with Docker's Testcontainers

•

Aug 3, 2024

• 8

Article

WWDC 24: Running Mistral 7B with Core ML

and 3 others •

Jul 22, 2024

• 61

upvoted 2 papers about 1 year ago

Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks

Paper • 2407.02855 • Published Jul 3, 2024 • 13

Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Paper • 2407.04620 • Published Jul 5, 2024 • 35