Doug Holton's picture

74 229

Doug Holton PRO

edtechdev

·

https://edtechdev.wordpress.com/

AI & ML interests

Educational Technology, Intelligent Tutoring Systems, Learning Sciences

Recent Activity

liked a model 7 days ago

zai-org/GLM-4.7

upvoted a collection 11 days ago

liked a model 11 days ago

zai-org/GLM-4.6V-Flash

View all activity

Organizations

None yet

upvoted a collection 11 days ago

GLM-4.6V

3 items • Updated 22 days ago • 47

upvoted a collection 15 days ago

Parakeet

NeMo Parakeet ASR Models attain strong speech recognition accuracy while being efficient for inference. Available in CTC and RNN-Transducer variants. • 12 items • Updated 7 days ago • 50

upvoted 2 collections about 1 month ago

Step-Audio-R1

Step-Audio-R1 is the first audio language model to successfully unlock test-time compute scaling. • 3 items • Updated Nov 21 • 15

DeepSeek-Math

DeepSeek Math series • 6 items • Updated Nov 27 • 44

upvoted a collection about 2 months ago

📄 FinePDFs

81 items • Updated Nov 11 • 25

upvoted a collection 2 months ago

LightOnOCR

The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR • 7 items • Updated Nov 13 • 15

upvoted 2 articles 2 months ago

Article

LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR

Oct 23

•

62

Article

Supercharge your OCR Pipelines with Open Models

+5

Oct 21

•

284

upvoted a collection 2 months ago

MathCanvas

Datasets and models for the paper "MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning" • 5 items • Updated Nov 19 • 3

upvoted a paper 2 months ago

MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning

Paper • 2510.14958 • Published Oct 16 • 22

upvoted an article 3 months ago

Article

In-browser LLM app in pure Python: Gemini Nano + Gradio-Lite

Jul 12, 2024

•

14

upvoted 2 collections 3 months ago

Qwen3-Omni

6 items • Updated Oct 9 • 176

Qwen3-VL

37 items • Updated Nov 1 • 549

upvoted an article 3 months ago

Article

AI Watermarking 101: Tools and Techniques

+7

Feb 26, 2024

•

27

upvoted a paper 4 months ago

Kosmos-2.5: A Multimodal Literate Model

Paper • 2309.11419 • Published Sep 20, 2023 • 55

upvoted 2 collections 4 months ago

MiniCPM-o & MiniCPM-V

Multimodal models with leading performance. • 28 items • Updated Sep 1 • 59

DeepSeek-V3.1

4 items • Updated Nov 27 • 256

upvoted 3 collections 5 months ago

Canary

A collection of multilingual and multitask speech to text models from NVIDIA NeMo 🐤 • 5 items • Updated 7 days ago • 29

Gemma 3-270m

Collection of models for Gemma 3-270m • 4 items • Updated 14 days ago • 21

Gemma 3 Release

28 items • Updated Aug 11 • 572