1 7 3

Gregor Koch PRO

cronos3k

AI & ML interests

None yet

Recent Activity

updated a Space 18 days ago

cronos3k/document-integrity-verifier

published a Space 18 days ago

cronos3k/document-integrity-verifier

upvoted an article 2 months ago

vindex-infer — Run LLMs without CUDA, without PyTorch, from flat binary files

View all activity

Organizations

updated a Space 18 days ago

Document Integrity Verifier

🛡

Audit a document for integrity before AI ingestion.

published a Space 18 days ago

Document Integrity Verifier

🛡

Audit a document for integrity before AI ingestion.

upvoted an article 2 months ago

Article

vindex-infer — Run LLMs without CUDA, without PyTorch, from flat binary files

cronos3k

•

Apr 16

• 1

published an article 2 months ago

Article

vindex-infer — Run LLMs without CUDA, without PyTorch, from flat binary files

cronos3k

•

Apr 16

• 1

updated a model 2 months ago

cronos3k/gemma-3-4b-it-vindex

Updated Apr 16 • 1

published a model 2 months ago

cronos3k/gemma-3-4b-it-vindex

Updated Apr 16 • 1

upvoted an article 2 months ago

Article

80% of AI LLM Fine-Tuning Compute Is Wasted — Here's How We Proved It

cronos3k

•

Apr 15

• 1

published an article 2 months ago

Article

80% of AI LLM Fine-Tuning Compute Is Wasted — Here's How We Proved It

cronos3k

•

Apr 15

• 1

upvoted an article 2 months ago

Article

Querying a Neural Network Like a Database — and Making That Tool Available to Everyone

cronos3k

•

Apr 15

• 1

published an article 2 months ago

Article

Querying a Neural Network Like a Database — and Making That Tool Available to Everyone

cronos3k

•

Apr 15

• 1

updated a dataset 2 months ago

cronos3k/qwen2.5-0.5b-instruct-vindex

Updated Apr 14 • 32

published a dataset 2 months ago

cronos3k/qwen2.5-0.5b-instruct-vindex

Updated Apr 14 • 32

updated a Space 2 months ago

LARQL Explorer

🧠

Query neural network weights like a knowledge graph

published a Space 2 months ago

LARQL Explorer

🧠

Query neural network weights like a knowledge graph

updated a Space 3 months ago

LongCat-AudioDiT Enhanced

🐱

Voice cloning TTS with Whisper STT

published a Space 3 months ago

LongCat-AudioDiT Enhanced

🐱

Voice cloning TTS with Whisper STT

upvoted an article 3 months ago

Article

9 Hours, 199 Rounds, Zero Errors: Three Local NemoClaw AI Agents Migrated COBOL to Python Using Only Persistent Memory

cronos3k

•

Mar 18

• 1

published an article 3 months ago

Article

9 Hours, 199 Rounds, Zero Errors: Three Local NemoClaw AI Agents Migrated COBOL to Python Using Only Persistent Memory

cronos3k

•

Mar 18

• 1

posted an update 3 months ago

Post

198

🧠 What if your AI agents could remember every decision across 199 rounds — without stuffing the context window?

I ran 3 NVIDIA Nemotron-3-Nano-30B-A3B agents (3B active params each, MoE) for 9 hours on a real COBOL→Python migration (AWS CardDemo, 50K lines). All local, all on llama.cpp, zero API calls.

Results:
• 24M tokens processed
• 52 Python files written
• 402 persistent memories shared between agents
• Context per agent: never exceeded 9K tokens
• Speed: 97-137 tok/s from start to finish — no degradation
• Errors: 0

The secret: memory-first architecture via AgentAZAll. Only the last round goes into context. Everything else is a tool call to recall/remember. The context window stays clean. Speed stays constant. Knowledge grows forever.

Full technical writeup with architecture, code samples, and all numbers:
🔗 https://www.linkedin.com/pulse/i-gave-three-local-ai-agents-shared-persistent-memory-gregor-h-max-yxs2f/

Try AgentAZAll: cronos3k/AgentAZAll
GitHub: https://github.com/cronos3k/AgentAZAll

updated a Space 3 months ago

AgentAZAll - Dual-Agent Live Demo

🧠

Watch two AI agents collaborate via filesystem in real-time

Gregor Koch PRO

AI & ML interests

Recent Activity

Organizations

cronos3k's activity

Document Integrity Verifier

Document Integrity Verifier

vindex-infer — Run LLMs without CUDA, without PyTorch, from flat binary files

vindex-infer — Run LLMs without CUDA, without PyTorch, from flat binary files

80% of AI LLM Fine-Tuning Compute Is Wasted — Here's How We Proved It

80% of AI LLM Fine-Tuning Compute Is Wasted — Here's How We Proved It

Querying a Neural Network Like a Database — and Making That Tool Available to Everyone

Querying a Neural Network Like a Database — and Making That Tool Available to Everyone

LARQL Explorer

LARQL Explorer

LongCat-AudioDiT Enhanced

LongCat-AudioDiT Enhanced

9 Hours, 199 Rounds, Zero Errors: Three Local NemoClaw AI Agents Migrated COBOL to Python Using Only Persistent Memory

9 Hours, 199 Rounds, Zero Errors: Three Local NemoClaw AI Agents Migrated COBOL to Python Using Only Persistent Memory

AgentAZAll - Dual-Agent Live Demo