Robin Williams PRO
bfuzzy1
AI & ML interests
None yet
Recent Activity
upvoted
an
article
8 days ago
SmolLM3: smol, multilingual, long-context reasoner
upvoted
a
collection
8 days ago
Encoders vs Decoders: the Ettin Suite
commented on
a paper
21 days ago
FLEXITOKENS: Flexible Tokenization for Evolving Language Models
Organizations
None yet
llambses-1
llambses-1 models
Gunny
fine tuned models focused on veteran support
Agents
Collection of resources related to Agents.
-
Communicative Agents for Software Development
Paper • 2307.07924 • Published • 6 -
Self-Refine: Iterative Refinement with Self-Feedback
Paper • 2303.17651 • Published • 2 -
ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent
Paper • 2312.10003 • Published • 44 -
ReAct: Synergizing Reasoning and Acting in Language Models
Paper • 2210.03629 • Published • 27
Attentive
Generation Nation
RL
-
RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response
Paper • 2412.14922 • Published • 90 -
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners
Paper • 2412.17256 • Published • 48 -
Deliberation in Latent Space via Differentiable Cache Augmentation
Paper • 2412.17747 • Published • 33 -
Outcome-Refining Process Supervision for Code Generation
Paper • 2412.15118 • Published • 19
acheron
acheron slms
AI for Good
AI for Good.
Agentic-ly agentic
-
Automated Design of Agentic Systems
Paper • 2408.08435 • Published • 41 -
On the limits of agency in agent-based models
Paper • 2409.10568 • Published • 14 -
On the Diagram of Thought
Paper • 2409.10038 • Published • 14 -
DSBench: How Far Are Data Science Agents to Becoming Data Science Experts?
Paper • 2409.07703 • Published • 68
Don't hate - evaluate
Nifty
-
LLMs + Persona-Plug = Personalized LLMs
Paper • 2409.11901 • Published • 34 -
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning
Paper • 2409.12183 • Published • 40 -
Chain of Thought Empowers Transformers to Solve Inherently Serial Problems
Paper • 2402.12875 • Published • 13 -
TPI-LLM: Serving 70B-scale LLMs Efficiently on Low-resource Edge Devices
Paper • 2410.00531 • Published • 35
acheron-m
RL
-
RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response
Paper • 2412.14922 • Published • 90 -
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners
Paper • 2412.17256 • Published • 48 -
Deliberation in Latent Space via Differentiable Cache Augmentation
Paper • 2412.17747 • Published • 33 -
Outcome-Refining Process Supervision for Code Generation
Paper • 2412.15118 • Published • 19
llambses-1
llambses-1 models
acheron
acheron slms
Gunny
fine tuned models focused on veteran support
AI for Good
AI for Good.
Agents
Collection of resources related to Agents.
-
Communicative Agents for Software Development
Paper • 2307.07924 • Published • 6 -
Self-Refine: Iterative Refinement with Self-Feedback
Paper • 2303.17651 • Published • 2 -
ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent
Paper • 2312.10003 • Published • 44 -
ReAct: Synergizing Reasoning and Acting in Language Models
Paper • 2210.03629 • Published • 27
Agentic-ly agentic
-
Automated Design of Agentic Systems
Paper • 2408.08435 • Published • 41 -
On the limits of agency in agent-based models
Paper • 2409.10568 • Published • 14 -
On the Diagram of Thought
Paper • 2409.10038 • Published • 14 -
DSBench: How Far Are Data Science Agents to Becoming Data Science Experts?
Paper • 2409.07703 • Published • 68
Attentive
Don't hate - evaluate
Generation Nation
Nifty
-
LLMs + Persona-Plug = Personalized LLMs
Paper • 2409.11901 • Published • 34 -
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning
Paper • 2409.12183 • Published • 40 -
Chain of Thought Empowers Transformers to Solve Inherently Serial Problems
Paper • 2402.12875 • Published • 13 -
TPI-LLM: Serving 70B-scale LLMs Efficiently on Low-resource Edge Devices
Paper • 2410.00531 • Published • 35