Agents-MCP-Hackathon

community

https://www.gradio.app/

gradio

gradio-app

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

jayavibhav new activity 15 days ago

Agents-MCP-Hackathon/gradio_workflowbuilder:INLAGA TILL FALU TINGSRÄTT – MÅL T 3098-23

NohTow authored a paper about 2 months ago

ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models

Amasylla new activity 2 months ago

Agents-MCP-Hackathon/credit-card-database-mcp-server:Build error when starting the app

View all activity

rajkumarrawal

posted an update 10 days ago

Post

1557

I submitted a "Context-Value-Action Architecture for Value-Driven Large Language Model Agents" Paper by TianZe Zhang, Sirui Sun, Yuhang Xie, Xin Zhang Zhiqiang Wu Guojie Song· From

PekingUniversity to Daily Papers on

huggingface .

Large language models exhibit behavioral rigidity that worsens with intensified reasoning, prompting the development of a Context-Value-Action architecture that decouples action generation from cognitive reasoning using a Value Verifier trained on human data.

Context-Value-Action Architecture for Value-Driven Large Language Model Agents (2604.05939)

rajkumarrawal

submitted a paper to Daily Papers 10 days ago

Context-Value-Action Architecture for Value-Driven Large Language Model Agents

Paper • 2604.05939 • Published 11 days ago • 9

jayavibhav

in Agents-MCP-Hackathon/gradio_workflowbuilder 15 days ago

INLAGA TILL FALU TINGSRÄTT – MÅL T 3098-23

#1 opened 9 months ago by

Ali-86

Ironman-3000

authored a paper about 1 month ago

Entity Augmentation for Efficient Classification of Vertically Partitioned Data with Limited Overlap

Paper • 2406.17899 • Published Jun 25, 2024

Amasylla

in Agents-MCP-Hackathon/credit-card-database-mcp-server 2 months ago

Build error when starting the app

#2 opened 2 months ago by

Amasylla

daniel-was-taken

updated a Space 2 months ago

AutoML - MCP Hackathon

📈

Automated ML model comparison with LazyPredict

rajkumarrawal

posted an update 2 months ago

Post

225

I submitted a "Continual GUI Agents" Paper by Ziwei Liu, Borul Kang, Hangjie Yuan, Zixiang Zhao, Wei li, Yifan Zhu, Tao Feng ,
From

Tsinghua ,

ZhejiangUniversity ,

ethz ,

BUPT2023213296 . to Daily Papers on

huggingface .

Continual GUI Agents framework addresses performance degradation in dynamic digital environments through reinforcement fine tuning with novel anchoring rewards that stabilize learning across shifting UI domains and resolutions.

Continual GUI Agents (2601.20732)

Amasylla

in Agents-MCP-Hackathon/credit-card-database-mcp-server 2 months ago

Request about the server restart

#1 opened 3 months ago by

Amasylla

Chris4K

in Agents-MCP-Hackathon/credit-card-database-mcp-server 2 months ago

Request about the server restart

#1 opened 3 months ago by

Amasylla

rajkumarrawal

posted an update 3 months ago

Post

3686

I submitted a "FlashLabs Chroma 1.0: A Real-Time End-to-End Spoken Dialogue Model with Personalized Voice Cloning" Paper by Tanyu Chen, Tairan Chen, Kai shen , Zhenghua Bao, Zhihui Zhang, Man Yuan, Yi Shi From

FlashLabs to Daily Papers on

huggingface .

Chroma 1.0 enables real time spoken dialogue with personalized voice cloning through discrete speech representations and interleaved text audio token scheduling.

Chroma 1.0 , the world’s first open source, real time speech to speech model with voice cloning.

FlashLabs Chroma 1.0: A Real-Time End-to-End Spoken Dialogue Model with Personalized Voice Cloning (2601.11141)

rajkumarrawal

submitted a paper to Daily Papers 3 months ago

Continual GUI Agents

Paper • 2601.20732 • Published Jan 28 • 5

ovi054

posted an update 3 months ago

Post

3461

ovi054/LTX-2-19b-Squish-LoRA ⚡

I trained a Squish LoRA for LTX-2. Upload an image and give prompt "squish it" to get the squish video.

Demo output videos are attached.

👉Try it now:
ovi054/LTX-2-19b-Squish-LoRA
ovi054/ltx-2-Audio-to-Video

rajkumarrawal

submitted 3 papers to Daily Papers 3 months ago

LLM Prompt Evaluation for Educational Applications

Paper • 2601.16134 • Published Jan 22 • 1

FlashLabs Chroma 1.0: A Real-Time End-to-End Spoken Dialogue Model with Personalized Voice Cloning

Paper • 2601.11141 • Published Jan 16 • 23

The Responsibility Vacuum: Organizational Failure in Scaled Agent Systems

Paper • 2601.15059 • Published Jan 21 • 4

rajkumarrawal

posted an update 3 months ago

Post

863

I submitted a "AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts" Paper by @weizhihao1KeyuLi Junhao shi @dqwangDequan Wang @YangXiao-nlpYang Xiao Mohan Jiang @Sunshine279Jie Sun Yunze Wu Shijie Xia Xiaojie Cai Tianze Xu Weiye Si Wenjie Li Pengfei Liu From

SJTU Shanghai Jiao Tong University

PolyUHK The Hong Kong Polytechnic University GAIRSII-GAIR to Daily Papers on huggingfaceHugging Face.

A potentially another direction for Benchmarking the Frontiers of Autonomous Agents in 2026

Some of the observations founded are :-

-- Long-horizon tasks remain challenging :
Even frontier models struggle with sustained reasoning over real world tasks that require 1M tokens and 90 tool calls, indicating limits in long context autonomy.

-- Proprietary models outperform open source models:
Closed source models achieve a higher average score (48.4%) than open source counterparts (32.1%), revealing a persistent performance gap on complex agentic tasks.

-- Feedback driven self correction varies widely:
Models like GPT 5.2 and Claude show strong gains from iterative feedback, while others (e.g. DeepSeek V3.2) exhibit minimal or no improvement after feedback.

-- Efficiency trade offs are significant:
High performing models often consume far more tokens and time, some models (e.g. Grok 4.1 Fast) are more token efficient despite lower absolute scores.

-- Agentic scaffolds strongly influence performance:
Models tend to perform best within their native or optimized ecosystems, highlighting that agent performance depends on tight coupling between the model and its scaffold not the model alone.

..... many more...

AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts (2601.11044)

1 reply

rajkumarrawal

submitted 2 papers to Daily Papers 3 months ago

What Matters in Data Curation for Multimodal Reasoning? Insights from the DCVLR Challenge

Paper • 2601.10922 • Published Jan 16 • 3

AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts

Paper • 2601.11044 • Published Jan 16 • 34

ovi054

posted an update 3 months ago

Post

2223

My project, Anim-Lab-AI, won the Community Choice Award at the MCP-1st-Birthday hackathon by @HuggingFace and @Gradio ! 🏆

It turns any idea or complex concept into a clear, engaging explainer animation video. 🎥

I want to thank everyone in the Hugging Face community for supporting my project!

MCP-1st-Birthday/anim-lab-ai