Shang Hong Sim

shanghong

https://shanghongsim.github.io/

AI & ML interests

Neural decoding, neuroengineering, signal processing

Organizations

Collections 3

View 3 collections

models 13

datasets 6

shanghong/oumi-web-agent

Viewer • Updated Aug 19, 2025 • 9.28k • 104

shanghong/oumi_rag_grpo_data

Viewer • Updated Aug 12, 2025 • 5.12k • 75

shanghong/llama_index_integration_data

Viewer • Updated May 15, 2025 • 21.1M • 13

shanghong/PRM800K_phase2_balanced

Viewer • Updated Oct 18, 2024 • 1.38M • 11

shanghong/PRM800K_train2_base_sft

Viewer • Updated Oct 12, 2024 • 97.8k • 4

shanghong/PRM800K_train2

Viewer • Updated Oct 12, 2024 • 966k • 6

Shang Hong Sim

AI & ML interests

Organizations

Collections 3

Towards General Agentic Intelligence via Environment Scaling

Establishing Best Practices for Building Rigorous Agentic Benchmarks

LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts

ProfBench: Multi-Domain Rubrics requiring Professional Knowledge to Answer and Judge

UCSC-VLAA/MedReason

interstellarninja/hermes_reasoning_tool_use

HuggingFaceM4/DoclingMatix

Amod/mental_health_counseling_conversations

Towards General Agentic Intelligence via Environment Scaling

Establishing Best Practices for Building Rigorous Agentic Benchmarks

LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts

ProfBench: Multi-Domain Rubrics requiring Professional Knowledge to Answer and Judge

UCSC-VLAA/MedReason

interstellarninja/hermes_reasoning_tool_use

HuggingFaceM4/DoclingMatix

Amod/mental_health_counseling_conversations

models 13

shanghong/qwen3_8b_tatqa

shanghong/oumi_rag_grpo

shanghong/llama3.1_8b_stage1

shanghong/qwen3_8b_stage1

shanghong/qwen3_4b_stage1

shanghong/stage1

shanghong/q-FrozenLake-4x4-custom

shanghong/q-FrozenLake-4x4-test

shanghong/q-FrozenLake-custommap-v2

shanghong/q-FrozenLake-custommap

datasets 6

shanghong/oumi-web-agent

shanghong/oumi_rag_grpo_data

shanghong/llama_index_integration_data

shanghong/PRM800K_phase2_balanced

shanghong/PRM800K_train2_base_sft

shanghong/PRM800K_train2

Shang Hong Sim

AI & ML interests

Organizations

Collections 3

models 13 Sort: Recently updated

datasets 6 Sort: Recently updated

models 13

datasets 6