Hao Jiang's picture

Hao Jiang

TechxGenus

·

https://techxgenus.github.io/

TechxGenus

AI & ML interests

Code Intelligence; Large Language Model; AI Alignment; Efficient Inference

Recent Activity

liked a model about 2 hours ago

zai-org/GLM-4.7

updated a dataset 2 days ago

TechxGenus/t2i-i2t

published a dataset 2 days ago

TechxGenus/t2i-i2t

View all activity

Organizations

None yet

upvoted a collection about 1 month ago

Step-Audio-R1

Step-Audio-R1 is the first audio language model to successfully unlock test-time compute scaling. • 3 items • Updated Nov 21 • 15

upvoted a paper about 1 month ago

TiDAR: Think in Diffusion, Talk in Autoregression

Paper • 2511.08923 • Published Nov 12 • 117

upvoted a paper about 2 months ago

VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation

Paper • 2511.02778 • Published Nov 4 • 101

upvoted a collection 3 months ago

DeepSeek-V3.2

4 items • Updated 24 days ago • 510

upvoted 2 papers 4 months ago

Matrix-Game: Interactive World Foundation Model

Paper • 2506.18701 • Published Jun 23 • 72

Thyme: Think Beyond Images

Paper • 2508.11630 • Published Aug 15 • 81

upvoted a paper 5 months ago

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 266

upvoted a paper 7 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 263

upvoted a collection 8 months ago

OpenMathReasoning

Models and datasets from "AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset" • 7 items • Updated 1 day ago • 46

upvoted 3 papers 8 months ago

OmniSVG: A Unified Scalable Vector Graphics Generation Model

Paper • 2504.06263 • Published Apr 8 • 182

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

Paper • 2504.11536 • Published Apr 15 • 63

Sleep-time Compute: Beyond Inference Scaling at Test-time

Paper • 2504.13171 • Published Apr 17 • 15

upvoted a paper 9 months ago

Inference-Time Scaling for Generalist Reward Modeling

Paper • 2504.02495 • Published Apr 3 • 57

upvoted a collection 9 months ago

Qwen2.5-Omni

End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 7 items • Updated Jul 21 • 160

upvoted a collection 10 months ago

Gemma 3 Release

28 items • Updated Aug 11 • 571

upvoted a paper 10 months ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 211

upvoted a collection 10 months ago

The Ultimate Collection of Code Classifiers

🔥 15 classifiers, 124M parameters, one per programming language— for assessing the educational value of GitHub code • 15 items • Updated May 5 • 15

upvoted 2 papers 11 months ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7 • 151

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 429

upvoted a paper 12 months ago

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 108