reproducing DeepSeek R1 Zero with Qwen2.5-0.5B on two 4090 GPUs
rasdani
rasdani
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 20 hours ago
rasdani/github-patches-genesys-swe-prompt-2k-context-1k-diff
published
a dataset
about 20 hours ago
rasdani/github-patches-genesys-swe-prompt-2k-context-1k-diff
liked
a Space
1 day ago
lerobot/visualize_dataset