PurpCode UIUC

non-profit

https://github.com/purpcode-uiuc/purpcode

AI & ML interests

🥇 Winner Defender Team at Amazon Nova AI Challenge 2025

Recent Activity

zhewang updated a dataset 2 days ago

purpcode/rl-safety-8k-single-turn

zhewang published a dataset 2 days ago

purpcode/rl-safety-8k-single-turn

zhewang updated a dataset 2 days ago

purpcode/rl-secqa-11k

View all activity

Organization Card

Community About org cards

Welcome to the PurpCode project!

PurpCode is an alignment method and a fully open-source recipe (data, model, and code) for eliciting cybersafe reasoning capabilities of coding models, including secure code generation and defending against malicious cyber events. PurpCode includes two alignment stages:

Rule Learning: teaching LLMs secure coding rules and general safety practices
Reinforcement Learning: letting LLMs co-exercise their safety and utility via verifiable tasks

We also curate comprehensive safety data via internal red teaming and use various evaluators covering cybersafety, utility, and overrefusal.

Paper: https://arxiv.org/abs/2507.19060
GitHub: https://github.com/purpcode-uiuc/purpcode
About: We are the 🥇Winner Defender Team at Amazon Nova AI Challenge 2025, from University of Illinois Urbana-Champaign!

To cite our work:

@article{purpcode,
  title = {PurpCode: Reasoning for Safer Code Generation},
  author = {Liu, Jiawei and Diwan, Nirav and Wang, Zhe and Zhai, Haoyu and Zhou, Xiaona and Nguyen, Kiet A. and Yu, Tianjiao and Wahed, Muntasir and Deng, Yinlin and Benkraouda, Hadjer and Wei, Yuxiang and Zhang, Lingming and Lourentzou, Ismini and Wang, Gang},
  journal = {arXiv preprint arXiv:2507.19060},
  year = {2025},
}

Collections 8

View 8 collections

models 4

purpcode/purpcode-32b-rule-sft

Text Generation • 33B • Updated 11 days ago • 2

purpcode/purpcode-14b-rule-sft

Text Generation • 15B • Updated 11 days ago • 2

purpcode/purpcode-14b-rl

Text Generation • 15B • Updated 11 days ago • 11

purpcode/purpcode-32b-rl

Text Generation • 33B • Updated 11 days ago • 6

datasets 30

purpcode/rl-safety-8k-single-turn

Viewer • Updated 2 days ago • 8.45k • 36

purpcode/rl-secqa-11k

Viewer • Updated 2 days ago • 11.6k • 29

purpcode/rl-codesec-78k

Viewer • Updated 2 days ago • 78.6k • 34

purpcode/code-r1-46k-leetcode2k-kodcode

Viewer • Updated 2 days ago • 51.9k • 34

purpcode/codeguru-rules

Viewer • Updated 2 days ago • 133 • 31

purpcode/ctxdistill-verified-Qwen2.5-14B-Instruct-1M-57k

Viewer • Updated 3 days ago • 57.7k • 32

purpcode/ctxdistill-verified-Qwen2.5-32B-Instruct-55k

Viewer • Updated 3 days ago • 55.6k • 31

purpcode/codeguru-python-detectors

Viewer • Updated 7 days ago • 133 • 69

purpcode/ctxdistill-verified-ablation-Qwen2.5-14B-Instruct-1M-73k

Viewer • Updated 7 days ago • 74k • 22

purpcode/jailbreak-eval-harm-prompts

Viewer • Updated 8 days ago • 462 • 74

View 30 datasets