NicholasCorrado
AI & ML interests
Reinforcement learning
Organizations
None yet
NicholasCorrado/mistral-7b-ift
Text Generation
•
7B
•
Updated
•
20
NicholasCorrado/zephyr-7b-uf-rlced-conifer-group-dpo-2e-alr-0.1
Text Generation
•
7B
•
Updated
•
4
NicholasCorrado/zephyr-7b-uf-rlced-conifer-group-dpo-2e-alr-0.01
Text Generation
•
7B
•
Updated
•
4
NicholasCorrado/zephyr-7b-uf-rlced-conifer-group-dpo-2e-alr-0.01-1e
Text Generation
•
7B
•
Updated
•
3
NicholasCorrado/zephyr-7b-uf-rc-small-dpo
Text Generation
•
7B
•
Updated
•
5
NicholasCorrado/test
Updated
NicholasCorrado/zephyr-7b-uf-dpo-2e
Text Generation
•
7B
•
Updated
•
2
NicholasCorrado/rlced-conifer-zephyr-7b-dpo-2e
Text Generation
•
7B
•
Updated
•
2
NicholasCorrado/zephyr-7b-uf-rlced-conifer-1e2e-group-dpo-2e
Text Generation
•
7B
•
Updated
•
2
NicholasCorrado/zephyr-7b-uf-rlced-conifer-group-dpo-2e
Text Generation
•
7B
•
Updated
•
3
NicholasCorrado/zephyr-7b-uf-rlced-conifer-dpo-2e
Text Generation
•
7B
•
Updated
•
2
NicholasCorrado/rlced-conifer-zephyr-7b-dpo
Updated
NicholasCorrado/tulu-2-7b-rlced-conifer-dpo
Text Generation
•
7B
•
Updated
•
2
NicholasCorrado/uf-tulu-2-7b-dpo
Text Generation
•
7B
•
Updated
•
2
NicholasCorrado/tinyllama-1.1b-chat-v1.0-uf-dpo
Updated
NicholasCorrado/tinyllama-1.1b-chat-v1.0-rlced-conifer-3-1-dpo
Text Generation
•
1B
•
Updated
•
3
NicholasCorrado/tinyllama-1.1b-chat-v1.0-ui-dpo-2
Text Generation
•
1B
•
Updated
•
2
NicholasCorrado/tinyllama-1.1b-chat-v1.0-ui-logic-dpo-2
Text Generation
•
1B
•
Updated
•
2
NicholasCorrado/tinyllama-1.1b-chat-v1.0-ui-math-coding-dpo-2
Text Generation
•
1B
•
Updated
•
2
NicholasCorrado/tinyllama-1.1b-chat-v1.0-ui-math-coding-group-dpo
Text Generation
•
1B
•
Updated
•
2
NicholasCorrado/tinyllama-1.1b-chat-v1.0-ui-math-dpo-2
Text Generation
•
1B
•
Updated
•
2
NicholasCorrado/tinyllama-1.1b-chat-v1.0-ui-coding-dpo-2
Text Generation
•
1B
•
Updated
•
2
NicholasCorrado/tinyllama-1.1b-chat-v1.0-ui-logic-dpo
Text Generation
•
1B
•
Updated
•
2
NicholasCorrado/tinyllama-1.1b-chat-v1.0-ui-math-coding-dpo
Text Generation
•
1B
•
Updated
•
2
NicholasCorrado/uf-rlced-conifer_tulu-2-7b-group-dpo-no-clip
Text Generation
•
7B
•
Updated
•
5
NicholasCorrado/uf-rlced-conifer_tulu-2-7b-group-dpo
Text Generation
•
7B
•
Updated
•
4
NicholasCorrado/tinyllama-1.1b-chat-v1.0-ui-coding-dpo
Text Generation
•
1B
•
Updated
•
2
NicholasCorrado/tinyllama-1.1b-chat-v1.0-arena-hh-dpo
Text Generation
•
1B
•
Updated
•
4
NicholasCorrado/tinyllama-1.1b-chat-v1.0-ui-math-dpo
Text Generation
•
1B
•
Updated
•
2
NicholasCorrado/uf-rlced-conifer-zephyr-7b-group-dpo-no-clip-no-excess
Text Generation
•
7B
•
Updated
•
2