33 14 8

Hamish Ivison

hamishivi

https://ivison.id.au

AI & ML interests

NLP :)

Recent Activity

updated a dataset about 24 hours ago

hamishivi/simple_qa_rlvr

published a dataset about 24 hours ago

hamishivi/simple_qa_rlvr

updated a dataset about 24 hours ago

hamishivi/2wiki_rlvr

View all activity

Organizations

Collections 7

Inference demo for TESS 2 model

models 34

hamishivi/s1k_seq_orig_hyper421740446762

Updated Mar 13

hamishivi/tulu_3_long_finetune_qwen_7b_reg_system_prompt

Updated Mar 11 • 3

hamishivi/tulu-2-wildchat-326k-sft

Updated Mar 4

hamishivi/tulu-2-arena-hard-326k-sft

Updated Mar 4 • 1

hamishivi/llama-3.1-tulu-3-arena-hard-939k-sft

Updated Mar 4 • 1

hamishivi/llama-3.1-tulu-3-multitask-rrmax-939k-sft

Updated Mar 4

hamishivi/tulu-2-multitask-rrmax-326k-sft

Updated Mar 4

hamishivi/qwen2_math_tokenizer_tweaked

Updated Mar 3

hamishivi/0224_jupiter_hamish_grpo_tulu3_s1k_orz_30350

Updated Feb 25

hamishivi/0224_jupiter_hamish_grpo_s1k_only_orz_24021

Updated Feb 25 • 2

datasets 63

hamishivi/simple_qa_rlvr

Viewer • Updated about 24 hours ago • 4.33k • 15

hamishivi/2wiki_rlvr

Viewer • Updated about 24 hours ago • 15.3k • 204

hamishivi/tqa_rlvr

Viewer • Updated about 24 hours ago • 156k • 213

hamishivi/nq_rlvr

Viewer • Updated about 24 hours ago • 91.5k • 263

hamishivi/hotpotqa_rlvr

Viewer • Updated about 24 hours ago • 97.9k • 235

hamishivi/simple_qa_rlvr_no_prompt

Viewer • Updated about 24 hours ago • 4.33k • 14

hamishivi/2wiki_rlvr_no_prompt

Viewer • Updated about 24 hours ago • 15.3k • 15

hamishivi/tqa_rlvr_no_prompt

Viewer • Updated about 24 hours ago • 156k • 14

hamishivi/nq_rlvr_no_prompt

Viewer • Updated about 24 hours ago • 91.5k • 14

hamishivi/hotpotqa_rlvr_no_prompt

Viewer • Updated about 24 hours ago • 97.9k • 15

Hamish Ivison

AI & ML interests

Recent Activity

Organizations

Collections 7

Large-Scale Data Selection for Instruction Tuning

hamishivi/tulu-2-multitask-rrmax-326k-sft

hamishivi/rds-sels-multitask-rrmax-top326k

hamishivi/llama-3.1-tulu-3-multitask-rrmax-939k-sft

TESS 2: A Large-Scale Generalist Diffusion Language Model

Tess 2 Demo

hamishivi/tess2-v0.3

hamishivi/tess2-v0.1

Papers 11

spaces 1

Tess 2 Demo

models 34

hamishivi/s1k_seq_orig_hyper421740446762

hamishivi/tulu_3_long_finetune_qwen_7b_reg_system_prompt

hamishivi/tulu-2-wildchat-326k-sft

hamishivi/tulu-2-arena-hard-326k-sft

hamishivi/llama-3.1-tulu-3-arena-hard-939k-sft

hamishivi/llama-3.1-tulu-3-multitask-rrmax-939k-sft

hamishivi/tulu-2-multitask-rrmax-326k-sft

hamishivi/qwen2_math_tokenizer_tweaked

hamishivi/0224_jupiter_hamish_grpo_tulu3_s1k_orz_30350

hamishivi/0224_jupiter_hamish_grpo_s1k_only_orz_24021

datasets 63

hamishivi/simple_qa_rlvr

hamishivi/2wiki_rlvr

hamishivi/tqa_rlvr

hamishivi/nq_rlvr

hamishivi/hotpotqa_rlvr

hamishivi/simple_qa_rlvr_no_prompt

hamishivi/2wiki_rlvr_no_prompt

hamishivi/tqa_rlvr_no_prompt

hamishivi/nq_rlvr_no_prompt

hamishivi/hotpotqa_rlvr_no_prompt

Hamish Ivison

AI & ML interests

Recent Activity

Organizations

Collections 7

Tess 2 Demo

Papers 11

spaces 1

Tess 2 Demo

models 34 Sort: Recently updated

datasets 63 Sort: Recently updated

models 34

datasets 63