Hamish Ivison's picture

Hamish Ivison

hamishivi

·

https://ivison.id.au

AI & ML interests

NLP :)

Recent Activity

updated a dataset about 15 hours ago

hamishivi/appworld_env_train_fixed

published a dataset about 15 hours ago

hamishivi/appworld_env_train_fixed

updated a model about 15 hours ago

hamishivi/1412_rl_rag_open_judge_citation_1237_step1500

View all activity

Organizations

Collections 8

View 8 collections

Papers 14

arxiv:2512.13961

arxiv:2511.19399

arxiv:2511.07317

arxiv:2503.01807

models 231

hamishivi/1412_rl_rag_open_judge_citation_1237_step1500

8B • Updated about 15 hours ago • 10

hamishivi/1412_rl_rag_open_judge_citation_123711768961599_step1000

8B • Updated 7 days ago • 114

hamishivi/2912_rl_rag_wapaptive_step650abl_3228711768460967_step2500

8B • Updated 10 days ago • 26

hamishivi/2912_rl_rag_napaptive_step650abl_step2500

8B • Updated 11 days ago • 33

hamishivi/1412_rl_rag_open_judge_citation_step_650

8B • Updated 14 days ago • 48

hamishivi/2911_rl_rag_NAR8_gpt5sft_noapaptive_27343_step_500

8B • Updated 14 days ago • 42

hamishivi/2912_rl_rag_wadaptive_step650abl_step500

Updated 14 days ago

hamishivi/2912_rl_rag_nadaptive_step650abl_step_500

8B • Updated 14 days ago • 7

hamishivi/rl_rag_wapaptive_step650abl_3228711767513354_checkpoints_step_1350

8B • Updated 17 days ago • 31

hamishivi/2912_rl_rag_napaptive_step650abl_721111767260092_checkpoints_step_1350

8B • Updated 17 days ago • 44

View 231 models

datasets 186

hamishivi/appworld_env_train_fixed

Viewer • Updated about 15 hours ago • 50 • 18

hamishivi/wiki_search_env_train

Viewer • Updated 1 day ago • 100 • 32

hamishivi/wordle_expert_train

Viewer • Updated 1 day ago • 1k • 10

hamishivi/wordle_env_train

Viewer • Updated 1 day ago • 100 • 107

hamishivi/appworld_env_train

Viewer • Updated 4 days ago • 50 • 28

hamishivi/tulu_3_rewritten_tools_test

Viewer • Updated 13 days ago • 1k • 85

hamishivi/rl_rag_shortformqa

Viewer • Updated 18 days ago • 2.58k • 108

hamishivi/wots_the_weather

Viewer • Updated 20 days ago • 32 • 100

hamishivi/IF_multi_constraints_upto5_filtered_dpo_0625_filter-keyword-filtered

Viewer • Updated Oct 29, 2025 • 57.8k • 29

hamishivi/olmo_msgs_thinker

Viewer • Updated Oct 28, 2025 • 60 • 85

View 186 datasets