edbeeching
·
AI & ML interests
None yet
Organizations
edbeeching/mujoco_pusher_1111
Reinforcement Learning
•
Updated
•
1
edbeeching/llama-7b-ift-ds-save-test5
Text Generation
•
Updated
•
4
edbeeching/llama-7b-ift-ds-save-test4
Text Generation
•
Updated
•
4
edbeeching/llama-7b-ift-ds-save-test3
Text Generation
•
Updated
•
4
edbeeching/llama-65b-ift-ds-v03
Text Generation
•
Updated
•
2
edbeeching/llama-65b-ift-ds-v02
Text Generation
•
Updated
•
4
edbeeching/llama-7b-se-rl-tokenizer
Updated
edbeeching/llama-se-rl-adapter
Text Generation
•
Updated
edbeeching/llama-se-rl-finetune-128-8-8-1.4e-5step_1000-adapter-merged
Updated
edbeeching/llama-se-rl-finetune-128-8-8-1.4e-5_adamstep_800-adapter-merged
Text Generation
•
Updated
•
3
edbeeching/llama-se-rl-finetune-128-8-8-1.4e-5_adamstep_1100-adapter-merged
Text Generation
•
Updated
•
2
edbeeching/llama-se-rl-finetune-128-8-8-1.4e-5_adamstep_1000-adapter-merged
Text Generation
•
Updated
•
2
edbeeching/llama-se-rl-finetune-128-8-8-1.4e-5_adamstep_600-adapter-merged
Text Generation
•
Updated
•
2
edbeeching/llama-se-rl-finetune-128-8-8-1.4e-5step_1200-adapter-merged
Text Generation
•
Updated
•
2
edbeeching/llama-7b_stack-exchange_RM_peft-adapter-merged
edbeeching/gpt2_stack-exchange-paired_rmts__10000_2e-05_hub
Text Classification
•
Updated
•
1
edbeeching/gpt2_stack-exchange-paired_rmts__1000_2e-05_hub
Updated
edbeeching/gpt2-xl-stackexchange_stack-exchange-paired_rmts__240000_2e-05_hub
Text Classification
•
Updated
•
2
edbeeching/gpt2-xl-stackexchange_stack-exchange-paired_rmts__240000_1e-05_hub
Text Classification
•
Updated
•
2
edbeeching/gpt2-xl-stackexchange_stack-exchange-paired_rmts__240000_8e-05_hub
Text Classification
•
Updated
•
2
edbeeching/gpt2-xl-stackexchange_stack-exchange-paired_rmts__240000_4e-05_hub
Text Classification
•
Updated
•
2
edbeeching/GPT-JT-6B-v1_stack-exchange-paired_rmts__200_2e-05_hub
Updated
edbeeching/gpt2-xl-stackexchange_stack-exchange-paired_rmts__200_2e-05_hub
Updated
edbeeching/gpt2-xl-stackexchange_stack-exchange-paired_rmts_1000_hub
Text Classification
•
Updated
•
2
edbeeching/gpt2_stack-exchange-paired_rmts_1000
Text Classification
•
Updated
•
1
edbeeching/gpt2_stack-exchange-paired_rmts_1000_hub
Text Classification
•
Updated
•
2
edbeeching/gpt-neox-20b-imdb-lora-lr5e-5-adapter-merged-ppo-sentiment
Updated
edbeeching/gpt2-xl-stackexchange_stack-exchange-paired_rmts_240000_bup
Text Classification
•
Updated
•
2
edbeeching/gpt2-xl-stackexchange_stack-exchange-paired_rmts_240000
Text Classification
•
Updated
•
1
edbeeching/gpt2-xl-stackexchange_stack-exchange-paired_rmts_1000
Updated