AdversarialRLHF
/

rloo_pythia410m_tldr6.9b_rm410mdata_allprefixsft_prefix

Model card Files Files and versions Community

rloo_pythia410m_tldr6.9b_rm410mdata_allprefixsft_prefix / checkpoint-24 /tokenizer.json

Muqeeth's picture

Training in progress, step 24, checkpoint

bff773e verified 3 months ago

history contribute delete

3.56 MB

File too large to display, you can check the raw version instead.