kanwal-mehreen18
/

Llama3.1-8B-GRPO

text-generation-inference

Model card Files Files and versions

Llama3.1-8B-GRPO

353 MB

1 contributor

History: 4 commits

kanwal-mehreen18's picture

kanwal-mehreen18

Trained with Unsloth

c14fb78 verified 9 months ago