Qwen3-8B-GRPO / README.md
jadohu's picture
Update README.md
6a983e9 verified
metadata
license: apache-2.0
datasets:
  - agentica-org/DeepScaleR-Preview-Dataset
language:
  - en
base_model:
  - Qwen/Qwen3-8B-Base
pipeline_tag: reinforcement-learning