billmatrix nielsr HF Staff commited on
Commit
1d2cc9c
·
verified ·
1 Parent(s): fed844c

Add pipeline tag, library name, and improve project page link (#1)

Browse files

- Add pipeline tag, library name, and improve project page link (ac98c7dc85105a065f585bb5889c45d5473d8bb3)


Co-authored-by: Niels Rogge <nielsr@users.noreply.huggingface.co>

Files changed (1) hide show
  1. README.md +9 -7
README.md CHANGED
@@ -1,17 +1,19 @@
1
  ---
2
- license: apache-2.0
 
 
 
3
  language:
4
  - en
 
5
  tags:
6
  - agent
7
  - deepresearch
8
  - llm
9
  - rl
10
  - reinforcementlearning
11
- datasets:
12
- - miromind-ai/MiroRL-GenQA
13
- base_model:
14
- - Qwen/Qwen2.5-7B-Instruct
15
  ---
16
 
17
  # Model Card for PokeeResearch
@@ -33,7 +35,7 @@ The model integrates **Reinforcement Learning from AI Feedback (RLAIF)** with a
33
 
34
  - **Repository:** [https://github.com/Pokee-AI/PokeeResearchOSS](https://github.com/Pokee-AI/PokeeResearchOSS)
35
  - **Paper:** [*PokeeResearch: Effective Deep Research via Reinforcement Learning from AI Feedback and Robust Reasoning Scaffold*](https://arxiv.org/pdf/2510.15862), Pokee AI, October 2025
36
- - **API Access:** [https://pokee.ai/deepresearch-preview](https://pokee.ai/deepresearch-preview)
37
 
38
  ---
39
 
@@ -203,4 +205,4 @@ For inquiries, contact: hello@pokee.ai
203
  **Yi Wan**, **Jiuqi Wang**, Liam Li, Jinsong Liu, Ruihao Zhu, and Zheqing Zhu — Pokee AI Research Team
204
 
205
  ## Model Card Contact
206
- Pokee AI Team — hello@pokee.ai
 
1
  ---
2
+ base_model:
3
+ - Qwen/Qwen2.5-7B-Instruct
4
+ datasets:
5
+ - miromind-ai/MiroRL-GenQA
6
  language:
7
  - en
8
+ license: apache-2.0
9
  tags:
10
  - agent
11
  - deepresearch
12
  - llm
13
  - rl
14
  - reinforcementlearning
15
+ pipeline_tag: text-generation
16
+ library_name: transformers
 
 
17
  ---
18
 
19
  # Model Card for PokeeResearch
 
35
 
36
  - **Repository:** [https://github.com/Pokee-AI/PokeeResearchOSS](https://github.com/Pokee-AI/PokeeResearchOSS)
37
  - **Paper:** [*PokeeResearch: Effective Deep Research via Reinforcement Learning from AI Feedback and Robust Reasoning Scaffold*](https://arxiv.org/pdf/2510.15862), Pokee AI, October 2025
38
+ - **Project Page:** [https://pokee.ai/deepresearch-preview](https://pokee.ai/deepresearch-preview)
39
 
40
  ---
41
 
 
205
  **Yi Wan**, **Jiuqi Wang**, Liam Li, Jinsong Liu, Ruihao Zhu, and Zheqing Zhu — Pokee AI Research Team
206
 
207
  ## Model Card Contact
208
+ Pokee AI Team — hello@pokee.ai