Reformulating the RL of reasoning LLMs through Markovian Thinking paradigm.
AI & ML interests
computational linguistics, natural language processing
Recent Activity
View all activity
spaces
7
pinned
Running
5
AfroBench
🥇
Comprehensive benchmark of LLMs on African Languages
pinned
Running
1
mSTEB Leaderboard
🥇
Leaderboard for mSTEB benchmark
pinned
Running
17
WebLINX Explorer
😻
Visualize web interaction recordings
Running
2
Agent Reward Bench Leaderboard
🥇
Leaderboard for AgentRewardBench
Running
4
Agent Reward Bench Demo
💻
Explore agent trajectories and judgments in web benchmarks
Running
3
Safearena Leaderboard
🏃
SafeArena Leaderboard
models
73
McGill-NLP/longcot-8k-1.5b
2B
•
Updated
•
19
McGill-NLP/delethink-24k-1.5b
2B
•
Updated
•
498
•
5
McGill-NLP/delethink-96k-1.5b
2B
•
Updated
•
36
•
3
McGill-NLP/longcot-24k-1.5b
2B
•
Updated
•
27
•
1
McGill-NLP/delethink-96k-base-1.5b
2B
•
Updated
•
13
•
1
McGill-NLP/ssa-comet-mtl-final
Updated
McGill-NLP/ssa-comet-stl-final
Updated
McGill-NLP/ssa-comet-qe-final
Updated
McGill-NLP/gemma-2-9b-it-Injongo-intent
Text Generation
•
9B
•
Updated
•
1
McGill-NLP/gemma-2-9b-it-Injongo-slot
Text Generation
•
9B
•
Updated
•
1
datasets
36
McGill-NLP/SSA-MT
Viewer
•
Updated
•
23.3k
•
124
McGill-NLP/SSA-MTE
Viewer
•
Updated
•
92.9k
•
171
•
2
McGill-NLP/openmath-filtered
Viewer
•
Updated
•
200k
•
100
McGill-NLP/WebLINX-full
Updated
•
31.6k
•
6
McGill-NLP/knowledge-intensive-vqa-in-the-wild
Viewer
•
Updated
•
380
•
27
McGill-NLP/msteb_requests
Updated
•
159
McGill-NLP/msteb_results
Updated
•
117
McGill-NLP/GlobalNLI
Viewer
•
Updated
•
37.2k
•
44
McGill-NLP/WebMMU
Viewer
•
Updated
•
4.24k
•
84
•
1
McGill-NLP/AdvBench-IR-Small-Wiki-100
Viewer
•
Updated
•
50.9k
•
36