1 212 159

Gibran Iqbal PRO

Jibbscript

AI & ML interests

None yet

Recent Activity

liked a model about 2 hours ago

fangwu97/DeepSearch-1.5B

liked a model 1 day ago

microsoft/UserLM-8b

liked a model 6 days ago

ibm-granite/granite-timeseries-ttm-r2

View all activity

Organizations

liked a model about 2 hours ago

fangwu97/DeepSearch-1.5B

Text Generation • 2B • Updated 10 days ago • 409 • 8

liked a model 1 day ago

microsoft/UserLM-8b

Text Generation • 8B • Updated 21 days ago • 4.59k • 336

liked a model 6 days ago

ibm-granite/granite-timeseries-ttm-r2

Time Series Forecasting • 805k • Updated Feb 26 • 147k • 129

upvoted an article 6 days ago

Article

Building the Open Agent Ecosystem Together: Introducing OpenEnv

8 days ago

• 103

upvoted 5 papers 7 days ago

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

Paper • 2510.18927 • Published 9 days ago • 80

Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning

Paper • 2510.19338 • Published 9 days ago • 100

upvoted a paper 8 days ago

LightMem: Lightweight and Efficient Memory-Augmented Generation

Paper • 2510.18866 • Published 9 days ago • 105

liked 2 models 9 days ago

deepseek-ai/DeepSeek-OCR

Image-Text-to-Text • 3B • Updated 6 days ago • 1.34M • 2.26k

nanonets/Nanonets-OCR2-3B

Image-Text-to-Text • 4B • Updated 15 days ago • 61.2k • 425

upvoted a paper 10 days ago

StreamingVLM: Real-Time Understanding for Infinite Video Streams

Paper • 2510.09608 • Published 20 days ago • 49

upvoted a paper 11 days ago

Demystifying Reinforcement Learning in Agentic Reasoning

Paper • 2510.11701 • Published 17 days ago • 31

upvoted a paper 12 days ago

Agentic Entropy-Balanced Policy Optimization

Paper • 2510.14545 • Published 15 days ago • 101

upvoted 5 papers 14 days ago

Bee: A High-Quality Corpus and Full-Stack Suite to Unlock Advanced Fully Open MLLMs

Paper • 2510.13795 • Published 15 days ago • 50

rStar2-Agent: Agentic Reasoning Technical Report

Paper • 2508.20722 • Published Aug 28 • 113

Stronger Together: On-Policy Reinforcement Learning for Collaborative LLMs

Paper • 2510.11062 • Published 18 days ago • 25

Generative Universal Verifier as Multimodal Meta-Reasoner

Paper • 2510.13804 • Published 15 days ago • 24

Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm Enables Fine-Grained Policy Optimization

Paper • 2510.13554 • Published 15 days ago • 55

Gibran Iqbal PRO

AI & ML interests

Recent Activity

Organizations

Jibbscript's activity

Building the Open Agent Ecosystem Together: Introducing OpenEnv