Jinrui Zhang's picture

6 6

Jinrui Zhang

zjr2000

·

AI & ML interests

None yet

Recent Activity

liked a model 15 days ago

kr-cen/BLIP3o-Next-MICo

upvoted a paper about 1 month ago

ARC-Chapter: Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries

upvoted a collection about 2 months ago

Nemotron-Pre-Training-Datasets

View all activity

Organizations

None yet

liked a model 15 days ago

kr-cen/BLIP3o-Next-MICo

Any-to-Any • 5B • Updated 22 days ago • 24 • 2

upvoted a paper about 1 month ago

ARC-Chapter: Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries

Paper • 2511.14349 • Published Nov 18, 2025 • 17

upvoted a collection about 2 months ago

Nemotron-Pre-Training-Datasets

Large scale pre-training datasets used in the Nemotron family of models. • 11 items • Updated 10 days ago • 85

upvoted a paper 2 months ago

From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model

Paper • 2510.19871 • Published Oct 22, 2025 • 29

upvoted a paper 4 months ago

AudioStory: Generating Long-Form Narrative Audio with Large Language Models

Paper • 2508.20088 • Published Aug 27, 2025 • 21

updated a dataset 5 months ago

zjr2000/spes-debug

Updated Aug 4, 2025 • 8

published a dataset 5 months ago

zjr2000/spes-debug

Updated Aug 4, 2025 • 8

authored 4 papers 7 months ago

Caption Anything: Interactive Image Description with Diverse Multimodal Controls

Paper • 2305.02677 • Published May 4, 2023

Transferable Decoding with Visual Entities for Zero-Shot Image Captioning

Paper • 2307.16525 • Published Jul 31, 2023

LongVALE: Vision-Audio-Language-Event Benchmark Towards Time-Aware Omni-Modal Perception of Long Videos

Paper • 2411.19772 • Published Nov 29, 2024

TIIF-Bench: How Does Your T2I Model Follow Your Instructions?

Paper • 2506.02161 • Published Jun 2, 2025 • 13

upvoted a paper 7 months ago

TIIF-Bench: How Does Your T2I Model Follow Your Instructions?

Paper • 2506.02161 • Published Jun 2, 2025 • 13

liked a dataset 7 months ago

A113NW3I/TIIF-Bench-Data

Viewer • Updated Jun 4, 2025 • 1.7k • 1.77k • 5

upvoted a paper 8 months ago

VisualQuality-R1: Reasoning-Induced Image Quality Assessment via Reinforcement Learning to Rank

Paper • 2505.14460 • Published May 20, 2025 • 32

liked a model about 1 year ago

THUdyh/Oryx-ViT

Image Feature Extraction • Updated Mar 1, 2025 • 8

updated a dataset over 1 year ago

zjr2000/REVERIE

Viewer • Updated Jul 6, 2024 • 254k • 23 • 2

reacted to ArthurZ's post with ❤️🤝 almost 2 years ago

Post

mamba is now available in transformers. Thanks to @tridao and @albertgu for this brilliant model! 🚀 and the amazing mamba-ssm kernels powering this!
Checkout the collection here:
state-spaces/transformers-compatible-mamba-65e7b40ab87e5297e45ae406

5 replies

·

liked a Space about 2 years ago

MM-Vet Evaluator

Evaluate AI model predictions with correctness scores

liked a model about 2 years ago

liuhaotian/llava-v1.5-7b

Image-Text-to-Text • Updated May 8, 2024 • 134k • 525