-
PersonaLive! Expressive Portrait Image Animation for Live Streaming
Paper • 2512.11253 • Published • 34 -
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos
Paper • 2601.00393 • Published • 100 -
Agent READMEs: An Empirical Study of Context Files for Agentic Coding
Paper • 2511.12884 • Published • 17 -
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion
Paper • 2503.11576 • Published • 127
MN
ma1664
·
AI & ML interests
None yet
Recent Activity
updated
a collection
about 17 hours ago
Papers
updated
a collection
about 17 hours ago
Papers
updated
a collection
about 17 hours ago
Papers
Organizations
None yet
Models
Spaces
-
RunningFeatured428
FastVLM WebGPU
🍎428Real-time video captioning powered by FastVLM
-
Running on ZeroMCPFeatured1.73k
Qwen Image Edit Camera Control
🎬1.73kFast 4 step inference with Qwen Image Edit 2509
-
Running on ZeroFeatured337
Depth Anything 3
🏢337Create detailed depth maps from images using Depth Anything 3
Datasets
Papers
-
PersonaLive! Expressive Portrait Image Animation for Live Streaming
Paper • 2512.11253 • Published • 34 -
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos
Paper • 2601.00393 • Published • 100 -
Agent READMEs: An Empirical Study of Context Files for Agentic Coding
Paper • 2511.12884 • Published • 17 -
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion
Paper • 2503.11576 • Published • 127
Spaces
-
RunningFeatured428
FastVLM WebGPU
🍎428Real-time video captioning powered by FastVLM
-
Running on ZeroMCPFeatured1.73k
Qwen Image Edit Camera Control
🎬1.73kFast 4 step inference with Qwen Image Edit 2509
-
Running on ZeroFeatured337
Depth Anything 3
🏢337Create detailed depth maps from images using Depth Anything 3
Models
Datasets