CLIPGuys

university

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

taesiri submitted a paper about 5 hours ago

Rethinking Video Generation Model for the Embodied World

taesiri submitted a paper about 5 hours ago

FARE: Fast-Slow Agentic Robotic Exploration

taesiri submitted a paper about 5 hours ago

RoboBrain 2.5: Depth in Sight, Time in Mind

View all activity

taesiri

submitted 5 papers to Daily Papers about 5 hours ago

Facilitating Proactive and Reactive Guidance for Decision Making on the Web: A Design Probe with WebSeek

Paper • 2601.15100 • Published about 16 hours ago

Render-of-Thought: Rendering Textual Chain-of-Thought as Images for Visual Latent Reasoning

Paper • 2601.14750 • Published about 24 hours ago • 9

taesiri

submitted a paper to Daily Papers 2 days ago

The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models

Paper • 2601.10387 • Published 7 days ago • 9

taesiri

submitted 5 papers to Daily Papers 3 days ago

FrankenMotion: Part-level Human Motion Generation and Composition

Paper • 2601.10909 • Published 6 days ago • 17

Building Production-Ready Probes For Gemini

Paper • 2601.11516 • Published 6 days ago • 6

Reasoning Models Generate Societies of Thought

Paper • 2601.10825 • Published 7 days ago • 10

AstroReason-Bench: Evaluating Unified Agentic Planning across Heterogeneous Space Planning Problems

Paper • 2601.11354 • Published 6 days ago • 3

BAPO: Boundary-Aware Policy Optimization for Reliable Agentic Search

Paper • 2601.11037 • Published 6 days ago • 15

taesiri

submitted 5 papers to Daily Papers 6 days ago

Action100M: A Large-scale Video Action Dataset

Paper • 2601.10592 • Published 7 days ago • 25

Transition Matching Distillation for Fast Video Generation

Paper • 2601.09881 • Published 7 days ago • 31

FlowAct-R1: Towards Interactive Humanoid Video Generation

Paper • 2601.10103 • Published 7 days ago • 29

Inference-time Physics Alignment of Video Generative Models with Latent World Models

Paper • 2601.10553 • Published 7 days ago • 11

Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding

Paper • 2601.10611 • Published 7 days ago • 25

taesiri

submitted 4 papers to Daily Papers 7 days ago

TranslateGemma Technical Report

Paper • 2601.09012 • Published 8 days ago • 18

OpenVoxel: Training-Free Grouping and Captioning Voxels for Open-Vocabulary 3D Scene Understanding

Paper • 2601.09575 • Published 8 days ago • 24

EvoFSM: Controllable Self-Evolution for Deep Research with Finite State Machines

Paper • 2601.09465 • Published 8 days ago • 39

The AI Hippocampus: How Far are We From Human Memory?

Paper • 2601.09113 • Published 8 days ago • 4

AI & ML interests

Recent Activity

Team members 1

CLIPGuys's activity