tingting gao's picture

2

tingting gao

TinaGao

·

AI & ML interests

MLLMs|Diffusion Models|Computer Vision

Organizations

None yet

authored 14 papers 4 months ago

DragAnything: Motion Control for Anything using Entity Representation

Paper • 2403.07420 • Published Mar 12, 2024 • 14

Learning Multi-dimensional Human Preference for Text-to-Image Generation

Paper • 2405.14705 • Published May 23, 2024

CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation

Paper • 2406.10462 • Published Jun 15, 2024

Decouple Content and Motion for Conditional Image-to-Video Generation

Paper • 2311.14294 • Published Nov 24, 2023

Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization

Paper • 2502.01051 • Published Feb 3, 2025 • 1

MM-RLHF: The Next Step Forward in Multimodal LLM Alignment

Paper • 2502.10391 • Published Feb 14, 2025 • 34

Why Distillation can Outperform Zero-RL: The Role of Flexible Reasoning

Paper • 2505.21067 • Published May 27, 2025 • 3

InstructEngine: Instruction-driven Text-to-Image Alignment

Paper • 2504.10329 • Published Apr 14, 2025

OneRec Technical Report

Paper • 2506.13695 • Published Jun 16, 2025 • 17

TaskGalaxy: Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types

Paper • 2502.09925 • Published Feb 14, 2025

Thyme: Think Beyond Images

Paper • 2508.11630 • Published Aug 15, 2025 • 81

Decoupling Contrastive Decoding: Robust Hallucination Mitigation in Multimodal Large Language Models

Paper • 2504.08809 • Published Apr 9, 2025 • 1

OneRec-V2 Technical Report

Paper • 2508.20900 • Published Aug 28, 2025 • 21

Kwai Keye-VL 1.5 Technical Report

Paper • 2509.01563 • Published Sep 1, 2025 • 37

authored a paper 6 months ago

Kwai Keye-VL Technical Report

Paper • 2507.01949 • Published Jul 2, 2025 • 130