Yong Jae Lee's picture

2 2

Yong Jae Lee

yjlee0222

AI & ML interests

None yet

Recent Activity

upvoted a paper 25 days ago

Relational Visual Similarity

upvoted a paper 7 months ago

VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection

commented on a paper 7 months ago

VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection

View all activity

Organizations

None yet

authored 2 papers 8 months ago

X-Fusion: Introducing New Modality to Frozen Large Language Models

Paper • 2504.20996 • Published Apr 29, 2025 • 13

YoChameleon: Personalized Vision and Language Generation

Paper • 2504.20998 • Published Apr 29, 2025 • 12

authored a paper 9 months ago

Efficient LLaMA-3.2-Vision by Trimming Cross-attended Visual Features

Paper • 2504.00557 • Published Apr 1, 2025 • 15

authored 2 papers about 1 year ago

TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models

Paper • 2410.10818 • Published Oct 14, 2024 • 16

Vinoground: Scrutinizing LMMs over Dense Temporal Reasoning with Short Videos

Paper • 2410.02763 • Published Oct 3, 2024 • 7

authored 2 papers over 1 year ago

LLaRA: Supercharging Robot Learning Data for Vision-Language Policy

Paper • 2406.20095 • Published Jun 28, 2024 • 18

Matryoshka Multimodal Models

Paper • 2405.17430 • Published May 27, 2024 • 34

authored 2 papers about 2 years ago

Interfacing Foundation Models' Embeddings

Paper • 2312.07532 • Published Dec 12, 2023 • 12

Improved Baselines with Visual Instruction Tuning

Paper • 2310.03744 • Published Oct 5, 2023 • 39

authored a paper over 2 years ago

Generate Anything Anywhere in Any Scene

Paper • 2306.17154 • Published Jun 29, 2023 • 22