Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Long Zhao's picture
3 4 5

Long Zhao

garyzhao9012
tingianliu's profile picture kmnp's profile picture Ksgk-fy's profile picture
·
https://garyzhao.github.io/
  • garyzhao9012
  • garyzhao

AI & ML interests

Computer Vision, Machine Perception, Machine Learning

Organizations

Google's profile picture ICCV2023's profile picture

upvoted a collection 3 months ago

VideoPrism

Collection
VideoPrism is a foundational video encoder that enables state-of-the-art performance on a large variety of video understanding tasks. • 5 items • Updated Jul 16 • 12
upvoted a paper 4 months ago

VideoPrism: A Foundational Visual Encoder for Video Understanding

Paper • 2402.13217 • Published Feb 20, 2024 • 37
upvoted a paper 9 months ago

The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering

Paper • 2502.03628 • Published Feb 5 • 12
upvoted a paper 11 months ago

Video Creation by Demonstration

Paper • 2412.09551 • Published Dec 12, 2024 • 9
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs