Daniel Serrano's picture

5 39

Daniel Serrano

dnlserrano

·

https://dnlserrano.dev

AI & ML interests

computer vision, biometrics, face, facial recognition, deepfakes, pad, mad, age, bias

Recent Activity

liked a Space 2 days ago

Qwen/Qwen-Image-2512

liked a Space 4 months ago

apple/fastvlm-webgpu

upvoted a collection 4 months ago

View all activity

Organizations

None yet

upvoted 2 collections 4 months ago

FastVLM

Efficient Vision Encoding for Vision Language Models • 9 items • Updated Sep 2, 2025 • 106

MobileCLIP2

MobileCLIP2: Mobile-friendly image-text models with SOTA zero-shot capabilities trained on DFNDR-2B • 37 items • Updated Sep 18, 2025 • 57

upvoted a collection 8 months ago

Nomic Embed Vision

Vision Encoders aligned to Nomic Embed Text making Nomic Embed multimodal! • 2 items • Updated Jun 5, 2024 • 10

upvoted an article 11 months ago

Article

Introduction to ggml

+1

Aug 13, 2024

•

256

upvoted a paper about 1 year ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published Dec 13, 2024 • 147