5 9 4

An Yan

zzxslp

zzxslp

AI & ML interests

Vision and Language, text generation

Recent Activity

upvoted a paper 28 days ago

BLIP3o-NEXT: Next Frontier of Native Image Generation

upvoted a paper 28 days ago

CaptionQA: Is Your Caption as Useful as the Image Itself?

upvoted a paper 8 months ago

Trust but Verify: Programmatic VLM Evaluation in the Wild

View all activity

Organizations

upvoted 2 papers 28 days ago

BLIP3o-NEXT: Next Frontier of Native Image Generation

Paper • 2510.15857 • Published Oct 17 • 24

CaptionQA: Is Your Caption as Useful as the Image Itself?

Paper • 2511.21025 • Published Nov 26 • 27

upvoted 4 papers 8 months ago

Trust but Verify: Programmatic VLM Evaluation in the Wild

Paper • 2410.13121 • Published Oct 17, 2024 • 3

GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation

Paper • 2311.07562 • Published Nov 13, 2023 • 15

Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures

Paper • 2505.09343 • Published May 14 • 73

RM-R1: Reward Modeling as Reasoning

Paper • 2505.02387 • Published May 5 • 79

liked a dataset 12 months ago

Salesforce/PROVE

Viewer • Updated Feb 3 • 4.36k • 161 • 5

authored 2 papers about 1 year ago

Trust but Verify: Programmatic VLM Evaluation in the Wild

Paper • 2410.13121 • Published Oct 17, 2024 • 3

BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions

Paper • 2411.07461 • Published Nov 12, 2024 • 23

upvoted a paper about 1 year ago

BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions

Paper • 2411.07461 • Published Nov 12, 2024 • 23

updated 2 models over 1 year ago

zzxslp/vicuna-7b-v1.5-rm-P1B

7B • Updated Aug 23, 2024 • 5

zzxslp/vicuna-7b-v1.5-rm-P1A

7B • Updated Aug 21, 2024 • 7

authored a paper over 1 year ago

xGen-MM (BLIP-3): A Family of Open Large Multimodal Models

Paper • 2408.08872 • Published Aug 16, 2024 • 101

upvoted a paper over 1 year ago

xGen-MM (BLIP-3): A Family of Open Large Multimodal Models

Paper • 2408.08872 • Published Aug 16, 2024 • 101

updated 2 models over 1 year ago

zzxslp/som-llava-v1.5-13b

Text Generation • 13B • Updated Jun 5, 2024 • 49 • 5

zzxslp/som-llava-v1.5-7b

Text Generation • 7B • Updated Jun 5, 2024 • 106

New activity in HuggingFaceM4/idefics2-8b over 1 year ago

OSError: HuggingFaceM4/idefics2-8b does not appear to have a file named config.json.

#60 opened over 1 year ago by

zzxslp

liked a model over 1 year ago

Salesforce/xgen-mm-phi3-mini-instruct-r-v1

Image-Text-to-Text • 5B • Updated Feb 3 • 443 • 186

updated a model over 1 year ago

zzxslp/som-llava-v1.5-13b-hf

Image-to-Text • 13B • Updated May 7, 2024 • 18 • 1

New activity in zzxslp/som-llava-v1.5-13b over 1 year ago

How to load the som-llava model using the transformers library?

#1 opened over 1 year ago by

dyliu

An Yan

AI & ML interests

Recent Activity

Organizations

zzxslp's activity

OSError: HuggingFaceM4/idefics2-8b does not appear to have a file named config.json.

How to load the som-llava model using the transformers library?