AI & ML interests
Large Multimodal Models
Organizations
None yet
Zhang199/TinyLLaVA-Qwen2-0.5B-SigLIP
Image-Text-to-Text
•
1B
•
Updated
•
127
•
5
Zhang199/EDGE-GRPO-Qwen-1.5B
Text Generation
•
2B
•
Updated
•
3
Zhang199/EDGE-GRPO-Qwen-7B
Text Generation
•
8B
•
Updated
•
10
•
1
Zhang199/TinyLLaVA-Video-Qwen2.5-3B-Group-16-512
Video-Text-to-Text
•
4B
•
Updated
•
12
•
1
Zhang199/TinyLLaVA-Video-Qwen2.5-3B-Naive-16-512
Video-Text-to-Text
•
4B
•
Updated
•
14
Zhang199/TinyLLaVA-Video-Phi2-Naive-16-512
Video-Text-to-Text
•
3B
•
Updated
•
15
Zhang199/TinyLLaVA-Qwen2.5-3B-SigLIP
Image-Text-to-Text
•
4B
•
Updated
•
11
Zhang199/TinyLLaVA-Video-R1
Video-Text-to-Text
•
4B
•
Updated
•
59
•
4
Zhang199/TinyLLaVA-Video-Coldstart_NextQA_16
Video-Text-to-Text
•
4B
•
Updated
•
15
•
1
Zhang199/TinyLLaVA-Video-Qwen2.5-3B-Group-1fps-512
Video-Text-to-Text
•
4B
•
Updated
•
23
Zhang199/subject_bert_mmmu
Text Classification
•
0.1B
•
Updated
•
8