Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Ziyang Zhang's picture
In a Training Loop 🔄
1 2 8

Ziyang Zhang

zenosai
Mulugetaabrham's profile picture
·
  • zenosai

AI & ML interests

Multi-modal Learning, OCR

Recent Activity

authored a paper 25 days ago
MonkeyOCR v1.5 Technical Report: Unlocking Robust Document Parsing for Complex Patterns
liked a Space about 2 months ago
HuggingFaceTB/smol-training-playbook
authored a paper 2 months ago
Intern-S1: A Scientific Multimodal Foundation Model
View all activity

Organizations

VLRLab-OCR's profile picture

authored a paper 25 days ago

MonkeyOCR v1.5 Technical Report: Unlocking Robust Document Parsing for Complex Patterns

Paper • 2511.10390 • Published Nov 13
authored a paper 2 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21 • 259
authored 2 papers 5 months ago

SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text Spotting

Paper • 2504.09966 • Published Apr 14

MonkeyOCR: Document Parsing with a Structure-Recognition-Relation Triplet Paradigm

Paper • 2506.05218 • Published Jun 5 • 2
authored a paper 12 months ago

R-CoT: Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal Models

Paper • 2410.17885 • Published Oct 23, 2024
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs