arxiv:2511.10390
Ziyang Zhang
zenosai
AI & ML interests
Multi-modal Learning, OCR
Recent Activity
authored
a paper
about 4 hours ago
MonkeyOCR v1.5 Technical Report: Unlocking Robust Document Parsing for Complex Patterns
liked
a Space
29 days ago
HuggingFaceTB/smol-training-playbook
authored
a paper
about 1 month ago
Intern-S1: A Scientific Multimodal Foundation Model