Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ZhenyuLiu's picture
3 13 1

ZhenyuLiu

foggyforest

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago
HIT-TMG/KaLM-embedding-multilingual-mini-instruct-v2
upvoted a paper about 2 months ago
AniMaker: Automated Multi-Agent Animated Storytelling with MCTS-Driven Clip Generation
upvoted a paper about 2 months ago
ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow Development
View all activity

Organizations

None yet

Collections 2

vllm data
  • PiTe: Pixel-Temporal Alignment for Large Video-Language Model

    Paper • 2409.07239 • Published Sep 11, 2024 • 15
speech
  • LLaMA-Omni: Seamless Speech Interaction with Large Language Models

    Paper • 2409.06666 • Published Sep 10, 2024 • 59
vllm data
  • PiTe: Pixel-Temporal Alignment for Large Video-Language Model

    Paper • 2409.07239 • Published Sep 11, 2024 • 15
speech
  • LLaMA-Omni: Seamless Speech Interaction with Large Language Models

    Paper • 2409.06666 • Published Sep 10, 2024 • 59

Papers 8

arxiv:2505.04921
arxiv:2502.19917
arxiv:2501.01028
arxiv:2410.10293

models 2

foggyforest/Qwen2-VL-2B-ViSA-80K

Image-to-Text • 2B • Updated Apr 11 • 3

foggyforest/Qwen2-VL-2B-Instruction-ViSA-700K

Image-to-Text • 2B • Updated Apr 1 • 3

datasets 2

foggyforest/ViSA_LlavaOV_80K

Viewer • Updated Apr 7 • 86.4k • 8

foggyforest/ViSA_LlavaOV_700K

Viewer • Updated Apr 7 • 694k • 11
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs