Bokai Xu's picture

7 6 53

Bokai Xu

bokesyo

·

https://bokaixu.site

bokesyo

AI & ML interests

None yet

Recent Activity

replied to mitkox's post about 21 hours ago

Training a model to reason in the continuous latent space based on Meta's Coconut. If it all works will apply it on the MiniCPM-o SVD-LR. Endgame is a multimodal, adaptive, and efficient foundational on device AI model.

reacted to mitkox's post with 👀 about 21 hours ago

Training a model to reason in the continuous latent space based on Meta's Coconut. If it all works will apply it on the MiniCPM-o SVD-LR. Endgame is a multimodal, adaptive, and efficient foundational on device AI model.

reacted to mitkox's post with 🚀 about 21 hours ago

Training a model to reason in the continuous latent space based on Meta's Coconut. If it all works will apply it on the MiniCPM-o SVD-LR. Endgame is a multimodal, adaptive, and efficient foundational on device AI model.

View all activity

Organizations

bokesyo's activity

upvoted a paper 3 months ago

VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents

Paper • 2410.10594 • Published Oct 14, 2024 • 24

upvoted 3 papers 5 months ago

Transformer Explainer: Interactive Learning of Text-Generative Models

Paper • 2408.04619 • Published Aug 8, 2024 • 156

Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks

Paper • 2408.03615 • Published Aug 7, 2024 • 31

MiniCPM-V: A GPT-4V Level MLLM on Your Phone

Paper • 2408.01800 • Published Aug 3, 2024 • 79

upvoted a paper 6 months ago

SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound

Paper • 2405.00233 • Published Apr 30, 2024 • 16

upvoted a collection 6 months ago

Awesome Visual Embedding

9 items • Updated Jul 23, 2024 • 4