Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2391.9
TFLOPS
57
15
112
chansung park
PRO
chansung
Follow
raincandy-u's profile picture
aumsathwara's profile picture
ManeAI31416's profile picture
3001 followers
Ā·
11 following
algo_diver
deep-diver
AI & ML interests
None yet
Recent Activity
published
a Space
about 2 hours ago
chansung/adaptsum
reacted
to
their
post
with š
1 day ago
Simple summary on DeepSeek AI's Janus-Pro: A fresh take on multimodal AI! It builds on its predecessor, Janus, by tweaking the training methodology rather than the model architecture. The result? Improved performance in understanding and generating multimodal data. Janus-Pro uses a three-stage training strategy, similar to Janus, but with key modifications: ā¦ Stage 1 & 2: Focus on separate training for specific objectives, rather than mixing data. ā¦ Stage 3: Fine-tuning with a careful balance of multimodal data. Benchmarks show Janus-Pro holds its own against specialized models like TokenFlow XL and MetaMorph, and other multimodal models like SD3 Medium and DALL-E 3. The main limitation? Low image resolution (384x384). However, this seems like a strategic choice to focus on establishing a solid "recipe" for multimodal models. Future work will likely leverage this recipe and increased computing power to achieve higher resolutions.
posted
an
update
1 day ago
Simple summary on DeepSeek AI's Janus-Pro: A fresh take on multimodal AI! It builds on its predecessor, Janus, by tweaking the training methodology rather than the model architecture. The result? Improved performance in understanding and generating multimodal data. Janus-Pro uses a three-stage training strategy, similar to Janus, but with key modifications: ā¦ Stage 1 & 2: Focus on separate training for specific objectives, rather than mixing data. ā¦ Stage 3: Fine-tuning with a careful balance of multimodal data. Benchmarks show Janus-Pro holds its own against specialized models like TokenFlow XL and MetaMorph, and other multimodal models like SD3 Medium and DALL-E 3. The main limitation? Low image resolution (384x384). However, this seems like a strategic choice to focus on establishing a solid "recipe" for multimodal models. Future work will likely leverage this recipe and increased computing power to achieve higher resolutions.
View all activity
Articles
dstack to manage clusters of on-prem servers for AI workloads with ease
Oct 10, 2024
ā¢
7
dstack: Your LLM Launchpad - From Fine-Tuning to Serving, Simplified
Aug 22, 2024
ā¢
12
Deploying š¤ ViT on Vertex AI
Aug 19, 2022
ā¢
2
Deploying š¤ ViT on Kubernetes with TF Serving
Aug 11, 2022
Organizations
chansung
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a Space
5 months ago
Running
5
š
Mesop Duo Chat
liked
a Space
6 months ago
Running
on
Zero
6.63k
š„ļø
FLUX.1 [dev]
liked
a model
6 months ago
meta-llama/Llama-3.1-405B
Text Generation
ā¢
Updated
Sep 25, 2024
ā¢
8.49k
ā¢
917
liked
2 models
7 months ago
google/gemma-2-9b
Text Generation
ā¢
Updated
Aug 7, 2024
ā¢
72.5k
ā¢
634
wave-on-discord/gemini-nano
Updated
Jun 24, 2024
ā¢
102
liked
a model
8 months ago
nvidia/Nemotron-4-340B-Base
Updated
Jun 28, 2024
ā¢
178
ā¢
145
liked
a dataset
9 months ago
llama-duo/synth_summarize_dataset
Viewer
ā¢
Updated
May 31, 2024
ā¢
903k
ā¢
241
ā¢
5
liked
a Space
9 months ago
Running
on
T4
311
š¤²
PaliGemma Demo
liked
a dataset
9 months ago
llama-duo/coverage_dataset
Viewer
ā¢
Updated
May 11, 2024
ā¢
10k
ā¢
135
ā¢
1
liked
2 models
9 months ago
chansung/llamaduo_synth_ds_v0.1
Updated
Apr 30, 2024
ā¢
5
ā¢
1
meta-llama/Meta-Llama-3-8B
Text Generation
ā¢
Updated
Sep 27, 2024
ā¢
660k
ā¢
5.99k
liked
2 datasets
9 months ago
HuggingFaceH4/no_robots
Viewer
ā¢
Updated
Apr 18, 2024
ā¢
10k
ā¢
1.29k
ā¢
464
chansung/merged_ds_coding
Viewer
ā¢
Updated
Apr 23, 2024
ā¢
60.6k
ā¢
46
ā¢
16
liked
a model
10 months ago
ai21labs/Jamba-v0.1
Text Generation
ā¢
Updated
Sep 11, 2024
ā¢
10.9k
ā¢
1.18k
liked
a Space
10 months ago
Running
on
Zero
147
š„
Llava Next
liked
2 models
11 months ago
ehristoforu/dalle-3-xl-v2
Text-to-Image
ā¢
Updated
Mar 9, 2024
ā¢
464
ā¢
121
google/metricx-23-qe-xxl-v2p0
Updated
22 days ago
ā¢
1.11k
ā¢
6
liked
a Space
11 months ago
Runtime error
16
š„
Gradio š¤ TGI
Gradio and TGI packed in the same machine
liked
2 models
11 months ago
xai-org/grok-1
Text Generation
ā¢
Updated
Mar 28, 2024
ā¢
392
ā¢
2.23k
Crystalcareai/GemMoE-Beta-1
Text Generation
ā¢
Updated
Mar 20, 2024
ā¢
66
ā¢
79
Load more