SIGGRAPHAsia2022 (SIGGRAPH Asia 2022 Demos)

AtAndDev

posted an update 4 days ago

Post

190

Qwen 3 Coder is a personal attack to k2, and I love it.
It achieves near SOTA on LCB while not having reasoning.
Finally people are understanding that reasoning isnt necessary for high benches...

Qwen ftw!

DECENTRALIZE DECENTRALIZE DECENTRALIZE

AtAndDev

posted an update about 2 months ago

Post

2885

deepseek-ai/DeepSeek-R1-0528

This is the end

1 reply

·

AtAndDev

posted an update 4 months ago

Post

3120

Llama 4 is out...

3 replies

·

menghanxia

authored a paper 4 months ago

ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Paper • 2503.11647 • Published Mar 14 • 142

AtAndDev

posted an update 4 months ago

Post

4345

There seems to multiple paid apps shared here that are based on models on hf, but some ppl sell their wrappers as "products" and promote them here. For a long time, hf was the best and only platform to do oss model stuff but with the recent AI website builders anyone can create a product (really crappy ones btw) and try to sell it with no contribution to oss stuff. Please dont do this, or try finetuning the models you use...
Sorry for filling yall feed with this bs but yk...

6 replies

·

AtAndDev

posted an update 5 months ago

Post

1660

Gemma 3 seems to be really good at human preference. Just waiting for ppl to see it.

AtAndDev

posted an update 5 months ago

Post

2494

@nroggendorff is that you sama?

2 replies

·

AtAndDev

posted an update 6 months ago

Post

1942

everywhere i go i see his face

AtAndDev

posted an update 6 months ago

Post

577

Deepseek gang on fire fr fr

AtAndDev

posted an update 6 months ago

Post

1655

R1 is out! And with a lot of other R1 releated models...

akhaliq

posted an update 7 months ago

Post

24523

Google drops Gemini 2.0 Flash Thinking

a new experimental model that unlocks stronger reasoning capabilities and shows its thoughts. The model plans (with thoughts visible), can solve complex problems with Flash speeds, and more

now available in anychat, try it out: https://huggingface.co/spaces/akhaliq/anychat

4 replies

·

AtAndDev

posted an update 7 months ago

Post

495

@s3nh Hey man check your discord! Got some news.

4 replies

·

menghanxia

authored 2 papers 8 months ago

SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints

Paper • 2412.07760 • Published Dec 10, 2024 • 56

3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation

Paper • 2412.07759 • Published Dec 10, 2024 • 18

PKUWilliamYang

authored a paper 8 months ago

Trajectory Attention for Fine-grained Video Motion Control

Paper • 2411.19324 • Published Nov 28, 2024 • 12

akhaliq

posted an update 8 months ago

Post

24120

QwQ-32B-Preview is now available in anychat

A reasoning model that is competitive with OpenAI o1-mini and o1-preview

try it out: https://huggingface.co/spaces/akhaliq/anychat

1 reply

·

akhaliq

posted an update 8 months ago

Post

4747

New model drop in anychat

allenai/Llama-3.1-Tulu-3-8B is now available

try it here: https://huggingface.co/spaces/akhaliq/anychat

akhaliq

posted an update 8 months ago

Post

3568

anychat

supports chatgpt, gemini, perplexity, claude, meta llama, grok all in one app

try it out there: https://huggingface.co/spaces/akhaliq/anychat

menghanxia

authored a paper about 1 year ago

FreeTraj: Tuning-Free Trajectory Control in Video Diffusion Models

Paper • 2406.16863 • Published Jun 24, 2024 • 11

akhaliq

posted an update about 1 year ago

Post

21061

Phased Consistency Model

Phased Consistency Model (2405.18407)

The consistency model (CM) has recently made significant progress in accelerating the generation of diffusion models. However, its application to high-resolution, text-conditioned image generation in the latent space (a.k.a., LCM) remains unsatisfactory. In this paper, we identify three key flaws in the current design of LCM. We investigate the reasons behind these limitations and propose the Phased Consistency Model (PCM), which generalizes the design space and addresses all identified limitations. Our evaluations demonstrate that PCM significantly outperforms LCM across 1--16 step generation settings. While PCM is specifically designed for multi-step refinement, it achieves even superior or comparable 1-step generation results to previously state-of-the-art specifically designed 1-step methods. Furthermore, we show that PCM's methodology is versatile and applicable to video generation, enabling us to train the state-of-the-art few-step text-to-video generator.

AI & ML interests

Team members 10

SIGGRAPHAsia2022's activity