AI & ML interests

None defined yet.

AtAndDevย 
posted an update 4 days ago
view post
Post
190
Qwen 3 Coder is a personal attack to k2, and I love it.
It achieves near SOTA on LCB while not having reasoning.
Finally people are understanding that reasoning isnt necessary for high benches...

Qwen ftw!

DECENTRALIZE DECENTRALIZE DECENTRALIZE
AtAndDevย 
posted an update about 2 months ago
view post
Post
2885
deepseek-ai/DeepSeek-R1-0528

This is the end
  • 1 reply
ยท
AtAndDevย 
posted an update 4 months ago
view post
Post
3120
Llama 4 is out...
ยท
AtAndDevย 
posted an update 4 months ago
view post
Post
4345
There seems to multiple paid apps shared here that are based on models on hf, but some ppl sell their wrappers as "products" and promote them here. For a long time, hf was the best and only platform to do oss model stuff but with the recent AI website builders anyone can create a product (really crappy ones btw) and try to sell it with no contribution to oss stuff. Please dont do this, or try finetuning the models you use...
Sorry for filling yall feed with this bs but yk...
  • 6 replies
ยท
AtAndDevย 
posted an update 5 months ago
view post
Post
1660
Gemma 3 seems to be really good at human preference. Just waiting for ppl to see it.
AtAndDevย 
posted an update 5 months ago
view post
Post
2494
@nroggendorff is that you sama?
  • 2 replies
ยท
AtAndDevย 
posted an update 6 months ago
view post
Post
1942
everywhere i go i see his face
AtAndDevย 
posted an update 6 months ago
view post
Post
577
Deepseek gang on fire fr fr
AtAndDevย 
posted an update 6 months ago
view post
Post
1655
R1 is out! And with a lot of other R1 releated models...
akhaliqย 
posted an update 7 months ago
view post
Post
24523
Google drops Gemini 2.0 Flash Thinking

a new experimental model that unlocks stronger reasoning capabilities and shows its thoughts. The model plans (with thoughts visible), can solve complex problems with Flash speeds, and more

now available in anychat, try it out: https://huggingface.co/spaces/akhaliq/anychat
ยท
AtAndDevย 
posted an update 7 months ago
view post
Post
495
@s3nh Hey man check your discord! Got some news.
  • 4 replies
ยท
akhaliqย 
posted an update 8 months ago
akhaliqย 
posted an update 8 months ago
akhaliqย 
posted an update 8 months ago
akhaliqย 
posted an update about 1 year ago
view post
Post
21061
Phased Consistency Model

Phased Consistency Model (2405.18407)

The consistency model (CM) has recently made significant progress in accelerating the generation of diffusion models. However, its application to high-resolution, text-conditioned image generation in the latent space (a.k.a., LCM) remains unsatisfactory. In this paper, we identify three key flaws in the current design of LCM. We investigate the reasons behind these limitations and propose the Phased Consistency Model (PCM), which generalizes the design space and addresses all identified limitations. Our evaluations demonstrate that PCM significantly outperforms LCM across 1--16 step generation settings. While PCM is specifically designed for multi-step refinement, it achieves even superior or comparable 1-step generation results to previously state-of-the-art specifically designed 1-step methods. Furthermore, we show that PCM's methodology is versatile and applicable to video generation, enabling us to train the state-of-the-art few-step text-to-video generator.