AI & ML interests

Enhance and upgrade SD-models

Recent Activity

NymboΒ 
posted an update 10 days ago
view post
Post
1595
Anyone know how to reset Claude web's MCP config? I connected mine when the HF MCP first released with just the default example spaces added. I added lots of other MCP spaces but Claude.ai doesn't update the available tools... "Disconnecting" the HF integration does nothing, deleting it and adding it again does nothing.

Refreshing tools works fine in VS Code because I can manually restart it in mcp.json, but claude.ai has no such option. Anyone got any ideas?
Β·
KingNishΒ 
posted an update 29 days ago
view post
Post
777
What's currently the biggest gap in Open Source Datasets ??
Β·
AtAndDevΒ 
posted an update about 1 month ago
view post
Post
2843
deepseek-ai/DeepSeek-R1-0528

This is the end
  • 1 reply
Β·
NymboΒ 
posted an update 2 months ago
view post
Post
3210
Haven't seen this posted anywhere - Llama-3.3-8B-Instruct is available on the new Llama API. Is this a new model or did someone mislabel Llama-3.1-8B?
  • 1 reply
Β·
NymboΒ 
posted an update 2 months ago
view post
Post
2725
PSA for anyone using Nymbo/Nymbo_Theme or Nymbo/Nymbo_Theme_5 in a Gradio space ~

Both of these themes have been updated to fix some of the long-standing inconsistencies ever since the transition to Gradio v5. Textboxes are no longer bright green and in-line code is readable now! Both themes are now visually identical across versions.

If your space is already using one of these themes, you just need to restart your space to get the latest version. No code changes needed.
AtAndDevΒ 
posted an update 3 months ago
view post
Post
3096
Llama 4 is out...
Β·
AtAndDevΒ 
posted an update 4 months ago
view post
Post
4341
There seems to multiple paid apps shared here that are based on models on hf, but some ppl sell their wrappers as "products" and promote them here. For a long time, hf was the best and only platform to do oss model stuff but with the recent AI website builders anyone can create a product (really crappy ones btw) and try to sell it with no contribution to oss stuff. Please dont do this, or try finetuning the models you use...
Sorry for filling yall feed with this bs but yk...
  • 6 replies
Β·
AtAndDevΒ 
posted an update 4 months ago
view post
Post
1659
Gemma 3 seems to be really good at human preference. Just waiting for ppl to see it.
not-lainΒ 
posted an update 4 months ago
ehristoforuΒ 
posted an update 5 months ago
view post
Post
3500
Introducing our first standalone model – FluentlyLM Prinum

Introducing the first standalone model from Project Fluently LM! We worked on it for several months, used different approaches and eventually found the optimal one.

General characteristics:
- Model type: Causal language models (QwenForCausalLM, LM Transformer)
- Number of parameters: 32.5B
- Number of parameters (not embedded): 31.0B
- Number of layers: 64
- Context: 131,072 tokens
- Language(s) (NLP): English, French, Spanish, Russian, Chinese, Japanese, Persian (officially supported)
- License: MIT

Creation strategy:
The basis of the strategy is shown in Pic. 2.
We used Axolotl & Unsloth for SFT-finetuning with PEFT LoRA (rank=64, alpha=64) and Mergekit for SLERP and TIES mergers.

Evolution:
πŸ† 12th place in the Open LLM Leaderboard ( open-llm-leaderboard/open_llm_leaderboard) (21.02.2025)

Detailed results and comparisons are presented in Pic. 3.

Links:
- Model: fluently-lm/FluentlyLM-Prinum
- GGUF version: mradermacher/FluentlyLM-Prinum-GGUF
- Demo on ZeroGPU: ehristoforu/FluentlyLM-Prinum-demo
  • 7 replies
Β·
AtAndDevΒ 
posted an update 5 months ago
view post
Post
2493
@nroggendorff is that you sama?
  • 2 replies
Β·
ameerazam08Β 
posted an update 5 months ago
not-lainΒ 
posted an update 5 months ago
AtAndDevΒ 
posted an update 5 months ago
view post
Post
1941
everywhere i go i see his face
AtAndDevΒ 
posted an update 6 months ago
view post
Post
576
Deepseek gang on fire fr fr
AtAndDevΒ 
posted an update 6 months ago
view post
Post
1654
R1 is out! And with a lot of other R1 releated models...