Alfaxad Eyembe

Alfaxad

AI & ML interests

AI, Robotics

Recent Activity

Organizations

None yet

Alfaxad's activity

upvoted an article 24 days ago
view article
Article

I trained a Language Model to schedule events with GRPO!

By anakin87 β€’
β€’ 76
upvoted an article 26 days ago
view article
Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

By andito and 2 others β€’
β€’ 180
upvoted an article 27 days ago
view article
Article

Everything You Need to Know about Knowledge Distillation

By Kseniase and 1 other β€’
β€’ 26
upvoted an article about 1 month ago
view article
Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

By ariG23498 and 3 others β€’
β€’ 427
upvoted an article about 1 month ago
view article
Article

SmolLM - blazingly fast and remarkably powerful

By loubnabnl and 2 others β€’
β€’ 375
New activity in yuntian-deng/ChatGPT about 1 month ago

Update app.py

#21 opened about 1 month ago by
Alfaxad
upvoted an article about 1 month ago
reacted to AdinaY's post with πŸ”₯ about 1 month ago
view post
Post
5127
Kimi-Audio πŸš€πŸŽ§ an OPEN audio foundation model released by Moonshot AI
moonshotai/Kimi-Audio-7B-Instruct
✨ 7B
✨ 13M+ hours of pretraining data
✨ Novel hybrid input architecture
✨ Universal audio capabilities (ASR, AQA, AAC, SER, SEC/ASC, end-to-end conversation)
reacted to etemiz's post with πŸ‘€ about 2 months ago
view post
Post
2187
It looks like Llama 4 team gamed the LMArena benchmarks by making their Maverick model output emojis, longer responses and ultra high enthusiasm! Is that ethical or not? They could certainly do a better job by working with teams like llama.cpp, just like Qwen team did with Qwen 3 before releasing the model.

In 2024 I started playing with LLMs just before the release of Llama 3. I think Meta contributed a lot to this field and still contributing. Most LLM fine tuning tools are based on their models and also the inference tool llama.cpp has their name on it. The Llama 4 is fast and maybe not the greatest in real performance but still deserves respect. But my enthusiasm towards Llama models is probably because they rank highest on my AHA Leaderboard:

https://sheet.zoho.com/sheet/open/mz41j09cc640a29ba47729fed784a263c1d08

Looks like they did a worse job compared to Llama 3.1 this time. Llama 3.1 has been on top for a while.

Ranking high on my leaderboard is not correlated to technological progress or parameter size. In fact if LLM training is getting away from human alignment thanks to synthetic datasets or something else (?), it could be easily inversely correlated to technological progress. It seems there is a correlation regarding the location of the builders (in the West or East). Western models are ranking higher. This has become more visible as the leaderboard progressed, in the past there was less correlation. And Europeans seem to be in the middle!

Whether you like positive vibes from AI or not, maybe the times are getting closer where humans may be susceptible to being gamed by an AI? What do you think?
Β·
reacted to BrigitteTousi's post with πŸš€ 2 months ago
view post
Post
3171
AI agents are transforming how we interact with technology, but how sustainable are they? 🌍

Design choices β€” like model size and structure β€” can massively impact energy use and cost. βš‘πŸ’° The key takeaway: smaller, task-specific models can be far more efficient than large, general-purpose ones.

πŸ”‘ Open-source models offer greater transparency, allowing us to track energy consumption and make more informed decisions on deployment. 🌱 Open-source = more efficient, eco-friendly, and accountable AI.

Read our latest, led by @sasha with assists from myself + @yjernite πŸ€—
https://huggingface.co/blog/sasha/ai-agent-sustainability
  • 1 reply
Β·
reacted to Jaward's post with πŸš€πŸ”₯ 3 months ago
view post
Post
1939
This is the most exciting of this week’s release for me: Gemini Robotics - A SOTA generalist Vision-Language-Action model that brings intelligence to the physical world. It comes with a verifiable real-world knowledge Embodied Reasoning QA benchmark. Cool part is that the model can be specialized with fast adaptation to new tasks and have such adaptations transferred to new robot embodiment like humanoids. Looking forward to the model and data on hf, it’s about time I go full physical:)
Technical Report: https://storage.googleapis.com/deepmind-media/gemini-robotics/gemini_robotics_report.pdf
upvoted an article 3 months ago
view article
Article

LLM Inference on Edge: A Fun and Easy Guide to run LLMs via React Native on your Phone!

By medmekk and 1 other β€’
β€’ 60