Dhruv's picture

Dhruv PRO

dhruv3006

AI & ML interests

None yet

Recent Activity

Organizations

MLX Community's profile picture Hugging Face Discord Community's profile picture

dhruv3006's activity

reacted to their post with โค๏ธ๐Ÿš€ about 4 hours ago
posted an update about 4 hours ago
reacted to evijit's post with ๐Ÿค— 4 days ago
replied to ariG23498's post 4 days ago
reacted to ariG23498's post with ๐Ÿ‘ 4 days ago
view post
Post
1202
๐Ÿšจ Implement KV Cache from scratch in pure PyTorch. ๐Ÿšจ

We have documented all of our learning while implementing KV Cache to nanoVLM. Joint work with @kashif @lusxvr @andito @pcuenq

Blog: hf.co/blog/kv-cache
  • 1 reply
ยท
posted an update 4 days ago
view post
Post
1060
Browser agents use computers the same way humans do, unlocking powerful use cases for personal assistants, browsers, and enterprise workflows.

Includes Manus AI,Athena Intelligence,c/ua,Stagehand .

Here is a definitive guide.

Courtesy of Theta.
reacted to azettl's post with ๐Ÿ‘€๐Ÿค— 4 days ago
view post
Post
1628
Agents & MCP Hackathon Day 1

Before starting with the second night of the Agents & MCP Hackathon, I briefly wanted to share my progress from last night. Not much sleep, but lots of progress!

So I managed to build the first MVP version of my custom Gradio component, https://huggingface.co/spaces/azettl/gradio_consilium_roundtable (https://pypi.org/project/gradio-consilium-roundtable/). This creates a visual roundtable component for AI consensus discussions. Displays AI participants as avatars positioned around an oval table (poker style!) with animated speech bubbles, thinking states, and real-time discussion updates.

Also, I managed to get a rough draft of the Gradio app + MCP server done, but not so much yet that I can share the space with you. You will be able to define your question the AI participants should discuss, decide on the protocol, do role assignments like having a devil's advocate on the table, and define the communication pattern. Lastly, you can decide which AI should be the moderator and how many rounds of discussions there should be. You can see my progress in the attached image.

Most of the options are just placeholders right now, and I will work on their implementation tonight. Hopefully, I can add an MVP tomorrow evening to the following space: Agents-MCP-Hackathon/consilium_mcp.

I am also very interested in the cool stuff you all are building; please let me know in the comments. :)
  • 1 reply
ยท
reacted to awni's post with ๐Ÿค๐Ÿค—๐Ÿค—๐Ÿ‘๐Ÿคฏโค๏ธ 6 days ago
view post
Post
First HF social post:

pip install -U mlx
  • 2 replies
ยท
reacted to their post with โค๏ธ๐Ÿš€๐Ÿ”ฅ 6 days ago
view post
Post
2533
App-Use : Create virtual desktops for AI agents to focus on specific apps.

App-Use lets you scope agents to just the apps they need. Instead of full desktop access, say "only work with Safari and Notes" or "just control iPhone Mirroring" - visual isolation without new processes for perfectly focused automation.

Running computer-use on the entire desktop often causes agent hallucinations and loss of focus when they see irrelevant windows and UI elements. App-Use solves this by creating composited views where agents only see what matters, dramatically improving task completion accuracy

What you can build: Research agents working in Safari while writing agents draft in Notes, iPhone automation for messages and reminders, parallel testing across isolated app sessions, or teams of specialized agents working simultaneously without interference.

Currently macOS-only (Quartz compositing engine).

Read the full guide: https://trycua.com/blog/app-use

Github : https://github.com/trycua/cua
posted an update 6 days ago
view post
Post
2533
App-Use : Create virtual desktops for AI agents to focus on specific apps.

App-Use lets you scope agents to just the apps they need. Instead of full desktop access, say "only work with Safari and Notes" or "just control iPhone Mirroring" - visual isolation without new processes for perfectly focused automation.

Running computer-use on the entire desktop often causes agent hallucinations and loss of focus when they see irrelevant windows and UI elements. App-Use solves this by creating composited views where agents only see what matters, dramatically improving task completion accuracy

What you can build: Research agents working in Safari while writing agents draft in Notes, iPhone automation for messages and reminders, parallel testing across isolated app sessions, or teams of specialized agents working simultaneously without interference.

Currently macOS-only (Quartz compositing engine).

Read the full guide: https://trycua.com/blog/app-use

Github : https://github.com/trycua/cua
reacted to their post with โค๏ธ 9 days ago
view post
Post
2214
C/ua Cloud Containers : Computer Use Agents in the Cloud

First cloud platform built for Computer-Use Agents. Open-source backbone. Linux/Windows/macOS desktops in your browser. Works with OpenAI, Anthropic, or any LLM. Pay only for compute time.

Our beta users have deployed 1000s of agents over the past month. Available now in 3 tiers: Small (1 vCPU/4GB), Medium (2 vCPU/8GB), Large (8 vCPU/32GB). Windows & macOS coming soon.

Github : https://github.com/trycua/cua ( We are open source !)

Cloud Platform : https://www.trycua.com/blog/introducing-cua-cloud-containers