1 20 7

Dhruv PRO

dhruv3006

AI & ML interests

None yet

Recent Activity

reacted to Jaward's post with 🚀 6 days ago

Awesome intro to LLM course "Language Modeling from Scratch" by stanford. love the aesthetics behind the lecture notes, notes-in-code genius idea👍 Course site: https://stanford-cs336.github.io/spring2025/ Repo: https://github.com/stanford-cs336/spring2025-lectures Videos: https://www.youtube.com/playlist?list=PLoROMvodv4rOY23Y0BoGoBGgQ1zmU_MT_

posted an update 6 days ago

WebBench: A real-world benchmark for Browser Agents WebBench is an open, task-oriented benchmark designed to measure how effectively browser agents handle complex, realistic web workflows. It includes 2,454 tasks across 452 live websites selected from the global top-1000 by traffic. Github : https://github.com/Halluminate/WebBench

reacted to their post with 👀 11 days ago

Introducing Windows Sandbox support - run computer-use agents on Windows business apps without VMs or cloud costs. Your enterprise software runs on Windows, but testing agents required expensive cloud instances. Windows Sandbox changes this - it's Microsoft's built-in lightweight virtualization sitting on every Windows 10/11 machine, ready for instant agent development. Enterprise customers kept asking for AutoCAD automation, SAP integration, and legacy Windows software support. Traditional VM testing was slow and resource-heavy. Windows Sandbox solves this with disposable, seconds-to-boot Windows environments for safe agent testing. What you can build: AutoCAD drawing automation, SAP workflow processing, Bloomberg terminal trading bots, manufacturing execution system integration, or any Windows-only enterprise software automation - all tested safely in disposable sandbox environments. Free with Windows 10/11, boots in seconds, completely disposable. Perfect for development and testing before deploying to Windows cloud instances (coming later this month). Check out the github here : https://github.com/trycua/cua

View all activity

Organizations

Posts 11

Post

219

WebBench: A real-world benchmark for Browser Agents

WebBench is an open, task-oriented benchmark designed to measure how effectively browser agents handle complex, realistic web workflows. It includes 2,454 tasks across 452 live websites selected from the global top-1000 by traffic.

Github : https://github.com/Halluminate/WebBench

View all Posts

Articles 1

Article

The Era of Local Computer-Use AI Agents is here.

View all Articles

models 0

None public yet

datasets 0

None public yet