Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Mateusz Dziemian's picture
52 8 41

Mateusz Dziemian

mattmdjaga
mrshabangunhlanhla's profile picture alirezakatani's profile picture alex4gg's profile picture
ยท
  • mattmdjaga
  • mattmdjaga

AI & ML interests

Interested in AI safety.

Recent Activity

authored a paper 21 days ago
Security Challenges in AI Agent Deployment: Insights from a Large Scale Public Competition
authored a paper 21 days ago
Deceptive Automated Interpretability: Language Models Coordinating to Fool Oversight Systems
upvoted a paper 21 days ago
Security Challenges in AI Agent Deployment: Insights from a Large Scale Public Competition
View all activity

Organizations

Hugging Face for Computer Vision's profile picture Sure Here, Marv's profile picture Social Post Explorers's profile picture Hugging Face Discord Community's profile picture

authored 2 papers 21 days ago

Security Challenges in AI Agent Deployment: Insights from a Large Scale Public Competition

Paper โ€ข 2507.20526 โ€ข Published Jul 28 โ€ข 1

Deceptive Automated Interpretability: Language Models Coordinating to Fool Oversight Systems

Paper โ€ข 2504.07831 โ€ข Published Apr 10
authored 2 papers 10 months ago

AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents

Paper โ€ข 2410.09024 โ€ข Published Oct 11, 2024 โ€ข 1

Applying Refusal-Vector Ablation to Llama 3.1 70B Agents

Paper โ€ข 2410.10871 โ€ข Published Oct 8, 2024 โ€ข 1
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs