Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Mateusz Dziemian's picture
54 8 41

Mateusz Dziemian

mattmdjaga
kelviator's profile picture Wookesout4chix04's profile picture merve's profile picture
ยท
  • mattmdjaga
  • mattmdjaga

AI & ML interests

Interested in AI safety.

Recent Activity

new activity 19 days ago
mattmdjaga/segformer_b2_clothes:demo
new activity about 1 month ago
mattmdjaga/segformer_b2_clothes:License clarification
updated a model about 1 month ago
mattmdjaga/segformer_b2_clothes
View all activity

Organizations

Hugging Face for Computer Vision's profile picture Sure Here, Marv's profile picture Social Post Explorers's profile picture Hugging Face Discord Community's profile picture

authored 2 papers 2 months ago

Security Challenges in AI Agent Deployment: Insights from a Large Scale Public Competition

Paper โ€ข 2507.20526 โ€ข Published Jul 28 โ€ข 1

Deceptive Automated Interpretability: Language Models Coordinating to Fool Oversight Systems

Paper โ€ข 2504.07831 โ€ข Published Apr 10
authored 2 papers about 1 year ago

AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents

Paper โ€ข 2410.09024 โ€ข Published Oct 11, 2024 โ€ข 1

Applying Refusal-Vector Ablation to Llama 3.1 70B Agents

Paper โ€ข 2410.10871 โ€ข Published Oct 8, 2024 โ€ข 1
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs