Exploiting Cultural Biases via Homoglyphs in Text-to-Image Synthesis Paper • 2209.08891 • Published Sep 19, 2022 • 2
Revision Transformers: Instructing Language Models to Change their Values Paper • 2210.10332 • Published Oct 19, 2022 • 1
The Stable Artist: Steering Semantics in Diffusion Latent Space Paper • 2212.06013 • Published Dec 12, 2022 • 1
LLavaGuard: VLM-based Safeguards for Vision Dataset Curation and Safety Assessment Paper • 2406.05113 • Published Jun 7, 2024 • 3
SCAR: Sparse Conditioned Autoencoders for Concept Detection and Steering in LLMs Paper • 2411.07122 • Published Nov 11, 2024 • 2
AILuminate: Introducing v1.0 of the AI Risk and Reliability Benchmark from MLCommons Paper • 2503.05731 • Published Feb 19 • 2
Evaluating LLMs Robustness in Less Resourced Languages with Proxy Models Paper • 2506.07645 • Published Jun 9 • 3
ConECT Dataset: Overcoming Data Scarcity in Context-Aware E-Commerce MT Paper • 2506.04929 • Published Jun 5 • 1
BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing Paper • 2206.15076 • Published Jun 30, 2022 • 5
CSMeD: Bridging the Dataset Gap in Automated Citation Screening for Systematic Literature Reviews Paper • 2311.12474 • Published Nov 21, 2023 • 1
Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language Models Paper • 2406.02061 • Published Jun 4, 2024 • 2
DataComp-LM: In search of the next generation of training sets for language models Paper • 2406.11794 • Published Jun 17, 2024 • 54
Scaling Laws for Robust Comparison of Open Foundation Language-Vision Models and Datasets Paper • 2506.04598 • Published Jun 5 • 6
Building Instruction-Tuning Datasets from Human-Written Instructions with Open-Weight Large Language Models Paper • 2503.23714 • Published Mar 31 • 1
Mitigating Inappropriateness in Image Generation: Can there be Value in Reflecting the World's Ugliness? Paper • 2305.18398 • Published May 28, 2023 • 2
Interactively Providing Explanations for Transformer Language Models Paper • 2110.02058 • Published Sep 2, 2021 • 1
Multilingual Text-to-Image Generation Magnifies Gender Stereotypes and Prompt Engineering May Not Help You Paper • 2401.16092 • Published Jan 29, 2024 • 1
A Typology for Exploring the Mitigation of Shortcut Behavior Paper • 2203.03668 • Published Mar 4, 2022 • 1
MultiFusion: Fusing Pre-Trained Models for Multi-Lingual, Multi-Modal Image Generation Paper • 2305.15296 • Published May 24, 2023 • 1