Tom Goldstein's Lab at University of Maryland, College Park

university

http://www.cs.umd.edu/~tomg/

tomgoldsteincs

Activity Feed

AI & ML interests

AI security & privacy, algorithmic bias, foundations of ML

Recent Activity

montehoover updated a dataset about 18 hours ago

tomg-group-umd/DynaBench

kaiyuyue new activity 5 days ago

tomg-group-umd/pixelprose:Can you provide the id mapping from pixelprose-cc12m-annos to the original cc12m-wds?

montehoover updated a Space 5 days ago

tomg-group-umd/DynaGuard

View all activity

tomg-group-umd 's collections 12

DynaGuard

https://arxiv.org/abs/2509.02563

tomg-group-umd/DynaGuard-8B

Text Generation • 8B • Updated 6 days ago • 273 • 11
tomg-group-umd/DynaGuard-4B

Text Generation • 4B • Updated 6 days ago • 38 • 2
tomg-group-umd/DynaGuard-1.7B

Text Generation • 2B • Updated 6 days ago • 89 • 2
tomg-group-umd/DynaBench

Viewer • Updated about 18 hours ago • 140k • 70 • 2

FictionalQA

tomg-group-umd/fictionalqa

Viewer • Updated Jun 9 • 31.7k • 93 • 2
tomg-group-umd/fictionalqa_training_splits

Viewer • Updated Jun 9 • 107k • 111
tomg-group-umd/fictionalqa_reformatted_triviaqa

Viewer • Updated Jun 9 • 16.4k • 130

Gemstone Models

Our 22 open source Gemstone models for scaling laws range from 50M to 2B parameters, spanning 11 widths from 256 to 3072 and 18 depths from 3 to 80.

tomg-group-umd/Gemstone-768x45

Text Generation • 0.5B • Updated Feb 9 • 5
tomg-group-umd/Gemstone-1280x15

Text Generation • 0.5B • Updated Feb 6 • 6
tomg-group-umd/Gemstone-512x13

Text Generation • 0.1B • Updated Feb 6 • 5
tomg-group-umd/Gemstone-1536x50

Text Generation • 2B • Updated Feb 7 • 271

Style Descriptors

How to extract style from images? Model, dataset, and the paper

Measuring Style Similarity in Diffusion Models

Paper • 2404.01292 • Published Apr 1, 2024 • 17
tomg-group-umd/CSD-ViT-L

Image Feature Extraction • Updated Sep 4, 2024 • 27 • 4
tomg-group-umd/ContraStyles

Viewer • Updated Jul 31, 2024 • 498k • 69 • 4

CLRS-Text

Hugging Face collection for all things CLRS-Text

The CLRS-Text Algorithmic Reasoning Language Benchmark

Paper • 2406.04229 • Published Jun 6, 2024 • 4
tomg-group-umd/CLRS-Text-train

Viewer • Updated Jul 14, 2024 • 2.15M • 36 • 2
tomg-group-umd/CLRS-Text-test

Viewer • Updated Jul 10, 2024 • 503k • 427

Goldfish Loss: Mitigating Memorization in LLMs

This collection contains artifacts from our paper titled: "Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs."

Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs

Paper • 2406.10209 • Published Jun 14, 2024 • 8
tomg-group-umd/3-goldfish-loss-llama-1B

Text Generation • 1B • Updated Aug 19, 2024 • 6
tomg-group-umd/4-goldfish-loss-llama-1B

Text Generation • 1B • Updated Aug 19, 2024 • 5
tomg-group-umd/8-goldfish-loss-llama-1B

Text Generation • 1B • Updated Aug 19, 2024 • 5

Refusal Token Models

This collection contains models described in the refusal token paper published in COLM 2025.

tomg-group-umd/zephyr-llama3-8b-sft-refusal-n-contrast

8B • Updated Jul 22 • 22
tomg-group-umd/zephyr-llama3-8b-sft-refusal-n-contrast-multiple-tokens

8B • Updated Jul 22 • 2.47k
tomg-group-umd/zephyr-llama3-8b-sft-refusal-n-contrast-single-token

8B • Updated Jul 22 • 26 • 1
tomg-group-umd/zephyr-llama3-8b-sft-no-refusal-messages

8B • Updated Jul 22 • 13

LoRI Adapters

LoRI adapters for natural language understanding, code generation, mathematical reasoning, and safety alignment, based on LLaMA-3-8B and Mistral-7B.

tomg-group-umd/LoRI-S_safety_mistral7b_rank_64

Text Generation • Updated Apr 14 • 5 • 1
tomg-group-umd/LoRI-S_safety_mistral7b_rank_32

Text Generation • Updated Apr 14 • 6
tomg-group-umd/LoRI-S_safety_llama3_rank_64

Text Generation • Updated 28 days ago • 11
tomg-group-umd/LoRI-S_safety_llama3_rank_32

Text Generation • Updated Apr 14 • 4

Recurrent Models

These are checkpoints for recurrent LLMs developed to scale test-time compute by recurring in latent space.

tomg-group-umd/huginn-0125

Text Generation • 4B • Updated Jul 29 • 3.44k • 283
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7 • 150
tomg-group-umd/huginn_swa_100_10_avg_0.9_merge

Text Generation • 4B • Updated Jul 17 • 3
tomg-group-umd/step-00010752-recurrence_full_512_0

Text Generation • 4B • Updated Jul 17 • 3

GenQA

tomg-group-umd/GenQA

Viewer • Updated Jun 21, 2024 • 11.1M • 1.22k • 54
tomg-group-umd/GenQA_raw

Viewer • Updated Jun 13, 2024 • 11.1M • 274
tomg-group-umd/GenQA_rebalanced

Viewer • Updated Jun 13, 2024 • 6.47M • 1 • 3
tomg-group-umd/GenQA-Subset-llama-3

Text Generation • 8B • Updated Jun 17, 2024 • 5

PixelProse

From Pixels to Prose: A Large Dataset of Dense Image Captions

Paper • 2406.10328 • Published Jun 14, 2024 • 18
tomg-group-umd/pixelprose

Viewer • Updated Jun 23, 2024 • 15.6M • 1.3k • 158
tomg-group-umd/pixelprose-jsons

Preview • Updated Jul 3 • 28

Zero-Shot Grafting

Zero-Shot Vision Encoder Grafting via LLM Surrogates

Paper • 2505.22664 • Published May 28 • 7
tomg-group-umd/zero-model-checkpoints

Image-Text-to-Text • Updated Aug 5 • 2