chenguangwang
cgraywang
ยท
AI & ML interests
NLP and machine learning
Recent Activity
upvoted
a
paper
4 days ago
RepIt: Representing Isolated Targets to Steer Language Models
upvoted
a
paper
4 days ago
SteeringControl: Holistic Evaluation of Alignment Steering in LLMs
upvoted
a
collection
6 days ago
Verification