Wei Shen
Swtheking
AI & ML interests
None yet
Recent Activity
commented on
a paper
about 2 months ago
Policy Filtration in RLHF to Fine-Tune LLM for Code Generation
authored
a paper
about 2 months ago
AdaCoT: Pareto-Optimal Adaptive Chain-of-Thought Triggering via
Reinforcement Learning
upvoted
a
paper
about 2 months ago
AdaCoT: Pareto-Optimal Adaptive Chain-of-Thought Triggering via
Reinforcement Learning