Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

NTU Miulab

university
https://www.csie.ntu.edu.tw/~miulab/
MiuLab
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

wlchen  authored a paper about 2 months ago
The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning
yentinglin  authored a paper 6 months ago
Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback
morrischang  updated a model 7 months ago
miulab/SalesBot2_CoT_lora_w_neg_wo_dup_chitchat_e10
View all activity

Yen-Ting Lin's profile picture Chang-Sheng Kao's profile picture Wei-Lin Chen's profile picture Chen Kang-Chieh's profile picture Foo Jia Yin's profile picture SI-JIA CHENG's profile picture Po-Heng, Chen's profile picture Wen Yu Chang's profile picture Tzu-Han Lin's profile picture

miulab 's collections 1

DogeRM
Models trained/used in the paper "DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging ( https://arxiv.org/abs/2407.01470)
  • miulab/llama2-7b-oss-instruct

    Text Generation • 7B • Updated Oct 3, 2024 • 4
  • miulab/llama2-7b-alpaca-sft-10k

    Text Generation • 7B • Updated Oct 3, 2024 • 5
  • miulab/llama2-7b-magicoder-evol-instruct

    Text Generation • 7B • Updated Oct 3, 2024 • 4
  • miulab/llama2-7b-ultrafeedback-rm

    Text Classification • 7B • Updated Oct 3, 2024 • 250
DogeRM
Models trained/used in the paper "DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging ( https://arxiv.org/abs/2407.01470)
  • miulab/llama2-7b-oss-instruct

    Text Generation • 7B • Updated Oct 3, 2024 • 4
  • miulab/llama2-7b-alpaca-sft-10k

    Text Generation • 7B • Updated Oct 3, 2024 • 5
  • miulab/llama2-7b-magicoder-evol-instruct

    Text Generation • 7B • Updated Oct 3, 2024 • 4
  • miulab/llama2-7b-ultrafeedback-rm

    Text Classification • 7B • Updated Oct 3, 2024 • 250
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs