Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

nbagent

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

YaoTang23  authored a paper 25 days ago
QLASS: Boosting Language Agent Inference via Q-Guided Stepwise Search
YaoTang23  authored a paper 25 days ago
Reinforcement Pre-Training
sxyao  authored a paper 3 months ago
Kimi-VL Technical Report
View all activity

lin's profile picture Yao Tang's profile picture Yao Tang's profile picture Xingcheng Yao's profile picture

models 11

nbagent/qagent-yxc-Llama-2-7b-chat-hf-alfworld-sft

7B • Updated Jan 30 • 16

nbagent/llama-3.2-1B-Instruct-webshop-sft

1B • Updated Jan 4 • 6

nbagent/llama-3.2-1B-Instruct-alfworld-sft

1B • Updated Jan 4 • 31

nbagent/llama-3.2-1B-Instruct-sciworld-sft

1B • Updated Jan 4 • 13

nbagent/sciworld-qnet

Updated Sep 29, 2024

nbagent/alfworld-sft

7B • Updated Sep 29, 2024 • 5

nbagent/sciworld-sft

7B • Updated Sep 29, 2024 • 7

nbagent/alfworld-qnet

Updated Sep 29, 2024

nbagent/webshop_dpo_ckpt_fromselftrain_e1_1e-7-0.5

7B • Updated Sep 24, 2024 • 24

nbagent/webshop_dpo_ckpt_fromselftrain_e1

7B • Updated Sep 24, 2024 • 33
View 11 models

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs