Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
nbagent
Activity Feed
Follow
4
AI & ML interests
None defined yet.
Recent Activity
YaoTang23
authored
a paper
25 days ago
QLASS: Boosting Language Agent Inference via Q-Guided Stepwise Search
YaoTang23
authored
a paper
25 days ago
Reinforcement Pre-Training
sxyao
authored
a paper
3 months ago
Kimi-VL Technical Report
View all activity
Team members
4
models
11
Sort: Recently updated
nbagent/qagent-yxc-Llama-2-7b-chat-hf-alfworld-sft
7B
•
Updated
Jan 30
•
16
nbagent/llama-3.2-1B-Instruct-webshop-sft
1B
•
Updated
Jan 4
•
6
nbagent/llama-3.2-1B-Instruct-alfworld-sft
1B
•
Updated
Jan 4
•
31
nbagent/llama-3.2-1B-Instruct-sciworld-sft
1B
•
Updated
Jan 4
•
13
nbagent/sciworld-qnet
Updated
Sep 29, 2024
nbagent/alfworld-sft
7B
•
Updated
Sep 29, 2024
•
5
nbagent/sciworld-sft
7B
•
Updated
Sep 29, 2024
•
7
nbagent/alfworld-qnet
Updated
Sep 29, 2024
nbagent/webshop_dpo_ckpt_fromselftrain_e1_1e-7-0.5
7B
•
Updated
Sep 24, 2024
•
24
nbagent/webshop_dpo_ckpt_fromselftrain_e1
7B
•
Updated
Sep 24, 2024
•
33
View 11 models
datasets
0
None public yet