What will happen if we train a Q function for digital agents?
HAO BAI
JackBAI
AI & ML interests
Representation learning, language models.
Recent Activity
updated
a dataset
3 days ago
JackBAI/eval_data
published
a dataset
3 days ago
JackBAI/eval_data
liked
a model
about 1 month ago
google/gemma-3-27b-it
Organizations
Collections
2
models
18

JackBAI/aitw-general-digiq-agent
Updated

JackBAI/aitw-webshop-digiq-agent
Updated

JackBAI/llava-v1.5-7b-sfted-pad-inputtext
Updated

JackBAI/CRATE-GPT-12L-Pile-600000steps
Updated

JackBAI/webshop-off2on-filteredbc
Updated

JackBAI/general-off2on-filteredbc
Updated

JackBAI/general-off2on-digirl
Updated

JackBAI/webshop-off2on-digirl
Updated

JackBAI/crate-3l-l0-sae-1x
Updated

JackBAI/crate-1l-l0-sae-1x
Updated
datasets
7
JackBAI/eval_data
Viewer
•
Updated
•
9.64k
•
11
JackBAI/autoui-zeroshot-trajectories
Preview
•
Updated
•
76
JackBAI/pile_uncopyrighted_bin
Updated
•
11
JackBAI/bert_pretrain_datasets
Viewer
•
Updated
•
80.5M
•
2k
•
1
JackBAI/redbajama-sampled
Viewer
•
Updated
•
24.3M
•
6.28k
JackBAI/merged_roberta_dataset
Updated
•
20
JackBAI/chatgpt-woi-finetune
Preview
•
Updated
•
32
•
3