yl's picture

1

yl

renyulin

·

AI & ML interests

None yet

Organizations

renyulin 's models 32

renyulin/my-new-shiny-tokenizer

renyulin/llama2-13b-sql-merged

Text Generation • Updated Sep 16, 2023 • 5

renyulin/gptneo125m-detoxify-ppo

Reinforcement Learning • Updated Aug 17, 2023 • 2

renyulin/baichuan-7b-sft-merged

Text Generation • Updated Aug 15, 2023 • 16

renyulin/baichuan-7b-sft-peft

Updated Aug 15, 2023 • 1

renyulin/gpt2-movie-review-ctrl-ppo

Text Generation • Updated Aug 15, 2023 • 16

renyulin/gpt-neo-1.3b-es-rlhf-step2500-peft

Reinforcement Learning • Updated Jul 3, 2023

renyulin/gpt-neo-1.3B-se-sft

Text Generation • Updated Jul 3, 2023 • 1

renyulin/llama-7b-es-ppo-adpater

Reinforcement Learning • Updated Jul 3, 2023

renyulin/llama-7b-se-rm-merged

Text Classification • Updated Jul 2, 2023 • 1

renyulin/llama-7b-se-sft-merged

Text Generation • Updated Jul 2, 2023 • 13

renyulin/gpt-neo-2.7B-qlora-merged

Text Generation • Updated Jul 1, 2023 • 1

renyulin/gpt-neo-2.7B-qlora-adapter

Updated Jul 1, 2023 • 1

renyulin/llama7b_es_rm

Updated Jun 30, 2023 • 1

renyulin/gpt2_es_rm

Updated Jun 27, 2023 • 1

renyulin/gptneo125M-es-sft-lora8bit

Updated Jun 27, 2023 • 1

renyulin/gptneo125m-detoxify-ppo-0.05

Reinforcement Learning • Updated Jun 26, 2023 • 1

renyulin/opt125m-imdb-ppo

Text Generation • Updated Jun 25, 2023 • 14

renyulin/opt125m-imdb-sft-lora8bit-adapter-merged

Text Generation • Updated Jun 25, 2023 • 13

renyulin/opt125m-imdb-sft-lora8bit

Updated Jun 25, 2023 • 1

renyulin/clm_finetune_peft_imdb_opt_125m_qlora

Updated Jun 21, 2023 • 1

renyulin/finetune_fp4_opt_bnb_peft_opt-350m-lora

Updated Jun 19, 2023 • 1

renyulin/roberta-large-peft-lora

Updated Jun 18, 2023

renyulin/Reinforce-CartPole-v1

Reinforcement Learning • Updated Jun 13, 2023

renyulin/dqn-SpaceInvadersNoFrameskip-v4

Reinforcement Learning • Updated Jun 12, 2023 • 2

renyulin/q-Taxi-v3

Reinforcement Learning • Updated Jun 12, 2023

renyulin/q-FrozenLake-v1-4x4-noSlippery

Reinforcement Learning • Updated Jun 12, 2023

renyulin/ppo-LunarLander-v2

Reinforcement Learning • Updated Apr 28, 2023 • 1

renyulin/distilbert-base-uncased-finetuned-imdb

Fill-Mask • Updated Mar 10, 2023 • 2

renyulin/bert-finetuned-ner

Updated Mar 9, 2023