renyulin
·
AI & ML interests
None yet
Organizations
renyulin/my-new-shiny-tokenizer
Updated
renyulin/llama2-13b-sql-merged
Text Generation
•
Updated
•
5
renyulin/gptneo125m-detoxify-ppo
Reinforcement Learning
•
Updated
•
2
renyulin/baichuan-7b-sft-merged
Text Generation
•
Updated
•
16
renyulin/baichuan-7b-sft-peft
renyulin/gpt2-movie-review-ctrl-ppo
Text Generation
•
Updated
•
16
renyulin/gpt-neo-1.3b-es-rlhf-step2500-peft
Reinforcement Learning
•
Updated
renyulin/gpt-neo-1.3B-se-sft
Text Generation
•
Updated
•
1
renyulin/llama-7b-es-ppo-adpater
Reinforcement Learning
•
Updated
renyulin/llama-7b-se-rm-merged
Text Classification
•
Updated
•
1
renyulin/llama-7b-se-sft-merged
Text Generation
•
Updated
•
13
renyulin/gpt-neo-2.7B-qlora-merged
Text Generation
•
Updated
•
1
renyulin/gpt-neo-2.7B-qlora-adapter
renyulin/llama7b_es_rm
renyulin/gpt2_es_rm
renyulin/gptneo125M-es-sft-lora8bit
renyulin/gptneo125m-detoxify-ppo-0.05
Reinforcement Learning
•
Updated
•
1
renyulin/opt125m-imdb-ppo
Text Generation
•
Updated
•
14
renyulin/opt125m-imdb-sft-lora8bit-adapter-merged
Text Generation
•
Updated
•
13
renyulin/opt125m-imdb-sft-lora8bit
renyulin/clm_finetune_peft_imdb_opt_125m_qlora
renyulin/finetune_fp4_opt_bnb_peft_opt-350m-lora
renyulin/roberta-large-peft-lora
Updated
renyulin/Reinforce-CartPole-v1
Reinforcement Learning
•
Updated
renyulin/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
•
2
renyulin/q-Taxi-v3
Reinforcement Learning
•
Updated
renyulin/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
renyulin/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
•
1
renyulin/distilbert-base-uncased-finetuned-imdb
Fill-Mask
•
Updated
•
2
renyulin/bert-finetuned-ner
Updated