SebastianS
AI & ML interests
Everything.
Organizations
SebastianS/llama-7-chat-instruction-int4-fc-op_glaive-sft
Updated
SebastianS/llama-7-chat-instruction-int4-fc-op_glaive-sft_test
Updated
SebastianS/llama-7-chat-instruction-int4-glaive-fc-testing
Updated
SebastianS/llama-7-chat-instruction-int4-glaive-fc-sft
Updated
SebastianS/llama-7-chat-instruction-int4-fc-dpo-_5_beta
Updated
SebastianS/llama-7-chat-instruction-int4-fc-dpo-_9_beta
Updated
SebastianS/llama-7-chat-instruction-int4-fc-dpo-_1_beta
Updated
SebastianS/test-llama-7-chat-instruction-int4-fc-dpo
Updated
SebastianS/llama-7-chat-instruction-int4-fc-dpo
Updated
SebastianS/llama-7-chat-instruction-int4-fc-sft_fix-dpo
Updated
SebastianS/llama-7-chat-instruction-int4-fc-sft_fix
Updated
SebastianS/llama-7-chat-instruction-int4-fc-sft
Updated
SebastianS/llama-7-chat-instruction-int4-fc-pipeline
Updated
SebastianS/function_calling-llama_7b-nat-fc_only
Updated
SebastianS/function_calling-llama_7b
Updated
SebastianS/ppo-LunarLander-v2_v2
Reinforcement Learning
•
Updated
•
4
SebastianS/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
SebastianS/LunarLander-v2
Reinforcement Learning
•
Updated
SebastianS/poca-SoccerTwos-v2
Reinforcement Learning
•
Updated
•
37
SebastianS/poca-SoccerTwos
Updated
SebastianS/poca-SoccerTwos_light
Reinforcement Learning
•
Updated
•
10
SebastianS/a2c-PandaReachDense-v2
Reinforcement Learning
•
Updated
•
4
SebastianS/a2c-AntBulletEnv-v0
Reinforcement Learning
•
Updated
•
4
SebastianS/ppo-PyramidsRND-1
Reinforcement Learning
•
Updated
•
18
SebastianS/ppo-SnowballTarget
Reinforcement Learning
•
Updated
•
30
SebastianS/Reinforce-Pixelcopter-PLE-v0-3
Reinforcement Learning
•
Updated
SebastianS/Reinforce-Pixelcopter-PLE-v0-2
Reinforcement Learning
•
Updated
SebastianS/Reinforce-Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
SebastianS/Reinforce-CartPole-v1
Reinforcement Learning
•
Updated
SebastianS/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
•
10