zhuqihao's picture

18 7

zhuqihao

zqh11

·

AI & ML interests

None yet

Recent Activity

updated a Space 14 days ago

zqh11/loveluer

published a Space 14 days ago

zqh11/loveluer

updated a Space 14 days ago

zqh11/zqh111

View all activity

Organizations

zqh11's activity

updated a Space 14 days ago

loveluer

published a Space 14 days ago

loveluer

updated a Space 14 days ago

zqh111

published a Space 14 days ago

zqh111

authored a paper 3 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 381

liked a model 3 months ago

deepseek-ai/DeepSeek-R1-Zero

Text Generation • Updated 20 days ago • 6.7k • 897

authored a paper 8 months ago

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

Paper • 2408.08152 • Published Aug 15, 2024 • 58

authored a paper 10 months ago

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Paper • 2406.11931 • Published Jun 17, 2024 • 64

authored a paper 11 months ago

DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data

Paper • 2405.14333 • Published May 23, 2024 • 41

updated a model about 1 year ago

deepseek-ai/deepseek-coder-33b-instruct

Text Generation • Updated Mar 7, 2024 • 7.93k • 507

New activity in deepseek-ai/deepseek-coder-33b-instruct about 1 year ago

Adding `safetensors` variant of this model

#24 opened about 1 year ago by

authored a paper about 1 year ago

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 115

updated a model about 1 year ago

deepseek-ai/deepseek-math-7b-instruct

Text Generation • Updated Feb 6, 2024 • 75.3k • 123

New activity in deepseek-ai/deepseek-coder-6.7b-instruct about 1 year ago

inference_params

#12 opened over 1 year ago by

updated a model about 1 year ago

deepseek-ai/deepseek-coder-7b-instruct-v1.5

Text Generation • Updated Feb 5, 2024 • 6.42k • 132

New activity in deepseek-ai/deepseek-coder-33b-instruct about 1 year ago

torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 196.00 MiB. GPU 0 has a total capacty of 79.11 GiB of which 29.56 MiB is free

#21 opened about 1 year ago by

Set global data for future chats

#17 opened over 1 year ago by

[AUTOMATED] Model Memory Requirements

#18 opened over 1 year ago by

model-sizer-bot

Fine tune the model with part of layers on GPU and rest on CPU

#11 opened over 1 year ago by

New activity in deepseek-ai/deepseek-coder-7b-base-v1.5 about 1 year ago

Update to deepseek-coder-7b-base-v1.5 in code

#1 opened about 1 year ago by