DeepSeek Trading Assistant

This is a fine-tuned version of DeepSeek-R1-Distill-Qwen-32B specialized for generating trading strategies and market analysis.

Model Details

Model Description

Uses

Direct Use

This model is designed to:

  1. Analyze market conditions using technical indicators
  2. Generate trading strategies based on market analysis
  3. Implement risk management rules
  4. Create Python code for strategy implementation

Training Data

The model is trained on a custom dataset containing:

  • Market analysis using technical indicators (RSI, MACD, Moving Averages)
  • Trading strategy implementations
  • Risk management rules
  • Python code examples using QuantConnect framework

Training Procedure

Training Hyperparameters

  • Number of epochs: 3
  • Batch size: 2
  • Learning rate: 1e-5
  • Gradient accumulation steps: 8
  • Warmup steps: 100
  • Training regime: fp16 mixed precision with gradient checkpointing
  • Temperature: 0.6 (recommended for DeepSeek-R1 series)

Technical Specifications

Compute Infrastructure

  • Required Hardware: 2x NVIDIA A10G GPUs or 1x A100 GPU
  • Training Time (estimated): 2-4 hours

Model Card Contact

For questions or issues, please open an issue in the repository.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no pipeline_tag.