Update README.md
Browse files
README.md
CHANGED
|
@@ -22,7 +22,6 @@ metrics:
|
|
| 22 |
## Training Results
|
| 23 |
- **Training Samples**: 19,480 (SQL analytics + technical conversations)
|
| 24 |
- **Hardware**: NVIDIA 4x A10G GPU (96GB VRAM)
|
| 25 |
-
- **Framework**: Unsloth optimization (2x speedup)
|
| 26 |
|
| 27 |
## π Model Description
|
| 28 |
This is a high-performance fine-tuned version of **Meta-Llama-3.1-8B-Instruct**, specifically optimized for:
|
|
@@ -86,8 +85,6 @@ print(response)
|
|
| 86 |
| **Final Loss** | 0.0508 |
|
| 87 |
| **Loss Reduction** | **96.7%** |
|
| 88 |
| **Training Time** | 8.9 hours |
|
| 89 |
-
| **GPU Utilization** | ~90% (A10G) |
|
| 90 |
-
| **Memory Usage** | 18-22GB VRAM |
|
| 91 |
|
| 92 |
## π― Use Cases
|
| 93 |
- **SQL Generation**: Create complex queries from natural language
|
|
@@ -113,9 +110,7 @@ The model was trained on a curated dataset of **19,480 high-quality examples** i
|
|
| 113 |
This model inherits the **Llama 3.1 license** from the base model. Please review the [official license](https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/LICENSE) for usage terms.
|
| 114 |
|
| 115 |
## π Acknowledgments
|
| 116 |
-
- Built with [Unsloth](https://github.com/unslothai/unsloth) for optimized training
|
| 117 |
- Based on Meta's Llama 3.1 8B Instruct model
|
| 118 |
-
- Trained on NVIDIA A10G GPU infrastructure
|
| 119 |
|
| 120 |
## π Model Card Contact
|
| 121 |
For questions about this model, please open an issue in the repository or contact the model author.
|
|
|
|
| 22 |
## Training Results
|
| 23 |
- **Training Samples**: 19,480 (SQL analytics + technical conversations)
|
| 24 |
- **Hardware**: NVIDIA 4x A10G GPU (96GB VRAM)
|
|
|
|
| 25 |
|
| 26 |
## π Model Description
|
| 27 |
This is a high-performance fine-tuned version of **Meta-Llama-3.1-8B-Instruct**, specifically optimized for:
|
|
|
|
| 85 |
| **Final Loss** | 0.0508 |
|
| 86 |
| **Loss Reduction** | **96.7%** |
|
| 87 |
| **Training Time** | 8.9 hours |
|
|
|
|
|
|
|
| 88 |
|
| 89 |
## π― Use Cases
|
| 90 |
- **SQL Generation**: Create complex queries from natural language
|
|
|
|
| 110 |
This model inherits the **Llama 3.1 license** from the base model. Please review the [official license](https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/LICENSE) for usage terms.
|
| 111 |
|
| 112 |
## π Acknowledgments
|
|
|
|
| 113 |
- Based on Meta's Llama 3.1 8B Instruct model
|
|
|
|
| 114 |
|
| 115 |
## π Model Card Contact
|
| 116 |
For questions about this model, please open an issue in the repository or contact the model author.
|