saishshinde15
/

Clyrai_Vortex_Reasoning

Text Generation

text-generation-inference

advanced-finetuning

Model card Files Files and versions

saishshinde15 commited on 7 days ago

Commit

9492a8c

·

verified ·

1 Parent(s): 0e85df1

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -1,6 +1,6 @@
 ---
 base_model:
-- saishshinde15/TethysAI_Base_Reasoning
 tags:
 - text-generation-inference
 - transformers
@@ -19,12 +19,12 @@ pipeline_tag: text-generation
 - **Developed by:** clyrai
 - **License:** apache-2.0
-- **Fine-tuned from:** [saishshinde15/TBH.AI_Base_Reasoning](https://huggingface.co/saishshinde15/TethysAI_Base_Reasoning)
 - **Category:** Experimental, Research
 ## **Introduction**
-TethysAI Vortex Reasoning is an **experimental model** that advances the structured reasoning capabilities pioneered by [Clyrai Base Reasoning](https://huggingface.co/saishshinde15/TethysAI_Base_Reasoning). While the Base Reasoning model utilized **Generalized Reinforced Policy Optimization (GRPO)** to enhance step-by-step logical thought processes similar to **DeepSeek-R1**, this model takes a different approach—**eliminating GRPO and instead relying on high-end Supervised Fine-Tuning (SFT) techniques**.
 The core objective was to investigate whether **deep reasoning and self-questioning behavior could emerge purely through SFT on high-quality datasets**. The results were highly promising: the model successfully **questions itself internally**, improves reasoning depth, and consistently generates structured, logical responses.

 ---
 base_model:
+- saishshinde15/Clyrai_Base_Reasoning
 tags:
 - text-generation-inference
 - transformers
 - **Developed by:** clyrai
 - **License:** apache-2.0
+- **Fine-tuned from:** [saishshinde15/Clyrai_Base_Reasoning](https://huggingface.co/saishshinde15/TethysAI_Base_Reasoning)
 - **Category:** Experimental, Research
 ## **Introduction**
+TethysAI Vortex Reasoning is an **experimental model** that advances the structured reasoning capabilities pioneered by [Clyrai_Base Reasoning](https://huggingface.co/saishshinde15/TethysAI_Base_Reasoning). While the Base Reasoning model utilized **Generalized Reinforced Policy Optimization (GRPO)** to enhance step-by-step logical thought processes similar to **DeepSeek-R1**, this model takes a different approach—**eliminating GRPO and instead relying on high-end Supervised Fine-Tuning (SFT) techniques**.
 The core objective was to investigate whether **deep reasoning and self-questioning behavior could emerge purely through SFT on high-quality datasets**. The results were highly promising: the model successfully **questions itself internally**, improves reasoning depth, and consistently generates structured, logical responses.