avemio/German-RAG-PHI-3.5-MINI-4B-SFT-HESSIAN-AI · fast field test with your models

Hi @kalle07 ,
these models are meant for synthetic data generation and especially the SFT-Models are strictly bound to the Prompt Templates that were trained in this stage.
If you would like to test a more generialized model you could test the merged llama model or even look into the example where we have merged with hermes.
https://huggingface.co/avemio/German-RAG-LLAMA-3.1-8B-MERGED-HESSIAN-AI
https://huggingface.co/avemio/German-RAG-x-HERMES-LLAMA-MERGED-EXP

The decision to work with Base-Models was in the end resulting into too narrow models that only accept instruction tasks trained in the different stages. Was better for comparison but not better for the performance in the end.

Hope that helps until we will prepare the next round!