you should mention this model use deepseek architecture
#3
by
CHNtentes
- opened
add a special thanks or something
Thanks a lot for your suggestion, we have added THIRD_PARTY_NOTICES.md to clarify this. Please see https://huggingface.co/moonshotai/Kimi-K2-Instruct/blob/main/THIRD_PARTY_NOTICES.md
lsw825
changed discussion status to
closed