is this multimodal / VLM?

by ququwowo - opened 14 days ago

14 days ago

Hi! I saw "multimodal" mentioned in model card -- is this a vision-language model that can read images? or is this a pure text based LLM?

Thanks.

Xinhe123456

11 days ago

Hi! I saw "multimodal" mentioned in model card -- is this a vision-language model that can read images? or is this a pure text based LLM?

Thanks.

Seems like it's only text modality, based on their config file

ScienceOne-AI

Owner 11 days ago

Thank you for your attention. The "multimodal" in S1-Base refers to "scientific modalities" (such as spectra, fields, etc.). This repository is for the large language model in the S1-Base series, which is a text modality model.

ScienceOne-AI changed discussion status to closed 11 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment