Input model directly with embeddings

#40

by SergioLimone - opened Dec 21, 2023

Dec 21, 2023

Hey,
do you know if there is a way to input phi-2 directly with the tokens embeddings (rather than token ids).
In contrast to other HF models, the forward method does not seem to handle the 'inputs_embeds' argument.

Thanks.

susnato

Dec 22, 2023

•

edited Dec 22, 2023

Hi @SergioLimone , you can use

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("susnato/phi-2")
tokenizer = AutoTokenizer.from_pretrained("susnato/phi-2")

to load the model and then pass inputs_embeds directly.

This will load the phi model from the transformers library and you will be able to use any features that you can use with the other models loaded from the library.

Also, make sure you have the latest transformers installed.

pip install -U transformers

SergioLimone

Dec 23, 2023

Great, thanks. However, it seems that "susnato/phi-2" does not support 'device_map="auto"'. Is there an easy fix for that?
Thanks.

susnato

Dec 23, 2023

Hmm for that I think you can manually push that model either to CPU or GPU

import torch
from transformers import AutoModelForCausalLM, AutoTokenizer

device = "cuda" if torch.cuda.is_available() else "cpu"
model = AutoModelForCausalLM.from_pretrained("susnato/phi-2").to(device)
tokenizer = AutoTokenizer.from_pretrained("susnato/phi-2")

gugarosa

Microsoft org Jan 9, 2024

Hello @SergioLimone !

We will be updating the model's files as soon as our ongoing PR is merged. It will fix any problems related to input_embeds not being able to be passed.

Regards,
Gustavo.

gugarosa changed discussion status to closed Jan 9, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment