How to use the AFWhisper to extract audio features

by GuoGuoBan - opened 25 days ago

Discussion

GuoGuoBan

25 days ago

Thank you for your contribution. But how can I use the AFWhisper to extract audio features?

SreyanG-NVIDIA

NVIDIA org 23 days ago

You can load the weights using Whisper and do exactly the same steps you do with Whisper.

GuoGuoBan

18 days ago

You can load the weights using Whisper and do exactly the same steps you do with Whisper.

I am using transformer package to load Whisper model weights as the link https://huggingface.co/openai/whisper-large-v3. Could you provide an example to how to load your model weights, please! Thank you very much!

gijs

18 days ago

I believe it is something like this:

import librosa
from transformers import WhisperProcessor
from transformers.models.whisper.modeling_whisper import WhisperEncoder

encoder = WhisperEncoder.from_pretrained("nvidia/audio-flamingo-3", subfolder="sound_tower")
processor = WhisperProcessor.from_pretrained("openai/whisper-large-v3")

audio_array, sampling_rate = librosa.load("test.mp3", sr=16000)
inputs = processor(audio_array, sampling_rate=16000, return_tensors="pt")

encoder_outputs = encoder(inputs.input_features)
    
print(f"Encoder output shape: {encoder_outputs.last_hidden_state.shape}")

GuoGuoBan

17 days ago

I believe it is something like this:

import librosa
from transformers import WhisperProcessor
from transformers.models.whisper.modeling_whisper import WhisperEncoder

encoder = WhisperEncoder.from_pretrained("nvidia/audio-flamingo-3", subfolder="sound_tower")
processor = WhisperProcessor.from_pretrained("openai/whisper-large-v3")

audio_array, sampling_rate = librosa.load("test.mp3", sr=16000)
inputs = processor(audio_array, sampling_rate=16000, return_tensors="pt")

encoder_outputs = encoder(inputs.input_features)
    
print(f"Encoder output shape: {encoder_outputs.last_hidden_state.shape}")

It works for me. Thank you!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment