torch>=2.0.0 torchvision transformers==4.46.2 gradio==5.34.2 spaces imageio imageio[ffmpeg] safetensors einops sentencepiece protobuf librosa numpy pillow tqdm