How to use this quantized model file in music generation?

by LikeGiver - opened Sep 25

Sep 25

As far as I know, the input of the inspiremusic model is the embedding vector instead of text or token ids, which makes the model kind of stick to the transformers dependency, how should i actually deploy it in production using inference libs like llama.cpp? do you have any suggestions?

mradermacher

Owner Sep 25

Top be honest, I have no clue, but llama.cpp is primary a library, so if none of the examples that came with llama .cpp fit, then you'd have to write your own program using the llama.cpp libraries.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment