Did you try Elsewhere? like other platforms?
Inference platform focused on cold starts and GPU utilizatio