Has anyone achieved a speed-up with this model?
#3 opened 10 months ago
by
RonanMcGovern
Add text-generation pipeline tag and MIT license
#2 opened 11 months ago
by
nielsr
Is this MTP head just for predicting one token ahead?
#1 opened 11 months ago
by
RonanMcGovern