Spanish startup Multiverse Computing has released a new version of its HyperNova 60B model on Hugging Face that, it says, ...
With reported 3x speed gains and limited degradation in output quality, the method targets one of the biggest pain points in production AI systems: latency at scale.