In vision-language models (VLMs), visual tokens usually consume a significant amount of computational overhead, despite their sparser information density compared to text tokens. To address this, ...
Abstract: Spectral super-resolution (SSR) can reconstruct hyperspectral images (HSIs) from images with fewer spectral bands, such as RGB images, to furnish critical spectral information for var-ious ...