Abstract: Large Vision-Language Models (LVLMs) mainly rely on template-generated textual descriptions to understand defects. This reliance impairs the performance of these models for Industrial Defect ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results