Abstract: Large vision models (LVMs), particularly vision transformers (ViTs), stand at the forefront of computer vision ad-vancements, demonstrating exceptional capabilities in processing and ...
Abstract: In recent years, prompt tuning has proven effective in adapting pre-trained vision-language models to downstream tasks. These methods aim to adapt the pre-trained models by introducing ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results