News
However, although Transformer-based backbones have achieved much progress on ImageNet classification, it is still unclear whether the learned representations are as transferable as or even more ...
ABSTRACT: As morphemes are the smallest phonetic and semantic word formation units in Chinese, the study of morphemes has always been an important part of Chinese language acquisition research. Taking ...
Abstract: In this paper, we consider the problem of classifying a real world image to the corresponding object class based on its visual content via sparse representation, which is originally used as ...
Visual Instruction Tuning with 500x Fewer Parameters through Modality Linear Representation-Steering
Multimodal Large Language Models (MLLMs) have significantly advanced visual tasks by integrating visual representations into large language models (LLMs). The textual modality, inherited from LLMs, ...
In this paper, we aim to contribute to all these controversial questions by complementing Burge in demarcating the minimal case of perceptual representation. We will do so by analyzing (1) what is the ...
The left panel depicts the Audio-Visual Feature Representation framework and the Contrastive-Generative Synchronization Training methodology. For generative synchronization, we design a Feature ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results