News
Convolutional neural networks (CNNs) and vision transformers (ViTs) are widely adopted but have limitations. To address these challenges, we propose a frequency-enhanced lightweight vision Mamba ...
To fully exploit this additional attribute information, we introduce an Attribute Vision Transformer (A-ViT). This model integrates attribute tokens with image tokens, thereby enhancing the ReID ...
Run 🤗 Transformers directly in your browser, with no need for a server! Transformers.js is designed to be functionally equivalent to Hugging Face's transformers python library, meaning you can run ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results