Use Case Diagram Audio Super Resolution with Masked Autoencoder

About 685,000 results

Open links in new tab

Any time

ieee.org
https://ieeexplore.ieee.org › document
Audio Super-Resolution With Robust Speech Representation …
Recently, masked autoencoders have been found to be beneficial in learning robust representations of audio for speech classification tasks. Following these studies, we leverage …
github.com
https://github.com › facebookresearch › AudioMAE
facebookresearch/AudioMAE - GitHub
This repo hosts the code and models of "Masked Autoencoders that Listen". Resources
sciencedirect.com
https://www.sciencedirect.com › science › article › pii
A deep learning framework for audio restoration using …
Nov 15, 2023 · A bubble chart has been provided in Fig. 6 to show which combination of use case and audio length has a greater number of equal values when comparing their waveforms. As …
github.com
https://github.com › sh-lee-prml › sh-lee-prml
sh-lee-prml/sh-lee-prml - GitHub
Audio Super-resolution with Robust Speech Representation Learning of Masked Autoencoder, S.-B. Kim, S.-H. Lee, H.-Y. Choi, S.-W. Lee, IEEE Trans. on Audio, Speech and Language …
thecvf.com
https://openaccess.thecvf.com › content › papers › Georgescu...
[PDF]
Audiovisual Masked Autoencoders - CVF Open Access
Our work also uses the masked autoencod-ing framework, but jointly models both audio and video, and is demonstrated on both unimodal (i.e. video-only and audio-only) and audiovisual …
researchgate.net
https://www.researchgate.net › publication
(PDF) Audio Super-Resolution With Robust Speech
Jan 1, 2024 · In this paper, we propose an upper-band masking strategy with the initialization of the mask token, which is simple but efficient for audio super-resolution. Furthermore, we …
researchgate.net
https://www.researchgate.net › figure › Overall-framework-of-Fre...
Overall framework of Fre-Painter. Initially, we pre-train the masked ...
Initially, we pre-train the masked autoencoder using a random masking strategy. Subsequently, the generator is jointly trained with the pre-trained encoder of masked autoencoder. For audio...
audioldm.github.io
https://audioldm.github.io › audiosr
AudioSR: Versatile Audio Super-resolution at Scale - GitHub Pages
We introduce a diffusion-based generative model, AudioSR, that is capable of performing robust audio super-resolution on versatile audio types, including sound effects, music, and speech. …
researchgate.net
https://www.researchgate.net › publication
[PDF]
Audio Super-Resolution With Robust Speech Representation …
In this paper, we propose Fre-Painter, a robust neural audio super-resolution system that utilizes robust speech represen-tation learning using MAE and several masking strategies. We utilize a...
arxiv.org
https://arxiv.org › abs
[2207.06405] Masked Autoencoders that Listen - arXiv.org
Jul 13, 2022 · This paper studies a simple extension of image-based Masked Autoencoders (MAE) to self-supervised representation learning from audio spectrograms. Following the …

Some results have been removed
Pagination
- 1
- 2
- 3
- 4
- 5
- Next

Audio Super-Resolution With Robust Speech Representation …

facebookresearch/AudioMAE - GitHub

A deep learning framework for audio restoration using …

sh-lee-prml/sh-lee-prml - GitHub

Audiovisual Masked Autoencoders - CVF Open Access

(PDF) Audio Super-Resolution With Robust Speech

Overall framework of Fre-Painter. Initially, we pre-train the masked ...

AudioSR: Versatile Audio Super-resolution at Scale - GitHub Pages

Audio Super-Resolution With Robust Speech Representation …

[2207.06405] Masked Autoencoders that Listen - arXiv.org