MAGNeT: Masked Audio Generation using a Single Non-Autoregressive Transformer
Want to make some of these yourself?