Speech at 12.0 kbps

Audio Reconstruction from Discrete Tokens using different models and configurations.

Ground Truth
EnCodec
RFWave