This webpage provides a sound demo for the test set of the WSJ0CAM-DEREVERB.
| Utterance ID=000000, T60=1.1564 | Target Speaker | Log-compressed Magnitude Spectrogram | PESQ | eSTOI | SI-SDR (dB) |
| Mixture | ![]() |
1.29 | 0.271 | -8.5 | |
| WPE (row 1a of Table 1) | ![]() |
1.52 | 0.462 | -0.6 | |
| ARTT StageI (row 4a of Table 1) | ![]() |
1.79 | 0.631 | -0.5 | |
| ARTT StageII (row 4b of Table 1) | ![]() |
2.31 | 0.790 | 5.5 | |
| Target | ![]() |
- | - | - | |
| Utterance ID=000001, T60=1.2623 | Target Speaker | Log-compressed Magnitude Spectrogram | PESQ | eSTOI | SI-SDR (dB) |
| Mixture | ![]() |
1.27 | 0.300 | -7.5 | |
| WPE (row 1a of Table 1) | ![]() |
1.48 | 0.175 | -15.1 | |
| ARTT StageI (row 4a of Table 1) | ![]() |
1.79 | 0.662 | -0.3 | |
| ARTT StageII (row 4b of Table 1) | ![]() |
2.23 | 0.767 | 4.3 | |
| Target | ![]() |
- | - | - | |
| Utterance ID=000002, T60=1.1686 | Target Speaker | Log-compressed Magnitude Spectrogram | PESQ | eSTOI | SI-SDR (dB) |
| Mixture | ![]() |
1.41 | 0.336 | -6.3 | |
| WPE (row 1a of Table 1) | ![]() |
1.44 | 0.082 | -10.1 | |
| ARTT StageI (row 4a of Table 1) | ![]() |
1.79 | 0.661 | -0.2 | |
| ARTT StageII (row 4b of Table 1) | ![]() |
2.24 | 0.794 | 2.6 | |
| Target | ![]() |
- | - | - | |
| Utterance ID=000003, T60=1.279 | Target Speaker | Log-compressed Magnitude Spectrogram | PESQ | eSTOI | SI-SDR (dB) |
| Mixture | ![]() |
1.32 | 0.343 | -7.7 | |
| WPE (row 1a of Table 1) | ![]() |
1.44 | 0.260 | -10.3 | |
| ARTT StageI (row 4a of Table 1) | ![]() |
1.77 | 0.619 | -0.7 | |
| ARTT StageII (row 4b of Table 1) | ![]() |
2.36 | 0.760 | 6.7 | |
| Target | ![]() |
- | - | - | |
| Utterance ID=000004, T60=1.2509 | Target Speaker | Log-compressed Magnitude Spectrogram | PESQ | eSTOI | SI-SDR (dB) |
| Mixture | ![]() |
1.33 | 0.284 | -6.9 | |
| WPE (row 1a of Table 1) | ![]() |
1.43 | 0.258 | -9.3 | |
| ARTT StageI (row 4a of Table 1) | ![]() |
1.70 | 0.635 | -1.0 | |
| ARTT StageII (row 4b of Table 1) | ![]() |
2.27 | 0.800 | 5.1 | |
| Target | ![]() |
- | - | - | |
| Utterance ID=000005, T60=1.037 | Target Speaker | Log-compressed Magnitude Spectrogram | PESQ | eSTOI | SI-SDR (dB) |
| Mixture | ![]() |
1.33 | 0.384 | -4.6 | |
| WPE (row 1a of Table 1) | ![]() |
1.51 | 0.137 | -16.4 | |
| ARTT StageI (row 4a of Table 1) | ![]() |
1.73 | 0.651 | 1.1 | |
| ARTT StageII (row 4b of Table 1) | ![]() |
2.76 | 0.795 | 7.8 | |
| Target | ![]() |
- | - | - | |
| Utterance ID=000006, T60=0.829 | Target Speaker | Log-compressed Magnitude Spectrogram | PESQ | eSTOI | SI-SDR (dB) |
| Mixture | ![]() |
1.44 | 0.386 | -7.0 | |
| WPE (row 1a of Table 1) | ![]() |
1.46 | 0.369 | -2.9 | |
| ARTT StageI (row 4a of Table 1) | ![]() |
2.30 | 0.754 | 2.7 | |
| ARTT StageII (row 4b of Table 1) | ![]() |
2.77 | 0.845 | 8.4 | |
| Target | ![]() |
- | - | - | |
| Utterance ID=000007, T60=0.2845 | Target Speaker | Log-compressed Magnitude Spectrogram | PESQ | eSTOI | SI-SDR (dB) |
| Mixture | ![]() |
2.51 | 0.881 | 6.5 | |
| WPE (row 1a of Table 1) | ![]() |
1.49 | 0.498 | 0.4 | |
| ARTT StageI (row 4a of Table 1) | ![]() |
2.37 | 0.844 | 6.5 | |
| ARTT StageII (row 4b of Table 1) | ![]() |
2.79 | 0.922 | 9.2 | |
| Target | ![]() |
- | - | - | |
| Utterance ID=000008, T60=0.258 | Target Speaker | Log-compressed Magnitude Spectrogram | PESQ | eSTOI | SI-SDR (dB) |
| Mixture | ![]() |
2.25 | 0.762 | 0.6 | |
| WPE (row 1a of Table 1) | ![]() |
2.39 | 0.754 | 3.8 | |
| ARTT StageI (row 4a of Table 1) | ![]() |
2.61 | 0.819 | 5.3 | |
| ARTT StageII (row 4b of Table 1) | ![]() |
2.95 | 0.879 | 9.3 | |
| Target | ![]() |
- | - | - | |
| Utterance ID=000009, T60=0.7434 | Target Speaker | Log-compressed Magnitude Spectrogram | PESQ | eSTOI | SI-SDR (dB) |
| Mixture | ![]() |
1.47 | 0.434 | -5.2 | |
| WPE (row 1a of Table 1) | ![]() |
1.53 | 0.453 | -3.2 | |
| ARTT StageI (row 4a of Table 1) | ![]() |
2.00 | 0.720 | 0.2 | |
| ARTT StageII (row 4b of Table 1) | ![]() |
2.60 | 0.820 | 6.7 | |
| Target | ![]() |
- | - | - | |
| Utterance ID=000010, T60=1.147 | Target Speaker | Log-compressed Magnitude Spectrogram | PESQ | eSTOI | SI-SDR (dB) |
| Mixture | ![]() |
1.31 | 0.402 | -4.0 | |
| WPE (row 1a of Table 1) | ![]() |
1.52 | 0.513 | -2.2 | |
| ARTT StageI (row 4a of Table 1) | ![]() |
1.85 | 0.715 | 4.1 | |
| ARTT StageII (row 4b of Table 1) | ![]() |
2.09 | 0.791 | 6.6 | |
| Target | ![]() |
- | - | - | |
| Utterance ID=000011, T60=0.7335 | Target Speaker | Log-compressed Magnitude Spectrogram | PESQ | eSTOI | SI-SDR (dB) |
| Mixture | ![]() |
1.32 | 0.445 | -6.1 | |
| WPE (row 1a of Table 1) | ![]() |
1.84 | 0.654 | 0.0 | |
| ARTT StageI (row 4a of Table 1) | ![]() |
2.11 | 0.763 | 0.8 | |
| ARTT StageII (row 4b of Table 1) | ![]() |
2.77 | 0.882 | 8.8 | |
| Target | ![]() |
- | - | - | |
| Utterance ID=000012, T60=0.7096 | Target Speaker | Log-compressed Magnitude Spectrogram | PESQ | eSTOI | SI-SDR (dB) |
| Mixture | ![]() |
1.49 | 0.354 | -4.3 | |
| WPE (row 1a of Table 1) | ![]() |
1.44 | 0.181 | -16.3 | |
| ARTT StageI (row 4a of Table 1) | ![]() |
2.05 | 0.681 | 1.0 | |
| ARTT StageII (row 4b of Table 1) | ![]() |
2.35 | 0.770 | 4.4 | |
| Target | ![]() |
- | - | - | |
| Utterance ID=000013, T60=0.4019 | Target Speaker | Log-compressed Magnitude Spectrogram | PESQ | eSTOI | SI-SDR (dB) |
| Mixture | ![]() |
1.57 | 0.632 | -2.4 | |
| WPE (row 1a of Table 1) | ![]() |
1.45 | 0.439 | -3.0 | |
| ARTT StageI (row 4a of Table 1) | ![]() |
2.65 | 0.860 | 5.8 | |
| ARTT StageII (row 4b of Table 1) | ![]() |
3.50 | 0.913 | 9.4 | |
| Target | ![]() |
- | - | - | |
| Utterance ID=000014, T60=0.7831 | Target Speaker | Log-compressed Magnitude Spectrogram | PESQ | eSTOI | SI-SDR (dB) |
| Mixture | ![]() |
1.38 | 0.418 | -5.2 | |
| WPE (row 1a of Table 1) | ![]() |
1.86 | 0.684 | 3.0 | |
| ARTT StageI (row 4a of Table 1) | ![]() |
2.11 | 0.748 | 3.2 | |
| ARTT StageII (row 4b of Table 1) | ![]() |
2.60 | 0.824 | 6.6 | |
| Target | ![]() |
- | - | - | |
| Utterance ID=000015, T60=1.0404 | Target Speaker | Log-compressed Magnitude Spectrogram | PESQ | eSTOI | SI-SDR (dB) |
| Mixture | ![]() |
1.30 | 0.296 | -9.3 | |
| WPE (row 1a of Table 1) | ![]() |
1.54 | 0.530 | -2.8 | |
| ARTT StageI (row 4a of Table 1) | ![]() |
1.87 | 0.695 | -1.0 | |
| ARTT StageII (row 4b of Table 1) | ![]() |
2.66 | 0.854 | 7.5 | |
| Target | ![]() |
- | - | - | |
| Utterance ID=000016, T60=0.4121 | Target Speaker | Log-compressed Magnitude Spectrogram | PESQ | eSTOI | SI-SDR (dB) |
| Mixture | ![]() |
1.70 | 0.674 | 0.1 | |
| WPE (row 1a of Table 1) | ![]() |
1.92 | 0.673 | 2.5 | |
| ARTT StageI (row 4a of Table 1) | ![]() |
2.68 | 0.863 | 6.1 | |
| ARTT StageII (row 4b of Table 1) | ![]() |
3.00 | 0.918 | 7.4 | |
| Target | ![]() |
- | - | - | |
| Utterance ID=000017, T60=1.0309 | Target Speaker | Log-compressed Magnitude Spectrogram | PESQ | eSTOI | SI-SDR (dB) |
| Mixture | ![]() |
1.38 | 0.344 | -5.8 | |
| WPE (row 1a of Table 1) | ![]() |
1.48 | 0.457 | -4.5 | |
| ARTT StageI (row 4a of Table 1) | ![]() |
1.91 | 0.731 | 0.9 | |
| ARTT StageII (row 4b of Table 1) | ![]() |
2.76 | 0.895 | 7.9 | |
| Target | ![]() |
- | - | - | |
| Utterance ID=000018, T60=1.2955 | Target Speaker | Log-compressed Magnitude Spectrogram | PESQ | eSTOI | SI-SDR (dB) |
| Mixture | ![]() |
1.29 | 0.273 | -10.2 | |
| WPE (row 1a of Table 1) | ![]() |
1.66 | 0.574 | -2.3 | |
| ARTT StageI (row 4a of Table 1) | ![]() |
1.75 | 0.685 | 0.2 | |
| ARTT StageII (row 4b of Table 1) | ![]() |
2.68 | 0.850 | 7.0 | |
| Target | ![]() |
- | - | - | |
| Utterance ID=000019, T60=1.161 | Target Speaker | Log-compressed Magnitude Spectrogram | PESQ | eSTOI | SI-SDR (dB) |
| Mixture | ![]() |
1.34 | 0.533 | -4.8 | |
| WPE (row 1a of Table 1) | ![]() |
1.62 | 0.522 | -4.4 | |
| ARTT StageI (row 4a of Table 1) | ![]() |
2.20 | 0.777 | 4.1 | |
| ARTT StageII (row 4b of Table 1) | ![]() |
2.46 | 0.852 | 6.5 | |
| Target | ![]() |
- | - | - | |