Spaces:

AIvry
/

MAPSS-measures

Running on Zero

App Files Files Community

AIvry commited on Sep 16

Commit

a41124a

verified ·

1 Parent(s): 9ae59a2

Update app.py

Browse files

Files changed (1) hide show

app.py +10 -11

app.py CHANGED Viewed

@@ -215,9 +215,9 @@ def create_interface():
         - `manifest_canonical.json`: File mapping and processing details
         ### Score Interpretation
-        - **NaN values**: Appear in frames where fewer than 2 speakers are active
         - **Valid scores**: Only computed when at least 2 speakers are active in a frame
-        - **Time resolution**: 20ms frames (configurable in code)
         ## Available Models
@@ -226,10 +226,10 @@ def create_interface():
         | `raw` | Raw waveform features | N/A | Baseline comparison |
         | `wavlm` | WavLM Large | 24 | Strong performance |
         | `wav2vec2` | Wav2Vec2 Large | 24 | Best overall performance |
-        | `hubert` | HuBERT Large | 24 | Good for speech |
-        | `wavlm_base` | WavLM Base | 12 | Faster processing |
         | `wav2vec2_base` | Wav2Vec2 Base | 12 | Faster, good quality |
-        | `hubert_base` | HuBERT Base | 12 | Faster processing |
         | `wav2vec2_xlsr` | Wav2Vec2 XLSR-53 | 24 | Multilingual |
         ## Parameters
@@ -253,12 +253,11 @@ def create_interface():
         If you use MAPSS, please cite:
         ```bibtex
-        @article{Ivry2025MAPSS,
-          title     = {MAPSS: Manifold-based Assessment of Perceptual Source Separation},
-          author    = {Ivry, Amir and Cornell, Samuele and Watanabe, Shinji},
-          journal   = {arXiv preprint arXiv:2509.09212},
-          year      = {2025},
-          url       = {https://arxiv.org/abs/2509.09212}
         }
         ```

         - `manifest_canonical.json`: File mapping and processing details
         ### Score Interpretation
         - **Valid scores**: Only computed when at least 2 speakers are active in a frame
+        - **NaN values**: Appear for non-active speakers, or when fewer than 2 speakers are active in the frame.
+        - **Time resolution**: 20ms frames
         ## Available Models
         | `raw` | Raw waveform features | N/A | Baseline comparison |
         | `wavlm` | WavLM Large | 24 | Strong performance |
         | `wav2vec2` | Wav2Vec2 Large | 24 | Best overall performance |
+        | `hubert` | HuBERT Large | 24 |  |
+        | `wavlm_base` | WavLM Base | 12 |  |
         | `wav2vec2_base` | Wav2Vec2 Base | 12 | Faster, good quality |
+        | `hubert_base` | HuBERT Base | 12 |  |
         | `wav2vec2_xlsr` | Wav2Vec2 XLSR-53 | 24 | Multilingual |
         ## Parameters
         If you use MAPSS, please cite:
         ```bibtex
+        @article{ivry2025mapss,
+        title={MAPSS: Manifold-based Assessment of Perceptual Source Separation},
+        author={Ivry, Amir and Cornell, Samuele and Watanabe, Shinji},
+        journal={arXiv preprint arXiv:2509.09212},
+        year={2025}
         }
         ```