1. 26 Apr, 2024 3 commits
    • * Adding temperature scaling on Joiner logits:
      
      - T hard-coded to 2.0
      - so far best result NCE 0.122 (still not so high)
          - the BPE scores were rescaled with 0.2 (but then also incorrect words
            get high confidence, visually reasonable histograms are for 0.5 scale)
          - BPE->WORD score merging done by min(.) function
            (tried also prob-product, and also arithmetic, geometric, harmonic mean)
      
      - without temperature scaling (i.e. scale 1.0), the best NCE was 0.032 (here product merging was best)
      
      Results seem consistent with: https://arxiv.org/abs/2110.15222
      
      Everything tuned on a very-small set of 100 sentences with 813 words and 10.2% WER, a Czech model.
      
      I also experimented with blank posteriors mixed into the BPE confidences,
      but no NCE improvement found, so not pushing that.
      
      Temperature scling added also to the Greedy search confidences.
      
      * making `temperature_scale` configurable from outside
      Karel Vesely authored
    • Fangjun Kuang authored
    • Daniel Doña authored
  2. 25 Apr, 2024 2 commits
  3. 24 Apr, 2024 4 commits
  4. 22 Apr, 2024 4 commits
  5. 21 Apr, 2024 1 commit
  6. 19 Apr, 2024 4 commits
  7. 18 Apr, 2024 1 commit
  8. 17 Apr, 2024 2 commits
  9. 16 Apr, 2024 5 commits
  10. 15 Apr, 2024 1 commit
  11. 14 Apr, 2024 1 commit
  12. 13 Apr, 2024 5 commits
  13. 12 Apr, 2024 1 commit
  14. 11 Apr, 2024 6 commits