1. 15 Jul, 2025 1 commit
  2. 14 Jul, 2025 1 commit
  3. 12 Jul, 2025 12 commits
  4. 11 Jul, 2025 2 commits
    • This PR adds support for the TEN VAD model alongside the existing Silero VAD in both C++ and Python interfaces.
      
      - Introduces TenVadModelConfig with Python bindings and integrates it into VadModelConfig.
      - Implements TenVadModel in C++ and extends the factory (VadModel::Create) and detector logic to choose between Silero and TEN VAD.
      - Updates build files (CMake), fixes a spelling typo, and extends the Python example script to demonstrate --ten-vad-model.
      Fangjun Kuang authored
    • Fangjun Kuang authored
  5. 10 Jul, 2025 2 commits
    • Fangjun Kuang authored
    • Add support for the new NeMo Canary ASR model across multiple language bindings by introducing a Canary model configuration and setter method on the offline recognizer.
      
      - Define Canary model config in Pascal, Go, C#, Dart and update converter functions
      - Add SetConfig API for offline recognizer (Pascal, Go, C#, Dart)
      - Extend CI/workflows and example scripts to test non-streaming Canary decoding
      Fangjun Kuang authored
  6. 09 Jul, 2025 3 commits
    • # New Features
      - Added new example programs demonstrating streaming speech recognition from a microphone using Parakeet-TDT CTC and Zipformer Transducer models with voice activity detection.
      - These examples support microphone input via PortAudio and display recognized text incrementally.
      
      # Bug Fixes
      - Improved error handling and logic when opening microphone devices in several example programs for more reliable device initialization.
      
      # Chores
      - Updated build configuration to include new executable examples when PortAudio support is enabled.
      Fangjun Kuang authored
    • This PR integrates LODR (Level-Ordered Deterministic Rescoring) support from Icefall into both online and offline recognizers, enabling LODR for LM shallow fusion and LM rescore.
      
      - Extended OnlineLMConfig and OfflineLMConfig to include lodr_fst, lodr_scale, and lodr_backoff_id.
      - Implemented LodrFst and LodrStateCost classes and wired them into RNN LM scoring in both online and offline code paths.
      - Updated Python bindings, CLI entry points, examples, and CI test scripts to accept and exercise the new LODR options.
      Askars Salimbajevs authored
    • Refactors and extends model export support to include new NeMo Parakeet TDT int8 variants for English and Japanese, updating the Kotlin API, export scripts, test runners, and CI workflows.
      
      - Added support for two new int8 model types in OfflineRecognizer.kt.
      - Enhanced Python export scripts to perform dynamic quantization and metadata injection.
      - Updated shell scripts and GitHub workflows to package, test, and publish int8 model artifacts.
      Fangjun Kuang authored
  7. 08 Jul, 2025 3 commits
  8. 07 Jul, 2025 3 commits
  9. 06 Jul, 2025 4 commits
  10. 04 Jul, 2025 5 commits
  11. 03 Jul, 2025 1 commit
  12. 02 Jul, 2025 1 commit
  13. 30 Jun, 2025 2 commits