LocalVocal: Local Live Captions & Translation On-the-Go

LocalVocal: Local Live Captions & Translation On-the-Go v0.3.6

Much rejoicing! Introducing AMD ROCm / hipBLAS acceleration, as well as CoreML and Metal on Apple!
This was tested to give tremendous performance boosts on Apple and AMD GPUs.

Also in this release:
  • bumping to the latest whisper.cpp
  • fixing the captions stream issue on Twitch and YouTube
  • fixing the translation captions disappearance.
Enjoy and let me know of any problems!

Download:
Big release! Many improvements, solving a lot of bugs and introducing far better performance.
In points:
  • VAD based segmentation - no more "3 seconds segments"! when speech is detected the transcription kicks in
  • Incremental output: The captions will appear continuously as you speak, a live transcription effect
  • Bumping the version of whisper.cpp
  • Many more options for real-time translation
  • A lot of bug fixing...
Download
  • Like
Reactions: IDLT
A big release. Fixing things and adding evaluation tools
  • Fixing Linux problems with missing dependency
  • Testing tools and scripts for evaluating the algorithm
  • Add Whisper-based translation
  • Add M2M100 1.2B model for translation
  • Optimizing overlap regions
  • Sentence suppression fixup
Download
  • Like
Reactions: IDLT
Top