Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

streaming transcription or push-to-talk #49

Open
2 tasks done
khimaros opened this issue Jul 2, 2024 · 6 comments
Open
2 tasks done

streaming transcription or push-to-talk #49

khimaros opened this issue Jul 2, 2024 · 6 comments
Assignees
Labels
enhancement New feature or request

Comments

@khimaros
Copy link

khimaros commented Jul 2, 2024

Thank you for wanting to share an idea! But before starting, ensure to check if this feature request respects the following requirements:

  • It is written in English (I can translate what you say, but issues written in English are easier to read for the other users).
  • There is not already a similar feature request among the open or closed issues.

Is your feature request related to a problem? Please describe.
transcription in walkie talkie mode does not work reliably in noisy environments. even with adjustments to microphone sensitivity, it never stops listening for input, which means translation never begins.

Describe the solution you'd like
either offer a user control for when to start translating the buffer, or switch to a steaming mechanism so that input doesn't need to end before translation starts.

@khimaros khimaros added the enhancement New feature or request label Jul 2, 2024
@niedev
Copy link
Owner

niedev commented Jul 2, 2024

Hi, a streaming mechanism would be nearly impossible to do with current models, but I was already thinking about making a system to decide whether to use automatic or manual listening together with the new GUI in RTranslator 2.1, however, it won't be very soon (2 or 3 months probably). But maybe, before that, I could make sure that muting the microphone stops listening but still produces a transcription and a translation.

@khimaros
Copy link
Author

khimaros commented Jul 2, 2024

utilizing the mute button as you describe would solve the problem well enough for now!

@niedev
Copy link
Owner

niedev commented Jul 4, 2024

utilizing the mute button as you describe would solve the problem well enough for now!

The new release with this change is out! 🚀 Let me know how it works

@khimaros
Copy link
Author

it is working well, thank you!

i wonder if you're aware of this: https://k2-fsa.github.io/sherpa/onnx/android/apk.html

@niedev
Copy link
Owner

niedev commented Aug 13, 2024

@khimaros I didn't know it, I'll take a look, thanks!

@niedev
Copy link
Owner

niedev commented Sep 5, 2024

@khimaros the new release now has the option to use push to talk 🚀

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants