Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[feature request] Local speech recognition #84

Open
josuah opened this issue Feb 23, 2024 · 2 comments
Open

[feature request] Local speech recognition #84

josuah opened this issue Feb 23, 2024 · 2 comments

Comments

@josuah
Copy link
Member

josuah commented Feb 23, 2024

Token and data plan saver: perform the transcription of audio locally, which seems to have very good results, as it is part of Apple Siri and Google Assistant products, as well as voice input as used to speak text messages.

IMG_0524

https://discord.com/channels/963222352534048818/984966420603482182/1210382075363065906

@josuah josuah changed the title [feature request] Local speech recognition. [feature request] Local speech recognition Feb 23, 2024
@lukeswitz
Copy link

Ran into all sorts of problems because monocle couldn’t handle any sort of bandwidth. Using the phone across the room was more accurate also. It’s possible if it’s chunked but the lag makes for a slow experience. Maybe frame has more throughput I’ll have to try it out when they land.

@josuah
Copy link
Member Author

josuah commented May 6, 2024

The Frame device does have a better Bluetooth bandwidth.

It is possible to choose a trade-off between low/high-resolution and low/high-bitrate audio for a compromise between bandwidth and speed.

There was not yet anyone to experiment with audio compression using StreamLogic, and audio compression for Frame was suggested here: brilliantlabsAR/frame-codebase#134 (comment)

But it seems like the FPGA is already full with JPEG encoding, so a trade-off would be needed to fit FPGA-based compression.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants