NoGoolag
This gives Google LESS access to your data!
FUTO Voice Input
Android application that lets you do speech-to-text on Android, integrating with third party keyboards or apps that use the generic speech-to-text APIs.
π FUTO Voice website: https://voiceinput.futo.org
π F-Droid repo:
https://app.futo.org/fdroid/repo/
π FUTO Voice play store: https://play.google.com/store/apps/details?id=org.futo.voiceinput
π FUTO Voice source code: https://gitlab.futo.org/alex/voiceinput
β οΈ License https://gitlab.futo.org/alex/voiceinput/-/blob/master/FTL_LICENSE.md
π΅ Linux Live Captions interview: https://www.youtube.com/watch?v=r09Hm2zd2lY
π Live Captions github: https://github.com/abb128/LiveCaptions
π΅ Louis' interview w/ FUTO's founder:
Q&A w/ billionaire alt-tech investor/philanthropist Eron Wolf
https://www.youtube.com/watch?v=OJPmbcU-Vzo
β οΈ Google's breaches of privacy have gone TOO FAR!
https://www.youtube.com/watch?v=_vWAF13KigI
00:00 - Intro
00:24 - The problem with voice to text
00:37 - How Google's voice to text stored data
01:24 - EULA roofying
02:21 - Why is this called paid promotion?
03:37 - Introducing our voice to text keyboard
04:00 - The issue with Google's voice to text
04:30 - Demoing Google's voice to text
04:58 - Installing FUTO Voice Input
05:26 - Configuring FUTO Voice Input
05:52 - A note on compatible keyboards
06:19 - Configuring FUTO Voice Input
06:42 - Demoing FUTO Voice Input - SLOW MODEL
07:54 - Demoing FUTO Voice Input - FAST MODEL
08:28 - Demoing Google Voice input - same text
08:52 - FUTO keyboard wins out - better accuracy & punctuation
09:34 - NO DATA COLLECTION, NO ADS, NO TRACKING
09:58 - Open Source, but not free
10:18 - Our DRM/antipiracy measure is the honor system
10:48 - Why isn't this free?
12:00 - We will be making a keyboard w/ autocorrect soon as well
#Android #speech #recognition #stt #voice
Android application that lets you do speech-to-text on Android, integrating with third party keyboards or apps that use the generic speech-to-text APIs.
π FUTO Voice website: https://voiceinput.futo.org
π F-Droid repo:
https://app.futo.org/fdroid/repo/
π FUTO Voice play store: https://play.google.com/store/apps/details?id=org.futo.voiceinput
π FUTO Voice source code: https://gitlab.futo.org/alex/voiceinput
β οΈ License https://gitlab.futo.org/alex/voiceinput/-/blob/master/FTL_LICENSE.md
π΅ Linux Live Captions interview: https://www.youtube.com/watch?v=r09Hm2zd2lY
π Live Captions github: https://github.com/abb128/LiveCaptions
π΅ Louis' interview w/ FUTO's founder:
Q&A w/ billionaire alt-tech investor/philanthropist Eron Wolf
https://www.youtube.com/watch?v=OJPmbcU-Vzo
β οΈ Google's breaches of privacy have gone TOO FAR!
https://www.youtube.com/watch?v=_vWAF13KigI
00:00 - Intro
00:24 - The problem with voice to text
00:37 - How Google's voice to text stored data
01:24 - EULA roofying
02:21 - Why is this called paid promotion?
03:37 - Introducing our voice to text keyboard
04:00 - The issue with Google's voice to text
04:30 - Demoing Google's voice to text
04:58 - Installing FUTO Voice Input
05:26 - Configuring FUTO Voice Input
05:52 - A note on compatible keyboards
06:19 - Configuring FUTO Voice Input
06:42 - Demoing FUTO Voice Input - SLOW MODEL
07:54 - Demoing FUTO Voice Input - FAST MODEL
08:28 - Demoing Google Voice input - same text
08:52 - FUTO keyboard wins out - better accuracy & punctuation
09:34 - NO DATA COLLECTION, NO ADS, NO TRACKING
09:58 - Open Source, but not free
10:18 - Our DRM/antipiracy measure is the honor system
10:48 - Why isn't this free?
12:00 - We will be making a keyboard w/ autocorrect soon as well
#Android #speech #recognition #stt #voice
Google Play
FUTO Voice Input - Apps on Google Play
Privacy-friendly Voice Input
Media is too big
VIEW IN TELEGRAM
#Linux Desktop application that provides live #captioning
FUTO Fellowship program interview; linux captions software
π Live Captions github: https://github.com/abb128/LiveCaptions
π΅ Q&A w/ billionaire alt-tech investor/philanthropist Eron Wolf
https://www.youtube.com/watch?v=OJPmbcU-Vzo
π΅ FUTO Fellows program: https://futo.org/fellows/
π΅ FUTO Youtube channel - @futotech
β οΈ Google's breaches of privacy have gone TOO FAR!
https://www.youtube.com/watch?v=_vWAF13KigI
#speech #recognition #stt #voice
FUTO Fellowship program interview; linux captions software
π Live Captions github: https://github.com/abb128/LiveCaptions
π΅ Q&A w/ billionaire alt-tech investor/philanthropist Eron Wolf
https://www.youtube.com/watch?v=OJPmbcU-Vzo
π΅ FUTO Fellows program: https://futo.org/fellows/
π΅ FUTO Youtube channel - @futotech
β οΈ Google's breaches of privacy have gone TOO FAR!
https://www.youtube.com/watch?v=_vWAF13KigI
#speech #recognition #stt #voice
WhisperTux
Simple #voice #dictation application for #Linux. Uses whisper.cpp for offline speech-to-text transcription. No fancy GPUs are required although whisper.cpp is capable of using them if available. Once your speech is transcribed, it is sent to a ydotool daemon that will write the text into the focused application.
Features
Local speech-to-text processing via whisper.cpp (no cloud dependencies)
No expensive hardware required (works well on a plain x86 laptop with AVX instructions)
Global keyboard shortcuts for system-wide operation
Automatic text injection into focused applications
Configurable whisper models and shortcuts
https://github.com/cjams/whispertux
#assistant #speech #stt
Simple #voice #dictation application for #Linux. Uses whisper.cpp for offline speech-to-text transcription. No fancy GPUs are required although whisper.cpp is capable of using them if available. Once your speech is transcribed, it is sent to a ydotool daemon that will write the text into the focused application.
Features
Local speech-to-text processing via whisper.cpp (no cloud dependencies)
No expensive hardware required (works well on a plain x86 laptop with AVX instructions)
Global keyboard shortcuts for system-wide operation
Automatic text injection into focused applications
Configurable whisper models and shortcuts
https://github.com/cjams/whispertux
#assistant #speech #stt
GitHub
GitHub - cjams/whispertux: Simple GUI around whisper.cpp for voice-to-text on Linux
Simple GUI around whisper.cpp for voice-to-text on Linux - cjams/whispertux
Dicio assistant
Dicio is a free and open source #voice #assistant running on #Android. It supports many different skills and input/output methods, and it provides both speech and graphical feedback to a question. It interprets user input and (when possible) generates user output entirely on-device, providing privacy by design. It has multilanguage support, and is currently available in these languages: Czech, English, French, German, Greek, Italian, Polish, Russian, Slovenian, Spanish, Swedish and Ukrainian. Open to contributions :-D
https://github.com/Stypox/dicio-android
Download
https://f-droid.org/packages/org.stypox.dicio
https://github.com/Stypox/dicio-android/releases
https://play.google.com/store/apps/details?id=org.stypox.dicio
Skills
Currently Dicio answers questions about:
search: looks up information on DuckDuckGo (and in the future more engines) - Search for Dicio
weather: collects weather information from OpenWeatherMap - What's the weather like?
lyrics: shows Genius lyrics for songs - What's the song that goes we will we will rock you?
open: opens an app on your device - Open NewPipe
calculator: evaluates basic calculations - What is four thousand and two times three minus a million divided by three hundred?
telephone: view and call contacts - Call Tom
timer: set, query and cancel timers - Set a timer for five minutes
current time: query current time - What time is it?
navigation: opens the navigation app at the requested position - Take me to New York, fifteenth avenue
media: play, pause, previous, next song
Speech to text
Dicio uses Vosk as its speech to text (#STT) engine. In order to be able to run on every phone small models are employed, weighing ~50MB. The download from here starts automatically whenever needed, so the app language can be changed seamlessly.
Dicio is a free and open source #voice #assistant running on #Android. It supports many different skills and input/output methods, and it provides both speech and graphical feedback to a question. It interprets user input and (when possible) generates user output entirely on-device, providing privacy by design. It has multilanguage support, and is currently available in these languages: Czech, English, French, German, Greek, Italian, Polish, Russian, Slovenian, Spanish, Swedish and Ukrainian. Open to contributions :-D
https://github.com/Stypox/dicio-android
Download
https://f-droid.org/packages/org.stypox.dicio
https://github.com/Stypox/dicio-android/releases
https://play.google.com/store/apps/details?id=org.stypox.dicio
Skills
Currently Dicio answers questions about:
search: looks up information on DuckDuckGo (and in the future more engines) - Search for Dicio
weather: collects weather information from OpenWeatherMap - What's the weather like?
lyrics: shows Genius lyrics for songs - What's the song that goes we will we will rock you?
open: opens an app on your device - Open NewPipe
calculator: evaluates basic calculations - What is four thousand and two times three minus a million divided by three hundred?
telephone: view and call contacts - Call Tom
timer: set, query and cancel timers - Set a timer for five minutes
current time: query current time - What time is it?
navigation: opens the navigation app at the requested position - Take me to New York, fifteenth avenue
media: play, pause, previous, next song
Speech to text
Dicio uses Vosk as its speech to text (#STT) engine. In order to be able to run on every phone small models are employed, weighing ~50MB. The download from here starts automatically whenever needed, so the app language can be changed seamlessly.
GitHub
GitHub - DicioTeam/dicio-android: Dicio assistant app for Android
Dicio assistant app for Android. Contribute to DicioTeam/dicio-android development by creating an account on GitHub.
Speed of Sound
#Voice #typing for the #Linux desktop:
Features
Offline, on-device transcription powered by Whisper, Parakeet, Canary, and more. No data leaves your machine.
Multiple activation options: click the in-app button or use a global keyboard shortcut.
Types the result directly into any focused application using Portals for wide desktop support (X11, Wayland).
Multi-language support with switchable primary and secondary languages on the fly.
Works out of the box with a built-in multilingual Whisper model. Download additional models from within the app to improve accuracy and language coverage.
Optional text polishing with LLMs (Anthropic, Google, OpenAI), with support for a custom context and vocabulary.
Supports self-hosted services like vLLM, Ollama, and llama.cpp (cloud services supported but not required).
Getting Started
The easiest and recommended way to install Speed of Sound is from Flathub or from the Snap Store. Alternatively, AppImage, Deb, and RPM packages are also available from the releases page.
For initial configuration, troubleshooting, and other resources, visit speedofsound.io
#stt
#Voice #typing for the #Linux desktop:
Features
Offline, on-device transcription powered by Whisper, Parakeet, Canary, and more. No data leaves your machine.
Multiple activation options: click the in-app button or use a global keyboard shortcut.
Types the result directly into any focused application using Portals for wide desktop support (X11, Wayland).
Multi-language support with switchable primary and secondary languages on the fly.
Works out of the box with a built-in multilingual Whisper model. Download additional models from within the app to improve accuracy and language coverage.
Optional text polishing with LLMs (Anthropic, Google, OpenAI), with support for a custom context and vocabulary.
Supports self-hosted services like vLLM, Ollama, and llama.cpp (cloud services supported but not required).
Getting Started
The easiest and recommended way to install Speed of Sound is from Flathub or from the Snap Store. Alternatively, AppImage, Deb, and RPM packages are also available from the releases page.
For initial configuration, troubleshooting, and other resources, visit speedofsound.io
#stt
flathub.org
Install Speed of Sound on Linux | Flathub
Voice typing for the Linux desktop
Speech Note
#Linux desktop and #Sailfish OS app for note taking, reading and translating with offline #Speech to Text #stt, Text to Speech #tts and Machine #Translation
https://github.com/mkiol/dsnote
MPL-2.0 license
https://github.com/mkiol/dsnote#how-to-install
Speech Note let you take, read and translate notes in multiple languages. It uses Speech to Text, Text to Speech and Machine Translation to do so. Text and voice processing take place entirely offline, locally on your computer, without using a network connection. Your privacy is always respected. No data is sent to the Internet.
Speech Note uses many different processing engines to do its job. Currently these are used:
Speech to Text (STT)
Coqui STT (a fork of Mozilla DeepSpeech)
Vosk
whisper.cpp
Faster Whisper
april-asr
Text to Speech (TTS)
espeak-ng
MBROLA
Piper
RHVoice
Coqui TTS
Mimic 3
WhisperSpeech
Kokoro
Parler-TTS
F5-TTS
S.A.M.
Machine Translation (MT)
Bergamot Translator
#Linux desktop and #Sailfish OS app for note taking, reading and translating with offline #Speech to Text #stt, Text to Speech #tts and Machine #Translation
https://github.com/mkiol/dsnote
MPL-2.0 license
https://github.com/mkiol/dsnote#how-to-install
Speech Note let you take, read and translate notes in multiple languages. It uses Speech to Text, Text to Speech and Machine Translation to do so. Text and voice processing take place entirely offline, locally on your computer, without using a network connection. Your privacy is always respected. No data is sent to the Internet.
Speech Note uses many different processing engines to do its job. Currently these are used:
Speech to Text (STT)
Coqui STT (a fork of Mozilla DeepSpeech)
Vosk
whisper.cpp
Faster Whisper
april-asr
Text to Speech (TTS)
espeak-ng
MBROLA
Piper
RHVoice
Coqui TTS
Mimic 3
WhisperSpeech
Kokoro
Parler-TTS
F5-TTS
S.A.M.
Machine Translation (MT)
Bergamot Translator
GitHub
GitHub - mkiol/dsnote: Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speechβ¦
Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation. - mkiol/dsnote