NoGoolag

This gives Google LESS access to your data!

FUTO Voice Input

Android application that lets you do speech-to-text on Android, integrating with third party keyboards or apps that use the generic speech-to-text APIs.

👉 FUTO Voice website: https://voiceinput.futo.org
👉 F-Droid repo:
https://app.futo.org/fdroid/repo/
👉 FUTO Voice play store: https://play.google.com/store/apps/details?id=org.futo.voiceinput
👉 FUTO Voice source code: https://gitlab.futo.org/alex/voiceinput
⚠️ License https://gitlab.futo.org/alex/voiceinput/-/blob/master/FTL_LICENSE.md

🔵 Linux Live Captions interview: https://www.youtube.com/watch?v=r09Hm2zd2lY
👉 Live Captions github: https://github.com/abb128/LiveCaptions
🔵 Louis' interview w/ FUTO's founder:
Q&A w/ billionaire alt-tech investor/philanthropist Eron Wolf
https://www.youtube.com/watch?v=OJPmbcU-Vzo
⚠️ Google's breaches of privacy have gone TOO FAR!
https://www.youtube.com/watch?v=_vWAF13KigI

00:00 - Intro
00:24 - The problem with voice to text
00:37 - How Google's voice to text stored data
01:24 - EULA roofying
02:21 - Why is this called paid promotion?
03:37 - Introducing our voice to text keyboard
04:00 - The issue with Google's voice to text
04:30 - Demoing Google's voice to text
04:58 - Installing FUTO Voice Input
05:26 - Configuring FUTO Voice Input
05:52 - A note on compatible keyboards
06:19 - Configuring FUTO Voice Input
06:42 - Demoing FUTO Voice Input - SLOW MODEL
07:54 - Demoing FUTO Voice Input - FAST MODEL
08:28 - Demoing Google Voice input - same text
08:52 - FUTO keyboard wins out - better accuracy & punctuation
09:34 - NO DATA COLLECTION, NO ADS, NO TRACKING
09:58 - Open Source, but not free
10:18 - Our DRM/antipiracy measure is the honor system
10:48 - Why isn't this free?
12:00 - We will be making a keyboard w/ autocorrect soon as well

#Android #speech #recognition #stt #voice

Google Play

FUTO Voice Input - Apps on Google Play

Privacy-friendly Voice Input

527 viewsedited 11:17

#Linux Desktop application that provides live #captioning

FUTO Fellowship program interview; linux captions software

👉 Live Captions github: https://github.com/abb128/LiveCaptions

🔵 Q&A w/ billionaire alt-tech investor/philanthropist Eron Wolf
https://www.youtube.com/watch?v=OJPmbcU-Vzo
🔵 FUTO Fellows program: https://futo.org/fellows/
🔵 FUTO Youtube channel - @futotech
⚠️ Google's breaches of privacy have gone TOO FAR!
https://www.youtube.com/watch?v=_vWAF13KigI

#speech #recognition #stt #voice

520 viewsedited 11:17

NoGoolag

WhisperTux

Simple #voice #dictation application for #Linux. Uses whisper.cpp for offline speech-to-text transcription. No fancy GPUs are required although whisper.cpp is capable of using them if available. Once your speech is transcribed, it is sent to a ydotool daemon that will write the text into the focused application.

Features

Local speech-to-text processing via whisper.cpp (no cloud dependencies)
No expensive hardware required (works well on a plain x86 laptop with AVX instructions)
Global keyboard shortcuts for system-wide operation
Automatic text injection into focused applications
Configurable whisper models and shortcuts

https://github.com/cjams/whispertux

#assistant #speech #stt

GitHub

GitHub - cjams/whispertux: Simple GUI around whisper.cpp for voice-to-text on Linux

Simple GUI around whisper.cpp for voice-to-text on Linux - cjams/whispertux

358 views03:49

NoGoolag

Dicio assistant

Dicio is a free and open source #voice #assistant running on #Android. It supports many different skills and input/output methods, and it provides both speech and graphical feedback to a question. It interprets user input and (when possible) generates user output entirely on-device, providing privacy by design. It has multilanguage support, and is currently available in these languages: Czech, English, French, German, Greek, Italian, Polish, Russian, Slovenian, Spanish, Swedish and Ukrainian. Open to contributions :-D

https://github.com/Stypox/dicio-android

Download
https://f-droid.org/packages/org.stypox.dicio
https://github.com/Stypox/dicio-android/releases
https://play.google.com/store/apps/details?id=org.stypox.dicio

Skills
Currently Dicio answers questions about:
search: looks up information on DuckDuckGo (and in the future more engines) - Search for Dicio
weather: collects weather information from OpenWeatherMap - What's the weather like?
lyrics: shows Genius lyrics for songs - What's the song that goes we will we will rock you?
open: opens an app on your device - Open NewPipe
calculator: evaluates basic calculations - What is four thousand and two times three minus a million divided by three hundred?
telephone: view and call contacts - Call Tom
timer: set, query and cancel timers - Set a timer for five minutes
current time: query current time - What time is it?
navigation: opens the navigation app at the requested position - Take me to New York, fifteenth avenue
media: play, pause, previous, next song

Speech to text
Dicio uses Vosk as its speech to text (#STT) engine. In order to be able to run on every phone small models are employed, weighing ~50MB. The download from here starts automatically whenever needed, so the app language can be changed seamlessly.

GitHub

GitHub - DicioTeam/dicio-android: Dicio assistant app for Android

Dicio assistant app for Android. Contribute to DicioTeam/dicio-android development by creating an account on GitHub.

387 views10:39

NoGoolag

Speed of Sound

#Voice #typing for the #Linux desktop:

Features
Offline, on-device transcription powered by Whisper, Parakeet, Canary, and more. No data leaves your machine.
Multiple activation options: click the in-app button or use a global keyboard shortcut.
Types the result directly into any focused application using Portals for wide desktop support (X11, Wayland).
Multi-language support with switchable primary and secondary languages on the fly.
Works out of the box with a built-in multilingual Whisper model. Download additional models from within the app to improve accuracy and language coverage.
Optional text polishing with LLMs (Anthropic, Google, OpenAI), with support for a custom context and vocabulary.
Supports self-hosted services like vLLM, Ollama, and llama.cpp (cloud services supported but not required).

Getting Started
The easiest and recommended way to install Speed of Sound is from Flathub or from the Snap Store. Alternatively, AppImage, Deb, and RPM packages are also available from the releases page.
For initial configuration, troubleshooting, and other resources, visit speedofsound.io

#stt

flathub.org

Install Speed of Sound on Linux | Flathub

Voice typing for the Linux desktop

405 views16:20

NoGoolag

Speech Note

#Linux desktop and #Sailfish OS app for note taking, reading and translating with offline #Speech to Text #stt, Text to Speech #tts and Machine #Translation

https://github.com/mkiol/dsnote
MPL-2.0 license
https://github.com/mkiol/dsnote#how-to-install

Speech Note let you take, read and translate notes in multiple languages. It uses Speech to Text, Text to Speech and Machine Translation to do so. Text and voice processing take place entirely offline, locally on your computer, without using a network connection. Your privacy is always respected. No data is sent to the Internet.

Speech Note uses many different processing engines to do its job. Currently these are used:

Speech to Text (STT)
Coqui STT (a fork of Mozilla DeepSpeech)
Vosk
whisper.cpp
Faster Whisper
april-asr

Text to Speech (TTS)
espeak-ng
MBROLA
Piper
RHVoice
Coqui TTS
Mimic 3
WhisperSpeech
Kokoro
Parler-TTS
F5-TTS
S.A.M.

Machine Translation (MT)
Bergamot Translator

GitHub

GitHub - mkiol/dsnote: Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech…

Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation. - mkiol/dsnote

442 views21:14

About

Blog

Apps

Platform