Forum: d0p3 BBS

Re: Tutorial: Windows/Android privacy de-googled STT optimized for spee

From Alan Peeling@3:633/10 to All on Thu Apr 30 12:15:27 2026

Subject: Re: Tutorial: Windows/Android privacy de-googled STT optimized for speed

On 30/04/2026 09:53, Maria Sophia wrote:

The entire setup & testing was done using scrcpy & adb
on Windows over USB with Developer Options USB Debugging turned on...
if the phone was on
Wi-Fi, it could have been anywhere in the house on the LAN.

Does this mean SCRCPY can work over Wi-Fi? If so how is it done?

--
Alan

--- PyGate Linux v1.5.14
* Origin: Dragon's Lair, PyGate NNTP<>Fido Gate (3:633/10)

From Alan Peeling@3:633/10 to All on Thu Apr 30 21:50:16 2026

Subject: Re: Tutorial: Windows/Android privacy de-googled STT optimized for speed

On 30/04/2026 12:39, Maria Sophia wrote:

Here are some screenshots in series using both wired and wireless scrcpy.

Thanks for your very comprehensive post. My experience so far has been
that SCRCPY on USB works some of the time And SCRCPY on Wi-Fi won't
connect at all. As Wi-Fi connection seems to be a lot more faff than USB connection I think I'll forget about it.

--
Alan

--- PyGate Linux v1.5.14
* Origin: Dragon's Lair, PyGate NNTP<>Fido Gate (3:633/10)

From Paul@3:633/10 to All on Fri May 8 02:31:26 2026

Subject: Re: Tutorial: Windows/Android privacy de-googled STT optimized for speed

On Wed, 5/6/2026 10:37 PM, Maria Sophia wrote:

Maria Sophia wrote:

a. Low end Android => use HeliBoard + WhisperIME STT
b. High end Android => try the all-in-one Futo Keyboard

Testing takes time...

I'm having trouble with the tiny models in a noisy environment,
with the transcription taking too long or not working at all.

It seems the AGC on the mic is allowing too much noise to filter through.

First, I confirmed the small models are running by running adb logcat.
"testing testing 123"

Since, at home, you never need to touch the phone itself, from Windows:
adb shell logcat -c
adb shell "logcat -d -v tag WhisperEngineJava:D *:S"
--------- beginning of main
D/WhisperEngineJava: Model is
loaded.../storage/emulated/0/Android/data/org.woheller69.whisper/files/whisper-tiny.en.tflite
D/WhisperEngineJava: Filters and Vocab are loaded.../storage/emulated/0/Android/data/org.woheller69.whisper/files/filters_vocab_en.bin
D/WhisperEngineJava: Model is loaded.../storage/emulated/0/Android/data/org.woheller69.whisper/files/whisper-tiny.en.tflite
D/WhisperEngineJava: Filters and Vocab are loaded.../storage/emulated/0/Android/data/org.woheller69.whisper/files/filters_vocab_en.bin

Where the specgtrogram was too big for such as small sentence:
D/WhisperEngineJava: Calculating Mel spectrogram...
D/WhisperEngineJava: Mel spectrogram is calculated...!
D/WhisperEngineJava: output_len: 449

So to lower the mic sensitivity on the Samsung A32-5G, I ran:
adb shell settings put global call_noise_reduction 1
adb reboot

Re-run "testing, testing, 123"
adb shell logcat -c
adb shell "logcat -d -v tag WhisperEngineJava:D *:S"
--------- beginning of main
D/WhisperEngineJava: Model is loaded.../storage/emulated/0/Android/data/org.woheller69.whisper/files/whisper-tiny.en.tflite
D/WhisperEngineJava: Filters and Vocab are loaded.../storage/emulated/0/Android/data/org.woheller69.whisper/files/filters_vocab_en.bin
D/WhisperEngineJava: Calculating Mel spectrogram...
D/WhisperEngineJava: Mel spectrogram is calculated...!
D/WhisperEngineJava: output_len: 449
D/WhisperEngineJava: Skipping token: 50257, word: [_SOT_] D/WhisperEngineJava: Detected language code: en
D/WhisperEngineJava: Skipping token: 50259, word: [_extra_token_50259] D/WhisperEngineJava: It is Transcription...
D/WhisperEngineJava: Skipping token: 50359, word: [_extra_token_50359] D/WhisperEngineJava: Skipping token: 50363, word: [_BEG_] D/WhisperEngineJava: Skipping token: 50413, word: [_TT_50] D/WhisperEngineJava: Skipping token: 50513, word: [_TT_150] D/WhisperEngineJava: Inference is executed...!

Drat. It's still 449.

If that doesn't work in noisy environments, then I'll have to bump up
to the next-sized model, which I think is the base model.

adb push whisper-base.en.tflite /storage/emulated/0/Android/data/org.woheller69.whisper/files/
adb shell "cp /storage/emulated/0/Android/data/org.woheller69.whisper/files/whisper-base.en.tflite /storage/emulated/0/Android/data/org.woheller69.whisper/files/whisper.tflite"
adb shell "cp /storage/emulated/0/Android/data/org.woheller69.whisper/files/whisper-base.en.tflite /storage/emulated/0/Android/data/org.woheller69.whisper/files/whisper-tiny.en.tflite"

It missed the word "It's" in the picture.

[Picture] dsnote-ubu2504.gif

https://imgur.com/a/9VxuCCa

https://postimg.cc/CRrHVQXP

That's "dsnote" in Ubuntu using a Whisper model.
I read the text of the lines above, and the model
missed the "It's" on the recorded attempt. A
previous attempt was OK.

Microphone was a Blue Yeti. Which doesn't have AGC.
And the level wasn't all that high either, maybe
-24dBm or so. I recorded the microphone first in
Audacity, to see I had to hold the mike two inches
from my face to get a signal.

While the spec for the microphone claims a 20-20000Hz
response (which would be 3dB down at the ends),
it is clearly a "voice" microphone and it
cuts off the high frequencies. That's one of the reasons
the fans in the room didn't get picked up. So as far as
being a "live" mic, it's a bit of a "dull potato" as
mics go. But it does seem to give a decent result.

And when you "blast" the four lines above at the model,
then stop and wait for the conversion, it must have taken
at least 10-15 seconds to do the amount of text in the picture.
It "feels" slightly better, if you feed it a sentence at a time.
Feed it just a few words. It seems happier that way. Dragon
Naturally Speaking has nothing to worry about :-)

Paul

--- PyGate Linux v1.5.14
* Origin: Dragon's Lair, PyGate NNTP<>Fido Gate (3:633/10)

Who's Online
Recent Visitors
- jas hud
  Sun May 3 19:24:32 2026
  from bbs.eob-bbs.com via Telnet
- Guest
  Tue May 5 14:18:38 2026
  from Melbourne via Telnet
- RufusT
  Tue May 19 12:56:22 2026
  from Dallas, TX via RLogin
- Guest
  Tue May 26 22:56:58 2026
  from Melbourne via Telnet

System Info

Sysop:	Tetrazocine
Location:	Melbourne, VIC, Australia
Users:	15
Nodes:	8 (0 / 8)
Uptime:	128:50:31
Calls:	218
Files:	21,503
Messages:	84,351

Re: Tutorial: Windows/Android privacy de-googled STT optimized for spee

Who's Online

Recent Visitors

System Info