Forums before death by AOL, social media and spammers... "We can't have nice things"
|    alt.os.linux.mint    |    Looks pretty on the outside, thats it!    |    30,566 messages    |
[   << oldest   |   < older   |   list   |   newer >   |   newest >>   ]
|    Message 29,746 of 30,566    |
|    Paul to pinnerite    |
|    Re: I tried Speech Note    |
|    19 Nov 25 22:24:14    |
      From: nospam@needed.invalid              On Sun, 11/16/2025 4:09 PM, pinnerite wrote:       > This is a speech to text program.       > Example:       > Dictated: Fake Admiral at Remembrance Day Parade       > Printed: They have more. Pat Remembrance.       >       > I couldn't see any feed back buttons for corrections like I used to have       with Dragon Dictate.       > I may give up.       >       > Alan       >              I got it working.              A balloon help from the OS, overlaying the "Settings" item at the top of       the menu, is why I missed the Settings at first.              I guess I was expecting to see an input selector on       the interface. As you can have more than one microphone       on a computer. And you may not want to be altering the       machine setting, just for one application cycle, then       setting it back. Once my microphone was selected in the system       audio settings, things improved after that.              Once I did that part, I got some results.               [Picture]               https://i.postimg.cc/LXrN3d48/Call-Me-Ishmael-Simple-Note.jpg              I set it for "continuous listening", but it still runs one sentence       at a time. There is a setting in the settings dialog that will       "sound a tone" when it is processing, and that would have allowed       me to read the first paragraph of the book "Moby Dick" into it.       But I didn't need to do that.              The phase               "If this is testing"              was repeated twice. On the first instance, the "If" got       clipped off. So to some extent it suffers from the       usual squelch effects. This could be solved by the software       winding back the input trace and starting conversion before       the first detect-able blurb occurs.              I could probably dictate into it, but it auto-punctuates, so I       doubt making an exact copy of Moby Dick is possible.              *******              Speech recognition was always a garbage-in garbage-out thing.       I tried years ago to do this, only to be thwarted by a microphone       that was for all practical purposes, as insensitive as a stone.       I have many duff microphones, I have one microphone used       for video conferences (a nuisance to setup), and I have a new one.              About a year ago, as an impulse purchase, I bought this big-assed       microphone. I brought it home, was not getting much signal, and       people were saying it was kind of a dud. One person successfully       described the tuning procedure (requires setting the gain a lot       higher than you might expect, which is why other people were dissing       it). I put it back in the box after tune-up. It's just been       sitting there.              This was the first opportunity to test it for real. And you       can see in my trace, that with the exception of the missing "If",       it is getting most of the input.              I used a heavy-weight model (from Whisper). The settings box DOES have       a hardware acceleration tick box... which is unticked, and requires the       user to tick it. The evidence is, it was working. Maybe. I don't       know if there is a graphical version of "nvidia-smi" I could use       to watch the video card utilization. My video card idles at       14W out of 180W.               Paul              --- SoupGate-Win32 v1.05        * Origin: you cannot sedate... all the things you hate (1:229/2)    |
[   << oldest   |   < older   |   list   |   newer >   |   newest >>   ]
(c) 1994, bbs@darkrealms.ca