veganism.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
Veganism Social is a welcoming space on the internet for vegans to connect and engage with the broader decentralized social media community.

Administered by:

Server stats:

296
active users

#speechtotext

1 post1 participant0 posts today
Tech Chilli<p>Voxtral: Open Source AI Audio Model—Capabilities, Features, and How to Access.</p><p>See here - <a href="https://techchilli.com/artificial-intelligence/mistral-voxtral-open-source-ai-audio-model/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">techchilli.com/artificial-inte</span><span class="invisible">lligence/mistral-voxtral-open-source-ai-audio-model/</span></a></p><p><a href="https://mastodon.social/tags/OpenSourceAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenSourceAI</span></a> <a href="https://mastodon.social/tags/AudioAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AudioAI</span></a> <a href="https://mastodon.social/tags/Voxtral" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Voxtral</span></a> <a href="https://mastodon.social/tags/MistralAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>MistralAI</span></a> <a href="https://mastodon.social/tags/Transcription" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Transcription</span></a> <a href="https://mastodon.social/tags/SpeechToText" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechToText</span></a> <a href="https://mastodon.social/tags/AIModel" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AIModel</span></a> <a href="https://mastodon.social/tags/AI2025" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI2025</span></a> <a href="https://mastodon.social/tags/TechForAll" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TechForAll</span></a> <a href="https://mastodon.social/tags/FutureOfAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FutureOfAI</span></a> <a href="https://mastodon.social/tags/VoiceTech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceTech</span></a> <a href="https://mastodon.social/tags/Apache2" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Apache2</span></a></p>
Mide mike<p>Manual transcription is out — AI is in. Discover how transcription tools powered by AI are saving time, boosting accuracy, and transforming audio into text in seconds.<br>Full post here: <br><a href="https://mastodon.social/tags/aimartz" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>aimartz</span></a> <a href="https://mastodon.social/tags/aimartz" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>aimartz</span></a>.com <a href="https://mastodon.social/tags/AITranscription" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AITranscription</span></a> <a href="https://mastodon.social/tags/SpeechToText" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechToText</span></a> <a href="https://mastodon.social/tags/ProductivityTech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ProductivityTech</span></a> </p><p><a href="https://aimartz.com/blog/transcription-ai/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">aimartz.com/blog/transcription</span><span class="invisible">-ai/</span></a></p>
kurtsh<p>One my favorite Windows 10/11 keyboard shortcuts remains Win-H for "voice typing" or voice-to-text. Easy to access, accurate &amp; built-in the OS.</p><p>Here's the list of special phrases to say for punctuation &amp; voice commands:</p><p><a href="https://support.microsoft.com/en-us/windows/use-voice-typing-to-talk-instead-of-type-on-your-pc-fec94565-c4bd-329d-e59a-af033fa5689f" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">support.microsoft.com/en-us/wi</span><span class="invisible">ndows/use-voice-typing-to-talk-instead-of-type-on-your-pc-fec94565-c4bd-329d-e59a-af033fa5689f</span></a><br><a href="https://mastodon.social/tags/windows11" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>windows11</span></a> <a href="https://mastodon.social/tags/speechtotext" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>speechtotext</span></a> <a href="https://mastodon.social/tags/Microsoft" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Microsoft</span></a></p>
Debby<p><span class="h-card" translate="no"><a href="https://mastodon.social/@thelinuxEXP" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>thelinuxEXP</span></a></span> I really like Speech Note! It's a fantastic tool for quick and local voice transcription in multiple languages, created by <span class="h-card" translate="no"><a href="https://mastodon.social/@mkiol" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>mkiol</span></a></span> </p><p>It's incredibly handy for capturing thoughts on the go, conducting interviews, or making voice memos without worrying about language barriers. The app uses strictly locally running LLMs, and its ease of use makes it a standout choice for anyone needing offline transcription services.</p><p>I primarily use <a href="https://hear-me.social/tags/WhisperAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>WhisperAI</span></a> for transcription and Piper for voice, but many other models are available as well. </p><p>It is available as flatpak and <a href="https://github.com/mkiol/dsnote" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">github.com/mkiol/dsnote</span><span class="invisible"></span></a> </p><p><a href="https://hear-me.social/tags/TTS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TTS</span></a> <a href="https://hear-me.social/tags/transcription" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>transcription</span></a> <a href="https://hear-me.social/tags/TextToSpeech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TextToSpeech</span></a> <a href="https://hear-me.social/tags/translator" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>translator</span></a> translation <a href="https://hear-me.social/tags/offline" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>offline</span></a> <a href="https://hear-me.social/tags/machinetranslation" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>machinetranslation</span></a> <a href="https://hear-me.social/tags/sailfishos" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>sailfishos</span></a> <a href="https://hear-me.social/tags/SpeechSynthesis" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechSynthesis</span></a> <a href="https://hear-me.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechRecognition</span></a> <a href="https://hear-me.social/tags/speechtotext" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>speechtotext</span></a> <a href="https://hear-me.social/tags/nmt" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>nmt</span></a> <a href="https://hear-me.social/tags/linux" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>linux</span></a>-desktop <a href="https://hear-me.social/tags/stt" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>stt</span></a> <a href="https://hear-me.social/tags/asr" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>asr</span></a> <a href="https://hear-me.social/tags/flatpak" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>flatpak</span></a>-applications <a href="https://hear-me.social/tags/SpeechNote" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechNote</span></a></p>
Fossery Tech :debian: :gnome:<p>(Linux news in original post)</p><p>FOSS NEWS</p><p>Proton Mail gets Newsletter view to manage all email subscriptions in one place:<br><a href="https://proton.me/blog/proton-mail-newsletters" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">proton.me/blog/proton-mail-new</span><span class="invisible">sletters</span></a><br>(That's really cool. Now we can tell normies that Proton Mail has this feature and Gmail doesn't lol)</p><p>Proton Pass adds 14 new entry types, option to create custom types:<br><a href="https://alternativeto.net/news/2025/6/proton-pass-goes-beyond-passwords-and-credit-cards-with-customizable-item-storage/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">alternativeto.net/news/2025/6/</span><span class="invisible">proton-pass-goes-beyond-passwords-and-credit-cards-with-customizable-item-storage/</span></a><br>(Really tempting feature, but personally I would advise against storing every piece of sensitive data in one central database in the cloud. Proton can get hacked any time, like any other company, and also the new Swiss law can force them to hand over all that personal data in plain text, so you can mess up your privacy really badly. I'm not pointing fingers at Proton, but I think this update wasn't quite a good idea, it puts too much responsibility on them.)</p><p>Firefox 140 ESR released with unload tab feature, support for adding custom search engines in Search settings, support for keeping more or fewer pinned vertical tabs in view, "Select All" option for bookmarks on Android:<br><a href="https://9to5linux.com/firefox-140-esr-web-browser-is-now-available-for-download-this-is-whats-new" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">9to5linux.com/firefox-140-esr-</span><span class="invisible">web-browser-is-now-available-for-download-this-is-whats-new</span></a></p><p>Firefox 141 beta is available with less memory usage on Linux, ability to drag a tab to the pinned tabs tray and drag it out to unpin it, etc.:<br><a href="https://9to5linux.com/firefox-141-promises-to-use-less-memory-on-linux-systems-beta-out-now" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">9to5linux.com/firefox-141-prom</span><span class="invisible">ises-to-use-less-memory-on-linux-systems-beta-out-now</span></a></p><p>Mozilla discontinues DeepSpeech, an embedded/offline speech-to-text engine:<br><a href="https://www.phoronix.com/news/Mozilla-DeepSpeech-Discontinued" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">phoronix.com/news/Mozilla-Deep</span><span class="invisible">Speech-Discontinued</span></a><br>(GNOME: *drops a feature every few releases*<br>Mozilla: Hold my beer. *drops a service each week*)</p><p>(more FOSS news in comment)</p><p><a href="https://social.linux.pizza/tags/WeeklyNews" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>WeeklyNews</span></a> <a href="https://social.linux.pizza/tags/OpenSource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenSource</span></a> <a href="https://social.linux.pizza/tags/FOSSNews" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FOSSNews</span></a> <a href="https://social.linux.pizza/tags/OpenSourceNews" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenSourceNews</span></a> <a href="https://social.linux.pizza/tags/FOSS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FOSS</span></a> <a href="https://social.linux.pizza/tags/Proton" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Proton</span></a> <a href="https://social.linux.pizza/tags/ProtonMail" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ProtonMail</span></a> <a href="https://social.linux.pizza/tags/ProtonPass" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ProtonPass</span></a> <a href="https://social.linux.pizza/tags/Firefox" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Firefox</span></a> <a href="https://social.linux.pizza/tags/Firefox140" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Firefox140</span></a> <a href="https://social.linux.pizza/tags/Firefox140ESR" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Firefox140ESR</span></a> <a href="https://social.linux.pizza/tags/FirefoxBeta" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FirefoxBeta</span></a> <a href="https://social.linux.pizza/tags/Mozilla" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Mozilla</span></a> <a href="https://social.linux.pizza/tags/DeepSpeech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DeepSpeech</span></a> <a href="https://social.linux.pizza/tags/STT" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>STT</span></a> <a href="https://social.linux.pizza/tags/SpeechToText" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechToText</span></a> <a href="https://social.linux.pizza/tags/Browser" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Browser</span></a> <a href="https://social.linux.pizza/tags/WebBrowser" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>WebBrowser</span></a> <a href="https://social.linux.pizza/tags/Email" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Email</span></a> <a href="https://social.linux.pizza/tags/EmailService" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>EmailService</span></a> <a href="https://social.linux.pizza/tags/EmailProvider" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>EmailProvider</span></a> <a href="https://social.linux.pizza/tags/PasswordManager" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>PasswordManager</span></a> <a href="https://social.linux.pizza/tags/Privacy" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Privacy</span></a> <a href="https://social.linux.pizza/tags/Security" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Security</span></a> <a href="https://social.linux.pizza/tags/FosseryTech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FosseryTech</span></a></p>
Benjamin Carr, Ph.D. 👨🏻‍💻🧬<p><a href="https://hachyderm.io/tags/Mozilla" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Mozilla</span></a> Formally Discontinues Its <a href="https://hachyderm.io/tags/DeepSpeech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DeepSpeech</span></a> Project<br><a href="https://hachyderm.io/tags/MozillaDeepSpeech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>MozillaDeepSpeech</span></a> was a <a href="https://hachyderm.io/tags/speechtotext" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>speechtotext</span></a> engine with great performance for real-time communication even when running on <a href="https://hachyderm.io/tags/RaspberryPi" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>RaspberryPi</span></a> and other low-power systems.<br>Mozilla discontinuing DeepSpeech sadly doesn't as surprise. Last tagged release was 0.9.3 back in December 2020 and there hadn't been any Git activity since 2021.<br>Even in 2020 DeepSpeech was considered at risk of ceasing development following Mozilla layoffs.<br><a href="https://www.phoronix.com/news/Mozilla-DeepSpeech-Discontinued" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">phoronix.com/news/Mozilla-Deep</span><span class="invisible">Speech-Discontinued</span></a></p>
unfa🇺🇦<p>If you're using an android phone you need this:</p><p><a href="https://keyboard.futo.org/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">keyboard.futo.org/</span><span class="invisible"></span></a><br> <a href="https://www.youtube.com/watch?v=cFP5bp3JvaU" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="">youtube.com/watch?v=cFP5bp3JvaU</span><span class="invisible"></span></a></p><p>I have been on the lookout for a sensible Gboard replacement that wasn't making my (voice) typing experience painful, and so far only FUTO Keyboard managed to provide that.</p><p>It has really good offline voice typing as well, which is something I use a lot.</p><p>I can not recommend this enough!</p><p><a href="https://mastodon.social/tags/FUTO" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FUTO</span></a> <a href="https://mastodon.social/tags/Android" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Android</span></a> <a href="https://mastodon.social/tags/Privacy" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Privacy</span></a> <a href="https://mastodon.social/tags/SpeechToText" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechToText</span></a> <a href="https://mastodon.social/tags/VoiceTyping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceTyping</span></a> <a href="https://mastodon.social/tags/Swype" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Swype</span></a> <a href="https://mastodon.social/tags/Gboard" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Gboard</span></a> <a href="https://mastodon.social/tags/Heliboard" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Heliboard</span></a> <a href="https://mastodon.social/tags/Florisboard" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Florisboard</span></a></p>
athmane mokraoui [BoF] ⏚ꝃ⌁⁂<p>IBus Speech To Text : Ajoutez facilement le support du <a href="https://mstdn.fr/tags/STT" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>STT</span></a> sur <a href="https://mstdn.fr/tags/Linux" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Linux</span></a>.</p><p><a href="https://www.renardudezert.com/2025/03/31/ibus-speech-to-text-ajoutez-facilement-le-support-du-stt-sur-linux.html" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">renardudezert.com/2025/03/31/i</span><span class="invisible">bus-speech-to-text-ajoutez-facilement-le-support-du-stt-sur-linux.html</span></a></p><p><a href="https://mstdn.fr/tags/SpeechToText" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechToText</span></a></p>
athmane mokraoui [BoF] ⏚ꝃ⌁⁂<p>ibus-speech-to-text will provide voice dictation capabilities to any application supporting IBus input methods in <a href="https://mstdn.fr/tags/Fedora" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Fedora</span></a> Linux 42, using VOSK for local voice recognition.</p><p>🔗 <a href="https://fedoraproject.org/wiki/Changes/ibus-speech-to-text" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">fedoraproject.org/wiki/Changes</span><span class="invisible">/ibus-speech-to-text</span></a></p><p><a href="https://mstdn.fr/tags/ibus" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ibus</span></a> <a href="https://mstdn.fr/tags/STT" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>STT</span></a> <a href="https://mstdn.fr/tags/SpeechToText" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechToText</span></a> <a href="https://mstdn.fr/tags/VOSK" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VOSK</span></a></p>
Omega_Scribet<p>Goed nieuws!</p><p>========<br>'AI-startup Juvoly plaatst supercomputers'</p><p>"Bij NorthC Datacenters in Rotterdam zijn op 10 maart de eerste NVIDIA DGX B200 supercomputers van Nederland onthuld"</p><p>"Juvoly, een Nederlandse AI-startup in de zorg, gebruikt de supercomputers voor het trainen van geavanceerde spraakherkenningsmodellen."</p><p>"Nederlandse organisaties zijn voor AI-training en hosting nog grotendeels afhankelijk van buitenlandse cloudproviders. Supercomputers, zoals de NVIDIA DGX B200, bieden echter de rekenkracht om grootschalige AI-modellen lokaal te ontwikkelen en te hosten, waardoor data binnen Nederland blijft. "</p><p>"Juvoly ontwikkelt en host inclusieve speech-to-tekstmodellen, geoptimaliseerd voor taalachterstanden, accenten en medische terminologie. Het resultaat: een model dat beter presteert, met een lager energieverbruik, dan de grote internationale spelers en al meer dan 5000 huisartsconsulten per dag verwerkt. Ook is Juvoly de enige aanbieder van Friese spraakherkenning. Momenteel werkt het bedrijf samen met Erasmus MC en de TU Delft aan de ontwikkeling van een Medisch Nederlands Large Language Model (LLM), dat in de toekomst open-source beschikbaar moet komen."</p><p><a href="https://www.skipr.nl/nieuws/ai-startup-juvoly-plaatst-supercomputers/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">skipr.nl/nieuws/ai-startup-juv</span><span class="invisible">oly-plaatst-supercomputers/</span></a></p><p><span class="h-card" translate="no"><a href="https://fosstodon.org/@bert_hubert" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>bert_hubert</span></a></span> </p><p><a href="https://mementomori.social/tags/zorg" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>zorg</span></a> <a href="https://mementomori.social/tags/Juvoly" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Juvoly</span></a> <a href="https://mementomori.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://mementomori.social/tags/supercomputer" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>supercomputer</span></a> <a href="https://mementomori.social/tags/SpeechToText" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechToText</span></a> <a href="https://mementomori.social/tags/huisarts" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>huisarts</span></a> <a href="https://mementomori.social/tags/Fries" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Fries</span></a> <a href="https://mementomori.social/tags/OpenSource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenSource</span></a></p>
Farooq | فاروق<p>For learning languages, do you think it's a good idea to practice with an AI Speech Recognition and an AI Speech Synthesis engine?</p><p>I'm specifically interesting in British English and German.</p><p><a href="https://cr8r.gg/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://cr8r.gg/tags/ML" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ML</span></a> <a href="https://cr8r.gg/tags/LanguageLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>LanguageLearning</span></a> <a href="https://cr8r.gg/tags/Learning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Learning</span></a> <a href="https://cr8r.gg/tags/SprachenLernen" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SprachenLernen</span></a> <a href="https://cr8r.gg/tags/British" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>British</span></a> <a href="https://cr8r.gg/tags/English" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>English</span></a> <a href="https://cr8r.gg/tags/DeutchLernen" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DeutchLernen</span></a> <a href="https://cr8r.gg/tags/EnglishLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>EnglishLearning</span></a> <a href="https://cr8r.gg/tags/speechrecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>speechrecognition</span></a> <a href="https://cr8r.gg/tags/speechtotext" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>speechtotext</span></a> <a href="https://cr8r.gg/tags/speechrecognitionsoftware" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>speechrecognitionsoftware</span></a> <a href="https://cr8r.gg/tags/speechsynthesis" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>speechsynthesis</span></a> <a href="https://cr8r.gg/tags/SpeechSynthesizer" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechSynthesizer</span></a></p>
spmatich vk3spm :blobcoffee:<p>Some folks at work have been playing with using Gemini to take notes in meetings. I noticed that the speech-to-text engine is heavily biased towards English as a first language speakers, and does not handle accents very well. <br>But this does not reflect reality. A quick search shows us that only about one quarter (25%) of the English speaking world has English as a first language. The translation engine definitely needs to do better with the majority of English speakers.<br><a href="https://ioc.exchange/tags/Gemini" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Gemini</span></a> <a href="https://ioc.exchange/tags/SpeechToText" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechToText</span></a> <a href="https://ioc.exchange/tags/Google" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Google</span></a> <a href="https://ioc.exchange/tags/EnglishAsASecondLanguage" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>EnglishAsASecondLanguage</span></a><br><a href="https://www.babbel.com/en/magazine/how-many-people-speak-english-and-where-is-it-spoken#:~:text=How%20Many%20People%20In%20The,English%20as%20their%20first%20language" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">babbel.com/en/magazine/how-man</span><span class="invisible">y-people-speak-english-and-where-is-it-spoken#:~:text=How%20Many%20People%20In%20The,English%20as%20their%20first%20language</span></a>.</p>
Andresimous<p>Wie versprochen schiebe ich mal ein kleines <a href="https://oslo.town/tags/Tutorial" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Tutorial</span></a> zu <a href="https://oslo.town/tags/Vosk" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Vosk</span></a> rein. Mit Vosk könnt Ihr <a href="https://oslo.town/tags/Untertitel" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Untertitel</span></a> zu Videos erzeugen &amp; Audio-Dateien transkribieren. Vosk ist also ein <a href="https://oslo.town/tags/SpeechToText" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechToText</span></a> Programm..<br>Auf der offiziellen Vosk-Webseite steht als Installationsanleitung:<br>- Installiere die Pakete Python3, pip3 und ffmpeg<br>- Installiere Vosk mit dem Befehl: pip3 install vosk</p><p>Doch das funktionierte bei mir auf Linux Mint nicht, denn nach dem pip3-Befehl konnte Vosk nicht gestartet werden.<br>1/x</p>