Peter Vágner

pvagner@fedi.ml

Father, geek, sysadmin from slovakia. I like to party with friends, work with new technologies, I am relying on screen reader accessibility. I prefer open-source and self-hosted apps and services.

Network posts

Banskobystricky, Slovakia

pvagner@pvagner.sk

pvagner:pvagner.sk

Peter Vágner reshared this.

Paweł Masarczyk

1 month ago

Paweł Masarczyk
1 month ago

TIL: There's a W3C candidate recommendation draft for a CSS markup to transfer different properties of text and controls on the web via audio cues and changes to the TTS volume, speech rate, tone, prosody and pronunciation, kind of like the attributed strings in iOS apps and it's called CSS Speech. w3.org/TR/css-speech-1/ #Accessibility #A11y #Blind

CSS Speech Module Level 1

^www.w3.org

reshared this

in reply to Paweł Masarczyk

miki

in reply to Paweł Masarczyk 1 month ago

Isn't this what Aural CSS was back in the day?

AFAIK that was never implemented in any mainstream browser, Emacspeak was the only implementor I know of.

As far as I remember, it had a bunch of extra properties for things like how speech should be positioned in 3d, which was a very Emacspeak thing to do.

in reply to miki

Paweł Masarczyk

in reply to miki 1 month ago

@miki Yes, I'm pretty sure it derives from Aural CSS. Yes, positioning is in there too.

@miki

in reply to Paweł Masarczyk

James Scholes

in reply to Paweł Masarczyk 1 month ago

There are people who seem to feel really strongly about this being a good thing for screen reader users, and I must admit to being bewildered about why. Websites changing aspects of screen reader output may be equitable, if we compare it with the way webpages can alter visual presentation through fonts and other aspects. But to me it feels entirely inappropriate to cross that boundary between the browser as the user agent and accessibility software in order to interfere with very personal settings.

Meanwhile on iOS, the related accessibility attributes are being used to achieve outcomes nobody wants or needs, like spaces between all the digits of a credit card number. @miki @prism

@miki @Drew Mochak

in reply to James Scholes

James Scholes

in reply to James Scholes 1 month ago

I can see the point for e.g. text-to-speech APIs built into the browser, maybe even read-aloud features. But the case for screen reader compatibility seems to be built on the foundational assertion that SR output is monotonous and can't be "livened up" by brands.

As assertions go, I think that is both true and exactly how it should be. I don't use a screen reader for entertainment. I can think of few things more obnoxious than a marketing person thinking that my screen reader should "shout this bit."

Many web authors can't even label stuff correctly. Why on earth would we expect them to treat this sort of feature with informed respect? @miki @prism

@miki @Drew Mochak

in reply to James Scholes

Drew Mochak

in reply to James Scholes 1 month ago

@jscholes @miki I was just going to say, this sounds similar to ARIA in that it can be helpful and can also completely break the site, depending on how it's used. Would we have been better off without it? Maybe, maybe not. Bit hard to tell at this point, the horse is out of the barn.

@miki @James Scholes

in reply to Drew Mochak

James Scholes

in reply to Drew Mochak 1 month ago

@prism I think without ARIA or an equivalent (like more things built into the web platform), the web would've continued galloping forward with all the same UI widgets and design patterns but with no way to make them even halfway accessible, and we'd be left even more behind than we are now.

By contrast, I don't think the inability for a website to change the pitch of NVDA is a legitimate blocker to anything worthwhile. @Piciok @miki

@miki @Paweł Masarczyk @Drew Mochak

in reply to James Scholes

Drew Mochak

in reply to James Scholes 1 month ago

@jscholes I have felt for a while that only having TTS for everything is pretty limitting. So, you know, I use unspoken. Problem solved. I haven't really thought to myself, self, it would be great if the website author could script some nonverbal feedback for me instead of what I am currently hearing, or anything like that. So this may well be a solution in search of a problem.
@Piciok @miki

@miki @Paweł Masarczyk @James Scholes

in reply to Drew Mochak

Rebecca

in reply to Drew Mochak 1 month ago

@prism @jscholes @miki I don't see the point because everyone has different ways they like to hear things. People choose the verbosity and speech options that work for them and to have something override that would be irritating. I also feel that this is part of a larger conversation about the perceived need for sighted people to feel like our experience of the web is vastly different. This is why we have a lot of unnecessary context already and here is another example.

@miki @James Scholes @Drew Mochak

in reply to Rebecca

miki

in reply to Rebecca 1 month ago

@silverleaf57 @prism @jscholes I kind of do see the point. currently, there's no way to announce something to a screen reader except through a status message, so you get "unread pinned important has attachment collapsed tickets for tomorrow", where sighted people just see "tickets for tomorrow" and see a bunch of icons.

@James Scholes @Rebecca @Drew Mochak

in reply to miki

miki

in reply to miki 1 month ago

@silverleaf57 @prism @jscholes I, for one, would certainly appreciate if I could hear exactly which parts of a line of code have "red squiggles" under them, preferrably with different styles for error and warning. This is something sigted people have. Visual Studio Code solves this with audio cues, but those are per line, not per character range.

@James Scholes @Rebecca @Drew Mochak

in reply to miki

James Scholes

in reply to miki 1 month ago

@miki I think it's a trap to suggest that such problems should currently be solved only through speech properties and auditory cues within individual apps. Expressive semantics on the web have only been explored at a surface level so far, and it's a complete stretch to go from "We don't have the ARIA properties to convey complex information," to "Let's have every application implement its own beeps and boops."

Imagine having to learn the sound scheme for Gmail, then Outlook, then Thunderbird. Then going over to Slack where they also have unread state albeit for chat messages rather than emails, but they use an entirely different approach again.

All the while, braille users are getting nothing, and people who struggle to process sounds alongside speech are becoming more and more frustrated. Even if we assume that this is being worked on in conjunction with improvements to ARIA and the like, how many teams have the bandwidth and willingness to implement more than one affordance?

We've already seen this in practice: ARIA has braille properties, but how many web apps use them? Practically none, because getting speech half right and giving braille users an even more subpar experience is easier. Your own example highlights how few apps currently let you control things like verbosity and ordering of information.

CSS Speech could turn out even worse. A product team might opt to implement it instead of semantics because the two blind people they spoke to said it would work for them, and never mind the other few million for whom it doesn't. They'll be the people complaining that there's no alternative to the accessibility feature a team spent a month on and thought was the bee's knees.

@silverleaf57 @prism @Piciok

@miki @Paweł Masarczyk @Rebecca @Drew Mochak

in reply to James Scholes

miki

in reply to James Scholes 1 month ago

@jscholes @silverleaf57 @prism Imagine having to learn the icon scheme for... oh wait.

@James Scholes @Rebecca @Drew Mochak

in reply to miki

James Scholes

in reply to miki 1 month ago

@miki There is much shared (or adjacent) iconography in the world, with a lot more power and opinion behind it than the sounds for a web app are going to get. Despite that, icon fatigue is a real and common user complaint; it seems bizarre to be leaning into such an issue purely in the name of equity. @silverleaf57 @prism @Piciok

@miki @Paweł Masarczyk @Rebecca @Drew Mochak

in reply to James Scholes

miki

in reply to James Scholes 1 month ago

@jscholes @silverleaf57 @prism Efficiency, not equity.

Words are a precious resource, far more precious than even screen real estate. After all, you can only get a fairly limited amount of them through a speaker in a second. We should conserve this resource as much as we can. That means as many other "side channels" as we can get, sounds, pitch changes, audio effects, stereo panning (when available) and much more.

Icon fatigue is real. "me English bad, me no know what delete is to mean" is also real, and icons, pictograms and other kinds of pictures is how you solve that problem in sighted land.

Obviously removing all labels and replacing it with pictograms is a bad idea. Removing all icons and replacing them with text... is how you get glorified DOS UIs with mouse support, and nobody uses these.

@James Scholes @Rebecca @Drew Mochak

in reply to miki

miki

in reply to miki 1 month ago

@jscholes @silverleaf57 @prism Everything said above also applies to braille, Braille cells are even more precious than words in a speaker. It's a schame we can abbreviate "main landmark heading level 2" to something more sensible, but we can't abbreviate "unread pinned has attachment overdue" if those labels are not "blessed" by some OS accessibility API.

@James Scholes @Rebecca @Drew Mochak

in reply to James Scholes

James Scholes

in reply to James Scholes 1 month ago

@miki Note that I'm specifically responding to your proposed use case here. You want beeps and boops, and I think you should have them. But:

1. I think you should have them in a centralised place that you control, made possible via relevant semantics.

2. I don't think the fact that some people like beeps and boops is a good reason to prioritise incorporating beeps and boops into the web stack in a way that can't be represented via any other modality.

@silverleaf57 @prism @Piciok

@miki @Paweł Masarczyk @Rebecca @Drew Mochak

This entry was edited (1 month ago)

in reply to James Scholes

miki

in reply to James Scholes 1 month ago

@jscholes @silverleaf57 @prism Centralized beeps and boops don't make much sense to me. Each app needs a different set, let's just consider important items on a list. That can mean "overdue", "signature required", "has unresolved complaints", "student not present", "compliance certification not granted" or something entirely different. We can't expect screen readers to have styles for all of these, just as we can't expect browsers to ship icons for all of these.

@James Scholes @Rebecca @Drew Mochak

in reply to miki

James Scholes

in reply to miki 1 month ago

@miki Sure. Or it can just mean "important" in a domain-specific way that's shared across apps in that domain. We should be taking advantage of that to make information presentation and processing more streamlined, before inventing an entirely new layer and interaction paradigm that hasn't been user tested and will require text alternatives anyway. @silverleaf57 @prism @Piciok

@miki @Paweł Masarczyk @Rebecca @Drew Mochak

in reply to James Scholes

James Scholes

in reply to James Scholes 1 month ago

@miki As noted, I think people who can process a more efficient stream of information should have it available to them. That could be through a combination of normalised/centralised semantics, support for specialised custom cases, and multi-modal output.

My main concern remains CSS Speech being positioned as the only solution to information processing bottlenecks, which I think is a particularly narrow view and will make things less accessible for many users rather than more.

Good discussion, thanks for chatting through it. @silverleaf57 @prism @Piciok

@miki @Paweł Masarczyk @Rebecca @Drew Mochak

in reply to James Scholes

Drew Mochak

in reply to James Scholes 1 month ago

@jscholes At the same time, I think the chances that CSSSpeech completely takes over the industry and we all stop doing text role assignments is quite low.
explainxkcd.com/wiki/index.php…

So I am decidedly meh about this. It could help but probably won't.
@miki @silverleaf57 @Piciok

927: Standards - explain xkcd

explain xkcd is a wiki dedicated to explaining the webcomic xkcd. Go figure.

^{www.explainxkcd.com}

@miki @Paweł Masarczyk @James Scholes @Rebecca

in reply to Drew Mochak

James Scholes

in reply to Drew Mochak 1 month ago

@prism @miki @silverleaf57 Please be assured that it hasn't reached "keep me up at night" status or anything. After all, it has so much competition on that front!

@miki @Rebecca @Drew Mochak

in reply to James Scholes

Paweł Masarczyk

in reply to James Scholes 1 month ago

@jscholes @prism @miki @silverleaf57 I found the concept intriguing and am myself in two minds about it. On one hand, I wouldn't mind having the speech experience augmented by things that aren't words. I could imagine browsing a product's details page and reading upon all of it's features with tiny earcons indicating whether certain feature is supported or not rather than hearing "Yes" and "No" every time. This could even be played at the same time as the readout begins. To be fair, I also don't mind having the pronunciation of tricky words that are important for proper understanding and functioning in a domain, predefined just so I could learn it. Character and number processing might come in handy too - recently there was an issue on the NVDA Github opened against a feature to read combinations of capital letters and digits as separate entities for the benefit of ham radio operators and their call signs. Some kinds of numbers I also find easier to remember when they come digit-by-digit etc. The ability to define the spatial location of voice on the stereo sound spectrum could be useful for presenting those spatial relationships in some advanced web apps (thinking scientific contexts, design, web text and code editors etc.. As you say, however, I wouldn't expect this being widely adopted by web devs who already struggle with the proper use of ARIA. Also the trade-offs could be significant, especially if this becomes the sole way of conveying information. Blind users with a profound hearing impairment who will miss out on crucial information because it was read out too quietly, too fast and with a pitch that takes away some of the frequencies they can't discern any more; neurodivergent people confused by sudden changes and unfamiliar sounds on top of exotic keyboard shortcut choices they already have to remember etc. This could create a situation similar to WCAG SC 1.4.1 where the colour is used as the only way of conveying information.

@miki @James Scholes @Rebecca @Drew Mochak

in reply to Paweł Masarczyk

Drew Mochak

in reply to Paweł Masarczyk 1 month ago

This already exists though, as a screenreader feature. Kind of. NVDA has an add-on called unspoken that will replace the announcement of roles with various sounds, there's a different one for checked vs. unchecked boxes for instance. JAWS did (does?) something similar with the shareable schemes in the speech and sounds manager. Granted, not a lot of people do this, but the ability is there if people want it. VO, TB and cvox also have earcons--they're not used for this purpose, but they could be. Having this under the user's control rather than the author's control does seem better. It prevents for instance a developer deciding to be super abtrusive with ads. I do see the potential for it to be good, the author would be able to convey more nuanced concepts being the author of the content... it just feels like a thing most people wouldn't use, and most of the people who'd try would end up being obnoxious about it.

@jscholes @miki @silverleaf57

@miki @James Scholes @Rebecca

in reply to Drew Mochak

Paweł Masarczyk

in reply to Drew Mochak 1 month ago

@prism @jscholes @miki @silverleaf57 Yes, this is what I'm thinking too. Also, the addons are great - I experiment with Earcons and Speech Rules which is another addon with tons of customization. Bringing it on as a core feature would signal it as industry standard though and from that it would be possible to explore whether any external API's could augment it in any way.

@miki @James Scholes @Rebecca @Drew Mochak

in reply to James Scholes

Paweł Masarczyk

in reply to James Scholes 1 month ago

@jscholes @prism @miki @silverleaf57 As for this being widely adopted, I expect some CSS properties could be mapped to the aural cues on a browser lever just like some HTML elements carry implicit ARIA properties with them by default. This would have to be carefully considered. Regarding sound cues: this would have to be based on some kind of familiarity principle where the sounds are those most users will already know or they resemble the action they are supposed to represent, think emptying the recycle bin on Windows. I really like the approach of JAWS representing heading levels through piano notes in C major - it sounds logical but on the other hand not everyone is able to recognize musical notes at random. I'm not convinced about the marketing value of this - I mean creating brand voices etc. It sounds fun but no more than that, at least in the screen reader context. I guess inclusion in advertising is another can of worms that might derail the discussion. I'm looking forward to when NVDA finally incorporates some kind of sound scheme system because we will then be able to talk about some kind of standard given that JAWS and to some extent VoiceOver and Talkback make use of that already. I guess then the discussion could evolve around this being complementary to something like aria-roledescription or aria-brailleroledescription, assigning familiar sounds and speech patterns to custom-built controls.

@miki @James Scholes @Rebecca @Drew Mochak

in reply to James Scholes

Paweł Masarczyk

in reply to James Scholes 1 month ago

@jscholes @prism @miki @silverleaf57 I think inviting @tink and @pixelate into the discussion is a great idea as they might have valuable insights on this. On a related note: something that's been running around my head is how many Emojis could be faithfully represented by sounds.

@miki @James Scholes @Rebecca @Léonie Watson @Devin Prater :blind: @Drew Mochak

in reply to Paweł Masarczyk

Devin Prater :blind:

in reply to Paweł Masarczyk 1 month ago

@jscholes @prism @miki @silverleaf57 @tink So, I generally like beeps and boops. All shiny and stuff. But the web is made by sighted people, and they will get things wrong. I'd rather we have our own tools, like NVDA'S earcons addon, and maybe have earcon packs for it to, for example, add aural highlighting for VS Code, or make-gmail-shiny, stuff like that.

@miki @James Scholes @Rebecca @Léonie Watson @Drew Mochak

in reply to Paweł Masarczyk

Adrian Roselli, pH0

in reply to Paweł Masarczyk 1 month ago

Léonie Watson wrote about this (and delivered talks on it) a few years ago, in case her thoughts interest you: tink.uk/why-we-need-css-speech…

Why we need CSS Speech - Tink - Léonie Watson

^{Tink - Léonie Watson - On technology, food & life in the digital age}

This entry was edited (1 month ago)

Peter Vágner reshared this.

Andy Greenberg

1 month ago

Andy Greenberg
1 month ago

Researchers pointed a satellite dish at the sky for 3 years and monitored what unencrypted data it picked up. The results were shocking: They obtained thousands of T-Mobile users' phone calls and texts, military and law enforcement secrets, much more: 🧵👇wired.com/story/satellites-are…

This entry was edited (1 month ago)

reshared this

Peter Vágner reshared this.

ondrosik

1 month ago from Enafore

ondrosik
1 month ago from Enafore

During last 3 months I am using VDO ninja for all my remote interwiev and podcast recordings. here is my article about it from the blind perspective, focused on accessibility and audio.

Have You Ever Wanted to Record an Interview or Podcast Online? You’ve probably faced a few challenges:
How to transmit audio in the highest possible quality?
How to connect in a way that doesn’t burden your guest with installing software?
And how to record everything, ideally into separate tracks?

The solution to these problems is offered by the open-source tool VDO Ninja.

What Is VDO Ninja

It’s an open-source web application that uses WebRTC technology. It allows you to create a P2P connection between participants in an audio or video call and gives you control over various transmission parameters.
You can decide whether the room will include video, what and when will be recorded, and much more.

In terms of accessibility, the interface is fairly easy to get used to — and all parameters can be adjusted directly in the URL address when joining.
All you need is a web browser, either on a computer or smartphone.

Getting Started

The basic principle is similar to using MS Teams, Google Meet, and similar services.
All participants join the same room via a link.
However, VDO Ninja distinguishes between two main types of participants: Guests and the Director.
While the guest has limited control, the director can, for example, change the guest’s input audio device (the change still must be confirmed by the guest).

A Few Words About Browsers

VDO Ninja works in most browsers, but I’ve found Google Chrome to be the most reliable.
Firefox, for some reason, doesn’t display all available audio devices, and when recording multiple tracks, it refuses to download several files simultaneously.

Let’s Record a Podcast

Let’s imagine we’re going to record our podcast, for example, Blindrevue.
We can connect using a link like this:

https://vdo.ninja/?director=Blindrevue&novideo=1&proaudio=1&label=Ondro&autostart=1&videomute=1&showdirector=1&autorecord&sm=0&beep

Looking at the URL more closely, we can see that it contains some useful instructions:

director – Defines that we are the director of the room, giving us more control. The value after the equals sign is the room name.
novideo – Prevents video from being transmitted from participants. This parameter is optional but useful when recording podcasts to save bandwidth.
proaudio – Disables effects like noise reduction, echo cancellation, automatic gain control, compression, etc., and enables stereo transmission.
Be aware that with this setting, you should use headphones, as echo cancellation is disabled, and otherwise, participants will hear themselves.
label=Ondro – Automatically assigns me the nickname “Ondro.”
autostart – Starts streaming immediately after joining, skipping the initial setup dialog.
videomute – Automatically disables the webcam.
showdirector – Displays our own input control panel (useful if we want to record ourselves).
autorecord – Automatically starts recording for each participant as they join.
sm=0 – Ensures that we automatically hear every new participant without manually unmuting them.
beep – Plays a sound and sends system notification when new participants join (requires notification permissions).

For guests, we can send a link like this:

https://vdo.ninja/?room=Blindrevue&novideo=1&proaudio=1&label&autostart=1&videomute=1&webcam

Notice the differences:

We replaced director with room. The value must remain the same, otherwise the guest will end up in a different room.
We left label empty — this makes VDO Ninja ask the guest for a nickname upon joining.
Alternatively, you can send personalized links, e.g., label=Peter or label=Marek.
The webcam parameter tells VDO Ninja to immediately stream audio from the guest’s microphone; otherwise, they’d need to click “Start streaming” or “Share screen.”

How to Join

Simply open the link in a browser.
In our case, the director automatically streams audio to everyone else.
Participants also join by opening their link in a browser.
If a nickname was predefined, they’ll only be asked for permission to access their microphone and camera.
Otherwise, they’ll also be prompted to enter their name.

Usually, the browser will display a permission warning.
Press F6 to focus on it, then Tab through available options and allow access.

Controls

The page contains several useful buttons:

Text chat – Toggles the text chat panel, also allows sending files.
Mute speaker output – Mutes local playback (others can still hear you).
Mute microphone – Mutes your mic.
Mute camera – Turns off your camera (enabled by default in our example).
Share screen / Share website – Allows screen or site sharing.
Room settings menu (director only) – Shows room configuration options.
Settings menu – Lets you configure input/output devices.
Stop publishing audio and video (director only) – Stops sending audio/video but still receives others.

Adjusting Input and Output Devices

To change your audio devices:

Activate Settings menu.
Press C to jump to the camera list — skip this for audio-only.
Open Audio sources to pick a microphone.
In Audio output destination, select your playback device. Press test button to test it.
Close settings when done.

Director Options

Each guest appears as a separate landmark on the page.
You can navigate between them quickly (e.g., using D with NVDA).

Useful controls include:

Volume slider – Adjusts how loud each participant sounds (locally only).
Mute – Silences a guest for everyone.
Hangup – Disconnects a participant.
Audio settings – Adjusts their audio input/output remotely.

Adjusting Guest Audio

Under Audio settings, you can:

Enable/disable filters (noise gate, compressor, auto-gain, etc.).
View and change the guest’s input device — if you change it, a Request button appears, prompting the guest to confirm the change.
Change the output device, useful for switching between speaker and earpiece on mobile devices.

Recording

Our URL parameters define automatic recording for all participants.
Recordings are saved in your Downloads folder, and progress can be checked with Ctrl+J.

Each participant’s recording is a separate file.
For editing, import them into separate tracks in your DAW and synchronize them manually.
VDO Ninja doesn’t support single-track recording, but you can use Reaper or APP2Clap with a virtual audio device.

To simplify synchronization:

Join as director, but remove autorecord.
Wait for everyone to join and check audio.
When ready, press Alt+D to edit the address bar.
Add &autorecord, reload the page, and confirm rejoining.
Recording now starts simultaneously for everyone.
Verify this in your downloads.

Manual Recording

To start recording manually:

Open Room settings menu.
Go to the Room settings heading.
Click Local record – start all.
Check PCM recording (saves WAV uncompressed).
Check Audio only (records sound without video).
Click Start recording.

Important Recording Notes

Always verify that all guest streams are recording.
To end recordings safely, click Hangup for each guest or let them leave.
You can also toggle recording for each guest under More options → Record.
Files are saved as WEBM containers. If your editor doesn’t support it, you can convert them using the official converter.
Reaper can open WEBM files but may have editing issues — I prefer importing the OPUS audio file instead.

Peter Vágner

in reply to ondrosik 1 month ago from RaccoonForFriendica

@ondrosik I like the fact it's fairy easi to self host. Thanks for Researching all the options and blindness specific use cases.

@ondrosik

Peter Vágner reshared this.

Bri😻

1 month ago

Bri😻
1 month ago

Announcing AudioCapture. A win32 application to capture audio from a process and save it to an audio file. Full disclosure: This was written with Claude Code. Why? Because I'm not an experienced c++ programmer, however I saw an idea for an app and no one else was going to write it, so I did it myself this way. The full code is available, so if you wish to contribute, feel free. github.com/masonasons/AudioCap…

GitHub - masonasons/AudioCapture: A win32 app to capture audio from specific processes to an audio file

A win32 app to capture audio from specific processes to an audio file - masonasons/AudioCapture

^GitHub

This entry was edited (1 month ago)

ondrosik likes this.

reshared this

in reply to Bri😻

Andre Louis

in reply to Bri😻 1 month ago

Um, what just happened? this went from, select some process and then select another process and things will record separately to... This? This is like a fully-fledged crazy thing that just took on a life of it's own. No monitor passthrough last time I checked. What happened? Totally here for it though lol

in reply to Andre Louis

JamminJerry

in reply to Andre Louis 1 month ago

@FreakyFwoof I really like it to. This saves me from changing E.Q. settings on my mixer making things sound bad threw my speakers, when I want to record something. this is just really awesome!

@Andre Louis

in reply to Andre Louis

Bri😻

in reply to Andre Louis 1 month ago

@FreakyFwoof Oh, yeah, that's just me. Going crazy as usual. Lol

@Andre Louis

in reply to Bri😻

Andre Louis

in reply to Bri😻 1 month ago

Windows is being rude again, claiming the exe has a virus. So irritating...

in reply to Andre Louis

Majid Hussain

in reply to Andre Louis 1 month ago

@FreakyFwoof ummmm, output audio in opus sounds really really slowed down,
not sure why?
setup, using chrome as input, output format is opus highest bitrate.
not sure if this issue happens with the other formats.

@Andre Louis

in reply to Andre Louis

Majid Hussain

in reply to Andre Louis 1 month ago

@FreakyFwoof .wav audio output is ok
.mp3 output fails to even record
not tryed flak.

also, just the chrome process has been selected no other and yet nvda can be herd in the recording loud and clear???

@Andre Louis

in reply to Majid Hussain

Andre Louis

in reply to Majid Hussain 1 month ago

@mhussain OO yeah, just testing that and I notice same. System audio is captured as well, because my capslock comes through, as well as open/close program sounds.

@Majid Hussain

in reply to Majid Hussain

Bri😻

in reply to Majid Hussain 1 month ago

@mhussain @FreakyFwoof Oh my! I'll look into the weird pitched Audio thing. re the system audio, unfortunately this just means whatever version of windows 10 you're using doesn't support per process audio capture.

@Andre Louis @Majid Hussain

in reply to Bri😻

Andre Louis

in reply to Bri😻 1 month ago

@mhussain Odd, I'm on Windows 10 22H2 (AMD64) build 19045.6396.
Surprising that wouldn't support it.

@Majid Hussain

in reply to Andre Louis

Bri😻

in reply to Andre Louis 1 month ago

@FreakyFwoof @mhussain So here's the weird bit. Some people get it to work on that build, some don't. I haven't personally tried it yet. Maybe that'll be part of today's doodling.

@Andre Louis @Majid Hussain

in reply to Bri😻

Andre Louis

in reply to Bri😻 1 month ago

@mhussain I hope you can, because I really want to know what might be different about various builds. The machine is up to date, all security this that and the other, etc.

@Majid Hussain

in reply to Andre Louis

Bri😻

in reply to Andre Louis 1 month ago

@FreakyFwoof @mhussain Oh I have a machine right over there running that build.

@Andre Louis @Majid Hussain

in reply to Bri😻

Andre Louis

in reply to Bri😻 1 month ago

@mhussain I noticed I couldn't select the mic only to monitor without selecting some process as well, so I could select a process I know wouldn't capture audio, like Paperback lol

@Majid Hussain

in reply to Andre Louis

Andre Louis

in reply to Andre Louis 1 month ago

@mhussain But I think you can't do mic passthrough from one interface to another, right? I mean sure you can do that in windows directly anyway so not the end of the world.

@Majid Hussain

in reply to Andre Louis

Bri😻

in reply to Andre Louis 1 month ago

@FreakyFwoof @mhussain Oh you sure can, it's just not enabled in this program, but easily could be

@Andre Louis @Majid Hussain

in reply to Bri😻

Bri😻

in reply to Bri😻 1 month ago

@FreakyFwoof @mhussain Oh, yeah, you're not actually meant to do that, would you find that to be useful?

@Andre Louis @Majid Hussain

in reply to Bri😻

Andre Louis

in reply to Bri😻 1 month ago

@mhussain I thought I would, then I kind of answered my own question so... Not sure. But maybe the ability to combine two or more input sources together. IE iPhone plugged into one sound card and mic in another, easy to do tutorials that way.

@Majid Hussain

in reply to Andre Louis

Bri😻

in reply to Andre Louis 1 month ago

@FreakyFwoof @mhussain I did want to make a second program specific to audio devices, sort of an audio repeater/recorder type of deal, for audio devices specifically. I feel like if I combine too many things into a single program it might get confusing. Thoughts?

@Andre Louis @Majid Hussain

in reply to Bri😻

Andre Louis

in reply to Bri😻 1 month ago

@mhussain Possibly, or it's done with a simple vs advanced button, or even in separate tab sheets.

@Majid Hussain

in reply to Andre Louis

Bri😻

in reply to Andre Louis 1 month ago

@FreakyFwoof @mhussain I did just think of redesigning that microphone selector into one of those checkbox list views, though.

@Andre Louis @Majid Hussain

in reply to Bri😻

Andre Louis

in reply to Bri😻 1 month ago

@mhussain Checkboxes vs list, probably the best of both worlds.

@Majid Hussain

in reply to Andre Louis

Bri😻

in reply to Andre Louis 1 month ago

@FreakyFwoof @mhussain Hmm, or maybe I could somehow tell it to make input devices show up in the processes list, so you could just select them as sources.

@Andre Louis @Majid Hussain

in reply to Bri😻

Andre Louis

in reply to Bri😻 1 month ago

@mhussain So much choice haha

@Majid Hussain

in reply to Andre Louis

Andre Louis

in reply to Andre Louis 1 month ago

@mhussain Also on win 10 try downloading from github, see if you get the virus alert from defender. If you can solve that, probably better than everything else we discussed combined.

@Majid Hussain

in reply to Andre Louis

Bri😻

in reply to Andre Louis 1 month ago

@FreakyFwoof @mhussain I hear that if you disable the cloud delivered protection, it makes the false positives all but go away. Aside from that, though, I don't think there's much Can I can do.

@Andre Louis @Majid Hussain

in reply to Bri😻

Andre Louis

in reply to Bri😻 1 month ago

@mhussain Aah I didn't know I could do that.

@Majid Hussain

in reply to Andre Louis

Bri😻

in reply to Andre Louis 1 month ago

@FreakyFwoof @mhussain Yep, if you go into windows security, then virus and threat protection, and then virus and threat protection settings, it's in there.

@Andre Louis @Majid Hussain

in reply to Bri😻

Andre Louis

in reply to Bri😻 1 month ago

@mhussain Thanks. Will try disabling then.

@Majid Hussain

in reply to Andre Louis

Bri😻

in reply to Andre Louis 1 month ago

@FreakyFwoof @mhussain No worries!

@Andre Louis @Majid Hussain

in reply to Andre Louis

Bri😻

in reply to Andre Louis 1 month ago

@FreakyFwoof @mhussain I'm working on finishing my coffee here and I'll go to the computer and see if I can get this sources thing working.

@Andre Louis @Majid Hussain

in reply to Andre Louis

Bri😻

in reply to Andre Louis 1 month ago

@FreakyFwoof @mhussain The more I think about it, I like the sources idea. The processes list can just be a list of sources that you can choose from.

@Andre Louis @Majid Hussain

in reply to Andre Louis

Majid Hussain

in reply to Andre Louis 1 month ago

@FreakyFwoof I have streao mix enabled could that be it?

@Andre Louis

in reply to Majid Hussain

Andre Louis

in reply to Majid Hussain 1 month ago

@mhussain I have multiple soundcards and that's not the case for me.

@Majid Hussain

in reply to Andre Louis

Bri😻

in reply to Andre Louis 1 month ago

Revisiting this thread. For both of you on Windows 10, could you try the latest release of AudioCapture and see if process capture works for you? I've discovered a bug with windows 10 where the process loopback capture doesn't release properly, and unfortunately this means that the process can only be captured one time. If you uncapture the process, you either have to restart the process, or AudioCapture itself. github.com/masonasons/AudioCap…

This entry was edited (1 month ago)

in reply to Andre Louis

JamminJerry

in reply to Andre Louis 1 month ago

@FreakyFwoof I wonder how I can get windows to knock that shit off now that I am using avast security? I know the antivirus is shut off in windows, but that isn't what it yelling at me. it says windows has protected your computer, and I have to hit more, and then run anyway. oh sure, it won't bug me again, but still. I don't want windows doing this.

@Andre Louis

in reply to Andre Louis

Brandon

in reply to Andre Louis 1 month ago

Is the executable signed? I have a command I run on the executable before I release it so defender apparently doesn’t flag it. Don’t know if it works though. Install windows SDK, just in case someone needs it, here is the command. powershell:
& 'C:\Program Files (x86)\Windows Kits\10\bin\10.0.26100.0\x64\signtool.exe' sign /fd SHA256 /tr timestamp.digicert.com /td SHA256 /a '.\iptvclient.exe'
If your in CMD:
"C:\Program Files (x86)\Windows Kits\10\bin\10.0.26100.0\x64\signtool.exe" sign /fd SHA256 /tr timestamp.digicert.com /td SHA256 /a ".\iptvclient.exe"

This entry was edited (1 month ago)

Peter Vágner reshared this.

ondrosik

1 month ago from TWBlue

ondrosik
1 month ago from TWBlue

Thanks to #NVDA #NVDASR ocr feature, I installed Spitfire audio and BBC symphony and I can somehow navigate through it inside reaper. KK will be more efficient but I currently don't have one.

#nvdasr #nvda

Peter Vágner reshared this.

ondrosik

1 month ago from TWBlue

ondrosik
1 month ago from TWBlue

my newest mashup audiopub.site/listen/31b60907-…

like this

reshared this

in reply to ondrosik

Peter Vágner

in reply to ondrosik 1 month ago from RaccoonForFriendica

@ondrosik Damn, that sounds great. There is too much singing in that track by Dune so I think it was difficult to find a spot where to mix the vocals from the other track.
Still I like it.
Thanks for sharing!

@ondrosik

in reply to Peter Vágner

ondrosik

in reply to Peter Vágner 1 month ago from TWBlue

@Peter Vágner well, harder was to find eq, delay and echo to fit the original vocals.

@Peter Vágner

Peter Vágner reshared this.

ondrosik

1 month ago from TWBlue

ondrosik
1 month ago from TWBlue

Do you know that you can use Subtitle edit to transcribe audio? It has a relatively accessible guy so you can use Purfwiev's faster whisper xxl, cpp, cpp cublas, const-me. Longer post how to use it follows:

Installing Subtitle Edit

Download the program from the developer’s website. Navigate to the level 2 heading labeled “Files.”
If you want to install Subtitle Edit normally, download the first file, labeled setup.zip.
There is also a portable version available, labeled SE_version_number.zip.

If you decide to use the portable version, extract it and move on to the next section of this article. The installation itself is standard and straightforward.

A Note on Accessibility

NVDA cannot automatically obtain focus in lists.
To find out which item in the list is currently selected, move down with the arrow key to change the item, then press NVDA+TAB to hear which one is focused.

Initial Setup

In the menu bar, go to Video and activate Audio to text (Whisper).
When using this feature for the first time, the program may ask whether you want to download FFMPEG. This library allows Subtitle Edit to open many audio and video files, so confirm the download by pressing Yes.
Subtitle Edit will confirm that FFMPEG has been downloaded and then ask whether you want to download Purfwiev’s Faster Whisper – XXL. This is the interface for the Whisper model that we’ll use for transcription, so again confirm by pressing Yes.
The download will take a little while.
Once it’s complete, you’ll see the settings window. Press Tab until you reach the Languages and models section. In the list, select the language of your recording.
Press Tab to move to the Select model option, and then again to an unlabeled button.
After activating it, choose which model you want to use. Several models are available:
- Small models require less processing power but are less accurate.
- Large models take longer to transcribe, need more performance and disk space, but are more accurate.
  I recommend choosing Large-V3 at this step.
Wait again for the model to finish downloading.

Transcribing Your First Recording

Navigate to the Add button and press Space to activate it.
A standard file selection dialog will open. Change the file type to Audio files, find your audio file on the disk, and confirm.
Activate the Generate button.
Now, simply wait. The Subtitle Edit window doesn’t provide much feedback, but you can tell it’s working by the slower performance of your computer—or, if you’re on a laptop, by the increased fan noise.
When the transcription is done, Subtitle Edit will display a new window with an OK button.

We Got Subtitles, So One More Step

In the folder containing your original file, you’ll now find a new file with the .srt extension.
This is a subtitle file—it contains both the text and the timing information. Since we usually don’t need timestamps for transcription, we’ll remove them in Subtitle Edit as follows:

Press Ctrl+O (or go to File → Open) to bring up the standard open file dialog. Select the .srt file you just got.
In the menu bar, open File → Export → Plain text.
Choose Merge all lines, and leave Show line numbers and Show timecode unchecked.
Press Save as and save the file normally.

If you’re transcribing multiple recordings, it’s a good idea to close the current subtitle file by starting a new project using Ctrl+N or by choosing File → New.

Conclusion

Downloaded models can, of course, be reused, so future transcriptions will go faster.
In this example, I used Purfwiev’s Faster Whisper. If you want to use a different model, you can select it from the model list, and Subtitle Edit will automatically ask whether you’d like to download it.

Peter Vágner likes this.

Peter Vágner reshared this.

Mike Gorse

1 month ago

Mike Gorse
1 month ago

I decided to write a post where I talk about my experiences finding work as a blind person and attempt to give some general advice to blind people who are either looking for work or looking for a position that better aligns with their goals or values. I'm not sure why the strange URL; hopefully it doesn't cause problems. mikegorse.substack.com/p/4834h…

Me, the disabled employee

For various reasons, finding gainful employment as a disabled person tends to be difficult for most people, so I wanted to write down my experiences.

^{Mike Gorse (Chronicles from an unstable era)}

Peter Vágner likes this.

Peter Vágner reshared this.

Unknown parent

Mike Gorse

Unknown parent 1 month ago

@startrek2025
I'm sorry that you all are stuck in the middle like this, with this disfunctional government that we have.

@David Dunphy

in reply to Mike Gorse

David Dunphy

in reply to Mike Gorse 1 month ago

Everyone wants to blame Trump, and he's by no means blameless, but they're all guilty for not considering the consumers. And to shit on the people who work for you, GTFO

Peter Vágner reshared this.

Jamie Teh

1 month ago

Jamie Teh
1 month ago

On my AMD Ryzen 7 8845HS mini PC, NVDA is a bit sluggish in some cases in Firefox; e.g. cursoring through messages in Gmail folders. For reasons I don't fully understand, setting the processor affinity to a single CPU core and setting the process priority to "above normal" helps significantly, even when the CPU is nearly idle. I don't currently have the time/energy to debug the root cause for this or write a proper add-on, but I wrote an NVDA global plugin to make the change for me automatically when NVDA starts. If it breaks something, you get to keep all the pieces.
```
import ctypes

import globalPluginHandler

class GlobalPlugin(globalPluginHandler.GlobalPlugin):
def __init__(self, *args, **kwargs):
super().__init__(*args, **kwargs)
p = ctypes.c_void_p(ctypes.windll.kernel32.GetCurrentProcess())
ctypes.windll.kernel32.SetProcessAffinityMask(p, ctypes.c_void_p(1))
ctypes.windll.kernel32.SetPriorityClass(p, ctypes.wintypes.DWORD(0x00008000))
```
#nvdasr

#nvdasr

This entry was edited (1 month ago)

reshared this

in reply to Jamie Teh

Brandon Tyson

in reply to Jamie Teh 1 month ago

Is this setting the priority for NVDA or Firefox?

in reply to Brandon Tyson

Jamie Teh

in reply to Brandon Tyson 1 month ago

@BTyson NVDA.

@Brandon Tyson

Peter Vágner reshared this.

ondrosik

1 month ago from Enafore

ondrosik
1 month ago from Enafore

this year i slovly get back to composing music again: audiopub.site/listen/4c8ae150-…

Peter Vágner likes this.

Peter Vágner reshared this.

European Commission

1 month ago

European Commission
1 month ago

Sending and receiving money has just become a whole lot easier!

Instant payments are now available to everyone in the eurozone.

⚡ Instant transfers 24/7, no waiting days for your money
💰 No extra fees, same price as regular payments
🔍 Free payee verification, ensuring IBAN and name match before sending
🛡️ Safer payments with daily checks to help prevent fraud and sanctions risks
🏦 More access, not just for banks but also fintechs and e-money institutions

For faster, safer payments than ever!

Illustration of two stacks of gold euro coins. The left stack shows some coins moving toward the right stack, indicating a transfer. Both stacks are labeled “11:00,” suggesting the transaction happens instantly. Below, the text reads “Instant payments are in,” and there’s a small European Union flag in the bottom right corner.

reshared this

Peter Vágner reshared this.

Bri😻

1 month ago

Bri😻
1 month ago

Guys! Guys I have a new mashup. It's called, Girls Just Want Black Magic.

Peter Vágner reshared this.

in reply to Bri😻

Tamas G

in reply to Bri😻 1 month ago

ahaha now that's a trippy one, great work there. The styles really match well on this to blend together so well.

in reply to Tamas G

Bri😻

in reply to Tamas G 1 month ago

@Tamasg Yeah, I like it!

@Tamas G

Peter Vágner reshared this.

Amir

1 month ago

Amir
1 month ago

I'm using Paperback more and more these days. Unlike QRead, which I still need for Bookshare DAISY files, Paperback removes all the unnecessary blank lines between paragraphs and pages, and you can't imagine how nice that feels! It also loads books incredibly fast, no matter how large your EPUB or PDF files are. Huge kudos to the developer, and here's hoping DAISY support gets added soon so I can fully switch over.
github.com/trypsynth/paperback
@TheQuinbox

GitHub - trypsynth/paperback: An accessible, light-weight, cross-platform ebook and document reader.

An accessible, light-weight, cross-platform ebook and document reader. - trypsynth/paperback

^GitHub

@Quin

Peter Vágner reshared this.

in reply to Amir

Cleverson

in reply to Amir 1 month ago

Can you say how it compares to Bookworm today?

in reply to Cleverson

Amir

in reply to Cleverson 1 month ago

@clv0 As I see it now, Bookworm supports DAISY books, and I'm afraid it's the only advantage. But even its DAISY support is half-baked, meaning, for instance, activating hyperlinks inside DAISY books doesn't work at all. Honestly - and with all respect, Bookworm sounds like an abandoned project to me. It hasn't received a single commit on Github over the past couple months or so, and no one is responsible to handle its long-standing bugs or issues. There's no clear roadmap for its upcoming releases - if any. It doesn't handle bookmarks and notes well either, so you can't easily move books to different machines along with your notes and bookmarks. All in all, I don't like Bookworm as things stand right now.
@TheQuinbox

@Quin @Cleverson

in reply to Amir

Cleverson

in reply to Amir 1 month ago

OK, does Paperback support OCR already? I find Bookworm's OCR implementation great; it recognizes an entire book nearly perfectly within seconds.

in reply to Cleverson

Amir

in reply to Cleverson 1 month ago

@clv0 As far as I know, no. But it seems to be planned. I also had no luck with Bookworm's OCR. It's good, but nowhere near solutions like AABBYY FineReader, Omnipage or even the older Kurzweil 1000. @TheQuinbox

@Quin @Cleverson

Peter Vágner reshared this.

Štěpán Škorpil

1 month ago

Štěpán Škorpil
1 month ago

To jsem na sebe zase upletl bič. Přihlásil jsem si přednášku na #LinuxDays2025. Je už zítra. Doufám, že dcerka dnes bude spinkací, jinak budou slajdy zejtra dost děravé.
Jinak kdybyste mě chtěli slyšet koktat něco o tom co selhostinguju a jak to jde, můžete se stavit ve 13h do posluchárny 105 na FIT ČVUT v Dejvicích. 😬

#linuxdays2025

Peter Vágner reshared this.

in reply to Štěpán Škorpil

Schmaker

in reply to Štěpán Škorpil 1 month ago

Však si to můžeš večer zkusit nanečisto s náma :)

in reply to Schmaker

Archos

in reply to Schmaker 1 month ago

@schmaker Teď jsem to chtěl napsat 😂
@stepan

@Štěpán Škorpil @Schmaker

in reply to Schmaker

Štěpán Škorpil

in reply to Schmaker 1 month ago

@schmaker nebo rozdám úkoly a slajdy společně dobastlíme. Delegovat! Delegovat! 😀

@Schmaker

in reply to Štěpán Škorpil

Štěpán Škorpil

in reply to Štěpán Škorpil 1 month ago

V postýlce se ozve zakňučení a mě projede hlavou:
"Ty kojenče lež a nevstávej"

Záběr na vstávajcího umrlce z filmové Kytice v podání Jiřího Schmitzera

Peter Vágner reshared this.

🇨🇦Samuel Proulx🇨🇦

1 month ago

🇨🇦Samuel Proulx🇨🇦
1 month ago

If you're looking for unspoken-ng for the 64-bit alphas of the #NVDA#screenreader, that now lives here: github.com/fastfinge/unspoken-ng/releases/download/v1.0.2/Unspoken-ng-1.0.2.nvda-addon#blind#accessibility#a11y

#a11y #Accessibility #blind #screenreader #nvda

Peter Vágner reshared this.

Mike Ely

1 month ago

Mike Ely
1 month ago

Network admins who disable ICMP: do you also take the numbers off the front of your house to keep the burglars out? #sysadmin

#sysadmin

reshared this

in reply to Mike Ely

Blurry Moon

in reply to Mike Ely 1 month ago

does ipv6 even work correctly without icmp, in my experience stuff starts breaking

in reply to Blurry Moon

feld

in reply to Blurry Moon 1 month ago

Yeah you need (some of) these. 1, 2, 3, 135, and 136. If you use RADVD for addressing you also need 133 and 134

iana.org/assignments/icmpv6-pa…

Internet Control Message Protocol version 6 (ICMPv6) Parameters

^www.iana.org

This entry was edited (1 month ago)

Peter Vágner reshared this.

aaron

1 month ago

aaron
1 month ago

If Google is killing sideloading, then Android is just iOS with ads and spyware. Why the hell would anyone choose that?
fireborn.mataroa.blog/blog/why…
#Android #Google #Sideloading #FOSS #Privacy #accessibility

Why the Hell Does Android Even Exist Anymore? — fireborn

^{fireborn.mataroa.blog}

#Accessibility #android #foss #google #privacy #sideloading

reshared this

in reply to aaron

these machines will destroy US.

in reply to aaron 1 month ago

Google......many devices are available without Google services or can be flashed to use Android without Google Services....this still makes the platform way more open and simultaneously secure than Apple devices.

in reply to aaron

Mormegil

in reply to aaron 1 month ago

This is exactly what I wrote to them in the feedback form (iPhone have never been an option to me because its walled garden rules; as soon as that distinction disappears, why would I use Android?). But I don't believe that feedback from nobodys could change anything.

Peter Vágner reshared this.

Aaron Espinoza

1 month ago

Aaron Espinoza
1 month ago

For the past 3 years, I’ve been part of the Google Accessibility Trusted Tester program, where people with disabilities test unreleased products and give direct feedback to engineers. In my article, I share how the program works, how I’ve maximized participation, the experience I’ve gained, and why more companies should follow Google’s lead in compensating disabled testers.
linkedin.com/pulse/my-experien…

The Google Accessibility Trusted Tester program gives participants the chance to try new Google products before they are released to the public. Testers then provide feedback directly to Google’s engineering teams.

^{Aaron Espinoza (www.linkedin.com)}

Seirdy likes this.

reshared this

Peter Vágner reshared this.

LibreOffice

1 month ago

LibreOffice
1 month ago

LibreOffice Podcast, Episode #5 – Accessibility in Free and Open Source Software: peertube.opencloud.lu/w/wwwjD9…

reshared this

Peter Vágner reshared this.

Rik Schennink

1 month ago

Rik Schennink
1 month ago

I just pushed a new edit.video release. 🔥

- Faster video processing on modern browsers with Mediabunny
- New output quality controls
- Improvements to video trimming controls
- Optionally load demo video
- Drag n' drop to edit video files
- Sticky fullscreen mode

100% Free

Edit • Video

No ads, no popups, no cookies, no account. The fastest way to edit video online

^{Edit • Video}

Peter Vágner reshared this.

in reply to Rik Schennink

FediVerseExplorer

in reply to Rik Schennink 1 month ago

Very interessting, thanx.
I tried various edits, but none of them could be saved. It gets stuck at this point. (Firefox on Android 16)

Peter Vágner reshared this.

Quin

1 month ago

Quin
1 month ago

Today I discovered that our current year is a perfect square. 45*45 = 2025. The last time this happened was in 1936, and it won't happen again until 2116. Happy Monday!

reshared this

Peter Vágner reshared this.

Soren Stoutner

1 month ago

Soren Stoutner
1 month ago

This is a concern for anyone using F-Droid, including users of Privacy Browaer Android.

F-Droid and Google's Developer Registration Decree | F-Droid - Free and Open Source Android App Repository – f-droid.org/en/2025/09/29/goog…

F-Droid and Google's Developer Registration Decree | F-Droid - Free and Open Source Android App Repository

For the past 15 years, F-Droidhas provided a safe and secure haven for Android users around the world tofind and install free and open source apps. When cont...

^f-droid.org

Peter Vágner reshared this.

in reply to Soren Stoutner

Milk🥛

in reply to Soren Stoutner 1 month ago

Здраствуйте,мне кажется они всегда хотели больше контроля,и хорошо что есть #opensource приложение,я не могу пожертвовать по личным причинам, но я занимаюсь продвижением #deltachat и возможно ещё буду #mastodon
трекеров всё больше и больше,в одной простой игре может быть 30+ трекера...😔

#opensource #Mastodon #deltachat

Peter Vágner reshared this.

StroongeCast

1 month ago

StroongeCast
1 month ago

#introduction:

Welcome to #StroongeCast, a husband and wife team consisting of Andre and Kirsten Louis who live in London. On this podcast, we explore anything that makes us question the world—from relationships and parenting to school memories and beyond. Join our family chats for lively discussions, fun stories, and plenty of curious moments.

Going forward we will post all new episode links to this account before any others.

Subscribe: onj.me/stroongecast

Feel free to follow our main accounts here on the fediverse as well.
Andre: @FreakyFwoof
Kirsten: @MoonCat

Onj.Me

^onj.me

#introduction #stroongecast @Andre Louis @Kirsten Louis

This entry was edited (1 month ago)

Peter Vágner reshared this.

in reply to StroongeCast

StroongeCast

in reply to StroongeCast 1 month ago

Many things end long before the one year mark. Easy to start a show, think 'yeah that'll work' but life gets in the way, you can't commit etc etc, but we've stuck with it and I'm proud we have. Really enjoyed doing this and we have no plans to stop.
Here's to another year of #StroongeCast.

#stroongecast

Unknown parent

StroongeCast

Unknown parent 1 month ago

@falcennial No, that's by design. It's so you can follow the podcast on your platform of choice. It can't load a particular episode because there are already 54 of them which we've already copy/pasted from the original episode posts. If you check our timeline, you'll see all of them.
Edit: Just updated our bio to reflect links to our youtube playlist, Apple Podcasts and also Spotify.
A

@millennial fulcrum

This entry was edited (1 month ago)

in reply to StroongeCast

StroongeCast

in reply to StroongeCast 1 month ago

Did you know: There are many completely *free* sample libraries these days, some could help you end up in writing for your next film, documentary, podcast or TV show?

Andre wrote this piece of intro music for the podcast using purely *free* libraries and nothing else.

#InspiredBySound - Musical Breakdown - #StroongeCast Podcast Music: youtu.be/Ra7prQ6d9Pw

#InspiredBySound - Musical Breakdown - StroongeCast Podcast Music

This piece of music is for the podcast 'StroongeCast' I co-created with my wife Kirsten. Every podcast needs a musical intro (or at least I believe so) and t...

^YouTube

#inspiredbysound #stroongecast

This entry was edited (1 month ago)

reshared this

in reply to StroongeCast

StroongeCast

in reply to StroongeCast 1 month ago

We record #StroongeCast episodes on Thursdays consistently now, meaning that youtube members can listen early if they so wish.
If we can find a way to do that for everyone else, we will. Until then however, if you want your fix a day early, please become a youtube member at any level.
youtube.com/@TheOnjLouis/join

Andre Louis

Hi, how are you doing? Thanks for visiting my channel, firstly. A bit about me then. I'm Andre Louis (AKA Onj), a visually impaired musician from London. What will you find here? Videos on this channel fall into a few categories.

^YouTube

#stroongecast

StroongeCast reshared this.

Peter Vágner

1 month ago

Peter Vágner
1 month ago

2 / 2: Did you know @GNOME Files aka #nautilus has a nifty feature where it can move selected files into a newly created folder? #ScreenReader #a11y is preserved.
In order to use it just select multiple files and find Move to new folder item in the shift+F10 popup menu.

#a11y #screenreader #nautilus @GNOME

Peter Vágner

1 month ago

Peter Vágner
1 month ago

1 / 2: Did you know @GNOME Files aka #nautilus has a nifty feature where it can batch rename files? Advanced features include adding sequential numbering, using placeholders and doing search and replace on the names of selected files. #ScreenReader #a11y is preserved.
In order to use it just select multiple files and find Rename item in the shift+F10 popup menu or simply press F2. Also... Don't be shy to press the add button in the batch rename dialog.

#a11y #screenreader #nautilus @GNOME

Peter Vágner likes this.

in reply to Peter Vágner

Lukáš Tyrychtr

in reply to Peter Vágner 1 month ago

Yeah, this one is not very well discoverable, I first found it actually in the Nautilus source code.

Peter Vágner reshared this.

Ivan Soto

1 month ago

Ivan Soto
1 month ago

Gauging user interest! Over the last few weeks I have been working on a website that can auto describe YouTube videos. I am aware that the solution has already been created for windows, however, I don’t want to leave mobile users and non-Windows users out in the cold. I have already experimented with a desktop app and it works quite well. Would this be of interest to the community?

This entry was edited (1 month ago)

reshared this

in reply to Ivan Soto

André Polykanine

in reply to Ivan Soto 1 month ago

Yes!

Peter Vágner reshared this.

Jan Schaumann

1 month ago

Jan Schaumann
1 month ago

libxml2's sole maintainer Nick Wellnhofer steps down, meaning libxml2 is now no longer maintained.

discourse.gnome.org/t/stepping…

It's hard to estimate just how many companies depend on this software and critical security updates to the library, so I'm certain many will quickly step up and offer sponsorship to ensure a fundamental dependency doesn't just deteriorate without proper support.

Any day now.

Stepping down as libxml2 maintainer

I’m stepping down as maintainer of libxml2 which means that this project is more or less unmaintained for now. I will fix regressions in the 2.15 release until the end of 2025.

^{GNOME Discourse}

reshared this

Peter Vágner reshared this.

Elijah Massey

1 month ago

Elijah Massey
1 month ago

It turns out that you can already run GUI Linux programs in the new Linux terminal app on Android 16, before Google releases the official GUI support. First I switched the audio system to pipewire in the VM by installing the pipewire-audio package, then I installed xrdp (an RDP server for X11), and pipewire-modules-xrdp, for audio support. Then I installed mate-desktop-environment and orca, enabled accessibility in Mate with "gsettings set org.mate.interface accessibility true", and enabled Orca to start automatically with "gsettings set org.gnome.desktop.a11y.applications screen-reader-enabled true". Then I set the password for the default "droid" user with "sudo passwd droid", and created ~/.xinitrc with "#!/bin/sh" and "mate-session", and made it executable with "chmod +x ~/.xinitrc"" After doing all of this, I pressed the third unlabeled button in the Terminal app to open its menu, went to "Port control" and enabled port 3389. Then I installed Windows App from the Play Store and I added a PC with hostname 127.0.0.1, and added a user with the name "droid" and the password I set. When I connected to it, Orca started speaking, and after turning TalkBack off by holding the volume keys, I could control the Linux system with my Bluetooth keyboard, including using the Control and Alt keys, and after putting Orca in laptop mode (by running "orca -s" to open the preferences dialog), I could perform Orca commands with the caps lock key, although sometimes it types a letter instead and it toggles Android's caps lock state (which is separate from Linux's), but pressing caps lock once toggles it off again.

#bin/sh

Peter Vágner reshared this.

in reply to Elijah Massey

Cleverson

in reply to Elijah Massey 1 month ago

Very cool and interesting; thanks!

in reply to Cleverson

Peter Vágner

in reply to Cleverson 1 month ago from RaccoonForFriendica

@Cleverson @Elijah Massey Thanks for the nice summary on how to make it work.
It's time for me to buy a OTG cable to hook ordinary keyboard into my phone.

@Cleverson @Elijah Massey

Peter Vágner reshared this.

Guus der Kinderen

1 month ago

Guus der Kinderen
1 month ago

The XMPP Interop Testing project helps ensure XMPP servers and clients play nicely together by providing specification test automation.

New update:
✔️ Option to fail runs if some tests were "impossible" to execute.
✔️ Flexible account provisioning

Details on the blog: xmpp-interop-testing.github.io…

The development journey that @fishbowler and I have been taking was made possible by a grant from @nlnet 🙏. The grant has now concluded, and we’re deeply thankful for their support!

#XMPP #interop #testing

XMPP Interop Testing

^{XMPP Interop Testing}

#xmpp #testing #interop @Dan Caseley @NLnet

reshared this

Peter Vágner reshared this.

Jiří Eischmann

1 month ago

Jiří Eischmann
1 month ago

We had to start charging the electric car from the grid this week due to bad weather and shorter days, but between this week and mid-April, when we got the car, we charged it exclusively from solar power. Six months and six thousand kilometers. Not bad.

#ElectricVehicle #electromobility #renewables #photovoltaic #renewableenergy

#electricvehicle #renewables #renewableenergy #electromobility #photovoltaic

Peter Vágner reshared this.

in reply to Jiří Eischmann

Schmaker

in reply to Jiří Eischmann 1 month ago

That would be 18k CZK in gasoline for me! Impressive

in reply to Schmaker

Jiří Eischmann

in reply to Schmaker 1 month ago

@schmaker before getting an electric car we paid 2.5-3k CZK for gasoline per month. The average petrol consumption cost was 3 CZK/km for the petrol car. When I charge from the solar system I only count the cost of missed opportunity for not being able to sell the energy to the grid. It's 1.25 CZK/kWh and with the average EV consumption of 15 kWh/100 km the cost per km is 0.2 CZK. 15 times lower.

@Schmaker

in reply to Jiří Eischmann

Jachym Cepicky

in reply to Jiří Eischmann 1 month ago

We have two electric cars now. We need to extend the powerplant.

12kkm 🤦

Peter Vágner reshared this.

Benedict

1 month ago

Benedict
1 month ago

Anyone using #Tammy as a #Matrix messenger client?

What do you think about it? Why do you use it? How much do you use it? On which device type is it installed? What features do you miss?

Yes. (15%, 3 votes)
No, but already tried it. (5%, 1 vote)
No, but heard about it. (15%, 3 votes)
No, never heard about it. (65%, 13 votes)

20 voters. Poll end: 1 month ago

#matrix #tammy

Peter Vágner reshared this.

in reply to Benedict

FediVerseExplorer

in reply to Benedict 1 month ago

First of all: Thank you for your work!
I use it on Android alongside with other #Matrix Clients.
So far I remember, the onboarding was good. Generelly it feels fast (after) sync.
The chatlist items, for me, could have a little bit more padding. The gui with the bubble style in the Chats I don't like much.
It's nice to have some settings for gui. The accent color could have more options.

#matrix

Peter Vágner reshared this.

aaron

1 month ago

aaron
1 month ago

After a short break, I’m returning to accessibility training services.

I provide one-on-one training for blind and visually impaired users across multiple platforms. My teaching is practical and goal-driven: not just commands, but confidence, independence, and efficient workflows that carry into daily life, study, and work.

I cover:
iOS: VoiceOver gestures, rotor navigation, Braille displays, Safari, text editing, Mail and Calendars, Shortcuts, and making the most of iOS apps for productivity, communication, and entertainment.
macOS: VoiceOver from basics to advanced, Trackpad Commander, Safari and Mail, iWork and Microsoft Office, file management, Terminal, audio tools, and system upkeep.
Windows: NVDA and JAWS from beginner to advanced. Training includes Microsoft Office, Outlook, Teams, Zoom, web browsing, customizing screen readers, handling less accessible apps, and scripting basics.
Android: TalkBack gestures, the built-in Braille keyboard and Braille display support, text editing, app accessibility, privacy and security settings, and everyday phone and tablet use.
Linux: Orca and Speakup, console navigation, package management, distro setup, customizing desktops, and accessibility under Wayland.

Concrete goals I can help you achieve:
Set up a new phone, tablet, or computer
Send and manage email independently
Browse the web safely and efficiently
Work with documents, spreadsheets, and presentations
Manage files and cloud storage
Use social media accessibly
Work with Braille displays and keyboards
Install and configure accessible software across platforms
Troubleshoot accessibility issues and build reliable workflows
Make the most of AI in a useful, productive way
Grow from beginner skills to advanced, efficient daily use

I bring years of lived experience as a blind user of these systems. I teach not only what manuals say, but the real-world shortcuts, workarounds, and problem-solving skills that make technology practical and enjoyable.

Remote training is available worldwide.

Pricing: fair and flexible — contact me for a quote. Discounts available for multi-session packages and ongoing weekly training.

Contact:
UK: 07447 931232
US: 772-766-7331
If these don’t work for you, email me at aaron.graham.hewitt@gmail.com

If you, or someone you know, could benefit from personalized accessibility training, I’d be glad to help.

#Accessibility #Blind #VisuallyImpaired #ScreenReaders #JAWS #NVDA #VoiceOver #TalkBack #Braille #AssistiveTechnology #DigitalInclusion #InclusiveTech #LinuxAccessibility #WindowsAccessibility #iOSAccessibility #AndroidAccessibility #MacAccessibility #Orca #ATTraining #TechTraining #AccessibleTech

This entry was edited (1 month ago)

reshared this

in reply to aaron

NV Access

in reply to aaron 1 month ago

Welcome back - hope it was a good break!

Are you (or would you consider becoming) an NVDA Certified Expert for your NVDA work? We publish a list of those who are, with contact details for those who would like to share them / provide services, and it's usually the first place I look when someone asks for a local contact: certification.nvaccess.org/

in reply to NV Access

aaron

in reply to NV Access 1 month ago

@NVAccess absolutely something I'm going to look into.

@NV Access

Peter Vágner reshared this.

Hacker News 50

2 months ago (Received 1 month ago)

Hacker News 50
2 months ago (Received 1 month ago)

Yt-dlp: Upcoming new requirements for YouTube downloads

Link: github.com/yt-dlp/yt-dlp/issue…
Discussion: news.ycombinator.com/item?id=4…

#youtube

[Announcement] Upcoming new requirements for YouTube downloads

Beginning very soon, you'll need to have Deno (or another supported JavaScript runtime) installed to keep YouTube downloads working as normal. Why? Up until now, yt-dlp has been able to use its bui...

^{bashonly (GitHub)}

#youtube

reshared this

Peter Vágner reshared this.

Terminal Trove

2 months ago

Terminal Trove
2 months ago

lue is a TUI ebook reader with text to speech (TTS) support.

It can read EPUB / DOCX / PDF / TXT / files, supports 100+ languages, highlights words in sync, saves your progress, has themes and more.

Starry Eyes (superstarryeyes on GitHub) made lue using Rich, a Python library by @textualize and is Terminal Tool of the Week! ⭐️

terminaltrove.com/lue/

lue - A TUI ebook reader with Text-to-Speech (TTS). - Terminal Trove

A TUI ebook reader with Text-to-Speech (TTS).

^{terminaltrove.com}

@Textualize

reshared this

Peter Vágner reshared this.

Robin Kipp

2 months ago

Robin Kipp
2 months ago

Here it is, my new, self-hosted home in the #Fediverse running on #GoToSocial! This is really exciting stuff, now I’m truly living the Fedi spirit by supporting decentralization. If you're reading this and don't mind, I'd greatly appreciate a boost of this post to help my tiny new instance discover more servers. Thanks! #NewFedi #FediAdmin #Selfhosting

#fediverse #selfhosting #GoToSocial #fediadmin #newfedi

reshared this

in reply to Talon

Robin Kipp

in reply to Talon 2 months ago

Thanks so much for clarifying and for the explanations! I was under the impression that boosting would indeed result in my server pulling in more posts from other instances, but I guess that was a misconception on my part having seen others doing this after spinning up a new server. I did follow many of the people who boosted though as a way to say thank-you, so perhaps this actually did help extend my server's reach by extension

. GTS does not support relays yet sadly, although it is on their roadmap, I only realized it after I'd already configured most things so there was no turning back. That being said though, I do feel GTS is absolutely the right choice for my single-user scenario, setting up Mastodon or one of its forks would likely be overkill for my small Fedi home. That being said, I did find this project which seems interesting, maybe I should look into setting this up: codeberg.org/tante/hypebot

hypebot

Mastodon bot that boosts trending posts from other instances into your timeline

^Codeberg.org

in reply to Robin Kipp

Talon

in reply to Robin Kipp 2 months ago

GTS is definitely the right choice without a doubt, at least in my opinion.
Yeah this push based thing is something a lot of people struggle with. If a server doesn't send you stuff, you won't get stuff, without external tools. I'm not entirely sure if GTS does context backfilling yet, I used to use fedifetcher for that for example. That's another way to get content in.

Peter Vágner reshared this.

🦜 Ondřej v Nizozemsku | V roce 2020 jsem se přestěhoval do Nizozemska. Abych si pamatoval, co se kdy událo, a abych se naučil s Jekyllem, začal jsem psát blog.

2 months ago

🦜 Ondřej v Nizozemsku | V roce 2020 jsem se přestěhoval do Nizozemska. Abych si pamatoval, co se kdy událo, a abych se naučil s Jekyllem, začal jsem psát blog.
2 months ago

Letos volím poštou

xn--ondej-kcb.v.nizozemsku.nl/2025/09/21/Letos_volim_postou.html

Zatímco v Česku vrcholí horká fáze kampaně, já už mám odvoleno. Spolu s více než dvaceti tisíci krajany jsem se rozhodl letos poprvé vyzkoušet korespondenční hlasování. To se dostalo do novely zákona o správě voleb a umožňuje Čechům žijícím v zahraničí…

Letos volím poštou

^{Ondřej Caletka (Ondřej v Nizozemsku)}

reshared this

Peter Vágner

2 months ago from RaccoonForFriendica

Peter Vágner
2 months ago from RaccoonForFriendica

Yeah, I've updated my @Arch Linux to @GNOME 49.
There are some nifty #a11y related tweaks such as better labelling for gnome shell menus, refreshed settings UI, I like how presentation of various lists e.g. List of wireless networks is presented with screen reader including signal strength.

Thanks to everyone involved for the improvements.

#a11y @GNOME @Arch Linux

in reply to Peter Vágner

Cleverson

in reply to Peter Vágner 2 months ago

Yes, Gnome is quite good now.

Peter Vágner reshared this.

feld

2 months ago

feld
2 months ago

A long time ago I was led to believe that CIFS was a successor to the SMB protocol, but it's not.

Peter Vágner reshared this.

miki

2 months ago

miki
2 months ago

Three things the blind community is known for:

1. Stevie Wonder.

2. those weird dots on your elevator buttons.

3. That guy on Mastodon.

Peter Vágner reshared this.

in reply to miki

Mayowa

in reply to miki 2 months ago

oh lord your refurring to remone

Peter Vágner reshared this.

Micr0byte

2 months ago

Micr0byte
2 months ago

Hi, I'm micr0, the creator of Altbot.

Almost a year ago, your incredible generosity helped us raise the funds to buy the server that Altbot runs on today (locally and privately) It's been operating from my home ever since, and I'm so grateful for the support that made that possible.

But now, the situation has become unsustainable. My home network is under a sustained, targeted DDoS attack aimed at taking Altbot offline. And unfortunatly this isn't just a threat to the bot, it's a serious security and privacy concern for my family.

A lot of people are probabaly going to be asking the same question I did: "Who is doing this?"
but the honest answer is: I don't know, and I likely never will. These attacks are launched through botnets and proxies designed specifically to hide the source. Figuring out the "who" is nearly impossible. The only thing I can do is focus on the "how to stop it."

Running this critical service from a residential address is no longer viable. To protect Altbot and my family, we need to move the server to a professional data center with proper, enterprise DDoS mitigation.

The Goal: $2,880 to cover 12 months of secure colocation.

This will provide a secure, stable home for Altbot with:

Enterprise-grade DDoS protection
99.95%+ uptime guarantee
24/7 monitoring and security
Separation from my personal home network

Donations can be made via:

Ko-fi: ko-fi.com/micr0byte/goal?g=0
GitHub Sponsors: github.com/sponsors/micr0-dev (More direct, Fewer fees)
Ethereum: 0xC992E57236eb9F30E79d0469446a6CF08Be05939

This isn't just about maintaining a service. It's about ensuring that an important accessibility tool remains available for everyone who depends on it, while also protecting my family's privacy and safety.

Please consider supporting if you can. If you're unable to donate, boosts are incredibly valuable for raising awareness.

Thank you for your support and for believing in Altbot's mission.

#Altbot #Accessibility

Sponsor @micr0-dev on GitHub Sponsors

hi i made altbot admin of wetdry.world and owner of fuzzies.wtf also post silly things on mastodon

^GitHub

#Accessibility #altbot

reshared this

⇧

Paweł Masarczyk 1 month ago • •

Andy Greenberg 1 month ago • •

ondrosik 1 month ago from Enafore • •

What Is VDO Ninja

Getting Started

A Few Words About Browsers

Let’s Record a Podcast

How to Join

Controls

Adjusting Input and Output Devices

Director Options

Adjusting Guest Audio

Recording

Manual Recording

Important Recording Notes

Recommended Reading

Bri😻 1 month ago • •

ondrosik 1 month ago from TWBlue • •

ondrosik 1 month ago from TWBlue • •

ondrosik 1 month ago from TWBlue • •

Installing Subtitle Edit

A Note on Accessibility

Initial Setup

Transcribing Your First Recording

We Got Subtitles, So One More Step

Conclusion

Mike Gorse 1 month ago • •

Jamie Teh 1 month ago • •

Paweł Masarczyk
1 month ago

Andy Greenberg
1 month ago

ondrosik
1 month ago from Enafore

Bri😻
1 month ago

ondrosik
1 month ago from TWBlue

ondrosik
1 month ago from TWBlue

ondrosik
1 month ago from TWBlue

Mike Gorse
1 month ago

Jamie Teh
1 month ago