Skip to main content

Peter Vágner reshared this.


Way more interesting and healthy fediverse news is happening in the shadows and is barely getting discussed! Discourse has federation between different instances of itself and other #fediverse software such as Mastodon working!

Attached is a demo video from Angus McLeod via their announcement here: meta.discourse.org/t/activityp…

This entry was edited (10 months ago)

reshared this


Peter Vágner reshared this.


Hey ! We're pleased to announce that Ltt.rs [1], an email (JMAP) client, and Mercurygram [2], a new fork of #Telegram, now support #UnifiedPush. And support is being upstreamed to Telegram-FOSS :)

[1] ltt.rs from @daniel
[2] github.com/drizzt/Mercurygram/ from @timothy

Peter Vágner reshared this.

in reply to UnifiedPush

let's find a few freemailer with jmap support.

I know exactly none. 😇


Peter Vágner reshared this.


In VSCode, you can use ctrl-k, ctrl-b to mark the start of a selection. You can then navigate normally to find the end of what you want selected, and press ctrl-k, ctrl-k. The text is selected. How did I not know this?! I've been shift-arrowing for years!
in reply to André Polykanine

@menelion Yes, I think that's how all the multi-layer keys in VSCode work. I don't use them often, but when I do, I have to hold the control key. NVDA offers similar mark setting, but I've not found it to be too reliable with large blocks of text.
in reply to Alex Hall

With JAWS it works perfectly. Probably because they had that for many years for web browsers and MS Word only, now it's available everywhere.

Peter Vágner reshared this.


I don't think #WebAIM has an account on the fediverse yet, but the 10th iteration of their Screen Ready survey is now live.

The vital #a11y insights created from these surveys help inform our understanding of the technological and usability landscape. This helps to shape how accessible, and importantly usable experiences are created on the web.

If you use a screen reader, I hope you'll please consider filling it out: webaim.org/projects/screenread…

reshared this


Peter Vágner reshared this.


Today I learned you can press windows alt R and record the output of whatever application you're focused in to a video file without capturing other system sound on your computer, press again to stop. #Wow
#wow

reshared this

in reply to André Polykanine

@menelion I think you just hit the same hotkey gain, its a toggle if I remember correctly. I don't use this all that often

Peter Vágner reshared this.


RScan now on Google play! Identify products, swiftly

RScan lets you scan a barcode of any product, and automatically looks it up on DuckDuckGo and tries to select the most fitting and useful description. The scanning is really fast and convenient, meaning it's easy to work with even if you're blind and don't know the location of the barcode, but it's also great for sorting through large number of items, if you need to say search for a particular chocolate in a pile, or you order a large number of cans that you need to tell apart in order to sort them to groups. RScan can deal even with scanning multiple items of the same barcode, making this process easy and efficient.

I've been gradually developing this app over the last years, always reflecting my actual needs and actively using it in my home. I've been sharing it with my friends from Czechoslovakia, receiving great feedback. In our region, the automatic product identification works really well and RScan can truly identify 90% of items just by seeing their barcode.

Few days ago, I finally got to release it in google play, and given this opportunity, would like to expand it to more regions. My theory is that my algorithm should work comparably well in regions where products use metric system for describing themselves (300 g chocolate, 400 ml can etc.).
It's not yet prepared for UK, where the commonly used units are different, and completely unknown for me is USA, where a similar but different barcode standard is used than in Europe, plus they have different units too.

I would love to support as many countries and regions as I can. If you would like to help me with this, sending me barcode numbers of things you commonly use in your region (cans, chocolates, common drugs, drinks), along with the name of your country would be very useful, so I could check out how does your Internet describe things and make RScan adapt for it.
If you're technically skilled, you can read the project's readme:
github.com/RastislavKish/RScan
and directly look for the unit information required by RScan, this would save me the struggle with localization and location simulation.

Also, if RScan works well in your country, I would love to hear that too! My theory of compatible countries is still just a theory, it needs to get verified.

You can find RScan on Google play:
play.google.com/store/apps/det…

And, along with all the code and scanning tips in the documentation, on my GitHub:
github.com/RastislavKish/RScan

Happy scanning!

Peter Vágner reshared this.


Peter Vágner reshared this.


I discovered this a few years ago, but every time I remember it it really makes me go "wtf?" Check boxes on Windows allow you to check them with equals, and uncheck them with dash. Why? I have no clue, but they do.

reshared this


Peter Vágner reshared this.


I made web component out of a common pattern I use in UI, combining a range slider with a number input- potch.me/2023/range-num-web-co…

reshared this


Peter Vágner reshared this.


Are you interested in making your notifications private?
We have a great guide in our website describing how to set up your android xmpp application to deliver notifications for all your apps on your device. Without using google.
joinjabber.org/tutorials/servi…
and for people who self host you can use your own server to deliver notifications privately to your devices and the devices of the people that use your server :)
joinjabber.org/tutorials/servi…

#privacy #xmpp #dataprotection

reshared this


Peter Vágner reshared this.


This festive season give the gift of accessibility with an NV Access donation. Your support ensures we continue delivering life-changing technology to blind and visually impaired people globally.
Donations can be made here, nvaccess.org/support-us/#donat…

#Donate #Donation #NVDA #ScreenReader #Accessibility #Christmas

Peter Vágner reshared this.


Peter Vágner reshared this.


So I pasted some #ObjectiveC code into suno.ai that attempts to extract the pixel color from underneath the mouse pointer and this is the video they have come up with!

Peter Vágner reshared this.


Peter Vágner reshared this.


TalkBack 14.1 comes with image descriptions (which are actually surprisingly accurate from my limited testing), spell check while using the Braille keyboard, automatic scrolling for Braille displays (with a customizable speed), and (most surprisingly to me), new haptics! I'm not convinced I like them yet, just because text elements don't appear to have a vibration, but it actually feels like Voice Assistant or VoiceOver now!

reshared this

in reply to Quin

@ysotomayor That, and old devices basically never get new kernel releases, so even when Google finally caves in and does it, it probably won't work on older phones. Even new phones often have out-of-date kernels apparently, so it might take a year or two since the official Google update for any devices to actually get support.

Mikołaj Hołysz reshared this.

in reply to Mikołaj Hołysz

@miki @ysotomayor With the 7 year support cycle Google is doing with Pixel now it would likely make it to those but for other brands yep probably not.

Peter Vágner reshared this.


This morning, someone called my dad in my name and started asking questions, using what we believe to be AI voice cloning. Be mindful and tell those close to you to be mindful as well. Figure out a way to ID yourself. This tech is quite dangerous and it is being used for evil. Who would've thought.

reshared this


Peter Vágner reshared this.


In a matter of a couple of weeks, Android accessibility has witnessed a dramatic boost from almost 0 access to picture descriptions to a wide range of options.
1. TalkBack 14.1 can describe images. Though, IMO, not as accurately as VoiceOver, it works well, and its auto-text extraction is awesome.
2. As you know, Seeing AI is now on Android along with its AI-oriented goodies.
3. @bemyeyes Be My AI just became available on Android. It's not yet capable of receiving pictures from other apps, but guess, hopefully, it will be added soon.
4. Since we're handling Google, I don't know when, but Lookout's AI capabilities, currently limited to users in the USA, will expand to other regions.

reshared this


Peter Vágner reshared this.


Let's Encrypt will issue new intermediate certs in Q1/2024: groups.google.com/a/mozilla.or…

Make sure your LE cert deployment logic includes serving the right intermediates that ACME should hand you, not just that same old LE intermediate you got years ago. Otherwise, there'll be breakage...

#x509 #pki #LetsEncrypt

reshared this


Peter Vágner reshared this.


The universe has cried and Kostyantin, the dev behind Whatsapp+ and Unigram+ NVDA addons, has delivered: Github-Release-Downloader, a NVDA addon that will happily download the latest release of any Github repo you throw at it. It's quite rudimentary at the moment i.e. we need to enter the URL manually, it can't handle multiple asset files etc. but I was just able to download some NVDA addons just fine. It's definitely a start.
dropbox.com/scl/fi/n7xn2fsv70c…
If you like this one and would like to support the dev, all the info you need is at t.me/unigramplus. Thanks, Kostya, and greetings to Ukraine! #NVDASR #Accessibility #Blind

Peter Vágner reshared this.

in reply to Bri😻

@Brynify Because it has somehow become a fashion for all small, handy tools that are invoked with shortcut keys to be NVDA addons. haha Anyway: I threw the idea out there some weeks ago and nobody objected so I found a person who made it happen. It can be redone as an actual app, I guess if that's better.
in reply to Paweł Masarczyk

In my opinion that would be better, only reason I say that is not everybody uses NVDA and thus wouldn't be able to benefit from something like this that doesn't necessarily need to be an add-on anyway.
This entry was edited (10 months ago)
in reply to Paweł Masarczyk

That being said I guess I see why people do it, NVDA just has all of the stuff, dialogs, python libraries for downloading things, speech output, etc, and if you know python and how to create add-ons, it's easy enough. But still
in reply to Bri😻

@Brynify I guess my train of thought was, since I download a lot of unreleased addons off Github, this is the right interface but you're right, of course.
in reply to Paweł Masarczyk

If it were meant specifically for nvda addons-, I'd understand it more maybe. Not bashing on it, just kind of sad a general purpose tool has been locked exclusive to NVDA.
in reply to Bri😻

@Brynify No worries, no bashing detected, it's a legitimate concern. I'll ask what can be done. Actually, since it only supports repos with a single asset for now, it actually is better suited for something like NVDA addons.
in reply to Bri😻

@Brynify With all of that being said, I do agree this isn't screen-reader-specific, and that as many people as possible should benefit. My answer to that would be to build it as a web service: let me open a GitHub repo URL, change the domain and nothing else (e.g. "github.com" to "rlshub.com"), and give me the info I need. Build screen reader add-ons to automate that step if it doesn't feel intuitive enough. @Piciok
in reply to Bri😻

@Brynify You have a browser, you were going to open the GitHub page anyway, and you'll be downloading a file even with an app. So I don't understand the objection. Didn't you build almost this exact idea for downloading NVDA? Or was that someone else? @Piciok
in reply to Bri😻

@Brynify Fair enough. If I had indeed been suggesting an Electron app for such a simple task, I'd ask you to please fire away at will. I'm just thinking, someone sends you a GitHub repo, you open it, change the address bar, boom: you get a download or a list of them. @Piciok
in reply to Bri😻 Paweł Masarczyk reshared this.

@Brynify Okay, I put a rough version of this together. Change "github" to "gitrls" in any GitHub repo URL, and get either the single asset for the latest release, or a list of assets if there's more than one. Some URLs to try:

* gitrls.com/cartertemm/AI-conte…
* gitrls.com/digitalocean/doctl
* gitrls.com/Brynify/typing_sett…

It will silently trim any unrelated parts of the URL, so you could e.g. do this and it will work: gitrls.com/NVDARemote/NVDARemo… @Piciok

reshared this

in reply to James Scholes

@jscholes @Brynify I have just tried with both the NVDA and Lossless Cut repo at github.com/mifi/lossless-cut/ each of them having multiple assets to a release and I got a 404 error. Maybe I spelled something wrong but I don't think so as I replaced the "hub" part from my address bar. hmmm...
in reply to Paweł Masarczyk

@Brynify A 404 for the NVDA repo is currently expected, because the installer isn't hosted on GitHub as a release asset. GitHub's web UI always offers a zip and tarball of the source code under the "assets" button, but the GH API doesn't consider those to be assets in the same way. As far as hgitrls is concerned, an NVDA release has zero assets attached.
in reply to James Scholes

@James Scholes Not complaining in any way, just trying to get to the bottom of this... This url ending in slash gitrls.com/mifi/lossless-cut/ returns a 404 error, adding something at the end or removing the slash makes it work the way I'd expect. @Bri😻 @Paweł Masarczyk
in reply to Andre Louis

@FreakyFwoof Yep, you did it right. In the case where a repo has no latest release, like that one, I should probably either return a 404, or an error message, or something similarly more helpful. @johann @Brynify @Piciok
in reply to James Scholes

@jscholes @FreakyFwoof @johann @Brynify For the record: if you add "ss" before "github" to any URL pointing to any part of a repo, you will be able to download a zipped archive of that specific part. Yep, somebody set up that too.

modulux reshared this.


I am just testing @Jami with a #screenreader on both linux desktop and android for #accessibility.
Initial setup is accessible on both the platforms.
Listing conversations I can't really say as I only have single contact.
Audio calling is working fine. I am impressed that the call setup took just a moment. On android controls like microphone toggle, speaker / earpiece toggle and hangup button are working fine.
I am unable to find out in call controls with the keyboard on linux.
On both desktop and android I can write messages.
On android I can read messages, find and execute additional actions in the popup menu.
On the desktop I can't read incoming and outgoing messages with a screen reader. I haven't discovered on how to copy them.

In conclusion comparing this to the tox chat the Jami is more accessible with a screen reader. Perhaps I will be able to figure out how to handle the calls with a keyboard shortcuts however the fact message text is not readable with a screen reader on desktop linux and perhaps other platforms sounds dissapointing. The idea and decentralized nature of this communication app sounds really amazing.

reshared this

in reply to Peter Vágner

Exploring @Jami further I think I can post a few more observations.
The desktop app uses QT6 for the UI. Apart of some ounlabelled buttons such as Accept / Reject incoming call and some tab controls on the main window most of the UI elements are clearly labelled and accessible from the keyboard.
The desktop app has a keyboard shortcuts button on the main screen which opens a tabbed dialog with accessible lists of keyboard shortcuts. I haven't yet discovered if these can be tweaked but the default ones are working well for me. For calls it's ctrl+y for accepting and ctrl+d for rejecting / hanging up. Letter m alone can be used for muting / unmuting the microphone.
I need to find out if there is a way to create global shortcuts or do some actions using commandline switches. If either of this turned out to be possible it would perfectly integrate with the desktop.
Sending and receiving files is something I'm going to try next.
@Jami

Peter Vágner reshared this.


Testing #SeeingAI on #Android. This is *not* a walkthrough or tutorial, just a very quick, very dirty demo. No more than that, so speech is at the usual speed I have it, etc.

reshared this



Peter Vágner reshared this.


My partner Deedra reported an accessibility issue in the Monal XMPP client
recently, and their reply included the following help request:

> If you are aware of an open source enthusiast with a bit of experience
> in writing SwiftUI as well as accessibility, please let him/her/them
> know that we would love some help.

I don't know anything about this stuff, but I hope I can find people who do.
Any takers?
Probably you should reach out to the project directly via email, to
info at monal-im dot org.
Please boost far and wide.

Mentioning @xmpp as well, here.

reshared this



Priatelia, poznáte simplemobiletools.com ? Je to sada veľmi jednoduchých ale funkčných open-source aplikácií pre android. Za projektom stojí slovenský vývojár Tibor Kaputa, čo som doteraz veľmi rád všade zdôrazňoval, kde to bolo možné. Sám autor v komentári potvrdzuje, že napozadí prebieha predaj týchto aplikácií resp. nie ani tak aplikácií a značky simple mobile tools ako používateľskej základne v obchode google play spoločnosti ZipoApps. Už tu na fediverse som zhliadol vtipné komentáre, že v kontexte autorovho priezviska, týmto aplikáciám je kaput. github.com/SimpleMobileTools/G…

Peter Vágner reshared this.


#AndroidAppRain at apt.izzysoft.de/fdroid today with 3 updated and 2 added apps:

* Vault: password manager compatible with KeePass
* Round Sync - Rclone for Android: file manager using rclone for the heavy lifting

At F-Droid, 35 apps were updated.

Enjoy your #free #Android #apps with #FDroid and the #IzzySoftRepo :awesome:

Peter Vágner reshared this.

in reply to IzzyOnDroid ✅

Hey, that is my app! :D

Out of curiosity, are you using the reproducible builds mechanism for that release, or is it signed by yourself?

I ask, because in the past my app has been a bit wonky in terms of reproducibility.

in reply to Felix

@newhinton I'm using your very own builds from the Releases tab. Details here: apt.izzysoft.de/fdroid/index/i…

Peter Vágner reshared this.


ChatGPT can reveal its training data, that includes personal information.

not-just-memorization.github.i…

Here, the authors used a prompt to instruct ChatGPT to repeat a word forever, eventually resulting in different text that can be linked back to the source; examples include "company" and "poem".

No doubt this "attack" (in words of the authors) will soon be intercepted, but who knows what other formulation of prompt results in the same behaviour?

Analysis:
arxiv.org/abs/2311.17035

This entry was edited (10 months ago)

reshared this


Peter Vágner reshared this.


LLaVA-1.5 is an open-ish AI model which can provide image descriptions and allow follow-up interaction, akin to Be My AI. The best part is that you can run it locally on your computer if you have an appropriate GPU... or very, very slowly if you want to use your CPU. I thought it'd be cool to hook it up to #NVDASR so you can get image descriptions for the current navigator object and then ask follow-up questions. So, I wrote an NVDA add-on to do just that using llama.cpp. github.com/jcsteh/nvda-llamaCp…

reshared this

in reply to Jamie Teh

I tried building this on wsl, and the server is running, but I never get anything back from NVDA. My fan speeds up, I see it's processing an image and then it releases the tokens in cache. Nothing that I can see in my log viewer...
in reply to Mike Wassel

@blindndangerous If you're running it on CPU, it'll take over a minute to respond to queries. But otherwise, I'm not sure why it would be failing. Does it say anything about how many tokens are in the cache when it releases the slot?
in reply to Mike Wassel

@blindndangerous Hmm. Do you see anything in the output about encoding images?
slot 0 - encoding image [id: 10]
in reply to Jamie Teh

Size is weird, I just took something. But it happens the same if it says 1920x1080 slot 0 - image loaded [id: 10] resolution (38 x 22)
slot 0 is processing [task id: 4]
slot 0 : kv cache rm - [0, end)
slot 0 - encoding image [id: 10]
{"timestamp":1701472031,"level":"INFO","function":"log_server_request","line":2601,"message":"request","remote_addr":"127.0.0.1","remote_port":33884,"status":200,"method":"POST","path":"/completion","params":{}}
slot 0 released (3 tokens in cache)
in reply to Mike Wassel

@blindndangerous The previous add-on build would have timed out after 10 seconds, though you definitely should have seen an error in the NVDA log in that case. I pushed another build which increases the timeout to 3 minutes.


Zaujímavý článok z histórie dopravy v čechách a na slovensku autobible.euro.cz/clanky/karos…

Steffen reshared this.


I have just found out up to date version of #rclone can be used on non-rooted #android devices thanks to the #round-sync app. github.com/newhinton/Round-Syn… cc @IzzyOnDroid ✅
in reply to Peter Vágner

Die App kann es gar nicht geben. Die kommt doch aus einem Phantom-Ort:

certificate DN: C=de, ST=NRW, L=Bielefeld, O=Nüsse IT, OU=dev, CN=Felix Nüsse

de.wikipedia.org/wiki/Bielefel…

Aber wenn Du darauf bestehst: warte einfach auf den nächsten #AndroidAppRain :awesome:

Danke für den Hinweis!


Peter Vágner reshared this.


I'm delighted to announce that #Libervia has received a new grant from #NLnet #NGI0 to develop an #email <=> #XMPP gateway.

This project will not only convert normal messages but also transform mailing lists into #pubsub based forums. It includes UI/UX enhancements on Libervia, end-to-end encryption, and advanced handling of attachments.

More at: nlnet.nl/project/EmailXMPPgate…

reshared this


Peter Vágner reshared this.


Making the switch from Google Podcasts? 🔄 Check out our latest blog post on migrating to AntennaPod! 🌐 Discover why it's the ideal choice for podcast lovers - simple, ad-free, and built on open standards. Say goodbye to tracking and hello to a great listening experience! #GooglePodcasts #AntennaPod #PodcastMigration
antennapod.org/blog/2023/11/go…
This entry was edited (10 months ago)

reshared this


Peter Vágner reshared this.


I wrote an update about @pipewire camera support in @mozilla Firefox and what kept me busy past few months: jgrulich.cz/2023/11/24/pipewir…

reshared this


Peter Vágner reshared this.


After two and a half years of rewrite, #Fractal 5 is finally out! Get the #GTK 4 #Rust #Matrix client from flathub.org/fr/apps/org.gnome.… and enjoy new features such as #EndToEndEncryption, location sharing, or multi-account with Single-Sign On 🚀

:boost_ok:

reshared this

in reply to Jeff Fortin T.

@nekohayo @sorunome but it’s all compiled code AFAIK, shouldn’t be possible to port it on android with minor changes? E. g. It look possible AFAIK with Qt
in reply to Samuele

@surveyor3 @nekohayo @sorunome GTK does not have an Android windowing system backend, so it's not possible to "port it with minor changes". On top of that, Fractal uses GNOME services, and porting to Android would mean rewriting a bunch of code to make use of the Android equivalent

Peter Vágner reshared this.


On our date night last night, Shae introduced me to a new game that any* two people online can play, and it turns out to be really fun!

You go to Wikipedia, and one of you (take turns doing this) names something off the top of their head that they're pretty sure a Wikipedia page exists for. (If you're not sure, you can use the search without submitting it just to see if a page turns up.)

After you both acknowledge that you understand the target, you simultaneously (we did a 3-2-1 on the voice chat) click "Random Page".

The goal is then to see who can reach the target page first ONLY BY CLICKING INTERNAL LINKS. No external links or typing allowed. We did eventually allow the back button so you could get yourself out of dead ends, but tried to use it in only that way.

It's surprisingly fun!

(* If you use a screen reader or other access technologies, or if, like me, you read VERY slowly, you might not be very competitive if your opponent does not also.)

reshared this

in reply to PastaThief

The "insane difficulty" variant of this is picking two random pages and trying to navigate from page A to page B, but you can click the links on both to explore. There was a site for this once that only picked pages with a sensible amount of internal links.

Peter Vágner reshared this.


Microsoft confirms broken Narrator when clean-installing Windows 11 version 23H2 neowin.net/news/microsoft-conf…

reshared this


Peter Vágner reshared this.


VScan: Turn your smartphone into any accessibility aid you can imagine with GPT4 vision

Here comes my next idea & project. If you can think of a system & user prompt that would turn GPT 4 vision into an accessibility aid, i.e. by telling it what to look for in images and how to tell you the output, and then if you can simply do photos against these prompts using your smartphone, then you can basically turn your smartphone into a pretty wide range of accessibility tools (color detector, text reader, expiry date extractor, navigator, etc.).
I decided to try this in practice, and the results are pretty interesting! Well, you can try yourself:
github.com/RastislavKish/VScan

Note the app has been designed such that it can be easily used both for tools creation as well as standard image recognition you may be used to do with Be my AI or my Vision project, or you don't even need to be taking pictures at all, you can use the app to simply chat with GPT 4V (the model has the same textual capabilities as GPT4).
Though note there is currently no chat history review functionality nor conversation truncation when the 4k token limit of GPT 4V is exceeded, so the app is not optimized for this use-case.

There are few rough edges by now, probably the most annoying issue I'm facing on my device is that Talkback shows the braille keyboard in a reversed position i nthe session screen due to the display orientation, I need to figure out how to make CameraX adapt for the current device orientation.

But that's mostly a minor issue, the main functionality works as expected.
Any constructive thoughts and opinions on this project are very welcome, and, if you get to create some interesting accessibility tools you would like to share, I would love to hear about them!

Happy Visioning!

Peter Vágner reshared this.


Peter Vágner reshared this.


PipeWire 1.0.0 is out!

gitlab.freedesktop.org/pipewir…

Peter Vágner reshared this.



What a discovery: LMI interface for changing bios settings from the userspace on modern Lenovo laptops is a thing for quite some time already. It's built into the linux kernel since 5.17. I have supported Lenovo laptop for a few months and I have only found about it today. It's nice for IT professionals but for blind users like me it's great too.

Peter Vágner reshared this.


I just tried the game Conjury available for IOS and Android. It is a lot of fun, and the developer cares about accessibility. It's only $1.99, so I picked it up, and I'm glad I did.

Peter Vágner reshared this.


Peter Vágner reshared this.


Now this is fun. Attached to the post is an audio of me demonstrating the new ChatGPT Voice for using the free tier and listen in as how I have now replaced my Siri with the new ChatGPT Voice feature.

reshared this

in reply to AnonyMouse

Were the silences cut out of the recording or is it actually that instantaneous?
in reply to Wesley M

Yes, for me it is working instantaneously between the voice prompt. I was also surprised by the quick responses. I'm not sure if that is due to the holiday here in the States and the responses are quicker or if it’s because I wasn't asking any complicated request, but I will be playing a bit further with it today. Regardless, the new Voice feature for the ChatGPT is just simply wonderful for those like me taking advantage of the free tier.

Peter Vágner reshared this.


Ultimate Arch Linux Install [CZ/EN]: LUKS, BTRFS, Dual Boot & More - Step-by-Step Tutorial
#archlinux #guide #linux
youtu.be/CteatauMLz8
This entry was edited (11 months ago)

reshared this


Peter Vágner reshared this.


First we had Mac get a GPT 4 image describing utility, now it’s NVDA’s turn. github.com/cartertemm/AI-conte…

reshared this

in reply to Jamie Teh

@jcsteh From what I've seen, this is one of the most common questions on the OpenAI dev forums and other Q&A sites by far; people expect it to work like the web version of ChatGPT out of the box. So on one hand, statefulness would be a popular feature. On the other, it could decrease token usage and hence API revenue, or make token utilisation less predictable. There could also be questions about how long they store e.g. cached images within a session, although I think the data usage ship has well and truly sailed on this one. @pitermach
in reply to James Scholes

@jscholes @jcsteh All OpenAI APIs work this way, including Chat GPT. There's no state, you always send the entire conversation history. This is most likely what web Chat GPT (or the Chat GPT backend) does under the hood. Even if there was state, it would be an abstraction at best with little to no impact on token usage. You just have to feed the whole conversation history to the model every time to make it do anything useful, particularly in such a heavily multi-user setup. That's how transformers function.

Peter Vágner reshared this.


Music/audio friends, must pass on this find: the best cheap headphones I've ever seen. $12, with coupon. Folding over-ear style, Bluetooth 5.3 or wired, 80 hour battery with USB-C charging, 3 EQ presets, physical media controls, spoken prompts, and multipoint pairing. Did I mention they're $12? They sound better than you'd ever dream $12 could sound. There is a mic. It isn't great. But $12! a.co/d/7v8xMOx