Lots of things happening in the AI/LLM space that could have implications for #accessibility
Ferret-UI from Apple:
https://arxiv.org/abs/2404.05719
ScreenAI from Google
https://research.google/blog/screenai-a-visual-language-model-for-ui-and-visually-situated-language-understanding/
Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs
Recent advancements in multimodal large language models (MLLMs) have been noteworthy, yet, these general-domain MLLMs often fall short in their ability to comprehend and interact effectively with user interface (UI) screens.arXiv.org