Lots of things happening in the AI/LLM space that could have implications for #accessibility
Ferret-UI from Apple:
arxiv.org/abs/2404.05719
ScreenAI from Google
research.google/blog/screenai-…
Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs
Recent advancements in multimodal large language models (MLLMs) have been noteworthy, yet, these general-domain MLLMs often fall short in their ability to comprehend and interact effectively with user interface (UI) screens.arXiv.org