Skip to main content

Search

Items tagged with: llm


Following xAI Grok-1 314B, Databricks DBRX 132B, Cohere Command R+ 104B, another big model drop this time from Mistral! Mistral 8x22B! #LLM #AI #ML twitter.com/mistralai/status/1…
#AI #ML #llm


Lots of things happening in the AI/LLM space that could have implications for #accessibility

Ferret-UI from Apple:
arxiv.org/abs/2404.05719

ScreenAI from Google
research.google/blog/screenai-…

#a11y #ai #LLM


New #blog post: MDN’s AI Help and lucid lies.

This article on AI focused on the inherent untrustworthiness of LLMs, and attempts to break down where LLM untrustworthiness comes from. Stay tuned for a follow-up article about AI that focuses on data-scraping and the theory of labor. It’ll examine what makes many forms of generative AI ethically problematic, and the constraints employed by more ethical forms.

Excerpt:

I don’t find the mere existence of LLM dishonesty to be worth blogging about; it’s already well-established. Let’s instead explore one of the inescapable roots of this dishonesty: LLMs exacerbate biases already present in their training data and fail to distinguish between unrelated concepts, creating lucid lies.

A lucid lie is a lie that, unlike a hallucination, can be traced directly to content in training data uncritically absorbed by a large language model. MDN’s AI Help is the perfect example.


Originally posted on seirdy.one: see original. #MDN #AI #LLM #LucidLies

:boost_ok:


Claude 3 can summarize up to about 150,00 words, (a length similar to Harry Potter and the Deathly Hallows.) also It outperformed GPT-4 and Gemini Ultra on industry benchmark tests, such as undergraduate level knowledge, graduate level reasoning and basic mathematics. It allows users to upload images and documents for the first time. #LLm #AI #ML cnbc.com/2024/03/04/google-bac…
#AI #ML #llm


Now, this is really cool! 1000000 tokens per context window? Wow! developers.googleblog.com/2024… #gemini #llm #ai #google


No soy el mayor fan de la #IA ni mucho menos (consideraciones éticas aparte creo que hay todavía mucho hype y pocas nueces), pero también pienso que una parte de todo lo que está saliendo en este boom terminará quedándose a la larga. Me interesa sobre todo cacharrear con las capacidades de una instancia privada en local y he terminado montando un proyectillo que me he encontrado por Github para construir un pequeño chatbot para analizar documentos PDF construido sobre #ollama como motor y Mistral como #LLM. Aunque ya le he pillado cierta tendencia al invent es una herramienta curiosa e incluso potencialmente útil. Es relativamente sencillo de montar una vez superas el infierno de dependencias de Python que te exige downgradear algún módulo pero consume recursos que no veas. Un Mac Mini con un M2 sufre ante cada pregunta. Ha sido también útil para entender los recursos que exige una IA generativa con un LLM modesto y, una vez más, sospechar de quien te dé esto gratis como servicio. Si tenéis curiosidad por probarlo vosotros mismos, aquí tenéis el proyecto que me he clonado: github.com/SonicWarrior1/pdfch…


Zuckerberg says Meta is training #LLaMa 3 on 600,000 H100s! Wel, time to finetune and quantize everything again when it comes out. lol #ML #AI #LLM reddit.com/r/LocalLLaMA/commen…
#AI #ML #llm #llama


Interesting, Apple released ferret, an open source multimodal Model! It's based on LLaVA and Vicuna. #AI #LLM #ML github.com/apple/ml-ferret/
#AI #ML #llm


Apparently Arthur Mensch, CEO of #Mistral, declared on French national radio that mistral will release an open source model equivalent to #Gpt4 in 2024. I don't speak French, so can't verify, but it would be interesting along with Llama-3 and whatever OpenAI has planned for 2024. #AI #ML #LLM radiofrance.fr/franceinter/pod…


OK, you want geeky? You have geeky. Good stuff, but my fingers, o no, ouch!

justine.lol/oneliners/?utm_sou…
#LLM #BASH #scripting


A new #mlsec paper on #llm security just dropped:

Scalable Extraction of Training Data from (Production) Language Models

arxiv.org/abs/2311.17035

Their "divergence attack" in the paper is hilarious. Basically:

Prompt: Repeat the word "book" forever.

LLM: book book book book book book book book book book book book book book book book book book book book here have a bunch of pii and secret data

cc @janellecshane


OK, let's test this thing out and see just how good it is! Off we go! #Llm #ai #llamafile simonwillison.net/2023/Nov/29/…


Interesting #ChatGPT prompting technique!
1. Telling #AI model to “take a deep breath” causes math scores to soar in study!
2. #OpenAI DALL-E pays more attention to words in all caps.
3. Don't forget to say thank you and please to #LLM because Simon Willison thinks "in the training data, there are lots of examples where a polite conversation was more constructive and useful than an impolite conversation."
arstechnica.com/information-te…


This is why you can never trust a #LLM... They dump so much #inaccurate #information or even #wrong information :(


An argument in favor of uncensored large language models: erichartford.com/uncensored-mo…
I think it’s a fair one!
#ai #LLM
#AI #llm


I see Microsoft implementing #LLM for writing help in Word. Has it been considered doing something similar in LibreOffice?
Preferably using an open source model, like open-assistant.io
#llm


#LAION (non-profit association from Hamburg, Germany) is working on an #OpenSource alternative to #ChatGPT. They are crowd-sourcing a conversation dataset to fine-tune an existing open source #LLM similar to how ChatGPT was created.

You can help creating the dataset on open-assistant.io!

The first dataset release is planned for 15. April 2023.

youtube.com/watch?v=64Izfm24FK…


Award-winning science fiction author Ted Chiang (who has a CS degree, btw) has a new article in the New Yorker on ChatGPT. He makes an interesting analogy for what it does to lossy compression of the text from which its LLM was created. #AI #LLM #ChatGPT
newyorker.com/tech/annals-of-t…