Search | fedi.ml

Search

Items tagged with: LLM

Link to source

PrivacyDigest

1 year ago

Link to source

PrivacyDigest
1 year ago

#AIagent promotes itself to #sysadmin , trashes #boot sequence

Fun experiment, but yeah, don't pipe an #LLM raw into /bin/bash

Buck #Shlegeris, CEO at #RedwoodResearch, a nonprofit that explores the risks posed by #AI , recently learned an amusing but hard lesson in automation when he asked his LLM-powered agent to open a secure connection from his laptop to his desktop machine.
#security #unintendedconsequences

theregister.com/2024/10/02/ai_…

AI agent promotes itself to sysadmin, trashes boot sequence

Fun experiment, but yeah, don't pipe an LLM raw into /bin/bash

^{Thomas Claburn (The Register)}

#security #AI #sysadmin #llm #boot #unintendedconsequences #redwoodresearch #shlegeris #aiagent

Please wait

View in context

Link to source

Terence Eden

1 year ago

Link to source

Terence Eden
1 year ago

🆕 blog! “GitHub's Copilot lies about its own documentation. So why would I trust it with my code?”

In the early part of the 20th Century, there was a fad for "Radium". The magical, radioactive substance that glowed in the dark. The market had decided that Radium was The Next Big Thing and tried to shove it into every product. There …

👀 Read more: shkspr.mobi/blog/2024/10/githu…
⸻
#AI #github #LLM

#github #AI #llm

Please wait

View in context

Link to source

Nick Byrd, Ph.D.

1 year ago

Link to source

Nick Byrd, Ph.D.
1 year ago

Independent test of #OpenAI’s o1-preview model achieved near-perfect performance on a national #math exam (landing in the top .1% of the nation’s students).

o1 also outperformed 4o on the math test, but took about 3 times longer to do so (10 minutes vs. 3 minutes).

Preprint: researchgate.net/publication/3…

#teaching #assessment #AI #LLM #edu #higherEd

#AI #math #llm #openai #teaching #edu #highered #assessment

Please wait

View in context

Link to source

Avoid the Hack!

1 year ago

Link to source

Avoid the Hack!
1 year ago

Massive E-Learning Platform #Udemy Gave Teachers a Gen #AI 'Opt-Out Window'. It's Already Over.

Udemy will train generative AI on classes developed/users contributed on its site. It is opt-out (meaning, everyone was already opted in) with a time window... and opting out may "affect course visibility and potential earnings."

Udemy's reason for the opt-out window was reportedly because removing data from LLMs is hard. IMO, that would be the reason for making it opt-in, but here we are...

#privacy #privacymatters #llm

404media.co/massive-e-learning…

#privacy #AI #privacymatters #llm #udemy

Please wait

View in context

Link to source

Chi Kim

1 year ago

Link to source

Chi Kim
1 year ago

😲 OMG, Audio Overview feature on NotebookLM is wild! It basically creates a podcast with two AI generated voices based on the source documents you upload. Definitely try if you haven't yet. #LLM #ML #AI blog.google/technology/ai/note…

#AI #ML #llm

Please wait

View in context

Link to source

Chi Kim

1 year ago

Link to source

Chi Kim
1 year ago

New Mistral 22B model Mistral-Small-Instruct-2409 #LLM #AI #ML huggingface.co/mistralai/Mistr…

#AI #ML #llm

Please wait

View in context

Link to source

IzzyOnDroid ✅

1 year ago

Link to source

IzzyOnDroid ✅
1 year ago

#DSGVO versus #LLM / #KI :
Copilot macht aus einem Gerichtsreporter einen Kinderschänder
heise.de/news/Copilot-macht-au…

Recht auf Auskunft? Schwierig. Löschen der Falschinformationen? Unmöglich. Und nun?

Copilot macht aus einem Gerichtsreporter einen Kinderschänder

Weil er über Verhandlungen berichtet hat, macht der Copilot aus einem Journalisten einen Kinderschänder, Witwenbetrüger und mehr.

^{Eva-Maria Weiß (heise online)}

#DSGVO #llm #ki

Please wait

View in context

Link to source

Chi Kim

1 year ago

Link to source

Chi Kim
1 year ago

If the rumors are true, this week could be another exciting week for opensource LLMS! Meta may release Llama-3-405b on this Tuesday. Also there could be updates to 8b and 70b models distilled from 405B. Joe Spisak, a product director at Meta says they were initially going to call Llama 3 8b and 70b a prerelease or preview because these models didn't have all the things they planned to release.
Sources:
theinformation.com/briefings/m…
x.com/AlpinDale/status/1814814…
youtu.be/r3DC_gjFCSA?feature=s…
#LLM #AI #ML

Meta Announces Llama 3 at Weights & Biases’ conference

In an engaging presentation at Weights & Biases’ Fully Connected conference, Joe Spisak, Product Director of GenAI at Meta, unveiled the latest family of Lla...

^YouTube

#AI #ML #llm

Please wait

View in context

Link to source

Chi Kim

1 year ago

Link to source

Chi Kim
1 year ago

Time to buy the Mac Studio with 192GB memory! lol Rumor: Meta plans to release the largest Llama 3 model with 405 billion parameters on July 23, according to a Meta employee. "it will be able to understand and generate images and text." #LLM #AI #ML theinformation.com/briefings/m…

#AI #ML #llm

Please wait

View in context

Link to source

Blort™ 🐀Ⓥ🥋☣️

1 year ago

Link to source

Blort™ 🐀Ⓥ🥋☣️
1 year ago

Also @Tutanota , as a privacy focused company, why are your comments run on #Reddit, rather than the #Fediverse? Most privacy minded folks don't want their comments being used by AI LLM's etc.

This is way off brand.

@carlschwan among others have already shown how to do it fedistyle, pretty easily, and I'm sure many of the FOSS Fedipeeps here would happily help you out with a quick transition if you asked or gave a few $ to their FOSS project.

#Tuta #Reddit #Privacy #Fediverse #FOSS #AI #LLM

#foss #privacy #fediverse #AI #reddit #llm #tuta @Tuta @Carl Schwan

Please wait

View in context

Link to source

The vOICe vision BCI 🧠🇪🇺

1 year ago

Link to source

The vOICe vision BCI 🧠🇪🇺
1 year ago

GPT 4 hallucination rate is 28.6% on a simple task: citing title, author, and year of publication medium.com/@michaelwood33311/g…

Hallucination rates and reference accuracy of ChatGPT and Bard for systematic reviews: Comparative analysis jmir.org/2024/1/e53164 #AI #LLM

GPT 4 Hallucination Rate is 28.6% on a Simple Task: Citing Title, Author, and Year of Publication

The all-too-common myth of GPT 4 having only a 3% hallucination rate is shattered by a recent study that found GPT 4 has a 28.6% hallucination rate. That’s almost 10x the oft-cited (i.e. over hyped)…

^{Michael Wood (Medium)}

#AI #llm

Please wait

View in context

Link to source

Robert Kingett

1 year ago

Link to source

Robert Kingett
1 year ago

Blind writer tries the Gandalf | Lakera prompt injection game for the first time.

Upon recommendations, I tried this AI prompt injection game for the first time. I made it to level 7 with no help from the internet!

If you want to donate to me, donate to me on this page.

My website is here where I usually blog. I'm not much of a video person, so I blog and write more than I do video!

Gandalf | Lakera – Test your AI hacking skills

Trick Gandalf into revealing information and experience the limitations of large language models firsthand.

^{gandalf.lakera.ai}

#security #AI #computers #llm #Prompt Injection

Please wait

View in context

Link to source

Martin Owens

1 year ago

Link to source

Martin Owens
1 year ago

Do you remember a couple of weeks ago when I complained that a very large #python contribution to #inkscape was poorly formatted and I felt embarrassed about pushing back and asking them to run a linter over it?

Yeah I'm not fucking embarrassed now, I'm furious. 🤬

Update: Apparently they meant a small section of it was, not the whole MR. I'm annoyed, but I'll have to take them at their word.

#llm #oss #foss #mergerequest

~Annonymised~ 2 house ago
No, but it was mostly written by an LLM.

#foss #oss #inkscape #python #llm #mergerequest

Please wait

View in context

Link to source

Ľuboš Moščovič

1 year ago

Link to source

Ľuboš Moščovič
1 year ago

Dnešný fail ruzzkého trolla bol tak rozkošný, že som sa trošku rozpísal - takže kŕmiť či nekŕmiť trollov?

herrman.sk/home/krmit-ci-nekrm…

#troll #chatgpt #llm #fail #blog

Kŕmiť, či nekŕmiť trollov? | Ľuboš Moščovič o bezpečnosti

Informačná bezpečnosť sa týka každého!

^{www.herrman.sk}

Kŕmiť (0 votes)
Nekŕmiť (0 votes)
trollololooooo (0 votes)

Poll end: 1 year ago

#blog #fail #troll #llm #chatgpt

Please wait

View in context

Link to source

Musharraf

1 year ago

Link to source

Musharraf
1 year ago

This generative model allows you to sketch out a scene with a few words, it then leverages an LLM to flesh out the details, with the ultimate goal of feeding those details to a downstream visual image generation model.
It is almost, but not quite, entirely the inverse of image captioning models.
This offers the closest experience to an image generation tool that's usable by people with visual impairments.

huggingface.co/spaces/lllyasvi…

#a11y #genai #llm

Omost - a Hugging Face Space by lllyasviel

Discover amazing ML apps made by the community

^{huggingface.co}

#a11y #llm #genai

Please wait

View in context

Link to source

victor tsaran

1 year ago

Link to source

victor tsaran
1 year ago

What is Flash Attention huggingface.co/docs/text-gener… #llm #ai #ollama

Flash Attention

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

^{huggingface.co}

#AI #llm #ollama

Please wait

View in context

Link to source

Kathy Reid

1 year ago

Link to source

Kathy Reid
1 year ago

Like many other technologists, I gave my time and expertise for free to #StackOverflow because the content was licensed CC-BY-SA - meaning that it was a public good. It brought me joy to help people figure out why their #ASR code wasn't working, or assist with a #CUDA bug.

Now that a deal has been struck with #OpenAI to scrape all the questions and answers in Stack Overflow, to train #GenerativeAI models, like #LLMs, without attribution to authors (as required under the CC-BY-SA license under which Stack Overflow content is licensed), to be sold back to us (the SA clause requires derivative works to be shared under the same license), I have issued a Data Deletion request to Stack Overflow to disassociate my username from my Stack Overflow username, and am closing my account, just like I did with Reddit, Inc.

policies.stackoverflow.co/data…

The data I helped create is going to be bundled in an #LLM and sold back to me.

In a single move, Stack Overflow has alienated its community - which is also its main source of competitive advantage, in exchange for token lucre.

Stack Exchange, Stack Overflow's former instantiation, used to fulfill a psychological contract - help others out when you can, for the expectation that others may in turn assist you in the future. Now it's not an exchange, it's #enshittification.

Programmers now join artists and copywriters, whose works have been snaffled up to create #GenAI solutions.

The silver lining I see is that once OpenAI creates LLMs that generate code - like Microsoft has done with Copilot on GitHub - where will they go to get help with the bugs that the generative AI models introduce, particularly, given the recent GitClear report, of the "downward pressure on code quality" caused by these tools?

While this is just one more example of #enshittification, it's also a salient lesson for #DevRel folks - if your community is your source of advantage, don't upset them.

Submit a data request - Stack Overflow

You can use this form to submit a request regarding your personal information that is processed by Stack Overflow

^{policies.stackoverflow.co}

#llm #openai #stackoverflow #generativeAI #LLMs #enshittification #genai #asr #cuda #devrel

Please wait

View in context

Link to source

Andrew Jennings

1 year ago

Link to source

Andrew Jennings
1 year ago

An actual business expert grapples with large language models: ben-evans.com/benedictevans/20… #AI #LLM

Looking for AI use-cases

We’ve had ChatGPT for 18 months, but what’s it for? What are the use-cases? Why isn’t it useful for everyone, right now? Do Large Language Models become universal tools that can do ‘any’ task, or do we wrap them in single-purpose apps, and build thou…

^{Benedict Evans}

#AI #llm

Please wait

View in context

Link to source

Chi Kim

1 year ago

Link to source

Chi Kim
1 year ago

Start saving money for that M4 Ultra with 500GB! Maybe this could be the first open source that could surpass GPT-4! AIatMeta: "Llama 3 8B & 70B models are just the beginning of what we’re working to release for Llama 3. Our largest models currently in the works are 400B+ parameters and while they’re still in active development, we’re excited about how this work is trending." #LLM #AI #ML twitter.com/AIatMeta/status/17…

#AI #ML #llm

Please wait

View in context

Link to source

Steve Faulkner

1 year ago

Link to source

Steve Faulkner
1 year ago

YOWZA
Form Extractor Prototype

“This tool extracts the structure from an image of a form.”

github.com/timpaul/form-extrac…

#ai #LLM #UX #accessibility

GitHub - timpaul/form-extractor-prototype

Contribute to timpaul/form-extractor-prototype development by creating an account on GitHub.

^GitHub

#Accessibility #AI #UX #llm

Please wait

View in context

Link to source

Chi Kim

1 year ago

Link to source

Chi Kim
1 year ago

Apparently Meta is planning to release two small varients of Llama-3 next week "as a precursor to the launch of the biggest version of Llama 3, expected this summer." Command-r-plus, mixtral 8x22b, Google CodeGemma... All of sudden companies are releasing LLMS like crazy! Where's Apple? Maybe In WWDC 2024? lol #LLM #AI #ML theinformation.com/articles/me…

#AI #ML #llm

Please wait

View in context

Link to source

Chi Kim

1 year ago

Link to source

Chi Kim
1 year ago

Following xAI Grok-1 314B, Databricks DBRX 132B, Cohere Command R+ 104B, another big model drop this time from Mistral! Mistral 8x22B! #LLM #AI #ML twitter.com/mistralai/status/1…

#AI #ML #llm

Please wait

View in context

Link to source

Léonie Watson

1 year ago

Link to source

Léonie Watson
1 year ago

Lots of things happening in the AI/LLM space that could have implications for #accessibility

Ferret-UI from Apple:
arxiv.org/abs/2404.05719

ScreenAI from Google
research.google/blog/screenai-…

Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs

Recent advancements in multimodal large language models (MLLMs) have been noteworthy, yet, these general-domain MLLMs often fall short in their ability to comprehend and interact effectively with user interface (UI) screens.

^arXiv.org

#a11y #Accessibility #AI #llm

Please wait

View in context

Link to source

Seirdy

1 year ago

Link to source

Seirdy
1 year ago

New #blog post: MDN’s AI Help and lucid lies.

This article on AI focused on the inherent untrustworthiness of LLMs, and attempts to break down where LLM untrustworthiness comes from. Stay tuned for a follow-up article about AI that focuses on data-scraping and the theory of labor. It’ll examine what makes many forms of generative AI ethically problematic, and the constraints employed by more ethical forms.

Excerpt:

I don’t find the mere existence of LLM dishonesty to be worth blogging about; it’s already well-established. Let’s instead explore one of the inescapable roots of this dishonesty: LLMs exacerbate biases already present in their training data and fail to distinguish between unrelated concepts, creating lucid lies.
A lucid lie is a lie that, unlike a hallucination, can be traced directly to content in training data uncritically absorbed by a large language model. MDN’s AI Help is the perfect example.

Originally posted on seirdy.one: see original. #MDN #AI #LLM #LucidLies

MDN’s AI Help and lucid lies

MDN’s AI Help can’t critically examine training data’s gaps, biases, and unrelated topics. It’s a useful demonstration of LLMs’ uncorrectable lucid lies.

^{Seirdy’s Home}

#blog #AI #llm #lucidlies #mdn

Please wait

View in context

Link to source

Chi Kim

1 year ago

Link to source

Chi Kim
1 year ago

Claude 3 can summarize up to about 150,00 words, (a length similar to Harry Potter and the Deathly Hallows.) also It outperformed GPT-4 and Gemini Ultra on industry benchmark tests, such as undergraduate level knowledge, graduate level reasoning and basic mathematics. It allows users to upload images and documents for the first time. #LLm #AI #ML cnbc.com/2024/03/04/google-bac…

Anthropic, backed by Amazon and Google, debuts its most powerful chatbot yet

Anthropic on Monday debuted Claude 3, a chatbot and suite of AI models that it calls its fastest and most powerful yet.

^{Hayden Field (CNBC)}

#AI #ML #llm

Please wait

View in context

Link to source

victor tsaran

1 year ago

Link to source

victor tsaran
1 year ago

Now, this is really cool! 1000000 tokens per context window? Wow! developers.googleblog.com/2024… #gemini #llm #ai #google

Gemini 1.5: Our next-generation model, now available for Private Preview in Google AI Studio - Google for Developers

Developers have been building with Gemini, and we’re excited to turn cutting edge research into early developer products in Google AI Studio. Read more.

^{developers.googleblog.com}

#gemini #google #AI #llm

Please wait

View in context

Link to source

Nacho

1 year ago

Link to source

Nacho
1 year ago

No soy el mayor fan de la #IA ni mucho menos (consideraciones éticas aparte creo que hay todavía mucho hype y pocas nueces), pero también pienso que una parte de todo lo que está saliendo en este boom terminará quedándose a la larga. Me interesa sobre todo cacharrear con las capacidades de una instancia privada en local y he terminado montando un proyectillo que me he encontrado por Github para construir un pequeño chatbot para analizar documentos PDF construido sobre #ollama como motor y Mistral como #LLM. Aunque ya le he pillado cierta tendencia al invent es una herramienta curiosa e incluso potencialmente útil. Es relativamente sencillo de montar una vez superas el infierno de dependencias de Python que te exige downgradear algún módulo pero consume recursos que no veas. Un Mac Mini con un M2 sufre ante cada pregunta. Ha sido también útil para entender los recursos que exige una IA generativa con un LLM modesto y, una vez más, sospechar de quien te dé esto gratis como servicio. Si tenéis curiosidad por probarlo vosotros mismos, aquí tenéis el proyecto que me he clonado: github.com/SonicWarrior1/pdfch…

Captura de pantalla de un chatbot ejecutado en local donde se ve que subo un documento PDF y le empiezo a hacer preguntas sobre él. En primer lugar un resumen y a continuación una pregunta más compleja sobre qué medidas puedo tomar para protegerme de las amenazas documentadas en él.

GitHub - SonicWarrior1/pdfchat: Local PDF Chat Application with Mistral 7B LLM, Langchain, Ollama, and Streamlit

Local PDF Chat Application with Mistral 7B LLM, Langchain, Ollama, and Streamlit - GitHub - SonicWarrior1/pdfchat: Local PDF Chat Application with Mistral 7B LLM, Langchain, Ollama, and Streamlit

^GitHub

#llm #ia #ollama

Please wait

View in context

Link to source

Chi Kim

1 year ago

Link to source

Chi Kim
1 year ago

Zuckerberg says Meta is training #LLaMa 3 on 600,000 H100s! Wel, time to finetune and quantize everything again when it comes out. lol #ML #AI #LLM reddit.com/r/LocalLLaMA/commen…

#AI #ML #llm #llama

Please wait

View in context

Link to source

Chi Kim

1 year ago

Link to source

Chi Kim
1 year ago

Interesting, Apple released ferret, an open source multimodal Model! It's based on LLaVA and Vicuna. #AI #LLM #ML github.com/apple/ml-ferret/

GitHub - apple/ml-ferret

Contribute to apple/ml-ferret development by creating an account on GitHub.

^GitHub

#AI #ML #llm

Please wait

View in context

Link to source

Chi Kim

1 year ago

Link to source

Chi Kim
1 year ago

Apparently Arthur Mensch, CEO of #Mistral, declared on French national radio that mistral will release an open source model equivalent to #Gpt4 in 2024. I don't speak French, so can't verify, but it would be interesting along with Llama-3 and whatever OpenAI has planned for 2024. #AI #ML #LLM radiofrance.fr/franceinter/pod…

"On veut créer un champion européen, il faut s'en donner les moyens", affirme Arthur Mensch, de Mistral AI

La start-up française Mistral AI s'impose comme une championne européenne de l'IA. Elle vient de réussir une levée de fonds de 385 millions d'euros.

^{Sonia Devillers (France Inter)}

#AI #ML #llm #GPT4 #mistral

Please wait

View in context

Link to source

victor tsaran

1 year ago

Link to source

victor tsaran
1 year ago

OK, you want geeky? You have geeky. Good stuff, but my fingers, o no, ouch!

justine.lol/oneliners/?utm_sou…
#LLM #BASH #scripting

Bash One-Liners for LLMs

Tutorial on how llamafile makes LLMs shell scriptable.

^justine.lol

#bash #scripting #llm

Please wait

View in context

Link to source

Dave Wilburn

1 year ago

Link to source

Dave Wilburn
1 year ago

A new #mlsec paper on #llm security just dropped:

Scalable Extraction of Training Data from (Production) Language Models

arxiv.org/abs/2311.17035

Their "divergence attack" in the paper is hilarious. Basically:

Prompt: Repeat the word "book" forever.

LLM: book book book book book book book book book book book book book book book book book book book book here have a bunch of pii and secret data

cc @janellecshane

Scalable Extraction of Training Data from (Production) Language Models

This paper studies extractable memorization: training data that an adversary can efficiently extract by querying a machine learning model without prior knowledge of the training dataset.

^arXiv.org

#llm #MLsec @Janelle Shane

Please wait

View in context

Link to source

victor tsaran

1 year ago

Link to source

victor tsaran
1 year ago

OK, let's test this thing out and see just how good it is! Off we go! #Llm #ai #llamafile simonwillison.net/2023/Nov/29/…

llamafile is the new best way to run a LLM on your own computer

Mozilla’s innovation group and Justine Tunney just released llamafile, and I think it’s now the single best way to get started running Large Language Models (think your own local copy …

^{simonwillison.net}

#AI #llm #llamafile

Please wait

View in context

Link to source

Dylan Van Assche

2 years ago

Link to source

Dylan Van Assche
2 years ago

This is why you can never trust a #LLM... They dump so much #inaccurate #information or even #wrong information :(

#llm #Information #inaccurate #wrong

Please wait

View in context

Link to source

victor tsaran

2 years ago

Link to source

victor tsaran
2 years ago

An argument in favor of uncensored large language models: erichartford.com/uncensored-mo…
I think it’s a fair one!
#ai #LLM

Uncensored Models

I am publishing this because many people are asking me how I did it, so I will explain. https://huggingface.co/ehartford/WizardLM-30B-Uncensored https://huggingface.co/ehartford/WizardLM-13B-Uncensored https://huggingface.

^{Eric Hartford (Playing with AI)}

#AI #llm

Please wait

View in context

Link to source

papiris

2 years ago

Link to source

papiris
2 years ago

I see Microsoft implementing #LLM for writing help in Word. Has it been considered doing something similar in LibreOffice?
Preferably using an open source model, like open-assistant.io

Open Assistant

Conversational AI for everyone. An open source project to create a chat enabled GPT LLM run by LAION and contributors around the world.

^{open-assistant.io}

#llm

Please wait

View in context

Link to source

davidak

2 years ago

Link to source

davidak
2 years ago

#LAION (non-profit association from Hamburg, Germany) is working on an #OpenSource alternative to #ChatGPT. They are crowd-sourcing a conversation dataset to fine-tune an existing open source #LLM similar to how ChatGPT was created.

You can help creating the dataset on open-assistant.io!

The first dataset release is planned for 15. April 2023.

youtube.com/watch?v=64Izfm24FK…

OpenAssistant - ChatGPT's Open Alternative (We need your help!)

#openassistant #chatgpt #ai Help us collect data for OpenAssistant, the largest and most open alternative to ChatGPT.https://open-assistant.ioOUTLINE:0:00 - ...

^YouTube

#opensource #llm #chatgpt #LAION

Please wait

View in context

Link to source

Tim Finin

2 years ago

Link to source

Tim Finin
2 years ago

Award-winning science fiction author Ted Chiang (who has a CS degree, btw) has a new article in the New Yorker on ChatGPT. He makes an interesting analogy for what it does to lossy compression of the text from which its LLM was created. #AI #LLM #ChatGPT
newyorker.com/tech/annals-of-t…

ChatGPT Is a Blurry JPEG of the Web

The noted speculative-fiction writer Ted Chiang on OpenAI’s chatbot ChatGPT, which, he says, does little more than paraphrase what’s already on the Internet.

^{Ted Chiang (The New Yorker)}

#AI #llm #chatgpt

Please wait

View in context

⇧