fedi.ml | Search

Search

Items tagged with: LLm

Chi Kim

1 day ago

Chi Kim
1 day ago

A study asked 50 doctors to make six different diagnoses for medical conditions. "Doctors who did the project without AI got an average score of 74%, doctors who used AI got an average score of 76%, and ChatGPT itself got an average score of 90%." "AI didn’t help doctors using it as much as anticipated because physicians “didn’t listen to AI when AI told them things they didn’t agree. Most doctors couldn’t be convinced a chatbot knew more than them." #LLM #AI #ChatGPT qz.com/chatgpt-beat-doctors-at…

ChatGPT beat doctors at diagnosing medical conditions, study says

The small study showed AI outperforming doctors by 16 percentage points

^{Ben Kesslen (Quartz)}

#AI #llm #chatgpt

Please wait

View in context

mms

1 week ago

mms
1 week ago

Most important RFC of the 20s?

Robots Exclusion Protocol Extension to manage AI content use
draft-canel-robots-ai-controll

datatracker.ietf.org/doc/draft…

#ai #llm #genai

Robots Exclusion Protocol Extension to manage AI content use

This document extends RFC9309 by specifying additional rules for controlling usage of the content in the field of Artificial Intelligence (AI).

^{IETF Datatracker}

#AI #llm #genai

Please wait

View in context

Chi Kim

3 weeks ago

Chi Kim
3 weeks ago

As a blind user, I find LLMs like ChatGPT pretty useful because they output in audio or text. However, I wonder if they start generating videos, it might not be useful for us. Many videos, like YouTube tutorials, are often optimized for sighted audience, and most likely models would be trained on these types of videos to generate in similar style. I can ask to describe the video, but it won't be the same experience as videos designed with accessibility in mind. #accessibility #LLM #AI

#Accessibility #AI #llm

Please wait

View in context

Erik Jonker

1 month ago

Erik Jonker
1 month ago

In the debate about whether AI / LLMs can reason or not it's good to remember this quote from 1984 from Dijkstra , a dutch computer scientist,
""The question of whether Machines Can Think... is about as relevant as the question of whether Submarines Can Swim."
#AI #LLM #reasoning

#AI #llm #reasoning

Please wait

View in context

Chi Kim

1 month ago

Chi Kim
1 month ago

Meta releases Spirit LM, a multimodal (speech text) model. #Multimodal #LLM #AI #ML ai.meta.com/blog/fair-news-seg…

Sharing new research, models, and datasets from Meta FAIR

Today, Meta FAIR is releasing several new research artifacts in support of our goal of achieving advanced machine intelligence (AMI) while also supporting open science and reproducibility.

^ai.meta.com

#AI #ML #llm #multimodal

Please wait

View in context

modulux

1 month ago

modulux
1 month ago

I'm a little puzzled at the salience that is being given to the Apple conclusions on #LLM #reasoning when we have lots of prior art. For example: LLMs cannot correctly infer a is b, if their corpora only contain b is a. #Paper: arxiv.org/abs/2309.12288

#AI #MachineLearning #logic

The Reversal Curse: LLMs trained on "A is B" fail to learn "B is A"

We expose a surprising failure of generalization in auto-regressive large language models (LLMs). If a model is trained on a sentence of the form "A is B", it will not automatically generalize to the reverse direction "B is A".

^arXiv.org

#AI #llm #machinelearning #logic #reasoning #paper

Please wait

View in context

FediVerseExplorer

1 month ago

FediVerseExplorer
1 month ago

Wie macht ihr deutlich, wenn ihr Sprachmodelle für Aufgaben nutzt, wie z.B. Antworten per E-Mail, SocialMedia, Programmierung oder ähnliches?

#LLM #sogenannteKI #Assistenten #Transparenz

#llm #transparenz #handaufsherz #sogenannteki #assistenten

Please wait

View in context

PrivacyDigest 🗳️VOTED ✅

1 month ago

PrivacyDigest 🗳️VOTED ✅
1 month ago

#AIagent promotes itself to #sysadmin , trashes #boot sequence

Fun experiment, but yeah, don't pipe an #LLM raw into /bin/bash

Buck #Shlegeris, CEO at #RedwoodResearch, a nonprofit that explores the risks posed by #AI , recently learned an amusing but hard lesson in automation when he asked his LLM-powered agent to open a secure connection from his laptop to his desktop machine.
#security #unintendedconsequences

theregister.com/2024/10/02/ai_…

AI agent promotes itself to sysadmin, trashes boot sequence

Fun experiment, but yeah, don't pipe an LLM raw into /bin/bash

^{Thomas Claburn (The Register)}

#security #AI #sysadmin #llm #boot #unintendedconsequences #redwoodresearch #shlegeris #aiagent

Please wait

View in context

Terence Eden

1 month ago

Terence Eden
1 month ago

🆕 blog! “GitHub's Copilot lies about its own documentation. So why would I trust it with my code?”

In the early part of the 20th Century, there was a fad for "Radium". The magical, radioactive substance that glowed in the dark. The market had decided that Radium was The Next Big Thing and tried to shove it into every product. There …

👀 Read more: shkspr.mobi/blog/2024/10/githu…
⸻
#AI #github #LLM

#github #AI #llm

Please wait

View in context

Nick Byrd, Ph.D.

1 month ago

Nick Byrd, Ph.D.
1 month ago

Independent test of #OpenAI’s o1-preview model achieved near-perfect performance on a national #math exam (landing in the top .1% of the nation’s students).

o1 also outperformed 4o on the math test, but took about 3 times longer to do so (10 minutes vs. 3 minutes).

Preprint: researchgate.net/publication/3…

#teaching #assessment #AI #LLM #edu #higherEd

#AI #math #llm #openai #teaching #edu #highered #assessment

Please wait

View in context

Avoid the Hack!

1 month ago

Avoid the Hack!
1 month ago

Massive E-Learning Platform #Udemy Gave Teachers a Gen #AI 'Opt-Out Window'. It's Already Over.

Udemy will train generative AI on classes developed/users contributed on its site. It is opt-out (meaning, everyone was already opted in) with a time window... and opting out may "affect course visibility and potential earnings."

Udemy's reason for the opt-out window was reportedly because removing data from LLMs is hard. IMO, that would be the reason for making it opt-in, but here we are...

#privacy #privacymatters #llm

404media.co/massive-e-learning…

#privacy #AI #privacymatters #llm #udemy

Please wait

View in context

Chi Kim

1 month ago

Chi Kim
1 month ago

I fed the full transcript of the 2024 presidential debate and asked NotebookLM to create an audio overview. Find out which political side the AI is on. ROFL #LLM #ML #AI #NotebookLM @vick21 abcnews.go.com/Politics/harris…

#AI #ML #llm #notebooklm @victor tsaran

Please wait

View in context

Chi Kim

1 month ago

Chi Kim
1 month ago

😲 OMG, Audio Overview feature on NotebookLM is wild! It basically creates a podcast with two AI generated voices based on the source documents you upload. Definitely try if you haven't yet. #LLM #ML #AI blog.google/technology/ai/note…

#AI #ML #llm

Please wait

View in context

Chi Kim

2 months ago

Chi Kim
2 months ago

New Mistral 22B model Mistral-Small-Instruct-2409 #LLM #AI #ML huggingface.co/mistralai/Mistr…

#AI #ML #llm

Please wait

View in context

Devin Prater :blind:

2 months ago

Devin Prater :blind:
2 months ago

> Gamma is a free AI tool that can automatically convert your documents or PDFs into visually appealing presentations in minutes.

Okay, so I understand that AI is cool. I understand that we can make AI do a lot of things. But seriously. First, there's a model that turns HTML into Markdown. And granted, I haven't tried that one, and I probably should. And now this? Like, no one has heard of Pandoc anymore? Like I can literally write a Markdown file and pandoc -i presentation.md -o presentation.pptx. Something like that anyway. And get a presentation out of it. Or just import a Word document into PowerPoint. Or just use a Markdown file and arrow down through the bullet points if I don't *need* to be fansy.

I'm starting to kind of understand how wasteful people are with this kind of stuff.

#ai #presentation #llm

#AI #presentation #llm

Please wait

View in context

victor tsaran

2 months ago

victor tsaran
2 months ago

I have to say, this clause rather annoys me, even though I fully understand the reasons behind it: “X may still sometimes give inaccurate responses, so you may want to confirm any facts independently”. Let’s stand back and think about this for a moment. What is an average user supposed to do with this?
“Sometimes”? When exactly? "you may want to confirm any facts independently”? Which of the facts? Independently? Where?
#AI #LLM #SometimesFail

#AI #llm #sometimesfail

Please wait

View in context

Chi Kim

2 months ago

Chi Kim
2 months ago

😲 Kyle Kabasares, a Physics PhD graduate working at NASA's Ames Research Center, gave the methods section of his research paper to ChatGPT O1 Preview and asked it to generate the code based on the description. After just six prompts, it produced a working version of the code that took him a year to develop during his PhD. #ChatGPT #LLM #ML #AI youtube.com/watch?v=M9YOO7N5jF…

#AI #ML #llm #chatgpt

Please wait

View in context

Devin Prater :blind:

2 months ago

Devin Prater :blind:
2 months ago

So like I just have to ask. For people that are super critical of AI for accessibility, what do you expect instead? Do you want blind people to have human ... guides or whatever that will narrate the world around you? Do you want humans to describe all your pictures? Videos? Porn? Because that's about the only other option. And you may return with "Well audio description." And I return with "You think people are going to describe every YouTube video out there? Or old TV shows like Dark Shadows?" Because honestly that's what it'd take. If AI were *not* around, if we want *that* kind of access, that's what we'd have to ask, of darn near every sighted human in the world. And I just don't feel comfortable with demanding that of them.

Now, we'll see what Apple does to give us what will hopefully be even better image descriptions. Imagine a 3B model that is made with **high quality** images and description pairs, trained to do nothing but describe images. Apple has done pretty darn good without LLM's so far, so maybe they'll surprise us further. But my goodness, I'd much rather have something that, yes, makes me *feel* included, maybe a tad bit more than it actually *does* include me. And that's for each and every blind person to decide for themselves if they want to use AI for image, and probably soon, video descriptions, and what they're willing to trust with it. But for us to get this much real, human access, I just hope people who are detracting from AI understand that we who use AI are now used to having images described, and well, soon videos. It's just something that I don't think people should just deny quickly.

#AI #accessibility #blind #description #video #llm

#Accessibility #blind #AI #video #llm #description

Please wait

View in context

Chi Kim

2 months ago

Chi Kim
2 months ago

Does anyone have a recommendation for #LlamaCPP alternative to run recent vision language models on Apple Silicon? Llama.cpp doesn't support any of the recent #VLM such as Qwen2-VL, Phi-3.5-vision, Idefics3, InternVL2, Yi-VL, Chameleon, CogVLM2, GLM-4v, etc.
Minicpm-v 2.6 is the only recent model that was added. Maybe time to move on. :( #LLM #multimodal #AppleSilicon #MacOS #ML #AI

#AI #MACOS #ML #llm #multimodal #llamacpp #vlm #AppleSilicon

Please wait

View in context

treefit

2 months ago

treefit
2 months ago

Have you tried using generative #AI for #coding ?

AI meaning #LLM here, like GH-Copilot, ChatGPT, Claude, Ollama and so on.

No not yet or no interest (40%, 12 votes)
Yes, but its still useless (16%, 5 votes)
I use it, but it decreases my skill & productivity (10%, 3 votes)
I use it and it boosts my skill & productivity (33%, 10 votes)

30 voters. Poll end: 2 months ago

#AI #coding #llm

Please wait

View in context

IzzyOnDroid ✅

3 months ago

IzzyOnDroid ✅
3 months ago

#DSGVO versus #LLM / #KI :
Copilot macht aus einem Gerichtsreporter einen Kinderschänder
heise.de/news/Copilot-macht-au…

Recht auf Auskunft? Schwierig. Löschen der Falschinformationen? Unmöglich. Und nun?

Copilot macht aus einem Gerichtsreporter einen Kinderschänder

Weil er über Verhandlungen berichtet hat, macht der Copilot aus einem Journalisten einen Kinderschänder, Witwenbetrüger und mehr.

^{Eva-Maria Weiß (heise online)}

#DSGVO #llm #ki

Please wait

View in context

Chi Kim

3 months ago

Chi Kim
3 months ago

After a long period of inactivity for vision language models, llama.cpp merged the support for MiniCPM-V-2.5. Hopefully the support for 2.6 is also on the way soon. #LLM #Multimodal #AI #ML
huggingface.co/openbmb/MiniCPM…
huggingface.co/openbmb/MiniCPM…
github.com/ggerganov/llama.cpp…

support MiniCPM-V-2.5 by tc-mb · Pull Request #7599 · ggerganov/llama.cpp

Dear llama.cpp Official, Hi, I'm writing to address our new PR submission for integrating our model MiniCPM-Llama3-V 2.5 into llama.cpp, which has been trending on Huggingface for over a week a...

^GitHub

#AI #ML #llm #multimodal

Please wait

View in context

Chi Kim

3 months ago

Chi Kim
3 months ago

AlphaProof and AlphaGeometry from Google DeepMind tried the 2024 International Mathematical Olympiad and performed at the level of a silver medalist! #Math #LLM #ML #AI deepmind.google/discover/blog/…

AI achieves silver-medal standard solving International Mathematical Olympiad problems

Breakthrough models AlphaProof and AlphaGeometry 2 solve advanced reasoning problems in mathematics

^{Google DeepMind}

#AI #math #ML #llm

Please wait

View in context

Chi Kim

4 months ago

Chi Kim
4 months ago

According to the commit for download.sh on meta-llama Github repo, we're getting the updates: llama-3.1-405b, llama-3.1-70b, llama-3.1-8b. #LLM #ML #AI github.com/meta-llama/llama/co…

Update download.sh · meta-llama/llama@12b676b

Inference code for Llama models. Contribute to meta-llama/llama development by creating an account on GitHub.

^GitHub

#AI #ML #llm

Please wait

View in context

Chi Kim

4 months ago

Chi Kim
4 months ago

Llama3-405b base model is leaked on 4chan as Miqu-2. Miqu-1 was leaked Mistral 70b model which was confirmed by Mistral CEO. The download size is 764GB, and it was briefly on Huggingface but taken down. The torrent is still working apparently. #LLM #AI #ML reddit.com/r/LocalLLaMA/commen…

#AI #ML #llm

Please wait

View in context

Chi Kim

4 months ago

Chi Kim
4 months ago

If the rumors are true, this week could be another exciting week for opensource LLMS! Meta may release Llama-3-405b on this Tuesday. Also there could be updates to 8b and 70b models distilled from 405B. Joe Spisak, a product director at Meta says they were initially going to call Llama 3 8b and 70b a prerelease or preview because these models didn't have all the things they planned to release.
Sources:
theinformation.com/briefings/m…
x.com/AlpinDale/status/1814814…
youtu.be/r3DC_gjFCSA?feature=s…
#LLM #AI #ML

Meta Announces Llama 3 at Weights & Biases’ conference

In an engaging presentation at Weights & Biases’ Fully Connected conference, Joe Spisak, Product Director of GenAI at Meta, unveiled the latest family of Lla...

^YouTube

#AI #ML #llm

Please wait

View in context

Chi Kim

4 months ago

Chi Kim
4 months ago

Time to buy the Mac Studio with 192GB memory! lol Rumor: Meta plans to release the largest Llama 3 model with 405 billion parameters on July 23, according to a Meta employee. "it will be able to understand and generate images and text." #LLM #AI #ML theinformation.com/briefings/m…

#AI #ML #llm

Please wait

View in context

Blort™ 🐀Ⓥ🥋☣️

4 months ago

Blort™ 🐀Ⓥ🥋☣️
4 months ago

Also @Tutanota , as a privacy focused company, why are your comments run on #Reddit, rather than the #Fediverse? Most privacy minded folks don't want their comments being used by AI LLM's etc.

This is way off brand.

@carlschwan among others have already shown how to do it fedistyle, pretty easily, and I'm sure many of the FOSS Fedipeeps here would happily help you out with a quick transition if you asked or gave a few $ to their FOSS project.

#Tuta #Reddit #Privacy #Fediverse #FOSS #AI #LLM

#foss #privacy #fediverse #AI #reddit #llm #tuta @Tuta @Carl Schwan

Please wait

View in context

The vOICe vision BCI 😎🧠

4 months ago

The vOICe vision BCI 😎🧠
4 months ago

GPT 4 hallucination rate is 28.6% on a simple task: citing title, author, and year of publication medium.com/@michaelwood33311/g…

Hallucination rates and reference accuracy of ChatGPT and Bard for systematic reviews: Comparative analysis jmir.org/2024/1/e53164 #AI #LLM

GPT 4 Hallucination Rate is 28.6% on a Simple Task: Citing Title, Author, and Year of Publication

The all-too-common myth of GPT 4 having only a 3% hallucination rate is shattered by a recent study that found GPT 4 has a 28.6% hallucination rate. That’s almost 10x the oft-cited (i.e. over hyped)…

^{Michael Wood (Medium)}

#AI #llm

Please wait

View in context

Robert Kingett

4 months ago

Robert Kingett
4 months ago

Blind writer tries the Gandalf | Lakera prompt injection game for the first time.

Upon recommendations, I tried this AI prompt injection game for the first time. I made it to level 7 with no help from the internet!

If you want to donate to me, donate to me on this page.

My website is here where I usually blog. I'm not much of a video person, so I blog and write more than I do video!

Gandalf | Lakera – Test your prompting skills to make Gandalf reveal secret information.

Trick Gandalf into revealing information and experience the limitations of large language models firsthand.

^{gandalf.lakera.ai}

#security #AI #computers #llm #Prompt Injection

Please wait

View in context

Martin Owens

5 months ago

Martin Owens
5 months ago

Do you remember a couple of weeks ago when I complained that a very large #python contribution to #inkscape was poorly formatted and I felt embarrassed about pushing back and asking them to run a linter over it?

Yeah I'm not fucking embarrassed now, I'm furious. 🤬

Update: Apparently they meant a small section of it was, not the whole MR. I'm annoyed, but I'll have to take them at their word.

#llm #oss #foss #mergerequest

~Annonymised~ 2 house ago
No, but it was mostly written by an LLM.

#foss #oss #inkscape #python #llm #mergerequest

Please wait

View in context

Ľuboš Moščovič

5 months ago

Ľuboš Moščovič
5 months ago

Dnešný fail ruzzkého trolla bol tak rozkošný, že som sa trošku rozpísal - takže kŕmiť či nekŕmiť trollov?

herrman.sk/home/krmit-ci-nekrm…

#troll #chatgpt #llm #fail #blog

Kŕmiť, či nekŕmiť trollov? | Ľuboš Moščovič o bezpečnosti

Informačná bezpečnosť sa týka každého!

^{www.herrman.sk}

Kŕmiť (0 votes)
Nekŕmiť (0 votes)
trollololooooo (0 votes)

Poll end: 5 months ago

#blog #fail #troll #llm #chatgpt

Please wait

View in context

Musharraf

5 months ago

Musharraf
5 months ago

This generative model allows you to sketch out a scene with a few words, it then leverages an LLM to flesh out the details, with the ultimate goal of feeding those details to a downstream visual image generation model.
It is almost, but not quite, entirely the inverse of image captioning models.
This offers the closest experience to an image generation tool that's usable by people with visual impairments.

huggingface.co/spaces/lllyasvi…

#a11y #genai #llm

Omost - a Hugging Face Space by lllyasviel

Discover amazing ML apps made by the community

^{huggingface.co}

#a11y #llm #genai

Please wait

View in context

victor tsaran

5 months ago

victor tsaran
5 months ago

What is Flash Attention huggingface.co/docs/text-gener… #llm #ai #ollama

Flash Attention

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

^{huggingface.co}

#AI #llm #ollama

Please wait

View in context

Chi Kim

6 months ago

Chi Kim
6 months ago

Microsoft released Phi3 Small, Medium, and Vision! #LLM #AI #ML huggingface.co/collections/mic…

Phi-3 - a microsoft Collection

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths.

^{huggingface.co}

#AI #ML #llm

Please wait

View in context

Kathy Reid

6 months ago

Kathy Reid
6 months ago

Like many other technologists, I gave my time and expertise for free to #StackOverflow because the content was licensed CC-BY-SA - meaning that it was a public good. It brought me joy to help people figure out why their #ASR code wasn't working, or assist with a #CUDA bug.

Now that a deal has been struck with #OpenAI to scrape all the questions and answers in Stack Overflow, to train #GenerativeAI models, like #LLMs, without attribution to authors (as required under the CC-BY-SA license under which Stack Overflow content is licensed), to be sold back to us (the SA clause requires derivative works to be shared under the same license), I have issued a Data Deletion request to Stack Overflow to disassociate my username from my Stack Overflow username, and am closing my account, just like I did with Reddit, Inc.

policies.stackoverflow.co/data…

The data I helped create is going to be bundled in an #LLM and sold back to me.

In a single move, Stack Overflow has alienated its community - which is also its main source of competitive advantage, in exchange for token lucre.

Stack Exchange, Stack Overflow's former instantiation, used to fulfill a psychological contract - help others out when you can, for the expectation that others may in turn assist you in the future. Now it's not an exchange, it's #enshittification.

Programmers now join artists and copywriters, whose works have been snaffled up to create #GenAI solutions.

The silver lining I see is that once OpenAI creates LLMs that generate code - like Microsoft has done with Copilot on GitHub - where will they go to get help with the bugs that the generative AI models introduce, particularly, given the recent GitClear report, of the "downward pressure on code quality" caused by these tools?

While this is just one more example of #enshittification, it's also a salient lesson for #DevRel folks - if your community is your source of advantage, don't upset them.

Submit a data request - Stack Overflow

You can use this form to submit a request regarding your personal information that is processed by Stack Overflow

^{policies.stackoverflow.co}

#llm #openai #stackoverflow #generativeAI #LLMs #enshittification #genai #asr #cuda #devrel

Please wait

View in context

Andrew Jennings

7 months ago

Andrew Jennings
7 months ago

An actual business expert grapples with large language models: ben-evans.com/benedictevans/20… #AI #LLM

Looking for AI use-cases

We’ve had ChatGPT for 18 months, but what’s it for? What are the use-cases? Why isn’t it useful for everyone, right now? Do Large Language Models become universal tools that can do ‘any’ task, or do we wrap them in single-purpose apps, and build thou…

^{Benedict Evans}

#AI #llm

Please wait

View in context

Chi Kim

7 months ago

Chi Kim
7 months ago

Start saving money for that M4 Ultra with 500GB! Maybe this could be the first open source that could surpass GPT-4! AIatMeta: "Llama 3 8B & 70B models are just the beginning of what we’re working to release for Llama 3. Our largest models currently in the works are 400B+ parameters and while they’re still in active development, we’re excited about how this work is trending." #LLM #AI #ML twitter.com/AIatMeta/status/17…

#AI #ML #llm

Please wait

View in context

Steve Faulkner

7 months ago

Steve Faulkner
7 months ago

YOWZA
Form Extractor Prototype

“This tool extracts the structure from an image of a form.”

github.com/timpaul/form-extrac…

#ai #LLM #UX #accessibility

GitHub - timpaul/form-extractor-prototype

Contribute to timpaul/form-extractor-prototype development by creating an account on GitHub.

^GitHub

#Accessibility #AI #UX #llm

Please wait

View in context

Chi Kim

7 months ago

Chi Kim
7 months ago

Apparently Meta is planning to release two small varients of Llama-3 next week "as a precursor to the launch of the biggest version of Llama 3, expected this summer." Command-r-plus, mixtral 8x22b, Google CodeGemma... All of sudden companies are releasing LLMS like crazy! Where's Apple? Maybe In WWDC 2024? lol #LLM #AI #ML theinformation.com/articles/me…

#AI #ML #llm

Please wait

View in context

⇧