Skip to main content

Search

Items tagged with: llm


A study asked 50 doctors to make six different diagnoses for medical conditions. "Doctors who did the project without AI got an average score of 74%, doctors who used AI got an average score of 76%, and ChatGPT itself got an average score of 90%." "AI didn’t help doctors using it as much as anticipated because physicians “didn’t listen to AI when AI told them things they didn’t agree. Most doctors couldn’t be convinced a chatbot knew more than them." #LLM #AI #ChatGPT qz.com/chatgpt-beat-doctors-at…


Most important RFC of the 20s?

Robots Exclusion Protocol Extension to manage AI content use
draft-canel-robots-ai-controll

datatracker.ietf.org/doc/draft…

#ai #llm #genai

#AI #llm #genai


As a blind user, I find LLMs like ChatGPT pretty useful because they output in audio or text. However, I wonder if they start generating videos, it might not be useful for us. Many videos, like YouTube tutorials, are often optimized for sighted audience, and most likely models would be trained on these types of videos to generate in similar style. I can ask to describe the video, but it won't be the same experience as videos designed with accessibility in mind. #accessibility #LLM #AI


In the debate about whether AI / LLMs can reason or not it's good to remember this quote from 1984 from Dijkstra , a dutch computer scientist,
""The question of whether Machines Can Think... is about as relevant as the question of whether Submarines Can Swim."
#AI #LLM #reasoning


Meta releases Spirit LM, a multimodal (speech text) model. #Multimodal #LLM #AI #ML ai.meta.com/blog/fair-news-seg…


I'm a little puzzled at the salience that is being given to the Apple conclusions on #LLM #reasoning when we have lots of prior art. For example: LLMs cannot correctly infer a is b, if their corpora only contain b is a. #Paper: arxiv.org/abs/2309.12288

#AI #MachineLearning #logic


#HandAufsHerz

Wie macht ihr deutlich, wenn ihr Sprachmodelle für Aufgaben nutzt, wie z.B. Antworten per E-Mail, SocialMedia, Programmierung oder ähnliches?

#LLM #sogenannteKI #Assistenten #Transparenz


#AIagent promotes itself to #sysadmin , trashes #boot sequence

Fun experiment, but yeah, don't pipe an #LLM raw into /bin/bash

Buck #Shlegeris, CEO at #RedwoodResearch, a nonprofit that explores the risks posed by #AI , recently learned an amusing but hard lesson in automation when he asked his LLM-powered agent to open a secure connection from his laptop to his desktop machine.
#security #unintendedconsequences

theregister.com/2024/10/02/ai_…


🆕 blog! “GitHub's Copilot lies about its own documentation. So why would I trust it with my code?”

In the early part of the 20th Century, there was a fad for "Radium". The magical, radioactive substance that glowed in the dark. The market had decided that Radium was The Next Big Thing and tried to shove it into every product. There …

👀 Read more: shkspr.mobi/blog/2024/10/githu…

#AI #github #LLM


Independent test of #OpenAI’s o1-preview model achieved near-perfect performance on a national #math exam (landing in the top .1% of the nation’s students).

o1 also outperformed 4o on the math test, but took about 3 times longer to do so (10 minutes vs. 3 minutes).

Preprint: researchgate.net/publication/3…

#teaching #assessment #AI #LLM #edu #higherEd


Massive E-Learning Platform #Udemy Gave Teachers a Gen #AI 'Opt-Out Window'. It's Already Over.

Udemy will train generative AI on classes developed/users contributed on its site. It is opt-out (meaning, everyone was already opted in) with a time window... and opting out may "affect course visibility and potential earnings."

Udemy's reason for the opt-out window was reportedly because removing data from LLMs is hard. IMO, that would be the reason for making it opt-in, but here we are...

#privacy #privacymatters #llm

404media.co/massive-e-learning…


I fed the full transcript of the 2024 presidential debate and asked NotebookLM to create an audio overview. Find out which political side the AI is on. ROFL #LLM #ML #AI #NotebookLM @vick21 abcnews.go.com/Politics/harris…


😲 OMG, Audio Overview feature on NotebookLM is wild! It basically creates a podcast with two AI generated voices based on the source documents you upload. Definitely try if you haven't yet. #LLM #ML #AI blog.google/technology/ai/note…
#AI #ML #llm


New Mistral 22B model Mistral-Small-Instruct-2409 #LLM #AI #ML huggingface.co/mistralai/Mistr…
#AI #ML #llm


> Gamma is a free AI tool that can automatically convert your documents or PDFs into visually appealing presentations in minutes.

Okay, so I understand that AI is cool. I understand that we can make AI do a lot of things. But seriously. First, there's a model that turns HTML into Markdown. And granted, I haven't tried that one, and I probably should. And now this? Like, no one has heard of Pandoc anymore? Like I can literally write a Markdown file and pandoc -i presentation.md -o presentation.pptx. Something like that anyway. And get a presentation out of it. Or just import a Word document into PowerPoint. Or just use a Markdown file and arrow down through the bullet points if I don't *need* to be fansy.

I'm starting to kind of understand how wasteful people are with this kind of stuff.

#ai #presentation #llm


I have to say, this clause rather annoys me, even though I fully understand the reasons behind it: “X may still sometimes give inaccurate responses, so you may want to confirm any facts independently”. Let’s stand back and think about this for a moment. What is an average user supposed to do with this?
“Sometimes”? When exactly? "you may want to confirm any facts independently”? Which of the facts? Independently? Where?
#AI #LLM #SometimesFail


😲 Kyle Kabasares, a Physics PhD graduate working at NASA's Ames Research Center, gave the methods section of his research paper to ChatGPT O1 Preview and asked it to generate the code based on the description. After just six prompts, it produced a working version of the code that took him a year to develop during his PhD. #ChatGPT #LLM #ML #AI youtube.com/watch?v=M9YOO7N5jF…


So like I just have to ask. For people that are super critical of AI for accessibility, what do you expect instead? Do you want blind people to have human ... guides or whatever that will narrate the world around you? Do you want humans to describe all your pictures? Videos? Porn? Because that's about the only other option. And you may return with "Well audio description." And I return with "You think people are going to describe every YouTube video out there? Or old TV shows like Dark Shadows?" Because honestly that's what it'd take. If AI were *not* around, if we want *that* kind of access, that's what we'd have to ask, of darn near every sighted human in the world. And I just don't feel comfortable with demanding that of them.

Now, we'll see what Apple does to give us what will hopefully be even better image descriptions. Imagine a 3B model that is made with **high quality** images and description pairs, trained to do nothing but describe images. Apple has done pretty darn good without LLM's so far, so maybe they'll surprise us further. But my goodness, I'd much rather have something that, yes, makes me *feel* included, maybe a tad bit more than it actually *does* include me. And that's for each and every blind person to decide for themselves if they want to use AI for image, and probably soon, video descriptions, and what they're willing to trust with it. But for us to get this much real, human access, I just hope people who are detracting from AI understand that we who use AI are now used to having images described, and well, soon videos. It's just something that I don't think people should just deny quickly.

#AI #accessibility #blind #description #video #llm


Does anyone have a recommendation for #LlamaCPP alternative to run recent vision language models on Apple Silicon? Llama.cpp doesn't support any of the recent #VLM such as Qwen2-VL, Phi-3.5-vision, Idefics3, InternVL2, Yi-VL, Chameleon, CogVLM2, GLM-4v, etc.
Minicpm-v 2.6 is the only recent model that was added. Maybe time to move on. :( #LLM #multimodal #AppleSilicon #MacOS #ML #AI


Have you tried using generative #AI for #coding ?

AI meaning #LLM here, like GH-Copilot, ChatGPT, Claude, Ollama and so on.

  • No not yet or no interest (40%, 12 votes)
  • Yes, but its still useless (16%, 5 votes)
  • I use it, but it decreases my skill & productivity (10%, 3 votes)
  • I use it and it boosts my skill & productivity (33%, 10 votes)
30 voters. Poll end: 2 months ago


#DSGVO versus #LLM / #KI :
Copilot macht aus einem Gerichtsreporter einen Kinderschänder
heise.de/news/Copilot-macht-au…

Recht auf Auskunft? Schwierig. Löschen der Falschinformationen? Unmöglich. Und nun?

#DSGVO #llm #ki


After a long period of inactivity for vision language models, llama.cpp merged the support for MiniCPM-V-2.5. Hopefully the support for 2.6 is also on the way soon. #LLM #Multimodal #AI #ML
huggingface.co/openbmb/MiniCPM…
huggingface.co/openbmb/MiniCPM…
github.com/ggerganov/llama.cpp…


AlphaProof and AlphaGeometry from Google DeepMind tried the 2024 International Mathematical Olympiad and performed at the level of a silver medalist! #Math #LLM #ML #AI deepmind.google/discover/blog/…
#AI #math #ML #llm


According to the commit for download.sh on meta-llama Github repo, we're getting the updates: llama-3.1-405b, llama-3.1-70b, llama-3.1-8b. #LLM #ML #AI github.com/meta-llama/llama/co…
#AI #ML #llm


Llama3-405b base model is leaked on 4chan as Miqu-2. Miqu-1 was leaked Mistral 70b model which was confirmed by Mistral CEO. The download size is 764GB, and it was briefly on Huggingface but taken down. The torrent is still working apparently. #LLM #AI #ML reddit.com/r/LocalLLaMA/commen…
#AI #ML #llm


If the rumors are true, this week could be another exciting week for opensource LLMS! Meta may release Llama-3-405b on this Tuesday. Also there could be updates to 8b and 70b models distilled from 405B. Joe Spisak, a product director at Meta says they were initially going to call Llama 3 8b and 70b a prerelease or preview because these models didn't have all the things they planned to release.
Sources:
theinformation.com/briefings/m…
x.com/AlpinDale/status/1814814…
youtu.be/r3DC_gjFCSA?feature=s…
#LLM #AI #ML
#AI #ML #llm


Time to buy the Mac Studio with 192GB memory! lol Rumor: Meta plans to release the largest Llama 3 model with 405 billion parameters on July 23, according to a Meta employee. "it will be able to understand and generate images and text." #LLM #AI #ML theinformation.com/briefings/m…
#AI #ML #llm


Also @Tutanota , as a privacy focused company, why are your comments run on #Reddit, rather than the #Fediverse? Most privacy minded folks don't want their comments being used by AI LLM's etc.

This is way off brand.

@carlschwan among others have already shown how to do it fedistyle, pretty easily, and I'm sure many of the FOSS Fedipeeps here would happily help you out with a quick transition if you asked or gave a few $ to their FOSS project.

#Tuta #Reddit #Privacy #Fediverse #FOSS #AI #LLM


GPT 4 hallucination rate is 28.6% on a simple task: citing title, author, and year of publication medium.com/@michaelwood33311/g…

Hallucination rates and reference accuracy of ChatGPT and Bard for systematic reviews: Comparative analysis jmir.org/2024/1/e53164 #AI #LLM

#AI #llm


Blind writer tries the Gandalf | Lakera prompt injection game for the first time.


Upon recommendations, I tried this AI prompt injection game for the first time. I made it to level 7 with no help from the internet!

If you want to donate to me, donate to me on this page.

My website is here where I usually blog. I'm not much of a video person, so I blog and write more than I do video!


Do you remember a couple of weeks ago when I complained that a very large #python contribution to #inkscape was poorly formatted and I felt embarrassed about pushing back and asking them to run a linter over it?

Yeah I'm not fucking embarrassed now, I'm furious. 🤬

Update: Apparently they meant a small section of it was, not the whole MR. I'm annoyed, but I'll have to take them at their word.

#llm #oss #foss #mergerequest


Dnešný fail ruzzkého trolla bol tak rozkošný, že som sa trošku rozpísal - takže kŕmiť či nekŕmiť trollov?

herrman.sk/home/krmit-ci-nekrm…

#troll #chatgpt #llm #fail #blog

  • Kŕmiť (0 votes)
  • Nekŕmiť (0 votes)
  • trollololooooo (0 votes)
Poll end: 5 months ago


This generative model allows you to sketch out a scene with a few words, it then leverages an LLM to flesh out the details, with the ultimate goal of feeding those details to a downstream visual image generation model.
It is almost, but not quite, entirely the inverse of image captioning models.
This offers the closest experience to an image generation tool that's usable by people with visual impairments.

huggingface.co/spaces/lllyasvi…

#a11y #genai #llm


What is Flash Attention huggingface.co/docs/text-gener… #llm #ai #ollama


Microsoft released Phi3 Small, Medium, and Vision! #LLM #AI #ML huggingface.co/collections/mic…
#AI #ML #llm


Like many other technologists, I gave my time and expertise for free to #StackOverflow because the content was licensed CC-BY-SA - meaning that it was a public good. It brought me joy to help people figure out why their #ASR code wasn't working, or assist with a #CUDA bug.

Now that a deal has been struck with #OpenAI to scrape all the questions and answers in Stack Overflow, to train #GenerativeAI models, like #LLMs, without attribution to authors (as required under the CC-BY-SA license under which Stack Overflow content is licensed), to be sold back to us (the SA clause requires derivative works to be shared under the same license), I have issued a Data Deletion request to Stack Overflow to disassociate my username from my Stack Overflow username, and am closing my account, just like I did with Reddit, Inc.

policies.stackoverflow.co/data…

The data I helped create is going to be bundled in an #LLM and sold back to me.

In a single move, Stack Overflow has alienated its community - which is also its main source of competitive advantage, in exchange for token lucre.

Stack Exchange, Stack Overflow's former instantiation, used to fulfill a psychological contract - help others out when you can, for the expectation that others may in turn assist you in the future. Now it's not an exchange, it's #enshittification.

Programmers now join artists and copywriters, whose works have been snaffled up to create #GenAI solutions.

The silver lining I see is that once OpenAI creates LLMs that generate code - like Microsoft has done with Copilot on GitHub - where will they go to get help with the bugs that the generative AI models introduce, particularly, given the recent GitClear report, of the "downward pressure on code quality" caused by these tools?

While this is just one more example of #enshittification, it's also a salient lesson for #DevRel folks - if your community is your source of advantage, don't upset them.


An actual business expert grapples with large language models: ben-evans.com/benedictevans/20… #AI #LLM
#AI #llm


Start saving money for that M4 Ultra with 500GB! Maybe this could be the first open source that could surpass GPT-4! AIatMeta: "Llama 3 8B & 70B models are just the beginning of what we’re working to release for Llama 3. Our largest models currently in the works are 400B+ parameters and while they’re still in active development, we’re excited about how this work is trending." #LLM #AI #ML twitter.com/AIatMeta/status/17…
#AI #ML #llm


YOWZA
Form Extractor Prototype

“This tool extracts the structure from an image of a form.”

github.com/timpaul/form-extrac…

#ai #LLM #UX #accessibility


Apparently Meta is planning to release two small varients of Llama-3 next week "as a precursor to the launch of the biggest version of Llama 3, expected this summer." Command-r-plus, mixtral 8x22b, Google CodeGemma... All of sudden companies are releasing LLMS like crazy! Where's Apple? Maybe In WWDC 2024? lol #LLM #AI #ML theinformation.com/articles/me…
#AI #ML #llm