Search
Items tagged with: LLm
ChatGPT beat doctors at diagnosing medical conditions, study says
The small study showed AI outperforming doctors by 16 percentage pointsBen Kesslen (Quartz)
Most important RFC of the 20s?
Robots Exclusion Protocol Extension to manage AI content use
draft-canel-robots-ai-controll
datatracker.ietf.org/doc/draft…
Robots Exclusion Protocol Extension to manage AI content use
This document extends RFC9309 by specifying additional rules for controlling usage of the content in the field of Artificial Intelligence (AI).IETF Datatracker
Sharing new research, models, and datasets from Meta FAIR
Today, Meta FAIR is releasing several new research artifacts in support of our goal of achieving advanced machine intelligence (AMI) while also supporting open science and reproducibility.ai.meta.com
I'm a little puzzled at the salience that is being given to the Apple conclusions on #LLM #reasoning when we have lots of prior art. For example: LLMs cannot correctly infer a is b, if their corpora only contain b is a. #Paper: arxiv.org/abs/2309.12288
The Reversal Curse: LLMs trained on "A is B" fail to learn "B is A"
We expose a surprising failure of generalization in auto-regressive large language models (LLMs). If a model is trained on a sentence of the form "A is B", it will not automatically generalize to the reverse direction "B is A".arXiv.org
Wie macht ihr deutlich, wenn ihr Sprachmodelle für Aufgaben nutzt, wie z.B. Antworten per E-Mail, SocialMedia, Programmierung oder ähnliches?
#AIagent promotes itself to #sysadmin , trashes #boot sequence
Fun experiment, but yeah, don't pipe an #LLM raw into /bin/bash
Buck #Shlegeris, CEO at #RedwoodResearch, a nonprofit that explores the risks posed by #AI , recently learned an amusing but hard lesson in automation when he asked his LLM-powered agent to open a secure connection from his laptop to his desktop machine.
#security #unintendedconsequences
theregister.com/2024/10/02/ai_…
AI agent promotes itself to sysadmin, trashes boot sequence
Fun experiment, but yeah, don't pipe an LLM raw into /bin/bashThomas Claburn (The Register)
🆕 blog! “GitHub's Copilot lies about its own documentation. So why would I trust it with my code?”
In the early part of the 20th Century, there was a fad for "Radium". The magical, radioactive substance that glowed in the dark. The market had decided that Radium was The Next Big Thing and tried to shove it into every product. There …
👀 Read more: shkspr.mobi/blog/2024/10/githu…
⸻
#AI #github #LLM
Independent test of #OpenAI’s o1-preview model achieved near-perfect performance on a national #math exam (landing in the top .1% of the nation’s students).
o1 also outperformed 4o on the math test, but took about 3 times longer to do so (10 minutes vs. 3 minutes).
Preprint: researchgate.net/publication/3…
Massive E-Learning Platform #Udemy Gave Teachers a Gen #AI 'Opt-Out Window'. It's Already Over.
Udemy will train generative AI on classes developed/users contributed on its site. It is opt-out (meaning, everyone was already opted in) with a time window... and opting out may "affect course visibility and potential earnings."
Udemy's reason for the opt-out window was reportedly because removing data from LLMs is hard. IMO, that would be the reason for making it opt-in, but here we are...
> Gamma is a free AI tool that can automatically convert your documents or PDFs into visually appealing presentations in minutes.
Okay, so I understand that AI is cool. I understand that we can make AI do a lot of things. But seriously. First, there's a model that turns HTML into Markdown. And granted, I haven't tried that one, and I probably should. And now this? Like, no one has heard of Pandoc anymore? Like I can literally write a Markdown file and pandoc -i presentation.md -o presentation.pptx. Something like that anyway. And get a presentation out of it. Or just import a Word document into PowerPoint. Or just use a Markdown file and arrow down through the bullet points if I don't *need* to be fansy.
I'm starting to kind of understand how wasteful people are with this kind of stuff.
#ai #presentation #llm
“Sometimes”? When exactly? "you may want to confirm any facts independently”? Which of the facts? Independently? Where?
#AI #LLM #SometimesFail
So like I just have to ask. For people that are super critical of AI for accessibility, what do you expect instead? Do you want blind people to have human ... guides or whatever that will narrate the world around you? Do you want humans to describe all your pictures? Videos? Porn? Because that's about the only other option. And you may return with "Well audio description." And I return with "You think people are going to describe every YouTube video out there? Or old TV shows like Dark Shadows?" Because honestly that's what it'd take. If AI were *not* around, if we want *that* kind of access, that's what we'd have to ask, of darn near every sighted human in the world. And I just don't feel comfortable with demanding that of them.
Now, we'll see what Apple does to give us what will hopefully be even better image descriptions. Imagine a 3B model that is made with **high quality** images and description pairs, trained to do nothing but describe images. Apple has done pretty darn good without LLM's so far, so maybe they'll surprise us further. But my goodness, I'd much rather have something that, yes, makes me *feel* included, maybe a tad bit more than it actually *does* include me. And that's for each and every blind person to decide for themselves if they want to use AI for image, and probably soon, video descriptions, and what they're willing to trust with it. But for us to get this much real, human access, I just hope people who are detracting from AI understand that we who use AI are now used to having images described, and well, soon videos. It's just something that I don't think people should just deny quickly.
#AI #accessibility #blind #description #video #llm
Minicpm-v 2.6 is the only recent model that was added. Maybe time to move on. :( #LLM #multimodal #AppleSilicon #MacOS #ML #AI
Have you tried using generative #AI for #coding ?
AI meaning #LLM here, like GH-Copilot, ChatGPT, Claude, Ollama and so on.
- No not yet or no interest (40%, 12 votes)
- Yes, but its still useless (16%, 5 votes)
- I use it, but it decreases my skill & productivity (10%, 3 votes)
- I use it and it boosts my skill & productivity (33%, 10 votes)
#DSGVO versus #LLM / #KI :
Copilot macht aus einem Gerichtsreporter einen Kinderschänder
heise.de/news/Copilot-macht-au…
Recht auf Auskunft? Schwierig. Löschen der Falschinformationen? Unmöglich. Und nun?
Copilot macht aus einem Gerichtsreporter einen Kinderschänder
Weil er über Verhandlungen berichtet hat, macht der Copilot aus einem Journalisten einen Kinderschänder, Witwenbetrüger und mehr.Eva-Maria Weiß (heise online)
huggingface.co/openbmb/MiniCPM…
huggingface.co/openbmb/MiniCPM…
github.com/ggerganov/llama.cpp…
support MiniCPM-V-2.5 by tc-mb · Pull Request #7599 · ggerganov/llama.cpp
Dear llama.cpp Official, Hi, I'm writing to address our new PR submission for integrating our model MiniCPM-Llama3-V 2.5 into llama.cpp, which has been trending on Huggingface for over a week a...GitHub
AI achieves silver-medal standard solving International Mathematical Olympiad problems
Breakthrough models AlphaProof and AlphaGeometry 2 solve advanced reasoning problems in mathematicsGoogle DeepMind
Update download.sh · meta-llama/llama@12b676b
Inference code for Llama models. Contribute to meta-llama/llama development by creating an account on GitHub.GitHub
Sources:
theinformation.com/briefings/m…
x.com/AlpinDale/status/1814814…
youtu.be/r3DC_gjFCSA?feature=s…
#LLM #AI #ML
Meta Announces Llama 3 at Weights & Biases’ conference
In an engaging presentation at Weights & Biases’ Fully Connected conference, Joe Spisak, Product Director of GenAI at Meta, unveiled the latest family of Lla...YouTube
Also @Tutanota , as a privacy focused company, why are your comments run on #Reddit, rather than the #Fediverse? Most privacy minded folks don't want their comments being used by AI LLM's etc.
This is way off brand.
@carlschwan among others have already shown how to do it fedistyle, pretty easily, and I'm sure many of the FOSS Fedipeeps here would happily help you out with a quick transition if you asked or gave a few $ to their FOSS project.
GPT 4 hallucination rate is 28.6% on a simple task: citing title, author, and year of publication medium.com/@michaelwood33311/g…
Hallucination rates and reference accuracy of ChatGPT and Bard for systematic reviews: Comparative analysis jmir.org/2024/1/e53164 #AI #LLM
GPT 4 Hallucination Rate is 28.6% on a Simple Task: Citing Title, Author, and Year of Publication
The all-too-common myth of GPT 4 having only a 3% hallucination rate is shattered by a recent study that found GPT 4 has a 28.6% hallucination rate. That’s almost 10x the oft-cited (i.e. over hyped)…Michael Wood (Medium)
Blind writer tries the Gandalf | Lakera prompt injection game for the first time.
Upon recommendations, I tried this AI prompt injection game for the first time. I made it to level 7 with no help from the internet!
If you want to donate to me, donate to me on this page.
My website is here where I usually blog. I'm not much of a video person, so I blog and write more than I do video!
Gandalf | Lakera – Test your prompting skills to make Gandalf reveal secret information.
Trick Gandalf into revealing information and experience the limitations of large language models firsthand.gandalf.lakera.ai
Do you remember a couple of weeks ago when I complained that a very large #python contribution to #inkscape was poorly formatted and I felt embarrassed about pushing back and asking them to run a linter over it?
Yeah I'm not fucking embarrassed now, I'm furious. 🤬
Update: Apparently they meant a small section of it was, not the whole MR. I'm annoyed, but I'll have to take them at their word.
#llm #oss #foss #mergerequest
Dnešný fail ruzzkého trolla bol tak rozkošný, že som sa trošku rozpísal - takže kŕmiť či nekŕmiť trollov?
herrman.sk/home/krmit-ci-nekrm…
#troll #chatgpt #llm #fail #blog
Kŕmiť, či nekŕmiť trollov? | Ľuboš Moščovič o bezpečnosti
Informačná bezpečnosť sa týka každého!www.herrman.sk
- Kŕmiť (0 votes)
- Nekŕmiť (0 votes)
- trollololooooo (0 votes)
This generative model allows you to sketch out a scene with a few words, it then leverages an LLM to flesh out the details, with the ultimate goal of feeding those details to a downstream visual image generation model.
It is almost, but not quite, entirely the inverse of image captioning models.
This offers the closest experience to an image generation tool that's usable by people with visual impairments.
huggingface.co/spaces/lllyasvi…
Omost - a Hugging Face Space by lllyasviel
Discover amazing ML apps made by the communityhuggingface.co
Flash Attention
We’re on a journey to advance and democratize artificial intelligence through open source and open science.huggingface.co
Phi-3 - a microsoft Collection
Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths.huggingface.co
Like many other technologists, I gave my time and expertise for free to #StackOverflow because the content was licensed CC-BY-SA - meaning that it was a public good. It brought me joy to help people figure out why their #ASR code wasn't working, or assist with a #CUDA bug.
Now that a deal has been struck with #OpenAI to scrape all the questions and answers in Stack Overflow, to train #GenerativeAI models, like #LLMs, without attribution to authors (as required under the CC-BY-SA license under which Stack Overflow content is licensed), to be sold back to us (the SA clause requires derivative works to be shared under the same license), I have issued a Data Deletion request to Stack Overflow to disassociate my username from my Stack Overflow username, and am closing my account, just like I did with Reddit, Inc.
policies.stackoverflow.co/data…
The data I helped create is going to be bundled in an #LLM and sold back to me.
In a single move, Stack Overflow has alienated its community - which is also its main source of competitive advantage, in exchange for token lucre.
Stack Exchange, Stack Overflow's former instantiation, used to fulfill a psychological contract - help others out when you can, for the expectation that others may in turn assist you in the future. Now it's not an exchange, it's #enshittification.
Programmers now join artists and copywriters, whose works have been snaffled up to create #GenAI solutions.
The silver lining I see is that once OpenAI creates LLMs that generate code - like Microsoft has done with Copilot on GitHub - where will they go to get help with the bugs that the generative AI models introduce, particularly, given the recent GitClear report, of the "downward pressure on code quality" caused by these tools?
While this is just one more example of #enshittification, it's also a salient lesson for #DevRel folks - if your community is your source of advantage, don't upset them.
Submit a data request - Stack Overflow
You can use this form to submit a request regarding your personal information that is processed by Stack Overflowpolicies.stackoverflow.co
Looking for AI use-cases
We’ve had ChatGPT for 18 months, but what’s it for? What are the use-cases? Why isn’t it useful for everyone, right now? Do Large Language Models become universal tools that can do ‘any’ task, or do we wrap them in single-purpose apps, and build thou…Benedict Evans
YOWZA
Form Extractor Prototype
“This tool extracts the structure from an image of a form.”
github.com/timpaul/form-extrac…
#ai #LLM #UX #accessibility
GitHub - timpaul/form-extractor-prototype
Contribute to timpaul/form-extractor-prototype development by creating an account on GitHub.GitHub