Skip to main content

Search

Items tagged with: ML


Start saving money for that M4 Ultra with 500GB! Maybe this could be the first open source that could surpass GPT-4! AIatMeta: "Llama 3 8B & 70B models are just the beginning of what we’re working to release for Llama 3. Our largest models currently in the works are 400B+ parameters and while they’re still in active development, we’re excited about how this work is trending." #LLM #AI #ML https://twitter.com/AIatMeta/status/1780997414071181370
#AI #ML #llm


Earlier today, Microsoft released new WizardLM-2 7b, 8x22b, 70b with great benchmark result, (of course, they say as good or almost same as GPT-4), but they removed weights on Huggingface, repo on Github, and their whitepaper. Someone on Reddit joked maybe they released GPT-4 by mistake! lol Quantized. weights from other people are still around on Huggingface! #ML #LLM #AI
#AI #ML #llm


Cool tip for running LLMs on Apple Silicon! By default, MacOS allows GPU to use up to 2/3 of RAM on machines with <=36GB and 3/4 on machines with >36GB. I used the command `sudo sysctl iogpu.wired_limit_mb=57344` to override and allocate 56GB/64GB for GPU. This allowed me to load all layers of larger models for a faster speed! #MacOS #LLM #AI #ML
#AI #MACOS #ML #llm


Apparently Meta is planning to release two small varients of Llama-3 next week "as a precursor to the launch of the biggest version of Llama 3, expected this summer." Command-r-plus, mixtral 8x22b, Google CodeGemma... All of sudden companies are releasing LLMS like crazy! Where's Apple? Maybe In WWDC 2024? lol #LLM #AI #ML https://www.theinformation.com/articles/meta-platforms-to-launch-small-versions-of-llama-3-next-week
#AI #ML #llm


Following xAI Grok-1 314B, Databricks DBRX 132B, Cohere Command R+ 104B, another big model drop this time from Mistral! Mistral 8x22B! #LLM #AI #ML https://twitter.com/mistralai/status/1777869263778291896
#AI #ML #llm


Claude 3 can summarize up to about 150,00 words, (a length similar to Harry Potter and the Deathly Hallows.) also It outperformed GPT-4 and Gemini Ultra on industry benchmark tests, such as undergraduate level knowledge, graduate level reasoning and basic mathematics. It allows users to upload images and documents for the first time. #LLm #AI #ML https://www.cnbc.com/2024/03/04/google-backed-anthropic-debuts-claude-3-its-most-powerful-chatbot-yet.html
#AI #ML #llm


Funny Reddit thread about Tim Cook's comment: "the Mac is the best computer for AI." Apple fanboys defend it with unified ram, and Apple haters attack with Nvidia GPU speed. lol #ML #AI #Apple #Mac https://www.reddit.com/r/LocalLLaMA/comments/1b2h5st/tim_cook_speaks_about_ai_at_the_apple_shareholder/
#apple #AI #ML #mac


Does anyone else experience the same issue I'm having with Google's Gemma-7B on Ollama? Sometimes, it even struggles with spelling simple words in English and creates nonsensical words! This is not what I expected. #LLM #ML #AI #Ollama
#AI #ML #llm #ollama


Ollama seems to support Llava V1.6 34B! Best open source multimodal vision-language model I've tried so far! #LLM #ML #AI @freakyfwoof @vick21 @tristan @KyleBorah @Bri


Zuckerberg says Meta is training #LLaMa 3 on 600,000 H100s! Wel, time to finetune and quantize everything again when it comes out. lol #ML #AI #LLM https://www.reddit.com/r/LocalLLaMA/comments/199y05e/zuckerberg_says_they_are_training_llama_3_on/
#AI #ML #llm #llama


Interesting, Apple released ferret, an open source multimodal Model! It's based on LLaVA and Vicuna. #AI #LLM #ML https://github.com/apple/ml-ferret/
#AI #ML #llm


Apparently Arthur Mensch, CEO of #Mistral, declared on French national radio that mistral will release an open source model equivalent to #Gpt4 in 2024. I don't speak French, so can't verify, but it would be interesting along with Llama-3 and whatever OpenAI has planned for 2024. #AI #ML #LLM https://radiofrance.fr/franceinter/podcasts/l-invite-de-7h50/l-invite-de-7h50-du-mardi-12-decembre-2023-3833724


Hello Fediverse,

We are looking for Text-To-Speak (TTS) expertise to help or advise us on improving the default voice of the Linux desktop. :linux: 📣

Please reach out or boost :boost_love:

Thanks!

#Linux #tts #accessibility #a11y #GNOME #KDE #FreeSoftware #freedesktop #ml


New blog post: Enhancing Accessibility with AI and ML

We discuss how to specialize #AI for understanding UI for #a11y testing, the importance of algorithm and data integrity, and how #ML can be used to simplify #a11y testing. https://www.deque.com/blog/enhancing-accessibility-with-ai-and-ml/

#a11y #AI #ML


#AI #FutureOfWork #ML

Researchers discover a more flexible approach to machine learning.

https://www.quantamagazine.org/researchers-discover-a-more-flexible-approach-to-machine-learning-20230207/


"Especially in this moment in history, it is vital that we provide our students with the critical thinking skills that will allow them to recognise misleading claims made by tech companies and understand the limits and risks of hyped and harmful technology that is made mainstream at a dazzling speed and on a frightening scale."

Excellent call to action by @Iris

https://irisvanrooijcogsci.com/2023/01/14/stop-feeding-the-hype-and-start-resisting/

#AIhype #MathyMath #AI #ML #ChatGPT


Maybe this could help FOSS developers make something like this for Linux or Android. This is an article from Apple Machine Learning, on how they made mobile apps accessible with machine learning:

https://machinelearning.apple.com/research/mobile-applications-accessible

#a11y #apple #ml