Skip to main content

Search

Items tagged with: language


What language do you toot/post in the most?

Do you write and read Toots mostly in the same language or do you stick to one language or mix it up;
If you use a different language for different topics or day time i would be curios to know

#Mastodon #Language #SocialMedia #OnlineCommunities #Multilingual #Sprache #Idioma #語言 #Lingvo #Lingua #言語 #언어 #שפּראַך #Langue #Taal #Мова #Язык

  • English 🇺🇸 🇬🇧 🇨🇦 🇦🇺 🇦🇬 🇧🇧🇮🇪 🇯🇲 🇳🇿🇱🇨 . .. (100%, 7 votes)
  • Japanese 🇯🇵 (0%, 0 votes)
  • Spanish 🇪🇸 🇲🇽 🇦🇷 🇨🇴 🇨🇱 🇵🇪 🇧🇴 🇨🇺 🇩🇴 🇸🇻 🇺🇾 . .. (0%, 0 votes)
  • Chinese 🇨🇳 🇭🇰 🇹🇼 🇲🇴 🇸🇬 (0%, 0 votes)
  • German 🇩🇪 🇨🇭 🇦🇹 🇱🇮 (28%, 2 votes)
  • French 🇫🇷 🇨🇦 🇧🇪 🇹🇫 🇼🇫 🇾🇹 🇵🇲 🇬🇵 🇧🇯 🇭🇹 . .. (14%, 1 vote)
  • Italian 🇮🇹 (0%, 0 votes)
  • Portuguese 🇧🇷 🇵🇹 🇦🇴 🇨🇻 🇬🇼 🇸🇹 🇲🇿 (0%, 0 votes)
  • Dutch 🇳🇱 🇸🇷 🇦🇼 (0%, 0 votes)
  • Esperanto 🏳⭐💚 (0%, 0 votes)
  • Other (tell me in the comments!) (0%, 0 votes)
7 voters. Poll end: in 4 days


If you're a #language nerd like I am, then you won't have missed the @mozilla #CommonVoice v19 #speech #dataset release - which now features 131 languages! Here's my #dataviz, done in @observablehq of the v19 #metadata coverage.

I've updated the visualisation this time around with human-readable language names instead of their ISO-639 or BCP-47 language codes to make it it easier to read.

There's some interesting observations:

▶ Catalan (ca) continues to be leader in terms of data - speaking volumes about the efforts to revitalise culture and language in Catalunya. It's also one of the few languages that has data for all age groups, particularly older speakers - this sort of data is missing for most other languages.

▶ Kiswahili (sw) is one of the languages where there is more data for female-identifying speakers than for male-identifying speakers ♀ - although Japanese (ja), Western Mari (mrj) and Luganda (lg) do pretty well here, too!

▶ Sentence domains can now be categorised, and although most new sentences are "general", Albanian (sq) has a lot of sentences related to law and government.

▶ Tsonga (ts), a Bantu language spoken in Southern Africa, has dethroned Icelandic (is) as the language with the highest average utterance duration. I don't know enough about Tsonga to speculate why - it's a somewhat agglutinative language, but many Tsonga works are generally short.

▶ Bengali / Bangla (bn) has a significant amount of data that is not yet validated, and therefore does not appear in training / dev / test splits. There is a similar case for many languages new to Common Voice - it takes time to validate.

▶ The language with the highest number of average contributions per speaker is Taita (dav), a Bantu language from Kenya.

What do you make of the data visualisation? Are there any other insights you can see?

Big thanks to the CV team for all their efforts - EM, Jessica Rose, Dmitrij Feller and Justin Grant.

#linguistics

observablehq.com/@kathyreid/mo…


Wow, English-only people (or Western languages, for that matter) are so naïve. In case you didn't know, the lang attribute is very important in East Asian languages.

lobste.rs/s/9ck6y9/what_progra…

jsfiddle.net/8sa8ndLj/2/

#CJK #language #EastAsian


I've been working on a thing for a few weeks, and I'm hoping there are #Language / #FileFormat / #Markdown nerds and experts on Mastodon who can provide some input and sanity-check.

I looked for an existing markdown extension file format that would allow me to write a document in multiple languages, and I couldn't find anything that fit the bill.

So, I decided to create my own. 🤷‍♂️

github.com/OmenApps/PolyglotMa…

#PolyglotMarkdown #Languages #OpenSource #LanguageTools #Translation #Multilingual

1/2


I am thinking about actually using my website and write some blog-ish stuff. Should I write in English, and add to the pile of blogspam (but with the benefit of better spell-checking), or should I go for Dutch (more original content, my native language, but spell checking is more difficult)?

#blog #language #English #Dutch

  • English (0%, 0 votes)
  • Dutch (100%, 1 vote)
1 voter. Poll end: 1 month ago


If I want to learn #Chinese and I'm #blind, what good options do I have other than in-person class? #Language #learning #a11y


Hello Jamers 😍

As you know, Transifex serves as our #collaborative translation #management platform. While the core principles remain similar, this tutorial diverges from its predecessor by focusing specifically on the translation of the Jami application. 📲

Are you #multilingual? Your skills can make a real impact! Join our translation efforts and help us break #language barriers! 💪

👀 Want to know how ?
Here's the link: jami.net/how-to-contribute-to-…

#Jami #opensource #P2P #App #PrivacyMatters


Looking for participants for my son’s #linguistics dissertation on #language online and the movement of words from the internet into #EnglishLanguage #Scots #AmericanEnglish etc... ‘The aim of this #research is to broaden our understanding of how language is spread throughout the #internet, and into the #offline world’. Please #boost for attention: form should take around 15 minutes to complete and is live until the end of July. forms.gle/F2eR86zBBSyVRWuE9


Looking for participants for my son’s #linguistics dissertation on #language online and the movement of words from the internet into #EnglishLanguage #Scots #AmericanEnglish etc... ‘The aim of this #research is to broaden our understanding of how language is spread throughout the #internet, and into the #offline world’. Please #boost for attention: form should take around 15 minutes to complete and is live until the end of July. forms.gle/F2eR86zBBSyVRWuE9


2. Language, linguistics

The Invention of Clouds: How an Amateur Meteorologist Forged the Language of the Skies (Richard Hamblyn)

The story of how cloud types were named by Luke Howard at the turn of the 19th century. The book gives great historical context starting from the 1600s, about the birth of meteorology and the difficulties of cloud classification. I finally learned how the categories work.

amazon.com/Invention-Clouds-Am…

#cloud #language #linguistics #nonfiction #history #books #bookstodon


So apparently the term "patch" in software development comes from punched paper tape.

"Small corrections to the programmed sequence could be done by patching over portions of the paper tape and re-punching the holes in that section."

chsi.harvard.edu/harvard-ibm-m…

#til #computers #development #language #history


One of the phrases that’s been popular in China this year according to this article: sixthtone.com/news/1014370

For more context, people in China have been assembling plain white bread sandwiches to try to understand how we live in this part of the world, and they are posting through it (the idea of eating anything cold or raw, especially a vegetable, is seen as especially disgusting in the Chinese world, with some exceptions)

theguardian.com/food/2023/jun/…

#Food #China #Language #Chinese #Mandarin


Recently, "Very Finnish Problems" posted a very strange list of "fun facts about the #Finnish #language". As a linguist, I wasn't amused at all, and here, finally, comes my own version: probably not that "fun", but at least with some real facts. kielioblog.wordpress.com/15-1-…


Let's get this show on the road, then. In my early days of composition, I used a #midi #programming #language called Zel, which I first found in June of 2001. In fact, you can still find it at its website! zelsoftware.org This youtube video is the first completed tune in Zel, from december of 2001. Its simple, I don't do any fancy synth tricks, except some delay on the final piano line, and it's rendered with my #Roland sc-8820. Well, i say simple, at least in comparison to what I'd pull off later. youtube.com/watch?v=lK9dg5CMWx…


Hallo !Friendica Support,

wie ich gerade festgestellt habe, taucht mein Server in der Statistik auf Fediverse Observer als englischsprachig auf. Allerdings habe ich sowohl im Admin-Panel als auch in der local.config.php die Sprache auf Deutsch gestellt. Muss ich da sonst noch irgendwo etwas angeben? So wichtig ist es mir ja nicht, das es meist kein offener Server ist, aber eigenartig finde ich es schon.

#friendica #language


This has been going around on Twitter, but I neglected my community here :) I'm sorry about this :)
Tomorrow at noon EST, I will give a #talk on the #accessibility of #language #learning and #linguistics in general for #screenReader users as part of the a11yTalks event. This will be a public event with no need to register so if this is something any of you are interested in, here's the link :) a11ytalks.com/posts/2023-MAY/ #speaker #a11y


This seems to be becoming a habit of mine now as I begin to use Mastodon more and continue to learn more about it each day.

This one covers the options available to you when posting content such as Alt Text and Content Warnings but also talks about language settings and using filters to tidy up your timelines.

Any boosts will be greatly appreciated so we can help everyone get the most out of Mastodon.
#twittermigration #MastodonTips #accessibility #language #feditips

youtu.be/Pg0rtrUOoJY


Mindblowing 🤯

#Whisper is an #openSource #speechRecognition model written in #Python by #OpenAI. I’ve just seen it in action. Extract an #mp3 from a video, run it through Whisper, and it turns every spoken word into text. It even does a very decent job in #Danish. Perfect for subtitling #TV and #video. I am very impressed.

github.com/openai/whisper

#ai #language #transcription #speechToText


cargo careful: run your Rust code with extra careful debug checking by Ralf Jung: ralfj.de/blog/2022/09/26/cargo… #Rust #language


By the way, nowadays I usually refer to servers as gardens. I believe this will make owning a garden sound a lot less technical and a lot more fun. It would also create expectations of ease of use.

As example, freedombox and yunohost are aiming to be such garden operating systems. And Pioneer Freedombox is a garden.

#language #orange #digitalissues