Discussion
Loading...

Discussion

Log in
  • About
  • Code of conduct
  • Privacy
  • Users
  • Instances
  • About Bonfire
Information Is Beautiful
Information Is Beautiful
@infobeautiful@vis.social  ·  activity timestamp 4 months ago

The expressive speed vs information 'bit rate' of spoken languages.
French and Vietnamese have the highest information density.

via The Economist
https://www.economist.com/graphic-detail

Stacked area chart comparing syllable rate and information rate across 14 languages. Two columns show distribution curves for syllables per second (4-8) and information rate in bits per second (30-60). Japanese has highest syllable rate, while Vietnamese conveys information fastest. Languages vary in speech speed vs information density.
Stacked area chart comparing syllable rate and information rate across 14 languages. Two columns show distribution curves for syllables per second (4-8) and information rate in bits per second (30-60). Japanese has highest syllable rate, while Vietnamese conveys information fastest. Languages vary in speech speed vs information density.
Stacked area chart comparing syllable rate and information rate across 14 languages. Two columns show distribution curves for syllables per second (4-8) and information rate in bits per second (30-60). Japanese has highest syllable rate, while Vietnamese conveys information fastest. Languages vary in speech speed vs information density.
The Economist

Graphic detail | The Economist

Explore how data, analytics and interactive journalism paint a picture of the world and tell stories of their own
  • Copy link
  • Flag this post
  • Block
chebra
chebra
@chebra@mstdn.io replied  ·  activity timestamp 4 months ago

@infobeautiful Vietnamese is so compressed that they often don't understand each other. Some words sound the same, some are actually identical and you have to get the meaning from context. Blue and green are the same word. Three and father are the same word. So yeah, high bitrate, but that might actually be the downside - needs more error correction.

  • Copy link
  • Flag this comment
  • Block
the grugq
the grugq
@thegrugq@infosec.exchange replied  ·  activity timestamp 4 months ago

@infobeautiful aaah. The article doesn’t say what the graphic implies.

The article finds that all languages send information at approximately 39 bits per second. The number of syllables is the number of unique syllables allowed per language not syllables per sentence.

“Language is universal, but it has few indisputably universal characteristics, with cross-linguistic variation being the norm. For example, languages differ greatly in the number of syllables they allow, resulting in large variation in the Shannon information per syllable. Nevertheless, all natural languages allow their speakers to efficiently encode and transmit information. We show here, using quantitative methods on a large cross-linguistic corpus of 17 languages, that the coupling between language-level (information per syllable) and speaker-level (speech rate) properties results in languages encoding similar information rates (~39 bits/s) despite wide differences in each property individually: Languages are more similar in information rates than in Shannon information or speech rate. These findings highlight the intimate feedback loops between languages’ structural properties and their speakers’ neurocognition and biology under communicative pressures. Thus, language is the product of a multiscale communicative niche construction process at the intersection of biology, environment, and culture.”

  • Copy link
  • Flag this comment
  • Block
the grugq
the grugq
@thegrugq@infosec.exchange replied  ·  activity timestamp 4 months ago

@infobeautiful that seems weird. Thai is very succinct compared to English. Most words are a single syllable and the majority of context is implicit, rather than explicit.

If you try to express something as 1:1 with the English phrase it would be inefficient, but people don’t speak like that.

The English, “let’s get lunch” (e.g. to co workers) would be “ไปกินข้าว“
4syllables in English vs 3 in Thai.

Very complex things are slower in Thai, but the majority of conversations are much faster because they’re sparse and rely on implicit information.

  • Copy link
  • Flag this comment
  • Block
R.L. LE
R.L. LE
@herrLorenz@chaos.social replied  ·  activity timestamp 4 months ago

@infobeautiful There's a flaw: it does not contain #tlhInganHol.
#tlh #Klingon

  • Copy link
  • Flag this comment
  • Block
Panic at the Trishco 🕊️
Panic at the Trishco 🕊️
@topofmyvoice@cupoftea.social replied  ·  activity timestamp 4 months ago

@infobeautiful
Lots of elephants inside of snakes

  • Copy link
  • Flag this comment
  • Block
Márton Salomváry
Márton Salomváry
@mrc@mastodon.berlin replied  ·  activity timestamp 4 months ago

@infobeautiful https://xkcd.com/833/

xkcd

Convincing

  • Copy link
  • Flag this comment
  • Block
Justin Ashworth
Justin Ashworth
@justinashworth@sciences.social replied  ·  activity timestamp 4 months ago

There is an article cited, but concerned this could possibly be "not even wrong" given its limitations, assumptions and technical model about how information content is defined. (The first author is also French.)

The work is nonetheless valuable and an excellent contribution.

Would be very interested to see results with alternative definitions of information conveyance, particularly from authors more familiar with how symbolic and probabilistic meaning works differently in e.g. Chinese.

  • Copy link
  • Flag this comment
  • Block
Georg Arne Spenden
Georg Arne Spenden
@Daseinsappeal@troet.cafe replied  ·  activity timestamp 4 months ago

@infobeautiful Add to that, in conversations, the French all talk simultaneously. Unbeatable.

  • Copy link
  • Flag this comment
  • Block

bonfire.cafe

A space for Bonfire maintainers and contributors to communicate

bonfire.cafe: About · Code of conduct · Privacy · Users · Instances
Bonfire social · 1.0.2-alpha.7 no JS en
Automatic federation enabled
Log in
  • Explore
  • About
  • Members
  • Code of Conduct