veganism.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
Veganism Social is a welcoming space on the internet for vegans to connect and engage with the broader decentralized social media community.

Administered by:

Server stats:

293
active users

#unicode

4 posts4 participants0 posts today

meta.wikimedia.org/w/index.php

It’s 2025 and:
- There is still no vertical text site mode for #Wikimedia in any language using vertical text.
- #Wikipedia still forces “simplified” Chinese on browsers.
- There is still no true IDS or CangJie composition matrix for characters in #Unicode.
- SignWriting still has no proper #Unicode inclusion, no IDS analogue, no inventory of signs, and is still mostly written by mouse drag and drop in a mishmash of SVG and HTML.
- There is no proper SignWriting IME, such as a Rime schema.

To say this state of affairs is cultural propaganda by mass technic inertia would be an understatement. Infotech is functional colonialism. Thats really all there is to say.

Filed under #崇洋媚外

meta.wikimedia.orgSearch results for "Vertical writing" - Meta

Got a bug report for @novelwriter from someone who uses Cuneiform text in their work. These are 4 byte Unicode symbols, and turned out to be very tricky to handle. 😅

The app is built with Python, which will switch a string to UCS-4 when it contains such characters, so the characters always have a single index in the string.

However, the Qt library uses UTF-16. That means 4-byte characters use two slots, creating a mismatch in indices between the two representations.

#Python#Qt#Code
Replied in thread

@electronjs

No Electron support for the latest Unicode version is a major hindrance for my open-source Unicopedia Plus application, which I have to keep in Beta version for a long time because of that...

codeberg.org/tonton-pixel/unic

Codeberg.orgunicopedia-plusDeveloper-oriented set of Unicode, Unihan, Unikemet & emoji utilities wrapped into one single app, built with Electron.
Replied in thread

@doctorwhom Ha! At The Mag, remember Ops got this filtering software and we decided to see if we could break it? Tox added an "@" before the <html> tag.

Browser totally rendered the page but filter wouldn't parse it because of the At 🤣🤣🤣

I think it took us three hours?

Replied in thread

@argv_minus_one there are a surprising number of moving pieces here, but as with all things #Unicode a lot of it boils down to "the UCD stores relevant properties and software interprets those"

I don't know the entire story, but my touchpoint for this are CJK numerals like 一, 二, 三

Those cannot be converted as decimals or digits, but they *do* have a numeric value

In Python:

```
>>> import unicodedata
>>> unicodedata.decimal("三")
...
ValueError: not a decimal
>>> unicodedata.digit("三")
...
ValueError: not a digit
>>> unicodedata.numeric("三")
3.0
```

For CJK specifically this is probably related to the fact that these are often combined with other glyphs multiplicatively, and there are plenty of non-decimal glyphs.