#unicopedia

Michel Marianimikaeru
2026-02-11

For the record, the icon of the new application shows the U+173C7 𗏇 meaning "written character".

Icon of the Unicopedia Tangutica application, with the Tangut ideograph U+173C7 𗏇  meaning "written character"
Michel Marianimikaeru
2026-02-11

All Tangut-related utilities and sample scripts have been moved from 'Unicopedia Sinica' to a new dedicated application: 'Unicopedia Tangutica'

🔗 codeberg.org/tonton-pixel/unic

Screenshot of the Tangut Components utility of the Unicopedia Tangutica application
Michel Marianimikaeru
2026-01-19

The latest version of the open-source application "Unicopedia Plus" is now available, adding support for all the new characters, scripts, and blocks defined in Unicode 17.0.

🔗 codeberg.org/tonton-pixel/unic

This current app version is a pre-release (Beta), since full support for Unicode 17.0 is not yet available in the Electron framework. More specifically, results from the "Unicode Foldings", "Unicode Normalizer", and "Unicode Segmenter" utilities cannot be fully trusted...

Unicopedia Plus application screenshot
Michel Marianimikaeru
2026-01-11

New in Unicopedia Sinica:

- Added new Tangut Inspector utility.
- Added new Tangut Data Finder utility.

🔗 codeberg.org/tonton-pixel/unic

Unicopedia Sinica - Tangut Inspector utility screenshotUnicopedia Sinica - Tangut Data Finder utility screenshot
Michel Marianimikaeru
2026-01-05

New in Unicopedia Sinica:

- Added new Tangut Components utility.
- Added new Tangut References utility.

🔗 codeberg.org/tonton-pixel/unic

Screenshot of Unicopedia Sinica app: Look Up IDS feature of Tangut Components utility
Michel Marianimikaeru
2025-11-20

Unicode 17.0 introduces five new CJK Unified Ideographs related to Chinese personal pronouns, four of them having been proposed by Andrew West (BabelStone):

« The other Chinese pronoun coming to Unicode v. 17.0 next year, in addition to ⿰㐅也 (3p gender-neutral, ⿰男也 (3p explicitly male), ⿱妳心 ( f. equivalent of 您), ⿱我心 (Taiwanese 1p plural), is ⿱她心 (f. equivalent to 怹) »

🔗 bsky.app/profile/babelstone.co

Screenshot of CJK Related data from Unicopedia Sinica: Chinese Personal Pronouns
Michel Marianimikaeru
2025-11-17

The latest version of the open-source application "Unicopedia Sinica" is now available, adding support for all the new CJK/Unihan characters defined in Unicode 17.0.

🔗 codeberg.org/tonton-pixel/unic

Screenshot of the open-source application Unicopedia Sinica v.17.0.0
Michel Marianimikaeru
2025-11-08

New in Unicopedia Ægypta:

- Added all Unikemet-related utilities from Unicopedia Plus.

🔗 codeberg.org/tonton-pixel/unic

Screenshot of Unicopedia Ægypta app: Unikemet Inspector utility
Michel Marianimikaeru
2025-11-08

New in Unicopedia Sinica:

- Added all Unihan-related utilities from Unicopedia Plus.
- Added typeface selector between serif and sans-serif in the Pan-CJK Font Variants utility.

Planned:

- Utilities for non-Han scripts: Khitan Small Script, Nüshu, Tangut.
- Utilities for Jurchen, Small Seal (Unicode 18.0?)

🔗 codeberg.org/tonton-pixel/unic

Screenshot of Unicopedia Sinica app: Unihan Total Strokes utility
Michel Marianimikaeru
2025-11-08

New in Unicopedia Plus:

- All Unihan-related utilities have been moved to Unicopedia Sinica.
- All Unikemet-related utilities have been moved to Unicopedia Ægypta.

🔗 codeberg.org/tonton-pixel/unic
🔗 codeberg.org/tonton-pixel/unic
🔗 codeberg.org/tonton-pixel/unic

Screenshot of Unicopedia Plus app: log(😅) =💧log(😄) [Math Geekiness]
Michel Marianimikaeru
2025-11-06

[Follow-Up]

Reference links:

- UTS #18: Unicode Regular Expressions
🔗 unicode.org/reports/tr18/

- UTS #18: Unicode Regular Expressions [Proposed Update]
🔗 unicode.org/reports/tr18/propo

- Issues - tonton-pixel/unicopedia-plus - Codeberg.org
🔗 codeberg.org/tonton-pixel/unic

Michel Marianimikaeru
2025-11-06

The "official" Unicode Regular Expressions (UTS #18) document, dated February 8, 2022, has never been updated since then, and the four new Unicode properties introduced in Unicode 15.1 are only listed in the Proposed Update *draft*, dated May 11, 2023...

This could explain why , , and the framework (#Chromium) trigger an "invalid property" error for the /\p{IDS_Unary_Operator}/u in JavaScript, while /\p{IDS_Binary_Operator}/u is ok...

Michel Marianimikaeru
2025-10-28

According to the "Can I Unicode‽" web page, as of today, the navigator is still "stuck" in Unicode 15.1, while the latest version of is 17.0!

mathiasbynens.github.io/caniun

The fact that the framework is based on probably explains why it is still lagging behind too...

Supporting Unicode 16.0 would allow me to produce a final stable version of my Unicopedia Plus app, before I can start working on a version for Unicode 17.0.

Michel Marianimikaeru
2025-10-28

Until now, I've been able to provide a working (pre-release though) edition of my Unicopedia Plus app, targeting a specific version not yet supported by the framework, by embedding a copy of all the up-to-date Unicode data files, and making use of the `regexpu-core` module to emulate the most "critical" regular expressions, but this is merely a workaround, not what it has been designed for in the first place...

github.com/mathiasbynens/regex

Michel Marianimikaeru
2025-10-28

As you might expect, my main application Unicopedia Plus relies heavily on ...

Today, I updated the framework to its latest major version 39.0.0, hoping it would at last bring full support to Unicode 16.0, published by the UTC in September 2024 , but unfortunately no; it is still stuck in Unicode 15.1, published in September 2023! Moreover, Unicode 17.0 has already been officially released...

🔗 codeberg.org/tonton-pixel/unic

Michel Marianimikaeru
2025-10-18

New utility in Unicopedia Plus:
- Unihan Total Strokes

🔗 codeberg.org/tonton-pixel/unic

Screenshot of the Unihan Total Strokes utility of the Unicopedia Plus application
Michel Marianimikaeru
2025-07-21

New in the CJK Variations utility of Unicopedia Sinica:

- Support for the latest Ideographic Variation Database (IVD 2025), adding the new CAAPH Collection.

- Support for the updated BabelStone Collection (unregistered), based on the latest BabelStone Han font (v17.0.0 BETA), by Andrew C. West (魏安), 1960-2025 RIP (安息吧).

🔗 https://codeberg.org/tonton-pixel/unic

Screenshot of the CJK Variations utility of Unicopedia Sinica for Unicode character U+3AB4Screenshot of the CJK Variations utility of Unicopedia Sinica for Unicode character U+4E9B
Michel Marianimikaeru
2025-06-30

@electronjs

No Electron support for the latest Unicode version is a major hindrance for my open-source Unicopedia Plus application, which I have to keep in Beta version for a long time because of that...

codeberg.org/tonton-pixel/unic

Michel Marianimikaeru
2025-05-21
Hieroglyph Picture Book utility screenshotHieroglyph Taxonomy utility screenshot

Client Info

Server: https://mastodon.social
Version: 2025.07
Repository: https://github.com/cyevgeniy/lmst