SymbolFYI

Glossário

Termos de Unicode, codificação e tipografia explicados.

Basic Multilingual Plane (BMP)

The first 65,536 code points of Unicode (U+0000 to U+FFFF), containing the most commonly used characters.

Unicode Standard

Bidirectional Text (Bidi)

Text that mixes left-to-right and right-to-left writing directions, requiring the Unicode Bidirectional Algorithm for proper display.

Unicode Standard

CJK

Abbreviation for Chinese, Japanese, and Korean — refers to the unified set of ideographic characters shared across these writing systems.

Unicode Standard

Code Point

A numerical value in the Unicode standard that maps to a specific character, written as U+ followed by hexadecimal digits (e.g., U+0041 for 'A').

Unicode Standard

Combining Character

A Unicode character that modifies the preceding base character, such as accents and diacritical marks.

Unicode Standard

Common Locale Data Repository (CLDR)

A project providing locale-specific formatting rules for dates, currencies, and language names used worldwide.

Unicode Standard

Confusables (Homoglyphs)

Characters that look similar or identical but have different code points (e.g., Latin 'A' U+0041 vs Cyrillic 'А' U+0410).

Unicode Standard

Emoji

Pictographic symbols defined in Unicode, originally from Japanese mobile phones, now a universal visual communication system.

Unicode Standard

General Category

A Unicode property that classifies each character (e.g., Lu = Uppercase Letter, Sm = Math Symbol, So = Other Symbol).

Unicode Standard

Grapheme Cluster

A user-perceived character that may consist of multiple code points (e.g., a base character + combining marks, or a flag emoji).

Unicode Standard

Mathematical Alphanumeric Symbols

Unicode block (U+1D400–U+1D7FF) containing styled letters and digits used in mathematical notation (bold, italic, script, etc.).

Unicode Standard

Private Use Area

Ranges of Unicode code points (U+E000–U+F8FF, etc.) reserved for custom characters defined by font vendors or applications.

Unicode Standard

Script

A Unicode property indicating which writing system a character belongs to (e.g., Latin, Greek, Common, Inherited).

Unicode Standard

Unicode

A universal character encoding standard that assigns a unique number (code point) to every character across all writing systems.

Unicode Standard

Unicode Block

A contiguous range of code points defined by the Unicode standard, grouping related characters (e.g., 'Arrows' block: U+2190–U+21FF).

Unicode Standard

Unicode Consortium

The non-profit organization that develops and maintains the Unicode Standard, adding new characters in annual releases.

Unicode Standard

Unicode Plane

A group of 65,536 consecutive code points. Unicode has 17 planes (0–16), with Plane 0 being the BMP.

Unicode Standard

Unicode Version

Numbered releases of the Unicode Standard (e.g., 16.0), each adding new characters, scripts, and emoji.

Unicode Standard

Unihan Database

A comprehensive database of CJK ideographs with readings, meanings, and variant information maintained by the Unicode Consortium.

Unicode Standard

Variation Selector

Unicode characters (U+FE00–U+FE0F) that modify the appearance of the preceding character, including text vs emoji presentation.

Unicode Standard

Zero-Width Joiner (ZWJ)

An invisible character (U+200D) that joins adjacent characters, commonly used in emoji sequences to create combined emoji.

Unicode Standard