शब्दावली
Unicode, एन्कोडिंग और टाइपोग्राफी शब्दों की व्याख्या।
Basic Multilingual Plane (BMP)
The first 65,536 code points of Unicode (U+0000 to U+FFFF), containing the most commonly used characters.
Unicode StandardBidirectional Text (Bidi)
Text that mixes left-to-right and right-to-left writing directions, requiring the Unicode Bidirectional Algorithm for proper display.
Unicode StandardCJK
Abbreviation for Chinese, Japanese, and Korean — refers to the unified set of ideographic characters shared across these writing systems.
Unicode StandardCode Point
A numerical value in the Unicode standard that maps to a specific character, written as U+ followed by hexadecimal digits (e.g., U+0041 for 'A').
Unicode StandardCombining Character
A Unicode character that modifies the preceding base character, such as accents and diacritical marks.
Unicode StandardCommon Locale Data Repository (CLDR)
A project providing locale-specific formatting rules for dates, currencies, and language names used worldwide.
Unicode StandardConfusables (Homoglyphs)
Characters that look similar or identical but have different code points (e.g., Latin 'A' U+0041 vs Cyrillic 'А' U+0410).
Unicode StandardEmoji
Pictographic symbols defined in Unicode, originally from Japanese mobile phones, now a universal visual communication system.
Unicode StandardGeneral Category
A Unicode property that classifies each character (e.g., Lu = Uppercase Letter, Sm = Math Symbol, So = Other Symbol).
Unicode StandardGrapheme Cluster
A user-perceived character that may consist of multiple code points (e.g., a base character + combining marks, or a flag emoji).
Unicode StandardMathematical Alphanumeric Symbols
Unicode block (U+1D400–U+1D7FF) containing styled letters and digits used in mathematical notation (bold, italic, script, etc.).
Unicode StandardPrivate Use Area
Ranges of Unicode code points (U+E000–U+F8FF, etc.) reserved for custom characters defined by font vendors or applications.
Unicode StandardScript
A Unicode property indicating which writing system a character belongs to (e.g., Latin, Greek, Common, Inherited).
Unicode StandardUnicode
A universal character encoding standard that assigns a unique number (code point) to every character across all writing systems.
Unicode StandardUnicode Block
A contiguous range of code points defined by the Unicode standard, grouping related characters (e.g., 'Arrows' block: U+2190–U+21FF).
Unicode StandardUnicode Consortium
The non-profit organization that develops and maintains the Unicode Standard, adding new characters in annual releases.
Unicode StandardUnicode Plane
A group of 65,536 consecutive code points. Unicode has 17 planes (0–16), with Plane 0 being the BMP.
Unicode StandardUnicode Version
Numbered releases of the Unicode Standard (e.g., 16.0), each adding new characters, scripts, and emoji.
Unicode StandardUnihan Database
A comprehensive database of CJK ideographs with readings, meanings, and variant information maintained by the Unicode Consortium.
Unicode StandardVariation Selector
Unicode characters (U+FE00–U+FE0F) that modify the appearance of the preceding character, including text vs emoji presentation.
Unicode StandardZero-Width Joiner (ZWJ)
An invisible character (U+200D) that joins adjacent characters, commonly used in emoji sequences to create combined emoji.
Unicode Standard