गाइड
Unicode Collation: How to Sort Text Correctly Across Languages
Master Unicode collation — the Unicode Collation Algorithm, locale-aware sorting in JavaScript and Python, PostgreSQL ICU collations, and common pitfalls.
Character Encoding Detection: How to Identify Unknown Text Encoding
Detect character encodings in unknown text — BOM sniffing, statistical analysis with chardet, ICU detection, and why heuristics sometimes fail.
Web Fonts and Unicode Subsetting: Loading Only What You Need
Optimize web font loading with unicode-range subsetting — reduce font file sizes, improve load times, and handle multilingual content efficiently.
Soft Hyphen: Controlling Line Breaks in Web Typography
Master the soft hyphen (U+00AD) — how it controls word breaking in HTML, CSS hyphens property, browser behavior differences, and when to use ­ vs CSS.
IDN Homograph Attacks: When Unicode Becomes a Security Threat
Learn how attackers use Unicode lookalike characters to create phishing domains — how IDN homograph attacks work and how browsers defend against them.
Unicode in URLs: Percent-Encoding, Punycode, and IRIs
Understand how Unicode characters appear in URLs — percent-encoding for paths, Punycode for domains, IRIs, and Python/JavaScript encoding functions.
Python and Unicode: The Complete Developer's Guide
Master Unicode in Python — str vs bytes, unicodedata module, encoding/decoding, file I/O, normalization, and the Unicode sandwich pattern.
JavaScript and Unicode: Strings, Code Points, and Grapheme Clusters
Navigate JavaScript's UTF-16 string model — surrogate pairs, codePointAt, for...of iteration, Intl.Segmenter, and common Unicode pitfalls.
Unicode-Aware Regex: Property Escapes and Multilingual Patterns
Write regex patterns that work with Unicode — property escapes, script detection, emoji matching, and common pitfalls in JavaScript and Python.
CSS Content Property: Using Unicode Symbols in Stylesheets
Learn to use Unicode characters in CSS — the content property, escape sequences, custom list markers, decorative elements, and accessibility considerations.
HTML Entities: The Complete Guide to Character References
Master HTML entities — named references, numeric codes, hex notation, when escaping is required, and a reference table of the most useful entities.