Web Development
Practical Unicode guides for developers — HTML entities, CSS content property, regex, JavaScript strings, Python, and URL encoding.
Unicode Collation: How to Sort Text Correctly Across Languages
Master Unicode collation — the Unicode Collation Algorithm, locale-aware sorting in JavaScript and Python, PostgreSQL ICU collations, and common pitfalls.
Jun 11, 2024Character Encoding Detection: How to Identify Unknown Text Encoding
Detect character encodings in unknown text — BOM sniffing, statistical analysis with chardet, ICU detection, and why heuristics sometimes fail.
Mei 28, 2024Web Fonts and Unicode Subsetting: Loading Only What You Need
Optimize web font loading with unicode-range subsetting — reduce font file sizes, improve load times, and handle multilingual content efficiently.
Mei 14, 2024Soft Hyphen: Controlling Line Breaks in Web Typography
Master the soft hyphen (U+00AD) — how it controls word breaking in HTML, CSS hyphens property, browser behavior differences, and when to use ­ vs CSS.
Mei 7, 2024IDN Homograph Attacks: When Unicode Becomes a Security Threat
Learn how attackers use Unicode lookalike characters to create phishing domains — how IDN homograph attacks work and how browsers defend against them.
Apr 30, 2024Unicode in URLs: Percent-Encoding, Punycode, and IRIs
Understand how Unicode characters appear in URLs — percent-encoding for paths, Punycode for domains, IRIs, and Python/JavaScript encoding functions.
Apr 16, 2024Python and Unicode: The Complete Developer's Guide
Master Unicode in Python — str vs bytes, unicodedata module, encoding/decoding, file I/O, normalization, and the Unicode sandwich pattern.
Apr 2, 2024JavaScript and Unicode: Strings, Code Points, and Grapheme Clusters
Navigate JavaScript's UTF-16 string model — surrogate pairs, codePointAt, for...of iteration, Intl.Segmenter, and common Unicode pitfalls.
Mar 19, 2024Unicode-Aware Regex: Property Escapes and Multilingual Patterns
Write regex patterns that work with Unicode — property escapes, script detection, emoji matching, and common pitfalls in JavaScript and Python.
Mar 5, 2024CSS Content Property: Using Unicode Symbols in Stylesheets
Learn to use Unicode characters in CSS — the content property, escape sequences, custom list markers, decorative elements, and accessibility considerations.
Feb 20, 2024HTML Entities: The Complete Guide to Character References
Master HTML entities — named references, numeric codes, hex notation, when escaping is required, and a reference table of the most useful entities.
Feb 6, 2024