OCLOperators UTF32
Created by Lars.olofsson on 2023-05-03 · Last edited by Sandra.akech on 2025-12-12.
UTF32, referring to "Unicode Transformation Format 32-bit", is a fixed-length encoding scheme that represents each Unicode code point as a 32-bit integer.
How UTF32 works:
- Every character, symbol, or emoji stored in exactly 32 bits
- This is fixed-length encoding: every character takes the same amount of space
- Unlike UTF8 or UTF16, you don’t need special sequences for some characters
- Easy for computers to find and process characters, but wastes memory for simple text (like English letters)
Example:
English text:
'Hello'- 5 characters × 32 bits each = 160 bits
- Stored as:
H → 32 bits
e → 32 bits
l → 32 bits
l → 32 bits
o → 32 bitsSee: Encoding
