This online utility converts ASCII characters to Unicode text. Anything that you paste or enter in the input area automatically gets converted to Unicode and is printed in the output area. It supports all Unicode symbols and it works with emoji characters. Created by encoding gurus from team Browserling.
This browser-based utility reinterprets sequences of ASCII and ANSI characters as Unicode data. The ASCII encoding uses only 7 bits and supports only 95 printable characters. The most popular Unicode encoding that most browsers implement uses a variable-width encoding from 7 to 21 bits and supports 1,100,000 printable characters. As the simplest characters (a-z, A-Z, numbers, and punctuation marks) use the same amount of bits (7 bits) in both ASCII and Unicode, they are left unchanged in the encoding process. All other Unicode characters are displayed as two, three, or four broken ANSI characters because the ASCII encoding doesn't support multi-byte values. To determine how many bytes each output Unicode symbol uses, we convert the broken input text to binary base and look at the first few beginning bits. This bit pattern tells us how many bytes to consume to create a Unicode glyph. Single-byte Unicode glyphs have the high bit set to zero. What that means is they start with the 0-bit and look like "0bbbbbbb", where "b" is any bit value. So a single byte Unicode glyph matches the same ASCII glyph. Double-byte glyphs start with the pattern "110bbbbb", followed by "10bbbbbb". The total number of free bits in double-byte glyphs is eleven (five in the first byte and six in the second byte). Triple-byte glyphs start with the pattern "1110bbbb", followed by two "10bbbbbb" patterns. In this case, the number of usable bits is sixteen (four in the first byte and six in bytes two and three). Quadruple-byte glyphs start with the pattern "11110bbb" and are followed by three patterns of "01bbbbbb". The total number of bits, in this case, is twenty-one (three in the first byte and six in each of the other three bytes). When we combine these one-, two-, three-, or four-byte sequences together and let the browser decode them to its default encoding, we get valid Unicode characters.
This example fixes distorted ASCII characters and prints a readable quote by Ralph Waldo Emerson in the output. The quote is written using Unicode's bold-italic letter font that uses 4-byte sequences for each letter. In the broken ASCII text, each letter is represented by four extended ASCII symbols so to restore them, the browser reads four bytes at a time and reencodes these bytes as Unicode. The quote marks and horizontal bar symbol use 3-byte sequences (three ASCII symbols). The author’s name uses simple English letters, which are equally readable in ASCII and Unicode encodings, so the browser doesn't need to reencode them.
This example restores broken Unicode emoticons in a short phrase. The destroyed emoticons are printed as three ANSI characters. To restore them, the utility decodes them to binary, extracts the Unicode bits from them, calculates the code point values, and converts them to valid Unicode emojis. The broken sequence "â" is "☔" ("rain"), and "ð" is "🌷" ("flower"), so the phrase is "No rain. No flowers."
This example looks like Zalgo destroyed it but no, it's not Zalgo, it's just the wrong Unicode encoding. The input text is interpreted as raw ASCII characters but in reality, it's Unicode characters. Because of this encoding mismatch, the text is absolutely unreadable. To understand the input data, it's converted into bytes and encoded to Unicode characters. As a result, we get a list of scientific emojis together with their descriptions in the small-caps Unicode font.
You can pass input to this tool via ?input query argument and it will automatically compute output. Here's how to type it in your browser's address bar. Click to try!
Quickly find code positions of all Unicode values.
Quickly decode code positions to Unicode values.
Quickly encode Unicode values to UTF-8 encoding.
Quickly encode Unicode values to UTF-16 encoding.
Quickly encode Unicode values to UTF-32 encoding.
Quickly create a picture from Unicode symbols.
Quickly generate random Unicode text in a given range.
Quickly generate all Unicode values from the given code point interval.
Quickly filter Unicode symbols that are within the given code point interval.
Quickly split Unicode data into graphemes.
Quickly sort Unicode glyphs in increasing or decreasing order.
Quickly find the length of Unicode text.
Quickly increase Unicode code point values.
Quickly decrease Unicode code point values.
Quickly reverse the order of symbols in Unicode text.
Quickly rotate Unicode characters to the left and right.
Quickly create multiple copies of Unicode text.
Quickly extract all characters from Unicode text.
Quickly split Unicode data into pieces.
Quickly split Unicode text into chunks of constant length.
Quickly merge Unicode snippets together.
Quickly shorten Unicode text to the given length.
Quickly left-pad Unicode text with any character.
Quickly right-pad Unicode text with any character.
Quickly align Unicode data to the center.
Quickly align Unicode data to the right.
Quickly convert ordinary numbers to fancy Unicode numbers.
Quickly convert Unicode numbers back to regular numbers.
Quickly convert ordinary letters to Unicode letters in various fonts.
Quickly convert Unicode letters back to regular Latin letters.
Quickly convert ordinary text to fancy Unicode text.
Quickly convert fancy Unicode text back to regular text.
Quickly combine input Unicode with diacritical marks.
Quickly delete diacritical marks from the Unicode text.
Quickly spoof regular text using Unicode homoglyphs.
Quickly unspoof homoglyphs in text with regular letters.
Quickly check if the given Unicode text is spoofed or not.
Quickly circularly rearrange Unicode symbols.
Quickly convert Unicode data to escape sequences.
Quickly encode Unicode data to HTML entities.
Quickly URL-escape Unicode symbols.
Quickly encode Unicode values to base64.
Quickly encode Unicode values to a data URI.
Quickly convert Unicode characters to raw bytes.
Quickly convert Unicode data to base-2 (binary).
Quickly convert Unicode data to base-8 (octal).
Quickly convert Unicode data to base-10 (decimal).
Quickly convert Unicode data to base-16 (hexadecimal).
Quickly convert Unicode symbols to raw ASCII bytes.
Quickly convert ASCII bytes to Unicode symbols.
Quickly convert Unicode text to a string literal.
Quickly create a picture from Unicode emojis.
Quickly release Zalgo on your Unicode text.
Quickly make Zalgo text readable again.
View and edit Unicode in a browser-based editor.
Spell out the names of Unicode characters in the input text.
URL-unescape Unicode text.
Convert base-2 data to Unicode encoding.
Convert base-8 data to Unicode encoding.
Convert base-10 data to Unicode encoding.
Convert base-16 data to Unicode encoding.
Convert Unicode text to any radix.
Convert any radix data to Unicode.
Convert Unicode text to ISO-8859-1 encoding.
Convert ISO-859-1 encoded data to Unicode.
Convert Unicode text to ISO-8859-2 encoding.
Convert ISO-8859-2 encoded data to Unicode.
Convert Unicode text to Ecoji encoding.
Convert Ecoji encoded data to Unicode.
Convert raw bytes to Unicode.
Check the Unicode version of the given Unicode characters.
Check if the given Unicode has valid encoding.
Encode Unicode text to Punycode encoding.
Decode Punycode encoding to Unicode.
Convert base64 data to Unicode text.
Convert Unicode to a valid data URL.
Convert a valid data URL to Unicode text.
Decode HTML entities to Unicode data.
Decode UTF8 encoding to Unicode.
Decode UTF16 encoding to Unicode.
Decode UTF32 encoding to Unicode.
Convert all Unicode characters to uppercase.
Convert all Unicode characters to lowercase.
Generate a list of all country flag icons.
Generate a list of all Unicode arrows.
Generate a list of all Unicode animals.
Generate a list of all Unicode flowers and plants.
Generate a list of all Unicode block elements.
Generate a list of all Egyptian hieroglyphs.
Generate a list of all currency symbols.
Use Unicode colors to generate a rainbow.
Create a smiley face from Unicode symbols.
Generate a list of random emojis.
Randomize case of all Unicode characters.
Convert all Unicode characters to lowercase.
Encode Unicode to JSON.
Decode JSON to Unicode.
Randomly rearrange the order of input graphemes.
Generate Alt codes for Unicode characters.
Generate Unicode glyphs from Alt codes.
Print statistics about Unicode data and code points.
Extract a part from Unicode data.
Generate waves with Unicode symbols.
Generate graphs using Unicode symbols.
Wrap a message in a Unicode box.
Subscribe to our updates. We'll let you know when we release new tools, features, and organize online workshops.
Enter your email here