Unicode to ascii converter
World's simplest unicode tool
This online utility converts Unicode data to ASCII characters. Anything that you paste or enter in the input area automatically gets converted to ASCII and is printed in the output area. It supports all Unicode symbols and it works with emoji characters. Created by encoding gurus from team Browserling.
announcement heads up!
We just launched a new site – Online Fractal Tools – utilities for doing fractaly things. Check it out!
Unicode to ascii converter tool What is a unicode to ascii converter?
This browser-based utility converts your Unicode data to the ASCII encoding. To do this, it first splits the Unicode data into graphemes and finds the code point values of each grapheme. A grapheme is usually a single glyph (such as a letter, number, ideogram, logogram, or an emoticon) but it can also be a combination of glyphs (such as text with combining characters). The browser's default encoding stores glyphs as sequences of one, two, three, or four bytes. Code points in the range from 0 to 127 use one byte (actually less than that – only 7 bits). Code points in the range from 128 to 2047 use two bytes. Code points in the range from 2048 to 65,535 use three bytes. Code points in the range from 65,536 to 1,114,111 use four bytes. If a symbol is encoded using just one byte, then the Unicode symbol will be exactly the same as the ASCII symbol and won't change its value when being converted to the ASCII encoding. Characters that use more than one byte are represented as two, three, or four extended ASCII characters, one for each byte. A two-byte Unicode symbol has the binary format "110xxxxx 10xxxxxx", where "x" is a usable bit, so it has 5+6=11 usable bits. A three-byte Unicode symbol has the binary format "1110xxxx 10xxxxxx 10xxxxxx" with 4+6+6=16 usable bits. A four-byte Unicode symbol has the binary format "11110xxx 10xxxxxx 10xxxxxx 10xxxxxx" with 3+6+6+6=21 usable bits. Let's analyze which extended ASCII characters are used in each multi-byte mode. If it's a two-byte encoding, then the first byte "110xxxxx" has 5 free bits and can have 25 = 32 values, and the second byte "10xxxxxx" has 6 free bits and can have 26 = 64 values. If you look at the extended ASCII table, then the possible values for the first byte "110xxxxx" are "ÀÁÂÃÄÅÆÇÈÉÊËÌÍÎÏÐÑÒÓÔÕÖ×ØÙÚÛÜÝÞß" and for the second byte "10xxxxxx" are " ¡¢£¤¥¦§¨©ª«¬®¯°±²³´µ¶·¸¹º»¼½¾¿". If it's a three-byte encoding, then the first byte "1110xxxx" has 4 free bits and can have 24 = 16 values. The second and third bytes are the same as in the two-byte case. The 16 possible values for the first byte "1110xxxx" are "àáâãäåæçèéêëìíîï". If it's a four-byte encoding, then the first byte "11110xxx" has only 3 free bits and can have 23 = 8 values. The bytes two, three, and four are the same as in previous cases. The possible values for the first byte "11110xxx" are "ðñòóôõö÷".
Unicode to ascii converter examples Click to use
Thomas Carlyle's Quote
In this example, we convert a quote by Thomas Carlyle that's written in Unicode bold-script font to extended ASCII bytes. The characters in the bold-script font use code points from the range U+1D4D0 to U+1D503. The value 1D4D0 (in hex) is 120,016 (in decimal) and the value 1D503 is 120,067. As these values are in the code point range 65,536 – 1,114,111, each character is represented by four bytes. The first byte is always in the form "11110xxx" and the other three bytes are in the form "10xxxxxx". The first byte has only three bits available, which means it can only be printed in ASCII as one of these ASCII symbols "ðñòóôõö÷". The remaining three bytes have 64 possible values. In the output, every Unicode character is printed as four ASCII characters. The spaces, the dot, and the comma remain unchanged because they are equal in both encodings.
𝓝𝓸 𝓹𝓻𝓮𝓼𝓼𝓾𝓻𝓮, 𝓷𝓸 𝓭𝓲𝓪𝓶𝓸𝓷𝓭𝓼. 𝓣𝓱𝓸𝓶𝓪𝓼 𝓒𝓪𝓻𝓵𝔂𝓵𝓮
ðð¸ ð¹ð»ð®ð¼ð¼ð¾ð»ð®, ð·ð¸ ðð²ðªð¶ð¸ð·ðð¼. ð£ð±ð¸ð¶ðªð¼ ððªð»ðµððµð®
Sentence with Emojis
This example uses two emoticons in place of two words. All other words in the input sentence use ordinary Latin alphabet letters. The uppercase letters belong to the code positions from 65 to 90 and the lowercase letters belong to the range from 97 to 122. These letters are defined in the 7-bit ASCII table, so they don't change during the conversion. The dot (code point value 46) and apostrophe (code point value 39) symbols are also in the ASCII table and don't change. However, the emojis are not in the ASCII table so they turn into sequences of 4 bytes, and each byte is printed as an ANSI character, so you get four ANSI characters per emoji.
A 🐕 is a 🚶's best friend.
A ð is a ð¶'s best friend.
Emoji of Ball Games
This example replaces a list of ball sports emoticons with extended ASCII symbols. The output includes 1-byte, 2-byte, 3-byte, and 4-byte characters. Latin letters are regular ASCII characters and are encoded in 1 byte. The "«" and "»" characters are represented as two bytes in Unicode and are converted to two symbols from the Windows-1252 (and ISO 8859-1) character encoding. The first two balls and the en dash symbol use three bytes, so they are converted to three characters. And the last four balls are represented as four bytes, so they are converted to four characters.
⚽ – «Soccer Ball» ⚾ – «Baseball» 🏀 – «Basketball» 🏐 – «Volleyball» 🏈 – «American Football» 🏉 – «Rugby Football»
â½ â Â«Soccer BallÂ» â¾ â Â«BaseballÂ» ð â Â«BasketballÂ» ð â Â«VolleyballÂ» ð â Â«American FootballÂ» ð â Â«Rugby FootballÂ»
Pro tips Master online unicode tools
You can pass input to this tool via ?input query argument and it will automatically compute output. Here's how to type it in your browser's address bar. Click to try!
All unicode tools
Quickly find code positions of all Unicode values.
Quickly decode code positions to Unicode values.
Quickly encode Unicode values to UTF-8 encoding.
Quickly encode Unicode values to UTF-16 encoding.
Quickly encode Unicode values to UTF-32 encoding.
Quickly create a picture from Unicode symbols.
Quickly generate random Unicode text in a given range.
Quickly generate all Unicode values from the given code point interval.
Quickly filter Unicode symbols that are within the given code point interval.
Quickly split Unicode data into graphemes.
Quickly sort Unicode glyphs in increasing or decreasing order.
Quickly find the length of Unicode text.
Quickly increase Unicode code point values.
Quickly decrease Unicode code point values.
Quickly reverse the order of symbols in Unicode text.
Quickly rotate Unicode characters to the left and right.
Quickly create multiple copies of Unicode text.
Quickly extract all characters from Unicode text.
Quickly split Unicode data into pieces.
Quickly split Unicode text into chunks of constant length.
Quickly merge Unicode snippets together.
Quickly shorten Unicode text to the given length.
Quickly left-pad Unicode text with any character.
Quickly right-pad Unicode text with any character.
Quickly align Unicode data to the center.
Quickly align Unicode data to the right.
Quickly convert ordinary numbers to Unicode numbers in various fonts.
Quickly convert ordinary letters to Unicode letters in various fonts.
Quickly convert ordinary text to fancy Unicode text.
Quickly combine input Unicode with diacritical marks.
Quickly circularly rearrange Unicode symbols.
Quickly encode Unicode data to HTML entities.
Quickly URL-escape Unicode symbols.
Quickly encode Unicode values to base64.
Quickly encode Unicode values to a data URI.
Quickly convert Unicode characters to raw bytes.
Quickly convert Unicode data to base-2 (binary).
Quickly convert Unicode data to base-8 (octal).
Quickly convert Unicode data to base-10 (decimal).
Quickly convert Unicode data to base-16 (hexadecimal).
Quickly convert Unicode symbols to raw ASCII bytes.
Quickly convert ASCII bytes to Unicode symbols.
Quickly create a picture from Unicode emojis.
Release Zalgo on your Unicode text.
Coming soon These unicode tools are on the way
Name Unicode Symbols
Spell out the names of Unicode characters in the input text.
URL-unescape Unicode text.
Convert Binary to Unicode
Convert base-2 data to Unicode encoding.
Convert Octal to Unicode
Convert base-8 data to Unicode encoding.
Convert Decimal to Unicode
Convert base-10 data to Unicode encoding.
Convert Hex to Unicode
Convert base-16 data to Unicode encoding.
Convert Unicode to Any Base
Convert Unicode text to any radix.
Convert Any Base to Unicode
Convert any radix data to Unicode.
Convert Unicode to Latin1
Convert Unicode text to Latin1 encoding.
Convert Latin1 to Unicode
Convert Latin1 encoded data to Unicode.
Convert Bytes to Unicode
Convert raw bytes to Unicode.
Remove Combining Characters
Delete diacritical marks from Unicode data
Remove Zalgo from Unicode
Make Unicode Zalgo text readable again.
Check if the given Unicode has valid encoding.
Convert Unicode to Punycode
Encode Unicode text to Punycode encoding.
Convert Punycode to Unicode
Decode Punycode encoding to Unicode.
Decode Base64 to Unicode
Convert base64 data to Unicode text.
Encode Unicode to Data URI
Convert Unicode to a valid data URL.
Decode Data URI to Unicode
Convert a valid data URL to Unicode text.
Convert HTML to Unicode
Decode HTML entities to Unicode data.
Convert UTF8 to Unicode
Decode UTF8 encoding to Unicode.
Convert UTF16 to Unicode
Decode UTF16 encoding to Unicode.
Convert UTF32 to Unicode
Decode UTF32 encoding to Unicode.
Convert Unicode to Uppercase
Convert all Unicode characters to uppercase.
Convert Unicode to Lowercase
Convert all Unicode characters to lowercase.
Convert Unicode to Randomcase
Randomize case of all Unicode characters.
Convert Unicode to Lowercase
Convert all Unicode characters to lowercase.
JSON Stringify Unicode
Encode Unicode to JSON.
JSON Parse Unicode
Decode JSON to Unicode.
Shuffle Unicode Symbols
Randomly rearrange the order of input graphemes.
Print statistics about Unicode data and code points, etc.
Subscribe! Never miss an update
Notifications We'll let you know when we add this tool