onlineunicodetools logo

Unicode to binary converter

World's simplest unicode tool

This browser-based utility converts Unicode text to base-2 binary data. Anything that you paste or enter in the text area on the left automatically gets printed as binary on the right. It supports the most popular Unicode encodings (such as UTF-8, UTF-16, UTF-32, UCS-2, and UCS-4) and it works with emoji characters. You can also customize the binary output format by enabling binary padding and spacing. Created by encoding gurus from team Browserling.
announcement heads up!
We just launched a new site โ€“ Online String Tools โ€“ a collection of string processing utilities. Check it out!
fullscreen fullscreen_exit
Binary Padding
If you enable this option, then all output bytes will be full octets (meaning 8 bits in every byte).
Binary Spacing
If you enable this option, then all output bytes will have a space character after each of them.
Output Encoding
Use this Unicode encoding in the output.
If you selected UTF16, UTF32, UCS2, or UCS4 as output encoding, then add Byte Order Mark (BOM) at the beginning.
Unicode to binary converter tool What is a unicode to binary converter?
This utility decodes your input Unicode data into individual code points and then encodes each code point to binary. Depending on which output encoding you have selected, each character can be encoded from one to four bytes in length. If the output encoding is UTF32 or UCS4, then each code point will be four bytes (or 32 bits). If the output encoding is UTF16 or UCS2, then each code point will be either two bytes (16 bits) or, if it's a more complex character, then four bytes (32 bits). If the output encoding is UTF8, then the binary will be one to four bytes in length (variable length 8 to 32 bits). When converting the Unicode data to binary, you can make it print full-length binary bytes always using 8 bits per byte, even if the byte starts with one or more zero bits. For increased readability, you can also put a space between every binary octet. We also support Little Endian and Big Endian byte orders for UTF16, UTF32, UCS2, and UCS4 encodings, as well as prepending a BOM (Byte Order Mark) to the binary output stream. To print the output in Little Endian format, select UTF16-LE, UTF32-LE, UCS2-LE, or UCS4-LE output encoding but to print output in Big Endian format, select UTF16-BE, UTF32-BE, UCS2-BE, or UCS4-BE from the output encoding dropdown option.
Unicode to binary converter examples Click to use
Emoji Flags in Binary
In this example, we convert emoji Unicode flags of the five biggest economies into binary bits in UTF8 encoding. To make the output more readable, we've enabled two output options. The first option adds padding to binary so that each UTF8 byte is 8 bits long, and the second option puts a space between every 8 bits.
๐Ÿ‡ฆ๐Ÿ‡บ - Australia ๐Ÿ‡จ๐Ÿ‡ฆ - Canada ๐Ÿ‡ฉ๐Ÿ‡ช - Germany ๐Ÿ‡ฌ๐Ÿ‡ง - UK ๐Ÿ‡บ๐Ÿ‡ธ - US
11110000 10011111 10000111 10100110 11110000 10011111 10000111 10111010 00100000 00101101 00100000 01000001 01110101 01110011 01110100 01110010 01100001 01101100 01101001 01100001 00001010 11110000 10011111 10000111 10101000 11110000 10011111 10000111 10100110 00100000 00101101 00100000 01000011 01100001 01101110 01100001 01100100 01100001 00001010 11110000 10011111 10000111 10101001 11110000 10011111 10000111 10101010 00100000 00101101 00100000 01000111 01100101 01110010 01101101 01100001 01101110 01111001 00001010 11110000 10011111 10000111 10101100 11110000 10011111 10000111 10100111 00100000 00101101 00100000 01010101 01001011 00001010 11110000 10011111 10000111 10111010 11110000 10011111 10000111 10111000 00100000 00101101 00100000 01010101 01010011
Required options
These options will be used automatically if you select this example.
If you enable this option, then all output bytes will be full octets (meaning 8 bits in every byte).
If you enable this option, then all output bytes will have a space character after each of them.
Use this Unicode encoding in the output.
Unicode Font in Binary
This example converts a fancy Unicode font to base two and Big Endian UTF16 encoding. It disables the space between octets option and produces a lengthy binary stream as output.
๐“ฆ๐“ฎ ๐“ต๐“ธ๐“ฟ๐“ฎ ๐“ค๐“ท๐“ฒ๐“ฌ๐“ธ๐“ญ๐“ฎ!
11011000001101011101110011100110110110000011010111011100111011100000000000100000110110000011010111011100111101011101100000110101110111001111100011011000001101011101110011111111110110000011010111011100111011100000000000100000110110000011010111011100111001001101100000110101110111001111011111011000001101011101110011110010110110000011010111011100111011001101100000110101110111001111100011011000001101011101110011101101110110000011010111011100111011100000000000100001
Required options
These options will be used automatically if you select this example.
If you enable this option, then all output bytes will be full octets (meaning 8 bits in every byte).
If you enable this option, then all output bytes will have a space character after each of them.
Use this Unicode encoding in the output.
Unicode to Binary UCS4LE
This example converts a quote by Albert Einstein to binary in UCS-4 Little Endian encoding with a BOM. UCS4 is the same encoding as UTF32, so it's exactly the same as encoding it to UTF32LE encoding. Both UCS4 and UTF32 use 4 bytes per Unicode code point, so each code point is 32 bits in length in the output.
I have no special talent. I am only passionately curious. โ’ถlbert โ’บinsten
11111111111111100000000000000000 01001001000000000000000000000000 00100000000000000000000000000000 01101000000000000000000000000000 01100001000000000000000000000000 01110110000000000000000000000000 01100101000000000000000000000000 00100000000000000000000000000000 01101110000000000000000000000000 01101111000000000000000000000000 00100000000000000000000000000000 01110011000000000000000000000000 01110000000000000000000000000000 01100101000000000000000000000000 01100011000000000000000000000000 01101001000000000000000000000000 01100001000000000000000000000000 01101100000000000000000000000000 00100000000000000000000000000000 01110100000000000000000000000000 01100001000000000000000000000000 01101100000000000000000000000000 01100101000000000000000000000000 01101110000000000000000000000000 01110100000000000000000000000000 00101110000000000000000000000000 00100000000000000000000000000000 01001001000000000000000000000000 00100000000000000000000000000000 01100001000000000000000000000000 01101101000000000000000000000000 00100000000000000000000000000000 01101111000000000000000000000000 01101110000000000000000000000000 01101100000000000000000000000000 01111001000000000000000000000000 00100000000000000000000000000000 01110000000000000000000000000000 01100001000000000000000000000000 01110011000000000000000000000000 01110011000000000000000000000000 01101001000000000000000000000000 01101111000000000000000000000000 01101110000000000000000000000000 01100001000000000000000000000000 01110100000000000000000000000000 01100101000000000000000000000000 01101100000000000000000000000000 01111001000000000000000000000000 00100000000000000000000000000000 01100011000000000000000000000000 01110101000000000000000000000000 01110010000000000000000000000000 01101001000000000000000000000000 01101111000000000000000000000000 01110101000000000000000000000000 01110011000000000000000000000000 00101110000000000000000000000000 00100000000000000000000000000000 10110110001001000000000000000000 01101100000000000000000000000000 01100010000000000000000000000000 01100101000000000000000000000000 01110010000000000000000000000000 01110100000000000000000000000000 00100000000000000000000000000000 10111010001001000000000000000000 01101001000000000000000000000000 01101110000000000000000000000000 01110011000000000000000000000000 01110100000000000000000000000000 01100101000000000000000000000000 01101110000000000000000000000000
Required options
These options will be used automatically if you select this example.
If you enable this option, then all output bytes will be full octets (meaning 8 bits in every byte).
If you enable this option, then all output bytes will have a space character after each of them.
Use this Unicode encoding in the output.
If you selected UTF16, UTF32, UCS2, or UCS4 as output encoding, then add Byte Order Mark (BOM) at the beginning.
Pro tips Master online unicode tools
You can pass input to this tool via ?input query argument and it will automatically compute output. Here's how to type it in your browser's address bar. Click to try!
https://onlineunicodetools.com/convert-unicode-to-binary?input=%F0%9F%87%A6%F0%9F%87%BA%20-%20Australia%0A%F0%9F%87%A8%F0%9F%87%A6%20-%20Canada%0A%F0%9F%87%A9%F0%9F%87%AA%20-%20Germany%0A%F0%9F%87%AC%F0%9F%87%A7%20-%20UK%0A%F0%9F%87%BA%F0%9F%87%B8%20-%20US&padding=true&spacing=true&output-encoding=utf8
All unicode tools
Didn't find the tool you were looking for? Let us know what tool we are missing and we'll build it!
Quickly find code positions of all Unicode values.
Quickly decode code positions to Unicode values.
Quickly encode Unicode values to UTF-8 encoding.
Quickly encode Unicode values to UTF-16 encoding.
Quickly encode Unicode values to UTF-32 encoding.
Quickly create a picture from Unicode symbols.
Quickly generate random Unicode text in a given range.
Quickly generate all Unicode values from the given code point interval.
Quickly filter Unicode symbols that are within the given code point interval.
Quickly split Unicode data into graphemes.
Quickly sort Unicode glyphs in increasing or decreasing order.
Quickly find the length of Unicode text.
Quickly increase Unicode code point values.
Quickly decrease Unicode code point values.
Quickly reverse the order of symbols in Unicode text.
Quickly rotate Unicode characters to the left and right.
Quickly create multiple copies of Unicode text.
Quickly extract all characters from Unicode text.
Quickly split Unicode data into pieces.
Quickly split Unicode text into chunks of constant length.
Quickly merge Unicode snippets together.
Quickly shorten Unicode text to the given length.
Quickly left-pad Unicode text with any character.
Quickly right-pad Unicode text with any character.
Quickly align Unicode data to the center.
Quickly align Unicode data to the right.
Quickly convert ordinary numbers to Unicode numbers in various fonts.
Quickly convert ordinary letters to Unicode letters in various fonts.
Quickly convert ordinary text to fancy Unicode text.
Quickly combine input Unicode with diacritical marks.
Quickly circularly rearrange Unicode symbols.
Quickly URL-escape Unicode symbols.
Quickly encode Unicode data to HTML entities.
Quickly encode Unicode values to base64.
Quickly encode Unicode values to a data URI.
Quickly convert Unicode data to base-2 (binary).
Quickly convert Unicode data to base-8 (octal).
Quickly convert Unicode data to base-10 (decimal).
Quickly convert Unicode data to base-16 (hexadecimal).
Quickly create a picture from Unicode emojis.
Release Zalgo on your Unicode text.
Coming soon These unicode tools are on the way
Name Unicode Symbols
Spell out the names of Unicode characters in the input text.
URL-decode Unicode
URL-unescape Unicode text.
Convert Binary to Unicode
Convert base-2 data to Unicode encoding.
Convert Octal to Unicode
Convert base-8 data to Unicode encoding.
Convert Decimal to Unicode
Convert base-10 data to Unicode encoding.
Convert Hex to Unicode
Convert base-16 data to Unicode encoding.
Convert Unicode to Any Base
Convert Unicode text to any radix.
Convert Any Base to Unicode
Convert any radix data to Unicode.
Convert Unicode to ASCII
Convert Unicode characters to ASCII characters.
Convert ASCII to Unicode
Convert ASCII characters to Unicode characters.
Convert Unicode to Latin1
Convert Unicode text to Latin1 encoding.
Convert Latin1 to Unicode
Convert Latin1 encoded data to Unicode.
Convert Unicode to Bytes
Convert Unicode characters to raw bytes.
Convert Bytes to Unicode
Convert raw bytes to Unicode.
Remove Combining Characters
Delete diacritical marks from Unicode data
Remove Zalgo from Unicode
Make Unicode Zalgo text readable again.
Validate Unicode
Check if the given Unicode has valid encoding.
Convert Unicode to Punycode
Encode Unicode text to Punycode encoding.
Convert Punycode to Unicode
Decode Punycode encoding to Unicode.
Decode Base64 to Unicode
Convert base64 data to Unicode text.
Encode Unicode to Data URI
Convert Unicode to a valid data URL.
Decode Data URI to Unicode
Convert a valid data URL to Unicode text.
Convert HTML to Unicode
Decode HTML entities to Unicode data.
Convert UTF8 to Unicode
Decode UTF8 encoding to Unicode.
Convert UTF16 to Unicode
Decode UTF16 encoding to Unicode.
Convert UTF32 to Unicode
Decode UTF32 encoding to Unicode.
Convert Unicode to Uppercase
Convert all Unicode characters to uppercase.
Convert Unicode to Lowercase
Convert all Unicode characters to lowercase.
Convert Unicode to Randomcase
Randomize case of all Unicode characters.
Convert Unicode to Lowercase
Convert all Unicode characters to lowercase.
JSON Stringify Unicode
Encode Unicode to JSON.
JSON Parse Unicode
Decode JSON to Unicode.
Shuffle Unicode Symbols
Randomly rearrange the order of input graphemes.
Analyze Unicode
Print statistics about Unicode data and code points, etc.