onlineunicodetools logo

Unicode text normalizer

World's simplest unicode tool

This browser-based utility converts fancy Unicode text back to regular text. All Unicode glyphs that you paste or enter in the text area as the input automatically get converted to simple ASCII characters in the output. It normalizes Unicode letters, numbers, punctuation marks, ligatures, as well as Emojis that contain letters. You can configure this tool to output some of the characters as-is and also control the case of the output text. Created by encoding gurus from team Browserling.
announcement a new project!
Super exciting news! We just launched MATHURLS โ€“ a fun math news reader. Check it out!
Skip Text Symbols
Enter Unicode letters, numbers, and marks that you don't want to normalize.
Normalized Text Case
Preserve the input case from the input Unicode glyphs.
Reformat the output to use a proper sentence case.
Convert all letters in the output to capital letters.
Convert all letters in the output to lowercase letters.
Unicode text normalizer tool What is a unicode text normalizer?
This online web application normalizes Unicode text. It converts all typographic Unicode glyphs to readable English characters from the ASCII charset. It supports over twenty different alphabets, a dozen of Unicode fonts, and it's also capable of recognizing letters by their shape. The tool transforms all letter-like symbols into letters of the English alphabet. It can normalize ligatures, for example, "ใŽž" as "km" and "ใ‘" as "ln", and word glyphs, for example, "๐Ÿ†•" as "NEW" and "๐Ÿ†—" as "OK". It also works with many numeric and math fonts and converts Unicode digits into regular Latin numerals 0 to 9 from the code position range U+0030 to U+0039. It can also normalize multi-digit numbers, for example, "โ‘ซ" to "12" and "ใ‰" to "60", as well as fractions, for example, "ยฝ" to "1/2" and "โ…—" to "3/5". It replaces fake punctuation marks with ASCII punctuation marks, for example, an emoji question mark "โ“" (U+2753) gets converted to a regular question mark "?" (U+003F), and a sine wave "โˆฟ" (U+223F) gets normalized to an ordinary tilde "~" (U+007E). It also normalizes all Unicode spaces and removes spaces of zero width. This app can also clear the combining characters in any text, as well as remove the Zalgo effect from the text and return clean symbols. If you want to preserve certain Unicode characters, you can enter them in the option skip symbols option field. The listed glyphs will then remain unchanged. You can also control the case of text letters in the output by letting the program automatically determine the case from the input, capitalizing only the first letter of each sentence, or turning all characters into upper or lower case.
Unicode text normalizer examples Click to use
The Penguins of Madagascar
In this example, we normalize a dialogue from the cartoon The Penguins of Madagascar. Skipper's lines are written in a monospace font and Kowalski's replies use a bold-italic sans-serif font. In the input dialogue, there are also many weird commas, dots, dashes, and question marks. The program splits the text into individual graphemes and then for each grapheme finds the corresponding character in the ASCII table. All output characters are in the range from U+0000 to U+007F and have the same case as the input Unicode glyphs.
โ€” ๐™บ๐š˜๐š ๐šŠ๐š•๐šœ๐š”๐š’ูซ ๐š˜๐š™๐š๐š’๐š˜๐š—๐šœ๊“ธ โ€” ๐˜ผ ๐™จ๐™ฉ๐™ง๐™–๐™ฉ๐™š๐™œ๐™ž๐™˜ ๐™ง๐™š๐™ฉ๐™ง๐™š๐™–๐™ฉอต ๐™Ž๐™ ๐™ž๐™ฅ๐™ฅ๐™š๐™งโ” โ€” ๐™ด๐šก๐š™๐š•๐šŠ๐š’๐š—๊“ธ โ€” ๐™„๐™ฉสผ๐™จ ๐™ก๐™ž๐™ ๐™š ๐™ง๐™ช๐™ฃ๐™ฃ๐™ž๐™ฃ๐™œ ๐™–๐™ฌ๐™–๐™ฎ ๐™—๐™ช๐™ฉ ๐™ข๐™–๐™ฃ๐™ก๐™ž๐™š๐™งโ€ค
- Kowalski, options. - A strategic retreat, Skipper? - Explain. - It's like running away but manlier.
Required options
These options will be used automatically if you select this example.
Enter Unicode letters, numbers, and marks that you don't want to normalize.
Preserve the input case from the input Unicode glyphs.
A Recipe For Happiness
In this example, we introduce a simple and useful recipe for everyday happiness. We use many bright and extraordinary Unicode characters here. Many letters contain combining marks, emoticons, as well as typographical ligatures. Unicode numbers use various shapes, fonts, and even fractional glyphs. Punctuation marks have a variety of styles and colors. We turn each Unicode symbol to plain text. We use the "Sentence Case" mode to properly capitalize only the first letter of each sentence and convert the rest of the text to lowercase.
เผ ๐˜ข lฬคษ๏ฝ’๐š๐—ฒ hฬคฬˆ๐’†๐•๐Ÿ…ฟ๐—ถ๐—‡๏ฝ‡ ๐จ๐™› แน•osฬคฬˆ๐ข๐Ÿ‡นโ“˜๐˜ท๐˜ช๐™ฉโ“จโ โˆ— โถ ๐™ก๐šŠrฬคฬˆฯฑ๐Ÿ„ด ฦจmฬˆ๐•š๐•๐—ฒโ เผ โ‘ก ๐Ÿ‡จ๐•ฆpฬคโ’ฎ ๐—ˆ๐—ณ ฦจ๐”€๏ฝ…๏ฝ…๐“ฝ๐•Ÿวsฬค๐—Œโธต โ‹† ยพ ใŽ ๐Ÿ…พ๐Ÿ‡ซ ๐–Œ๐™คoฬคฬˆdฬ ๐Ÿ…ข๐˜ฆโ’ฉโ“ข๐Ÿ„ด ๐—ˆ๐Ÿ…ต โ‚•แต˜โ‚˜๐˜ฐ๐Ÿ†อพ เผ ยฝ แถœuฬค๏ฝ ๐Ÿ„พfฬค ฦจวlโ“•-๏ฝ…ล›แบ—โ‚‘๏ฝ…โ“œโธต โ‹† ๐Ÿ“โ“ช๐Ÿข ใŽค ๐Ÿ…ž๐“ฏ ๐Ÿ‡นสณ๐Ÿ†„๐“ฎ ๐–‹๐–บ๐’Š๐š๐—โ โˆ— ๐Ÿ™ ๐—Œpฬˆ๐˜ฐoโฟfฬคแต˜๐Ÿ„ป แต’๐Ÿ„ต ๐ โ“ž๐™ค๐“ญแบ…๐˜ช๐š•โ’งอพ เผ แ˜” ๐—‰๐ข๐˜ฏ๊œ€๐กว๐Ÿ†‚ ๏ฝ๐•— ๐Ÿ„ด๐–†๐—Œโ“จ ๐”คโ“ž๐š’๐Ÿ†–โ โ‹† โ“โ’ฉdฬคฬˆ ๐Ÿ…ฐ ๐Ÿ„ท๐–พaฬคฬˆโ’ญ๐ญ โ“•uฬคฬˆ๐”ฉ๐Ÿ…› ๐™คfฬค ๏ฝŒ๐—ˆvฬ๐˜ฆ๊“ธ ๐˜ฎรญโ‚“ ๐—โ’ชโ“–๐•–๐’•โ‚•รซษน ๐Ÿ…ฐ๐’๐—ฑ ๐˜ดสœษ’โ“ก๐”ข wฬค๐“ฒ๐”ฑ๐™ ๐”ฃ๐Ÿ‡ฆmฬค๐—‚๐š•โ“จ โ’œ๏ฝŽ๐ fฬrฬครฏ๐˜ฆโ’ฉ๐šsฬคโ—โ•
* A large helping of positivity; * 1 large smile; * 2 cups of sweetness; * 3/4 kg of good sense of humor; * 1/2 cup of self-esteem; * 500 cm^3 of true faith; * 1 spoonful of goodwill; * 2 pinches of easy going; * and a heart full of love. Mix together and share with family and friends!!
Required options
These options will be used automatically if you select this example.
Enter Unicode letters, numbers, and marks that you don't want to normalize.
Reformat the output to use a proper sentence case.
Dinosaur Language
This example translates words from the Dinosaur language into English. The Dinosaur language contains many decorating Unicode symbols and even some Zalgo. The utility outputs symbols in clean text format so that it is easy to read the message. It preserves three Unicode characters: "โ‹—", "โ‹–", and "๐Ÿ…พ", by using the "Skip Text Symbols" option. Thus, we get an easy-to-read phrase in the output, with three decorating Unicode preserved.
แณแณแณแณแณโ‹—โ€œโ“‡โ“„โ’ถโ“‡โ€โ‹–แธแธแธแธแธแธ แณแณแณแณแณโ‹—๐”ช๐”ข๐”ž๐”ซ๐”ฐโ‹–แธแธแธแธแธแธแธ แณโ‹—โ€œ๐Ÿ…ธ ๐Ÿ…ป๐Ÿ…พ๐Ÿ†…๐Ÿ…ด ๐Ÿ†ˆ๐Ÿ…พ๐Ÿ†„โ€โ‹–แธ แณแณแณแณแณแณแณโ‹—ใŒโ‹–แธแธแธแธแธแธแธแธ แณแณแณแณแณโ‹—DอฌฬŒฬบฬ—ฬฎiอ‚ฬฬญอ…อ–nแท†อ’ฬžอšฬปoแท‡แท‡อšฬคฬบsอฆอฬœอœฬ˜aอ„อŒฬฆฬฃอ”uฬ“อ’ฬปแทŠฬจrอจอ‹ฬขฬงฬปโ‹–แธแธแธแธ
>>>>>โ‹—"ROAR"โ‹–<<<<<< >>>>>โ‹—meansโ‹–<<<<<<< >โ‹—"I L๐Ÿ…พVE Y๐Ÿ…พU"โ‹–< >>>>>>>โ‹—inโ‹–<<<<<<<< >>>>>โ‹—Dinosaurโ‹–<<<<
Required options
These options will be used automatically if you select this example.
Enter Unicode letters, numbers, and marks that you don't want to normalize.
Preserve the input case from the input Unicode glyphs.
Pro tips Master online unicode tools
You can pass input to this tool via ?input query argument and it will automatically compute output. Here's how to type it in your browser's address bar. Click to try!
https://onlineunicodetools.com/normalize-unicode-text?input=%E2%80%94%20%F0%9D%99%BA%F0%9D%9A%98%F0%9D%9A%A0%F0%9D%9A%8A%F0%9D%9A%95%F0%9D%9A%9C%F0%9D%9A%94%F0%9D%9A%92%D9%AB%20%F0%9D%9A%98%F0%9D%9A%99%F0%9D%9A%9D%F0%9D%9A%92%F0%9D%9A%98%F0%9D%9A%97%F0%9D%9A%9C%EA%93%B8%0A%E2%80%94%20%F0%9D%98%BC%20%F0%9D%99%A8%F0%9D%99%A9%F0%9D%99%A7%F0%9D%99%96%F0%9D%99%A9%F0%9D%99%9A%F0%9D%99%9C%F0%9D%99%9E%F0%9D%99%98%20%F0%9D%99%A7%F0%9D%99%9A%F0%9D%99%A9%F0%9D%99%A7%F0%9D%99%9A%F0%9D%99%96%F0%9D%99%A9%CD%B5%20%F0%9D%99%8E%F0%9D%99%A0%F0%9D%99%9E%F0%9D%99%A5%F0%9D%99%A5%F0%9D%99%9A%F0%9D%99%A7%E2%9D%94%0A%E2%80%94%20%F0%9D%99%B4%F0%9D%9A%A1%F0%9D%9A%99%F0%9D%9A%95%F0%9D%9A%8A%F0%9D%9A%92%F0%9D%9A%97%EA%93%B8%0A%E2%80%94%20%F0%9D%99%84%F0%9D%99%A9%CA%BC%F0%9D%99%A8%20%F0%9D%99%A1%F0%9D%99%9E%F0%9D%99%A0%F0%9D%99%9A%20%F0%9D%99%A7%F0%9D%99%AA%F0%9D%99%A3%F0%9D%99%A3%F0%9D%99%9E%F0%9D%99%A3%F0%9D%99%9C%20%F0%9D%99%96%F0%9D%99%AC%F0%9D%99%96%F0%9D%99%AE%20%F0%9D%99%97%F0%9D%99%AA%F0%9D%99%A9%20%F0%9D%99%A2%F0%9D%99%96%F0%9D%99%A3%F0%9D%99%A1%F0%9D%99%9E%F0%9D%99%9A%F0%9D%99%A7%E2%80%A4&ignore-letters=&autocase=true
All unicode tools
Didn't find the tool you were looking for? Let us know what tool we are missing and we'll build it!
Quickly find code positions of all Unicode values.
Quickly decode code positions to Unicode values.
Quickly encode Unicode values to UTF-8 encoding.
Quickly encode Unicode values to UTF-16 encoding.
Quickly encode Unicode values to UTF-32 encoding.
Quickly create a picture from Unicode symbols.
Quickly generate random Unicode text in a given range.
Quickly generate all Unicode values from the given code point interval.
Quickly filter Unicode symbols that are within the given code point interval.
Quickly split Unicode data into graphemes.
Quickly sort Unicode glyphs in increasing or decreasing order.
Quickly find the length of Unicode text.
Quickly increase Unicode code point values.
Quickly decrease Unicode code point values.
Quickly reverse the order of symbols in Unicode text.
Quickly rotate Unicode characters to the left and right.
Quickly create multiple copies of Unicode text.
Quickly extract all characters from Unicode text.
Quickly split Unicode data into pieces.
Quickly split Unicode text into chunks of constant length.
Quickly merge Unicode snippets together.
Quickly shorten Unicode text to the given length.
Quickly left-pad Unicode text with any character.
Quickly right-pad Unicode text with any character.
Quickly align Unicode data to the center.
Quickly align Unicode data to the right.
Quickly convert ordinary numbers to fancy Unicode numbers.
Quickly convert Unicode numbers back to regular numbers.
Quickly convert ordinary letters to Unicode letters in various fonts.
Quickly convert Unicode letters back to regular Latin letters.
Quickly convert ordinary text to fancy Unicode text.
Quickly convert fancy Unicode text back to regular text.
Quickly combine input Unicode with diacritical marks.
Quickly delete diacritical marks from the Unicode text.
Quickly spoof regular text using Unicode homoglyphs.
Quickly circularly rearrange Unicode symbols.
Quickly convert Unicode data to escape sequences.
Quickly encode Unicode data to HTML entities.
Quickly URL-escape Unicode symbols.
Quickly encode Unicode values to base64.
Quickly encode Unicode values to a data URI.
Quickly convert Unicode characters to raw bytes.
Quickly convert Unicode data to base-2 (binary).
Quickly convert Unicode data to base-8 (octal).
Quickly convert Unicode data to base-10 (decimal).
Quickly convert Unicode data to base-16 (hexadecimal).
Quickly convert Unicode symbols to raw ASCII bytes.
Quickly convert ASCII bytes to Unicode symbols.
Quickly convert Unicode text to a string literal.
Quickly create a picture from Unicode emojis.
Quickly release Zalgo on your Unicode text.
Quickly make Zalgo text readable again.
Coming soon These unicode tools are on the way
Name Unicode Symbols
Spell out the names of Unicode characters in the input text.
URL-decode Unicode
URL-unescape Unicode text.
Convert Binary to Unicode
Convert base-2 data to Unicode encoding.
Convert Octal to Unicode
Convert base-8 data to Unicode encoding.
Convert Decimal to Unicode
Convert base-10 data to Unicode encoding.
Convert Hex to Unicode
Convert base-16 data to Unicode encoding.
Convert Unicode to Any Base
Convert Unicode text to any radix.
Convert Any Base to Unicode
Convert any radix data to Unicode.
Convert Unicode to Latin1
Convert Unicode text to Latin1 encoding.
Convert Latin1 to Unicode
Convert Latin1 encoded data to Unicode.
Convert Bytes to Unicode
Convert raw bytes to Unicode.
Validate Unicode
Check if the given Unicode has valid encoding.
Convert Unicode to Punycode
Encode Unicode text to Punycode encoding.
Convert Punycode to Unicode
Decode Punycode encoding to Unicode.
Decode Base64 to Unicode
Convert base64 data to Unicode text.
Encode Unicode to Data URI
Convert Unicode to a valid data URL.
Decode Data URI to Unicode
Convert a valid data URL to Unicode text.
Convert HTML to Unicode
Decode HTML entities to Unicode data.
Convert UTF8 to Unicode
Decode UTF8 encoding to Unicode.
Convert UTF16 to Unicode
Decode UTF16 encoding to Unicode.
Convert UTF32 to Unicode
Decode UTF32 encoding to Unicode.
Convert Unicode to Uppercase
Convert all Unicode characters to uppercase.
Convert Unicode to Lowercase
Convert all Unicode characters to lowercase.
Convert Unicode to Randomcase
Randomize case of all Unicode characters.
Convert Unicode to Lowercase
Convert all Unicode characters to lowercase.
JSON Stringify Unicode
Encode Unicode to JSON.
JSON Parse Unicode
Decode JSON to Unicode.
Shuffle Unicode Symbols
Randomly rearrange the order of input graphemes.
Analyze Unicode
Print statistics about Unicode data and code points, etc.
Slice Unicode
Extract a part from Unicode data.