onlineunicodetools logo

Unicode letter normalizer

World's simplest unicode tool

This browser-based utility converts fancy Unicode letters back to regular letters. All the letter-like characters that you paste or enter in the text area as the input automatically get converted to simple English letters in the output. It normalizes several dozen various Unicode fonts as well as Emoji letters and mirror characters. You can also adjust the case of normalized letters and also ignore certain Unicode letters from being converted. Created by encoding gurus from team Browserling.
announcement check out our new project!
We just created something new for all science fans โ€“ SCIURLS โ€“ a neat science news aggragator. Check it out!
Ignore Unicode Glyphs
Specify the letters that you don't want to convert.
Output Letters Case
Detect case automatically.
Use sentence casing for letters.
Return letters in uppercase.
Return letters in lowercase.
Unicode letter normalizer tool What is a unicode letter normalizer?
This online web application normalizes Unicode letters. It converts glyphs from different Unicode scripts into regular Latin letters. It works with a wide variety of fonts, such as bold (๐š, ๐›, ๐œ), monospace (๐šŠ, ๐š‹, ๐šŒ), sans-serif (๐–บ, ๐–ป, ๐–ผ), fullwidth (๏ฝ, ๏ฝ‚, ๏ฝƒ), as well as with emoji letters, such as ๐Ÿ…ฐ, ๐Ÿ…ฑ, or ๐Ÿ…ฟ. It can also recognize Unicode characters that are similar in shape to letters and turn them into these letters. For example, the north indic quantity mark "๊ น" is converted to the 7-bit ASCII letter "S" and the copyright sign "ยฉ" is converted to the regular letter "C". It can also replace typographical ligatures with two or more letter strings. The ligature is a combination of several graphemes that forms a single glyph. For example, the character "ร†", used in the Danish, Norwegian, and Icelandic languages, decomposes into the string "AE", and the character "ล“", used in French, decomposes into the string "oe". It can normalize control pictures, for example, "โ€" as "NUL" or "โ›" as "ESC", and also decode Unicode words, for example, "๐Ÿ†’" as "COOL" or "๐Ÿ†˜" as "SOS". Additionally, this program can clear letters from diacritical marks and combining characters, as well as remove Zalgo deformations from the text. It can also identify spoofed letters and replace them with ASCII symbols. Spoofed letters look exactly the same as regular letters but they are from different Unicode blocks. For example, Ukrainian letter ะ looks the same as English letter H but they are different letters in different parts of the Unicode table. Some Unicode fonts consist only of uppercase letters, some only of lowercase letters, and some of mixed case. For each Unicode character, this tool automatically determines its case and replaces it with the ASCII letter of the corresponding case. You can quickly change the case to small, large, or true (standard capitalization of sentences), using simple radio button options. You can also preserve Unicode glyphs for certain letters if you enter them in the ignore letters options field.
Unicode letter normalizer examples Click to use
Normalize Unicode Letters
In this example, we convert a fancy Unicode phrase into plain ASCII text. The input text consists of more than twenty different fonts. Some letters use various diacritics and some other letters have even been attacked by Zalgo. The app first removes all diacritics and combining characters and then for each cleaned Unicode letter it finds its corresponding Latin block letter. The input Unicode glyphs have a mixed case, some are uppercase and some are lowercase. As the automatic case detection mode is selected, the application returns each letter in its original case.
๏ผด-๐—ฅ๐Ÿ‡ช๏ฝ˜ ๐•™๐•’๐—Œ ๐™จhฬค๐Ÿ…พ๐™ง๐“ฝ รค๐Ÿ…ก๐˜ฎโ’ฎ sฬ“อฉฬฬทฬ—ฬนฬจฬ™oฬ‡ฬฬฬธฬปอšฬฎ ๐˜โ’ฃ๐Ÿ„ฐ๐˜ ษ˜vฬคฬˆeฬคฬˆrฬค๐˜บ๐š๐ก๐’Š๐Ÿ‡ณ๐š โ’ค๐Ÿ…ƒ ๏ฝˆ๐Ÿ…พ๐Ÿ„ป๐“ญ๐Ÿ‡ธ ๐’Š๐˜€ ษ”๐—น๐“ธ๐ฌ๐’† ๐—oฬค ๐“ฒ๏ฝ”๐Ÿ†‚ hอฃฬ‚ฬƒฬผแทฟฬฉeฬƒแทƒฬ‚อŸฬงฬปอš๐šŠโ“กแบ—.
T-REx has shOrt aRms so thAt everythiNg iT hOLdS is close to itS heart.
Required options
These options will be used automatically if you select this example.
Specify the letters that you don't want to convert.
Detect case automatically.
Normalize Spoofed Text
In this example, the tool does something that can't be seen with the unaided eye. The input and output look exactly the same but the truth is, they are completely different. In the input, fake letters that belong to the Cyrillic font block are used (code point range from U+0400 to U+04FF). In the output, they are cleaned and we get letters from the English character set (code point range from U+000 to U+007F). Thus, the utility easily identified and normalized spoofed text.
ะ†'m ั•trะพng bะตัะฐuั•ะต ะ†'vะต bะตะตn wะตะฐk.
I'm strong because I've been weak.
Required options
These options will be used automatically if you select this example.
Specify the letters that you don't want to convert.
Detect case automatically.
Pablo Picasso Quote
In this example, we enter a Pablo Picasso quote, which consists of many different Unicode glyphs, accents, combining characters, and ligatures. We clear all the letters from diacritical marks, split the ligatures, and turn them into regular letters. We also demonstrate the ignore option and make two exceptions โ€“ we keep the original shape of the Unicode glyph "๐Ÿ…ฟ" and ligature "๐Ÿ†‹". This makes the author's signature more original. We also activate the true case option so that the sentence starts with a capital letter.
โ’พ bฬรฉวตใŒ ๏ฝ—๏ฝ‰๏ฝ”๏ฝˆ ๐š๐ง ๐Ÿ†”๐–Š๐–†, ๐•’๐•Ÿ๐•• ๐“ฝ๐“ฑ๐“ฎ๐“ท iฬคtฬค ๐”Ÿ๐”ข๐” ๐”ฌ๐”ช๐”ข๐”ฐ หขแต’แตแต‰แต—สฐโฑ๐Ÿ†– โ’ สชโ’ . ๐Ÿ…ฟ๐Ÿ…ฐ๐Ÿ…ฑ๐Ÿ…ป๐Ÿ…พ ๐Ÿ…ฟ๐Ÿ†‹๐Ÿ…ฐ๐Ÿ†‚๐Ÿ†‚๐Ÿ…พ
I begin with an idea, and then it becomes something else. ๐Ÿ…ฟablo ๐Ÿ…ฟ๐Ÿ†‹asso
Required options
These options will be used automatically if you select this example.
Specify the letters that you don't want to convert.
Use sentence casing for letters.
Pro tips Master online unicode tools
You can pass input to this tool via ?input query argument and it will automatically compute output. Here's how to type it in your browser's address bar. Click to try!
https://onlineunicodetools.com/normalize-unicode-letters?input=%EF%BC%B4-%F0%9D%97%A5%F0%9F%87%AA%EF%BD%98%20%F0%9D%95%99%F0%9D%95%92%F0%9D%97%8C%20%F0%9D%99%A8h%CC%A4%F0%9F%85%BE%F0%9D%99%A7%F0%9D%93%BD%20%C3%A4%F0%9F%85%A1%F0%9D%98%AE%E2%92%AE%20s%CC%93%CD%A9%CC%8F%CC%B7%CC%97%CC%B9%CC%A8%CC%99o%CC%87%CC%8F%CC%8F%CC%B8%CC%BB%CD%9A%CC%AE%20%F0%9D%98%81%E2%92%A3%F0%9F%84%B0%F0%9D%98%81%20%C9%98v%CC%A4%CC%88e%CC%A4%CC%88r%CC%A4%F0%9D%98%BA%F0%9D%9A%9D%F0%9D%90%A1%F0%9D%92%8A%F0%9F%87%B3%F0%9D%9A%90%20%E2%92%A4%F0%9F%85%83%20%EF%BD%88%F0%9F%85%BE%F0%9F%84%BB%F0%9D%93%AD%F0%9F%87%B8%20%F0%9D%92%8A%F0%9D%98%80%20%C9%94%F0%9D%97%B9%F0%9D%93%B8%F0%9D%90%AC%F0%9D%92%86%20%F0%9D%97%8Do%CC%A4%20%F0%9D%93%B2%EF%BD%94%F0%9F%86%82%20h%CD%A3%CC%82%CC%83%CC%BC%E1%B7%BF%CC%A9e%CC%83%E1%B7%83%CC%82%CD%9F%CC%A7%CC%BB%CD%9A%F0%9D%9A%8A%E2%93%A1%E1%BA%97.&ignore-letters=&autocase=true
All unicode tools
Didn't find the tool you were looking for? Let us know what tool we are missing and we'll build it!
Quickly find code positions of all Unicode values.
Quickly decode code positions to Unicode values.
Quickly encode Unicode values to UTF-8 encoding.
Quickly encode Unicode values to UTF-16 encoding.
Quickly encode Unicode values to UTF-32 encoding.
Quickly create a picture from Unicode symbols.
Quickly generate random Unicode text in a given range.
Quickly generate all Unicode values from the given code point interval.
Quickly filter Unicode symbols that are within the given code point interval.
Quickly split Unicode data into graphemes.
Quickly sort Unicode glyphs in increasing or decreasing order.
Quickly find the length of Unicode text.
Quickly increase Unicode code point values.
Quickly decrease Unicode code point values.
Quickly reverse the order of symbols in Unicode text.
Quickly rotate Unicode characters to the left and right.
Quickly create multiple copies of Unicode text.
Quickly extract all characters from Unicode text.
Quickly split Unicode data into pieces.
Quickly split Unicode text into chunks of constant length.
Quickly merge Unicode snippets together.
Quickly shorten Unicode text to the given length.
Quickly left-pad Unicode text with any character.
Quickly right-pad Unicode text with any character.
Quickly align Unicode data to the center.
Quickly align Unicode data to the right.
Quickly convert ordinary numbers to fancy Unicode numbers.
Quickly convert Unicode numbers back to regular numbers.
Quickly convert ordinary letters to Unicode letters in various fonts.
Quickly convert Unicode letters back to regular Latin letters.
Quickly convert ordinary text to fancy Unicode text.
Quickly convert fancy Unicode text back to regular text.
Quickly combine input Unicode with diacritical marks.
Quickly delete diacritical marks from the Unicode text.
Quickly spoof regular text using Unicode homoglyphs.
Quickly circularly rearrange Unicode symbols.
Quickly convert Unicode data to escape sequences.
Quickly encode Unicode data to HTML entities.
Quickly URL-escape Unicode symbols.
Quickly encode Unicode values to base64.
Quickly encode Unicode values to a data URI.
Quickly convert Unicode characters to raw bytes.
Quickly convert Unicode data to base-2 (binary).
Quickly convert Unicode data to base-8 (octal).
Quickly convert Unicode data to base-10 (decimal).
Quickly convert Unicode data to base-16 (hexadecimal).
Quickly convert Unicode symbols to raw ASCII bytes.
Quickly convert ASCII bytes to Unicode symbols.
Quickly convert Unicode text to a string literal.
Quickly create a picture from Unicode emojis.
Quickly release Zalgo on your Unicode text.
Quickly make Zalgo text readable again.
Coming soon These unicode tools are on the way
Name Unicode Symbols
Spell out the names of Unicode characters in the input text.
URL-decode Unicode
URL-unescape Unicode text.
Convert Binary to Unicode
Convert base-2 data to Unicode encoding.
Convert Octal to Unicode
Convert base-8 data to Unicode encoding.
Convert Decimal to Unicode
Convert base-10 data to Unicode encoding.
Convert Hex to Unicode
Convert base-16 data to Unicode encoding.
Convert Unicode to Any Base
Convert Unicode text to any radix.
Convert Any Base to Unicode
Convert any radix data to Unicode.
Convert Unicode to Latin1
Convert Unicode text to Latin1 encoding.
Convert Latin1 to Unicode
Convert Latin1 encoded data to Unicode.
Convert Bytes to Unicode
Convert raw bytes to Unicode.
Validate Unicode
Check if the given Unicode has valid encoding.
Convert Unicode to Punycode
Encode Unicode text to Punycode encoding.
Convert Punycode to Unicode
Decode Punycode encoding to Unicode.
Decode Base64 to Unicode
Convert base64 data to Unicode text.
Encode Unicode to Data URI
Convert Unicode to a valid data URL.
Decode Data URI to Unicode
Convert a valid data URL to Unicode text.
Convert HTML to Unicode
Decode HTML entities to Unicode data.
Convert UTF8 to Unicode
Decode UTF8 encoding to Unicode.
Convert UTF16 to Unicode
Decode UTF16 encoding to Unicode.
Convert UTF32 to Unicode
Decode UTF32 encoding to Unicode.
Convert Unicode to Uppercase
Convert all Unicode characters to uppercase.
Convert Unicode to Lowercase
Convert all Unicode characters to lowercase.
Convert Unicode to Randomcase
Randomize case of all Unicode characters.
Convert Unicode to Lowercase
Convert all Unicode characters to lowercase.
JSON Stringify Unicode
Encode Unicode to JSON.
JSON Parse Unicode
Decode JSON to Unicode.
Shuffle Unicode Symbols
Randomly rearrange the order of input graphemes.
Analyze Unicode
Print statistics about Unicode data and code points, etc.
Slice Unicode
Extract a part from Unicode data.