onlineunicodetools logo

Grapheme extractor

World's simplest unicode tool

This browser-based utility extracts graphemes from Unicode. Anything that you paste or enter in the text area on the left automatically gets split into graphemes on the right. It supports the most popular Unicode encodings (such as UTF-8, UTF-16, UTF-32, and UCS-2) and it works with emoji characters. You can also adjust the separator to place between the extracted graphemes and add a counter to enumerate them. Created by encoding gurus from team Browserling.
announcement a new project!
Super exciting news! We just launched TECHURLS – simple and fun tech news reader. Check it out!
fullscreen fullscreen_exit
Grapheme Separator
Place this character between extracted graphemes.
Grapheme Enumeration
Put numbers in front of graphemes.
Select a predefined numbering format, or a custom format.
If you choose a custom numbering format, then specify it here.
Grapheme Whitespaces
Spaces, tabs, and newlines remain untouched in extracted graphemes.
Spaces, tabs, and newlines are replaced by symbols "⎵", "⇥", and "↵" respectively.
Remove all spaces, tabs, and newlines.
Grapheme extractor tool What is a grapheme extractor?
This utility extracts individual graphemes from your Unicode text. A grapheme is the smallest unit of a writing system of any given language. It can be a character (such as the letter "a" of English alphabet or the letter "ö" of Finnish alphabet), a symbol (such as a right arrow "🡄 " or a left arrow "🡆"), an emoji (such as a smiling face with sunglasses "😎" or a face with stuck-out tongue "😜"), a hieroglyph (such as an Egyptian hieroglyph "𓁑" or "𓄿"), or even Zalgo text (such as "c̷̘͔͌̇h̴̞̤̀͋ä̵̬̫̔ò̵̞̮̇s̷̠̲̑͂"). In the options, you can set the output separator that gets placed between all graphemes, enumerate the graphemes and put a counter number before them. We added the three most common counter formats but you can also specify your own custom counter format. Another useful option that we added lets you manage the whitespace graphemes. You can process all spaces, tabs, and newlines untouched, visualize them by replacing them with special Unicode symbols, or remove them completely.
Grapheme extractor examples Click to use
Chicken Evolution
In this example, we split a chicken evolution illustrated with Unicode emojis into graphemes. We add numbering with parentheses to it and remove all whitespace characters to make meaningful graphemes more visible.
🥚 ➜ 🐣 ➜ 🐤 ➜ 🐓
1) 🥚 2) ➜ 3) 🐣 4) ➜ 5) 🐤 6) ➜ 7) 🐓
Required options
These options will be used automatically if you select this example.
Place this character between extracted graphemes.
Put numbers in front of graphemes.
Select a predefined numbering format, or a custom format.
Remove all spaces, tabs, and newlines.
Unicode Greek Phrase
In this example, we use a saying attributed to the ancient Greek philosopher Socrates, which translates to "I know that I know nothing." We extract all Greek graphemes from it and put pipe characters between them for better visualization.
ἓν οἶδα ὅτι οὐδὲν οἶδα
ἓ|ν| |ο|ἶ|δ|α| |ὅ|τ|ι| |ο|ὐ|δ|ὲ|ν| |ο|ἶ|δ|α
Required options
These options will be used automatically if you select this example.
Place this character between extracted graphemes.
Put numbers in front of graphemes.
Spaces, tabs, and newlines remain untouched in extracted graphemes.
Unicode Medals
In this example, we extract all graphemes from a list of Olympic emoji medals. We use a custom numbering format here and place the grapheme position number "%n" in square brackets. We also enable the option that visualizes all spaces, tabs, and newlines using special Unicode characters – space as a "⎵", tab as a "⇥", and newline as a "↵".
Gold: 🥇 Silver: 🥈 Bronze: 🥉
[1] G [2] o [3] l [4] d [5] : [6] ⇥ [7] 🥇 [8] ↵ [9] S [10] i [11] l [12] v [13] e [14] r [15] : [16] ⇥ [17] 🥈 [18] ↵ [19] B [20] r [21] o [22] n [23] z [24] e [25] : [26] ⇥ [27] 🥉
Required options
These options will be used automatically if you select this example.
Place this character between extracted graphemes.
Put numbers in front of graphemes.
Select a predefined numbering format, or a custom format.
If you choose a custom numbering format, then specify it here.
Spaces, tabs, and newlines are replaced by symbols "⎵", "⇥", and "↵" respectively.
Pro tips Master online unicode tools
You can pass input to this tool via ?input query argument and it will automatically compute output. Here's how to type it in your browser's address bar. Click to try!
https://onlineunicodetools.com/extract-unicode-graphemes?input=%F0%9F%A5%9A%20%E2%9E%9C%20%F0%9F%90%A3%20%E2%9E%9C%20%F0%9F%90%A4%20%E2%9E%9C%20%F0%9F%90%93&separator=%0A&enumerate-graphemes=true&numbering-format=number-with-parentheses&remove-whitespaces=true
All unicode tools
Didn't find the tool you were looking for? Let us know what tool we are missing and we'll build it!
Quickly find code positions of all Unicode values.
Quickly decode code positions to Unicode values.
Quickly encode Unicode values to UTF-8 encoding.
Quickly encode Unicode values to UTF-16 encoding.
Quickly encode Unicode values to UTF-32 encoding.
Quickly create a picture from Unicode symbols.
Quickly generate random Unicode text in a given range.
Quickly generate all Unicode values from the given code point interval.
Quickly filter Unicode symbols that are within the given code point interval.
Quickly split Unicode data into graphemes.
Quickly sort Unicode glyphs in increasing or decreasing order.
Quickly find the length of Unicode text.
Quickly increase Unicode code point values.
Quickly decrease Unicode code point values.
Quickly reverse the order of symbols in Unicode text.
Quickly rotate Unicode characters to the left and right.
Quickly create multiple copies of Unicode text.
Quickly extract all characters from Unicode text.
Quickly split Unicode data into pieces.
Quickly split Unicode text into chunks of constant length.
Quickly merge Unicode snippets together.
Quickly shorten Unicode text to the given length.
Quickly left-pad Unicode text with any character.
Quickly right-pad Unicode text with any character.
Quickly align Unicode data to the center.
Quickly align Unicode data to the right.
Quickly convert ordinary numbers to Unicode numbers in various fonts.
Quickly convert ordinary letters to Unicode letters in various fonts.
Quickly convert ordinary text to fancy Unicode text.
Quickly combine input Unicode with diacritical marks.
Quickly circularly rearrange Unicode symbols.
Quickly URL-escape Unicode symbols.
Quickly encode Unicode data to HTML entities.
Quickly encode Unicode values to base64.
Quickly encode Unicode values to a data URI.
Quickly convert Unicode data to base-2 (binary).
Quickly convert Unicode data to base-8 (octal).
Quickly convert Unicode data to base-10 (decimal).
Quickly convert Unicode data to base-16 (hexadecimal).
Quickly create a picture from Unicode emojis.
Release Zalgo on your Unicode text.
Coming soon These unicode tools are on the way
Name Unicode Symbols
Spell out the names of Unicode characters in the input text.
URL-decode Unicode
URL-unescape Unicode text.
Convert Binary to Unicode
Convert base-2 data to Unicode encoding.
Convert Octal to Unicode
Convert base-8 data to Unicode encoding.
Convert Decimal to Unicode
Convert base-10 data to Unicode encoding.
Convert Hex to Unicode
Convert base-16 data to Unicode encoding.
Convert Unicode to Any Base
Convert Unicode text to any radix.
Convert Any Base to Unicode
Convert any radix data to Unicode.
Convert Unicode to ASCII
Convert Unicode characters to ASCII characters.
Convert ASCII to Unicode
Convert ASCII characters to Unicode characters.
Convert Unicode to Latin1
Convert Unicode text to Latin1 encoding.
Convert Latin1 to Unicode
Convert Latin1 encoded data to Unicode.
Convert Unicode to Bytes
Convert Unicode characters to raw bytes.
Convert Bytes to Unicode
Convert raw bytes to Unicode.
Remove Combining Characters
Delete diacritical marks from Unicode data
Remove Zalgo from Unicode
Make Unicode Zalgo text readable again.
Validate Unicode
Check if the given Unicode has valid encoding.
Convert Unicode to Punycode
Encode Unicode text to Punycode encoding.
Convert Punycode to Unicode
Decode Punycode encoding to Unicode.
Decode Base64 to Unicode
Convert base64 data to Unicode text.
Encode Unicode to Data URI
Convert Unicode to a valid data URL.
Decode Data URI to Unicode
Convert a valid data URL to Unicode text.
Convert HTML to Unicode
Decode HTML entities to Unicode data.
Convert UTF8 to Unicode
Decode UTF8 encoding to Unicode.
Convert UTF16 to Unicode
Decode UTF16 encoding to Unicode.
Convert UTF32 to Unicode
Decode UTF32 encoding to Unicode.
Convert Unicode to Uppercase
Convert all Unicode characters to uppercase.
Convert Unicode to Lowercase
Convert all Unicode characters to lowercase.
Convert Unicode to Randomcase
Randomize case of all Unicode characters.
Convert Unicode to Lowercase
Convert all Unicode characters to lowercase.
JSON Stringify Unicode
Encode Unicode to JSON.
JSON Parse Unicode
Decode JSON to Unicode.
Shuffle Unicode Symbols
Randomly rearrange the order of input graphemes.
Analyze Unicode
Print statistics about Unicode data and code points, etc.