Search Results

    Text to Unicode

    Convert characters into Unicode format (U+XXXX)

    1


    Unicode Output:


    Share

    What Is the Text to Unicode Tool?

    The Text to Unicode tool converts each character in your text into its Unicode code point in the standard U+XXXX format, with your choice of delimiter separating the values in the output. Type or paste your text, choose how you want the code points separated, click Convert, and the Unicode representations appear in the output. It is used for inspecting the Unicode values of specific characters, generating code point sequences for documentation or development, working with character encoding, and checking how unusual or non-Latin characters are represented in the Unicode standard.

    The U+XXXX format is the canonical way to reference characters in Unicode documentation and specification work, so having a tool that produces it directly saves a fair amount of manual lookup.

    How to Use This Tool

    1. Paste or type your text into the input box above.
    2. Choose your preferred delimiter to separate the code points in the output: space, comma, or new line. Match it to how you plan to use the result.
    3. Click Convert and the tool outputs a U+XXXX code point for every character in your input, separated by your chosen delimiter.
    4. Copy the result using the Copy button, or select and copy manually. Use Clear to reset the input.

    When Would You Use This?

    Looking up the Unicode code point for a specific character, especially one that is not on a standard keyboard, to reference it in documentation, a specification, a bug report, or a code comment that needs to identify the character precisely.

    Converting a string to Unicode code points for use in a programming context where characters need to be referenced by their code point values rather than their literal form, such as in escape sequences or character encoding operations.

    Checking how accented letters, special symbols, emoji, or characters from non-Latin writing systems are represented in Unicode, particularly when debugging encoding issues or verifying that a character is what you think it is.

    Examples

    Converting a simple word

    Input   : Hello

    Delimiter: Space

    Output  : U+0048 U+0065 U+006C U+006C U+006F

    Converting a phrase with a space character

    Input   : Hi there

    Delimiter: Space

    Output  : U+0048 U+0069 U+0020 U+0074 U+0068 U+0065 U+0072 U+0065

    (U+0020 is the code point for a space character)

    Comma-separated output

    Input   : Test

    Delimiter: Comma

    Output  : U+0054,U+0065,U+0073,U+0074

    Accented character

    Input   : é

    Delimiter: Space

    Output  : U+00E9

    (é is a single precomposed Unicode character with code point U+00E9)

    New line separated output

    Input   : OK

    Delimiter: New line

    Output  : U+004F
    U+004B

    Frequently Asked Questions

    How do I convert text to Unicode code points?

    Paste your text into the input box, choose a delimiter, and click Convert. Each character is mapped to its U+XXXX code point and the output appears with your chosen formatting.

    What is a Unicode code point?

    A Unicode code point is a number assigned to a specific character in the Unicode standard. It is typically written in the format U+XXXX where XXXX is a hexadecimal number. Every character in Unicode, from basic Latin letters to emoji, has a unique code point.

    What does U+0041 mean?

    U+ indicates a Unicode code point, and 0041 is the hexadecimal value of that code point. U+0041 corresponds to the Latin capital letter A. You can look up any U+XXXX value in a Unicode chart to find the character it represents.

    How do I find the Unicode code point for a character?

    Type or paste the character into the input box of this tool and click Convert. The output will show its code point in U+XXXX format. You can also look characters up in the official Unicode character database if you need additional information about them.

    What is the Unicode code point for a space?

    A space character has the Unicode code point U+0020. It is one of the first printable characters in the Unicode standard, inherited directly from ASCII.

    How do I convert text to Unicode in Python?

    Use the ord() function to get the code point as an integer: ord('A') returns 65. To format it as U+XXXX: 'U+{:04X}'.format(ord('A')) returns 'U+0041'. For a list of characters: [f'U+{ord(c):04X}' for c in "Hello"]. For a quick conversion without code, use this tool.

    What is the difference between Unicode and UTF-8?

    Unicode is the standard that assigns code points to characters. UTF-8 is one of several encoding formats that specifies how those code points are stored as bytes. A code point like U+0041 always represents the letter A, but how that gets stored in a file depends on the encoding used.

    Can I convert emoji to Unicode code points?

    Yes. Emoji are standard Unicode characters and have code points just like letters. For example, the smiley face emoji has the code point U+1F600. Some emoji are composed of multiple code points combined, so a single visible emoji may produce more than one U+XXXX value in the output.