About 51 results
Open links in new tab
  1. What are Unicode, UTF-8, and UTF-16? - Stack Overflow

    Feb 18, 2022 · Encoding basics Note: If you know how UTF-8 and UTF-16 are encoded, skip to the next section for practical applications. UTF-8: For the standard ASCII (0-127) characters, the UTF-8 …

  2. What is the difference between UTF-8 and Unicode?

    Mar 14, 2009 · The main difference between UTF-8, UTF-16, and UTF-32 character encodings is how many bytes they require to represent a character in memory: UTF-8 uses a minimum of 1 byte, but if …

  3. unicode - UTF-8, UTF-16, and UTF-32 - Stack Overflow

    UTF-8 is the de-facto standard in most modern software for saved files. More specifically, it's the most widely used encoding for HTML and configuration and translation files (Minecraft, for example, …

  4. Unicode, UTF, ASCII, ANSI format differences - Stack Overflow

    Mar 31, 2009 · What is the difference between the Unicode, UTF8, UTF7, UTF16, UTF32, ASCII, and ANSI encodings? In what way are these helpful for programmers?

  5. ASCII vs Unicode + UTF-8 - Stack Overflow

    Jan 12, 2019 · Yes, except that UTF-8 is an encoding scheme. Other encoding schemes include UTF-16 (with two different byte orders) and UTF-32. (For some confusion, a UTF-16 scheme is called …

  6. "for line in..." results in UnicodeDecodeError: 'utf-8' codec can't ...

    UnicodeDecodeError: 'utf-8' codec can't decode byte 0xe9 in position 150: invalid continuation byte I opened the file with NotePad & counted 150th position: that was a Cyrillic symbol.

  7. What's the difference between UTF-8 and UTF-8 with BOM?

    1069 The UTF-8 BOM is a sequence of bytes at the start of a text stream (0xEF, 0xBB, 0xBF) that allows the reader to more reliably guess a file as being encoded in UTF-8. Normally, the BOM is …

  8. Is it possible to force Excel recognize UTF-8 CSV files automatically?

    May 14, 2011 · UTF-8, by definition, does not use nor should use the BOM characters. The way Excel reads BOM screws up the advantage of UTF-8 over Unicode, which is backwards compatibility with …

  9. Using UTF-8 Encoding (CHCP 65001) in Command Prompt / Windows ...

    Jul 21, 2019 · Note: This answer shows how to switch the character encoding in Windows consoles (terminals) to (BOM-less) UTF-8 system-wide (code page 65001), so that shells such as cmd.exe …

  10. Unicode (UTF-8) reading and writing to files in Python

    The point of UTF-8 is to be able to encode 21-bit characters (Unicode) as an 8-bit data stream (because that's the only thing all computers in the world can handle). But since most OSs predate the Unicode …