About

utf-8

Tag Info

Info Newest Frequent Score Active Unanswered

UTF-8 (Unicode Transformation Format, 8 bits) is a character encoding that describes each Unicode code point using a byte sequence of one to six bytes. It is backwards-compatible with ASCII while still supporting representation of all Unicode code points.

UTF-8 is a character encoding that can describe the set of unicode code points in byte sequences of one to six bytes.

UTF-8 is the most widely used character encoding, and is recommended for use on the Internet. It is the standard character encoding on linux and other recent unix-like operating systems. It was designed to be backwards-compatible with ascii while still supporting representation of all Unicode code points.

The algorithm for encoding code points in UTF-8 is described in RFC 3629.

History Usage Guidance history

Stats

created	12 years, 3 months ago
viewed	14 times
active	11 years, 8 months ago
editors	2

Top Answerers

200_success

146k22 gold badges191 silver badges481 bronze badges

G. Sliepen

69.3k3 gold badges75 silver badges180 bronze badges

vnp

58.7k4 gold badges55 silver badges144 bronze badges

Toby Speight

88.3k14 gold badges104 silver badges327 bronze badges

chux

36.4k2 gold badges43 silver badges97 bronze badges

Recent Hot Answers

Transcode UCS-4BE to UTF-8

Transcode UCS-4BE to UTF-8

Transcode UCS-4BE to UTF-8

Transcode UCS-4BE to UTF-8

Transcode UCS-4BE to UTF-8

more »

About

Stats

Top Answerers

Recent Hot Answers

Related Tags