Unicode System

Wednesday, February 01, 2017 Unknown 0 Comments



Unicode system  is universal international standard character encoding which is capable to represent most of the world's languages.

Before Unicode system there were several encoding systems


1 . ASCII - Supports language of united states.

2 . ISO 8859-1 - It supports western European language.
3 . KOI-8 - Supports Russian language.
4 . GB18030 and BIG-5 - Supports Chinese language.

This caused the following problem.


A particular code value corresponds to different letters in the various language standards and The encodings for languages with large character sets have variable length.Some common characters are encoded as single bytes, other require two or more byte.


                 To solve this problem, A new encoding system was developed called Unicode system which supports world's most of the languages. In unicode, character holds 2 bytes, so java also uses 2 bytes for characters.


Lowest value in unicode system- \u0000

Highest value in unicode system- \uFFFF

Previous topic                                                                                                   Next topic
Jdk, jre, jvm                                                                                                     Java operators

0 comments: