Thai Industrial Standard 620-2533

Thai Industrial Standard 620-2533 is commonly known as the most common character set and character encoding for the Thai signature by the acronym TIS -620. The standard was from the Thai Industrial Standards Institute ( TISI ), an organ of the Royal Thai Government, approved and is the only valid standard in the Kingdom of Thailand.

The descriptive name of the standard is: "Standard for Thai letter codes for use in the computer " ( Thai: รหัส สำหรับ อักขระ ไทย ที่ ใช้ กับ คอม พิ ว เต อร ).

The suffix " 2533 " refers to the year after the Buddhist calendar (1990 ), in which the standard was published. The previous version, TIS- 620-2529 (1986 ), so that no longer applies.

Structure

TIS -620 is a conventional ASCII extension that is fully compatible with 7-bit ASCII encoding and the Thai letters in 8 -bit hexadecimal range between A1 and FB. Due to the complex placement of the Thai vowels and tone marks TIS -620 is only used to exchange information. For a correct view additional rendering engine for Thai text is needed.

Variants

An almost identical version of TIS -620 was adapted in 1999 as ISO 8859-11. The only difference is that in ISO 8859-11 character A0 ( Hex) is defined as a non-breaking space, while it is indeed reserved in TIS -620, but not defined. (In practice, this small difference is usually ignored. )

The character set ISO 8859-11 has been registered as ISO -IR -166 at Ecma International, but this variant also contains explicit escape sequences to mark the beginning and end of a Thai word. ( In Thai no spaces between the words are set. )

The Windows code page 874 is also based on TIS -620, but a few more characters added.

The order of the characters in TIS -620 has also been adopted in Unicode ( ISO 10646 ). The Unicode block Thai ranges from U 0 E01 to U 0 E7F. TIS -620 character can easily be converted to UTF -16. You just have to add the prefix to each byte 0E and remove the hex number on the value of A0.

You may display in the browser must be enlarged to show all the signs read.

In the upper table 20 is the regular SPACE character. The values ​​00- 1F, 7F. 80 - 9F, A0, DB -DE and FC -FF are in TIS -620 assigned to any character.

767254
de