GSM 03.38
In mobile telephony GSM 03.38 or 3GPP 23.038 is a character set used in the Short Message Service of GSM based cell phones. It is defined in GSM recommendation 03.38. Messages sent via this encoding can be encoded in the default GSM 7-bit alphabet, the 8-bit data alphabet, and the 16-bit UCS-2 alphabet. Support of the GSM 7-bit alphabet is mandatory for GSM handsets and network elements, but characters in languages such as Arabic, Chinese, Korean or Japanese languages must be encoded using the 16-bit UCS-2 character encoding or an extended national language shift table.
GSM 7-bit default alphabet and extension table of 3GPP TS 23.038 / GSM 03.38
The standard encoding for GSM messages is the 7-bit default alphabet as defined in the 23.038 recommendation.
Seven-bit characters must be encoded into octets following one of three packing modes:
CBS: using this encoding, it is possible to send up to 93 characters (packed in up to 82 octets) in one SMS message in a Cell Broadcast Service.