Материал: part05

Внимание! Если размещение файла нарушает Ваши авторские права, то обязательно сообщите нам

DICOM PS3.5 2020a - Data Structures and Encoding Page 141

Table I.3-1. Character Sets and Escape Sequences Used in the Examples

Character Component		Value of	ISO	Standard	ESC	Code	Character Set:
Set	Group	(0008,0005)DefinedRegistration		for Code	Sequence Element Purpose of Use
Description		Term	Number	Extension
Korean	First:	Value 1:	ISO-IR 6			GL	ISO 646:
	Single-byte	none
	Second:	Value 1:	ISO-IR 6			GL	ISO 646:
	Ideographic	none					For delimiters
		Value 2:	ISO-IR 149	ISO 2022	ESC 02/04	GR	KS X 1001:
		ISO 2022 IR 149			02/09 04/03		Hangul and Hanja
		ISO 2022 IR 149					Hangul and Hanja
	Third:	Value 1:	ISO-IR 6			GL	ISO 646:
	Phonetic	none					For delimiters
		Value 2:	ISO-IR 149	ISO 2022	ESC 02/04	GR	KS X 1001:
		ISO 2022 IR 149			02/09 04/03		Hangul and Hanja
		ISO 2022 IR 149					Hangul and Hanja

- Standard -

Page 142

DICOM PS3.5 2020a - Data Structures and Encoding

- Standard -

DICOM PS3.5 2020a - Data Structures and Encoding

Page 143

J Character Sets and Person Name Value Representation using Unicode UTF-8, GB18030 and GBK (Informative)

The Unicode UTF-8 character set and the GB18030 character set may be used for multiple languages. Some of these languages mayalsobeencodedusingothercharactersetsthataredefinedelsewhereintheDICOMStandard.AsUnicodeUTF-8andGB18030 encodings do not allow ISO 2022 character set replacement, these must be used for all strings in a single SOP Instance. This may have implications for the character set selected for the encoding of the SOP Instance.

Since the GBK character set is fully code point compatible to the larger character set of GB 18030, and the specific examples of GB 18030 encoding this in Annex (J.3 and J.4) include only the Chinese characters falling in the common coding area between the two standards, these examples are used to demonstrate the person name and text encoding in both standards. Examples specific to GBK are not necessary.

J.1 Example of Person Name Value Representation in the Chinese Language Using Unicode

Example J.1-1. Example of Person Name Value Representation in the Chinese Language Using Unicode

Person names in the Chinese language may be written in Hanzi (ideographic characters), and/or Latin (alphabetic characters). The Latin representation may be derived using pinyin or another Romanization method, or may be a chosen "westernized" name. The two component groups should be written in the order of alphabetic, then ideographic; the phonetic component group is typically not used (see Table 6.2-1). In this example the traditional script is used.

Note

1.Somehealthcareinformationsystemsmayencodea"westernized"namewithotherpatientaliasesinaseparateattribute, e.g., Other Patient Names (0010,1091).

2.Some environments using Chinese language may use the third name component, e.g., for the Yi or Mongolian script, with or without the first name component. This would be similar to the Japanese and Korean name component usage.

In the example below, the Specific Character Set attribute (0008,0005) would contain:

(0008,0005) ISO_IR 192

Text string:

Wang^XiaoDong= ^ =

Character encoded representation is:

0x57

0x61

0x6e

0x67

0x5e

0x58

0x69

0x61

0x6f

0x44

0x6f

0x6e

0x67

0x3d

0xe7

0x8e

0x8b

0x5e

0xe5

0xb0

0x8f

0xe6

0x9d

0xb1

0x3d

Note

The underlined bytes correspond to the Unicode code points for the Chinese characters:

(U+738B)

(U+5C0F)

- Standard -

Page 144

DICOM PS3.5 2020a - Data Structures and Encoding

(U+6771)

and the corresponding UTF-8 encodings are: UTF-8 (U+738b) = 0xe7 0x8e 0x8b

UTF-8 (U+5c0f U+6771) = 0xe5 0xb0 0x8f 0xe6 0x9d 0xb1

J.2 Example of Long Text Value Representation in the Chinese Language Using Unicode

Example J.2-1. Example of Long Text Value Representation in the Chinese Language Using Unicode

The following is an example of a Long Text value representation that includes ASCII and ISO 10646 character set.

Specific Character Set:

(0008,0005) ISO_IR 192

Text string:

The first line includes .

The second line includes , too.

The third line.

Character encoded representation is:

0x54 0x68 0x65 0x20 0x66 0x69 0x72 0x73 0x74 0x20 0x6c 0x69 0x6e 0x65 0x20 0x69 0x6e 0x63 0x6c 0x75 0x64 0x65 0x73 0xe4 0xb8 0xad 0xe6 0x96 0x87 0x2e 0x0d 0x0a 0x54 0x68 0x65 0x20 0x73 0x65 0x63 0x6f 0x6e 0x64 0x20 0x6c 0x69 0x6e 0x65 0x20 0x69 0x6e 0x63 0x6c 0x75 0x64 0x65 0x73 0xe4 0xb8 0xad 0xe6 0x96 0x87 0x2c 0x20 0x74 0x6f 0x6f 0x2e 0x0d 0x0a 0x54 0x68 0x65 0x20 0x74 0x68 0x69 0x72 0x64 0x20 0x6c 0x69 0x6e 0x65 0x2e 0x0d 0x0a

Note

The underlined byte codes correspond to the Unicode code points for the Chinese characters:

(U+4E2D) 0xe4 0xb8 0xad

(U+6587) 0xe6 0x96 0x87

J.3 Example of Person Name Value Representation in the Chinese Language Using GB18030

ExampleJ.3-1.ExampleofPersonNameValueRepresentationintheChineseLanguageUsingGB18030

- Standard -

DICOM PS3.5 2020a - Data Structures and Encoding

Page 145

Note

See notes to Section J.1.

In the example below, the Specific Character Set attribute (0008,0005) would contain: (0008,0005) GB18030

Text string:

Wang^XiaoDong= ^ =

Character encoded representation is:

0x57 0x61 0x6e 0x67 0x5e 0x58 0x69 0x61 0x6f 0x44 0x6f 0x6e 0x67 0x3d 0xcd 0xf5 0x5e 0xd0 0xa1 0xb6 0xab 0x3d

Note

The GB18030 encodings for the Chinese characters used here are:

(CDF5 in GB18030)

(D0A1 in GB18030)

(B6AB in GB18030)

J.4 Example of Long Text Value Representation in the Chinese Language Using GB18030

Example J.4-1. Example of Long Text Value Representation in the Chinese Language Using GB18030

The following is an example of a Long Text value representation that includes ASCII and GB18030 character set.

Specific Character Set:

(0008,0005) GB18030

Text string:

The first line includes .

The second line includes , too.

The third line.

Character encoded representation is:

0x54 0x68 0x65 0x20 0x66 0x69 0x72 0x73 0x74 0x20 0x6c 0x69 0x6e 0x65 0x20 0x69 0x6e 0x63 0x6c 0x75 0x64 0x65 0x73 0xd6 0xd0 0xce 0xc4 0x2e 0x0d 0x0a 0x54 0x68 0x65 0x20 0x73 0x65 0x63 0x6f 0x6e 0x64 0x20 0x6c 0x69 0x6e 0x65 0x20 0x69 0x6e 0x63 0x6c 0x75 0x64 0x65 0x73 0xd6 0xd0 0xce 0xc4 0x2c 0x20 0x74 0x6f 0x6f 0x2e 0x0d 0x0a 0x54 0x68 0x65 0x20 0x74 0x68 0x69 0x72 0x64 0x20 0x6c 0x69 0x6e 0x65 0x2e 0x0d 0x0a

Note

The underlined byte codes correspond to the GB18030 encodings for the Chinese characters used:

(D6D0 in GB18030)

- Standard -

Смотрите также:


«ДОХОДЫ, РАСХОДЫ И ПРИБЫЛЬ КОММЕРЧЕСКОГО БАНКА.»
«ДОХОДЫ, РАСХОДЫ И ПРИБЫЛЬ КОММЕРЧЕСКОГО БАНКА.»
Значение, сущность и содержание социально — педагогической деятельности в организации для детей-сирот и детей, оставшихся без попечения родителей
Проактивные методы PR-деятельности российских авиационных компаний «Россия», «Азимут»
__RGR2
__RGR2
_10_Эмиль Золя для эл версии
_11_А. Франс для эл версии
_3 тема - Диффузия