Skip to main content

International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
  · Data Customizer
 

Related Websites

Unicode Consortium

Common Locale Data

IBM Open Source

Globalize
Your E-Business

Sun: Java i18n forum

 

ICU  >  Demonstrations  > 

ICU Data Library Customizer

This tool will generate a data library that can only be used with the 4.2 series of ICU. The help page provides information on how to use this tool.

+
Click on a table header to sort the list
Data Item Description Kilobytes
Partial table for ISO-2022426
Latin 1 (EBCDIC ibm-1140 with modified line endings)3
Simplified Chinese (GB18030)226
Urdu3
Cyrillic (EBCDIC)3
Turkish (EBCDIC)3
Latin 1 for Open systems (EBCDIC)3
hp-roman83
Arabic (ISO-8859-6)3
Farsi (EBCDIC)3
Farsi4
Baltic (EBCDIC)3
Estonian (EBCDIC)3
Ukrainian (EBCDIC)3
Ukrainian3
Ukrainian4
Vietnamese3
Vietnamese (EBCDIC)3
Cyrillic Belarusian4
Lao (EBCDIC)3
Lao3
Devanagari (EBCDIC)3
Latin 1 (EBCDIC with Euro)3
German (EBCDIC with Euro)3
Denmark (EBCDIC with Euro)3
Sweden (EBCDIC with Euro)3
Italian (EBCDIC with Euro)3
Spain (EBCDIC with Euro)3
UK and Ireland (EBCDIC with Euro)3
France (EBCDIC with Euro)3
Latin 1 (EBCDIC with Euro)3
Icelandic (EBCDIC with Euro)3
Latin 2 (EBCDIC with Euro)3
Cyrillic (EBCDIC with Euro)3
Turkish (EBCDIC with Euro)3
Baltic Multilingual (EBCDIC with Euro)3
Estonian (EBCDIC with Euro)3
Ukrainian (EBCDIC with Euro)3
Thai (EBCDIC with Euro)3
Thai (TIS-620 with Euro update)3
Vietnamese (EBCDIC with Euro)3
Ukrainian (KOI8-U)4
Central Europe (windows-1250 without Euro update)3
Cyrillic (windows-1251 without Euro update)3
Latin 1 (windows-1252 without Euro update)3
Greek (windows-1253 without Euro update)3
Turkish (windows-1254 without Euro update)3
Hebrew (windows-1255 without Euro update)3
Arabic (windows-1256 without Euro update)4
Baltic (windows-1257 without Euro update)3
Vietnamese (windows-1258 without Euro update)3
Hebrew (EBCDIC update of ibm-424)3
Adobe-Standard-Encoding3
Korean (KSC_5601)3
Korean (KSC_5601)131
Korean (EBCDIC)141
Traditional Chinese (EBCDIC)118
Traditional Chinese (Big5)115
Traditional Chinese (Big5-HKSCS-2004 with mappings for Unicode 3.1)178
Simplified Chinese (GB2312)75
Simplified Chinese (GBK)115
Simplified Chinese (EBCDIC)154
Japanese (EBCDIC)162
Japanese (EBCDIC)4
Japanese (DBCS)161
Arabic (EBCDIC update of ibm-420)3
German (EBCDIC)3
Denmark (EBCDIC)3
Sweden (EBCDIC)3
Italian (EBCDIC)3
Spain (EBCDIC)3
UK and Ireland (EBCDIC)3
Japanese with only Katakana (EBCDIC)3
France (EBCDIC)3
Japanese (EUC-JP)96
Japanese (EUC-JP)97
Latin 1 (EBCDIC)3
Arabic (EBCDIC)3
Hebrew (EBCDIC)3
USA (PC-DOS)4
Arabic (EBCDIC update of ibm-421)3
Hebrew (EBCDIC update of ibm-803)3
Greek (similar to ISO-8859-7 with Euro update)3
Greek (EBCDIC update of ibm-875)3
Latin 1 (EBCDIC)3
Hebrew (ISO-8859-8)3
Japanese with only Katakana (EBCDIC with Euro update of ibm-1027)3
Central Europe (windows-1250)3
Cyrillic (windows-1251)3
Latin 1 (windows-1252)3
Greek (windows-1253)3
Turkish (windows-1254)3
Hebrew (windows-1255 variant)3
Arabic (windows-1256 variant)4
Baltic (windows-1257 variant)3
Vietnamese (windows-1258)3
Traditional Chinese (Big5-HKSCS-2001 with mappings for Unicode 3.0)131
Partial table for ISO-202271
Arabic (PC-DOS)3
Greek (PC-DOS)4
Baltic (PC-DOS)4
Hebrew (EBCDIC)3
Greek (ISO-8859-7:1987 without Euro update)3
Thai (EBCDIC)3
Japanese with only Katakana (EBCDIC Euro update of ibm-290)3
Latin 1 (PC-DOS)4
Greek (PC-DOS)4
Latin 2 (PC-DOS)4
Cyrillic (PC-DOS)4
Hebrew (PC-DOS)3
Turkish (PC-DOS)4
Latin 1 (PC-DOS with Euro update)3
Portuguese (PC-DOS)4
Icelandic (PC-DOS)4
Hebrew (PC-DOS)4
French Canadian (PC-DOS)4
Arabic (PC-DOS)4
Norwegian and Dutch (PC-DOS)4
Russian (PC-DOS)4
Hebrew (PC-DOS update of ibm-862)4
Urdu4
Greek (PC-DOS)3
Latin 2 (EBCDIC)3
Icelandic (EBCDIC)3
Thai (TIS-620)3
Greek (EBCDIC)3
Russian (KOI8-R)3
Greek (ISO-8859-7:2003 with Euro update)3
Baltic (ISO-8859-13 with Euro update)4
Estonian (update of ibm-922)4
Greek (EBCDIC with Euro update of ibm-875 and ibm-4971)3
Latin 2 (ISO-8859-2)4
Latin 3 (ISO-8859-3)3
Baltic (ISO-8859-4)3
Cyrillic (ISO-8859-5)3
Hebrew (ISO-8859-8 variant)3
Urdu (EBCDIC)3
Turkish (ISO-8859-9)3
Baltic (ISO-8859-13)3
Estonian3
Latin 9 (ISO-8859-15)3
Japanese (EBCDIC)97
Korean (EBCDIC)139
Simplified Chinese (EBCDIC)82
Traditional Chinese (EBCDIC)118
Japanese (EBCDIC)96
Japanese (Shift-JIS78)83
Japanese (Shift-JIS with byte 0x5C <-> Yen mapping)88
Japanese (Shift-JIS with byte 0x5C <-> backslash mapping)88
Hebrew (windows-1255)3
Arabic (windows-1256)4
Baltic (windows-1257)3
Korean (KSC_5601)127
Korean (KSC_5601)127
Traditional Chinese (Big5 without Euro update)15
Japanese (EUC-JP)99
Traditional Chinese (EUC-TW)147
Korean (EUC-KR)100
Korean (DBCS)119
Partial table for ISO-2022102
Latin 6 (ISO-8859-10)3
Thai (ISO-8859-11)2
Latin 8 (ISO-8859-14)3
Partial table for ISO-202272
Partial table for ISO-202270
Partial table for LMBCS8
Macintosh4
Central Europe (Macintosh)3
Turkish (Macintosh)4
Greek (Macintosh)4
Cyrillic (Macintosh)4
Thai (TIS-620)3
Simplified Chinese (GBK)111
Korean (KSC_5601)132
Traditional Chinese (Big5)12
GSM cellphone encoding3
Latin 13
Arabic3
ISO-7 OLD IRV2
French (7-bit iso-ir-69)2
German (7-bit iso-ir-21)2
Italian (7-bit iso-ir-15)2
British English (7-bit iso-ir-4)2
Spanish (7-bit iso-ir-85)2
Portuguese (7-bit iso-ir-84)2
Norwegian (7-bit iso-ir-60)3
ibm-10172
ibm-10182
ibm-10192
ibm-10202
ibm-10212
ibm-10233
Arabic (ibm-1046)3
ibm-11003
ibm-11012
ibm-11023
ibm-11032
ibm-11042
ibm-11052
ibm-11062
ibm-11072
Arabic with French4
Thai (Euro update of ibm-1129)3
Vietnamese (Euro update of ibm-1129)3
Vietnamese (EBCDIC)3
Cyrillic for Kazakhstan (EBCDIC)3
Cyrillic (KOI8-RU)3
Cyrillic for Kazakhstan3
Adobe (Postscript) Latin-13
Simplified Chinese (DBCS subset of ibm-1388, ibm-4933)114
Latin 2 (PC-DOS update of ibm-852)3
Japanese (EBCDIC update of ibm-930)96
Japanese (EUC-JP variant)92
Japanese (DBCS subset of ibm-5039)82
Korean (DBCS subset of ibm-1363)132
Simplified Chinese (EBCDIC)152
Simplified Chinese (DBCS subset of ibm-1381)98
Simplified Chinese79
Simplified Chinese (DBCS subset of ibm-1383)94
Simplified Chinese (EBCDIC)151
Arabic (PC-DOS update of ibm-864)4
Arabic (PC-DOS update of ibm-864)4
Traditional Chinese (DBCS subset of ibm-1370)120
Latin 1 (EBCDIC)3
IBM Symbol Set 7 (EBCDIC)4
German, Belgium (EBCDIC)3
Portuguese (EBCDIC)3
German (EBCDIC)2
APL (EBCDIC symbols for A Programming Language)3
Japanese (DBCS subset of ibm-930 and ibm-939)94
Japanese (DBCS subset of ibm-943)85
Japanese with only Katakana (EBCDIC)3
Arabic (EBCDIC)3
Korean (DBCS subset of ibm-1364)139
Simplified Chinese (DBCS subset of ibm-1388)153
Latin 2 (PC-DOS update of ibm-852)3
Cyrillic (PC-DOS update of ibm-855)3
Hebrew (PC-DOS update of ibm-856)3
Arabic (PC-DOS update of ibm-864)3
Japanese (Shift-JIS variant for HP computers)77
Japanese (DBCS subset of ibm-1350, JIS X208-1990)87
Japanese (DBCS subset of ibm-1350, JIS X212)80
Korean (DBCS subset of ibm-21450)100
Arabic (Euro update of ibm-1008)3
Devanagari (ISCII variant)3
Cyrillic4
Korean (DBCS subset of ibm-933)137
Traditional Chinese (DBCS subset of ibm-5033)116
Simplified Chinese (DBCS subset of ibm-5031)80
Cyrillic (Euro update of ibm-1125)4
Cyrillic Belarusian (Euro update of ibm-1131)4
Latin 9 (PC-DOS)4
Arabic (EBCDIC update of ibm-420)3
Cyrillic (Euro update of ibm-855)4
Cyrillic (EBCDIC)3
Japanese with only Katakana2
Japanese (JIS X 0201)3
Traditional Chinese (DBCS subset of ibm-1371)116
Hebrew (Euro and Sequel update of ibm-856)4
Arabic3
Turkish (EBCDIC)3
Greek (Euro update of ibm-869)3
Japanese (DBCS subset of ibm-5050)61
Arabic (Update of ibm-1046)3
Latin 9 (EBCDIC with Euro)3
Korean (DBCS subset of ibm-944)132
Traditional Chinese (DBCS subset of ibm-938)118
Simplified Chinese (DBCS subset of ibm-936)104
Japanese (DBCS subset of ibm-943)113
Korean110
Simplified Chinese80
Traditional Chinese (DBCS subset of ibm-950)118
Traditional Chinese116
Korean (DBCS subset of ibm-949)128
Japanese (DBCS subset of ibm-954, JIS X208-1990)95
Japanese (DBCS subset of JIS X212-199088
Japanese (DBCS subset of ibm-957)74
Simplified Chinese (DBCS subset of ibm-5488)114
Latin 10 (ISO-8859-16)3
+
Click on a table header to sort the list
Data Item Description Kilobytes
Grapheme cluster break rules14
 13
Greek0.2
English0.1
English (United States)0.1
English (United States, Computer)0.2
Japanese0.2
Line break rules92
0.2
Root0.5
Sentence break rules27
 27
Thai0.2
Thai word break rules244
Title casing break rules13
Word break rules29
POSIX style word break rules29
Japanese word break rules30
+
Click on a table header to sort the list
Data Item Description Kilobytes
Afrikaans14
Afrikaans (Namibia)0.1
Afrikaans (South Africa)0.1
Arabic14
Arabic (United Arab Emirates)0.1
Arabic (Bahrain)0.1
Arabic (Algeria)0.1
Arabic (Egypt)0.1
Arabic (Iraq)0.1
Arabic (Jordan)0.1
Arabic (Kuwait)0.1
Arabic (Lebanon)0.1
Arabic (Libya)0.1
Arabic (Morocco)0.1
Arabic (Oman)0.1
Arabic (Qatar)0.1
Arabic (Saudi Arabia)0.1
Arabic (Sudan)0.1
Arabic (Syria)0.1
Arabic (Tunisia)0.1
Arabic (Yemen)0.1
Assamese14
Assamese (India)0.1
Azerbaijani17
Azerbaijani (Latin)0.1
Azerbaijani (Latin, Azerbaijan)0.1
Belarusian0.1
Belarusian (Belarus)0.1
Bulgarian0.1
Bulgarian (Bulgaria)0.1
Bengali0.1
Bengali (India)0.1
Catalan14
Catalan (Spain)0.1
Czech15
Czech (Czech Republic)0.1
Welsh0.1
Danish17
Danish (Denmark)0.1
German30
German0.1
German (Austria)0.1
German (Belgium)0.1
German (Switzerland)0.1
German (Germany)0.1
German (Luxembourg)0.1
German (PHONEBOOK)0.1
Greek0.5
Greek (Greece)0.1
English14
English (Australia)0.1
English (Belgium)0.5
English (Botswana)0.1
English (Canada)0.1
English (United Kingdom)0.1
English (Hong Kong SAR China)0.1
English (Ireland)0.1
English (India)0.1
English (Malta)0.1
English (New Zealand)0.1
English (Philippines)0.1
English (Singapore)0.1
English (United States)0.1
English (United States, Computer)0.1
English (U.S. Virgin Islands)0.1
English (South Africa)0.1
English (Zimbabwe)0.1
Esperanto16
Spanish29
Spanish0.1
Spanish (Argentina)0.1
Spanish (Bolivia)0.1
Spanish (Chile)0.1
Spanish (Colombia)0.1
Spanish (Costa Rica)0.1
Spanish (Dominican Republic)0.1
Spanish (Ecuador)0.1
Spanish (Spain)0.1
Spanish (Guatemala)0.1
Spanish (Honduras)0.1
Spanish (Mexico)0.1
Spanish (Nicaragua)0.1
Spanish (Panama)0.1
Spanish (Peru)0.1
Spanish (Puerto Rico)0.1
Spanish (Paraguay)0.1
Spanish (El Salvador)0.1
Spanish (United States)0.1
Spanish (Uruguay)0.1
Spanish (Venezuela)0.1
Spanish (TRADITIONAL)0.1
Estonian17
Estonian (Estonia)0.1
Persian15
Persian (Afghanistan)0.1
Persian (Iran)0.1
Finnish34
Finnish (Finland)0.1
Faroese17
Faroese (Faroe Islands)0.1
French14
French (Belgium)0.1
French (Canada)0.1
French (Switzerland)0.1
French (France)0.1
French (Luxembourg)0.1
Irish0.1
Irish (Ireland)0.1
Gujarati14
Gujarati (India)0.1
Hawaiian19
Hebrew0.5
Hebrew (Israel)0.1
Hindi29
Hindi0.1
Hindi (India)0.1
Hindi (DIRECT)0.1
Croatian16
Croatian (Croatia)0.1
Hungarian18
Hungarian (Hungary)0.1
Indonesian0.1
Indonesian (Indonesia)0.1
Indonesian0.1
Indonesian (Indonesia)0.1
 222
Icelandic18
Icelandic (Iceland)0.1
Italian14
Italian (Switzerland)0.1
Italian (Italy)0.1
Hebrew0.1
Hebrew (Israel)0.1
Japanese814
Japanese (Japan)0.1
Kazakh15
Kazakh (Kazakhstan)0.1
Kalaallisut17
Kalaallisut (Greenland)0.1
Khmer17
Kannada0.5
Kannada (India)0.1
Korean976
Korean (South Korea)0.1
Konkani0.1
Lithuanian17
Lithuanian (Lithuania)0.1
Latvian16
Latvian (Latvia)0.1
Macedonian0.1
Macedonian (Macedonia)0.1
Malayalam15
Marathi0.5
Marathi (India)0.1
Malay0.1
Malay (Brunei)0.1
Malay (Malaysia)0.1
Maltese15
Maltese (Malta)0.1
Norwegian Bokmål17
Norwegian Bokmål (Norway)0.1
Dutch0.1
Dutch (Belgium)0.1
Dutch (Netherlands)0.1
Norwegian Nynorsk17
Norwegian Nynorsk (Norway)0.1
Norwegian0.1
Norwegian (Norway)0.1
Oromo14
Oromo (Ethiopia)0.1
Oromo (Kenya)0.1
Oriya14
Punjabi0.5
Punjabi (Arabic)0.1
Punjabi (Arabic, Pakistan)0.1
Punjabi (Gurmukhi)0.1
Punjabi (Gurmukhi, India)0.1
Punjabi (India)0.1
Polish16
Polish (Poland)0.1
Pashto15
Pashto (Afghanistan)0.1
Portuguese0.1
Portuguese (Brazil)0.1
Portuguese (Portugal)0.1
4
Romanian16
Romanian (Romania)0.1
Root109
Russian15
Russian (Russia)0.1
Russian (Ukraine)0.1
Serbo-Croatian0.1
Serbo-Croatian (Bosnia and Herzegovina)0.1
Serbo-Croatian (Serbia)0.1
Sinhala28
Sinhala (Sri Lanka)0.1
Slovak17
Slovak (Slovakia)0.1
Slovenian15
Slovenian (Slovenia)0.1
Albanian15
Albanian (Albania)0.1
Serbian0.1
Serbian (Bosnia and Herzegovina)0.1
Serbian (Cyrillic)0.1
Serbian (Cyrillic, Bosnia and Herzegovina)0.1
Serbian (Cyrillic, Montenegro)0.1
Serbian (Cyrillic, Serbia)0.1
Serbian (Latin)0.1
Serbian (Latin, Bosnia and Herzegovina)0.1
Serbian (Latin, Montenegro)0.1
Serbian (Latin, Serbia)0.1
Serbian (Montenegro)0.1
Serbian (Serbia)0.1
Swedish35
Swedish (Finland)0.1
Swedish (Sweden)0.1
Tamil0.5
Tamil (India)0.1
Telugu14
Telugu (India)0.1
Thai14
Thai (Thailand)0.1
Turkish16
Turkish (Turkey)0.1
 139
Ukrainian14
Ukrainian (Ukraine)0.1
Urdu15
Urdu (India)0.1
Urdu (Pakistan)0.1
Vietnamese22
Vietnamese (Vietnam)0.1
Chinese1340
Chinese0.1
Chinese (China)0.1
Chinese (Hong Kong SAR China)0.1
Chinese (Simplified Han)0.1
Chinese (Simplified Han, China)0.1
Chinese (Simplified Han, Singapore)0.1
Chinese (Traditional Han)0.2
Chinese (Traditional Han, Hong Kong SAR China)0.1
Chinese (Traditional Han, Macau SAR China)0.1
Chinese (Traditional Han, Taiwan)0.1
Chinese (Macau SAR China)0.1
Chinese (Singapore)0.1
Chinese (Taiwan)0.1
Chinese (Taiwan, STROKE)0.1
Chinese (Pinyin Romanization)0.1
+
Click on a table header to sort the list
Data Item Description Kilobytes
Afrikaans4
Amharic2
Arabic15
Azerbaijani4
Belarusian10
Bulgarian5
Catalan15
Czech9
Welsh7
Danish3
German8
Greek17
English9
Esperanto2
Spanish13
Estonian2
Persian2
Persian (Afghanistan)2
Finnish2
Faroese7
French7
French (Belgium)7
French (Switzerland)7
Irish12
Hebrew22
Hindi4
Croatian10
Hungarian2
Armenian2
Indonesian2
Icelandic7
Italian11
Japanese2
Georgian3
Kalaallisut4
Korean2
Lithuanian5
Latvian4
Macedonian5
Malay2
Maltese30
Norwegian Bokmål3
Dutch5
Norwegian Nynorsk3
Polish9
Portuguese11
Portuguese (Portugal)9
1
Romanian6
Root17
Russian10
Slovak8
Slovenian9
Albanian4
Serbian8
Serbian (Latin)10
Swedish12
Tamil2
Thai2
Turkish4
Ukrainian10
Vietnamese1
Chinese7
Chinese (Traditional Han)7
+
Click on a table header to sort the list
Data Item Description Kilobytes
Greek0.2
English0.3
Root472
+
Click on a table header to sort the list
Data Item Description Kilobytes
Afrikaans9
Afrikaans (Namibia)0.2
Afrikaans (South Africa)0.1
Amharic8
Amharic (Ethiopia)0.2
Arabic76
Arabic (United Arab Emirates)0.2
Arabic (Bahrain)0.3
Arabic (Algeria)0.7
Arabic (Egypt)0.2
Arabic (Iraq)0.3
Arabic (Jordan)1
Arabic (Kuwait)0.3
Arabic (Lebanon)1
Arabic (Libya)0.3
Arabic (Morocco)0.7
Arabic (Oman)0.3
Arabic (Qatar)0.6
Arabic (Saudi Arabia)0.6
Arabic (Sudan)0.3
Arabic (Syria)1
Arabic (Tunisia)1
Arabic (Yemen)0.6
Assamese4
Assamese (India)0.2
Azerbaijani80
Azerbaijani (Azerbaijan)0.1
Azerbaijani (Cyrillic)2
Azerbaijani (Cyrillic, Azerbaijan)0.2
Azerbaijani (Latin)0.1
Azerbaijani (Latin, Azerbaijan)0.2
Belarusian27
Belarusian (Belarus)0.1
Bulgarian101
Bulgarian (Bulgaria)0.1
Bengali74
Bengali (Bangladesh)0.1
Bengali (India)1
Tibetan4
Tibetan (China)0.2
Tibetan (India)0.2
Catalan101
Catalan (Spain)0.1
Czech46
Czech (Czech Republic)0.1
Welsh10
Welsh (United Kingdom)0.1
Danish88
Danish (Denmark)0.1
German90
German (Austria)1
German (Belgium)0.8
German (Switzerland)1
German (Germany)0.1
German (Liechtenstein)0.4
German (Luxembourg)0.3
Greek117
Greek (Cyprus)0.2
Greek (Greece)0.1
English120
English (Australia)3
English (Belgium)1
English (Botswana)0.8
English (Belize)0.8
English (Canada)3
English (United Kingdom)3
English (Hong Kong SAR China)6
English (Ireland)3
English (India)0.7
English (Jamaica)0.4
English (Marshall Islands)0.2
English (Malta)0.8
English (Namibia)0.4
English (New Zealand)3
English (Philippines)0.3
English (Pakistan)0.7
English (Zimbabwe)0.1
English (Singapore)0.7
English (Trinidad and Tobago)0.5
English (United States)0.2
English (United States, Computer)0.4
English (U.S. Virgin Islands)0.2
English (South Africa)1
English (Zimbabwe)0.8
Esperanto14
Spanish108
Spanish (Argentina)1
Spanish (Bolivia)0.1
Spanish (Chile)1
Spanish (Colombia)0.6
Spanish (Costa Rica)0.1
Spanish (Dominican Republic)0.3
Spanish (Ecuador)0.8
Spanish (Spain)0.1
Spanish (Guatemala)0.8
Spanish (Honduras)0.8
Spanish (Mexico)0.4
Spanish (Nicaragua)0.3
Spanish (Panama)0.7
Spanish (Peru)0.7
Spanish (Puerto Rico)0.8
Spanish (Paraguay)0.3
Spanish (El Salvador)0.3
Spanish (United States)1
Spanish (Uruguay)0.3
Spanish (Venezuela)0.3
Estonian46
Estonian (Estonia)0.1
Basque21
Basque (Spain)0.1
Persian72
Persian (Afghanistan)7
Persian (Iran)0.2
Finnish107
Finnish (Finland)0.1
Faroese7
Faroese (Faroe Islands)0.2
French135
French (Belgium)0.7
French (Canada)3
French (Switzerland)0.8
French (France)0.1
French (Luxembourg)0.4
French (Monaco)0.1
French (Senegal)0.1
Irish36
Irish (Ireland)0.2
Galician36
Galician (Spain)0.1
Swiss German89
Swiss German (Switzerland)0.1
Gujarati34
Gujarati (India)0.2
Manx2
Manx (United Kingdom)0.1
Hausa3
Hausa (Ghana)0.1
Hausa (Latin)0.1
Hausa (Latin, Ghana)0.1
Hausa (Latin, Niger)0.1
Hausa (Latin, Nigeria)0.1
Hausa (Niger)0.1
Hausa (Nigeria)0.1
Hawaiian2
Hawaiian (United States)0.2
Hebrew55
Hebrew (Israel)0.2
Hindi72
Hindi (India)0.2
Croatian133
Croatian (Croatia)0.1
Hungarian92
Hungarian (Hungary)0.1
Armenian8
Armenian (Armenia)0.1
Armenian (Armenia, Revised Orthography)0.8
Indonesian18
Indonesian (Indonesia)0.1
Sichuan Yi3
Sichuan Yi (China)0.2
Indonesian0.1
Indonesian (Indonesia)0.1
Icelandic49
Icelandic (Iceland)0.2
Italian67
Italian (Switzerland)0.8
Italian (Italy)0.1
Hebrew0.1
Hebrew (Israel)0.1
Japanese73
Japanese (Japan)0.2
Japanese (Japan, TRADITIONAL)0.1
Georgian45
Georgian (Georgia)0.2
Kazakh2
Kazakh (Cyrillic)0.1
Kazakh (Cyrillic, Kazakhstan)0.1
Kazakh (Kazakhstan)0.1
Kalaallisut2
Kalaallisut (Greenland)0.2
Khmer12
Khmer (Cambodia)0.1
Kannada36
Kannada (India)0.2
Korean83
Korean (South Korea)0.2
Konkani6
Konkani (India)0.2
Cornish2
Cornish (United Kingdom)0.1
Lithuanian90
Lithuanian (Lithuania)0.1
Latvian65
Latvian (Latvia)0.1
Macedonian59
Macedonian (Macedonia)0.1
Malayalam131
Malayalam (India)0.2
Marathi35
Marathi (India)0.2
Malay11
Malay (Brunei)0.9
Malay (Malaysia)0.1
Maltese31
Maltese (Malta)0.2
Norwegian Bokmål89
Norwegian Bokmål (Norway)0.1
Nepali30
Nepali (India)0.2
Nepali (Nepal)0.1
Dutch93
Dutch (Belgium)0.9
Dutch (Netherlands)0.1
Norwegian Nynorsk88
Norwegian Nynorsk (Norway)0.1
Norwegian0.1
Norwegian (Norway)0.1
Norwegian (Norway, NY)0.1
Oromo1
Oromo (Ethiopia)0.2
Oromo (Kenya)0.2
Oriya35
Oriya (India)0.2
Punjabi4
Punjabi (Arabic)2
Punjabi (Arabic, Pakistan)0.2
Punjabi (Gurmukhi)0.1
Punjabi (Gurmukhi, India)0.2
Punjabi (India)0.1
Punjabi (Pakistan)0.1
Polish81
Polish (Poland)0.1
Pashto5
Pashto (Afghanistan)0.2
Portuguese126
Portuguese (Brazil)0.1
Portuguese (Portugal)39
6
Romanian75
Romanian (Moldova)0.1
Romanian (Romania)0.1
Root60
Russian101
Russian (Russia)0.1
Russian (Ukraine)0.9
Serbo-Croatian0.1
Serbo-Croatian (Bosnia and Herzegovina)0.1
Serbo-Croatian (Serbia and Montenegro)0.1
Serbo-Croatian (Serbia)0.1
Sinhala3
Sinhala (Sri Lanka)0.1
Slovak50
Slovak (Slovakia)0.1
Slovenian69
Slovenian (Slovenia)0.1
Somali6
Somali (Djibouti)0.2
Somali (Ethiopia)0.2
Somali (Kenya)0.2
Somali (Somalia)0.2
Albanian11
Albanian (Albania)0.1
Serbian176
Serbian (Bosnia and Herzegovina)0.1
Serbian (Serbia and Montenegro)0.1
Serbian (Cyrillic)0.1
Serbian (Cyrillic, Bosnia and Herzegovina)1
Serbian (Cyrillic, Serbia and Montenegro)0.1
Serbian (Cyrillic, Montenegro)0.1
Serbian (Cyrillic, Serbia)0.1
Serbian (Cyrillic, Serbia)0.1
Serbian (Latin)172
Serbian (Latin, Bosnia and Herzegovina)0.1
Serbian (Latin, Serbia and Montenegro)0.1
Serbian (Latin, Montenegro)0.6
Serbian (Latin, Serbia)0.1
Serbian (Latin, Serbia)0.1
Serbian (Montenegro)0.1
Serbian (Serbia)0.1
Serbian (Serbia)0.1
Swedish109
Swedish (Finland)0.4
Swedish (Sweden)0.1
Swahili5
Swahili (Kenya)0.3
Swahili (Tanzania)0.1
Tamil37
Tamil (India)0.2
Telugu35
Telugu (India)0.2
Thai99
Thai (Thailand)0.2
Thai (Thailand, TRADITIONAL)0.1
Tigrinya1
Tigrinya (Eritrea)1
Tigrinya (Ethiopia)0.2
Turkish87
Turkish (Turkey)0.1
Ukrainian98
Ukrainian (Ukraine)0.1
Urdu8
Urdu (India)0.3
Urdu (Pakistan)0.2
Uzbek0.5
Uzbek (Afghanistan)0.1
Uzbek (Arabic)0.9
Uzbek (Arabic, Afghanistan)0.2
Uzbek (Cyrillic)0.1
Uzbek (Cyrillic, Uzbekistan)0.2
Uzbek (Latin)3
Uzbek (Latin, Uzbekistan)0.2
Uzbek (Uzbekistan)0.1
Vietnamese27
Vietnamese (Vietnam)0.1
Chinese85
Chinese (China)0.1
Chinese (Hong Kong SAR China)0.1
Chinese (Simplified Han)0.1
Chinese (Simplified Han, China)0.2
Chinese (Simplified Han, Hong Kong SAR China)0.2
Chinese (Simplified Han, Macau SAR China)0.2
Chinese (Simplified Han, Singapore)0.5
Chinese (Traditional Han)64
Chinese (Traditional Han, Hong Kong SAR China)2
Chinese (Traditional Han, Macau SAR China)0.5
Chinese (Traditional Han, Taiwan)0.2
Chinese (Macau SAR China)0.1
Chinese (Singapore)0.1
Chinese (Taiwan)0.1
Zulu3
Zulu (South Africa)0.1
+
Click on a table header to sort the list
Data Item Description Kilobytes
CLDR locale identifier mapping information18
CLDR timezone information89
CLDR numbering systems information3
CLDR plural grammer rule information5
Unicode property names24
RFC 3491 string prep profile20
RFC 3530 case sensitive string prep profile13
RFC 3530 case insensitive string prep profile20
RFC 3530 mixed string prep profile13
RFC 3722 string prep profile20
RFC 3920 node identifiers string prep profile20
RFC 3920 resource identifiers string prep profile13
RFC 4011 string prep profile13
RFC 4013 string prep profile13
RFC 4505 string prep profile13
RFC 4518 case sensitive string prep profile14
RFC 4518 case insensitive string prep profile20
Supplemental CLDR information (ISO-4217, timezones, calendars and more)202
Unicode character names177
Olson timezone information154
+ Base Data (119 KB)
Click on a table header to sort the list
Data Item Description Kilobytes
cnvalias.icuCharset alias table55
confusables.cfu 64
ubidi.icuUnicode bidirectional properties for ICU4J17
ucase.icuUnicode casing properties for ICU4J19
unorm.icuUnicode normalization properties for ICU4J115
uprops.icuUnicode general properties for ICU4J79

Please specify which edition of ICU will use this data.



 

The estimated uncompressed size of this data library is 16685 KB

The ICU 4.0 Data generate a data library that can be used with the 4.0 series of ICU.

The ICU 3.8 Data generate a data library that can be used with the 3.8 series of ICU.

+ Advanced Options
Item Filtering  


Groups