From a175314c3e5827eb193872241446f2f8f5c9d33c Mon Sep 17 00:00:00 2001 From: Daniel Baumann Date: Sat, 4 May 2024 20:07:14 +0200 Subject: Adding upstream version 1:10.5.12. Signed-off-by: Daniel Baumann --- sql/share/charsets/README | 39 +++++++++++++++++++++++++++++++++++++++ 1 file changed, 39 insertions(+) create mode 100644 sql/share/charsets/README (limited to 'sql/share/charsets/README') diff --git a/sql/share/charsets/README b/sql/share/charsets/README new file mode 100644 index 00000000..3c5b3206 --- /dev/null +++ b/sql/share/charsets/README @@ -0,0 +1,39 @@ +This directory holds configuration files that enable MySQL to work with +different character sets. It contains: + +charset_name.xml + Each charset_name.xml file contains information for a simple character + set. The information in the file describes character types, + lower- and upper-case equivalencies and sorting orders for the + character values in the set. + +Index.xml + The Index.xml file lists all of the available charset configurations, + including collations. + + Each collation must have a unique number. The number is stored + IN THE DATABASE TABLE FILES and must not be changed. + + The max-id attribute of the element must be set to + the largest collation number. + +Compiled in or configuration file? + When should a character set be compiled in to MySQL's string library + (libmystrings), and when should it be placed in a charset_name.xml + configuration file? + + If the character set requires the strcoll functions or is a + multi-byte character set, it MUST be compiled in to the string + library. If it does not require these functions, it should be + placed in a charset_name.xml configuration file. + + If the character set uses any one of the strcoll functions, it + must define all of them. Likewise, if the set uses one of the + multi-byte functions, it must define them all. See the manual for + more information on how to add a complex character set to MySQL. + +Syntax of configuration files + The syntax is very simple. Words in array elements are + separated by arbitrary amounts of whitespace. Each word must be a + number in hexadecimal format. The ctype array has 257 words; the + other arrays (lower, upper, etc.) take up 256 words each after that. -- cgit v1.2.3