diff options
Diffstat (limited to 'man/deb-src-symbols.pod')
-rw-r--r-- | man/deb-src-symbols.pod | 387 |
1 files changed, 387 insertions, 0 deletions
diff --git a/man/deb-src-symbols.pod b/man/deb-src-symbols.pod new file mode 100644 index 0000000..f44d21f --- /dev/null +++ b/man/deb-src-symbols.pod @@ -0,0 +1,387 @@ +# dpkg manual page - deb-src-symbols(5) +# +# Copyright © 2007-2011 Raphaël Hertzog <hertzog@debian.org> +# Copyright © 2009-2010 Modestas Vainius <modestas@vainius.eu> +# Copyright © 2012-2015 Guillem Jover <guillem@debian.org> +# +# This is free software; you can redistribute it and/or modify +# it under the terms of the GNU General Public License as published by +# the Free Software Foundation; either version 2 of the License, or +# (at your option) any later version. +# +# This is distributed in the hope that it will be useful, +# but WITHOUT ANY WARRANTY; without even the implied warranty of +# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the +# GNU General Public License for more details. +# +# You should have received a copy of the GNU General Public License +# along with this program. If not, see <https://www.gnu.org/licenses/>. + +=encoding utf8 + +=head1 NAME + +deb-src-symbols - Debian's extended shared library template file + +=head1 SYNOPSIS + +B<debian/>I<package>B<.symbols.>I<arch>, +B<debian/symbols.>I<arch>, +B<debian/>I<package>B<.symbols>, +B<debian/symbols> + +=head1 DESCRIPTION + +The symbol file templates are shipped in Debian source packages, and its +format is a superset of the symbols files shipped in binary packages, +see L<deb-symbols(5)>. + +=head2 Comments + +Comments are supported in template symbol files. Any line with ‘#’ as +the first character is a comment except if it starts with ‘#include’ +(see section B<Using includes>). +Lines starting with ‘#MISSING:’ are special comments documenting +symbols that have disappeared. + +=head2 Using #PACKAGE# substitution + +In some rare cases, the name of the library varies between architectures. +To avoid hardcoding the name of the package in the symbols file, you can +use the marker I<#PACKAGE#>. It will be replaced by the real package +name during installation of the symbols files. Contrary to the +I<#MINVER#> marker, I<#PACKAGE#> will never appear in a symbols file +inside a binary package. + +=head2 Using symbol tags + +Symbol tagging is useful for marking symbols that are special in some way. Any +symbol can have an arbitrary number of tags associated with it. While all tags are +parsed and stored, only some of them are understood by +B<dpkg-gensymbols> and trigger special handling of the symbols. See +subsection B<Standard symbol tags> for reference of these tags. + +Tag specification comes right before the symbol name (no whitespace is allowed +in between). It always starts with an opening bracket B<(>, ends with a +closing bracket B<)> and must contain at least one tag. Multiple tags are +separated by the B<|> character. Each tag can optionally have a value which +is separated form the tag name by the B<=> character. Tag names and values +can be arbitrary strings except they cannot contain any of the special B<)> +B<|> B<=> characters. Symbol names following a tag specification can +optionally be quoted with either B<'> or B<"> characters to allow +whitespaces in them. However, if there are no tags specified for the symbol, +quotes are treated as part of the symbol name which continues up until the +first space. + + (tag1=i am marked|tag name with space)"tagged quoted symbol"@Base 1.0 + (optional)tagged_unquoted_symbol@Base 1.0 1 + untagged_symbol@Base 1.0 + +The first symbol in the example is named I<tagged quoted symbol> and has two +tags: I<tag1> with value I<i am marked> and I<tag name with space> +that has no value. The second symbol named I<tagged_unquoted_symbol> is +only tagged with the tag named I<optional>. The last symbol is an +example of the normal untagged symbol. + +Since symbol tags are an extension of the B<deb-symbols>(5) format, they +can only be part of the symbols files used in source packages (those files +should then be seen as templates used to build the symbols files that are +embedded in binary packages). When +B<dpkg-gensymbols> is called without the B<-t> option, it will +output symbols files compatible to the B<deb-symbols>(5) format: +it fully processes symbols according to the requirements of their standard tags +and strips all tags from the output. On the contrary, in template mode +(B<-t>) all symbols and their tags (both standard and unknown ones) +are kept in the output and are written in their original form as they were +loaded. + +=head2 Standard symbol tags + +=over + +=item B<optional> + +A symbol marked as optional can disappear from the library at any time and that +will never cause B<dpkg-gensymbols> to fail. However, disappeared optional +symbols will continuously appear as MISSING in the diff in each new package +revision. This behaviour serves as a reminder for the maintainer that such a +symbol needs to be removed from the symbol file or readded to the library. When +the optional symbol, which was previously declared as MISSING, suddenly +reappears in the next revision, it will be upgraded back to the “existing” +status with its minimum version unchanged. + +This tag is useful for symbols which are private where their disappearance do +not cause ABI breakage. For example, most of C++ template instantiations fall +into this category. Like any other tag, this one may also have an arbitrary +value: it could be used to indicate why the symbol is considered optional. + +=item B<arch=>I<architecture-list> + +=item B<arch-bits=>I<architecture-bits> + +=item B<arch-endian=>I<architecture-endianness> + +These tags allow one to restrict the set of architectures where the symbol +is supposed to exist. The B<arch-bits> and B<arch-endian> tags +are supported since dpkg 1.18.0. When the symbols list is updated with +the symbols +discovered in the library, all arch-specific symbols which do not concern +the current host architecture are treated as if they did not exist. If an +arch-specific symbol matching the current host architecture does not exist +in the library, normal procedures for missing symbols apply and it may +cause B<dpkg-gensymbols> to fail. On the other hand, if the +arch-specific symbol is found when it was not supposed to exist (because +the current host architecture is not listed in the tag or does not match +the endianness and bits), it is made arch neutral (i.e. the arch, arch-bits +and arch-endian tags are dropped and the symbol will appear in the diff due +to this change), but it is not considered as new. + +When operating in the default non-template mode, among arch-specific symbols +only those that match the current host architecture are written to the +symbols file. On the contrary, all arch-specific symbols (including those +from foreign arches) are always written to the symbol file when operating +in template mode. + +The format of I<architecture-list> is the same as the one used in the +B<Build-Depends> field of I<debian/control> (except the enclosing +square brackets []). For example, the first symbol from the list below +will be considered only on alpha, any-amd64 and ia64 architectures, +the second only on linux architectures, while the third one anywhere +except on armel. + + (arch=alpha any-amd64 ia64)64bit_specific_symbol@Base 1.0 + (arch=linux-any)linux_specific_symbol@Base 1.0 + (arch=!armel)symbol_armel_does_not_have@Base 1.0 + +The I<architecture-bits> is either B<32> or B<64>. + + (arch-bits=32)32bit_specific_symbol@Base 1.0 + (arch-bits=64)64bit_specific_symbol@Base 1.0 + +The I<architecture-endianness> is either B<little> or B<big>. + + (arch-endian=little)little_endian_specific_symbol@Base 1.0 + (arch-endian=big)big_endian_specific_symbol@Base 1.0 + +Multiple restrictions can be chained. + + (arch-bits=32|arch-endian=little)32bit_le_symbol@Base 1.0 + +=item B<allow-internal> + +dpkg-gensymbols has a list of internal symbols that should not +appear in symbols files as they are usually only side-effects of +implementation details of the toolchain (since dpkg 1.20.1). +If for some reason, you really want one of those symbols to be included in +the symbols file, you should tag the symbol with B<allow-internal>. +It can be necessary for some low level toolchain libraries like “libgcc”. + +=item B<ignore-blacklist> + +A deprecated alias for B<allow-internal> (since dpkg 1.20.1, +supported since dpkg 1.15.3). + +=item B<c++> + +Denotes I<c++> symbol pattern. See B<Using symbol patterns> subsection +below. + +=item B<symver> + +Denotes I<symver> (symbol version) symbol pattern. See B<Using symbol +patterns> subsection below. + +=item B<regex> + +Denotes I<regex> symbol pattern. See B<Using symbol patterns> subsection +below. + +=back + +=head2 Using symbol patterns + +Unlike a standard symbol specification, a pattern may cover multiple real +symbols from the library. B<dpkg-gensymbols> will attempt to match each +pattern against each real symbol that does I<not> have a specific symbol +counterpart defined in the symbol file. Whenever the first matching pattern is +found, all its tags and properties will be used as a basis specification of the +symbol. If none of the patterns matches, the symbol will be considered as new. + +A pattern is considered lost if it does not match any symbol in the library. By +default this will trigger a B<dpkg-gensymbols> failure under B<-c1> or +higher level. However, if the failure is undesired, the pattern may be marked +with the I<optional> tag. Then if the pattern does not match anything, it +will only appear in the diff as MISSING. Moreover, like any symbol, the pattern +may be limited to the specific architectures with the I<arch> tag. Please +refer to B<Standard symbol tags> subsection above for more information. + +Patterns are an extension of the B<deb-symbols>(5) format hence they are +only valid in symbol file templates. Pattern specification syntax is not any +different from the one of a specific symbol. However, symbol name part of the +specification serves as an expression to be matched against I<name@version> +of the real symbol. In order to distinguish among different pattern types, a +pattern will typically be tagged with a special tag. + +At the moment, B<dpkg-gensymbols> supports three basic pattern types: + +=over + +=item B<c++> + +This pattern is denoted by the I<c++> tag. It matches only C++ symbols by +their demangled symbol name (as emitted by B<c++filt>(1) utility). This +pattern is very handy for matching symbols which mangled names might vary +across different architectures while their demangled names remain the same. One +group of such symbols is I<non-virtual thunks> which have architecture +specific offsets embedded in their mangled names. A common instance of this +case is a virtual destructor which under diamond inheritance needs a +non-virtual thunk symbol. For example, even if _ZThn8_N3NSB6ClassDD1Ev@Base on +32bit architectures will probably be _ZThn16_N3NSB6ClassDD1Ev@Base on 64bit +ones, it can be matched with a single I<c++> pattern: + + libdummy.so.1 libdummy1 #MINVER# + [...] + (c++)"non-virtual thunk to NSB::ClassD::~ClassD()@Base" 1.0 + [...] + +The demangled name above can be obtained by executing the following command: + + $ echo '_ZThn8_N3NSB6ClassDD1Ev@Base' | c++filt + +Please note that while mangled name is unique in the library by definition, +this is not necessarily true for demangled names. A couple of distinct real +symbols may have the same demangled name. For example, that's the case with +non-virtual thunk symbols in complex inheritance configurations or with most +constructors and destructors (since g++ typically generates two real symbols +for them). However, as these collisions happen on the ABI level, they should +not degrade quality of the symbol file. + +=item B<symver> + +This pattern is denoted by the I<symver> tag. Well maintained libraries have +versioned symbols where each version corresponds to the upstream version where +the symbol got added. If that's the case, you can use a I<symver> pattern to +match any symbol associated to the specific version. For example: + + libc.so.6 libc6 #MINVER# + (symver)GLIBC_2.0 2.0 + [...] + (symver)GLIBC_2.7 2.7 + access@GLIBC_2.0 2.2 + +All symbols associated with versions GLIBC_2.0 and GLIBC_2.7 will lead to +minimal version of 2.0 and 2.7 respectively with the exception of the symbol +access@GLIBC_2.0. The latter will lead to a minimal dependency on libc6 version +2.2 despite being in the scope of the "(symver)GLIBC_2.0" pattern because +specific symbols take precedence over patterns. + +Please note that while old style wildcard patterns (denoted by "*@version" in +the symbol name field) are still supported, they have been deprecated by new +style syntax "(symver|optional)version". For example, "*@GLIBC_2.0 2.0" should +be written as "(symver|optional)GLIBC_2.0 2.0" if the same behaviour is needed. + +=item B<regex> + +Regular expression patterns are denoted by the I<regex> tag. They match by +the perl regular expression specified in the symbol name field. A regular +expression is matched as it is, therefore do not forget to start it with the +I<^> character or it may match any part of the real symbol +I<name@version> string. For example: + + libdummy.so.1 libdummy1 #MINVER# + (regex)"^mystack_.*@Base$" 1.0 + (regex|optional)"private" 1.0 + +Symbols like "mystack_new@Base", "mystack_push@Base", "mystack_pop@Base" etc. +will be matched by the first pattern while e.g. "ng_mystack_new@Base" won't. +The second pattern will match all symbols having the string "private" in their +names and matches will inherit I<optional> tag from the pattern. + +=back + +Basic patterns listed above can be combined where it makes sense. In that case, +they are processed in the order in which the tags are specified. For example, +both: + + (c++|regex)"^NSA::ClassA::Private::privmethod\d\(int\)@Base" 1.0 + (regex|c++)N3NSA6ClassA7Private11privmethod\dEi@Base 1.0 + +will match symbols "_ZN3NSA6ClassA7Private11privmethod1Ei@Base" and +"_ZN3NSA6ClassA7Private11privmethod2Ei@Base". When matching the first pattern, +the raw symbol is first demangled as C++ symbol, then the demangled name is +matched against the regular expression. On the other hand, when matching the +second pattern, regular expression is matched against the raw symbol name, then +the symbol is tested if it is C++ one by attempting to demangle it. A failure +of any basic pattern will result in the failure of the whole pattern. +Therefore, for example, "__N3NSA6ClassA7Private11privmethod\dEi@Base" will not +match either of the patterns because it is not a valid C++ symbol. + +In general, all patterns are divided into two groups: aliases (basic I<c++> +and I<symver>) and generic patterns (I<regex>, all combinations of +multiple basic patterns). Matching of basic alias-based patterns is fast (O(1)) +while generic patterns are O(N) (N - generic pattern count) for each symbol. +Therefore, it is recommended not to overuse generic patterns. + +When multiple patterns match the same real symbol, aliases (first I<c++>, +then I<symver>) are preferred over generic patterns. Generic patterns are +matched in the order they are found in the symbol file template until the first +success. Please note, however, that manual reordering of template file entries +is not recommended because B<dpkg-gensymbols> generates diffs based on the +alphanumerical order of their names. + +=head2 Using includes + +When the set of exported symbols differ between architectures, it may become +inefficient to use a single symbol file. In those cases, an include directive +may prove to be useful in a couple of ways: + +=over + +=item * + +You can factorize the common part in some external file +and include that file in your I<package>.symbols.I<arch> file by +using an include directive like this: + + #include "I<packages>.symbols.common" + +=item * + +The include directive may also be tagged like any symbol: + + (tag|...|tagN)#include "file-to-include" + +As a result, all symbols included from I<file-to-include> will be considered +to be tagged with I<tag> ... I<tagN> by default. You can use this feature +to create a common I<package>.symbols file which includes architecture +specific symbol files: + + common_symbol1@Base 1.0 + (arch=amd64 ia64 alpha)#include "package.symbols.64bit" + (arch=!amd64 !ia64 !alpha)#include "package.symbols.32bit" + common_symbol2@Base 1.0 + +=back + +The symbols files are read line by line, and include directives are processed +as soon as they are encountered. This means that the content of the included +file can override any content that appeared before the include directive and +that any content after the directive can override anything contained in the +included file. Any symbol (or even another #include directive) in the included +file can specify additional tags or override values of the inherited tags in +its tag specification. However, there is no way for the symbol to remove +any of the inherited tags. + +An included file can repeat the header line containing the SONAME of the +library. In that case, it overrides any header line previously read. +However, in general it's best to avoid duplicating header lines. One way +to do it is the following: + + #include "libsomething1.symbols.common" + arch_specific_symbol@Base 1.0 + +=head1 SEE ALSO + +B<deb-symbols>(5), +B<dpkg-shlibdeps>(1), +B<dpkg-gensymbols>(1). |