DOC HOME SITE MAP MAN PAGES GNU INFO SEARCH PRINT BOOK
 

gennorm(8)





NAME

       gennorm  -  compile normalization data from the Unicode Character Data-
       base


SYNOPSIS

       gennorm [ -h, -?, --help ] [ -v, --verbose ] [ -u, --unicode version  ]
       [ -c, --copyright ] [ -s, --sourcedir source ] [ -d, --destdir destina-
       tion ] [ suffix ]


DESCRIPTION

       gennorm reads some of the Unicode Character Database files and compiles
       their  normalization  information  into  a  binary form.  The resulting
       file, unorm.dat, can then be read directly by  ICU,  or  used  by  pkg-
       data(8) for incorporation into a larger archive or library.

       The files read by gennorm are described in the FILES section. If suffix
       is passed on the command line, the names of these files  will  actually
       be  changed to include a dash followed by suffix in their basename. For
       example, the file UnicodeData.txt would be looked for  under  the  name
       UnicodeData-suffix.txt.


OPTIONS

       -h, -?, --help
              Print help about usage and exit.

       -v, --verbose
              Display extra informative messages during execution.

       -u, --unicode version
              Specify  which version of Unicode the Unicode Character Database
              refers to.  Defaults to 3.0.0.

       -c, --copyright
              Include a copyright notice into the binary data.

       -s, --sourcedir source
              Set the source directory to source.  The default  source  direc-
              tory is specified by the environment variable ICU_DATA.

       -d, --destdir destination
              Set  the destination directory to destination.  The default des-
              tination directory is  specified  by  the  environment  variable
              ICU_DATA.


ENVIRONMENT

       ICU_DATA  Specifies  the  directory  containing  ICU  data. Defaults to
                 /usr/lib/icu/3.2/.  Some tools in ICU depend on the  presence
                 of the trailing slash. It is thus important to make sure that
                 it is present if ICU_DATA is set.


FILES

       The following files are read by gennorm  and  are  looked  for  in  the
       source directory.

       UnicodeData.txt     The  main  file  in the Unicode Character Database.
                           Contains character  properties,  combining  classes
                           information, decompositions, names, etc....

       DerivedNormalizationProperties.txt
                           Derived  properties  useful in dealing with normal-
                           ization forms.


VERSION

       3.2


COPYRIGHT

       Copyright (C) 2000-2002 IBM, Inc. and others.


SEE ALSO

       pkgdata(8)

ICU MANPAGE                     16 January 2001                     GENNORM(8)

Man(1) output converted with man2html