Using the cprnc tool to compare two history files

cprnc is a tool shared by both CAM and CLM to compare two NetCDF history files. It differences every field that has a time-axis that is also shared on both files, and reports a summary of the difference. The summary includes the three largest differences, as well as the root mean square (RMS) difference. It also gives some summary information on the field as well. You have to enter at least one file, and up to two files. With one file it gives you summary information on the file, and with two it gives you information on the differences between the two. At the end it will give you a summary of the fields compared and how many fields were different and how many were identical.

Options:

-m = do NOT align time-stamps before comparing
-v = verbose output
-ipr
-jpr
-kpr

See the cprnc README file for more details which is repeated here:

cprnc README
------------

cprnc is a generic tool for analyzing a netcdf file or comparing
two netcdf files.


Quick Start Guide:
------------------

> setenv INC_NETCDF $local_netcdf_include_path
> setenv LIB_NETCDF $local_netcdf_lib_path
> gmake 
> ./cprnc file1          # for analyze mode
> ./cprnc file1 file2    # for compare mode

 Usage: cprnc  [-v] [-d dimname:start[:count]] file1 [file2]
 -m: Compare each time sample. Default is false, i.e. match "time" 
     coordinate values before comparing
 -v: Verbose output
 -d dimname:start[:count]
     Print variable values for the specified dimname index subrange.


Users Guide:
------------

cprnc is a Fortran-90 application. It relies on netcdf version 3 or
later and uses the f90 netcdf interfaces.  It requires a netcdf include
file and a netcdf library.  The Makefile assumes those are in 
/usr/local/include and /usr/local/lib respectively.  Users can override 
these defaults by setting environment variables INC_NETCDF and LIB_NETCDF 
to the local values of the netcdf paths.  The Makefile has entries for 
IRIX64, AIX, Darwin, Cray X1E, OSF1, SUN, and Linux.  On Linux, entries 
exist for pgf90 and lf95 compilers. The Makefile is quite extensible for 
new architectures.  gmake should be used to build cprnc.  The Makefile
relies on a hard-wired Depends file.

cprnc generates an ascii output file via standard out.  It initially
summarizes some characteristics of the input file[s].  A compare file is
generally 132 characters wide and an analyze file is less than 80 
characters wide.

In analyze mode, the output for a field looks like

                      (   lon,   lat,  time, -----)
              259200  (   587,   134,     1) (   269,    59,     1)
 FX1           96369   8.273160400390625E+02  0.000000000000000E+00
            avg abs field values:   9.052845920820910E+01

and a guide to this information is printed at the top of the file

                      (  dim1,  dim2,  dim3,  dim4)
              ARRSIZ1 ( indx1, indx2, indx3) file 1
 FIELD        NVALID           MAX                   MIN


The first 10 characters of the field name are identified in the first
  dozen columns of the third line.  
The first line summarizes the names of the dimensions of the field
The second line summarizes the indices of the maximum and minimum value 
  of the field for the first three dimensions.  If the fourth dimension 
  exists, it's always assumed to be time.  Time is handled separately.
The third line summarizes the number of valid values in the array
  and the maximum and minimum value over those valid values.  Invalid
  values are values that are identified to be "fill" value.
The last line summarizes some overall statistics including the average 
  absolute value of the valid values of the field.

In comparison mode, the output (132 chars wide) for a field looks like

               96369  (   lon,   lat,  time)
              259200  (   422,   198,     1) (   203,   186,     1)           (    47,   169,      \ 
   1)         (   224,   171,     1)
 FIRA          96369   1.466549530029297E+02 -3.922052764892578E+01   1.4E+02  \ 
   -3.037954139709473E+01 1.0E+00 -3.979958057403564E+00
               96369   1.321966247558594E+02 -1.603044700622559E+01             \ 
   1.084177169799805E+02          3.982142448425293E+00
              259200  (   156,    31,     1) (   573,   178,     1)           (
          avg abs field values:    6.778244097051392E+01    rms diff: 1.4E+01   avg rel diff(npos):  \ 
    4.6E-02
                                   5.960437961084186E+01                        avg decimal  \ 
   digits(ndif):  1.2 worst:  0.0

and a guide to this information is printed at the top of the file

              NDIFFS  (  dim1,  dim2,  dim3,  dim4, ... )
              ARRSIZ1 ( indx1, indx2, indx3, ... ) file 1
 FIELD        NVALID1          MAX1                  MIN1            DIFFMAX  VALUES                 \ 
   RDIFMAX  VALUES
              NVALID2          MAX2                  MIN2
              ARRSIZ2 ( indx1, indx2, indx3, ...) file 2

The information content is identical to the information in analyze
mode with the following additions.  Two additional lines are added
in the main body.  Lines 4 and 5 are identical to line 3 and 2
respectively but are associated with file 2 instead of file 1.
In addition, the right hand side of lines 2, 3, and 4 contain
information about the maximum difference, the location and values
of the maximum difference, the relative difference and the location
and values of the maximum relative difference.  The last two line
summarize some overall statistics including average absolute values
of the field on the two files, rms difference, average relative 
difference, average number of digits that match, and the worst
case for the number of digits that match.

At the end of the output file, a summary is presented that looks like

SUMMARY of cprnc:
  A total number of    119 fields were compared
          of which      83 has non-zero differences
          including      0 that had inconsistent size/shape
  A total number of      0 fields in file 1 were analyzed (non-compare mode)
  A total number of     10 fields in file 1 could not be analyzed at all
  A total number of      0 fields in file 2 were analyzed (non-compare mode)

This summarizes the number of fields that were compared, that differed
and the number of fields that could not be compared but were analyzed.


Developers Guide:
-----------------

The tool works as follows.

Fields can be analyzed if they are int, float or double and 
have between 0 and n dimensions 

In general, fields that appear on both files are 
compared.  If they are sizes, no difference
statistics are computed and only a summary of the fields on
the files are presented.  If  fields only appear
on one file, those fields are analyzed.

The unlimited dimension is treated uniquely.  In general, for files
that have a dimension named "time", the time axes are compared
and matching time values on the two files are compared one
timestep at a time.  Time values that don't match are skipped.
To override the matching behaviour, use cprnc -m.  In this mode, 
timestamps are compared in indexical space.  In analyze mode,
the fields are analyzed one timestamp at a time.  In general,
if there is a "time" axis, it will be the outer-most loop in
the output analysis.  In compare mode, fields with a time axis
and a timestamp that are not common between the two files are
ignored.


Note: To compare files with OUT a time axis you can use the cprnc.ncl NCL script in models/lnd/clm/tools/ncl_scripts. It won't give you the details on the differences but will report if the files are identical or different.