man exifprobe (Commandes) - probe and report structure and metadata content of camera image files

NAME

exifprobe - probe and report structure and metadata content of camera image files

SYNOPSIS



exifprobe [options] filename(s)

DESCRIPTION

Exifprobe reads image files produced by digital cameras (including several so-called "raw" file formats) and reports the structure of the files and the auxilliary data and metadata contained within them. In addition to TIFF, JPEG, and EXIF, the program understands several formats which may contain "raw" camera data, including MRW, CIFF/CRW, JP2/JPEG2000, RAF, and X3F, as well as most most TIFF-derived "raw" formats, including DNG, ORF, CR2, NEF, K25/KDC/DCR, and PEF. Other TIFF or JPEG-derived formats (which begin with a TIFF header or JPEG marker) should also be readable. The program attempts to display ALL information in the image file, in a manner which mimics the structure of the file as closely as possible.

Where possible, output is not limited to “known” data items. I.e. for tagged file formats, unknown tags will be identified by tag number and type, and values shown without interpretation. Proprietary, untagged or fixed format files do not permit this, but unknown or undefined data can usually be dumped in a hex/ascii format for examination, so long as the file structure is understood. The program will report the contents of any properly structured TIFF IFD or CIFF directory encountered, even when entry tags are not recognized. Recognized TIFF, TIFF/EP, DNG, and CIFF tags are expanded, including EXIF2.2 sections and camera MakerNotes which are found to be in TIFF IFD format. TIFF and/or JPEG sections found in MRW, RAF or JP2 files will be reported, along with the “native” sections of those formats. JP2 boxes will be reported whether known or unknown, and expanded if known. Unknown boxes cannot be expanded, since JP2 is not tagged below the box (or sub-box) level.

An effort is made to identify all sub-images (in any image format) contained in multi-image files; the location, size, and format of such images is reported, and a hex/ascii dump of the image data may be requested. Image data is not processed, but the program will recognize and report all standard JPEG and JPEG2000 markers (including JPEG APPn markers) and will expand APP0 (JFIF/JFXX) and APP1 (EXIF) sections.

Since the program does not attempt to display images or modify the contents of files, it can often recover from and report failures or warn about structural oddities which would confuse a display or image edit program.

There are a wide variety of output formats, selectable in detail by the “lower case” options described in the OPTIONS section below. These options select which items to print, and within narrow confines, how and where to print them. A large number of combinations of options exist, and some of those combinations may not be sensible. In order to avoid the need for constant creativity or invention, three “prefabricated” output formats are provided, selected by the upper case option letters.

The structural (-S) output format (default) provides a description of the image file which mimics as closely as possible the layout and structure of the data in the image file, including file offsets of headers, section and segment markers, fully described TIFF IFDs, CIFF directories, or JP2 boxes, and the location of actual image and/or thumbnail data. The contents of each section are indented relative to the beginning of the section, and ”offset” values for TIFF IFDs and CIFF directories are reported at the offsets where they are found (usually following the entry list for TIFF, or in the HEAP for CIFF). The peculiar “reverse” structures of CIFF and X3F formats are handled sensibly.

The report format (-R) shows the “logical” structure of the image file, but eliminates addresses, offsets, IFD value types and counts, etc., and prints “offset” directory values inline, while otherwise preserving the primary structure of the data.

The list format (-L) omits all structural data. It writes only “tag” values from TIFF, Exif, and MakerNote IFDs, CIFF or other format directories or JP2 boxes, including section and image offsets and sizes. Identifiable values from non-tagged formats are written in a similar manner. This format may be useful for extracting information for photo galleries.

The structural format is default. This format provides maximum information about the contents of an image file, and may reveal information (sometimes important) which other formats (or image info programs) may hide.

In all formats, the filename, file type, file size, an image summary, and a summary file format will be displayed (even when all other output is disabled by option). The image summary includes a summary report, for each subimage found, giving the image type, compression type (if any), pixel size, data length, file offset where found, and section of the file which includes or references the image. In some cases, short remarks may be included for images mentioned but not found, etc. The summary concludes with the number of images found, and number of images not found (if any). The summary is followed by a listing of format sections found (TIFF/JPEG/EXIF, etc.) and a type identifier for TIFF-derived types (e.g. CR2) where possible.

An environment variable may be set to a list of options to customize the default output behavior (see below).

MakerNotes

Camera-generated images which contain EXIF sections may also contain sections introduced by a MakerNote tag, which may contain information about camera or firmware settings used to produce the image. The structure and contents of MakerNote sections is not mandated by the Exif specification, but many camera MakerNotes are written in TIFF IFD format, possibly offset following an ID string or new TIFF header (or both), and sometimes with inventive handling of “offsets”. Exifprobe currently understands and automatically detects such schemes and prints the contents of the IFD (and the ID string, if present). This detection is not dependent upon make or model of camera. Make and Model information will usually be available from the first TIFF IFD in the file; this information may be used to interpret the MakerNote information for “known” cameras; otherwise, tag numbers, sizes, types, and raw values from the IFD will be shown (if permitted by option settings).

Some camera makes are known to use more than one version of MakerNote, depending upon model. If an unknown model from that maker is encountered, the note will be briefly examined and a noteversion assigned automatically if possible. If that fails, the note will be displayed without interpretation.

MakerNotes which are not in a recognizable IFD format will be reported (start and end offsets) in structural (-S) and report (-R) formats , and the beginning of the note section hex/ascii dumped. The remainder of the note may be dumped, in whole or in part, by the -M option (see below).

In list (-L) format, the starting file offset and length supplied by the MakerNote tag will be reported, and three “pseudo” tags which report the offset (MakerNote.Offset), size (MakerNote.Length) and scheme (MakerNote.Scheme) will appear.

JPEG APPn

In JPEG interchange format files, APP0 (JFIF,JFXX) and APP1 (Exif) segments will be fully decoded, and the “printable” portions of APP12 sections will be displayed. APP3 (Meta) sections will be expanded and the contained TIFF IFD will be displayed, although little interpretation is done. Other APP markers will be reported, and the sections may be hex/ascii dumped in whole or in part using the -A option. APP1 sections not marked as Exif will be treated as unknown.

ANSI Color

The program (by default) emits ANSI color escape sequences to highlight Exif, MakerNote , and Interoperability sub-sections. Errors and warnings are highlighted in red. These sequences are effective, of course, only for terminals or terminal emulators (e.g. xterm) which respond to ANSI color escape sequences. If a pager is used to display the output when these sequences are present, a “raw” option to the pager may be required (e.g. less -R). The use of these sequences may be toggled by the -c option. LIST mode turns color sequences off.

The program may be compiled without support for color sequences.

OUTPUT FORMATS

In all formats, and regardless of option setting, the first three lines of output for each file processed are the filename, image type (TIFF, CIFF, JP2, etc.), and the file size. If the type (taken from the file header) specifies a data byte order, the byte order will be indicated with the type as `II' (Intel byte order) or `MM' (Motorola byte order). The image summary and summary format will always be printed at the end.

Structural Format

Structural format output begins with a display of the file header. The header is followed by lines of the form <IFD0> ... </IFD0> <APP0> ... </APP0> <DIRECTORY> ... </DIRECTORY> etc. to indicate the beginning and end of each “section” of the file. Actual section names will, of course, depend upon the file format and/or the tags encountered. Only the TIFF IFD format is described here; other formats are similar, except that JP2 box names are printed inside square (rather than angle) brackets, and MRW section names inside curly braces.

Within sections, directory entries, subdirectories, the contents of known APP sections, JPEG segment markers, etc. are printed. Non-jpeg image data sections will be shown with a few lines of hex/ascii dump of the beginning of the data.

Each line of output is preceded by a file offset given in hex and decimal. File offsets are preceded by the character `@', except that section end markers are preceded by `-' and the character `>' may be used to mark sections which are located outside the IFD in which they are declared. If that section includes a subsection which is similarly afflicted, the '>' is replaced by '+' in the subsection. In JP2 files, the '@' is replaced by '=', for no particular reason.

JPEG and JPEG2000 segment markers are written with the marker name, and the decoded values of any information associated with the marker.

TIFF information is written in a manner which reflects the structure of the IFD, with all values interpreted according to the applicable specification where possible. All IFD fields are reported. The following fields will appear on each line (in the order given, following the file offset):

•
Tag number in hex and decimal representations, enclosed in brackets.
•
Tag name (where known); names for unknown tags are created as a hex representation of the tag number prefixed by the string 'TAG_'.
•
The TIFF type number, name, and byte count for the associated value, enclosed in square brackets.
•
The “value/offset” for the entry. If the value fits in the four bytes of the entry, the value is printed directly.

If the value for the entry did not fit in the four bytes of the entry, then the value found is an offset to the actual location of the data; that offset is printed preceded by an '@' symbol. The actual value will be printed later, at the file offset where it was found (except in some non-conforming MakerNote IFDs). If the value requires interpretation (e.g. TIFF Orientation) it is followed by an '=' sign and the interpretation, enclosed in double quotes (e.g. “0,0 top left”).

The list of entries will be followed by a line giving the offset to the next IFD (often 0) which is always found at the end of a TIFF IFD entry list.

If there were offset entries found in the list above, the TIFF (and Exif) specification requires that they will be located next in the file, immediately following the dirctory entries. This stricture is frequently ignored in MakerNotes and TIFF-derived formats. A line reporting the beginning of these offset values will be printed immediately after the next IFD offset, followed by one line for each offset entry, with the tag name repeated, followed by the actual value, followed by its interpretation (if any).

Multiple values in entries are printed on a single line, but large lists will be elided, with just the first two or three values shown, followed by an ellipsis, followed by the last value, the number of values, and the offset of the last value in the list. The full value list may be printed using the -eA option.

In structural format, ascii strings in the entry are printed for the entire length given in the IFD entry, including nulls and non-ascii values (if present), which are printed in `backslashed' octal notation. The -ea option may be used to force ascii values to be printed only up to the first null. This option is often necessary for CIFF format files, and is enabled by default in “list” mode.

Entries are indented slightly from the start identifier for the IFD, and subsegments (e.g. an Exif IFD, SubIFD, or MakerNote) will be further indented in order to indicate the structure of the file.

The resulting output displays the contents of the IFD much as it appears in the file (see the TIFF or EXIF specifications for descriptions of the IFD format).

Finally, the start and end of actual image data for the primary image (and possibly thumbnail or reduced-resolution image) is reported at the end. For JPEG images, this usually includes display of the JPEG segment markers within the image. Binary format image data will be shown with a brief hex/ascii dump of the beginning of the data, between start and end markers.

Note that values preceded by `@' are always offsets from the beginning of the file to the actual value. IFD offsets are usually recorded in the file as offsets relative to the beginning of the TIFF header (which is offset from the beginning of the file in JPEG APP1 files) but are adjusted by exifprobe to show offset from the beginning of the file. If it is important to see the recorded value, the -er option may be used to print the recorded value in parentheses, following the adjusted file offset.

Report Format

The report format (-R) displays all sections and segments of the image file, including start and end of sections, but eliminates much of the “cruft” of the structural format by eliminating address/offset information and much of the `internal' information from the TIFF IFD (tag number, type and count). Offset values are printed inline with the tag name. The output is indented to show the logical structure of the image file, but is much less difficult to view than the structural format.

List format

The list format (-L) suppresses structural information, writing only content in the format tagname = value or tagname = value = “what value means”. For non-tagged file formats, the tagname will be replaced by a fixed identifier for the item. In LIST format, “long” tagnames are used, which include the names of all parent sections of the section in which the data is found. Long tagnames can be toggled off, although this is unwise if the file contains multiple image sections.

The “value” of tags or items which represent an offset to a subsection or image are printed in list format as “@offset:length”.

The List format is used by the auxilliary script exifgrep, which permits selective extraction of information e.g. for photo galleries, and output in (almost) “shell variable” format.

Custom Formats

The -Z option “zeroes” all option flags (except the longnames modifier), after which the lower-case options may be used to set desired options. The lower-case options are `toggles', which may also be used to turn off items in the pre-defined formats.

As an example, the command:

exifprobe -Z -et somefile.tif

may be used to list just the TIFF and Exif tags, without values (or anything else) in “long name” format.

exifprobe -eNnT somefile.tif

will print in structural format, suppressing output of hex and decimal tag numbers, and tag type and count.

The “zero-level” output still reports the file data and image summary as described above.

OPTIONS

The environment variable EXIFPROBE_OPTIONS may be set to any valid option string, which will be evaluated before command line options. E.g.

export EXIFPROBE_OPTIONS='-L -c'

will make list format the default output format, and re-enable the color sequences turned off by -L.

Options are evaluated from left to right, so -Z should be given first, while -D (decimal only) or -X (hex only) should be given last.

-S
Structure mode: (default) almost everything; offset values not inline
-R
Report mode: like structural, but only tagnames and decimal values, indented, inline
-L
List mode: print only tags and values (including interpreted values); no section info; no color
-Z
turn off (zero) all optional output. Prints only filename, filetype, filesize, image summary, and file format.
-c
toggle use of ANSI color control sequences to emphasize EXIF sections. (default 'on' except list mode, unless compiled with no color support)
-a
toggle printing of addresses (file offsets) in hex and decimal
-I
three-way toggle indent (after address -> before -> none)
-o
toggle “inline” print of offset IFD values
-p[items]
toggle print identifiers for:
s
- sections (IFDs, APPn)
g
- segments (JPEG segments)
a
- JPEG APP0...APPn entries
l
- long names (dot-separated list of parent sections preceding item name)
e
- entries. Includes tag names, numbers, types, values.
m
- print MakerNote scheme description
M
- watch debug of MakerNote scheme detection
-e[items]
toggle print IFD entry items:
t
- tagname
n
- tag number in decimal
N
- tag number in hex
T
- entry type and count
v
- value in decimal
V
- value in hex
o
- file offset to value in decimal
O
- file offset to value in hex
r
- relative (unadjusted) offset in decimal
R
- also print “raw” values where normal values are computed (e.g. rational values, or some MakerNote values where APEX values must be computed from a raw value).
A
- print ALL elements of multiple-value tags
a
- ascii "ignore length" (stop at first null)
-D
limit all enabled numerical values to decimal only (addresses, tag numbers, offsets, values)
-X
limit all enabled numerical values to hex only (addresses, tag numbers, offsets, values)
-U[len|a]
dump len (or all) bytes of UNDEFINED data found in TIFF IFDS in hex/ascii form (but only if the structure of the data is not known)
-M[len|a]
dump len (or all) bytes of unrecognized MakerNotes in hex/ascii form (but only if the structure of the data is not known)
-A[len|a]
dump len (or all) bytes of unrecognized JPEG APP segments in hex/ascii form (but only if the structure of the data is not known)
-B[len|a]
dump len (or all) bytes of binary image data or failed JPEG image data
-C[make]+[model]
print a list of camera makes/models matching make or model as substrings. `+' by itself prints everything
-O start_offset
start processing at file offset start_offset
-n
print filename at beginning of each line of output (useful when grepping multiple files in LIST mode)
-N noteversion
force use of note version noteversion when interpreting MakerNotes. Useful only if you know what you're doing.
-m make
Force the makernote code to interpret the note according to the make given, rather than that contained in the file.
-l model
force the makernote code to interpret the note according to the model given, rather than that contained in the file.
-t
This option has effect only if set in EXIFPROBE_OPTIONS. If set when command line options are processed, color will be be off by default if the output is not to a tty. Any command line option which toggles or sets color (e.g. “-R”) will turn color back on.
-u
Print “raw” Unicode data. Normally 16 bit data is printed as though the high byte is zero (which is often the case). Writing the nulls would annoy most ascii terminal devices, so the default is more hospitable. The -u option forces printing of the full value.
-h
print a help message
-V
print program version and copyright

SEE ALSO

exifgrep(1)

The TIFF6 specification:

https://partners.adobe.com/asn/developer/PDFS/TN/TIFF6.pdf

The Exif 2.2 specification:

http://tsc.jeita.or.jp/avs/data/cp3451.pdf

The JFIF specification:

http://www.w3.org/Graphics/JPEG/jfif3.pdf

The TIFF/EP specification:

http://www.map.tu.chiba-u.ac.jp/IEC/100/TA2/recdoc/N4378.pdf

The CIFF specification http://xyrion.org/ciff/CIFFspecV1R04.pdf

The X3F public specification http://www.sd9.org.uk/X3F_Format.pdf

The JPEG2000 public draft (outdated) http://www.jpeg.org/public/fcd15444-1.pdf

DIAGNOSTICS

Most diagnostics are printed “inline” to stdout, in red if color is enabled, and the program attempts to proceed.

BUGS

Interpretation of MakerNote data for specific cameras is incomplete and probably always will be. The X3F specification is incomplete, and the final JPEG2000/JP2 specification is not freely available; support for these formats is therefore not complete, and may not be entirely accurate.

The RAF file format support is preliminary (there is no published specification).

Floating point values read from the file are expected to be in IEEE format (or at least, native format); i.e. no conversions are attempted.

ANSI color sequence support should use termcap/terminfo facilities; it does not.

AUTHOR



Duane H. Hesser

dhh@virtual-cafe.com