ESW PART-FW: OpenBIOS DeTokenizer detok (A User's Guide)

The easiest way to describe the different optional output formats would be by creating an example of a source file that has been Tokenized and displaying the output of the DeTokenizer, applied to its resultant FCode binary, with the various options.

Our example source file looks like this:

With no options selected, the DeTokenizer output looks like this:

The "verbose" option adds a display of the hex value of each token processed, (as well as a signature block), thus:

The "offsets" option shows the position of the tokens relative to the start of the first FCode block after a PCI header (if one is present) and the destination-offset of each branch. If more than one FCode header follows a single PCI header, the offset-counter will continue; if a new PCI header is encountered, the offset-counter will be reset and will begin counting again from zero after the end of the latest PCI header.

Without the "verbose" option, i.e., with just the "offsets" option by itself, the DeTokenizer output looks like this:

Combining the "verbose" and "offsets" options results in something that looks like this:

There's another option called "line numbers" but it only numbers the lines of output. It's easy enough to describe, and so needs no illustration.

The command-line format is simply:

detok [options] fc-file [fc-file ...]

The output of this DeTokenizer is directed to STDOUT, so there is no "Output file" option per se. Simply redirect the output to the file in which you wish to keep the results, using the standard Shell conventions.

Command-Line option Switches are case-sensitive; only one option has an applicable argument, and that one is a file name. Its case sensitivity is, of course, dependent on the Host Operating System.

Print a brief help message and then exit.

Verbose -- display additional information: the hex value of each token processed, as well as a signature block.

Offsets -- display the positions of the tokens relative to the start of the first FCode block after a PCI header (if one is present), and the destination-offset of each branch.

Note that the combination of the Verbose and Offsets options yields the maximum amount of useful information.

Line Numbers -- display the sequential number of each line of output.

Note that the -n and -o options are mutually exclusive; if both are specified, -o will be favoured.

Process All input. Do not stop when end0 has been encountered. This option is usually not needed, but may be useful in cases where a file has been corrupted or when something very strange has been Tokenized...

Pre-load Additional FCodes before processing. These might be, for instance, a set of vendor-specific FCodes that were generated for a specific vendor's products by a Tokenizer customized for that specific vendor. A detailed discussion of the "Additional FCodes" file will be presented in a separate dedicated section.

Some vendors' FCode drivers contain non-standard FCode tokens. In order to accommodate those situations, provision is made to specify the names of the FCodes in question. The -f command-line option permits the user to specify an "Additional FCodes List" file, which will be read before detokenization begins and which will contain the list of "Additional FCodes" to be recognized.

The format of the file is as follows:

One entry, consisting of an FCode and its name, on a line. The FCode Number is given first, in the form of a hex number, preceded by an optional 0x or 0X (Thus: 0x602 or 0X602 or simply 602 are all equivalent.) At least one blank space separates the FCode Number from the Name, which must be on the same line. Any number of blanks are permitted, and any text that follows the Name is permitted and will be ignored.
Blank lines are permitted and will be ignored.
Comment lines are permitted and will be ignored. A comment-line starts with either a pound-sign ( # ) or a backslash ( \ ).
FCode Numbers are limited to the range 0x10..0x7ff Numbers smaller than 0x10 are the leading-byte of a two-byte FCode, and numbers from 0x800 and up are assigned by the tokenizer. Lines with numbers outside the permitted range will be ignored, and a message will be printed.
FCode numbers that are already assigned will not be permitted to be overwritten. Lines with numbers that are already assigned will be ignored, and a message will be printed.

If the file cannot be read, that will be regarded as an immediate failure and cause the program to exit.

Special Functions

In addition to non-standard FCode tokens with simple behavior, some vendors' FCode drivers also contain non-standard FCode tokens with complex behavior. An example that was recently encountered is "double(lit)" which precedes a double-length (i.e., 64-bit) literal. This DeTokenizer is structured to allow the creation of a list of pre-defined Special Function names, each of which has a special behavior associated with it. When one of those names occurs in the "Additional FCodes List" file, it will be recognized; the FCode Number given with it is assigned to it. When that FCode number is encountered, the assigned special behavior will be exercised.

Adding to the list of Special Function names, and associating a new behavior with the added function, requires modifying the DeTokenizer code, but the infrastructure that is already in place should make this a manageable task for even a modestly skilled programmer.

At the present writing, only one such Special Function name is supported, and that one is, of course, double(lit)

Its associated special behavior is to collect the next eight bytes from the FCode input stream and display them as a double-length literal.

If you modify the DeTokenizer to recognize additional Special Function names, please update this document to list them and describe their special behaviors. Thank you.

OpenBIOS DeTokenizer detok

(A User's Guide)

Table of Contents

Overview

Output Formats

Sample source file:

DeTokenizer output with no options selected:

DeTokenizer output with the "verbose" option selected:

DeTokenizer output with the "offsets" option selected:

DeTokenizer output with both the "verbose" and "offsets" options selected:

Command-Line Format

Command-Line Options

Switches

The "Additional FCodes" file

Special Functions

End Of Document