NAME

bisonc++input - Organization of bisonc++’s grammar file(s)

DESCRIPTION

Bisonc++ derives from bison++(1), originally derived from bison(1). Like these programs bisonc++ generates a parser for an LALR(1) grammar. Bisonc++ generates C++ code: an expandable C++ class.

Refer to bisonc++(1) for a general overview. This manual page covers the structure and organization of bisonc++’s grammar file(s).

Bisonc++’s grammar file has the following generic outline:

    directives (see the next section)
    %%
    grammar rules

Grammar rules have the following generic form:

    nonterminal:
        production-rules
    ;

Production rules consist of zero or more sequences of terminal tokens, nonterminal tokens and/or action blocks. When multiple production rules are used they must be separated from each other by vertical bars. Action blocks are C++ compound statements.

This manual page contains the following sections:

o: DESCRIPTION: this section;
o: DIRECTIVES: bisonc++’s grammar-specification directives;
o: POLYMORPHIC SEMANTIC VALUES: how to use polymorphic semantic values in parsers generated by bisonc++;
o: DOLLAR NOTATIONS: available $-shorthand notations with single, union, and polymorphic semantic value types.
o: RESTRICTIONS ON TOKEN NAMES: name restrictions for user-defined symbols;
o: OBSOLETE SYMBOLS: symbols available to bison(1), but not to bisonc++;
o: USING SYMBOLIC TOKENS IN CLASSES OTHER THAN THE PARSER CLASS; how to refer to tokens defined in the grammar;
o: EXAMPLE: an example of using bisonc++;
o: SEE ALSO: references to other programs and documentation;
o: AUTHOR: at the end of this man-page.

UNDERSCORES

Starting with version 6.02.00 bisonc++ reserved identifiers no longer end in two underscore characters, but in one. This modification was necessary because according to the C++ standard identifiers having two or more consecutive underscore characters are reserved by the language. In practice this could require some minor modifications of existing source files using bisonc++’s facilities, most likely limited to changing Tokens__ into Tokens_ and changing Meta__ into Meta_.

The complete list of affected names is:

Enums:

DebugMode_, ErrorRecovery_, Return_, Tag_, Tokens_

Enums values:

PARSE_ABORT_, PARSE_ACCEPT_, UNEXPECTED_TOKEN_, sizeofTag_

Type / namespace designators:

Meta_, PI_, STYPE_

Member functions:

clearin_, errorRecovery_, errorVerbose_, executeAction_, lex_, lookup_, nextCycle_, nextToken_, popToken_, pop_, print_, pushToken_, push_, recovery_, redoToken_, reduce_, savedToken_, shift_, stackSize_, startRecovery_, state_, token_, top_, vs_,

Protected data members:

d_acceptedTokens_, d_actionCases_, d_debug_, d_nErrors_, d_requiredTokens_, d_val_, idOfTag_, s_nErrors_

DIRECTIVES

Quite a few directives can be specified in the initial section of the grammar specification file. If command-line options for directives are available, then their specifications take precedence over the corresponding directives in the grammar file. Once class header or implementation header files exist directives affecting those files are ignored.

Directives accepting a `filename’ do not accept path names, i.e., they cannot contain directory separators (/); directives accepting a ’pathname’ may contain directory separators. A ’pathname’ using blank characters should be surrounded by double quotes.

Some directives may generate errors. This happens when their specifications conflict with the contents of files bisonc++ cannot modify (e.g., a parser class header file exists, but doesn’t define a namespace, but in a later run the a %namespace directive was provided).

To resolve such errors the offending directive could be omitted, the existing file could be removed, or the existing file could be hand-edited according to the directive’s specification.