Next: The Bison Declarations Section, Previous: The prologue, Up: Outline of a Bison Grammar [Contents][Index]
The functionality of Prologue sections can often be subtle and
inflexible. As an alternative, Bison provides a %code
directive with
an explicit qualifier field, which identifies the purpose of the code and
thus the location(s) where Bison should generate it. For C/C++, the
qualifier can be omitted for the default location, or it can be one of
requires
, provides
, top
. See %code Summary.
Look again at the example of the previous section:
%{ #define _GNU_SOURCE #include <stdio.h> #include "ptypes.h" %}
%union {
long n;
tree t; /* tree
is defined in ptypes.h. */
}
%{ static void print_token (yytoken_kind_t token, YYSTYPE val); %}
…
Notice that there are two Prologue sections here, but there’s a subtle
distinction between their functionality. For example, if you decide to
override Bison’s default definition for YYLTYPE
, in which
Prologue section should you write your new
definition?5
You should
write it in the first since Bison will insert that code into the parser
implementation file before the default YYLTYPE
definition. In
which Prologue section should you prototype an internal function,
trace_token
, that accepts YYLTYPE
and yytoken_kind_t
as
arguments? You should prototype it in the second since Bison will insert
that code after the YYLTYPE
and yytoken_kind_t
definitions.
This distinction in functionality between the two Prologue sections is
established by the appearance of the %union
between them. This
behavior raises a few questions. First, why should the position of a
%union
affect definitions related to YYLTYPE
and
yytoken_kind_t
? Second, what if there is no %union
? In that
case, the second kind of Prologue section is not available. This
behavior is not intuitive.
To avoid this subtle %union
dependency, rewrite the example using a
%code top
and an unqualified %code
. Let’s go ahead and add
the new YYLTYPE
definition and the trace_token
prototype at
the same time:
%code top { #define _GNU_SOURCE #include <stdio.h> /* WARNING: The following code really belongs * in a '%code requires'; see below. */ #include "ptypes.h" #define YYLTYPE YYLTYPE typedef struct YYLTYPE { int first_line; int first_column; int last_line; int last_column; char *filename; } YYLTYPE; }
%union {
long n;
tree t; /* tree
is defined in ptypes.h. */
}
%code { static void print_token (yytoken_kind_t token, YYSTYPE val); static void trace_token (yytoken_kind_t token, YYLTYPE loc); }
…
In this way, %code top
and the unqualified %code
achieve the
same functionality as the two kinds of Prologue sections, but it’s
always explicit which kind you intend. Moreover, both kinds are always
available even in the absence of %union
.
The %code top
block above logically contains two parts. The first
two lines before the warning need to appear near the top of the parser
implementation file. The first line after the warning is required by
YYSTYPE
and thus also needs to appear in the parser implementation
file. However, if you’ve instructed Bison to generate a parser header file
(see Bison Declaration Summary), you probably want that line to appear
before the YYSTYPE
definition in that header file as well. The
YYLTYPE
definition should also appear in the parser header file to
override the default YYLTYPE
definition there.
In other words, in the %code top
block above, all but the first two
lines are dependency code required by the YYSTYPE
and YYLTYPE
definitions.
Thus, they belong in one or more %code requires
:
%code top { #define _GNU_SOURCE #include <stdio.h> }
%code requires { #include "ptypes.h" }
%union {
long n;
tree t; /* tree
is defined in ptypes.h. */
}
%code requires { #define YYLTYPE YYLTYPE typedef struct YYLTYPE { int first_line; int first_column; int last_line; int last_column; char *filename; } YYLTYPE; }
%code { static void print_token (yytoken_kind_t token, YYSTYPE val); static void trace_token (yytoken_kind_t token, YYLTYPE loc); }
…
Now Bison will insert #include "ptypes.h"
and the new YYLTYPE
definition before the Bison-generated YYSTYPE
and YYLTYPE
definitions in both the parser implementation file and the parser header
file. (By the same reasoning, %code requires
would also be the
appropriate place to write your own definition for YYSTYPE
.)
When you are writing dependency code for YYSTYPE
and YYLTYPE
,
you should prefer %code requires
over %code top
regardless of
whether you instruct Bison to generate a parser header file. When you are
writing code that you need Bison to insert only into the parser
implementation file and that has no special need to appear at the top of
that file, you should prefer the unqualified %code
over %code
top
. These practices will make the purpose of each block of your code
explicit to Bison and to other developers reading your grammar file.
Following these practices, we expect the unqualified %code
and
%code requires
to be the most important of the four Prologue
alternatives.
At some point while developing your parser, you might decide to provide
trace_token
to modules that are external to your parser. Thus, you
might wish for Bison to insert the prototype into both the parser header
file and the parser implementation file. Since this function is not a
dependency required by YYSTYPE
or YYLTYPE
, it doesn’t make
sense to move its prototype to a %code requires
. More importantly,
since it depends upon YYLTYPE
and yytoken_kind_t
, %code
requires
is not sufficient. Instead, move its prototype from the
unqualified %code
to a %code provides
:
%code top { #define _GNU_SOURCE #include <stdio.h> }
%code requires { #include "ptypes.h" }
%union {
long n;
tree t; /* tree
is defined in ptypes.h. */
}
%code requires { #define YYLTYPE YYLTYPE typedef struct YYLTYPE { int first_line; int first_column; int last_line; int last_column; char *filename; } YYLTYPE; }
%code provides { void trace_token (yytoken_kind_t token, YYLTYPE loc); }
%code { static void print_token (FILE *file, int token, YYSTYPE val); }
…
Bison will insert the trace_token
prototype into both the parser
header file and the parser implementation file after the definitions for
yytoken_kind_t
, YYLTYPE
, and YYSTYPE
.
The above examples are careful to write directives in an order that reflects
the layout of the generated parser implementation and header files:
%code top
, %code requires
, %code provides
, and then
%code
. While your grammar files may generally be easier to read if
you also follow this order, Bison does not require it. Instead, Bison lets
you choose an organization that makes sense to you.
You may declare any of these directives multiple times in the grammar file. In that case, Bison concatenates the contained code in declaration order. This is the only way in which the position of one of these directives within the grammar file affects its functionality.
The result of the previous two properties is greater flexibility in how you may organize your grammar file. For example, you may organize semantic-type-related directives by semantic type:
%code requires { #include "type1.h" } %union { type1 field1; } %destructor { type1_free ($$); } <field1> %printer { type1_print (yyo, $$); } <field1>
%code requires { #include "type2.h" } %union { type2 field2; } %destructor { type2_free ($$); } <field2> %printer { type2_print (yyo, $$); } <field2>
You could even place each of the above directive groups in the rules section of
the grammar file next to the set of rules that uses the associated semantic
type.
(In the rules section, you must terminate each of those directives with a
semicolon.)
And you don’t have to worry that some directive (like a %union
) in the
definitions section is going to adversely affect their functionality in some
counter-intuitive manner just because it comes first.
Such an organization is not possible using Prologue sections.
This section has been concerned with explaining the advantages of the four
Prologue alternatives over the original Yacc Prologue.
However, in most cases when using these directives, you shouldn’t need to
think about all the low-level ordering issues discussed here.
Instead, you should simply use these directives to label each block of your
code according to its purpose and let Bison handle the ordering.
%code
is the most generic label.
Move code to %code requires
, %code provides
, or %code top
as needed.
However, defining YYLTYPE
via a C macro is not
the recommended way. See Data Type of Locations
Next: The Bison Declarations Section, Previous: The prologue, Up: Outline of a Bison Grammar [Contents][Index]