man HTML::Mason::Compiler () - Compile Mason component source
NAME
HTML::Mason::Compiler - Compile Mason component source
SYNOPSIS
package My::Funky::Compiler;
use base qw(HTML::Mason::Compiler);
DESCRIPTION
The compiler starts the compilation process by calling its lexer's CWlex method and passing itself as the CWcompiler parameter. The lexer then calls various methods in the compiler as it parses the component source.
PARAMETERS TO THE new() CONSTRUCTOR
- allow_globals
-
List of variable names, complete with prefix (CW$@%), that you intend
to use as globals in components. Normally global variables are
forbidden by CWstrict, but any variable mentioned in this list is
granted a reprieve via a use vars statement. For example:
allow_globals => [qw($DBH %session)]
In a mod_perl environment, CW$r (the request object) is automatically added to this list. - default_escape_flags
-
Escape flags to apply to all <% %> expressions by default. The current
valid flags are
h - escape for HTML ('<' => '<', etc.) u - escape for URL (':' => '%3A', etc.)
The developer can override default escape flags on a per-expression basis; see the escaping expressions section of the developer's manual. If you want to set multiple flags as the default, this should be given as a reference to an array of flags. - enable_autoflush
- True or false, default is true. Indicates whether components are compiled with support for autoflush. The component can be compiled to a more efficient form if it does not have to check for autoflush mode, so you should set this to 0 if you can.
- lexer
- The Lexer object to associate with this Compiler. By default a new object of class lexer_class will be created.
- lexer_class
- The class to use when creating a lexer. Defaults to HTML::Mason::Lexer.
- preprocess
- Sub reference that is called to preprocess each component before the compiler does it's magic. The sub is called with a single parameter, a scalar reference to the script. The sub is expected to process the script in-place. This is one way to extend the HTML::Mason syntax with new tags, etc., although a much more flexible way is to subclass the Lexer or Compiler class. See also postprocess_text and postprocess_perl.
- postprocess_text
- Sub reference that is called to postprocess the text portion of a compiled component, just before it is assembled into its final subroutine form. The sub is called with a single parameter, a scalar reference to the text portion of the component. The sub is expected to process the string in-place. See also preprocess and postprocess_perl.
- postprocess_perl
- Sub reference that is called to postprocess the Perl portion of a compiled component, just before it is assembled into its final subroutine form. The sub is called with a single parameter, a scalar reference to the Perl portion of the component. The sub is expected to process the string in-place. See also preprocess and postprocess_text.
- use_source_line_numbers
- True or false, default is true. Indicates whether component line numbers that appear in error messages, stack traces, etc. are in terms of the source file instead of the object file. Mason does this by inserting '#line' directives into compiled components. While source line numbers are more immediately helpful, object file line numbers may be more appropriate for in-depth debugging sessions.
ACCESSOR METHODS
All of the above properties have read-only accessor methods of the same name.
You cannot change any property of a compiler after it has been created - among other things, this would potentially invalidate any existing cached component objects or object files. Your best bet is to create different compiler objects and load them into different interpreters.
METHODS
There are several methods besides the compilation callbacks below that a Compiler subclass needs to implement.
- compile(comp_source => <string>, name => <string>, comp_class => <string>)
- The comp_class parameter may be ignored by the compiler.
- object_id
- This method should return a unique id for the given compiler object. This is used by the interpreter when determining the object directory, for example.
Compilation Callbacks
These are methods called by the Lexer while processing a component source. You may wish to override some of these methods if you're implementing your own custom Compiler class.
- start_component()
- This method is called by the Lexer when it starts processing a component.
- end_component()
- This method is called by the Lexer when it finishes processing a component.
- start_block(block_type => <string>)
- This method is called by the Lexer when it encounters an opening Mason block tag like CW<%perl> or CW<%args>. Its main purpose is to keep track of the nesting of different kinds of blocks within each other. The type of block (init, once, etc.) is passed via the block_type parameter.
- end_block(block_type => <string>)
- This method is called by the Lexer when it encounters a closing Mason block tag like CW</%perl> or CW</%args>. Like CWstart_block(), its main purpose is to help maintain syntactic integrity.
- *_block(block => <string>, [ block_type => <string> ])
- Several compiler methods like CWdoc_block(), CWtext_block(), and CWraw_block() are called by the Lexer after CWstart_block() when it encounters blocks of certain types. These methods actually do the work of putting the body of a block into the compiled data structure. The methods that follow this pattern are CWinit_block(), CWperl_block(), CWdoc_block(), CWtext_block(), and CWraw_block(). The last method is called for all CW<%once>, CW<%cleanup>, CW<%filter>, CW<%init>, CW<%perl>, and CW<%shared> blocks.
- text(text => <string>)
- Inserts the text contained in a CWtext parameter into the component for verbatim output. This is called when the lexer finds plain text in a component.
- variable_declaration( type => <string>, name => <string>, default => <string> )
- Inserts a variable declaration from the CW<%args> section into the component. The type will be either $, @, or %, indicating a scalar, array, or hash. The name is the variable name without the leading sigil. The default is everything found after the first => on an CW<%args> block line, and may include a comment.
- key_value_pair(block_type => <string>, key => <string>, value => <string>)
- Inserts a key-value pair from a CW<%flags> or CW<%attr> section into the component. The block_type parameter will be either flags or attr.
- start_named_block(block_type => <string>, name => <name>)
- Analogous to item_start_block, but starts a named block (CW<%method> or CW<%def>).
- end_named_block()
- Called by the Lexer to end a named block.
- substitution(substitution => <string>, escape => <string>)
- Called by the Lexer when it encounters a substitution tag (CW<% ... %>). The value of the escape parameter will be everything found after the pipe (|) in the substitution tag, and may be more than one character such as nh.
- component_call(call => <string>)
- Called by the Lexer when it encounters a component call tag without embedded content (CW<& ... &>). The call parameter contains the entire contents of the tag.
- component_content_call(call => <string>)
- Called by the Lexer when it encounters a component call tag with embedded content (CW<&| ... &>).
- component_content_call_end()
- Called by the Lexer when it encounters an ending tag for a component call with content (CW</&>). Note that there is no corresponding CWcomponent_call_end() method for component calls without content, because these calls don't have ending tags.
- perl_line(line => <string>)
- Called by the Lexer when it encounters a CW%-line.
SUBCLASSING
We recommend that any parameters you add to Compiler be read-only, because the compiler object_id is only computed once on creation and would not reflect any changes to Lexer parameters.
SEE ALSO
HTML::Mason, HTML::Mason::Admin, HTML::Mason::Interp