public class PerlTokenMaker extends AbstractJFlexCTokenMaker
This implementation was created using
JFlex 1.4.1; however, the generated file
was modified for performance. Memory allocation needs to be almost
completely removed to be competitive with the handwritten lexers (subclasses
of AbstractTokenMaker
, so this class has been modified so that
Strings are never allocated (via yytext()), and the scanner never has to
worry about refilling its buffer (needlessly copying chars around).
We can achieve this because RText always scans exactly 1 line of tokens at a
time, and hands the scanner this line as an array of characters (a Segment
really). Since tokens contain pointers to char arrays instead of Strings
holding their contents, there is no need for allocating new memory for
Strings.
The actual algorithm generated for scanning has, of course, not been modified.
If you wish to regenerate this file yourself, keep in mind the following:
PerlTokenMaker.java
file will contain two
definitions of both zzRefill
and yyreset
.
You should hand-delete the second of each definition (the ones
generated by the lexer), as these generated methods modify the input
buffer, which we'll never have to do.yylex()
on the generated scanner
directly; rather, you should use getTokenList
as you would
with any other TokenMaker
instance.AbstractJFlexCTokenMaker.CStyleInsertBreakAction
Modifier and Type | Field and Description |
---|---|
static int |
BACKTICKS |
static int |
CHAR_LITERAL |
static int |
HEREDOC_EOF_SINGLE_QUOTED
lexical states
|
static int |
HEREDOC_EOF_UNQUOTED |
static int |
HEREDOC_EOT_SINGLE_QUOTED |
static int |
HEREDOC_EOT_UNQUOTED |
static int |
INTERNAL_HEREDOC_EOF_SINGLE_QUOTED
Token type specific to PerlTokenMaker; this signals that we are inside
an single quoted EOF heredoc.
|
static int |
INTERNAL_HEREDOC_EOF_UNQUOTED
Token type specific to PerlTokenMaker; this signals that we are inside
an unquoted/double quoted/backtick EOF heredoc.
|
static int |
INTERNAL_HEREDOC_EOT_SINGLE_QUOTED
Token type specific to PerlTokenMaker; this signals that we are inside
an single quoted EOT heredoc.
|
static int |
INTERNAL_HEREDOC_EOT_UNQUOTED
Token type specific to PerlTokenMaker; this signals that we are inside
an unquoted/double quoted/backtick EOT heredoc.
|
static int |
INTERNAL_POD
Token type specific to PerlTokenMaker; this signals we are in a POD
block.
|
static int |
POD |
static int |
STRING |
static int |
YYEOF
This character denotes the end of file
|
static int |
YYINITIAL |
offsetShift, s, start
currentToken, firstToken, previousToken
Constructor and Description |
---|
PerlTokenMaker()
Constructor.
|
PerlTokenMaker(java.io.InputStream in)
Creates a new scanner.
|
PerlTokenMaker(java.io.Reader in)
Creates a new scanner
There is also a java.io.InputStream version of this constructor.
|
Modifier and Type | Method and Description |
---|---|
void |
addToken(char[] array,
int start,
int end,
int tokenType,
int startOffset)
Adds the token specified to the current linked list of tokens.
|
java.lang.String[] |
getLineCommentStartAndEnd(int languageIndex)
Returns the text to place at the beginning and end of a
line to "comment" it in this programming language.
|
boolean |
getMarkOccurrencesOfTokenType(int type)
Returns whether tokens of the specified type should have "mark
occurrences" enabled for the current programming language.
|
Token |
getTokenList(javax.swing.text.Segment text,
int initialTokenType,
int startOffset)
Returns the first token in the linked list of tokens generated
from
text . |
void |
yybegin(int newState)
Enters a new lexical state
|
char |
yycharat(int pos)
Returns the character at position pos from the
matched text.
|
void |
yyclose()
Closes the input stream.
|
int |
yylength()
Returns the length of the matched text region.
|
Token |
yylex()
Resumes scanning until the next regular expression is matched,
the end of input is encountered or an I/O-Error occurs.
|
void |
yypushback(int number)
Pushes the specified amount of characters back into the input stream.
|
void |
yyreset(java.io.Reader reader)
Resets the scanner to read from a new input stream.
|
int |
yystate()
Returns the current lexical state.
|
java.lang.String |
yytext()
Returns the text matched by the current regular expression.
|
createInsertBreakAction, getCurlyBracesDenoteCodeBlocks, getInsertBreakAction, getShouldIndentNextLineAfter
yybegin
addNullToken, addToken, addToken, createOccurrenceMarker, getClosestStandardTokenTypeForInternalType, getLanguageIndex, getLastTokenTypeOnLine, getNoTokensIdentifiedYet, getOccurrenceMarker, isIdentifierChar, isMarkupLanguage, resetTokenList, setLanguageIndex
public static final int YYEOF
public static final int HEREDOC_EOF_SINGLE_QUOTED
public static final int HEREDOC_EOT_SINGLE_QUOTED
public static final int HEREDOC_EOT_UNQUOTED
public static final int STRING
public static final int BACKTICKS
public static final int YYINITIAL
public static final int HEREDOC_EOF_UNQUOTED
public static final int CHAR_LITERAL
public static final int POD
public static final int INTERNAL_HEREDOC_EOF_UNQUOTED
public static final int INTERNAL_HEREDOC_EOF_SINGLE_QUOTED
public static final int INTERNAL_HEREDOC_EOT_UNQUOTED
public static final int INTERNAL_HEREDOC_EOT_SINGLE_QUOTED
public static final int INTERNAL_POD
public PerlTokenMaker()
public PerlTokenMaker(java.io.Reader in)
in
- the java.io.Reader to read input from.public PerlTokenMaker(java.io.InputStream in)
in
- the java.io.Inputstream to read input from.public void addToken(char[] array, int start, int end, int tokenType, int startOffset)
addToken
in interface TokenMaker
addToken
in class TokenMakerBase
array
- The character array.start
- The starting offset in the array.end
- The ending offset in the array.tokenType
- The token's type.startOffset
- The offset in the document at which this token
occurs.public java.lang.String[] getLineCommentStartAndEnd(int languageIndex)
getLineCommentStartAndEnd
in interface TokenMaker
getLineCommentStartAndEnd
in class TokenMakerBase
languageIndex
- The language index at the offset in question.
Since some TokenMaker
s effectively have nested
languages (such as JavaScript in HTML), this parameter tells the
TokenMaker
what sub-language to look at.null
value for either means there
is no string to add for that part. A value of
null
for the array means this language
does not support commenting/uncommenting lines.public boolean getMarkOccurrencesOfTokenType(int type)
TokenTypes.IDENTIFIER
.
Subclasses can override this method to support other token types, such
as TokenTypes.VARIABLE
.getMarkOccurrencesOfTokenType
in interface TokenMaker
getMarkOccurrencesOfTokenType
in class AbstractJFlexCTokenMaker
type
- The token type.public Token getTokenList(javax.swing.text.Segment text, int initialTokenType, int startOffset)
text
. This method must be implemented by
subclasses so they can correctly implement syntax highlighting.text
- The text from which to get tokens.initialTokenType
- The token type we should start with.startOffset
- The offset into the document at which
text
starts.Token
in a linked list representing
the syntax highlighted text.public final void yyreset(java.io.Reader reader)
reader
- the new input streampublic final void yyclose() throws java.io.IOException
java.io.IOException
public final int yystate()
public final void yybegin(int newState)
yybegin
in class AbstractJFlexTokenMaker
newState
- the new lexical statepublic final java.lang.String yytext()
public final char yycharat(int pos)
pos
- the position of the character to fetch.
A value from 0 to yylength()-1.public final int yylength()
public void yypushback(int number)
number
- the number of characters to be read again.
This number must not be greater than yylength()!public Token yylex() throws java.io.IOException
java.io.IOException
- if any I/O-Error occurs