scala3-compiler-bootstrapped/dotty.tools/dotty.tools.dotc/dotty.tools.dotc.parsing/Scanners/Scanner

Scanner

dotty.tools.dotc.parsing.Scanners$.Scanner

class Scanner(source: SourceFile, val startFrom: Offset, profile: Profile, allowIndent: Boolean)(using x$5: Context) extends ScannerCommon

Attributes

Graph
Supertypes: class ScannerCommon

trait TokenData

class CharArrayReader

class Object

trait Matchable

class Any
Show all

Members list

Type members

Inherited classlikes

Attributes

Inherited from:: CharArrayReader
Supertypes: class CharArrayReader

class Object

trait Matchable

class Any

Value members

Concrete methods

Parse character literal if current character is followed by ', or follow with given op and return a symbol literal token

Attributes

Return a list of all the comment positions

Attributes

Returns the closest docstring preceding the position supplied

Attributes

Handle newlines, possibly inserting an INDENT, OUTDENT, NEWLINE, or NEWLINES token in front of the current token. This depends on whether indentation is significant or not.

Indentation is significant if indentSyntax is set, and we are not inside a {...}, [...], (...), case ... => pair, nor in a if/while condition (i.e. currentRegion is empty).

There are three rules:

Insert NEWLINE or NEWLINES if
- the closest enclosing sepRegion is { ... } or for ... do/yield, or we are on the toplevel, i.e. currentRegion is empty, and
- the previous token can end a statement, and
- the current token can start a statement, and
- the current token is not a leading infix operator, and
- if indentation is significant then the current token starts at the current indentation width or to the right of it.
The inserted token is NEWLINES if the current token is preceded by a whitespace line, or NEWLINE otherwise.
Insert INDENT if
- indentation is significant, and
- the last token can start an indentation region.
- the indentation of the current token is strictly greater than the previous indentation width, or the two widths are the same and the current token is one of : or match.
The following tokens can start an indentation region:

: = => <- if then else while do try catch finally for yield match

Inserting an INDENT starts a new indentation region with the indentation of the current token as indentation width.
Insert OUTDENT if
- indentation is significant, and
- the indentation of the current token is strictly less than the previous indentation width,
- the current token is not a leading infix operator.
Inserting an OUTDENT closes an indentation region. In this case, issue an error if the indentation of the current token does not match the indentation of some previous line in an enclosing indentation region.

If a token is inserted and consumed, the original source token is still considered to start a new line, so the process that inserts an OUTDENT might repeat several times.

Indentation widths are strings consisting of spaces and tabs, ordered by the prefix relation. I.e. a <= b iff b.startsWith(a). If indentation is significant it is considered an error if the current indentation width and the indentation of the current token are incomparable.

Attributes

Is the current token in a position where a modifier is allowed?

Attributes

The indentation width of the given offset.

Attributes

Insert token at assumed offset in front of current one.

Attributes

A leading symbolic or backquoted identifier is treated as an infix operator if

it does not follow a blank line, and
it is followed by at least one whitespace character and a token that can start an expression.
if the operator appears on its own line, the next line must have at least the same indentation width as the operator. See pos/i12395 for a test where this matters. If a leading infix operator is found and the source version is 3.0-migration, emit a change warning.

Attributes

The next token after this one. The token is computed via fetchToken, so complex two word tokens such as CASECLASS are not recognized. Newlines and indent/unindent tokens are skipped. Restriction: lookahead is illegal if the current token is INTERPOLATIONID

Attributes

Produce next token, filling TokenData fields of Scanner.

Attributes

Insert an token if next token closes an indentation region. Exception: continue if indentation region belongs to a match and next token is case.

Attributes

Join CASE + CLASS => CASECLASS, CASE + OBJECT => CASEOBJECT SEMI + ELSE => ELSE, COLON following id/)/] => COLONfollow
Insert missing OUTDENTs at EOF

Attributes

Skip on error to next safe point.

Attributes

Skip matching pairs of (...) or [...] parentheses.

Attributes

Returns a string representation of the object.

The default representation is platform dependent.

Attributes

Returns: a string representation of the object.
Definition Classes: Any

The token for given name. Either IDENTIFIER or a keyword.

Attributes

Inherited methods

Attributes

Inherited from:: ScannerCommon

Attributes

Inherited from:: TokenData

Switch whether unicode should be decoded

Attributes

Inherited from:: CharArrayReader

Generate an error at the given offset

Attributes

Inherited from:: ScannerCommon

Implements CharArrayReader's error method

Attributes

Inherited from:: ScannerCommon

Attributes

Inherited from:: ScannerCommon

Finish an IDENTIFIER with this.name.

Attributes

Inherited from:: ScannerCommon

Clear buffer and set name and token. If target is different from this, don't treat identifiers as end tokens.

Attributes

Inherited from:: ScannerCommon

Attributes

Inherited from:: CharArrayReader

signal an error where the input ended in the middle of a token

Attributes

Inherited from:: ScannerCommon

Attributes

Inherited from:: ScannerCommon

Is current token first one after a newline?

Attributes

Inherited from:: TokenData

Attributes

Inherited from:: TokenData

Attributes

Inherited from:: CharArrayReader

Attributes

Inherited from:: TokenData

Attributes

Inherited from:: TokenData

Attributes

Inherited from:: TokenData

Attributes

Inherited from:: TokenData

Attributes

Inherited from:: TokenData

Attributes

Inherited from:: TokenData

Attributes

Inherited from:: ScannerCommon

Attributes

Inherited from:: TokenData

Attributes

Inherited from:: TokenData

Is last character a unicode escape \uxxxx?

Attributes

Inherited from:: CharArrayReader

Attributes

Inherited from:: CharArrayReader

A new reader that takes off at the current character position

Attributes

Inherited from:: CharArrayReader

Advance one character; reducing CR;LF pairs to just LF

Attributes

Inherited from:: CharArrayReader

Advance one character, leaving CR;LF pairs intact. This is for use in multi-line strings, so there are no "potential line ends" here.

Attributes

Inherited from:: CharArrayReader

append Unicode character to "litBuf" buffer

Attributes

Inherited from:: ScannerCommon

Attributes

Inherited from:: ScannerCommon

Clear buffer and set string

Attributes

Inherited from:: ScannerCommon

Attributes

Inherited from:: ScannerCommon

Concrete fields

The current region. This is initially an Indented region with zero indentation width.

Attributes

We need one token lookahead and one token history

Attributes

Inherited fields

the base of a number

Attributes

Inherited from:: TokenData

Attributes

Inherited from:: ScannerCommon

the last read character

Attributes

Inherited from:: CharArrayReader

The offset one past the last read character

Attributes

Inherited from:: CharArrayReader

the last error offset

Attributes

Inherited from:: ScannerCommon

The offset before the last read character

Attributes

Inherited from:: CharArrayReader

the offset of the character following the token preceding this one

Attributes

Inherited from:: TokenData

the offset of the newline immediately preceding the token, or -1 if token is not preceded by a newline.

Attributes

Inherited from:: TokenData

The start offset of the current line

Attributes

Inherited from:: CharArrayReader

A character buffer for literals

Attributes

Inherited from:: ScannerCommon

the name of an identifier

Attributes

Inherited from:: TokenData

the offset of the first character of the current token

Attributes

Inherited from:: TokenData

the string value of a literal

Attributes

Inherited from:: TokenData

the next token

Attributes

Inherited from:: TokenData

In this article

Generated with