XhtmlParser

Inherited from: MarkupParser

Inherited from: MarkupHandler

<! attlist := ATTLIST

Inherited from: MarkupParser

Inherited from: MarkupParser

Inherited from: MarkupParser

Inherited from: TokenTests

Inherited from: TokenTests

Inherited from: ConstructingHandler

content1 ::=  '<' content1 | '&' charref ...

Inherited from: MarkupParser

'<' content1 ::=  ...

Inherited from: MarkupParser

[22]     prolog      ::= XMLDecl? Misc* (doctypedecl Misc*)?
[23]     XMLDecl     ::= '<?xml' VersionInfo EncodingDecl? SDDecl? S? '?>'
[24]     VersionInfo ::= S 'version' Eq ("'" VersionNum "'" | '"' VersionNum '"')
[25]     Eq          ::= S? '=' S?
[26]     VersionNum  ::= '1.0'
[27]     Misc        ::= Comment | PI | S

Inherited from: MarkupParser

Inherited from: ConstructingHandler

Inherited from: MarkupHandler

callback method invoked by MarkupParser after end-tag of element.

Value Params

label: the local name
pos: the position in the source file
pre: the prefix

Inherited from

MarkupHandler

callback method invoked by MarkupParser after start-tag of element.

Value Params

attrs: the attributes (metadata)
label: the local name
pos: the position in the sourcefile
pre: the prefix

Inherited from

MarkupHandler

Inherited from: MarkupParser

'<' element ::= xmlTag1 '>'  { xmlExpr | '{' simpleExpr '}' } ETag
             | xmlTag1 '/' '>'

Inherited from: MarkupParser

<! element := ELEMENT

Inherited from: MarkupParser

Inherited from: MarkupHandler

<! element := ELEMENT

Inherited from: MarkupParser

Inherited from: ConstructingHandler

Inherited from: MarkupParser

Inherited from: MarkupParserCommon

Inherited from: MarkupParser

Inherited from: MarkupParser

externalID ::= SYSTEM S syslit
               PUBLIC S pubid S syslit

Inherited from: MarkupParser

Inherited from: ExternalSources

As the current code requires you to call nextch once manually after construction, this method formalizes that suboptimal reality.

Inherited from: MarkupParser

"rec-xml/#ExtSubset" pe references may not occur within markup declarations

Inherited from: MarkupParser

These are 99% sure to be redundant but refactoring on the safe side.

Inherited from: TokenTests

Inherited from: TokenTests

See [5] of XML 1.0 specification.

Name ::= ( Letter | '_' ) (NameChar)*

See [5] of XML 1.0 specification.

Inherited from: TokenTests

See [4] and [4a] of Appendix B of XML 1.0 specification.

NameChar ::= Letter | Digit | '.' | '-' | '_' | ':' | #xB7
           | CombiningChar | Extender

See [4] and [4a] of Appendix B of XML 1.0 specification.

Inherited from: TokenTests

where Letter means in one of the Unicode general categories { Ll, Lu, Lo, Lt, Nl }.

NameStart ::= ( Letter | '_' | ':' )

where Letter means in one of the Unicode general categories { Ll, Lu, Lo, Lt, Nl }.

We do not allow a name to start with :. See [4] and Appendix B of XML 1.0 specification

Inherited from: TokenTests

Inherited from: TokenTests

(#x20 | #x9 | #xD | #xA)+

Inherited from: TokenTests

(#x20 | #x9 | #xD | #xA)

Inherited from: TokenTests

Returns true if the encoding name is a valid IANA encoding. This method does not verify that there is a decoder available for this encoding, only that the characters are valid for an IANA encoding name.

Value Params

ianaEncoding: The IANA encoding name.

Inherited from

TokenTests

Inherited from: MarkupParser

Inherited from: MarkupHandler

Inherited from: MarkupParser

Inherited from: MarkupParser

Inherited from: MarkupParser

Inherited from: MarkupParser

this method tells ch to get the next character when next called

Inherited from: MarkupParser

'N' notationDecl ::= "OTATION"

Inherited from: MarkupParser

Inherited from: MarkupHandler

Inherited from: MarkupHandler

parses document type declaration and assigns it to instance variable dtd.

<! parseDTD ::= DOCTYPE name ... >

Inherited from: MarkupParser

Inherited from: MarkupHandler

Inherited from: MarkupHandler

Inherited from: MarkupParser

Inherited from: ConstructingHandler

<? prolog ::= xml S?
// this is a bit more lenient than necessary...

Inherited from: MarkupParser

[12]       PubidLiteral ::=        '"' PubidChar* '"' | "'" (PubidChar - "'")* "'"

Inherited from: MarkupParser

Inherited from: MarkupParser

Inherited from: MarkupParser

append Unicode character to name buffer

Inherited from: MarkupParser

Inherited from: MarkupHandler

Inherited from: MarkupParser

Inherited from: MarkupParser

Inherited from: MarkupParser

Apply a function and return the passed value

Inherited from: MarkupParserCommon

Execute body with a variable saved and restored after execution

Inherited from: MarkupParserCommon

attribute value, terminated by either ' or ". value may not contain <.

     AttValue     ::= `'` { _ } `'`
                    | `"` { _ } `"`

Inherited from: MarkupParser

Inherited from: ConstructingHandler

prolog, but without standalone

Inherited from: MarkupParser

Inherited from: MarkupParser

Inherited from: MarkupHandler

Inherited from: MarkupParserCommon

Inherited from: MarkupParserCommon

attribute value, terminated by either ' or ". value may not contain <.

Value Params

endCh: either ' or "

Inherited from

MarkupParserCommon

parse attribute and create namespace scope, metadata

[41] Attributes    ::= { S Name Eq AttValue }

Inherited from: MarkupParser

'<! CharData ::= [CDATA[ ( {char} - {char}"]]>"{char} ) ']]>'

see [15]

Inherited from: MarkupParser

Inherited from: MarkupParserCommon

Inherited from: MarkupParserCommon

CharRef ::= "&#" '0'..'9' {'0'..'9'} ";" | "&#x" '0'..'9'|'A'..'F'|'a'..'f' { hexdigit } ";"

see [66]

Inherited from: MarkupParserCommon

Comment ::= ''

see [15]

Inherited from: MarkupParser

scan [S] '=' [S]

Inherited from: MarkupParserCommon

[42] '<' xmlEndTag ::= '<' '/' Name S? '>'

Inherited from: MarkupParserCommon

entity value, terminated by either ' or ". value may not contain <.

     AttValue     ::= `'` { _  } `'`
                    | `"` { _ } `"`

Inherited from: MarkupParser

Inherited from: MarkupParser

actually, Name ::= (Letter | '_' | ':') (NameChar)* but starting with ':' cannot happen Name ::= (Letter | '_') (NameChar)*

see [5] of XML 1.0 specification

pre-condition: ch != ':' // assured by definition of XMLSTART token post-condition: name does neither start, nor end in ':'

Inherited from: MarkupParserCommon

'?' {Char})]'?>'

see [15]

Inherited from: MarkupParserCommon

scan [3] S ::= (#x20 | #x9 | #xD | #xA)+

Inherited from: MarkupParserCommon

skip optional space S?

Inherited from: MarkupParserCommon

parse a start or empty tag. [40] STag ::= '<' Name { S Attribute } [S] [44] EmptyElemTag ::= '<' Name { S Attribute } [S]

Inherited from: MarkupParserCommon

Take characters from input stream until given String "until" is seen. Once seen, the accumulated characters are passed along with the current Position to the supplied handler function.

Inherited from: MarkupParserCommon

Inherited from: MarkupParserCommon

Inherited from: MarkupParserCommon

<? prolog ::= xml S ... ?>

Inherited from: MarkupParser

XhtmlParser

Type members

Inherited types

Value members

Inherited methods

Concrete fields

Inherited fields