Class/Object

eu.cdevreeze.yaidom.parse

DocumentParserUsingDom

Related Docs: object DocumentParserUsingDom | package parse

Permalink

final class DocumentParserUsingDom extends AbstractDocumentParser

DOM-based Document parser.

Typical non-trivial creation is as follows, assuming class MyEntityResolver, which extends EntityResolver, and class MyErrorHandler, which extends ErrorHandler:

val dbf = DocumentBuilderFactory.newInstance()
dbf.setNamespaceAware(true)

def createDocumentBuilder(dbf: DocumentBuilderFactory): DocumentBuilder = {
  val db = dbf.newDocumentBuilder()
  db.setEntityResolver(new MyEntityResolver)
  db.setErrorHandler(new MyErrorHandler)
  db
}

val docParser = DocumentParserUsingDom.newInstance(dbf, createDocumentBuilder _)

If we want the DocumentBuilderFactory to be a validating one, using an XML Schema, we could obtain the DocumentBuilderFactory as follows:

val schemaFactory = SchemaFactory.newInstance(XMLConstants.W3C_XML_SCHEMA_NS_URI)
val schemaSource = new StreamSource(new File(pathToSchema))
val schema = schemaFactory.newSchema(schemaSource)

val dbf = {
  val result = DocumentBuilderFactory.newInstance()
  result.setNamespaceAware(true)
  result.setSchema(schema)
  result
}

A custom EntityResolver could be used to retrieve DTDs locally, or even to suppress DTD resolution. The latter can be coded as follows (see http://stuartsierra.com/2008/05/08/stop-your-java-sax-parser-from-downloading-dtds), risking some loss of information:

class MyEntityResolver extends EntityResolver {
  override def resolveEntity(publicId: String, systemId: String): InputSource = {
    // This dirty hack may not work on IBM JVMs
    new InputSource(new java.io.StringReader(""))
  }
}

For completeness, a custom ErrorHandler class that simply prints parse exceptions to standard output:

class MyErrorHandler extends ErrorHandler {
  def warning(exc: SAXParseException): Unit = { println(exc) }
  def error(exc: SAXParseException): Unit = { println(exc) }
  def fatalError(exc: SAXParseException): Unit = { println(exc) }
}

If more flexibility is needed in configuring the DocumentParser than offered by this class, consider writing a wrapper DocumentParser which wraps a DocumentParserUsingDom, but adapts the parse method.

A DocumentParserUsingDom instance can be re-used multiple times, from the same thread. If the DocumentBuilderFactory is thread-safe, it can even be re-used from multiple threads. Typically a DocumentBuilderFactory cannot be trusted to be thread-safe, however. In a web application, one (safe) way to deal with that is to use one DocumentBuilderFactory instance per request.

Linear Supertypes
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. DocumentParserUsingDom
  2. AbstractDocumentParser
  3. DocumentParser
  4. AnyRef
  5. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Instance Constructors

  1. new DocumentParserUsingDom(docBuilderFactory: DocumentBuilderFactory, docBuilderCreator: (DocumentBuilderFactory) ⇒ DocumentBuilder, converterToDocument: ConverterToDocument[Document])

    Permalink

Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  4. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  5. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  6. val converterToDocument: ConverterToDocument[Document]

    Permalink
  7. val docBuilderCreator: (DocumentBuilderFactory) ⇒ DocumentBuilder

    Permalink
  8. val docBuilderFactory: DocumentBuilderFactory

    Permalink
  9. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  10. def equals(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  11. def finalize(): Unit

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  12. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
  13. def hashCode(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  14. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  15. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  16. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
  17. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
  18. def parse(inputSource: InputSource): Document

    Permalink

    Parses the input source into a yaidom Document.

    Parses the input source into a yaidom Document. Closes the input stream or reader afterwards.

    Definition Classes
    DocumentParserUsingDomDocumentParser
  19. final def parse(file: File): Document

    Permalink

    Parses the content of the given File into a eu.cdevreeze.yaidom.simple.Document.

    Parses the content of the given File into a eu.cdevreeze.yaidom.simple.Document.

    Definition Classes
    AbstractDocumentParserDocumentParser
  20. final def parse(uri: URI): Document

    Permalink

    Parses the content of the given URI into a eu.cdevreeze.yaidom.simple.Document.

    Parses the content of the given URI into a eu.cdevreeze.yaidom.simple.Document.

    Definition Classes
    AbstractDocumentParserDocumentParser
  21. final def parse(inputStream: InputStream): Document

    Permalink

    Parses the content of the given input stream into a eu.cdevreeze.yaidom.simple.Document.

    Parses the content of the given input stream into a eu.cdevreeze.yaidom.simple.Document.

    Definition Classes
    AbstractDocumentParserDocumentParser
  22. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  23. def toString(): String

    Permalink
    Definition Classes
    AnyRef → Any
  24. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  25. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  26. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  27. def withConverterToDocument(newConverterToDocument: ConverterToDocument[Document]): DocumentParserUsingDom

    Permalink

    Returns an adapted copy having the passed ConverterToDocument.

    Returns an adapted copy having the passed ConverterToDocument. This method makes it possible to use an adapted converter, which may be needed depending on the JAXP implementation used.

Inherited from AbstractDocumentParser

Inherited from DocumentParser

Inherited from AnyRef

Inherited from Any

Ungrouped