Class/Object

net.ruippeixotog.scalascraper.browser

JsoupBrowser

Related Docs: object JsoupBrowser | package browser

Permalink

class JsoupBrowser extends Browser

A Browser implementation based on jsoup, a Java HTML parser library. JsoupBrowser provides powerful and efficient document querying, but it doesn't run JavaScript in the pages. As such, it is limited to working strictly with the HTML send in the page source.

Currently, JsoupBrowser does not keep separate cookie stores for different domains and paths. In each request all cookies set previously will be sent, regardless of the domain they were set on. If you do requests to different domains and do not want this behavior, use different JsoupBrowser instances.

As the documents parsed by JsoupBrowser instances are not changed after loading, Document and Element instances obtained from them are guaranteed to be immutable.

Linear Supertypes
Browser, AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. JsoupBrowser
  2. Browser
  3. AnyRef
  4. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Instance Constructors

  1. new JsoupBrowser(userAgent: String = "jsoup/1.8")

    Permalink

    userAgent

    the user agent with which requests should be made

Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  4. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  5. def clearCookies(): Unit

    Permalink
    Definition Classes
    JsoupBrowserBrowser
  6. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  7. def cookies(url: String): Map[String, String]

    Permalink
    Definition Classes
    JsoupBrowserBrowser
  8. def defaultRequestSettings(conn: Connection): Connection

    Permalink
    Attributes
    protected[this]
  9. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  10. def equals(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  11. def executeRequest(conn: Connection): Response

    Permalink
    Attributes
    protected[this]
  12. def finalize(): Unit

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  13. def get(url: String): Document

    Permalink
    Definition Classes
    JsoupBrowserBrowser
  14. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
  15. def hashCode(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  16. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  17. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  18. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
  19. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
  20. def parseFile(file: File, charset: String): Document

    Permalink
    Definition Classes
    JsoupBrowserBrowser
  21. def parseFile(path: String): Document

    Permalink
    Definition Classes
    Browser
  22. def parseFile(path: String, charset: String): Document

    Permalink
    Definition Classes
    Browser
  23. def parseFile(file: File): Document

    Permalink
    Definition Classes
    Browser
  24. def parseInputStream(inputStream: InputStream, charset: String): Document

    Permalink

    Closes the InputStream after reading

    Closes the InputStream after reading

    Definition Classes
    JsoupBrowserBrowser
  25. def parseResource(name: String, charset: String = "UTF-8"): Document

    Permalink
    Definition Classes
    Browser
  26. def parseString(html: String): Document

    Permalink
    Definition Classes
    JsoupBrowserBrowser
  27. def post(url: String, form: Map[String, String]): Document

    Permalink
    Definition Classes
    JsoupBrowserBrowser
  28. def processResponse(res: Response): Document

    Permalink
    Attributes
    protected[this]
  29. def requestSettings(conn: Connection): Connection

    Permalink
  30. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  31. def toString(): String

    Permalink
    Definition Classes
    AnyRef → Any
  32. val userAgent: String

    Permalink

    the user agent with which requests should be made

    the user agent with which requests should be made

    Definition Classes
    JsoupBrowserBrowser
  33. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  34. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  35. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )

Inherited from Browser

Inherited from AnyRef

Inherited from Any

Ungrouped