Skip to content

Releases: fimad/scalpel

0.6.2.1

23 Oct 21:35
Compare
Choose a tag to compare
  • Match Content-Type case-insensitively.

0.6.2

18 Apr 17:09
Compare
Choose a tag to compare
  • Add the monad transformer ScraperT.

0.6.1

19 Jan 06:14
Compare
Choose a tag to compare
  • Support GHC 8.8.

0.6.0

18 Feb 05:01
Compare
Choose a tag to compare

Breaking Changes

  • anySelector now captures text nodes. This causes different results when used
    with a plural scraper (e.g. chroots). Usage with a singular scraper (e.g.
    chroot) should be unaffected.
  • The dependency on curl has been replaced with http-client and
    http-client-tls. This has the following observable changes.
    • scrapeURLWithOpts is removed.
    • The Config type used with scrapeURLWithConfig no longer contains a list
      of curl options. Instead it now takes a Maybe Manager from http-client.
    • The Decoder function type now takes in a Response type from
      http-client.
    • scrapeURL will now throw an exception if there is a problem connecting to
      a URL.

Other Changes

  • Remove Ord constraint from public APIs.
  • Add atDepth operator which allows for selecting nodes at a specified depth
    in relation to another node (#21).
  • Fix issue selecting malformed HTML where "a" // "c" would not match
    <a><b><c></c></a></b>.
  • Add textSelector for selecting text nodes.
  • Add SerialScraper type and associated primitives (#48).

0.5.1

25 Apr 03:17
Compare
Choose a tag to compare
  • Fix bug (#59, #54) in DFS traversal order.

0.5.0

05 Feb 04:50
Compare
Choose a tag to compare
  • Split scalpel into two packages: scalpel and scalpel-core. The latter
    does not provide networking support and does not depend on curl.

0.4.1

19 Jan 02:28
Compare
Choose a tag to compare
  • Added notP attribute predicate.

0.4.0

17 Oct 00:46
Compare
Choose a tag to compare
  • Add the chroot tricks (#23 and #25) to README.md and added examples.
    • Fix backtracking that occurs when using guard and chroot.
    • Fix bug where the same tag may appear in the result set multiple times.
  • Performance optimizations when using the (//) operator.
  • Make Scraper an instance of MonadFail. Practically this means that failed
    pattern matches in <- expressions within a do block will evaluate to mzero
    instead of throwing an error and bringing down the entire script.
  • Pluralized scrapers will now return the empty list instead mzero when there
    are no matches.
  • Add the position scraper which provides the index of the current sub-tree
    within the context of a chroots's do-block.

0.3.1

28 May 21:56
Compare
Choose a tag to compare
  • Added the innerHTML and innerHTMLs scraper.
  • Added the match function which allows for the creation of arbitrary
    attribute predicates.
  • Fixed build breakage with GHC 8.0.1.

0.3.0.1

31 Jan 23:25
Compare
Choose a tag to compare
  • Make tag and attribute matching case-insensitive.