An Elixir library for parsing and extracting data from HTML and XML with CSS or XPath selectors.
MIT License
Published by mischov over 6 years ago
Document
functions now raise if an unknown node_id
is provided, when before they might have raised or might have handle the situation gracefullyget_root_ids/1
, get_node_ids/1
, and fetch_node/2
to Document
Document.delete_node/2
, courtesy of @willbarrettmix format
projectPublished by mischov over 6 years ago
meeseeks_html5ever v0.8.1
, which supports OTP 20.2Published by mischov about 7 years ago
meeseeks_html5ever v0.8.0
, which removes panics related to calling mark_script_already_started
and get_template_contents
, and removes synchronous parsing, which did not correctly handle panics and broke the <1ms contract on first callPublished by mischov about 7 years ago
meeseeks_html5ever v0.7.0
, which fixes an erroneous panic related to calling remove_from_parent
on a node with no parentPublished by mischov about 7 years ago
Published by mischov about 7 years ago
Published by mischov over 7 years ago
Document.html/1
and Document.tree/1
extractorsMeeseeks.html/1
and Meeseeks.tree/1
to accept a Document
Published by mischov over 7 years ago
meeseeks_html5ever v0.6.1
, which supports OTP 20Published by mischov over 7 years ago
meeseeks_html5ever v0.6.0
, which supports parsing XMLMeeseeks.parse/2
which takes either :html
or :xml
as the second argument to specify how the source gets parsedMeeseeks.data/1
to handle CDATA when parsing HTMLPublished by mischov over 7 years ago
Context.new/1
to Context.prepare_for_selection/1
Context.with_accumulator/2
to Context.add_accumulator/2
meeseeks_html5ever v0.5.0
Document.ProcessingInstruction
nodes from tuple-treesprocessing-instruction
functionality in Meeseeks.XPath
(when possible)Document.ProcessingInstruction
node typeSelect.select/3
and Meeseeks.select/3
Context.ensure_accumulator!/1
Published by mischov over 7 years ago
Selector.match?/3
callback to Selector.match/4
, which now takes a context and can return a {boolean, context}
tuple in addition to returning a boolean.Meeseeks.XPath
)Selector.filters/1
callback to the Selector
behaviour and update selection to allow for filtering matches before proceedingMeeseeks.Context
to allow selectors and the selection process to store stateMeeseeks.Accumulator
behaviour and update Accumulator.{All, One}
to use itNode
and Root
selectorsAncestors
, AncestorsOrSelf
, Children
, Descendants
, DescendantsOrSelf
, NextSiblings
, Parent
, PreviousSiblings
, and Self
selector combinatorsparent
, ancestors
, and previous_siblings
queries to Document
meeseeks_html5ever v0.4.6
, which correctly parses namespaced elements and doesn't try to bring in html5ever 0.16.0
html
extractor to add namespaces to elementsDocument
and Result
opaque inspected valuesPublished by mischov over 7 years ago
meeseeks_html5ever v0.4.4
, which permits Elixir 1.3Published by mischov over 7 years ago
html5ever_elixir
with meeseeks_html5ever
:not()
to accept multiple selectorsDocument.new/1
to Parser.parse_tuple_tree/1
Document.get_nodes/1
and Accumulator.return/1
(for Accumulator.All
)Published by mischov over 7 years ago
dataset
extractor that mimics the HTMLElement.dataset APIPublished by mischov over 7 years ago
Published by mischov over 7 years ago
Published by mischov over 7 years ago
Published by mischov over 7 years ago