• Stars
    star
    548
  • Rank 81,119 (Top 2 %)
  • Language
    TypeScript
  • License
    BSD 2-Clause "Sim...
  • Created almost 13 years ago
  • Updated 2 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

a CSS selector compiler & engine

css-select NPM version Node.js CI Downloads Coverage

A CSS selector compiler and engine

What?

As a compiler, css-select turns CSS selectors into functions that tests if elements match them.

As an engine, css-select looks through a DOM tree, searching for elements. Elements are tested "from the top", similar to how browsers execute CSS selectors.

In its default configuration, css-select queries the DOM structure of the domhandler module (also known as htmlparser2 DOM). To query alternative DOM structures, see Options below.

Features:

  • ๐Ÿ”ฌ Full implementation of CSS3 selectors, as well as most CSS4 selectors
  • ๐Ÿงช Partial implementation of jQuery/Sizzle extensions (see cheerio-select for the remaining selectors)
  • ๐Ÿง‘โ€๐Ÿ”ฌ High test coverage, including the full test suites from Sizzle, Qwery and NWMatcher and .
  • ๐Ÿฅผ Reliably great performance

Why?

Most CSS engines written in JavaScript execute selectors left-to-right. That means thet execute every component of the selector in order, from left to right. As an example: For the selector a b, these engines will first query for a elements, then search these for b elements. (That's the approach of eg. Sizzle, Qwery and NWMatcher.)

While this works, it has some downsides: Children of as will be checked multiple times; first, to check if they are also as, then, for every superior a once, if they are bs. Using Big O notation, that would be O(n^(k+1)), where k is the number of descendant selectors (that's the space in the example above).

The far more efficient approach is to first look for b elements, then check if they have superior a elements: Using big O notation again, that would be O(n). That's called right-to-left execution.

And that's what css-select does โ€“ and why it's quite performant.

How does it work?

By building a stack of functions.

Wait, what?

Okay, so let's suppose we want to compile the selector a b, for right-to-left execution. We start by parsing the selector. This turns the selector into an array of the building blocks. That's what the css-what module is for, if you want to have a look.

Anyway, after parsing, we end up with an array like this one:

[
    { type: "tag", name: "a" },
    { type: "descendant" },
    { type: "tag", name: "b" },
];

(Actually, this array is wrapped in another array, but that's another story, involving commas in selectors.)

Now that we know the meaning of every part of the selector, we can compile it. That is where things become interesting.

The basic idea is to turn every part of the selector into a function, which takes an element as its only argument. The function checks whether a passed element matches its part of the selector: If it does, the element is passed to the next function representing the next part of the selector. That function does the same. If an element is accepted by all parts of the selector, it matches the selector and double rainbow ALL THE WAY.

As said before, we want to do right-to-left execution with all the big O improvements. That means elements are passed from the rightmost part of the selector (b in our example) to the leftmost (which would be c of course a).

For traversals, such as the descendant operating the space between a and b, we walk up the DOM tree, starting from the element passed as argument.

//TODO: More in-depth description. Implementation details. Build a spaceship.

API

const CSSselect = require("css-select");

Note: css-select throws errors when invalid selectors are passed to it. This is done to aid with writing css selectors, but can be unexpected when processing arbitrary strings.

CSSselect.selectAll(query, elems, options)

Queries elems, returns an array containing all matches.

  • query can be either a CSS selector or a function.
  • elems can be either an array of elements, or a single element. If it is an element, its children will be queried.
  • options is described below.

Aliases: default export, CSSselect.iterate(query, elems).

CSSselect.compile(query, options)

Compiles the query, returns a function.

CSSselect.is(elem, query, options)

Tests whether or not an element is matched by query. query can be either a CSS selector or a function.

CSSselect.selectOne(query, elems, options)

Arguments are the same as for CSSselect.selectAll(query, elems). Only returns the first match, or null if there was no match.

Options

All options are optional.

  • xmlMode: When enabled, tag names will be case-sensitive. Default: false.
  • rootFunc: The last function in the stack, will be called with the last element that's looked at.
  • adapter: The adapter to use when interacting with the backing DOM structure. By default it uses the domutils module.
  • context: The context of the current query. Used to limit the scope of searches. Can be matched directly using the :scope pseudo-class.
  • relativeSelector: By default, selectors are relative to the context, which means that no parent elements of the context will be matched. (Eg. a b c with context b will never give any results.) If relativeSelector is set to false, selectors won't be absolutized and selectors can test for parent elements outside of the context.
  • cacheResults: Allow css-select to cache results for some selectors, sometimes greatly improving querying performance. Disable this if your document can change in between queries with the same compiled selector. Default: true.
  • pseudos: A map of pseudo-class names to functions or strings.

Custom Adapters

A custom adapter must match the interface described here.

You may want to have a look at domutils to see the default implementation, or at css-select-browser-adapter for an implementation backed by the DOM.

Supported selectors

As defined by CSS 4 and / or jQuery.

  • Type (<tagname>): Selects elements by their tag name.
  • Descendant ( ): Selects elements that are descendants of the specified element.
  • Child (>): Selects elements that are direct children of the specified element.
  • Parent (<): Selects elements that are direct parents of the specified element. This follows an old proposal that has been made obsolete by the :has() pseudo-class.
  • Adjacent sibling (+): Selects elements that are the next sibling of the specified element.
  • General sibling (~): Selects elements that are siblings of the specified element.
  • Attribute ([attr=foo]), with supported comparisons:
    • [attr] (existential): Selects elements with the specified attribute, whatever its value.
    • =: Selects elements with the specified attribute and value.
    • ~=: Selects elements with the specified attribute and value, separated by spaces.
    • |=: Selects elements with the specified attribute and value, separated by hyphens.
    • *=: Selects elements with the specified attribute and value, anywhere in the attribute value.
    • ^=: Selects elements with the specified attribute and value, beginning at the beginning of the attribute value.
    • $=: Selects elements with the specified attribute and value, ending at the end of the attribute value.
    • !=: Selects elements with the specified attribute and value, not equal to the specified value.
    • i and s can be added after the comparison to make the comparison case-insensitive or case-sensitive (eg. [attr=foo i]). If neither is supplied, css-select will follow the HTML spec's case-sensitivity rules.
  • Selector lists (,): Selects elements that match any of the specified selectors.
  • Universal (*): Selects all elements.
  • Pseudos:
    • :not: Selects elements that do not match the specified selector.
    • :contains: Selects elements that contain the specified text.
    • :icontains: Selects elements that contain the specified text, case-insensitively.
    • :has: Selects elements that have descendants that match the specified selector.
    • :root: Selects the root element.
    • :empty: Selects elements that have no children.
    • :first-child: Selects elements that are the first element child of their parent.
    • :last-child: Selects elements that are the last element child of their parent.
    • :first-of-type: Selects elements that are the first element of their type.
    • :last-of-type: Selects elements that are the last element of their type.
    • :only-of-type: Selects elements that are the only element of their type.
    • :only-child: Selects elements that are the only element child of their parent.
    • :nth-child: Selects elements that are the nth element child of their parent.
    • :nth-last-child: Selects elements that are the nth element child of their parent, counting from the last child.
    • :nth-of-type: Selects elements that are the nth element of their type.
    • :nth-last-of-type: Selects elements that are the nth element of their type, counting from the last child.
    • :any-link: Selects elements that are links.
    • :link: Selects elements that are links and have not been visited.
    • :visited, :hover, :active (these depend on optional Adapter methods, so these will only match elements if implemented in Adapter)
    • :checked: Selects input elements that are checked, or option elements that are selected.
    • :disabled: Selects input elements that are disabled.
    • :enabled: Selects input elements that are not disabled.
    • :required: Selects input elements that are required.
    • :optional: Selects input elements that are not required.
    • jQuery extensions:
      • :parent: Selects elements that have at least one child.
      • :header: Selects header elements.
      • :selected: Selects option elements that are selected.
      • :button: Selects button elements, and input elements of type button.
      • :input: Selects input, textarea, select, and button elements.
      • :text: Selects input elements of type text.
      • :checkbox: Selects input elements of type checkbox.
      • :file: Selects input elements of type file.
      • :password: Selects input elements of type password.
      • :reset: Selects input elements of type reset.
      • :radio: Selects input elements of type radio.
    • :is, as well as the aliases :where, and the legacy alias :matches: Selects elements that match any of the given selectors.
    • :scope: Selects elements that are part of the scope of the current selector. This uses the context from the passed options.

License: BSD-2-Clause

Security contact information

To report a security vulnerability, please use the Tidelift security contact. Tidelift will coordinate the fix and disclosure.

css-select for enterprise

Available as part of the Tidelift Subscription

The maintainers of css-select and thousands of other packages are working with Tidelift to deliver commercial support and maintenance for the open source dependencies you use to build your applications. Save time, reduce risk, and improve code health, while paying the maintainers of the exact dependencies you use. Learn more.

More Repositories

1

htmlparser2

The fast & forgiving HTML and XML parser
TypeScript
4,057
star
2

domhandler

Handler for htmlparser2, to get a DOM
TypeScript
311
star
3

entities

Encode & decode HTML & XML entities with ease & speed
TypeScript
293
star
4

readabilitySAX

a fast and platform independent readability port (JS)
HTML
237
star
5

css-what

a CSS selector parser
TypeScript
217
star
6

domutils

Utilities for working with htmlparser2's DOM
TypeScript
180
star
7

bitfield

A bitfield implementation using buffers, compliant with the BitTorrent spec.
TypeScript
80
star
8

nth-check

Parses and compiles CSS nth-checks to highly optimized functions.
TypeScript
52
star
9

cornet

transform streaming html using css selectors
JavaScript
44
star
10

domelementtype

all the types of nodes in htmlparser2's dom
TypeScript
27
star
11

high5

html 5 tokenizer
JavaScript
24
star
12

boolbase

two functions: one that returns true, one that returns false
JavaScript
14
star
13

inline

inline all images, stylesheets and scripts of a webpage
JavaScript
11
star
14

binopsy

Reimplementation of binary-parser supporting serialization and streaming
JavaScript
10
star
15

SimpleQueue

A simple FIFO queue
TypeScript
6
star
16

node-minreq

minimalistic request library for node
JavaScript
6
star
17

webshelf

my node knockout 2012 project
JavaScript
2
star
18

minschema

a (html form) schema builder & validator
JavaScript
1
star
19

node-fsi-dropbox

DEPRECATED
JavaScript
1
star
20

encoding-sniffer

HTML encoding sniffer, with stream support
TypeScript
1
star
21

fb55

Config files for my GitHub profile.
1
star
22

fb55.github.io

HTML
1
star
23

YQL-Tables-for-Google-Data-API

Tables to authenticate at and use the Google Data API
JavaScript
1
star
24

ReadableFeeds

runs readabilitySAX on feeds
JavaScript
1
star
25

level-insert

insert documents into a db with autoincrementing keys
JavaScript
1
star
26

funexp

UNFINISHED a functional regular expression library
JavaScript
1
star