minimalcss
A Node library to extract the minimal CSS used in a set of URLs with puppeteer.
Used to find what minimal CSS is needed to render on first load, even with
document.onload
executed.
This minimal CSS is also known as critical path CSS and ultimately a web performance technique to make web pages load faster at initial load.
What does it do
You supply a list of URLs that it opens (one at a time) and for each page
it downloads all external CSS files (e.g.
<link rel="stylesheet" href="bootstrap.min.css">
) and uses the DOM and
document.querySelector
to investigate which selectors, in the CSS, are
actually in the DOM. That minimal payload of CSS is all you need to load
the URLs styled without having to make it block on CSS.
Under the hood it relies on the excellent puppeteer library which uses the Headless Chome Node API. This means it runs (at the time of writing) Chrome 62 and this library is maintained by the Google Chrome team.
The CSS to analyze (and hopefully minimize) is downloaded automatically just
like a browser opens and downloads CSS as mentioned in the DOM as <link>
tags.
The CSS is parsed by CSSTree and the minification and compression is done with CSSO. An AST of each CSS payload is sent into the Headless Chrome page evaluation together with a callback that compares with the DOM and then each minimal CSS payload is concatenated into one big string which then CSSO compresses into one "merged" and minified CSS payload.
Usage
Install:
yarn add minimalcss --dev
You can install it globally if you like:
yarn global add minimalcss
npm install [--save-dev|--global] minimalcss
Now you can run it:
./node_modules/.bin/minimalcss https://example.com/ https://example.com/aboutus > minimal.min.css
Prior art
minimalcss
isn't the first library to perform this task. What's unique and
special about minimalcss
is that it uses the Chrome Headless browser.
-
penthouse - uses puppeteer (since version 1.0) and CSSTree. Supports only 1 URL at a time and can't you have to first save the CSS files it should process.
-
critical - uses
penthouse
(see above) with its "flaws" meaning you can only do 1 URL (or HTML string) and you have to prepare the CSS files too. -
UnCSS - uses jsdom to render and execute JavaScript. Supports supplying multiple URLs but still requires to manually supply the CSS files to process.
-
mincss - Python project that uses lxml.html to analyze the HTML statically (by doing a
GET
of the URL as if done by a server). I.e. it can't load the HTML as a real browser would and thus does not support a DOM with possible JavaScript mutations on load. It can optionally usePhantomJS
to extract the HTML.
Killer features
-
You don't need to specify where the CSS is. It gets downloaded and parsed automatically.
-
It uses puppeteer and CSSTree which are both high quality projects that are solid and well tested.
-
The CSS selectors downloaded is compared to the DOM before and after JavaScript code has changed the DOM. That means you can extract the critical CSS needed to display properly before the JavaScript has kicked in.
-
Ability to analyze the remaining CSS selectors to see which keyframe animations that they use and use this to delete keyframe definitions that are no longer needed.
-
You can specify a viewport, which might cause the page to render slightly different. It does not create the minimal CSS only on DOM that is visible though.
-
If the CSS contains
@font-face { ... }
rules whose name is never used in any remaining CSS selector, the wholefont-face
block is removed.
Help needed
Let's make this a thriving community project!
Help needed with features, tooling, and much testing in real web performance optimization work.
API
const minimalcss = require('minimalcss');
minimalcss.version
Get version Just prints out the current version.
minimalcss.run(options)
Run a minimization Returns a promise. The promise returns an object containing, amongst other things, the minified minimal CSS as a string. For example:
minimalcss
.minimize({ url: 'https://example.com/' })
.then(result => {
console.log('OUTPUT', result.finalCss.length, result.finalCss);
})
.catch(error => {
console.error(`Failed the minimize CSS: ${error}`);
});
That result
object that is returned by the minimize
function contains:
finalCss
- the minified minimal CSS as a string.stylesheetContents
- an object of stylesheet URLs as keys and their content as text.
Optionally, you can supply a list of URLs like this:
minimalcss
.minimize({ urls: ['https://example.com/page1', 'https://example.com/page2'] })
...
and minimalcss
will try to merge the minimal critical CSS across all pages.
But we aware that this can be "dangerous" because of the inherit order of CSS.
API Options
Calling minimalcss.run(options)
takes an object whose only mandatory
key is urls
or url
. Other optional options are:
debug
- all console logging during page rendering are included in the stdout. Also, any malformed selector that cause errors indocument.querySelector
will be raised as new errors.skippable
- function which takes request as an argument and returns boolean. If it returns true then given request will be aborted (skipped). Can be used to block requests to Google Analytics etc.loadimages
- If set totrue
, images will actually load.withoutjavascript
- If set tofalse
it will skip loading the page first without JavaScript. By defaultminimalcss
will evaluate the DOM as plain as can be, and then with JavaScript enabled and waiting for network activity to be idle.disableJavaScript
- By default JavaScript is enabled. If set totrue
it will ignorewithoutjavascript
option and loading the page only one time without JavaScript.browser
- Instance of a Browser, which will be used instead of launching another one.userAgent
- specific user agent to use (string)viewport
- viewport object as specified in page.setViewportpuppeteerArgs
- Args sent to puppeteer when launching. List of strings for headless Chrome.cssoOptions
- CSSO compress function optionstimeout
- Maximum navigation time in milliseconds, defaults to 30 seconds, pass 0 to disable timeout.ignoreCSSErrors
- By default, any CSS parsing error throws an error inminimalcss
. If you know it's safe to ignore (for example, third-party CSS resources), set this totrue
.ignoreJSErrors
- By default, any JavaScript error encountered by puppeteerignoreRequestErrors
- When CSS files return 404 or another request errorminimalcss
will ignore this instead of throwing an error. will be thrown byminimalcss
. If you know it's safe to ignore errors (for example, on third-party webpages), set this totrue
.styletags
- If set totrue
, on-page<style>
tags are parsed along with external stylesheets. By default, only external stylesheets are parsed.enableServiceWorkers
- By default all Service Workers are disabled. This option enables them as is.whitelist
- Array of css selectors that should be left in final CSS. RegExp patterns are supported (e.g.['sidebar', icon-.*, .*-error]
).
Warnings
Google Fonts
Suppose you have this in your HTML:
<link href="https://fonts.googleapis.com/css?family=Lato" rel="stylesheet">
then, minimalcss
will consider this an external CSS stylesheet, load it
and include it in the minimal CSS.
The problem is that Google Fonts will respond to that URL dynamically based
on the user agent. In other words a different CSS payload depending on who's
asking. So, the user agent when minimalcss
runs will be whatever
puppeteer
uses and it might not be the best CSS for other user agents.
So to avoid this predicament use the skippable
option. On the command line
you can do that like this:
./node_modules/.bin/minimalcss --skip fonts.googleapis.com https://example.com
With the API, you can do it like this:
minimalcss
.minimize({
url: 'https://example.com',
skippable: request => {
return !!request.url().match('fonts.googleapis.com');
}
})
.then(result => {
...
});
Multiple URLs
minimalcss
can accept multiple URLs when figuring out the minimal CSS for
all those URLs, combined. But be careful, this can be dangerous. If
you have one URL with this HTML:
<head>
<link rel="stylesheet" href="base.css">
<link rel="stylesheet" href="specific.css">
</head>
and another URL with...:
<head>
<link rel="stylesheet" href="base.css">
</head>
When combining these, it will optimize the CSS in this order:
base.css
specific.css
base.css
But if specific.css
was meant to override something in base.css
in the
first URL, that might get undone when base.css
becomes the last CSS
to include.
See this issue for another good example why running minimalcss
across multiple URLs.
cheerio
About When minimalcss
evaluates each CSS selector to decide whether to keep it
or not, some selectors might not be parseable. Possibly, the CSS employs
hacks for specific browsers that
cheerio doesn't support. Or
there might be CSS selectors that no browser or tool can understand
(e.g a typo by the CSS author). If there's a problem parsing a CSS selector,
the default is to swallow the exception and let the CSS selector stay.
Also by default, all these warnings are hidden. To see them use the --debug
flag (or debug
API option). Then the CSS selector syntax errors are
printed on stderr
.
@font-face
About minimalcss
will remove any @font-face
rules whose name is not mentioned
in any of the CSS selectors. But be aware that you might have a
@font-face { font-family: MyName; }
in some /static/foo.css
but separately
you might have an inline style sheet that looks like this:
<style type="text/css">
div.something { font-family: MyName; }
</style>
In this case the @font-face { font-family: MyName; }
would be removed even
though it's mentioned from somewhere else.
About Blobs
If your document uses Blob
to create injectable stylesheets into the DOM,
minimalcss
will not be able to optimize that. It will be not be
included in the final CSS.
Development
First thing to get started, once you've cloned the repo is to install all the dependencies:
yarn
Testing
Testing is done with jest
. At the
beginning of every test, a static file server is started on localhost
and a puppeteer
browser instance is created for every test.
To run the tests:
yarn jest
Best way to get into writing tests is to look at existing tests and copy.
Prettier
All code is expected to conform with Prettier according
to the the .prettierrc
file in the root of the project.
To check that all your code conforms, run:
yarn lintcheck
Use without a server
This blog post
demonstrates technique to use minimalcss
when you don't yet have a server.
Using the http-server
package you can start a server right before you run
and shut down as soon as you're done.
License
Copyright (c) 2017-2020 Peter Bengtsson. See the LICENSE file for license rights and limitations (MIT).