• Stars
    star
    294
  • Rank 136,022 (Top 3 %)
  • Language
    Ruby
  • License
    MIT License
  • Created over 10 years ago
  • Updated 2 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Rails HTML Sanitizers

This gem is responsible for sanitizing HTML fragments in Rails applications. Specifically, this is the set of sanitizers used to implement the Action View SanitizerHelper methods sanitize, sanitize_css, strip_tags and strip_links.

Rails HTML Sanitizer is only intended to be used with Rails applications. If you need similar functionality but aren't using Rails, consider using the underlying sanitization library Loofah directly.

Usage

Sanitizers

All sanitizers respond to sanitize, and are available in variants that use either HTML4 or HTML5 parsing, under the Rails::HTML4 and Rails::HTML5 namespaces, respectively.

NOTE: The HTML5 sanitizers are not supported on JRuby. Users may programmatically check for support by calling Rails::HTML::Sanitizer.html5_support?.

FullSanitizer

full_sanitizer = Rails::HTML5::FullSanitizer.new
full_sanitizer.sanitize("<b>Bold</b> no more!  <a href='more.html'>See more here</a>...")
# => Bold no more!  See more here...

or, if you insist on parsing the content as HTML4:

full_sanitizer = Rails::HTML4::FullSanitizer.new
full_sanitizer.sanitize("<b>Bold</b> no more!  <a href='more.html'>See more here</a>...")
# => Bold no more!  See more here...

HTML5 version:

LinkSanitizer

link_sanitizer = Rails::HTML5::LinkSanitizer.new
link_sanitizer.sanitize('<a href="example.com">Only the link text will be kept.</a>')
# => Only the link text will be kept.

or, if you insist on parsing the content as HTML4:

link_sanitizer = Rails::HTML4::LinkSanitizer.new
link_sanitizer.sanitize('<a href="example.com">Only the link text will be kept.</a>')
# => Only the link text will be kept.

SafeListSanitizer

This sanitizer is also available as an HTML4 variant, but for simplicity we'll document only the HTML5 variant below.

safe_list_sanitizer = Rails::HTML5::SafeListSanitizer.new

# sanitize via an extensive safe list of allowed elements
safe_list_sanitizer.sanitize(@article.body)

# sanitize only the supplied tags and attributes
safe_list_sanitizer.sanitize(@article.body, tags: %w(table tr td), attributes: %w(id class style))

# sanitize via a custom scrubber
safe_list_sanitizer.sanitize(@article.body, scrubber: ArticleScrubber.new)

# prune nodes from the tree instead of stripping tags and leaving inner content
safe_list_sanitizer = Rails::HTML5::SafeListSanitizer.new(prune: true)

# the sanitizer can also sanitize css
safe_list_sanitizer.sanitize_css('background-color: #000;')

Scrubbers

Scrubbers are objects responsible for removing nodes or attributes you don't want in your HTML document.

This gem includes two scrubbers Rails::HTML::PermitScrubber and Rails::HTML::TargetScrubber.

Rails::HTML::PermitScrubber

This scrubber allows you to permit only the tags and attributes you want.

scrubber = Rails::HTML::PermitScrubber.new
scrubber.tags = ['a']

html_fragment = Loofah.fragment('<a><img/ ></a>')
html_fragment.scrub!(scrubber)
html_fragment.to_s # => "<a></a>"

By default, inner content is left, but it can be removed as well.

scrubber = Rails::HTML::PermitScrubber.new
scrubber.tags = ['a']

html_fragment = Loofah.fragment('<a><span>text</span></a>')
html_fragment.scrub!(scrubber)
html_fragment.to_s # => "<a>text</a>"

scrubber = Rails::HTML::PermitScrubber.new(prune: true)
scrubber.tags = ['a']

html_fragment = Loofah.fragment('<a><span>text</span></a>')
html_fragment.scrub!(scrubber)
html_fragment.to_s # => "<a></a>"

Rails::HTML::TargetScrubber

Where PermitScrubber picks out tags and attributes to permit in sanitization, Rails::HTML::TargetScrubber targets them for removal. See https://github.com/flavorjones/loofah/blob/main/lib/loofah/html5/safelist.rb for the tag list.

Note: by default, it will scrub anything that is not part of the permitted tags from loofah HTML5::Scrub.allowed_element?.

scrubber = Rails::HTML::TargetScrubber.new
scrubber.tags = ['img']

html_fragment = Loofah.fragment('<a><img/ ></a>')
html_fragment.scrub!(scrubber)
html_fragment.to_s # => "<a></a>"

Similarly to PermitScrubber, nodes can be fully pruned.

scrubber = Rails::HTML::TargetScrubber.new
scrubber.tags = ['span']

html_fragment = Loofah.fragment('<a><span>text</span></a>')
html_fragment.scrub!(scrubber)
html_fragment.to_s # => "<a>text</a>"

scrubber = Rails::HTML::TargetScrubber.new(prune: true)
scrubber.tags = ['span']

html_fragment = Loofah.fragment('<a><span>text</span></a>')
html_fragment.scrub!(scrubber)
html_fragment.to_s # => "<a></a>"

Custom Scrubbers

You can also create custom scrubbers in your application if you want to.

class CommentScrubber < Rails::HTML::PermitScrubber
  def initialize
    super
    self.tags = %w( form script comment blockquote )
    self.attributes = %w( style )
  end

  def skip_node?(node)
    node.text?
  end
end

See Rails::HTML::PermitScrubber documentation to learn more about which methods can be overridden.

Custom Scrubber in a Rails app

Using the CommentScrubber from above, you can use this in a Rails view like so:

<%= sanitize @comment, scrubber: CommentScrubber.new %>

A note on HTML entities

Rails HTML sanitizers are intended to be used by the view layer, at page-render time. They are not intended to sanitize persisted strings that will be sanitized again at page-render time.

Proper HTML sanitization will replace some characters with HTML entities. For example, text containing a < character will be updated to contain &lt; to ensure that the markup is well-formed.

This is important to keep in mind because HTML entities will render improperly if they are sanitized twice.

A concrete example showing the problem that can arise

Imagine the user is asked to enter their employer's name, which will appear on their public profile page. Then imagine they enter JPMorgan Chase & Co..

If you sanitize this before persisting it in the database, the stored string will be JPMorgan Chase &amp; Co.

When the page is rendered, if this string is sanitized a second time by the view layer, the HTML will contain JPMorgan Chase &amp;amp; Co. which will render as "JPMorgan Chase &amp; Co.".

Another problem that can arise is rendering the sanitized string in a non-HTML context (for example, if it ends up being part of an SMS message). In this case, it may contain inappropriate HTML entities.

Suggested alternatives

You might simply choose to persist the untrusted string as-is (the raw input), and then ensure that the string will be properly sanitized by the view layer.

That raw string, if rendered in an non-HTML context (like SMS), must also be sanitized by a method appropriate for that context. You may wish to look into using Loofah or Sanitize to customize how this sanitization works, including omitting HTML entities in the final string.

If you really want to sanitize the string that's stored in your database, you may wish to look into Loofah::ActiveRecord rather than use the Rails HTML sanitizers.

A note on module names

In versions < 1.6, the only module defined by this library was Rails::Html. Starting in 1.6, we define three additional modules:

  • Rails::HTML for general functionality (replacing Rails::Html)
  • Rails::HTML4 containing sanitizers that parse content as HTML4
  • Rails::HTML5 containing sanitizers that parse content as HTML5 (if supported)

The following aliases are maintained for backwards compatibility:

  • Rails::Html points to Rails::HTML
  • Rails::HTML::FullSanitizer points to Rails::HTML4::FullSanitizer
  • Rails::HTML::LinkSanitizer points to Rails::HTML4::LinkSanitizer
  • Rails::HTML::SafeListSanitizer points to Rails::HTML4::SafeListSanitizer

Installation

Add this line to your application's Gemfile:

gem 'rails-html-sanitizer'

And then execute:

$ bundle

Or install it yourself as:

$ gem install rails-html-sanitizer

Support matrix

branch ruby support actively maintained security support
1.6.x >= 2.7 yes yes
1.5.x >= 2.5 no while Rails 6.1 is in security support
1.4.x >= 1.8.7 no no

Read more

Loofah is what underlies the sanitizers and scrubbers of rails-html-sanitizer.

The node argument passed to some methods in a custom scrubber is an instance of Nokogiri::XML::Node.

Contributing to Rails HTML Sanitizers

Rails HTML Sanitizers is work of many contributors. You're encouraged to submit pull requests, propose features and discuss issues.

See CONTRIBUTING.

Security reports

Trying to report a possible security vulnerability in this project? Please check out the Rails project's security policy for instructions.

License

Rails HTML Sanitizers is released under the MIT License.

More Repositories

1

rails

Ruby on Rails
Ruby
54,600
star
2

webpacker

Use Webpack to manage app-like JavaScript modules in Rails
Ruby
5,313
star
3

thor

Thor is a toolkit for building powerful command-line interfaces.
Ruby
5,066
star
4

jbuilder

Jbuilder: generate JSON objects with a Builder-style DSL
Ruby
4,298
star
5

spring

Rails application preloader
Ruby
2,782
star
6

jquery-ujs

Ruby on Rails unobtrusive scripting adapter for jQuery
JavaScript
2,610
star
7

rails-dev-box

A virtual machine for Ruby on Rails core development
Shell
2,049
star
8

tailwindcss-rails

Ruby
1,343
star
9

kredis

Higher-level data structures built on Redis
Ruby
1,341
star
10

activeresource

Connects business objects and REST web services
Ruby
1,309
star
11

strong_parameters

Taint and required checking for Action Pack and enforcement in Active Model
Ruby
1,271
star
12

docked

Running Rails from Docker for easy start to development
Dockerfile
1,262
star
13

globalid

Identify app models with a URI
Ruby
1,164
star
14

actioncable

Framework for real-time communication over websockets
1,087
star
15

importmap-rails

Use ESM with importmap to manage modern JavaScript in Rails without transpiling or bundling.
Ruby
990
star
16

jquery-rails

A gem to automate using jQuery with Rails
Ruby
946
star
17

sprockets

Rack-based asset packaging system
Ruby
919
star
18

sass-rails

Ruby on Rails stylesheet engine for Sass
Ruby
858
star
19

exception_notification

NOTICE: official repository moved to https://github.com/smartinez87/exception_notification
Ruby
844
star
20

sdoc

Standalone sdoc generator
JavaScript
820
star
21

propshaft

Deliver assets for Rails
Ruby
785
star
22

jsbundling-rails

Bundle and transpile JavaScript in Rails with esbuild, rollup.js, or Webpack.
Ruby
778
star
23

rails-perftest

Benchmark and profile your Rails apps
Ruby
775
star
24

activejob

Declare job classes that can be run by a variety of queueing backends
Ruby
746
star
25

activestorage

Store files in Rails applications
734
star
26

solid_cache

A database-backed ActiveSupport::Cache::Store
Ruby
682
star
27

pjax_rails

PJAX integration for Rails
Ruby
670
star
28

actioncable-examples

Action Cable Examples
Ruby
663
star
29

cache_digests

Ruby
644
star
30

sprockets-rails

Sprockets Rails integration
Ruby
569
star
31

cssbundling-rails

Bundle and process CSS in Rails with Tailwind, PostCSS, and Sass via Node.js.
Ruby
539
star
32

activerecord-session_store

Active Record's Session Store extracted from Rails
Ruby
524
star
33

rails-observers

Rails observer (removed from core in Rails 4.0)
Ruby
513
star
34

execjs

Run JavaScript code from Ruby
Ruby
509
star
35

actiontext

Edit and display rich text in Rails applications
406
star
36

acts_as_list

NOTICE: official repository moved to https://github.com/swanandp/acts_as_list
Ruby
384
star
37

marcel

Find the mime type of files, examining file, filename and declared type
Ruby
369
star
38

request.js

JavaScript
356
star
39

actionpack-page_caching

Static page caching for Action Pack (removed from core in Rails 4.0)
Ruby
343
star
40

commands

Run Rake/Rails commands through the console
Ruby
338
star
41

ssl_requirement

NOTICE: official repository moved to https://github.com/retr0h/ssl_requirement
Ruby
315
star
42

rubocop-rails-omakase

Omakase Ruby styling for Rails
Ruby
310
star
43

rails-controller-testing

Brings back `assigns` and `assert_template` to your Rails tests
Ruby
295
star
44

open_id_authentication

NOTICE: official repository moved to https://github.com/Velir/open_id_authentication
Ruby
284
star
45

acts_as_tree

NOTICE: official repository moved to https://github.com/amerine/acts_as_tree
Ruby
279
star
46

actionpack-action_caching

Action caching for Action Pack (removed from core in Rails 4.0)
Ruby
260
star
47

in_place_editing

NOTICE: official repository moved to https://github.com/amerine/in_place_editing
Ruby
230
star
48

protected_attributes

Protect attributes from mass-assignment in ActiveRecord models.
Ruby
230
star
49

journey

A router for rails
Ruby
221
star
50

auto_complete

NOTICE: official repository moved to https://github.com/david-kerins/auto_complete
Ruby
211
star
51

dartsass-rails

Integrate Dart Sass with the asset pipeline in Rails
Ruby
192
star
52

dynamic_form

NOTICE: official repository moved to https://github.com/joelmoss/dynamic_form
Ruby
192
star
53

country_select

NOTICE: official repository moved to https://github.com/stefanpenner/country_select
Ruby
176
star
54

rails-dom-testing

Extracting DomAssertions and SelectorAssertions from ActionView.
Ruby
168
star
55

routing_concerns

Abstract common routing resource concerns to cut down on duplication.
Ruby
154
star
56

esbuild-rails

Bundle and transpile JavaScript in Rails with esbuild
Ruby
147
star
57

rails-contributors

The web application that runs https://contributors.rubyonrails.org
Ruby
138
star
58

actionmailbox

Receive and process incoming emails in Rails
125
star
59

requestjs-rails

JavaScript
103
star
60

activemodel-globalid

Serializing models to a single string makes it easy to pass references around
Ruby
90
star
61

account_location

NOTICE: official repository moved to https://github.com/bbommarito/account_location
Ruby
73
star
62

acts_as_nested_set

NOTICE: official repository moved to https://github.com/bbommarito/acts_as_nested_set
Ruby
71
star
63

iso-3166-country-select

WARNING: this repo is not maintained anymore, if you want to maintain it, please send an mail to rails-core
Ruby
70
star
64

activerecord-deprecated_finders

Ruby
68
star
65

spring-watcher-listen

Ruby
63
star
66

weblog

Superseded by https://github.com/rails/website
HTML
63
star
67

prototype-ujs

JavaScript
62
star
68

prototype_legacy_helper

WARNING: this repo is not maintained anymore, if you want to maintain it, please send an mail to rails-core
Ruby
60
star
69

verification

NOTICE: official repository moved to https://github.com/sikachu/verification
Ruby
58
star
70

website

HTML
55
star
71

prototype-rails

Add RJS, Prototype, and Scriptaculous helpers to Rails 3.1+ apps
Ruby
55
star
72

activemodel-serializers-xml

Ruby
52
star
73

record_tag_helper

ActionView Record Tag Helpers
Ruby
50
star
74

homepage

Superseded by https://github.com/rails/website
HTML
50
star
75

rollupjs-rails

Bundle and transpile JavaScript in Rails with rollup.js
Ruby
49
star
76

actionpack-xml_parser

XML parameters parser for Action Pack (removed from core in Rails 4.0)
Ruby
49
star
77

activesupport-json_encoder

Ruby
48
star
78

etagger

Declare what goes in to your ETags: asset versions, account ID, etc.
Ruby
41
star
79

upload_progress

NOTICE: official repository moved to https://github.com/rishav/upload_progress
Ruby
39
star
80

atom_feed_helper

NOTICE: official repository moved to https://github.com/TrevorBramble/atom_feed_helper
Ruby
38
star
81

render_component

NOTICE: official repository moved to https://github.com/malev/render_component. Components allow you to call other actions for their rendered response while executing another action
Ruby
38
star
82

gsoc2014

Project website and wiki for Ruby on Rails proposals to Google Summer of Code 2014
37
star
83

gsoc2013

Project website and wiki for Ruby on Rails proposals to Google Summer of Code 2013
31
star
84

ruby-coffee-script

Ruby CoffeeScript Compiler
Ruby
28
star
85

asset_server

NOTICE: official repository moved to https://github.com/andhapp/asset_server
Ruby
27
star
86

homepage-2011

This repo is now legacy. New homepage is at rails/homepage
HTML
26
star
87

deadlock_retry

NOTICE: official repository moved to https://github.com/heaps/deadlock_retry
Ruby
26
star
88

token_generator

NOTICE: official repository moved to https://github.com/bbommarito/token_generator
Ruby
25
star
89

rails-docs-server

Ruby
24
star
90

http_authentication

NOTICE: official repository moved to https://github.com/dshimy/http_authentication
Ruby
22
star
91

irs_process_scripts

WARNING: this repo is not maintained anymore, if you want to maintain it, please send an mail to rails-core. The extracted inspector, reaper, and spawner scripts from script/process/*
22
star
92

javascript_test

WARNING: this repo is not maintained anymore, if you want to maintain it, please send an mail to rails-core
JavaScript
19
star
93

rails_fast_attributes

Experimental project
Rust
18
star
94

scriptaculous_slider

WARNING: this repo is not maintained anymore, if you want to maintain it, please send an mail to rails-core
JavaScript
18
star
95

rails-ujs

Ruby on Rails unobtrusive scripting adapter
17
star
96

request_profiler

WARNING: this repo is not maintained anymore, if you want to maintain it, please send an mail to rails-core. Request profiler based on integration test scripts
Ruby
17
star
97

scaffolding

NOTICE: official repository moved to https://github.com/KeysetTS/scaffolding
Ruby
17
star
98

rails-new

Shell
16
star
99

buildkite-config

Fallback configuration for branches that lack a .buildkite/ directory
Ruby
16
star
100

tzinfo_timezone

WARNING: this repo is not maintained anymore, if you want to maintain it, please send an mail to rails-core
Ruby
13
star