MPark.Patterns
This is an experimental library that has evolved to P1371, being proposed for C++23.
Introduction
MPark.Patterns is an experimental pattern matching library for C++17.
It determines whether a given value matches a pattern and, if it does, binds the desired portions of the value to a handler.
Pattern matching has been introduced to many programming languages outside of the functional world, and this library draws inspiration from languages such as Haskell, OCaml, Rust, Scala, and Swift.
#include <cstdio>
#include <mpark/patterns.hpp>
void fizzbuzz() {
using namespace mpark::patterns;
for (int i = 1; i <= 100; ++i) {
match(i % 3, i % 5)(
pattern(0, 0) = [] { std::printf("fizzbuzz\n"); },
pattern(0, _) = [] { std::printf("fizz\n"); },
pattern(_, 0) = [] { std::printf("buzz\n"); },
pattern(_, _) = [i] { std::printf("%d\n", i); });
}
}
int main() {
fizzbuzz();
}
Basic Syntax
using namespace mpark::patterns;
IDENTIFIERS(<identifier>...); // optional
match(<expr>...)(
pattern(<pattern>...) = [](<binding>...) { /* ... */ },
pattern(<pattern>...) = [](<binding>...) { /* ... */ },
// ...
);
Types of Patterns
Expression Pattern
An expression pattern matches if the value compares equal to the given expression.
Requirements
Given an expression e
and a value x
, e == x
must be a valid expression.
Syntax
<expr>
Examples
int factorial(int n) {
using namespace mpark::patterns;
return match(n)(pattern(0) = [] { return 1; },
// ^ expression
pattern(_) = [n] { return n * factorial(n - 1); });
}
Arg Pattern
An arg pattern matches any value, and binds the value by passing it to the corresponding handler. This pattern can be repeated in a single pattern, and they match independent of each other.
Requirements
None.
Syntax
arg
arg(<pattern>)
Examples
int factorial(int n) {
using namespace mpark::patterns;
return match(n)(pattern(0) = [] { return 1; },
pattern(arg) = [](auto n) { return n * factorial(n - 1); });
// ^^^ arg
}
using P = std::pair<int, int>;
P p = {101, 202};
using namespace mpark::patterns;
match(p)(pattern(arg(ds(101, _))) = [](P &p) { std::cout << "pair!\n"; },
// ^^^ pass the pair!
pattern(_) = [] { std::cout << "not a pair\n"; });
// prints: "pair!"
Wildcard Pattern
A wildcard pattern matches any value and discards the value by not passing it to the corresponding handler. This pattern can be repeated within a single pattern, and they match independent of each other.
Requirements
None.
Syntax
_
(underscore)
Examples
int factorial(int n) {
using namespace mpark::patterns;
return match(n)(pattern(0) = [] { return 1; },
pattern(_) = [n] { return n * factorial(n - 1); });
// ^ wildcard
}
Identifier Pattern
An identifier pattern is similar to arg
and _
in that it matches any value.
Two major differences are:
- Ability to operate on names rather than just
arg
or_
. - When an identifier is repeated within a single pattern, the values bound to the repeated identifiers must compare equal for the pattern to match.
There are two types of identifier patterns: binding and discarding.
Requirements
If no identifiers are repeated, none.
Otherwise, let x
be a repeated identifer and v, vs...
be the group of values
matched by the repeated instances of x
. Then (... && (v == vs))
must be a
valid expression. The same requirement applies to any other repeated identifiers
in a single pattern.
Syntax
IDENTIFIERS(<identifier>...);
<identifier>
<identifier>(<pattern>)
If <identifier>
has a leading _
(underscore), it is a discarding
identifier. Otherwise, it is a binding identifier.
Examples
void is_same(int lhs, int rhs) {
using namespace mpark::patterns;
IDENTIFIERS(x, y);
match(lhs, rhs)(
pattern(x, x) = [](auto) { std::cout << "same\n"; },
// ^ ^ binding identifier (repeated)
pattern(x, y) = [](auto, auto) { std::cout << "diff\n"; }
// ^ ^ binding identifier
);
}
is_same(101, 101); // prints: "same"
is_same(101, 202); // prints: "diff"
void is_same(int lhs, rhs) {
using namespace mpark::patterns;
IDENTIFIERS(_x, _y);
match(42, 42)(
pattern(_x, _x) = [] { std::cout << "same\n"; },
// ^ ^ discarding identifier (repeated)
pattern(_x, _y) = [] { std::cout << "diff\n"; }
// ^ ^ discarding identifier
);
}
is_same(101, 101); // prints: "same"
is_same(101, 202); // prints: "diff"
std::tuple<int, int> t = {101, 202};
std::optional<int> o = 101;
using namespace mpark::patterns;
IDENTIFIERS(x, y);
match(t, o)(
pattern(ds(x, x), some(x)) = [](auto x) {
std::cout << "They're all " << x << "!\n";
},
pattern(ds(x, y), some(x)) = [](auto x, auto y) {
std::cout << "I recognize x-y-x!\n";
},
pattern(_) = [] { std::cout << "I don't know.\n"; });
// prints: "I recognize x-y-x!"
Destructure Pattern
A destructure pattern matches values that hold multiple values.
Requirements
Given a value x
of type T
, T
must satisfy one of the following conditions:
std::is_array_v<T> == true
, orstd::is_class_v<T> == true
andstd::tuple_size<T>
is a complete type, orstd::is_aggregate_v<T> == true
If (2), the following conditions must also be satisfied.
std::tuple_size<T>::value
must be a well-formed integer constant expression, andx.get<I>()
orget<I>(x)
must be a valid expression for allI
in[0, std::tuple_size<T>::value)
For aggregate types (3), the current implementation only supports types that contain upto 16 non-static data members.
NOTE: These requirements are very similar to the requirements for C++17 Structured Binding.
Syntax
ds(<pattern>...)
Examples
auto t = std::make_tuple(101, "hello", 1.1);
// C++17 Structured Binding:
const auto & [x, y, z] = t;
// ...
// MPark.Patterns:
using namespace mpark::patterns;
IDENTIFIERS(x, y, z);
match(t)(
pattern(ds(x, y, z)) = [](auto &&x, auto &&y, auto &&z) { /* ... */ });
// ^^^^^^^^^^^ destructure
NOTE: The top-level is wrapped by a tuple
, allowing us to write:
void fizzbuzz() {
for (int i = 1; i <= 100; ++i) {
using namespace mpark::patterns;
match(i % 3, i % 5)(
pattern(0, 0) = [] { std::cout << "fizzbuzz\n"; },
pattern(0, _) = [] { std::cout << "fizz\n"; },
pattern(_, 0) = [] { std::cout << "buzz\n"; },
pattern(_, _) = [i] { std::cout << i << '\n'; });
}
}
Optional Pattern
An optional pattern matches values that can be dereferenced,
and tested as a bool
.
Requirements
Given a value x
of type T
, T
must satisfy all of the following conditions:
*x
is a valid expression, andx
is contextually convertible tobool
Syntax
some(<pattern>)
none
Examples
int *p = nullptr;
using namespace mpark::patterns;
match(p)(pattern(some(_)) = [] { std::cout << "some\n"; },
pattern(none) = [] { std::cout << "none\n"; });
// prints: "none"
std::optional<int> o = 42;
using namespace mpark::patterns;
match(o)(
pattern(some(arg)) = [](auto x) { std::cout << "some(" << x << ")\n"; },
pattern(none) = [] { std::cout << "none\n"; });
// prints: "some(42)"
As Pattern
An as pattern matches values that is capable of holding a value of multiple types.
Requirements
Given a value x
of type T
, and a pattern as<U>(<pattern>)
,
T
must satisfy one of the following conditions:
std::is_polymorphic_v<T> == true
, orstd::is_class_v<T> == true
andstd::variant_size<T>
is a complete type, orx.any_cast<U>()
orany_cast<U>(&x)
is a valid expression
- If (1),
dynamic_cast<U'>(&x)
must be a valid expression, whereU'
is pointer toU
with the same cv-qualifiers asdecltype(x)
- If (2),
x.get_if<U>()
orget_if<U>(&x)
must be a valid expression
Syntax
as<U>(<pattern>)
Examples
struct Shape { virtual ~Shape() = default; };
struct Circle : Shape {};
struct Square : Shape {};
std::unique_ptr<Shape> shape = std::make_unique<Square>();
using namespace mpark::patterns;
match(shape)(pattern(some(as<Circle>(_))) = [] { std::cout << "Circle\n"; },
pattern(some(as<Square>(_))) = [] { std::cout << "Square\n"; });
// prints: "Square"
using str = std::string;
std::variant<int, str> v = 42;
using namespace mpark::patterns;
match(v)(pattern(as<int>(_)) = [] { std::cout << "int\n"; },
pattern(as<str>(_)) = [] { std::cout << "str\n"; });
// prints: "int"
using str = std::string;
std::any a = str("hello");
using namespace mpark::patterns;
match(a)(pattern(as<int>(_)) = [] { std::cout << "int\n"; },
pattern(as<str>(_)) = [] { std::cout << "str\n"; });
// prints: "str"
Visit Pattern
A visit pattern matches values that is capable of holding a value of closed set of types, and can be visited.
Requirements
Given a value x
, The following code must be valid.
using std::visit;
visit([](auto &&) {}, x);`
Syntax
vis(<pattern>)
Examples
using str = std::string;
std::variant<int, str> v = "hello world!";
struct Visitor {
void operator()(int n) const { std::cout << "int: " << n << '\n'; }
void operator()(const str &s) const { std::cout << "str: " << s << '\n'; }
};
using namespace mpark::patterns;
match(v)(pattern(vis(arg)) = Visitor{});
// prints: "str: hello world!".
Alternation Pattern
An alternation pattern matches values that match any of the given patterns.
Requirements
None.
Syntax
anyof(<pattern>...)
Examples
std::string s = "large";
using namespace mpark::patterns;
match(s)(
pattern(anyof("big", "large", "huge")) = [] { std::cout << "big!\n"; },
pattern(anyof("little", "small", "tiny")) = [] { std::cout << "small.\n"; },
pattern(_) = [] { std::cout << "unknown size.\n"; });
Variadic Pattern
A variadic pattern matches 0 or more values that match a given pattern.
Requirements
- A variadic pattern can only appear within a destructure pattern.
- A variadic pattern can only appear once.
Syntax
variadic(<pattern>)
Examples
auto x = std::make_tuple(101, "hello", 1.1);
using namespace mpark::patterns;
match(x)(
pattern(ds(variadic(arg))) = [](const auto &... xs) {
int dummy[] = { (std::cout << xs << ' ', 0)... };
(void)dummy;
});
// prints: "101 hello 1.1 "
This could also be used to implement C++17 std::apply
:
template <typename F, typename Tuple>
decltype(auto) apply(F &&f, Tuple &&t) {
using namespace mpark::patterns;
return match(std::forward<T>(t))(
pattern(ds(variadic(arg))) = std::forward<F>(f));
}
and even C++17 std::visit
:
template <typename F, typename... Vs>
decltype(auto) visit(F &&f, Vs &&... vs) {
using namespace mpark::patterns;
return match(std::forward<Vs>(vs)...)(
pattern(variadic(vis(arg))) = std::forward<F>(f));
}
We can even get a little fancier:
int x = 42;
auto y = std::make_tuple(101, "hello", 1.1);
using namespace mpark::patterns;
match(x, y)(
pattern(arg, ds(variadic(arg))) = [](const auto &... xs) {
int dummy[] = { (std::cout << xs << ' ', 0)... };
(void)dummy;
});
// prints: "42 101 hello 1.1 "
Pattern Guards
While pattern matching is very useful to match values against patterns, it mainly focuses on equality comparisons. Pattern guards are used to test whether the bound values satisfy some predicate.
WHEN
A WHEN
clause appears inside the handler as the only expression.
A WHEN
clause differs from an if
statement in that if the conditional
expression evaluates to false
, the WHEN
clause falls through to the
next case, whereas the if
statement would stay in the handler.
Requirements
The WHEN
clause may only appear once inside the handler as the only
expression, and the conditional expression must be contextually
convertible to bool
.
Syntax
WHEN(<condition>) { /* ... */ };
Examples
using namespace mpark::patterns;
IDENTIFIERS(x, y);
match(101, 202)(
pattern(x, x) = [](int) { std::cout << "EQ\n"; },
pattern(x, y) = [](int x, int y) { WHEN(x > y) { std::cout << "GT\n"; }; },
pattern(x, y) = [](int x, int y) { WHEN(x < y) { std::cout << "LT\n"; }; });
// prints: "LT"
Pattern Statements
if_let
This construct is simply syntactic sugar for the following situation:
std::optional<int> o = 42;
using namespace mpark::patterns;
match(o)(
pattern(some(arg)) = [](auto x) { ... },
pattern(_) = [] {});
For this situation, if_let
provides a cleaner interface:
std::optional<int> o = 42;
using namespace mpark::patterns;
if_let (pattern(some(arg)) = o) = [](auto x) {
// ...
};
for_let
Suppose we want to match each element in a range-based for
loop with pattern.
We could imagine being able to write something like:
std::vector<std::pair<int, int>> pairs = {{0, 1}, {1, 1}, {0, 0}, {0, 2}};
for ((0, x) : pairs) {
std::cout << x << ' ';
}
// prints: "1 0 2 "
With if_let
we can say something close-ish:
std::vector<std::pair<int, int>> pairs = {{0, 1}, {1, 1}, {0, 0}, {0, 2}};
for (const auto &p : pairs) {
using namespace mpark::patterns;
if_let (pattern(ds(0, arg)) = p) = [](auto x) {
std::cout << x << ' ';
};
}
// prints: "1 0 2 "
for_let
allows the above example to be written more concisely:
std::vector<std::pair<int, int>> pairs = {{0, 1}, {1, 1}, {0, 0}, {0, 2}};
using namespace mpark::patterns;
for_let (pattern(ds(0, arg)) = pairs) = [](auto x) {
std::cout << x << ' ';
};
// prints: "1 0 2 "
for_let
also provides the break
and continue
statements with
return Break;
and return Continue;
respectively.
std::vector<std::pair<int, int>> pairs = {{0, 1}, {1, 1}, {0, 0}, {0, 2}};
using namespace mpark::patterns;
for_let (pattern(ds(0, arg)) = pairs) = [](auto x) {
if (x == 1) return Break;
std::cout << x << ' ';
return Continue;
};
// prints: "0 "
std::vector<std::pair<int, int>> pairs = {{0, 1}, {1, 1}, {0, 0}, {0, 2}};
using namespace mpark::patterns;
for_let (pattern(ds(0, arg)) = pairs) = [](auto x) {
if (x == 0) return Continue;
std::cout << x << ' ';
return Continue;
};
// prints: "1 2 "
Requirements
This library requires a standard conformant C++17 compiler. The following compilers are continously tested:
Compiler | Operating System | Version String |
---|---|---|
GCC 7.4.0 | Ubuntu 16.04.6 LTS | g++-7 (Ubuntu 7.4.0-1ubuntu1 |
GCC 8.1.0 | Ubuntu 16.04.6 LTS | g++-8 (Ubuntu 8.1.0-5ubuntu1~16.04) 8.1.0 |
Clang 5.0.0 | Ubuntu 16.04.6 LTS | clang version 5.0.0-3~16.04.1 (tags/RELEASE_500/final) |
Clang 6.0.0 | Ubuntu 16.04.6 LTS | clang version 6.0.0-1ubuntu2~16.04.1 (tags/RELEASE_600/final) |
Clang 7.1.0 | Ubuntu 16.04.6 LTS | clang version 7.1.0-svn353565-1 |
CMake Variables
-
MPARK_PATTERNS_INCLUDE_TESTS
:BOOL
(default:OFF
)Build tests.
Unit Tests
Refer to test/README.md.
License
Distributed under the Boost Software License, Version 1.0.