TokuMX
TokuMX is a high-performance distribution of MongoDB, a document-oriented database with built-in sharding and replication, built on Tokutek's Fractal Tree indexes.
TokuMX has the same binaries, supports the same drivers, data model, and features of MongoDB, because it shares much of its code with MongoDB. The benefits of using TokuMX include:
- Faster writes and less I/O on large data sets.
- Concurrent read/write access to collections with document-level locking.
- Compression of all data by default.
- MVCC transactional semantics.
- Zero fragmentation of data or indexes, removing the
need for maintenance like
compact
orrepairDatabase
.
TokuMX is fully compatible with MongoDB 2.4, except that TokuMX does not support:
- Text indexes and search
- Geospatial indexing and queries
Documentation
For the most part, the MongoDB documentation applies to TokuMX as well. For exceptions, additional features, and TokuMX specific strategies and behaviors, the TokuMX Users Guide takes precedence.
Packaging
TokuMX is available for 64-bit Linux systems. Tokutek builds and provides packages for Centos 5 and 6, Debian 7, Fedora 20, and Ubuntu 12.04, 12.10, 13.04, 13.10, and 14.04. For other distributions of Linux, Tokutek provides a binary tarball that can be installed standalone.
Community downloads can be found here.
Enterprise downloads, including the hot backup utility, can be found here and come with 30 days of free support for evaluation.
Installation instructions for Linux distributions and OSX can be found here.
Migrating from MongoDB
TokuMX stores data on disk completely differently from MongoDB. Therefore it is necessary to export any existing data from MongoDB and import it into TokuMX. Depending on your existing MongoDB database and your application's availability requirements, you can choose from a number of strategies for converting your data set to TokuMX.
- Single server
- Replica set
- Offline (with downtime)
- Online (with no downtime)
- Sharded cluster
- Offline, individual shards
- Offline, all data at once
- Online
For details of how to perform these types of data migrations, see the documentation.
Building TokuMX
See docs/building.md.
Drivers
All MongoDB drivers work seamlessly with TokuMX.
Replication
Replication in TokuMX is managed similarly to MongoDB and uses much of the same terminology, administrative commands, and election mechanisms, and write concern works the same. There are some significant improvements to replication performance in TokuMX, including significantly reduced I/O load due to writes on secondaries.
The details of the replication oplog are different in TokuMX. Applications that read the oplog and rely on the format its contents will need to change. Therefore, "mixed" replica sets containing MongoDB and TokuMX nodes do not work.
Sharding
Sharding in TokuMX is managed similarly to MongoDB and uses much of the same terminology and administrative commands. There are some significant improvements to sharding, including significantly less intrusive chunk migrations, and clustering shard keys that make range queries much faster.
The internal mechanisms used to migrate chunks between shards are different in TokuMX from MongoDB, and therefore "mixed" clusters containing MongoDB and TokuMX shards do not work.
Multi-statement transactions (beginTransaction
, rollbackTransaction
,
and commitTransaction
commands) do not work through mongos
.
Monitoring
MongoDB Management Service monitoring partially works with TokuMX. Some monitored statistics work normally (e.g. opcounters), and some report things that are MongoDB-specific, and therefore report nothing for TokuMX (e.g. btree misses). MMS can be used but is an incomplete solution.
Other on-site and hosted solutions can be used. db.serverStatus()
reports TokuMX equivalent metrics for many commonly-tracked MongoDB
metrics, and db.engineStatus()
has an exhaustive list of many metrics
that can be tracked.
Support
Enterprise support subscriptions are available from Tokutek. This includes an enterprise hot backup tool.
Community
Mailing lists:
- [email protected] for news and help running a TokuMX database and developing applications using TokuMX.
- [email protected] for discussion about developing TokuMX itself.
IRC:
- irc.freenode.net/tokutek for discussion about TokuMX and TokuDB.
Issue tracker:
- TokuMX JIRA for problems with TokuMX.
- TokuFT JIRA for problems with TokuFT.
License
Most TokuMX source files are made available under the terms of the GNU Affero General Public License (AGPL). See individual files for details.
As an exception, the files in the client/, debian/, rpm/, utils/mongoutils, and all subdirectories thereof are made available under the terms of the Apache License, version 2.0.
The TokuKV Fractal Tree Indexing library is made available under the terms of the GNU General Public License (GPL) version 2, with an additional grant of a patent license. See README-TOKUKV for details.