• Stars
    star
    166
  • Rank 227,748 (Top 5 %)
  • Language
    C++
  • License
    GNU General Publi...
  • Created over 7 years ago
  • Updated 4 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Reference implementation of the ZIM specification

Libzim

The Libzim is the reference implementation for the ZIM file format. It's a software library to read and write ZIM files on many systems and architectures. More information about the ZIM format and the openZIM project at https://openzim.org/.

Release Repositories macOS Homebrew License Build Doc Codecov CodeFactor

Disclaimer

This document assumes you have a little knowledge about software compilation. If you experience difficulties with the dependencies or with the Libzim compilation itself, we recommend to have a look to kiwix-build.

Preamble

Although the Libzim can be compiled/cross-compiled on/for many systems, the following documentation explains how to do it on POSIX ones. It is primarily though for GNU/Linux systems and has been tested on recent releases of Ubuntu and Fedora.

Dependencies

The Libzim relies on many third party software libraries. They are prerequisites to the Kiwix library compilation. Following libraries need to be available:

  • LZMA (package liblzma-dev on Ubuntu)
  • ICU (package libicu-dev on Ubuntu)
  • Zstd (package libzstd-dev on Ubuntu)
  • Xapian - optional (package libxapian-dev on Ubuntu)

To test the code:

To build the documentations you need the packages:

These dependencies may or may not be packaged by your operating system. They may also be packaged but only in an older version. The compilation script will tell you if one of them is missing or too old. In the worse case, you will have to download and compile a more recent version by hand.

If you want to install these dependencies locally, then ensure that Meson (through pkg-config) will properly find them.

Environment

The Libzim builds using Meson version 0.43 or higher. Meson relies itself on Ninja, Pkg-config and few other compilation tools. Install them first:

  • Meson
  • Ninja
  • Pkg-config

These tools should be packaged if you use a cutting edge operating system. If not, have a look to the Troubleshooting section.

Compilation

Once all dependencies are installed, you can compile Libzim with:

meson . build
ninja -C build

By default, it will compile dynamic linked libraries. All binary files will be created in the build directory created automatically by Meson. If you want statically linked libraries, you can add --default-library=static option to the Meson command.

If you want to build the documentation, we need to pass the -Ddoc=true option and run the doc target:

meson . build -Ddoc=true
ninja -C build doc

Depending on your system, ninja command may be called ninja-build.

By default, Libzim tries to compile with Xapian (and will generate an error if Xapian is not found). You can build without Xapian by passing the option -Dwith_xapian=false :

meson . build -Dwith_xapian=false
ninja -C build doc

If Libzim is compiled without Xapian, all search API are removed. You can test if an installed version of Libzim is compiled with or without xapian by testing the define LIBZIM_WITH_XAPIAN.

Testing

ZIM files needed by unit-tests are not included in this repository. By default, Meson will use an internal directory in your build directory, but you can specify another directory with option test_data_dir:

meson . build -Dtest_data_dir=<A_DIR_WITH_TEST_DATA>

Whatever you specify a directory or not, you need a extra step to download the data. At choice:

  • Get the data from the repository openzim/zim-testing-suite and put it yourself in the directory.
  • Use the script download_test_data.py which will download and extract the data for you.
  • As ninja to do it for you with ninja download_test_data once the project is configured.

The simple workflow is:

meson . build # Configure the project (using default directory for test data)
cd build
ninja # Build
ninja download_test_data # Download the test data
meson test # Test

It is possible to deactivate all tests using test data zim files by passing none to the test_data_dir option:

meson . build -Dtest_data_dir=none
cd build
ninja
meson test # Run tests but tests needing test zim files.

If the automated tests fail or timeout, you need to be aware that some tests need up to 16GB of memory. You can skip those specific tests with:

SKIP_BIG_MEMORY_TEST=1 meson test

Some tests are checking error detection in multithread environment and they need to sleep to let threads working (and detect error). How many time to wait depends of your computer. If you have error_in_creator test failing, you probably need to extend the waiting time. This can be done by setting the env variable WAIT_TIME_FACTOR_TEST to a float factor. The waiting time will multiplied by this factor.

WAIT_TIME_FACTOR_TEST=2 meson test

Installation

If you want to install the Libzim and the headers you just have compiled on your system, here we go:

ninja -C build install

You might need to run the command as root (or using sudo), depending where you want to install the libraries. After the installation succeeded, you may need to run ldconfig (as root).

Uninstallation

If you want to uninstall the Libzim:

ninja -C build uninstall

Like for the installation, you might need to run the command as root (or using sudo).

Troubleshooting

If you need to install Meson "manually":

virtualenv -p python3 ./ # Create virtualenv
source bin/activate      # Activate the virtualenv
pip3 install meson       # Install Meson
hash -r                  # Refresh bash paths

If you need to install Ninja "manually":

git clone git://github.com/ninja-build/ninja.git
cd ninja
git checkout release
./configure.py --bootstrap
mkdir ../bin
cp ninja ../bin
cd ..

If the compilation still fails, you might need to get a more recent version of a dependency than the one packaged by your Linux distribution. Try then with a source tarball distributed by the problematic upstream project or even directly from the source code repository.

License

GPLv2 or later, see COPYING for more details.

More Repositories

1

zimit

Make a ZIM file from any Web site and surf offline!
Python
335
star
2

mwoffliner

Mediawiki scraper: all your wiki articles in one highly compressed ZIM file
TypeScript
285
star
3

sotoki

StackExchange websites to ZIM scraper
Python
217
star
4

gutenberg

Scraper for downloading the entire ebooks repository of project Gutenberg
Python
130
star
5

zim-tools

Various ZIM command line tools
C++
127
star
6

zimfarm

Farm operated by bots to grow and harvest new zim files
Python
83
star
7

python-libzim

Libzim binding for Python: read/write ZIM files in Python
Python
63
star
8

youtube

Create a ZIM file from a Youtube channel/username/playlist
Python
48
star
9

warc2zim

Command line tool to convert a file in the WARC format to a file in the ZIM format
Python
44
star
10

zim-requests

Want a new ZIM file? Propose ZIM content improvements or fixes? Here you are!
37
star
11

zimwriterfs

[ARCHIVED] Create ZIM files based from a directory on your local filesystem
C++
36
star
12

node-libzim

Libzim binding for Node.js: read/write ZIM files in Javascript
C++
27
star
13

ifixit

iFixit to ZIM scraper
Python
25
star
14

wp1

Wikipedia 1.0 engine & selection tools
Python
24
star
15

nautilus

Turns a collection of documents into a browsable ZIM file
Python
21
star
16

python-scraperlib

Collection of Python code to re-use across Python-based scrapers
Python
19
star
17

wikihow

WikiHow scraper
Python
16
star
18

ted

Provide the best of TED.com for offline usage!
Python
13
star
19

zimit-frontend

Zimit Public Web UI
Vue
9
star
20

kolibri

Convert a Kolibri channel in ZIM file(s)
Python
8
star
21

openedx

Open edX (to zim) scraper
Python
8
star
22

phet

Scraper for PhET Science & Math Interactive Simulations
JavaScript
7
star
23

zip2zim

[ARCHIVED] Convert Zip Files to Zim Files
JavaScript
6
star
24

wp1_selection_tools

Create selections with the best articles of a WM project
Perl
6
star
25

zimreader-java

[ARCHIVED] ZIM file reader in Java
Java
5
star
26

freecodecamp

FreeCodeCamp.org scraper (to ZIM)
Python
4
star
27

cms

ZIM file Publishing Platform
Python
4
star
28

docker-publish-action

Docker Publish Action for OpenZIM projects
Python
4
star
29

education-numerique

Éducation & Numérique scraper
Python
3
star
30

zim-testing-suite

This repository contains testing zim files for libzim and other openzim repositories.
PHP
3
star
31

overview

🎈 Start here for current projects, how to get involved, and joining community calls. A resource for new and veteran members of the offline commmunity
2
star
32

zimfarm-client

Command line tool to deal with the Zimfarm
Python
2
star
33

nautilus-webui

SaaS Web UI for nautilus
Python
1
star
34

python-storagelib

S3 Cache wrapper to use within Kiwix/OpenZIM/Offspot projects
Python
1
star
35

zimreader-tntnet

[ARCHIVED] ZIM file reader using tntnet HTTP server
CSS
1
star
36

devdocs

devdocs.io to ZIM scraper
Python
1
star
37

_python-bootstrap

Sample openZIM Python project bootstrap
Python
1
star
38

xapian-meson

Xapian ( 1.4.23) source code with meson build system
C++
1
star
39

lilote

Generate a Lilote ZIM file from a Lilote export JSON
JavaScript
1
star