Fonduer Tutorials
Introduction Tutorials
We have several introductory tutorials to help get you started with using Fonduer.
Hardware Tutorial
In this tutorial, we build a Fonduer application to tackle the task of extracting maximum storage temperatures for specific transistor part numbers from their datasheets (PDF format).
Wiki Tutorial
In this tutorial, we use Fonduer to extract the place of of birth for American presidents from their wikipedia html pages (HTML format).
Hardware Figures Tutorial
In this advanced tutorial, we build a Fonduer application
to tackle the task of extracting images for specific transistors from their
datasheets. This demonstrates how Fonduer
can be used for image data in
additional to text.
Quick Start
docker-compose up
Installation
Dependencies
We use a few applications that you'll need to install and be sure are on your PATH.
For OS X using homebrew:
brew install poppler
brew install postgresql
brew install libpng freetype pkg-config
On Debian-based distros:
sudo apt install libxml2-dev libxslt-dev python3-dev libpq-dev
sudo apt build-dep python-matplotlib
sudo apt install poppler-utils
sudo apt install postgresql
For the Python dependencies, we recommend using a virtualenv. Once you have cloned the repository, change directories to the root of the repository and run
virtualenv -p python3 .venv
Once the virtual environment is created, activate it by running
source .venv/bin/activate
Any Python libraries installed will now be contained within this virtual
environment. To deactivate the environment, simply run deactivate
.
Then, install Fonduer and any other python dependencies by running:
pip install -r requirements.txt
Running
After installing all the requirements, just run:
jupyter notebook