• Stars
    star
    735
  • Rank 61,652 (Top 2 %)
  • Language
    Python
  • License
    Apache License 2.0
  • Created over 3 years ago
  • Updated 3 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

NVIDIA Merlin is an open source library providing end-to-end GPU-accelerated recommender systems, from feature engineering and preprocessing to training deep learning models and running inference in production.

NVIDIA Merlin

GitHub tag (latest SemVer) GitHub License Documentation

NVIDIA Merlin is an open source library that accelerates recommender systems on NVIDIA GPUs. The library enables data scientists, machine learning engineers, and researchers to build high-performing recommenders at scale. Merlin includes tools to address common feature engineering, training, and inference challenges. Each stage of the Merlin pipeline is optimized to support hundreds of terabytes of data, which is all accessible through easy-to-use APIs. For more information, see NVIDIA Merlin on the NVIDIA developer web site.

Benefits

NVIDIA Merlin is a scalable and GPU-accelerated solution, making it easy to build recommender systems from end to end. With NVIDIA Merlin, you can:

  • Transform data (ETL) for preprocessing and engineering features.
  • Accelerate your existing training pipelines in TensorFlow, PyTorch, or FastAI by leveraging optimized, custom-built data loaders.
  • Scale large deep learning recommender models by distributing large embedding tables that exceed available GPU and CPU memory.
  • Deploy data transformations and trained models to production with only a few lines of code.

Components of NVIDIA Merlin

NVIDIA Merlin consists of the following open source libraries:

NVTabular PyPI version shields.io  Documentation
NVTabular is a feature engineering and preprocessing library for tabular data. The library can quickly and easily manipulate terabyte-size datasets that are used to train deep learning based recommender systems. The library offers a high-level API that can define complex data transformation workflows. With NVTabular, you can:

  • Prepare datasets quickly and easily for experimentation so that you can train more models.
  • Process datasets that exceed GPU and CPU memory without having to worry about scale.
  • Focus on what to do with the data and not how to do it by using abstraction at the operation level.

HugeCTR  Documentation
HugeCTR is a GPU-accelerated training framework that can scale large deep learning recommendation models by distributing training across multiple GPUs and nodes. HugeCTR contains optimized data loaders with GPU-acceleration and provides strategies for scaling large embedding tables beyond available memory. With HugeCTR, you can:

  • Scale embedding tables over multiple GPUs or nodes.
  • Load a subset of an embedding table into a GPU in a coarse-grained, on-demand manner during the training stage.

Merlin Models PyPI version shields.io  Documentation
The Merlin Models library provides standard models for recommender systems with an aim for high-quality implementations that range from classic machine learning models to highly-advanced deep learning models. With Merlin Models, you can:

  • Accelerate your ranking model training by up to 10x by using performant data loaders for TensorFlow, PyTorch, and HugeCTR.
  • Iterate rapidly on featuring engineering and model exploration by mapping datasets created with NVTabular into a model input layer automatically. The model input layer enables you to change either without impacting the other.
  • Assemble connectable building blocks for common RecSys architectures so that you can create of new models quickly and easily.

Transformers4Rec PyPI version shields.io  Documentation
The Transformers4Rec library provides sequential and session-based recommendation. The library provides modular building blocks that are compatible with standard PyTorch modules. You can use the building blocks to design custom architectures such as multiple towers, multiple heads and tasks, and losses. With Transformers4Rec, you can:

  • Build sequential and session-based recommenders from any sequential tabular data.
  • Take advantage of the integration with NVTabular for seamless data preprocessing and feature engineering.
  • Perform next-item prediction as well as classic binary classification or regression tasks.

Merlin Systems PyPI version shields.io  Documentation
Merlin Systems provides tools for combining recommendation models with other elements of production recommender systems like feature stores, nearest neighbor search, and exploration strategies into end-to-end recommendation pipelines that can be served with Triton Inference Server. With Merlin Systems, you can:

  • Start with an integrated platform for serving recommendations built on Triton Inference Server.
  • Create graphs that define the end-to-end process of generating recommendations.
  • Benefit from existing integrations with popular tools that are commonly found in recommender system pipelines.

Merlin Core PyPI version shields.io  Documentation
Merlin Core provides functionality that is used throughout the Merlin ecosystem. With Merlin Core, you can:

  • Use a standard dataset abstraction for processing large datasets across multiple GPUs and nodes.
  • Benefit from a common schema that identifies key dataset features and enables Merlin to automate routine modeling and serving tasks.
  • Simplify your code by using a shared API for constructing graphs of data transformation operators.

Installation

The simplest way to use Merlin is to run a docker container. NVIDIA GPU Cloud (NGC) provides containers that include all the Merlin component libraries, dependencies, and receive unit and integration testing. For more information, see the Containers page.

To develop and contribute to Merlin, review the installation documentation for each component library. The development environment for each Merlin component is easily set up with conda or pip:

Component Installation Steps
HugeCTR https://nvidia-merlin.github.io/HugeCTR/master/hugectr_contributor_guide.html
Merlin Core https://github.com/NVIDIA-Merlin/core/blob/stable/README.md#installation
Merlin Models https://github.com/NVIDIA-Merlin/models/blob/stable/README.md#installation
Merlin Systems https://github.com/NVIDIA-Merlin/systems/blob/stable/README.md#installation
NVTabular https://github.com/NVIDIA-Merlin/NVTabular/blob/stable/README.md#installation
Transformers4Rec https://github.com/NVIDIA-Merlin/Transformers4Rec/blob/stable/README.md#installation

Example Notebooks and Tutorials

A collection of end-to-end examples are available in the form of Jupyter notebooks. The example notebooks demonstrate how to:

  • Download and prepare a dataset.
  • Use preprocessing and engineering features.
  • Train deep-learning recommendation models with TensorFlow, PyTorch, FastAI, HugeCTR or Merlin Models.
  • Deploy the models to production with Triton Inference Server.

These examples are based on different datasets and provide a wide range of real-world use cases.

Merlin Is Built On

RAPIDS cuDF
Merlin relies on cuDF for GPU-accelerated DataFrame operations used in feature engineering.

Dask
Merlin relies on Dask to distribute and scale feature engineering and preprocessing within NVTabular and to accelerate dataloading in Merlin Models and HugeCTR.

Triton Inference Server
Merlin leverages Triton Inference Server to provide GPU-accelerated serving for recommender system pipelines.

Feedback and Support

To report bugs or get help, please open an issue.

More Repositories

1

Transformers4Rec

Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation and works with PyTorch.
Python
1,076
star
2

NVTabular

NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.
Python
1,030
star
3

HugeCTR

HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training
C++
947
star
4

dataloader

The merlin dataloader lets you rapidly load tabular data for training deep leaning models with TensorFlow, PyTorch or JAX
Python
401
star
5

models

Merlin Models is a collection of deep learning recommender system model reference implementations
Python
253
star
6

competitions

Solutions to Recommender Systems competitions
Jupyter Notebook
196
star
7

HierarchicalKV

HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of HierarchicalKV is to store key-value feature-embeddings on high-bandwidth memory (HBM) of GPUs and in host memory. It also can be used as a generic key-value storage.
Cuda
125
star
8

systems

Merlin Systems provides tools for combining recommendation models with other elements of production recommender systems (like feature stores, nearest neighbor search, and exploration strategies) into end-to-end recommendation pipelines that can be served with Triton Inference Server.
Python
88
star
9

publications

Jupyter Notebook
61
star
10

distributed-embeddings

distributed-embeddings is a library for building large embedding based models in Tensorflow 2.
Python
42
star
11

gcp-ml-ops

MLOps pipeline for NVIDIA Merlin on GKE
Python
41
star
12

core

Core Utilities for NVIDIA Merlin
Python
19
star
13

nvtabular_triton_backend

Triton Backend for NVTabular
C++
2
star