• Stars
    star
    591
  • Rank 72,787 (Top 2 %)
  • Language
    C++
  • License
    Apache License 2.0
  • Created about 8 years ago
  • Updated about 1 month ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

oneAPI Data Analytics Library (oneDAL)

oneAPI Data Analytics Library

Installation   |   Documentation   |   Support   |   Examples   |   How to Contribute   

Build Status License Join the community on GitHub Discussions

oneAPI Data Analytics Library (oneDAL) is a powerful machine learning library that helps you accelerate big data analysis at all stages: preprocessing, transformation, analysis, modeling, validation, and decision making.

The library implements classical machine learning algorithms. The boost in their performance is achieved by leveraging the capabilities of Intel® hardware.

oneDAL is part of oneAPI. The current branch implements version 1.1 of oneAPI Specification.

Usage

There are different ways for you to build high-performance data science applications that use the advantages of oneDAL:

  • Use oneDAL C++ interfaces with or without SYCL support (learn more)
  • Use Intel(R) Extension for Scikit-learn* to accelerate existing scikit-learn code without changing it
  • Use daal4py, a standalone package with Python API for oneDAL Deprecation Notice: The Java interfaces are removed from the oneDAL library.

Installation

Check System Requirements before installing oneDAL.

You can download the specific version of oneDAL or install it from sources.

Examples

C++ Examples:

Python Examples:

Other Examples

Documentation

oneDAL documentation:

Other related documentation:

Apache Spark MLlib

oneDAL library is used for Spark MLlib acceleration as part of OAP MLlib project and allows you to get a 3-18x increase in performance compared to the default Apache Spark MLlib.

Technical details: FPType: double; HW: 7 x m5.2xlarge AWS instances; SW: Intel DAAL 2020 Gold, Apache Spark 2.4.4, emr-5.27.0; Spark config num executors 12, executor cores 8, executor memory 19GB, task cpus 8

Scaling

oneDAL supports distributed computation mode that shows excellent results for strong and weak scaling:

oneDAL K-Means fit, strong scaling result oneDAL K-Means fit, weak scaling results

Technical details: FPType: float32; HW: Intel Xeon Processor E5-2698 v3 @2.3GHz, 2 sockets, 16 cores per socket; SW: Intel® DAAL (2019.3), MPI4Py (3.0.0), Intel® Distribution Of Python (IDP) 3.6.8; Details available in the article https://arxiv.org/abs/1909.11822

Support

Ask questions and engage in discussions with oneDAL developers, contributers, and other users through the following channels:

You may reach out to project maintainers privately at [email protected].

Security

To report a vulnerability, refer to Intel vulnerability reporting policy.

Contribute

We welcome community contributions. Check our contributing guidelines to learn more.

License

oneDAL is distributed under the Apache License 2.0 license. See LICENSE for more information.

oneMKL FPK microlibs are distributed under Intel Simplified Software License. Refer to third-party-programs-mkl.txt for details.

More Repositories

1

oneTBB

oneAPI Threading Building Blocks (oneTBB)
C++
5,245
star
2

oneDNN

oneAPI Deep Neural Network Library (oneDNN)
C++
3,420
star
3

oneAPI-samples

Samples for Intel® oneAPI Toolkits
C++
820
star
4

oneDPL

oneAPI DPC++ Library (oneDPL) https://software.intel.com/content/www/us/en/develop/tools/oneapi/components/dpc-library.html
C++
707
star
5

oneMKL

oneAPI Math Kernel Library (oneMKL) Interfaces
C++
560
star
6

SYCLomatic

202
star
7

level-zero

oneAPI Level Zero Specification Headers and Loader
C++
178
star
8

oneVPL

oneAPI Video Processing Library (oneVPL) dispatcher, tools, and examples
C++
173
star
9

oneCCL

oneAPI Collective Communications Library (oneCCL)
C++
172
star
10

oneAPI-spec

oneAPI Specification source files
Python
165
star
11

oneapi-ci

Sample configuration files for using oneAPI in CI systems
Shell
88
star
12

oneVPL-intel-gpu

C++
86
star
13

oneAPI-tab

oneAPI Technical Advisory Board (TAB) Meeting Notes
70
star
14

level-zero-tests

oneAPI Level Zero Conformance & Performance test content
C++
43
star
15

distributed-ranges

Distributed ranges is a generalization of C++ ranges for distributed data structures.
C++
40
star
16

Velocity-Bench

C++
37
star
17

oneVPL-cpu

oneAPI Video Processing Library (oneVPL) CPU implementation. This GitHub repository is no longer active. See ReadMe for more information.
C++
25
star
18

unified-runtime

C++
24
star
19

unified-memory-framework

A library for constructing allocators and memory pools. It also contains broadly useful abstractions and utilities for memory management. UMF allows users to manage multiple memory pools characterized by different attributes, allowing certain allocation types to be isolated from others and allocated using different hardware resources as required.
C
17
star
20

SYCLomatic-test

LLVM
14
star
21

level-zero-spec

Python
14
star
22

drone-navigation-inspection

AI Starter Kit for AI applications in Drone technology using Intel® Optimized Tensorflow*
Python
13
star
23

predictive-asset-health-analytics

AI Starter Kit for Predictive Asset Maintenance using Intel® optimized version of XGBoost
HTML
13
star
24

ishmem

Intel® SHMEM - Device initiated shared memory based communication library
C++
11
star
25

text-data-generation

AI Starter Kit for AI Unstructured Synthetic Data Generation using Intel® Extension for Pytorch
Python
9
star
26

traffic-camera-object-detection

AI Starter Kit for traffic camera object detection using Intel® Extension for Pytorch
Python
9
star
27

intelligent-indexing

AI Starter Kit for Intelligent Indexing of Incoming Correspondence using Intel® Extension for Scikit-learn*
Python
8
star
28

invoice-to-cash-automation

Ai starter kit for trade promotion and claim documents categorization using pytorch* and Tensorflow*
Python
7
star
29

demand-forecasting

AI Starter Kit for demand forecasting using Intel® Optimized Tensorflow*
Python
7
star
30

product-recommendations

AI Starter Kit for product recommendation system using Intel® Extension for Scikit-learn*
Python
6
star
31

ai-transcribe

AI Starter Kit for the implementation of an AI transcribe system using Intel® Extension for PyTorch*
Python
6
star
32

level-zero-intel-gpu

5
star
33

computational-fluid-dynamics

AI Starter Kit for fluid Flow Profiling using Intel® Optimized Tensorflow*
Python
5
star
34

disease-prediction

AI Starter Kit for the implementation of AI-based NLP Disease Prediction system using Intel® Extension for PyTorch* and Intel® Neural Compressor
Python
5
star
35

historical-assets-document-process

AI Starter Kit for Historical Assets document processing using Intel® Extension for Pytorch
Python
5
star
36

loan-default-risk-prediction

AI Starter Kit to predict probability of a loan default from client using Intel® optimized version of XGBoost
5
star
37

digital-twin

AI Starter Kit to build a MOSFET Digital Twin for Design Exploration using Intel® optimized version of XGBoost
Python
4
star
38

medical-imaging-diagnostics

AI Starter Kit for image-based abnormalities for different diseases classification using Intel® Optimized Tensorflow*
Python
4
star
39

network-intrusion-detection

AI Starter Kit for Network Intrusion Detection using Intel® Extension for Scikit-learn*
Python
4
star
40

structural-damage-assessment

AI Starter Kit for applications in Satellite Image processing using Intel® Extension for Pytorch
Python
4
star
41

ai-structured-data-generation

AI Starter Kit to generate structured synthetic data using Intel® Distribution of Modin
Python
3
star
42

purchase-prediction

AI Starter Kit for Purchase Prediction model using Intel® Extension for Scikit-learn*
Python
3
star
43

visual-quality-inspection

AI Starter Kit for Quality Visual Inspection using Intel® Extension for Pytorch
Python
3
star
44

customer-segmentation

AI Starter Kit for Customer Segmentation for Online Retail using Intel® Extension for Scikit-learn*
Python
3
star
45

powerline-fault-detection

AI Starter Kit for detect faulty signals in power line voltage using Intel® Extension for Scikit-learn*
Python
3
star
46

distributed-ranges-tutorial

C++
3
star
47

image-data-generation

AI Starter Kit for Synthetic Image Generation using Intel® Optimized Tensorflow*
Python
2
star
48

customer-chatbot

AI Starter Kit for Customer Chatbot using Intel® Extension for Pytorch
Python
2
star
49

unified-runtime-spec

2
star
50

visual-process-discovery

AI Starter Kit for Visual Process Discovery using Intel® Extension for Pytorch
Python
2
star
51

vertical-search-engine

AI Starter Kit for Semantic Vertical Search Engines using Intel® Extension for Pytorch
Python
2
star
52

document-automation

AI Starter Kit for Named Entity Recognition using Intel® Optimized Tensorflow (version 2.9.0 with oneDNN)
Python
2
star
53

voice-data-generation

AI Starter Kit for Synthetic Voice and Audio Generation using Intel® Extension for Pytorch
Python
1
star
54

order-to-delivery-time-forecasting

AI Starter Kit of a delivery time forecasting solution using Intel® optimized version of XGBoost
1
star
55

customer-churn-prediction

AI Starter Kit for customer churn prediction using Intel® Extension for Scikit-learn*
Python
1
star
56

credit-card-fraud-detection

AI Starter Kit for Credit Card Fraud Detection model using Intel® Extension for Scikit-learn*
Python
1
star