• Stars
    star
    499
  • Rank 88,341 (Top 2 %)
  • Language
    Jupyter Notebook
  • License
    Apache License 2.0
  • Created over 3 years ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

AutoX is an efficient automl tool, which is mainly aimed at data mining tasks with tabular data.

English | 简体中文 logo

AutoX是什么?

AutoX一个高效的自动化机器学习工具。 它的特点包括:

  • 效果出色: AutoX在多个kaggle数据集上,效果显著优于其他解决方案(见效果对比)。
  • 简单易用: AutoX的接口和sklearn类似,方便上手使用。
  • 通用: 适用于分类和回归问题。
  • 自动化: 无需人工干预,全自动的数据清洗、特征工程、模型调参等步骤。
  • 灵活性: 各组件解耦合,能单独使用,对于自动机器学习效果不满意的地方,可以结合专家知识,AutoX提供灵活的接口。
  • 比赛上分点总结:整理并公开历史比赛的上分点。

AutoX包含什么内容

加入社区

AutoX社区

框架

autox_competition

autox_competition framework

autox_recommend

autox_recommend framework

autox_video

autox_video framework

如何为AutoX贡献

如何为AutoX贡献

目录

安装

github仓库安装

git clone https://github.com/4paradigm/autox.git
pip install ./autox

pip安装

## pip安装包可能更新不及时,建议用github安装方式安装最新版本
!pip install automl-x -i https://www.pypi.org/simple/

快速上手

社区案例

汽车销量预测

比赛案例

见demo文件夹

数据集下载链接:https://pan.baidu.com/s/1p38OuP8_FJp2P_wJwhdFiw?pwd=8mxf

效果对比

不同任务下的效果提升百分比

data_type 对比AutoGluon 对比H2o
binary classification 20.44% 2.98%
regression 37.54% 39.66%
time-series 28.40% 32.46%

详细数据集对比

data_type single-or-multi data_name metric AutoX AutoGluon H2o
binary classification single-table Springleaf auc 0.78865 0.61141 0.78186
binary classification-nlp single-table stumbleupon auc 0.87177 0.81025 0.79039
binary classification single-table santander auc 0.89196 0.64643 0.88775
binary classification multi-table IEEE accuracy 0.920809 0.724925 0.907818
regression single-table ventilator mae 0.755 8.434 4.221
regression single-table Allstate Claims Severity mae 1137.07885 1173.35917 1163.12014
regression single-table zhidemai mse 1.0034 1.9466 1.1927
regression single-table Tabular Playground Series - Aug 2021 rmse 7.87731 10.3944 7.8895
regression single-table House Prices rmse 0.13043 0.13104 0.13161
regression single-table Restaurant Revenue rmse 2133204.32146 31913829.59876 28958013.69639
regression multi-table Elo Merchant Category Recommendation rmse 3.72228 3.80801 22.88899
regression-ts single-table Demand Forecasting smape 13.79241 25.39182 18.89678
regression-ts multi-table Walmart Recruiting wmae 4660.99174 5024.16179 5128.31622
regression-ts multi-table Rossmann Store Sales RMSPE 0.13850 0.20453 0.35757
regression-cv single-table PetFinder rmse 20.1327 23.1732 21.0586

AutoX成就

企业支持

比赛获奖

TODO

功能开发完成后,发布相应的使用demo

  • 多分类任务

若有其他希望AutoX支持的功能,欢迎提issue! 欢迎填写用户调研问卷,让AutoX变得更好!

错误排查

错误信息 解决办法

More Repositories

1

OpenMLDB

OpenMLDB is an open-source machine learning database that provides a feature platform computing consistent features for training and inference.
C++
1,583
star
2

k8s-vgpu-scheduler

OpenAIOS vGPU scheduler for Kubernetes is originated from the OpenAIOS project to virtualize GPU device memory.
Go
261
star
3

openaios-platform

OpenAIOS is an incubating open-source distributed OS kernel based on Kubernetes for AI workloads. OpenAIOS-Platform is an AI development platform built upon OpenAIOS for enterprises to develop and deploy AI applications for production.
Go
93
star
4

DemoApps

demo applications that show how to deploy offline feature engineering solutions to online in one minute with fedb and nativespark
Python
35
star
5

OpenEmbedding

OpenEmbedding is an open source framework for Tensorflow distributed training acceleration.
C++
31
star
6

pskiplist

An implementation of the persistent skiplist based on Intel Optane Persistent Memory. It is with Intel's pmemkv as an storage engine
C++
14
star
7

FeatInsight

FeatInsight is a feature platform based on OpenMLDB
Vue
12
star
8

prpc

C++
7
star
9

tsfe

Time Series Feature Extraction for Federated Learning
Python
7
star
10

openmldb-docs-zh

This is the repo for OpenMLDB's Chinese documentations
Python
7
star
11

rfcs

RFCs for 4Paradigm Open Source Community
6
star
12

hybridsql-asserts

HybridSQL third-party libraries
Shell
3
star
13

parameter-server

C++
3
star
14

openmldb-exporter

OpenMLDB metric exporter for Prometheus
Python
2
star
15

openmldb-k8s

This is the tool to support running the OpenMLDB's online engine on Kubernetes
Smarty
2
star
16

OpenMLDBSQLEmulator

Java
1
star
17

openmldb-go-sdk

Official OpenMLDB Golang SDK
Go
1
star