• Stars
    star
    1,538
  • Rank 29,243 (Top 0.6 %)
  • Language
    Java
  • License
    Apache License 2.0
  • Created almost 11 years ago
  • Updated over 6 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

for千亿数据即席分析

项目简介

    数据越来越多,传统的关系型数据库支撑不了,分布式数据仓库又非常贵。几十亿、几百亿、甚至几千亿的数据量,如何才能高效的分析?
mdrill是由阿里妈妈开源的一套数据的软件,针对TB级数据量,能够仅用10台机器,达到秒级响应,数据能实时导入,可以对任意的维度进行组合与过滤。
    mdrill作为数据在线分析处理软件,可以在几秒到几十秒的时间,分析百亿级别的任意组合维度的数据。
在阿里10台机器完成每日30亿的数据存储,其中10亿为实时的数据导入,20亿为离线导入。目前集群的总存储1000多亿80~400维度的数据。
目前有阿里、腾讯、京东、联想、一号店、美团、大街网、亚信、恒隆兴等多家公司在使用。

mdrill的特性

1.满足大数据查询需求:adhoc每天的数据量为30亿条,随着日积月累,数据会越来越大,mdrill采用列存储,索引,分布式技术,适当的分区等满足用户对数据的实时在线分析的需求。
2.支持增量更新:离线形式的mdrill数据支持按照分区方式的增量更新。
3.支持实时数据导入:在仅有10台机器的情况下,支持每天10亿级别(高峰每小时2亿)的实时导入。
4.响应时间快:列存储、倒排索引、高效的数据压缩、内存计算,各种缓存、分区、分布式处理等等这些技术,使得mdrill可以仅在几秒到几十秒的时间分析百亿级别的数据。
5.低成本:目前在阿里adhoc仅仅使用10台48G内存的PC机,但确存储了超过千亿规模的数据。

版本下载

版本下载

资源列表

mdrill contributors

jstorm Core contributors 点击进入

mdrill数据量的增长

时间点

数据量

事件

12年12月

小于2亿

adhoc首次上线

13年1月

20~30亿

由2台机器扩容到了10台

13年5月2日

100亿

首次过百亿

13年7月24日

400亿

首次开源

13年11月

1000亿

全文检索模式ods_allpv_ad_d上线

13年12月

1500亿

实时数据以及无线数据的接入

14年2月

3200亿

11台机器,硬盘使用率30%

14年3月28日

4900亿

11台机器,硬盘使用率60%

其他

More Repositories

1

arthas

Alibaba Java Diagnostic Tool Arthas/Alibaba Java诊断利器Arthas
Java
34,428
star
2

easyexcel

快速、简洁、解决大文件内存溢出的java处理Excel工具
Java
30,946
star
3

p3c

Alibaba Java Coding Guidelines pmd implements and IDE plugin
Kotlin
29,294
star
4

nacos

an easy-to-use dynamic service discovery, configuration and service management platform for building cloud native applications.
Java
28,956
star
5

canal

阿里巴巴 MySQL binlog 增量订阅&消费组件
Java
27,786
star
6

druid

阿里云计算平台DataWorks(https://help.aliyun.com/document_detail/137663.html) 团队出品,为监控而生的数据库连接池
Java
27,644
star
7

spring-cloud-alibaba

Spring Cloud Alibaba provides a one-stop solution for application development for the distributed solutions of Alibaba middleware.
Java
27,254
star
8

fastjson

FASTJSON 2.0.x has been released, faster and more secure, recommend you upgrade.
Java
25,603
star
9

flutter-go

flutter 开发者帮助 APP,包含 flutter 常用 140+ 组件的demo 演示与中文文档
Dart
23,552
star
10

Sentinel

A powerful flow control component enabling reliability, resilience and monitoring for microservices. (面向云原生微服务的高可用流控防护组件)
Java
21,947
star
11

weex

A framework for building Mobile cross-platform UI
C++
18,204
star
12

ice

🚀 ice.js: The Progressive App Framework Based On React(基于 React 的渐进式应用框架)
TypeScript
17,772
star
13

DataX

DataX是阿里云DataWorks数据集成的开源版本。
Java
14,952
star
14

ARouter

💪 A framework for assisting in the renovation of Android componentization (帮助 Android App 进行组件化改造的路由框架)
Java
14,228
star
15

lowcode-engine

An enterprise-class low-code technology stack with scale-out design / 一套面向扩展设计的企业级低代码技术体系
TypeScript
13,869
star
16

hooks

A high-quality & reliable React Hooks library.
TypeScript
13,353
star
17

tengine

A distribution of Nginx with some advanced features
C
12,583
star
18

vlayout

Project vlayout is a powerfull LayoutManager extension for RecyclerView, it provides a group of layouts for RecyclerView. Make it able to handle a complicate situation when grid, list and other layouts in the same recyclerview.
Java
10,804
star
19

formily

📱🚀 🧩 Cross Device & High Performance Normal Form/Dynamic(JSON Schema) Form/Form Builder -- Support React/React Native/Vue 2/Vue 3
TypeScript
10,716
star
20

COLA

🥤 COLA: Clean Object-oriented & Layered Architecture
Java
9,964
star
21

ali-dbhub

已迁移新仓库,此版本将不再维护
8,454
star
22

MNN

MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba
C++
8,307
star
23

atlas

A powerful Android Dynamic Component Framework.
Java
8,120
star
24

rax

🐰 Rax is a progressive framework for building universal application. https://rax.js.org
JavaScript
7,979
star
25

otter

阿里巴巴分布式数据库同步系统(解决中美异地机房)
Java
7,967
star
26

anyproxy

A fully configurable http/https proxy in NodeJS
JavaScript
7,726
star
27

fish-redux

An assembled flutter application framework.
Dart
7,341
star
28

AndFix

AndFix is a library that offer hot-fix for Android App.
C++
6,954
star
29

flutter_boost

FlutterBoost is a Flutter plugin which enables hybrid integration of Flutter for your existing native apps with minimum efforts
Dart
6,832
star
30

x-render

🚴‍♀️ 阿里 - 很易用的中后台「表单 / 表格 / 图表」解决方案
TypeScript
6,765
star
31

transmittable-thread-local

📌 TransmittableThreadLocal (TTL), the missing Java™ std lib(simple & 0-dependency) for framework/middleware, provide an enhanced InheritableThreadLocal that transmits values between threads even using thread pooling components.
Java
6,750
star
32

jvm-sandbox

Real - time non-invasive AOP framework container based on JVM
Java
6,601
star
33

BizCharts

Powerful data visualization library based on G2 and React.
TypeScript
6,066
star
34

freeline

A super fast build tool for Android, an alternative to Instant Run
Java
5,497
star
35

UltraViewPager

UltraViewPager is an extension for ViewPager to provide multiple features in a single ViewPager.
Java
5,004
star
36

jetcache

JetCache is a Java cache framework.
Java
4,774
star
37

AliSQL

AliSQL is a MySQL branch originated from Alibaba Group. Fetch document from Release Notes at bottom.
C++
4,689
star
38

AliOS-Things

面向IoT领域的、高可伸缩的物联网操作系统,可去官网了解更多信息https://www.aliyun.com/product/aliosthings
C
4,540
star
39

dexposed

dexposed enable 'god' mode for single android application.
Java
4,483
star
40

QLExpress

QLExpress is a powerful, lightweight, dynamic language for the Java platform aimed at improving developers’ productivity in different business scenes.
Java
4,361
star
41

BeeHive

🐝 BeeHive is a solution for iOS Application module programs, it absorbed the Spring Framework API service concept to avoid coupling between modules.
Objective-C
4,286
star
42

HandyJSON

A handy swift json-object serialization/deserialization library
Swift
4,185
star
43

x-deeplearning

An industrial deep learning framework for high-dimension sparse data
PureBasic
4,185
star
44

butterfly

🦋Butterfly,A JavaScript/React/Vue2 Diagramming library which concentrate on flow layout field. (基于JavaScript/React/Vue2的流程图组件)
JavaScript
4,168
star
45

Tangram-Android

Tangram is a modular UI solution for building native page dynamically including Tangram for Android, Tangram for iOS and even backend CMS. This project provides the sdk on Android.
Java
4,107
star
46

coobjc

coobjc provides coroutine support for Objective-C and Swift. We added await method、generator and actor model like C#、Javascript and Kotlin. For convenience, we added coroutine categories for some Foundation and UIKit API in cokit framework like NSFileManager, JSON, NSData, UIImage etc. We also add tuple support in coobjc.
Objective-C
4,014
star
47

jstorm

Enterprise Stream Process Engine
Java
3,917
star
48

dragonwell8

Alibaba Dragonwell8 JDK
Java
3,826
star
49

LuaViewSDK

A cross-platform framework to build native, dynamic and swift user interface - 强大轻巧灵活的客户端动态化解决方案
Objective-C
3,707
star
50

f2etest

F2etest是一个面向前端、测试、产品等岗位的多浏览器兼容性测试整体解决方案。
JavaScript
3,562
star
51

Alink

Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.
Java
3,479
star
52

GGEditor

A visual graph editor based on G6 and React
TypeScript
3,405
star
53

fastjson2

🚄 FASTJSON2 is a Java JSON library with excellent performance.
Java
3,353
star
54

cobar

a proxy for sharding databases and tables
Java
3,207
star
55

macaca

Automation solution for multi-platform. 多端自动化解决方案
3,159
star
56

designable

🧩 Make everything designable 🧩
TypeScript
3,120
star
57

GraphScope

🔨 🍇 💻 🚀 GraphScope: A One-Stop Large-Scale Graph Computing System from Alibaba | 一站式图计算系统
C++
3,103
star
58

lightproxy

💎 Cross platform Web debugging proxy
TypeScript
3,063
star
59

pont

🌉数据服务层解决方案
TypeScript
3,016
star
60

euler

A distributed graph deep learning framework.
C++
2,849
star
61

beidou

🌌 Isomorphic framework for server-rendered React apps
JavaScript
2,736
star
62

sentinel-golang

Sentinel Go enables reliability and resiliency for Go microservices
Go
2,684
star
63

pipcook

Machine learning platform for Web developers
TypeScript
2,497
star
64

kiwi

🐤 Kiwi-国际化翻译全流程解决方案
TypeScript
2,489
star
65

yugong

阿里巴巴去Oracle数据迁移同步工具(全量+增量,目标支持MySQL/DRDS)
Java
2,480
star
66

tsar

Taobao System Activity Reporter
C
2,446
star
67

jvm-sandbox-repeater

A Java server-side recording and playback solution based on JVM-Sandbox
Java
2,395
star
68

ChatUI

The UI design language and React library for Conversational UI
TypeScript
2,383
star
69

TProfiler

TProfiler是一个可以在生产环境长期使用的性能分析工具
Java
2,377
star
70

tidevice

tidevice can be used to communicate with iPhone device
Python
2,310
star
71

higress

Cloud Native API Gateway | 云原生API网关
Go
2,257
star
72

tair

A distributed key-value storage system developed by Alibaba Group
C++
2,128
star
73

dubbo-spring-boot-starter

Dubbo Spring Boot Starter
Java
2,099
star
74

RedisShake

redis-shake is a tool for synchronizing data between two redis databases. Redis-shake 是一个用于在两个 redis之 间同步数据的工具,满足用户非常灵活的同步、迁移需求。
Go
2,077
star
75

uirecorder

UI Recorder is a multi-platform UI test recorder.
JavaScript
2,052
star
76

LVS

A distribution of Linux Virtual Server with some advanced features. It introduces a new packet forwarding method - FULLNAT other than NAT/Tunneling/DirectRouting, and defense mechanism against synflooding attack - SYNPROXY.
C
1,947
star
77

EasyNLP

EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit
Python
1,946
star
78

AliceMind

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab
Python
1,910
star
79

alpha

Alpha是一个基于PERT图构建的Android异步启动框架,它简单,高效,功能完善。 在应用启动的时候,我们通常会有很多工作需要做,为了提高启动速度,我们会尽可能让这些工作并发进行。但这些工作之间可能存在前后依赖的关系,所以我们又需要想办法保证他们执行顺序的正确性。Alpha就是为此而设计的,使用者只需定义好自己的task,并描述它依赖的task,将它添加到Project中。框架会自动并发有序地执行这些task,并将执行的结果抛出来。
HTML
1,873
star
80

GCanvas

A lightweight cross-platform graphics rendering engine. (超轻量的跨平台图形引擎) https://alibaba.github.io/GCanvas
C
1,857
star
81

Tangram-iOS

Tangram is a modular UI solution for building native page dynamically, including Tangram for Android, Tangram for iOS and even backend CMS. This project provides the sdk on iOS platform.
Objective-C
1,857
star
82

testable-mock

换种思路写Mock,让单元测试更简单
Java
1,800
star
83

LazyScrollView

An iOS ScrollView to resolve the problem of reusability in views.
Objective-C
1,775
star
84

compileflow

🎨 core business process engine of Alibaba Halo platform, best process engine for trade scenes. | 一个高性能流程编排引擎
Java
1,705
star
85

SREWorks

Cloud Native DataOps & AIOps Platform | 云原生数智运维平台
Java
1,696
star
86

EasyCV

An all-in-one toolkit for computer vision
Python
1,677
star
87

MongoShake

MongoShake is a universal data replication platform based on MongoDB's oplog. Redundant replication and active-active replication are two most important functions. 基于mongodb oplog的集群复制工具,可以满足迁移和同步的需求,进一步实现灾备和多活功能。
Go
1,648
star
88

xquic

XQUIC Library released by Alibaba is a cross-platform implementation of QUIC and HTTP/3 protocol.
C
1,604
star
89

lowcode-demo

An enterprise-class low-code technology stack with scale-out design / 一套面向扩展设计的企业级低代码技术体系
TypeScript
1,536
star
90

ilogtail

Fast and Lightweight Observability Data Collector
C++
1,529
star
91

EasyRec

A framework for large scale recommendation algorithms.
Python
1,488
star
92

clusterdata

cluster data collected from production clusters in Alibaba for cluster management research
Jupyter Notebook
1,477
star
93

havenask

C++
1,463
star
94

async_simple

Simple, light-weight and easy-to-use asynchronous components
C++
1,455
star
95

Virtualview-Android

A light way to build UI in custom XML.
Java
1,454
star
96

kt-connect

A toolkit for Integrating with your kubernetes dev environment more efficiently
Go
1,453
star
97

tb_tddl

1,410
star
98

react-intl-universal

Internationalize React apps. Not only for Component but also for Vanilla JS.
JavaScript
1,316
star
99

data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!
Python
1,292
star
100

bindingx

🚀 Bind actions to effects.
JavaScript
1,249
star