🇨🇳 Made in China

Discover China's Leading Open Source Projects: Explore top-notch open source initiatives hailing from the vibrant tech community of China.

TOP Scala Projects

1
lw-lin/CoolplaySpark

lw-lin/CoolplaySpark

酷玩 Spark: Spark 源代码解析、Spark 类库等
Scala
3,430
star
2
geekyouth/SZT-bigdata

geekyouth/SZT-bigdata

深圳地铁大数据客流分析系统🚇🚄🌟
Scala
2,250
star
3
smallnest/C1000K-Servers

smallnest/C1000K-Servers

⚡ High performance websocket servers implemented by Spray-can, Netty, undertow, jetty, Vert.x, Grizzly, node.js and Go. It supports 1,200,000 active websocket connections
Scala
1,506
star
4
jacksu/utils4s

jacksu/utils4s

scala、spark使用过程中,各种测试用例以及相关资料整理
Scala
1,086
star
5
tminglei/slick-pg

tminglei/slick-pg

Slick extensions for PostgreSQL
Scala
838
star
6
XiaoMi/MiNLP

XiaoMi/MiNLP

XiaoMi Natural Language Processing Toolkits
Scala
781
star
7
CSUG/HouseMD

CSUG/HouseMD

HouseMD is an awesome diagnosing tool better than BTrace
Scala
700
star
8
MethodJiao/PkpmSpark

MethodJiao/PkpmSpark

awesome 三维数据挖掘 数据分析 & 推荐
Scala
620
star
9
xubo245/SparkLearning

xubo245/SparkLearning

Learning Apache spark,including code and data .Most part can run local.
Scala
602
star
10
Centaur/repox

Centaur/repox

Make sbt more responsive
Scala
468
star
11
luochana/News_recommend

luochana/News_recommend

基于Spark的新闻推荐系统,包含爬虫项目、web网站以及spark推荐系统
Scala
341
star
12
Ldpe2G/DeepLearningForFun

Ldpe2G/DeepLearningForFun

Implementation of some interesting ideas of deeplearning.
Scala
339
star
13
baolibin/Bigdata

baolibin/Bigdata

大数据处理相关技术学习之路(持续更新中...)。 Bigdata整理 --> 慢慢滴~ 大数据相关技术包括离线处理,实时处理,OLAP等,如hadoop、spark、flink、hive、hbase、oozie...以及大数据项目,如用户画像、数据仓库等,欢迎感兴趣的小伙伴一起来开发...
Scala
264
star
14
scalad/LayIM

scalad/LayIM

基于HTML5 WebSocket的一款IM即时通讯软件,使用Gradle集成了Scala、SpringBoot、Spring MVC、Mybatis、Redis等,前端使用了LayIm框架
Scala
262
star
15
zhengruifeng/spark-libFM

zhengruifeng/spark-libFM

An implement of Factorization Machines (LibFM)
Scala
247
star
16
LeechanX/Netflix-Recommender-with-Spark

LeechanX/Netflix-Recommender-with-Spark

基于Apache Spark的Netflix电影的离线与实时推荐系统
Scala
247
star
17
titicaca/spark-iforest

titicaca/spark-iforest

Isolation Forest on Spark
Scala
227
star
18
smallnest/douban-recommender

smallnest/douban-recommender

基于Spark ML实现的豆瓣电影推荐系统
Scala
224
star
19
neoremind/kraps-rpc

neoremind/kraps-rpc

A RPC framework leveraging Spark RPC module
Scala
212
star
20
Qihoo360/XSQL

Qihoo360/XSQL

Unified SQL Analytics Engine Based on SparkSQL
Scala
210
star
21
cookeem/CookIM

cookeem/CookIM

Distributed web chat application base websocket built on akka.
Scala
209
star
22
daizikaikou/learningSpark

daizikaikou/learningSpark

学习spark写的scala代码,工具使用的是IDEA2017.1.6,欢迎star
Scala
208
star
23
SidneyXu/AndroidDemoIn4Languages

SidneyXu/AndroidDemoIn4Languages

Comparison among Java, Groovy, Scala, Kotlin in Android Development.
Scala
195
star
24
eryk/squant

eryk/squant

SQuant是使用scala语言编写的量化开发工具箱,提供开箱即用的A股股票数据和外汇数据(docker镜像),以及高效的回测框架与交易模块。方便Java/Scala爱好者进行量化投资研究。 QQ群:281599099,微信公众号:Python量化交易实战。对,我已经转python了。。。
Scala
186
star
25
LinMingQiang/sparkstreaming

LinMingQiang/sparkstreaming

💥 🚀 封装sparkstreaming动态调节batch time(有数据就执行计算);🚀 支持运行过程中增删topic;🚀 封装sparkstreaming 1.6 - kafka 010 用以支持 SSL。
Scala
182
star
26
qifun/stateless-future

qifun/stateless-future

Asynchronous programming in fully featured Scala syntax.
Scala
177
star
27
yaooqinn/spark-authorizer

yaooqinn/spark-authorizer

A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apache Kyuubi
Scala
171
star
28
aliyun/aliyun-emapreduce-datasources

aliyun/aliyun-emapreduce-datasources

Extended datasource support for Spark/Hadoop on Aliyun E-MapReduce.
Scala
168
star
29
JerryLead/SparkLearning

JerryLead/SparkLearning

Learning to write Spark examples
Scala
156
star
30
youzan/gatling-dubbo

youzan/gatling-dubbo

A gatling plugin for running load tests on Apache Dubbo(https://github.com/apache/incubator-dubbo) and other java ecosystem.
Scala
149
star
31
TianLangStudio/DataXServer

TianLangStudio/DataXServer

为DataX(https://github.com/alibaba/DataX) 提供远程多语言调用(ThriftServer,HttpServer) 分布式运行(DataX on YARN) 功能
Scala
144
star
32
LoveLonelyTime/Bergamot

LoveLonelyTime/Bergamot

An exquisite superscalar RV32GC processor.
Scala
138
star
33
qindongliang/streaming-offset-to-zk

qindongliang/streaming-offset-to-zk

一个手动管理spark streaming集成kafka时的偏移量到zookeeper中的小项目
Scala
135
star
34
liguohua-bigdata/simple-flink

liguohua-bigdata/simple-flink

Scala
133
star
35
xiaogp/recsys_spark

xiaogp/recsys_spark

Spark SQL 实现 ItemCF,UserCF,Swing,推荐系统,推荐算法,协同过滤
Scala
131
star
36
19801201/SpinalHDL_CNN_Accelerator

19801201/SpinalHDL_CNN_Accelerator

CNN accelerator implemented with Spinal HDL
Scala
131
star
37
alibaba/SparkCube

alibaba/SparkCube

SparkCube is an open-source project for extremely fast OLAP data analysis. SparkCube is an extension of Apache Spark.
Scala
130
star
38
chucheng92/SwordOffer

chucheng92/SwordOffer

🔥剑指offer题解(Java & Scala实现)
Scala
127
star
39
STHSF/TextRank

STHSF/TextRank

基于PageRank的TextRank方法, 可以应用于中文关键词、短语、摘要提取程序,代码使用Scala编写。
Scala
123
star
40
Qihoo360/XLearning-XDML

Qihoo360/XLearning-XDML

extremely distributed machine learning
Scala
123
star
41
SidneyXu/JGSK

SidneyXu/JGSK

Java,Groovy,Scala,Kotlin 四种语言的特点对比
Scala
122
star
42
dyweb/scrala

dyweb/scrala

Unmaintained 🐳 ☕ 🕷️ Scala crawler(spider) framework, inspired by scrapy, created by @gaocegege
Scala
113
star
43
JerryLead/ApacheSparkBook

JerryLead/ApacheSparkBook

Scala
107
star
44
BaiGang/spark_multiboost

BaiGang/spark_multiboost

An implementation of the multi-class/multi-label classifier, of which the training is carried out using AdaBoost.MH on Apache Spark.
Scala
107
star
45
Kent7306/akkaflow

Kent7306/akkaflow

akkaflow是一个基于akka架构上构建的分布式高可用DAG工作流调度工具,可以把子节点分配在集群机器上并行执行,高效利用集群资源。
Scala
106
star
46
aliyun/MaxCompute-Spark

aliyun/MaxCompute-Spark

MaxCompute spark demo for building a runnable application.
Scala
106
star
47
PasaLab/marlin

PasaLab/marlin

A Distributed Matrix Operations Library Built on Top of Spark
Scala
105
star
48
qingmang-team/chanamq

qingmang-team/chanamq

Open source AMQP messaging broker based on Akka
Scala
103
star
49
IronmanJay/UserBehaviorAnalysis

IronmanJay/UserBehaviorAnalysis

模拟电商系统上线运行一段时间后,根据收集到大量的用户行为数据,利用大数据技术(Flink)进行深入挖掘和分析,进而得到感兴趣的商业指标并增强对风险的控制。 整体可以分为用户行为习惯数据和业务行为数据两大类。用户的行为习惯数据包括了用户的登录方式、上线的时间点及时长、点击和浏览页面、页面停留时间以及页面跳转等等,从中进行流量统计和热门商品的统计,并深入挖掘用户的特征;业务行为数据分为两类:一类是能够明显地表现出用户兴趣的行为,比如对商品的收藏、喜欢、评分和评价,对数据进行深入分析,得到用户画像,进而对用户给出个性化的推荐商品列表;另一类则是常规的业务操作,关注异常状况以做好风控,比如登录和订单支付。
Scala
102
star
50
qiniu/QStreaming

qiniu/QStreaming

A simplified, lightweight ETL pipeline framework for build stream/batch processing applications on top of Apache Spark
Scala
101
star
51
GuoNingNing/fire-spark

GuoNingNing/fire-spark

Spark 脚手架工程,标准化 spark 开发、部署、测试流程。
Scala
93
star
52
zlb1028/learning-flink

zlb1028/learning-flink

Scala
91
star
53
wangzaixiang/scala-sql

wangzaixiang/scala-sql

scala SQL api
Scala
89
star
54
titicaca/spark-gbtlr

titicaca/spark-gbtlr

Hybrid model of Gradient Boosting Trees and Logistic Regression (GBDT+LR) on Spark
Scala
88
star
55
xiaofateng/BinlogUpdatetoHive

xiaofateng/BinlogUpdatetoHive

mysql数据实时增量导入hive
Scala
86
star
56
share23/Food_Recommender

share23/Food_Recommender

基于 Spark Streaming + ALS 的餐饮推荐系统
Scala
85
star
57
oeljeklaus-you/SparkCore

oeljeklaus-you/SparkCore

Spark源码分析,主要包含SparkContext源码、Executor进程启动、Stage划分、Task执行和Spark2.0的新特性
Scala
82
star
58
howardlau1999/yatcpu

howardlau1999/yatcpu

Yet another toy CPU.
Scala
82
star
59
xlturing/spark-journey

xlturing/spark-journey

spark实例代码
Scala
79
star
60
qf6101/topwords

qf6101/topwords

Implementation of paper: Deng K, Bol P K, Li K J, et al. On the unsupervised analysis of domain-specific Chinese texts[J]. Proceedings of the National Academy of Sciences, 2016: 201516510.
Scala
77
star
61
Centaur/scalaconsole

Centaur/scalaconsole

Scala REPL in a GUI
Scala
74
star
62
hibayesian/spark-fm

hibayesian/spark-fm

A parallel implementation of factorization machines based on Spark
Scala
73
star
63
Ldpe2G/PCANet

Ldpe2G/PCANet

convert the matlab code of PCANet to C++ & Scala
Scala
71
star
64
aiyanbo/sbt-dependency-updates

aiyanbo/sbt-dependency-updates

⬆️ SBT plugin that can check Maven and Ivy repositories for dependency and plugin updates
Scala
70
star
65
TsinghuaDatabaseGroup/AI4DBCode

TsinghuaDatabaseGroup/AI4DBCode

Codes for building an AI-native database
Scala
69
star
66
wanghan0501/UserSessionBehaviorOfflineAnalysis

wanghan0501/UserSessionBehaviorOfflineAnalysis

四川大学拓思爱诺用户session行为数据离线分析项目
Scala
67
star
67
xieyuheng/study

xieyuheng/study

Study of language design and implementation.
Scala
67
star
68
rison168/spark-profile-tags

rison168/spark-profile-tags

基于Spark企业级用户画像项目
Scala
67
star
69
massquantity/dismember

massquantity/dismember

Advanced Retrieval Algorithms for Decomposing Large-Scale Candidate Set into Pieces.
Scala
64
star
70
jrthe42/aloha

jrthe42/aloha

Aloha: a distributed task scheduling and management framework
Scala
64
star
71
wulei-bj-cn/potatoes

wulei-bj-cn/potatoes

Scala
63
star
72
scalad/SpringBoot-Scala

scalad/SpringBoot-Scala

可以说近几年Spark的流行带动了Scala的发展,它集成了面向对象编程和函数式编程的各种特性,Scala具有更纯Lambda表粹的函数式业务逻辑解决方案,其语法比Java8后Lambda更加简洁方便,SpringBoot为Spring提供了一种更加方便快捷的方式,不再要求写大量的配置文件,作为一名Scala爱好者,使用SpringBoot结合Scala将大大节省我们开发的时间以及代码量
Scala
61
star
73
wulei-bj-cn/learn-spark

wulei-bj-cn/learn-spark

Scala
61
star
74
godpan/akka-demo

godpan/akka-demo

some demo for akka
Scala
60
star
75
molikto/mlang

molikto/mlang

Towards changing things and see if it proofs
Scala
59
star
76
goodrain/realtime-message-system

goodrain/realtime-message-system

Based akka distributed real-time message exchange system
Scala
58
star
77
notyy/scalaSnippet

notyy/scalaSnippet

在工作中和各种scala培训中积累的代码片段
Scala
55
star
78
yaooqinn/spark-ranger

yaooqinn/spark-ranger

已经合入(apache/incubator-kyuubi) ACL Management for Apache Spark SQL with Apache Ranger.
Scala
54
star
79
yizt/aiia_elec_miner

yizt/aiia_elec_miner

“AIIA”杯-国家电网-电力专业领域词汇挖掘
Scala
54
star
80
liumingmusic/HadoopLearning

liumingmusic/HadoopLearning

全套大数据基础学习教程,包含最基础的centos、maven。大数据主要包含hdfs、mr、yarn、hbase、kafka、scala、sparkcore、sparkstreaming、sparksql。教程包含所有的源代码演示以及在线文档说明。
Scala
53
star
81
xubo245/CarbonDataLearning

xubo245/CarbonDataLearning

Apache CarbonData Learning
Scala
53
star
82
yaooqinn/itachi

yaooqinn/itachi

A library that brings useful functions from various modern database management systems to Apache Spark
Scala
53
star
83
JerryCatLeung/deepwalk_node2vector_eges

JerryCatLeung/deepwalk_node2vector_eges

将deepwalk、node2vector和阿里的文章:Billion-scale Commodity Embedding for E-commerce Recommendation in Alibaba 用代码实现
Scala
52
star
84
JoeWoo/hadoop-spark-hive-cluster-docker

JoeWoo/hadoop-spark-hive-cluster-docker

hadoop-spark-hive-cluster-docker
Scala
52
star
85
renchunxiao/scala-learn

renchunxiao/scala-learn

scala 编程的基础知识,以及 快学scala 书中的习题
Scala
52
star
86
cloudwu/efkbgfx

cloudwu/efkbgfx

A bgfx renderer for effekseer runtime
Scala
50
star
87
xlturing/spark-streaming-action

xlturing/spark-streaming-action

The code of book: Spark Streaming Action
Scala
49
star
88
pkeropen/BigData-News

pkeropen/BigData-News

基于Spark2.2新闻网大数据实时系统项目
Scala
49
star
89
thestyleofme/user-behavior-analysis

thestyleofme/user-behavior-analysis

基于flink的用户行为分析
Scala
49
star
90
thestyleofme/flink-explore

thestyleofme/flink-explore

基于canal/kafka conenct的mysql/oracle数据实时同步、flink rest api、flink sql以及udf
Scala
49
star
91
frb502/spark-skewed-join-hint

frb502/spark-skewed-join-hint

SparkSQL自定义Hint优化器解决热点数据导致JOIN数据倾斜问题
Scala
48
star
92
jxnu-liguobin/SpringBoot-SecKill-Scala

jxnu-liguobin/SpringBoot-SecKill-Scala

Scala语言实现的慕课网秒杀系统增强版(含Java版),Scala v1
Scala
47
star
93
YCG09/xgbspark-text-classification

YCG09/xgbspark-text-classification

XGBoost on Spark for Chinese Text Classification
Scala
47
star
94
sjyttkl/spark_learning

sjyttkl/spark_learning

尚硅谷大数据Spark-2019版最新 Spark 学习
Scala
47
star
95
zhangslob/learning-spark

zhangslob/learning-spark

零基础学习spark,大数据学习
Scala
46
star
96
foldright/sbt-one-log

foldright/sbt-one-log

🌳 sbt-one-log resolve the logging dependencies chaos in your development, just make logging work as you expect and follow the best practice, automatically.
Scala
46
star
97
ojlm/pea

ojlm/pea

分布式压测引擎. A distributed stress tool based on gatling
Scala
45
star
98
chensoul/learning-spark

chensoul/learning-spark

Learning to write Spark examples
Scala
44
star
99
TopSpoofer/hbrdd

TopSpoofer/hbrdd

一个为spark批量导入数据到hbase的库
Scala
43
star
100
jizhang/spark-sandbox

jizhang/spark-sandbox

A playground for Spark jobs.
Scala
43
star