• Stars
    star
    101
  • Rank 335,933 (Top 7 %)
  • Language
    HTML
  • License
    Creative Commons ...
  • Created about 8 years ago
  • Updated about 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

R Code to accompany the book Introduction to Data Mining by Tan, Steinbach and Kumar (Code by Michael Hahsler)

R Companion for Introduction to Data Mining

This repository contains slides and documented R examples to accompany several chapters of the popular data mining text book:

Pang-Ning Tan, Michael Steinbach, Anuj Karpatne and Vipin Kumar, Introduction to Data Mining, Addison Wesley, 1st or 2nd edition.

The slides and examples are used in my course CS 7331 - Data Mining taught at SMU and will be regularly updated and improved. The code examples are now compiled into the free online book An R Companion for Introduction to Data Mining which is published under the creative commons attribution license and you can share and adapt them freely. Please open an issue
for corrections or to suggest improvements.

Covered Chapters

Chapter Slides R Code Companion Sample Textbook Chapters
1. Introduction Slides R Code
2. Data Slides: Data, Exploration R Code
3. Classification: Basic Concepts and Techniques Slides R Code Read Chapter 3
4. Classification: Alternative Techniques Slides R Code
5. Association Analysis: Basic Concepts and Algorithms Slides R Code Read Chapter 5
7. Cluster Analysis: Basic Concepts and Algorithms Slides R Code Read Chapter 7

Instructor Resources

  • Powerpoint presentation files for a data mining course can be found in the repository directory slides. The slides have an R symbol at the bottom whenever there are R code examples available.
  • Datasets for projects: Datasets can be found at https://www.kaggle.com/datasets
  • More instructional material can be found on the course web site of CS 7331 - Data Mining

License

Creative Commons License All code and documents in this repository are licensed under a Creative Commons Attribution-NonCommercial 4.0 International License

For questions please contact Michael Hahsler.

More Repositories

1

dbscan

Density Based Clustering of Applications with Noise (DBSCAN) and Related Algorithms - R package
C++
278
star
2

recommenderlab

recommenderlab - Lab for Developing and Testing Recommender Algorithms - R package
R
207
star
3

arules

Mining Association Rules and Frequent Itemsets with R
R
181
star
4

rBLAST

Interface for the Basic Local Alignment Search Tool (BLAST) - R-Package
R
99
star
5

seriation

Infrastructure for Ordering using Seriation - R Package
R
72
star
6

TSP

Traveling Salesperson Problem - R package
R
61
star
7

arulesViz

Visualizing Association Rules and Frequent Itemsets with R
R
52
star
8

CS7320-AI

Examples for an AI course following the textbook Artificial Intelligence: A Modern Approach by Russell and Norvig.
Jupyter Notebook
44
star
9

stream

A framework for data stream modeling and associated data mining tasks such as clustering and classification. - R Package
R
36
star
10

pomdp

R package for Partially Observable Markov Decision Processes
R
14
star
11

CS2341

Code Examples for Data Structures with C++
C++
13
star
12

streamMOA

Interface for data stream clustering algorithms implemented in the MOA (Massive Online Analysis) framework.
R
12
star
13

rMSA

Interface to Popular Multiple Sequence Alignment Tools - R-package
TeX
8
star
14

arulespy

Python interface to arules for association rule mining
TeX
7
star
15

arulesNBMiner

Mining NB-Frequent Itemsets and NB-Precise Rules - R Package
Java
6
star
16

qap

Heuristics for the Quadratic Assignment Problem (QAP) - R package
Fortran
4
star
17

ShinyApp_DB_HelloWorld

An example for a simple ShinyApp that connects to a remote database (SQLServer)
R
3
star
18

mdp

R package for Discrete-Time Markov Decision Processes
R
2
star
19

fit_dist

Simple R script to fit distributions to data
R
2
star
20

rRDP

Seamlessly interfaces RDP classifier.
R
1
star
21

streamConnect

Connecting Stream Mining Components Using Web Services
R
1
star
22

rEMM

R
1
star
23

pomdpSolve

Provides Cassandra's pomdp-solve program.
C
1
star