• Stars
    star
    166
  • Rank 226,533 (Top 5 %)
  • Language
    Python
  • Created about 9 years ago
  • Updated over 5 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Topological Data Analysis in Python

PyTDA - Topological Data Analysis (TDA) for Python

Important Notice

This repository is NOT a Python package. Codes in this repository are for demonstration and described in the blog entries listed below. And the codes in this repository run in Python 2.7 only.

However, there will be an optimized code found in the package mogutda, and you can refer to the codes in my another repository: MoguTDA You can also install the package mogutda by typing on the command prompt:

pip install -U mogutda

The package mogutda runs in Python 2.7, 3.5, and 3.6.

Introduction

PyTDA contains Python codes that demonstrate the numerical calculation of algebraic topology in an application to topological data analysis (TDA).

Topological data analysis aims at studying the shapes of the data, and draw some insights from them. A lot of machine learning algorithms deal with distances, which are extremely useful, but they miss the information the data may carry from their geometry.

Demo Codes and Blog Entries

Codes in this repository are demo codes for a few entries of my blog, Everything about Data Analytics, and the corresponding entries are:

Wolfram Demonstration

Richard Hennigan put a nice Wolfram Demonstration online explaining what the simplicial complexes are, and how homologies are defined:

Other TDA Packages

It is recommended that for real application, you should use the following packages for efficiency, because my codes serve the pedagogical purpose only.

C++

Python

R

References

  • Afra J. Zomorodian. Topology for Computing (New York, NY: Cambridge University Press, 2009). [Amazon]
  • Afra J. Zomorodian. "Topological Data Analysis," Proceedings of Symposia in Applied Mathematics (2011). [link]
  • Afra Zomorodian, Gunnar Carlsson, “Computing Persistent Homology,” Discrete Comput. Geom. 33, 249-274 (2005). [pdf]
  • Gunnar Carlsson, “Topology and Data”, Bull. Amer. Math. Soc. 46, 255-308 (2009). [link]
  • P. Y. Lum, G. Singh, A. Lehman, T. Ishkanov, M. Vejdemo-Johansson, M. Alagappan, J. Carlsson, G. Carlsson, “Extracting insights from the shape of complex data using topology”, Sci. Rep. 3, 1236 (2013). [link]
  • Robert Ghrist, “Barcodes: The persistent topology of data,” Bull. Amer. Math. Soc. 45, 1-15 (2008). [pdf]

More Repositories

1

PyShortTextCategorization

Various Algorithms for Short Text Mining
Python
467
star
2

MoguTDA

Topological Data Analysis in Python: Simplicial Complex
Python
97
star
3

PyWMD

Word Mover's Distance (WMD) in Python
Jupyter Notebook
36
star
4

pyqentangle

Quantum Entanglement in Python
Python
19
star
5

MoguNumerics

Python, a collection of simple numerical routines
Python
10
star
6

econ_inequality

Gini coefficients
Python
9
star
7

GraphFlow

Algorithms for Graph Flow Analysis
Python
9
star
8

TextMiningInR

Sample text mining procedures in R
R
5
star
9

RQEntangle

Quantum Entanglement in R
R
5
star
10

AprioriSc

Scala implementation of Apriori Algorithm
Scala
4
star
11

finance-tools

Python
4
star
12

TFNumPy

Numerical Python using TensorFlow
Python
3
star
13

PyBibleNLP2

Bible NLP and IR with Python
Python
3
star
14

ml-experiment

Machine Learning Experiment Framework
Python
3
star
15

SammonEmbedding

Sammon Embedding
Python
2
star
16

Word2VecModelTextWrangling

Reading Word2Vec Model in text format
Java
2
star
17

FinanceSimulation

Financial Simulation
Python
2
star
18

IPythonNotebooks

Jupyter Notebook
2
star
19

JShortTextMining

Short text mining in Java
Scala
1
star
20

WordNetSimilarity

Demonstration of using WS4J package
Scala
1
star
21

WebScrapingTasks

Some web scraping tasks
Python
1
star
22

SSSpend

Python
1
star
23

python-cloud-api

Python Cloud API
Python
1
star
24

canonical_ensemble

Simulating canonical ensemble
Python
1
star
25

OptionPricing

Option Pricing Project
Python
1
star
26

PyReadability

Readability of a text: calculation of various metrics
Python
1
star
27

RSpellCorrection

Spell Corrector, an R Wrapper on the Scala implementation
R
1
star
28

ZeiselSingleCellToSCEObject

Conversion of original Zeisel's single-cell experiment dataset to SingleCellExperiment object
R
1
star
29

ImageSimilarities

Calculate the similarities between images
Python
1
star
30

PyBibleNLP

Extracting Bible verses for natural language processing study
Python
1
star
31

GaussianMixtures

Gaussian Mixtures and EM
Python
1
star
32

rInfo

R Package for Information Metrics
R
1
star
33

KerasLogicGates

Implementation of logic gates using neural network (with Keras)
Python
1
star