• Stars
    star
    202
  • Rank 193,691 (Top 4 %)
  • Language
    Shell
  • Created over 4 years ago
  • Updated about 2 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Introducción a la Minería de Datos

CC5205

Repositorio del curso Minería de Datos dictado en el Departamento de Ciencias de la Computación de la Universidad de Chile

Slides y Videos

  1. Clase 1: Introducción Motivación, video 1, video 2

  2. Clase 2: Datos I, video 1, video 2

  3. Clase 3: Datos II , video 1, video 2, video 3

  4. Clase 4: Análisis Exploratorio de Datos, video 1, video 2, video 3, video 4

  5. Clase 5: Clasificación I, video 1, video 2

  6. Clase 6: Clasificación II - Framework, video 1, video 2, video 3, video 4

  7. Clase 7: Clasificación III - Algoritmos de Clasificación (árboles, KNN, Naive Bayes), video 1, video 2, video 3

  8. Clase 8: Clasificación IV - Support Vector Machines, video 1, video 2, video 3, video 4

  9. Clase 9: Clasificación V - Caso de Estudio: Rumores en Twitter, paper, video

  10. Clase 10: Clustering I - Introducción, video 1

  11. Clase 11: Clustering II - Algoritmos de Clustering, video 1, video 2, video 3, video 4

  12. Clase 12: Clustering III - Validación de Clusters, video

  13. Clase 13: Reglas de Asociación, video 1, video 2, video 3, video 4

  14. Clase 15: Selección y Reducción de Atributos, video 1, video 2, video 3

  15. Clase 16: Modelos Lineales y Redes Neuronales, video 1, video 2

Material Extra

  1. Material Extra 1: Repaso Matemático

  2. Material Extra 2: Límites Estadísticos de la Minería de Datos

  3. Material Extra 3: Clustering de Series de Tiempo

  4. Material Extra 4: Clustering - Casos de Estudio

  5. Material Extra 5: Privacidad en Minería de Datos

Links

  1. Libro: Introduction to Data Mining (Second Edition)
  2. Repositorio antiguo del curso por Mauricio Quezada
  3. Proyectos de años anteriores
  4. Hands-on Machine Learning with Scikit-Learn, Keras and TensorFlow: Notebooks
  5. Perfil de Hans Rosling en TED
  6. Python Machine Learning book code repository
  7. Machine learning examples: A collection of machine learning examples and tutorials
  8. KDnuggets: sitio Web muy popular sobre DM, ML, AI, etc
  9. Centroid Initialization Methods for k-means Clustering - KDnuggets
  10. Nested Cross-Validation for Machine Learning with Python
  11. Mathematics for Machine Learning
  12. FAISS a library for very fast clustering
  13. Data Transformation: Standardization vs Normalization
  14. Machine learning sucks at covid by Cory Doctorow

More Repositories

1

beto

BETO - Spanish version of the BERT model
491
star
2

spanish-word-embeddings

Spanish word embeddings computed with different methods and from different corpora
355
star
3

CC6205

Natural Language Processing
TeX
230
star
4

CC6204

Material del curso de Deep Learning de la Universidad de Chile
Jupyter Notebook
197
star
5

wefe

WEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a framework that standardizes the bias measurement and mitigation in Word Embeddings models. Please feel welcome to open an issue in case you have any questions or a pull request if you want to contribute to the project!
Python
173
star
6

CC6104

Teaching material of the course "Statistical Thinking" of the Department of Computer Science at the University of Chile.
TeX
97
star
7

lightweight-spanish-language-models

ALBETO and DistilBETO are versions of ALBERT and DistilBERT pre-trained exclusively on Spanish corpora.
Python
29
star
8

rivertext

RiverText is a framework that standardizes the Incremental Word Embeddings proposed in the state-of-art. Please feel welcome to open an issue in case you have any questions or a pull request if you want to contribute to the project!
Python
18
star
9

GLUES

Resources for GLUE benchmark in Spanish
15
star
10

PracticaProfesional

Everything related to practica profesional
11
star
11

relela

Representations for Learning and Language
HTML
8
star
12

speedy-gonzales

Code for "Speedy Gonzales: A Collection of Fast Task-Specific Models for Spanish"
HTML
7
star
13

SNEC

Special Needs Education Corpus project
Jupyter Notebook
2
star
14

RiverText

Machine Learning for Text Sreams
2
star
15

word-embeddings-benchmarks

Python
1
star