• Stars
    star
    3
  • Rank 3,943,898 (Top 79 %)
  • Language
    Jupyter Notebook
  • Created almost 6 years ago
  • Updated over 3 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

In this project we want to implement a solution that is capable of grouping Wikipedia articles based on the articles content or other features. We want to find feasible metrics to cluster articles into differentcategories. In the beginning we want to find very general categories e.g. science, politics, sports,movies, etc. and if possible, we want specify the categories into more detail, e.g. football player, actionmovie, midterm election, etc.. For this approach we want to use clustering.Possible extension: After we created a cluster we want to find a way to propose tags of related clustersfor a new article which is not in the cluster itself