• Stars
    star
    4
  • Rank 3,304,323 (Top 66 %)
  • Language
    Jupyter Notebook
  • Created over 4 years ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

This project can classify a website from 11 different classes based on the text and metadata content scraped from the website. The machine learning model used for classification is Logistic Regression. It was trained on the text and metadata content from 59K+ websites using TF-IDF features after basic text pre-processing. It got an accuracy of 77.9%