There are no reviews yet. Be the first to send feedback to the community and the maintainers!
Repository Details
This project can classify a website from 11 different classes based on the text and metadata content scraped from the website. The machine learning model used for classification is Logistic Regression. It was trained on the text and metadata content from 59K+ websites using TF-IDF features after basic text pre-processing. It got an accuracy of 77.9%