• Stars
    star
    1
  • Language
    Python
  • Created over 1 year ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

A Big Data Architecture for Early Identification and Categorization of Dark Web Sites. The solution is built using Big Data technologies (Kubernetes, Kafka, Kubeflow, and MinIO), continuously discovering onion services in different sources, deduplicating them using MinHash LSH, and categorizing with the BERTopic topic modeling.