• Stars
    star
    6
  • Rank 2,529,666 (Top 51 %)
  • Language
    Java
  • License
    Apache License 2.0
  • Created over 12 years ago
  • Updated almost 12 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Packages the ARCInputFormat used in Common Crawl in a small jar file that can be used in MapReduce jobs. Implements HdfsARCSource. See README for details