There are no reviews yet. Be the first to send feedback to the community and the maintainers!
Repository Details
Packages the ARCInputFormat used in Common Crawl in a small jar file that can be used in MapReduce jobs. Implements HdfsARCSource. See README for details