There are no reviews yet. Be the first to send feedback to the community and the maintainers!
Repository Details
This project contains some Hadoop code for working with the TREC Knowledge Base Acceleration dataset. In particular, it provides classes to read/write topic files, read/write run files, and expose the documents in the Thrift files as Hadoop-readable objects.