biblio-duplicate-detection
This repo deals with the problem of detecting near-duplicates among Russian-language bibliographic references. For the purpose of obtaining references, an additional task is solved --- the allocation of bibliographic references from scientific documents. To build the base of unique links, a search engine indexing is implemented.