84cc0124-1389-4285-9825-6efc837725dd

Detect Similar Documents Using C#

Discovering identical content or similar content is done using an algorithm called Finding Document Distance. Using this algorithm, applications for finding copyright infringements, detecting duplicate content and similar content can be effectively created. You can…


84cc0124-1389-4285-9825-6efc837725dd

Detect Similar Documents Using PHP

Discovering identical content or similar content is done using an algorithm called Finding Document Distance. Using this algorithm, applications for finding copyright infringements, detecting duplicate content and similar content can be effectively created. You can…




tumblr_n4fo75ieCR1qa3t7wo1_1280

Install Tokyo Tyrant in Linux

WHAT IS TOKYO TYRANT? Tokyo Cabinet (TC) is a data repository which is tailored for high speed retrieval. It is not a full fledged DBMS like Mysql or MongoDB . At its core, it is…


python

Recursive directory walker in Python

This code does a recursive listing of files and folders from a specified path. Though there are built in functions like glob() which can do this in one line, this is more to demonstrate recursion…