Sherlock is an universal extensible system for collecting documents distributed across the network (e.g., on the World-Wide Web), indexing them and offering full-text search capabilities. It's designed to run under any UNIX-like operating system, although it has been tested only with Linux.
Sherlock has been developed by Martin Mares and Pavel Machek. Please send all comments, bug reports and suggestions to the authors.
If you want to look at how does Sherlock really work or to use it for indexing your own web site, just download the freeware version. Although it doesn't offer all capabilities of the full version running here, it's still very useful and it has smaller resource requirements.
You can also look at a paper explaining internal operation of the search engine.
The Sherlock system is modular. Currently implemented modules include:
gived
.