Nutch Logo

This project is not covered by Drupal’s security advisory policy.

Intro

This module is under active development for a Drupal 6 upgrade have a look at the roadmap/progress Nutch Roadmap

Nutch is a web crawler/indexer/search engine that is based on Lucene. It is a Java tool.

This module allows you to have basic control over the Nutch crawl lifecycle through the Drupal web interface.

Drupal 4.7

The Drupal 4.7 version is combined with the OpenSearch Client module and you can offer the search results through Drupal as well.

Drupal 6 and beyond...

The Drupal 6 version of module will integrate with Apache Solr Module allowing you to combine native Drupal indexes and crawled results

This is a work in progress and participation from Nutch and Drupal developers is alway welcomed.

There is no documentation yet, but if you are familiar with Drupal and have managed to get Nutch running from the Nutch documentation, you'll figure the rest out.

There is now a working group to discuss this module and other Nutch/Lucene related efforts: http://groups.drupal.org/lucene-and-nutch

This project is Sponsored by Axis Twelve Ltd

Project information

Releases