Search API Solr NLP adds support for Solr's OpenNLP integration to Search API Solr.

Due to the requirement for trained models for natural language processing that might take a lot of space, this functionality encapsulated in an optional module and not included in Search API Solr itself.

Search API Solr NLP adds some new fulltext field types to Solr config sets, for example "nouns". Using this field type you can filter any text for nouns. The resulting data stored in the index is great source for features like auto complete or spell check.

Installation

This module has to be installed via composer to pull its dependencies. These contain a set of pre-trained models for different languages from apache.org. The first download ever of these files takes some time. Composer isn't fast on fetching those files. So it isn't an error when the first time download takes longer! Afterwards these files are cached.

Your Solr hosting provider must allow you to provide or upload "your" Solr config-set. Once this module is installed, the generated config-sets of Search API Solr will automatically contain all required files.

The default distribution of Solr already contains the required OpenNLP libraries.

If you use Solr Cloud you might have to adjust a zookeeper setting. one way is to set the environment variable accordingly:
SOLR_OPTS=$SOLR_OPTS -Djute.maxbuffer=50000000

Supporting organizations: 
sponsored the 1.0.0 release
sponsors further development

Project information

Releases