This project was a contracting work to add an implementation of the Daitch-Mokotoff Soundex codec to the Apache Commons Codec project in order to use it as a token filter within lucene.

The project available at github is a standalone version to be used together with lucene.

The actual implementation was merged into the commons-codec repo.

Information about the filter can be found in the latest documentation for lucene / solr.

The enable this filter you can add the following snippet to your solr configuration:

  <tokenizer class="solr.StandardTokenizerFactory"/>
  <filter class="solr.DaitchMokotoffSoundexFilterFactory" inject="true"/>