UpdateRequestProcessor is a mechinism in Solr to change the documents that are being submitted for indexing to Solr. The processors provide advanced functions such as language identification, duplicate detection, intelligent defaults, external text processing pipelines integration, and - most recently - schemaless schema mapping.
UpdateRequestProcessor factories (a.k.a. Update Request Processors or URPs) can be chained and multiple chains can be defined for a Solr collection. A chain is assigned to a request handler with update.chain parameter that can be defined in the configuration file or passed as a part of the URL. For full syntax consult Solr Reference Guide.
Here, you will find the full set of UpdateRequestProcessor factories presented in their inheritance hierarchy. Abstract classes that you cannot use directly are marked with underlined italic. Clicking on the class name will open corresponding JavaDoc page.
Most of the UpdateRequestProcessor factories are located in solr-core-5.5.0.jar ( dist ), so any entry without a location indicated can be found in that jar.
A factory to generate an UpdateRequestProcessor for each request.
Base class that can be extended by any UpdateRequestProcessorFactory designed to add a default value to the document in an AddUpdateCommand when that field is not already specified.
An update processor that adds a constant default value to any document being added that does not already have a value in the specified field.
An update processor that adds a newly generated Date value of "NOW" to any document being added that does not already have a value in the specified field.
AddSchemaFieldsUpdateProcessorFactory (Sample mentions: 1 )
This processor will dynamically add fields to the schema if an input document contains one or more fields that don't match any field or dynamic field in the schema.
Clones the values found in any matching source field into a configured dest field.
This Factory generates an UpdateProcessor that helps to enforce Version constraints on documents based on per-document version numbers using a configured name of a versionField.
Update Processor Factory for managing automatic "expiration" of documents.
Base class for implementing Factories for FieldMutatingUpdateProcessors and FieldValueMutatingUpdateProcessors.
Concatenates multiple values for fields matching the specified conditions using a configurable delimiter which defaults to ", ".
Replaces any list of values for a field matching the specified conditions with the the count of the number of values for that field.
Replaces any CharSequence values found in fields matching the specified conditions with the lengths of those CharSequences (as an Integer).
Base class for processors that want to mutate selected fields to only keep a subset of the original values.
Keeps only the first value of fields matching the specified conditions.
Keeps only the last value of fields matching the specified conditions.
An update processor that keeps only the the maximum value from any selected fields where multiple values are found.
An update processor that keeps only the the minimum value from any selected fields where multiple values are found.
Removes duplicate values found in fields matching the specified conditions.
Strips all HTML Markup in any CharSequence values found in fields matching the specified conditions.
Ignores & removes fields matching the specified conditions from any document being added to the index.
Abstract base class for numeric parsing update processor factories.
Attempts to mutate selected fields that have only CharSequence-typed values into Float values.
Attempts to mutate selected fields that have only CharSequence-typed values into Integer values.
An update processor that parses configured fields of any document being added using PreAnalyzedField with the configured format parser.
An updated processor that applies a configured regex to any CharSequence values found in the selected fields, and replaces any matches with the configured replacement string.
Trims leading and trailing whitespace from any CharSequence values found in fields matching the specified conditions and returns the resulting String.
Truncates any CharSequence values found in fields matching the specified conditions to a maximum character length.
Gives system administrators a way to ignore explicit commit or optimize requests from clients.
LangDetectLanguageIdentifierUpdateProcessorFactory in solr-langid-5.5.0.jar ( dist/ ) (Sample mentions: 1 )
Identifies the language of a set of input fields using http://code.google.com/p/language-detection
A No-Op implementation of DistributingUpdateProcessorFactory that allways returns null.
Factory which creates RegexBoostProcessors
StatelessScriptUpdateProcessorFactory (Sample mentions: 1 )
An update request processor factory that enables the use of update processors implemented as scripts which can be loaded by the SolrResourceLoader (usually via the conf dir for the SolrCore).
UIMAUpdateRequestProcessorFactory in solr-uima-5.5.0.jar ( dist/ )
Update document(s) to be indexed with UIMA extracted information
<processor class="solr.CustomUpdateRequestProcessorFactory">Only non-core URPs require full class name, including package name.