Google News
logo
Hadoop - Quiz(MCQ)
The tokens are passed through a Lucene ____________ to produce NGrams of the desired length.
A)
Collfilter
B)
ShngleFil
C)
SingleFilter
D)
ShingleFilter

Correct Answer :   ShingleFilter


Explanation : The tools that the collocation identification algorithm are embedded within either consume tokenized text as input or provide the ability to specify an implementation of the Lucene Analyzer class perform tokenization in order to form ngrams.

Advertisement