Q.1.
Mahout provides ____________ libraries for common and primitive Java collections.
Q.2.
Point out the correct statement.
Q.3.
_________ does not restrict contributions to Hadoop based implementations.
Q.4.
Mahout provides an implementation of a ______________ identification algorithm which scores collocations using log-likelihood ratio.
Q.5.
Point out the wrong statement.
Q.6.
The tokens are passed through a Lucene ____________ to produce NGrams of the desired length.
Q.7.
The _________ collocation identifier is integrated into the process that is used to create vectors from sequence files of text keys and values.
Q.8.
____________ generates NGrams and counts frequencies for ngrams, head and tail subgrams.
Q.9.
A key of type ___________ is generated which is used later to join ngrams with their heads and tails in the reducer phase.
Q.10.
________ phase merges the counts for unique ngrams or ngram fragments across multiple documents.