Maximum N-grams a keyword should have (Default: 3
).
Minimum N-grams a keyword should have (Default: 1
).
Number of Keywords to extract (Default: 30
).
the words to be filtered out (Default: English stop words from MLlib)
Threshold to filter keywords (Default: -1
).
Threshold to filter keywords (Default: -1
). By default it is disabled.
Each keyword will be given a keyword score greater than 0. (The lower the score better the keyword).
This sets the upper bound for the keyword score.
Window size for Co-Occurrence (Default: 3
).
Window size for Co-Occurrence (Default: 3
).
Yake will construct a co-occurrence matrix. You can set the window size for the co-occurrence matrix construction
with this parameter.
Example: windowSize=2
will look at two words to both left and right of a candidate word.