Maximum N-grams a keyword should have (Default: 3
).
Minimum N-grams a keyword should have (Default: 1
).
Number of Keywords to extract (Default: 30
).
the words to be filtered out (Default: English stop words from MLlib)
Threshold to filter keywords (Default: -1
).
Threshold to filter keywords (Default: -1
). By default it is disabled. Each keyword will
be given a keyword score greater than 0. (The lower the score better the keyword). This sets
the upper bound for the keyword score.
Window size for Co-Occurrence (Default: 3
).
Window size for Co-Occurrence (Default: 3
). Yake will construct a co-occurrence matrix.
You can set the window size for the co-occurrence matrix construction with this parameter.
Example: windowSize=2
will look at two words to both left and right of a candidate word.