com.github.jelmerk.spark.knn.bruteforce
Param that indicates whether to not return the a candidate when it's identifier equals the query identifier Default: false
Param that indicates whether to not return the a candidate when it's identifier equals the query identifier Default: false
Param for number of neighbors to find (> 0).
Param for number of neighbors to find (> 0). Default: 5
Param that specifies the number of index replicas to create when querying the index.
Param that specifies the number of index replicas to create when querying the index. More replicas means you can execute more queries in parallel at the expense of increased resource usage. Default: 0
Param for the output format to produce.
Param for the output format to produce. One of "full", "minimal" Setting this to minimal is more efficient when all you need is the identifier with its neighbors
Default: "full"
Param that specifies the number of threads to use.
Param that specifies the number of threads to use. Default: number of processors available to the Java virtual machine
Param for the column name for the query identifier.
Param for the column name for the query identifier.
Param for the column name for the query partitions.
Param for the column name for the query partitions.
Param for the threshold value for inclusion.
Param for the threshold value for inclusion. -1 indicates no threshold Default: -1
Model produced by
BruteForceSimilarity
.