Constructor with default index, query analyzers and Lucene similarity
Constructor with default index, query analyzers and Lucene similarity
Input DataFrame
Instantiate a LuceneRDD with DataFrame
Instantiate a LuceneRDD with DataFrame
Spark DataFrame
Index Analyzer name
Query Analyzer name
Lucene scoring similarity, i.e., BM25 or TF-IDF
Instantiate a LuceneRDD with an iterable
Instantiate a LuceneRDD with an iterable
Input type
Elements to index
Index Analyzer name
Query Analyzer name
Lucene scoring similarity, i.e., BM25 or TF-IDF
Spark Context
Instantiate a LuceneRDD given an RDD[T]
Instantiate a LuceneRDD given an RDD[T]
Generic type
RDD of type T
Index Analyzer name
Query Analyzer name
Lucene scoring similarity, i.e., BM25 or TF-IDF
Deduplication via blocking
Deduplication via blocking
Entities DataFrame to deduplicate
Function that maps Row to Lucene Query String
Columns on which exact match is required
Number of top-K query results
Lucene analyzer at index time
Lucene analyzer at query time
Lucene Similarity metric (BM25, Tf/idf)
Entity linkage between two DataFrame by blocking / filtering on one or more columns.
Entity linkage between two DataFrame by blocking / filtering on one or more columns.
Queries / entities to be linked with @corpus
DataFrame of entities to be linked with queries parameter
Converts each Row to a 'Lucene Query Syntax'
List of query columns for HashPartitioner
List of entity columns for HashPartitioner
Number of linked results
Lucene analyzer at index time
Lucene analyzer at query time
Lucene Similarity metric (BM25, Tf/idf)
Returns top-k linked results as RDD of Tuple2 where _1 is query and _2 is top-k linked results as SparkScoreDoc.
Get the configured analyzers or fallback to English
Get the configured analyzers or fallback to English
Return project information, i.e., version number, build time etc
Return project information, i.e., version number, build time etc