Azure tables are accessed using HTTP/HTTPS, with maximum batch size of 100 records per Azure Table partition key.
Converts an iterator of records that are already sorted by a key into an iterator of pages of records with the same key.
Converts an iterator of records that are already sorted by a key into an iterator of pages of records with the same key.
Created by Alexei Perelighin on 2017/12/18
data type of the records in the iterator
Created by Alexei Perelighin on 18/12/17.
Creates a converter from spark Row into Azure EntityProperty.
Creates a converter from spark Row into Azure EntityProperty.
Created by alexeipab on 18/12/17.
Created by Alexei Perelighin on 2018/03/25.
Azure tables are accessed using HTTP/HTTPS, with maximum batch size of 100 records per Azure Table partition key. Therefore there is a lot of lag between the batches and spending one spark executor per connection will be very wasteful. To work around it, this class will open multiple threads each having its own connection to an Azure Table and will push data from the blocking queue.
Created by Alexei Perelighin on 2018/03/01