Sends the random content to every receiver subscribed with 1/2 second delay.
Case class for converting RDD to DataFrame
A sample actor as receiver, is also simplest.
A sample word count program demonstrating the use of plugging in Actor as Receiver Usage: ActorWordCount <hostname> <port> <hostname> and <port> describe the AkkaSystem that Spark Sample feeder is running on.
Custom Receiver that receives data over a socket.
Consumes messages from one or more topics in Kafka and does wordcount.
Use this singleton to get or register an Accumulator.
A sample feeder actor
Produces a count of events received from Flume.
Produces a count of events received from Flume.
Counts words in new text files created in the given directory Usage: HdfsWordCount <directory> <directory> is the directory that Spark Streaming will use to find and read new text files.
Consumes messages from one or more topics in Kafka and does wordcount.
A simple Mqtt publisher for demonstration purposes, repeatedly publishes Space separated String Message "hello mqtt demo for spark streaming"
A sample wordcount with MqttStream stream
Counts words in UTF8 encoded, '\n' delimited text received from the network every second.
Receives text from multiple rawNetworkStreams and counts how many '\n' delimited lines have the word 'the' in them.
Counts words in text encoded with UTF8 received from the network every second.
Lazily instantiated singleton instance of SQLContext
A simple publisher for demonstration purposes, repeatedly publishes random Messages every one second.
Use DataFrames and SQL to count words in UTF8 encoded, '\n' delimited text received from the network every second.
Counts words cumulatively in UTF8 encoded, '\n' delimited text received from the network every second starting with initial value of word count.
Utility functions for Spark Streaming examples.
Illustrates the use of the Count-Min Sketch, from Twitter's Algebird library, to compute windowed and global Top-K estimates of user IDs occurring in a Twitter stream.
Illustrates the use of the HyperLogLog algorithm, from Twitter's Algebird library, to compute a windowed and global estimate of the unique user IDs occurring in a Twitter stream.
Calculates popular hashtags (topics) over sliding 10 and 60 second windows from a Twitter stream.
Use this singleton to get or register a Broadcast variable.
A sample wordcount with ZeroMQStream stream