streaming_course

To start the project

Start zookeeper server in one terminal

/usr/bin/zookeeper-server-start config/zookeeper.properties

Start kafka server in a seperate terminal

/usr/bin/kafka-server-start config/producer.properties

Start producer server in a seperate terminal

python kafka_server.py

Start consumer listener

/usr/bin/kafka-console-consumer --bootstrap-server localhost:9092 --topic police.department.calls --from-beginning

Start Streaming App

spark-submit --conf spark.ui.port=3000 --packages org.apache.spark:spark-sql-kafka-0-10_2.11:2.3.4 --master local[*] data_stream.py

Questions:

How did changing values on the SparkSession property parameters affect the throughput and latency of the data?

Setting the memory for executors (spark.executor.memory), the more throughput and the lower the latency you see.

What were the 2-3 most efficient SparkSession property key/value pairs? Through testing multiple variations on values, how can you tell these were the most optimal?

spark.default.parallelism - based on the available cores. I have 4 cores on my machine, so would set this value to 4. I saw an increase number of batch output displayed in the terminal during similar session runs compared to other.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
config		config
README.md		README.md
data_stream.py		data_stream.py
kafka-consumer-console.png		kafka-consumer-console.png
kafka_server.py		kafka_server.py
producer_server.py		producer_server.py
radio_code.json		radio_code.json
requirements.txt		requirements.txt
screenshots.zip		screenshots.zip
spark output.png		spark output.png
spark ui.png		spark ui.png
start.sh		start.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

streaming_course

To start the project

Questions:

About

Releases

Packages

Languages

gmart009/streaming_course

Folders and files

Latest commit

History

Repository files navigation

streaming_course

To start the project

Questions:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages