Setup Apache Spark environment on Windows | Apache Spark

Apache Spark is easy to use, unified platform for all purposes of big data processing, and equipped with rich set of APIs for different application needs as Spark DataFrame and Spark SQL for structured data processing, Spark Streaming and Structured Streaming for streaming applications, Spark MLib for machine learning applications, Spark Graphx for Graph analytics […]
Apache Spark Application Execution Mode | Apache Spark

Apache Spark is a powerful processing platform for big data applications that supports different big data processing types. In this article we will discover together how Apache Spark application can be executed in multiple modes, depending on the environment architecture and on the application requirements. Before going into details, if you would like to setup […]
Apache Kafka | How Does It Work?

in this article we are going throw the Kafka streaming tool , we will define what’s Apache Kafka ,it’s components and structure ,it’s behavior and finally end with real streaming app Apache Kafka Is an open-source-streaming-processing software platform. Written in scala and java , created by Linkedin Data Engineers in 2011,the technology was handed over […]