• Courses
  • Knowledge Hub
  • Cheat Sheets
  • Market Place
  • Plans and Pricing
  • Contact Us
  • Become an Instructor
DataValley
Category
Cloud Computing
Data Engineering
Data Modeling
ETL (Data Integration)
Data Science
Python
Webinars & Events
{{ search }}
Log in Sign Up

Login/Sign Up

Courses Favorites 0

Search

Category
Cloud Computing
Data Engineering
Data Modeling
ETL (Data Integration)
Data Science
Python
Webinars & Events
{{ search }}

Menu

  • Courses
  • Knowledge Hub
  • Cheat Sheets
  • Market Place
  • Plans and Pricing
  • Contact Us
  • Become an Instructor

Apache Kafka and Apache Spark Integration | Apache Kafka | Apache Spark

June 25, 2020mtarek Apache Kafka, Apache Spark, Big Data, Streaming

Introduction Apache Kafka is a scalable, high performance, low latency platform that allows reading and writing streams of data like a messaging system. We can start writing Kafka applications using Java fairly easily, check our previous article on how to design a Kafka pipeline in Java. If you research the variety of real-world use-cases for Kafka, you will very […]

Create a Kafka Pipeline using Java Application | Apache Kafka

June 6, 2020mtarek Apache Kafka, Big Data, Data Engineering, Streaming

Introduction This Article is about Programming Apache Kafka producer and consumer using Java language, as we’ll see, using Java we’ll be able to reproduce what the CLI does and even more. Prerequisites Kafka Installation and configuration article ( To setup cluster will be used in this article) Any java programming editor Ex. (Netbeans – IntelliJ […]

Setup Apache Kafka Environment | Apache Kafka

May 3, 2020mtarek Apache Kafka, Big Data, Data Engineering, Streaming

Introduction This article is about configuring and starting an Apache Kafka server on a Windows OS and Linux. This guide will also provide instructions to set up Java and Apache Zookeeper, and after the setup we will create a simple pipeline to test our installation. Kafka on windows Make sure you have the following prerequisites […]

Apache Kafka Components

April 14, 2020mtarek Apache Kafka, Big Data, Data Engineering, Streaming

What Is Apache Kafka? Apache Kafka is an open source project, initially created by LinkedIn, that is designed to be a distributed, partitioned, replicated commit log service. It provides the functionality of a messaging system, but with a unique design, which we will investigate in more detail in this Article. Kafka was designed with a […]

Learn

Courses
Cheat Sheets
Market Place
Plans and Pricing

About

DataValley is the e-learning platform for everything data science. From beginners to gurus, data geeks of all levels can find something at DataValley to help them enhance their skills.

Contact

DataValley Technologies.

[email protected]

Copyright © 2021 DataValley Technologies.
Search