Importance of Schema Registry on Kafka Based Data Streaming Pipelines

5 years ago

Needless to say Apache Kafka delivers messages to both real-time and batch consumers without performance degradation and in addition to… Read More

Orchestrating Multi-Brokers Kafka Cluster through CLI Commands

5 years ago

This short article aims to highlight the list of commands to manage a running multi-broker multi-topic Kafka cluster utilizing built-in… Read More

Crafting a Multi-Node Multi-Broker Kafka Cluster- A Weekend Project

5 years ago

Over the most recent couple of years, there has been a huge development in the appropriation of Apache Kafka. Kafka… Read More

Setup Zookeeper Cluster – A minute chore

5 years ago

  Despite the fact that Apache Zookeeper's functionalities are not legitimately noticeable to end-client however it remains as the spine… Read More

Alternative way of loading or importing data into Hive tables running on top of HDFS based data lake.

5 years ago

Preceding pen down the article, might want to stretch out appreciation to all the wellbeing teams beginning from cleaning/sterile group… Read More

Resolve Apache Zookeeper starting issue installed on multi-node cluster

5 years ago

This miniature article explains how to resolve the error "Error: Could not find or load main class org.apache.zookeeper.server.quorum.QuorumPeerMain" when we… Read More

Why disintegration of Apache Zookeeper from Kafka is in pipeline

6 years ago

  The main objective of this article is to highlight why to cut the bridge between Apache Zookeeper and Kafka… Read More

Install and Configuration of Apache Hive-3.1.2 on multi-node Hadoop-3.2.0 cluster with MySQL for Hive metastore

6 years ago

The apache Hive is a data warehouse system built on top of the  Apache Hadoop.  Hive can be utilized for… Read More

Apache Hadoop-3.2.0 installation on the multi-node cluster with an alternative backup recovery

6 years ago

The objective of this article is to explain how we can deploy the latest version of Apache Hadoop (Stable release:… Read More

How checksum smartly manages data integrity in HDFS (Hadoop Distributed File System).

6 years ago

Ensuring data integrity is basic necessity or back bond in big data processing environment to achieve accurate outcome. Of course,… Read More