Hadoop Operations by Eric Sammer

4 years ago

Source:- http://en.bookfi.net/ Also you can download the EPUB version or read online at http://en.bookfi.net/book/2244756 Please download here to get the… Read More

Pro Hadoop by Jason Venner

4 years ago

Source:- http://en.booklid.org/ Alternatively you can download or read online at http://en.booklid.org/book/1067086 Here also you can download the PDF version Read More

Big Data in Practice How 45 Successful Companies Used Big Data Analytics to Deliver Extraordinary Results

4 years ago

Big Data in Practice How 45 Successful Companies Used Big Data Analytics to Deliver Extraordinary Results by Bernard Marr Source:-… Read More

Data governance and security mechanism in distributed data storage system

6 years ago

We are very much aware that the traditional data storage mechanism is incapable to hold the massive volume of lightning… Read More

A Credible approach of Big Data processing and subsequent analysis on telecom data to minimize crime, combat terrorism, unsocial activities, etc.

7 years ago

Telecom providers have a treasure trove of captive data – customer data, CDR (call detail records), call center interactions, tower… Read More

Deleting Solr log files/folder from Standby NameNode could be the disaster when Primary NameNode is active in the HDP (Hortonworks Data Platform) Hadoop Cluster

7 years ago

Most of us know that we use Apache Ambari for managing, provisioning and monitor different components of a Hortonworks Hadoop… Read More

Fault tolerance enhancement on Apache Hadoop 3.0.0-alpha2 by supporting more than 2 NameNodes.

7 years ago

NameNode is the most critical resource in Hadoop core cluster. Once very large files loaded into the Hadoop Distributed File… Read More

Basic Understanding of Stateful data Streaming supported by Apache Flink

7 years ago

The technologies related to Big Data processing platform are enhancing the maturity in order to efficiently execute the streaming data… Read More

Apache Flink – A 4G Data Processing Engine

7 years ago

Analyzing streaming data in large-scale systems is becoming a focal point day by day to take accurate business decisions due… Read More

Steering number of mapper (MapReduce) in sqoop for parallelism of data ingestion into Hadoop Distributed File System (HDFS)

7 years ago

To import data from most the data source like RDBMS, SQOOP internally use mapper. Before delegating the responsibility to the… Read More