In today’s world, Internet plays a major factor to generate and propagate information from various sources. Social media, Email, What’sApp, E-News Paper etc are playing a crucial role on circulation followed by creation of information. These type of information often include text and multimedia contents. These information or data methodically can’t be persisted in database and it is referred as unstructured data.
Here in (HDFS)we do not have to design schema in row column structure to accumulate data. Data can be ingested directly in HDFS and process eventually in a distributed manner to get desire result. The unstructured data is closely associate with term call “BIG DATA” which is tuning the entire world of information towards a new direction and Hadoop is enabling us to process thousand of petabyte of unstructured data.
Written by
Gautam Goswami
Page: 1 2
Incremental computation in data streaming means updating results as fresh data comes in, without redoing… Read More
We call this an event when a button is pressed, a sensor detects a temperature… Read More
Apache Paimon is made to function well with constantly flowing data, which is typical of… Read More
A data fabric is an innovative system designed to seamlessly integrate and organize data from… Read More
Big data technologies' quick development has brought attention to the necessity of a smooth transition… Read More
Data is being generated from various sources, including electronic devices, machines, and social media, across… Read More