Tech Threads

Big Data Generation and its sources

To understand Big Data Generation, here are the sources where we can visualized how Big data is generating today and anticipate how the entire globe sink under Oceans of Data in near future due to exponential growth of digitization. Big Data generation sources can be classified as below.


– Media:- Media and communications outlet (article, audio, video, emails, blogs, podcasts)
– Machine:- Data generated by computers and machines generally without human intervention (Server logs, phone
calls, sensors, business process logs etc)
– Social:- Digital materials created by social media like Face book, LinkedIn etc (texts, photos, videos, tweet
etc)
– Historical:- Data about our environment (weather, traffic, census) and archived documents, forms or records etc.

Written by
Gautam Goswami

Can be reached for real-time POC development and hands-on technical training at gautambangalore@gmail.com. Besides, to design, develop just as help in any Hadoop/Big Data handling related task. Gautam is a advisor and furthermore an Educator as well. Before that, he filled in as Sr. Technical Architect in different technologies and business space across numerous nations.
He is energetic about sharing information through blogs, preparing workshops on different Big Data related innovations, systems and related technologies.

 

Page: 1 2

Recent Posts

Driving Streaming Intelligence On-Premises: Real-Time ML with Apache Kafka and Flink

Lately, companies, in their efforts to engage in real-time decision-making by exploiting big data, have… Read More

3 days ago

Dark Data Demystified: The Role of Apache Iceberg

Lurking in the shadows of every organization is a silent giant—dark data. Undiscovered log files,… Read More

2 weeks ago

The Role of Materialized Views in Modern Data Stream Processing Architectures + RisingWave

Incremental computation in data streaming means updating results as fresh data comes in, without redoing… Read More

4 months ago

Unlocking the Power of Patterns in Event Stream Processing (ESP): The Critical Role of Apache Flink’s FlinkCEP Library

We call this an event when a button is pressed, a sensor detects a temperature… Read More

4 months ago

Real-Time Redefined: Apache Flink and Apache Paimon Influence Data Streaming’s Future

Apache Paimon is made to function well with constantly flowing data, which is typical of… Read More

5 months ago

Revolutionize Stream Processing with the Power of Data Fabric

A data fabric is an innovative system designed to seamlessly integrate and organize data from… Read More

5 months ago