The info graphics representing the basic concept of Data Lake where we can use the approach ELT (Extraction, loading and then transformation) against traditional ETL (Extraction, Transformation and then loading)process. ETL process implies to traditional data warehousing system where structured data format follows (raw and column).By leveraging HDFS (Hadoop Distributed File System), we can develop data lake to store any format data in order to process and analysis. Directly data can be loaded in the Lake without transformation, later transformation can be performed on demand basis. Data Lake concept is offering a tremendous advantage and benefit.
Can be reached for real-time POC development and hands-on technical training at gautambangalore@gmail.com. Besides, to design, develop just as help in any Hadoop/Big Data handling related task. Gautam is a advisor and furthermore an Educator as well. Before that, he filled in as Sr. Technical Architect in different technologies and business space across numerous nations.
He is energetic about sharing information through blogs, preparing workshops on different Big Data related innovations, systems and related technologies.
Page: 1 2
The data serialization format is a key factor when dealing with stream processing, as it… Read More
In today's AI-powered systems, real-time data is essential rather than optional. Real-time data streaming has… Read More
In the current fast-paced digital age, many data sources generate an unending flow of information,… Read More
According to the International Chamber of Shipping, the maritime industry has increased fourfold in the… Read More
Lately, companies, in their efforts to engage in real-time decision-making by exploiting big data, have… Read More
Lurking in the shadows of every organization is a silent giant—dark data. Undiscovered log files,… Read More