Hi everyone,
Need your assistance regarding building a data pipeline. So the architecture will be comprise of three components namely data lake, data warehouse and olap cubes.
So we need to design a data pipeline architecture for these three components.
1) Data from the source system will be collected under data lake and will be saved in a data ware house environment.The data warehouse environment will have the data in de normalized structure and further the data are sent to a data base /olap cubes for slicing and dicing of data.
can anyone tell me which tools and technolgies can be used. assuming that the whole life cycle tools needs to be automated.
e.g : data from datalake are stored in a environment namely data warehouse but we should not create any table for a database. the tool should be able to adapt as per the incoming source data.
Thanks,
Karthik.