Posts

Showing posts with the label data analysis

How Data Mesh architecture and Data Catalogs help decentralized data teams.

Image
Not too long ago, Data Administrators had to change their long habit of having a monolith database. They were forced to accept and agree to the  Polyglot persistence - the developer's teams have started to choose different data storage and technologies that would support each application team's data model requirements. The time has arrived to break down  also the Data Lake monolith paradigm .  Refactoring monolith Data Lake makes a lot of sense.   The central data lake as well as the central data team is often a huge bottleneck . The central data team is usually busy with fixing broken data pipes and taking care of constant data changes made by the domain owners/development teams.  Data Mesh architecture is coming to the rescue here. Instead of a centralized data team, there would be multiple decentralised domain data teams, producing data sets or consuming other teams' data sets. Domain data team usually knows their domain data very well and are aware ...

Are you familiar with DATAIKU?

Image
If you want to make DATA a part of EVERYDAY decision-making, then you must try this amazing Data Analysis Platform. Dataiku is a tool for everyone, it has Notebooks and Python for Coders, Visual data flows for Clickers, relationships, statistics and visual data forecasting for Decision Makers. It's technology agnostic, you can install it on a public cloud, use it as SaaS service or install on-premises. You also can choose ANY DATA PROCESSING ENGINE that will process your workload, use Azure Synapse, Spark or Sql Server and analyze the data WITHOUT ANY DATA MOVEMENT, in "a spreadsheet" like manner.  Dataiku has many enterprise-scale features, like build-in flow audit, Data Quality features, easy deployments between Dataiku environments and much more. https://www.dataiku.com/