Analytics

How You Can Contribute to ClearML's MLOps Platform

ClearML is an open source MLOps platform, and we love the community that’s been growing around us over the last few years. In this post, we’ll give you an overview of the structure of the ClearML codebase so you know what to do when you want to contribute to our community. Prefer to watch the video? Click below: First things first. Let’s take a look at our GitHub page and corresponding repositories. Later on, we’ll cover the most important ones in detail.

Data modeling techniques for data warehousing

When setting up a modern data stack, data warehouse modeling is often the very first step. It is important to create an architecture that supports the data models that you wish to build. I often see people going straight to writing complex transformations before thinking about how they want to organize the databases, schemas, and tables within their warehouse. To succeed, it is key to design your data warehouse with your models in mind before starting the modeling process.

Data Governance Framework Policy - What Do You Need to Know?

According to IDCs Global Datasphere, 64.2 ZB of data was created in 2020 alone. This number is projected to grow by 23% annually from 2020-2025. Therefore, we need data governance frameworks for efficient data management and control. This will help us extract maximum value out of such high volumes of data. Such frameworks would be required for data integrity, data protection, and data security. Indeed, according to BDO, the average data breach cost has been estimated to be around USD 3.8 million.

Transaction Support in Cloudera Operational Database (COD)

CDP Operational Database enables developers to quickly build future-proof applications that are architected to handle data evolution. It helps developers automate and simplify database management with capabilities like auto-scale, and is fully integrated with Cloudera Data Platform (CDP). For more information and to get started with COD, refer to Getting Started with Cloudera Data Platform Operational Database (COD).

How to Deploy Transaction Support on Cloudera Operational Database (COD)

Cloudera Operational Database enables developers to quickly build future-proof applications that are architected to handle data evolution. It helps developers automate and simplify database management with capabilities like auto-scale, and is fully integrated with Cloudera Data Platform (CDP). For more information and to get started with COD, refer to our article Getting Started with Cloudera Data Platform Operational Database (COD).

What is Data Governance? Accountability and Quality Control in Analytics

Effective control and governance over your data assets is vital for long-term business success. By keeping your data available, reliable and usable for analysis, consumption and sharing, you can ensure data quality, data security and data reliability is consistently met. However, many organizations today struggle to implement governance frameworks over their data, which has once again highlighted the importance of data governance. So, what is data governance?