Systems | Development | Analytics | API | Testing

BI

Complete ETL Process Overview (design, challenges and automation)

The Extract, Transform, and Load process (ETL for short) is a set of procedures in the data pipeline. It collects raw data from its sources (extracts), cleans and aggregates data (transforms) and saves the data to a database or data warehouse (loads), where it is ready to be analyzed. A well-engineered ETL process provides true business value and benefits such as: Novel business insights. The entire ETL process brings structure to your company’s information.

Data Governance and Strategy for the Global Enterprise

While the word “data” has been common since the 1940s, managing data’s growth, current use, and regulation is a relatively new frontier. Governments and enterprises are working hard today to figure out the structures and regulations needed around data collection and use. According to Gartner, by 2023 65% of the world’s population will have their personal data covered under modern privacy regulations.

Cloudera DataFlow Functions for Public Cloud powered by Apache NiFi

Since its initial release in 2021, Cloudera DataFlow for Public Cloud (CDF-PC) has been helping customers solve their data distribution use cases that need high throughput and low latency requiring always-running clusters. CDF-PC’s DataFlow Deployments provides a cloud-native runtime to run your Apache NiFi flows through auto scaling Kubernetes clusters as well as centralized monitoring and alerting and improved SDLC for developers.

Serverless NiFi Flows with DataFlow Functions: The Next Step in the DataFlow Service Evolution

Cloudera DataFlow for the Public Cloud (CDF-PC) is a cloud-native service for Apache NiFi within the Cloudera Data Platform (CDP). CDF-PC enables organizations to take control of their data flows and eliminate ingestion silos by allowing developers to connect to any data source anywhere with any structure, process it, and deliver to any destination using a low-code authoring experience.

Announcing GA of DataFlow Functions

Today, we’re excited to announce that DataFlow Functions (DFF), a feature within Cloudera DataFlow for the Public Cloud, is now generally available for AWS, Microsoft Azure, and Google Cloud Platform. DFF provides an efficient, cost optimized, scalable way to run NiFi flows in a completely serverless fashion. This is the first complete no-code, no-ops development experience for functions, allowing users to save time and resources.

The Ultimate Guide to Choosing the Best JavaScript Charting Library

Charting libraries are in great demand, and their creation and use are becoming increasingly popular in languages such as JavaScript. As evidence, several JavaScript charting libraries are available, both commercial and open-source, with a wide range of functionalities to meet the demands of users. But how can a developer make an informed decision and choose the best JavaScript charting library? It's a difficult question, but we're here to assist!

How to add multiple charts to a report

In this video you will learn how to add multiple charts, or visualizations, to a single data table as you build a report. You'll learn about using the Auto Chart feature, as well as manually selecting your own chart types. In addition to adding charts, you will learn how to add text, graphics, and images to your report. Once you are finished adding charts and other visual elements, you will learn how to properly save your multi-chart report.

Simplify Data Access Control | infoSecur

In this episode of “Powered by Snowflake” host Daniel Myers sits down with infoSecur’s Founder and CEO, Michael Magalsky. infoSecur is a centralized tool, used across all structured data environments and database sources to manage data policies and access down to the cell level across your data cloud. The “Powered by Snowflake" video series features conversations with technology leaders who are building businesses and applications on top of Snowflake.