Systems | Development | Analytics | API | Testing

BI

Data Enrichment Using Cloudera Data Engineering

In this video, we'll walk through an example on how you can use Cloudera Data Engineering to pull in multiple datasets from a Hive data warehouse and go through the process of enriching the data through the use of Apache Spark. We'll then run this Spark job from within Cloudera Data Engineering so that we can follow the progress and see details about the job's execution.

Stephanie Stillman Talks About Data Sharing And The Data Marketplace | Behind the Data Cloud

Today on Behind The Data Cloud, Daniel Meyers interviews Snowflake Product Manager Stephanie Stillman and they talk about how she entered the data industry, data sharing, and the data marketplace. Behind the Data Cloud is a builder-focused video series.

Architecting a data lineage system for BigQuery

Democratization of data within an organization is essential to help users derive innovative insights for growth. In a big data environment, traceability of where the data in the data warehouse originated and how it flows through a business is critical. This traceability information is called data lineage. Being able to track, manage, and view data lineage helps you to simplify tracking data errors, forensics, and data dependency identification.

15 of the Best Data Analytics Tools of 2021

The importance of effective data analytics within an organization is widely accepted by business leaders at this point. With use cases for data analysis spanning every department—from IT management, financial planning, marketing analytics, and so on—the right data analytics tools can have a significant impact on a company’s profitability and growth.

Stitch vs. Talend vs. Xplenty: A Head-to-Head Comparison

Five differences between Stitch, Talend, and Xplenty: Organizations store data in many destinations, making that data difficult to analyze. Legacy systems, SaaS locations, in-house databases, apps, you name it — by storing data in all kinds of places, companies can complicate data analytics considerably. Storing data in a warehouse or a lake makes more sense.

Cloudera Operational Database application development concepts

Cloudera Operational Database is now available in three different form-factors in Cloudera Data Platform (CDP). If you are new to Cloudera Operational Database, see this blog post. And, check out the documentation here. In this blog post, we’ll look at both Apache HBase and Apache Phoenix concepts relevant to developing applications for Cloudera Operational Database.

Joining the Data Cloud

Join executives from Allianz Benelux and Emirates to hear why their organizations are joining the Data Cloud. The Data Cloud is transforming companies across financial services, transportation, and other industries. As leaders develop strategies to support the next 3–5 years of innovation, the Data Cloud is becoming a critical enabler for the success of their enterprises. Learn how these companies are seizing the opportunity with Snowflake, and see the broader impact Snowflake’s cloud data platform is having on their organizations.

Using COD and CML to build applications that predict stock data

No, not really. You probably won’t be rich unless you work really hard… As nice as it would be, you can’t really predict a stock price based on ML solely, but now I have your attention! Continuing from my previous blog post about how awesome and easy it is to develop web-based applications backed by Cloudera Operational Database (COD), I started a small project to integrate COD with another CDP cloud experience, Cloudera Machine Learning (CML).