Systems | Development | Analytics | API | Testing

Data Enrichment Using Cloudera Data Engineering

In this video, we'll walk through an example on how you can use Cloudera Data Engineering to pull in multiple datasets from a Hive data warehouse and go through the process of enriching the data through the use of Apache Spark. We'll then run this Spark job from within Cloudera Data Engineering so that we can follow the progress and see details about the job's execution.

CDP Public Cloud: SSH Key Deployment

This video covers how to deploy SSH keys in CDP Public Cloud. It touches on how to generate a new SSH key pair and steps through the process of deploying it for a workload user through the Cloudera Management Console Web UI, as well as using the CDP command-line tool. It discusses the security implications of using the Cloudbreak user for login on data hub hosts, and explains why workload user credentials should be used instead in most cases. It also demonstrates using the deployed SSH keys for login to data hub hosts.

Accelerate Application Development with the Operational Database Demo Highlight

Cloudera Operational Database is a fast, flexible, dbPaaS database that enables faster application development. It simplifies application planning as it grows in scale and importance, and is a great fit for many application types including mobile, web, gaming, ad-tech, IoT, and ML model serving.

Demo: Cloudera DataFlow on Data Hub

Cloudera DataFlow for Data Hub makes hybrid use cases possible by extending on-premises flow management, streams messaging, and stream processing and analytics capabilities to the public cloud. Watch an integrated demo of Cloudera DataFlow on Data Hub to understand how easy it is to ingest, process, and analyze your streaming data across multiple public cloud clusters.

Data Exploration & Reporting with Cloudera Data Warehouse

In this video, we’ll go over how you can use both Cloudera Public Cloud to both Ingest data through Cloudera Data Engineering as well as explore it through Hue and Impala within Cloudera Data Warehouse. You'll see how easy it is to run queries that give you insight into your data and how you can use a built in data visualization tool to then create a dashboard to share your results.

Introducing CDE: Purpose Built Tooling For Accelerating Data Pipelines Demo Highlight

Spark has become the de-facto processing framework for ETL and ELT workflows for good reason, but for many enterprises working with Spark has been challenging and resource-intensive. Leveraging Kubernetes to fully containerize workloads, DE provides a built-in administration layer that enables one-click provisioning of autoscaling resources with guardrails, as well as a comprehensive job management interface for streamlining pipeline delivery. DE enables a single pane of glass for managing all aspects of your data pipelines.

Validating Jet Engine Predictive Models Using Cloudera Machine Learning

In this video, we’ll go over how to use Cloudera Machine Learning (CML) to validate a complex predictive model. Using a publicly available NASA dataset that simulates how jet engines degrade over time, we’ll use machine learning concepts in a cloud environment to go from simulation data to a cost benefit analysis in just a few steps. We’ll also show how we can run experiments to track specific metrics from many different scenarios that our predictive model could possibly be implemented in.

Faster Application Development with Cloudera Operational Database (COD) Demo Highlight

IT is no longer relegated to the IT group. Lines of business are building new business applications that can drive their business’s top and/or bottom lines. These applications are increasingly stateless -- meaning that they rely on their underlying operational database to manage their state and work with IT to build, deploy and manage the database infrastructure. The application development lifecycle is accelerating with the broad adoption of cloud and the rise of dbPaaS where the database is fully managed and self-optimizes for the applications. In this session, we will show you how the Cloudera Operational Database offers an accelerated on-ramp to app development by offering a modern multi-model database that eliminates infrastructure management.

Automated Deployment of Apache Spark Jobs in Cloudera Data Engineering

In this video we're going to go over some more advanced features of the Cloudera Data Engineering Experience. Using some publicly accessible Paycheck Protection Data, you'll see how to automatically setup Spark jobs to deploy by using the CDE CLI, making development and deployment times much quicker and painless. We'll also take the development cycle through to the end and get some visualization of the finished reports using the aforementioned PPP data.