Systems | Development | Analytics | API | Testing

Cloudera

Cloudera's Applied ML Prototype Catalog Continues to Grow

Here at Cloudera, we’re committed to helping make the lives of data practitioners as painless as possible. For data scientists, we continue to provide new Applied Machine Learning Prototypes (AMPs), which are open source and available on GitHub. These pre-built reference examples are complete end-to-end data science projects. In Cloudera Machine Learning (CML), you can deploy them with the single click of a button, bringing data scientists that much closer to providing value.

Hello, Spark! An intro to Apache Spark using PySpark in the Cloud

If you’re new to the world of large-scale data analytics, this session is for you! We'll cover the basics of what problems Apache Spark can solve, why and when to use Spark, and how Spark enables efficient use of time and computing hardware. We’ll also demonstrate how easy it is to run a PySpark job in the public cloud using the Data Science Workbench and Cloudera Data Engineering Products.

Streaming Edge Data Collection and Global Data Distribution

In the first blog of the Universal Data Distribution blog series, we discussed the emerging need within enterprise organizations to take control of their data flows. From origin through all points of consumption both on-prem and in the cloud, all data flows need to be controlled in a simple, secure, universal, scalable, and cost-effective way.

Data & The Culture Transformation

TechCrunch and Cloudera invite you to a conversation about the data transformation underway that is changing how information is used and the very nature of business. The emerging data ecosystem will allow enterprises to work collaboratively with customers, partners and even competitors around the world to integrate disparate data sources for a more complete picture of their business’ present and future.

The Power of Exploratory Data Analysis and Visualization for ML

Data scientists and machine learning engineers in enterprise organizations need to fully understand their data in order to properly analyze it, build models, and power machine learning use cases across their business. Due to the lack of tooling specifically designed for data discovery, exploration, and preliminary analysis, this presents a significant challenge for these teams.

Moving Enterprise Data From Anywhere to Any System Made Easy

Since 2015, the Cloudera DataFlow team has been helping the largest enterprise organizations in the world adopt Apache NiFi as their enterprise standard data movement tool. Over the last few years, we have had a front-row seat in our customers’ hybrid cloud journey as they expand their data estate across the edge, on-premise, and multiple cloud providers.

Technical Demo - Universal Data Distribution With Cloudera DataFlow for Public Cloud

Hands-on demo for Cloudera Data Platform’s Universal Data Distribution (UDD) Service using CDF for the public cloud. This demo shows how to build ingest pipelines that move data from anywhere in the business to any other system, software, or workflow. In this particular demo we will show how the UDD service enables automation of ingest and data delivery across multiple public cloud providers into other analytic systems.