Systems | Development | Analytics | API | Testing

July 2021

Five Strategies to Accelerate Data Product Development

With this first article of the two-part series on data product strategies, I am presenting some of the emerging themes in data product development and how they inform the prerequisites and foundational capabilities of an Enterprise data platform that would serve as the backbone for developing successful data product strategies.

Beginner's Guide to Cloudera Operational Database

My name is Shanmukha Kota and I am a recent graduate from University at Buffalo. I interned with Cloudera last summer and joined Cloudera as a software engineer a couple of weeks ago and this is my first experience with CDP and CDP Operational Database. For a new hire college graduate in the industry with only academic experience with HBase, I can only say it is very simple and easy to set up and work with CDP Operational Database.

Accelerate Offloading to Cloudera Data Warehouse (CDW) with Procedural SQL Support

Did you know Cloudera customers, such as SMG and Geisinger, offloaded their legacy DW environment to Cloudera Data Warehouse (CDW) to take advantage of CDW’s modern architecture and best-in-class performance? In addition to substantial cost savings upon moving to CDW, Geisinger is also able to search through hundreds of million patient note records in seconds providing better treatment to their patients.

Future of Data Meetup: Hello, Kafka! (An Introduction to Apache Kafka)

Our “Hello, “ series of introductory “Big Data” topic-focused meetups returns to Boston in July as we deliver our fifth event. This meetup will introduce you to Apache Kafka without assuming you’ve heard anything about the Apache development project, the problems that Kafka was designed to solve or the role it currently plays in modern enterprise data architectures.

A Reference Architecture for the Cloudera Private Cloud Base Data Platform

The release of Cloudera Data Platform (CDP) Private Cloud Base edition provides customers with a next generation hybrid cloud architecture. This blog post provides an overview of best practice for the design and deployment of clusters incorporating hardware and operating system configuration, along with guidance for networking and security as well as integration with existing enterprise infrastructure.

Optimizing Risk and Exposure Management - Roundtable Highlights

We recently hosted a roundtable focused on optimizing risk and exposure management with data insights. For financial institutions and insurers, risk and exposure management has always been a fundamental tenet of the business. Now, risk management has become exponentially complicated in multiple dimensions. In this session we explored what firms are doing to approach the uncertainty with more predictability.

Demo Jam Live: Perform Flink stream processing and analytics using SQL

Is your business looking for a simpler way to access digital information faster? Do you know your developer and analytics teams, who have SQL skills, can now easily create streaming analytics for your business needs? This new demo jam webinar will showcase Cloudera Streaming Analytics with SQL Stream Builder and demonstrate how easy it is to create streaming queries using Apache Flink. Just like the previous session, this will be a no-slide, highly interactive demo-only session where you get to choose what you want to see based on live polling. This session is led by Kenny Gorman, Product Owner of Streaming Processing and Erik Beebe, Principal Stream Processing Engineer.

Delivering Modern Enterprise Data Engineering with Cloudera Data Engineering on Azure

After the launch of CDP Data Engineering (CDE) on AWS a few months ago, we are thrilled to announce that CDE, the only cloud-native service purpose built for enterprise data engineers, is now available on Microsoft Azure. CDP Data Engineering offers an all-inclusive toolset that enables data pipeline orchestration, automation, advanced monitoring, visual profiling, and a comprehensive management toolset for streamlining ETL processes and making complex data actionable across your analytic teams.

Cloudera Operational Database Replication in a Nutshell

In this previous blog post we provided a high-level overview of Cloudera Replication Plugin, explaining how it brings cross-platform replication with little configuration. In this post, we will cover how this plugin can be applied in CDP clusters and explain how the plugin enables strong authentication between systems which do not share mutual authentication trust.

4 Considerations When Building Your Government Data Strategy

If you’ve followed Cloudera for a while, you know we’ve long been singing the praises—or harping on the importance, depending on perspective—of a solid, standalone enterprise data strategy. While certainly not a new concept, Government missions are wholly dependent on real time access/analysis of data (wherever it may be (legacy data centers or public cloud) to render insight to support operational decisions.

Two Ways to Migrate Hortonworks DataFlow to Cloudera Flow Management

Hortonworks DataFlow (HDF) 3.5.2 was released at the end of 2020. The new releases will not continue under HDF as Cloudera brings the best and latest of Apache NiFi in the new Cloudera Flow Management (CFM) product. Getting the latest improvements and new features of NiFi is one of many reasons for you to move your legacy deployments of NiFi on this new platform. To that end, we released a few blog posts to help you migrate from HDF to CFM.