Systems | Development | Analytics | API | Testing

Latest News

Happy Birthday Apache HBase! 10 years of resilience, stability, and performance

Apache HBase became a top-level project with Apache 10 years ago and Cloudera began contributing to it at the same time (2010). Over this time, it has become one of the largest and most popular open-source tools in big data and one of the most popular NoSQL databases.

The Checkered Flag for Autonomous Vehicles

People intuitively know that self-driving or autonomous cars present complex engineering challenges. Vehicle assembly is the easy part – we’ve been doing that for 100 years. The real challenge is a data challenge, acquiring and managing the data needed to run the vehicles’ brain, eyes, and ears. Autonomous driving technology complexity lies in the ability to ingest, store, analyze, and deploy large volumes of data & the high bandwidth needs of data-in-motion.

How Harnessing Your Data in Motion Can Drive Mission Success

Today’s globalized organizations demand a new standard for communicating and sharing information. That includes data-rich content that moves through environments, networks, and locales. From being stored, analyzed, and shared, to quickly and effectively moving between environments, to spinning up in clusters and informing endless applications—data is more critical than ever.

Operational Database Application Support

This blog post is part of a series on Cloudera’s Operational Database (OpDB) in CDP. Each post goes into more details about new features and capabilities. Start from the beginning of the series with, Operational Database in CDP. This blog post gives you an overview of the languages, frameworks, and applications supported by Cloudera’s OpDB.

Automating BigQuery exports to an email

Data accessibility and analysis is a crucial part of getting value from your data. While there are many methods to view data when it comes to BigQuery, one common way is to export query results as an email on a scheduled basis. This lets end users get an email with a link to the most recent query results, and is a good solution for anyone looking for daily statistics on business processes, monthly summaries of website metrics, or weekly business reviews.

Connected Manufacturing - How data and analytics are reshaping manufacturing

Manufacturing has historically been laggards in its adoption of emerging technologies as business processes, from the production line to back-office operations, have inherently been tied to legacy applications. But that is rapidly changing. Faced with competitive pressures, and driven by technological enhancements across broad sectors of the economy, today’s manufacturing leaders are seeking new ways to improve productivity, reduce downtime, and streamline operations and the supply chain.

Apache YuniKorn (Incubating) 0.8 release: What's new and upcoming?

Apache YuniKorn (Incubating) is a standalone resource scheduler that aims to bring advanced scheduling capabilities for Big Data workloads onto containerized platforms. Please read YuniKorn: a universal resources scheduler to learn about the rationale and architecture. Since the time of our last post, we are delighted to update that YuniKorn was accepted by the Apache incubator in Jan 2020!

How Customer Success helped take a SaaS company from Niche to Visionary in just 3 years

It was three years ago, just after the Gartner Magic Quadrant (MQ) came out, that everything in customer success changed at Yellowfin. We had made it into the MQ again but we weren’t positioned where we believed we should be. We were a product-driven company that had been first to market for many of the functionalities that have since become expected in what BI vendors offer today.

Introducing MLOps And SDX for Models in Cloudera Machine Learning

It seems everyone is talking about machine learning (ML) these days — and ML’s use in products and services we consume everyday continues to be increasingly ubiquitous. But for many enterprise organizations, the promise of embedding ML models across the business and scaling use cases remains elusive. So what about ML makes it difficult for enterprises to adopt at scale?

Building an application to predict customer churn

Too often, companies are finding out after the fact that customers have stopped using their product or service, without enough notice to have done anything about it. The term customer churn is used to describe the loss of existing customers. These are people or organizations that were using a company’s products and/or services and have decided not to use them anymore, in favor of a competitor. Tracking customer churn is a key business metric for most companies.