Monthly Archive

Cloudera launches observability offering for the hybrid cloud

Jul 31, 2023 By Cloudera In Cloudera

Cloudera Observability’s role is to ultimately make for more productive end users that can focus on driving insight and value from their data, rather than trying to tweak and tune their analytics.

View Video

Cloudera

Read more about Cloudera launches observability offering for the hybrid cloud

Deploying NiFi Flow | Impala Critical Exception

Jul 28, 2023 By Cloudera In Cloudera

Part of the Visibility and Transparency series this video walks you through building an end-to-end data ingestion NiFi Flow for the Impala Critical Query Failure Monitoring use case.

View Video

Cloudera

Analytics
BI

Read more about Deploying NiFi Flow | Impala Critical Exception

The Art of Data Leadership | A discussion with Chief Digital Officer, Ray Kunik

Jul 24, 2023 By Cloudera In Cloudera

Our Chief Data & Analytics Officer, Shayde Christian, sits down for a buzzworthy conversation with Chief Digital Officer Raymond L. Kunik Jr. to discuss the “other” CDO role, the science behind work-life integration, the impact and applications of #AI, and its correlation with a pretty sweet hobby.

View Video

Cloudera

Analytics
BI

Read more about The Art of Data Leadership | A discussion with Chief Digital Officer, Ray Kunik

Why Reinvent the Wheel? The Challenges of DIY Open Source Analytics Platforms

Jul 24, 2023 By Andreas Skouloudis In Cloudera

In their effort to reduce their technology spend, some organizations that leverage open source projects for advanced analytics often consider either building and maintaining their own runtime with the required data processing engines or retaining older, now obsolete, versions of legacy Cloudera runtimes (CDH or HDP).

Read Post

Cloudera

Read more about Why Reinvent the Wheel? The Challenges of DIY Open Source Analytics Platforms

Create and use a Webhook Notification in SQL Stream Builder

Jul 21, 2023 By Cloudera In Cloudera

Brief demo on how to create and use webhook notification in SQL Stream Builder.

View Video

Cloudera

Analytics
BI

Read more about Create and use a Webhook Notification in SQL Stream Builder

Boosting Object Storage Performance with Ozone Manager

Jul 19, 2023 By Ritesh Shukla In Cloudera

Ozone is an Apache Software Foundation project to build a distributed storage platform that caters to the demanding performance needs of analytical workloads, content distribution, and object storage use cases. The Ozone Manager is a critical component of Ozone. It is a replicated, highly-available service that is responsible for managing the metadata for all objects stored in Ozone. As Ozone scales to exabytes of data, it is important to ensure that Ozone Manager can perform at scale.

Read Post

Cloudera

Read more about Boosting Object Storage Performance with Ozone Manager

Applied Machine Learning Prototypes | The Future of Machine Learning

Jul 19, 2023 By Cloudera In Cloudera

Applied Machine Learning Prototypes or AMPs, are pre-built applications that can be used as a starting point for your next machine learning project. These prototypes are designed to save time and resources by providing a tested and reliable solution to common machine learning problems. Cloudera + Dell + AMD.

View Video

Cloudera

Read more about Applied Machine Learning Prototypes | The Future of Machine Learning

Unlock the Full Potential of Hive

Jul 18, 2023 By Shirish Deshmukh In Cloudera

In the realm of big data analytics, Hive has been a trusted companion for summarizing, querying, and analyzing huge and disparate datasets. But let’s face it, navigating the world of any SQL engine is a daunting task, and Hive is no exception. As a Hive user, you will find yourself wanting to go beyond surface-level analysis, and deep dive into the intricacies of how a Hive query is executed.

Read Post

Cloudera

Read more about Unlock the Full Potential of Hive

One Big Cluster Stuck: Environment Health Scorecard

Jul 17, 2023 By Shayde Christian In Cloudera

Throughout the One Big Cluster Stuck series we’ve explored impactful best practices to gain control of your Cloudera Data platform (CDP) environment and significantly improve its health and performance. We’ve shared code, dashboards, and tools to help you on your health improvement journey. We’d like to provide one last tool.

Read Post

Cloudera

Read more about One Big Cluster Stuck: Environment Health Scorecard

Transforming Banking with AI/ML | OCBC and Cloudera Unleashing Data-Driven Insights

Jul 17, 2023 By Cloudera In Cloudera

Highlights for OCBC + Cloudera: LLM End User Meetup
Learn how Cloudera can help you trust your enterprise AI at https://www.cloudera.com/why-cloudera/enterprise-ai.html

View Video

Cloudera

Analytics
BI

Read more about Transforming Banking with AI/ML | OCBC and Cloudera Unleashing Data-Driven Insights

From Hive Tables to Iceberg Tables: Hassle-Free

Jul 14, 2023 By Srinivas Rishindra Pothireddi In Cloudera

For more than a decade now, the Hive table format has been a ubiquitous presence in the big data ecosystem, managing petabytes of data with remarkable efficiency and scale. But as the data volumes, data variety, and data usage grows, users face many challenges when using Hive tables because of its antiquated directory-based table format. Some of the common issues include constrained schema evolution, static partitioning of data, and long planning time because of S3 directory listings.

Read Post

Cloudera

Read more about From Hive Tables to Iceberg Tables: Hassle-Free

12 Times Faster Query Planning With Iceberg Manifest Caching in Impala

Jul 13, 2023 By Riza Suminto In Cloudera

Iceberg is an emerging open-table format designed for large analytic workloads. The Apache Iceberg project continues developing an implementation of Iceberg specification in the form of Java Library. Several compute engines such as Impala, Hive, Spark, and Trino have supported querying data in Iceberg table format by adopting this Java Library provided by the Apache Iceberg project.

Read Post

Cloudera

Read more about 12 Times Faster Query Planning With Iceberg Manifest Caching in Impala

Integrating Cloudera Data Warehouse with Kudu Clusters

Jul 11, 2023 By Varun Jaitly In Cloudera

Apache Impala and Apache Kudu make a great combination for real-time analytics on streaming data for time series and real-time data warehousing use cases. More than 200 Cloudera customers have implemented Apache Kudu with Apache Spark for ingestion and Apache Impala for real-time BI use cases successfully over the last decade, with thousands of nodes running Apache Kudu.

Read Post

Cloudera

Read more about Integrating Cloudera Data Warehouse with Kudu Clusters

Cloudera Data Catalog | Data Stewardship, Data Lakes, & GDPR in Pharma

Jul 10, 2023 By Cloudera In Cloudera

Explore the captivating world of Data Stewardship with a focus on Cloudera's Data Catalog. In this friendly and professional session, our esteemed speaker, Hemanth, will share his expertise and knowledge to foster collaboration and discussion among participants, as we delve into the intricacies of Data Lakes and GDPR compliance within the Pharma industry. During this interactive session, Hemanth will expertly guide participants through key concepts related to Cloudera Data Catalog, including.

View Video

Cloudera

Analytics
BI

Read more about Cloudera Data Catalog | Data Stewardship, Data Lakes, & GDPR in Pharma

Hive Acid Table Replication

Jul 7, 2023 By Cloudera In Cloudera

This video show how Cloudera Data Platform gives you the ability to recover your Hive Acid tables in the event of a cluster failure.

View Video

Cloudera

Analytics
BI

Read more about Hive Acid Table Replication

Calving Apache Iceberg

Jul 3, 2023 By Cloudera In Cloudera

Apache Iceberg is an open-source high-performance format for huge analytic tables that brings the reliability and simplicity of SQL tables to big data. It enables engines like Spark, Trino, Flink, Presto, Hive, and Impala to work with the same tables, simultaneously and safely. Discover how Apache Iceberg can transform the way you store and manage your big data, and take your analytics to the next level.

View Video

Cloudera

Analytics
BI

Read more about Calving Apache Iceberg

Systems | Development | Analytics | API | Testing

Cloudera launches observability offering for the hybrid cloud

Deploying NiFi Flow | Impala Critical Exception

The Art of Data Leadership | A discussion with Chief Digital Officer, Ray Kunik

Why Reinvent the Wheel? The Challenges of DIY Open Source Analytics Platforms

Create and use a Webhook Notification in SQL Stream Builder

Boosting Object Storage Performance with Ozone Manager

Applied Machine Learning Prototypes | The Future of Machine Learning

Unlock the Full Potential of Hive

One Big Cluster Stuck: Environment Health Scorecard

Transforming Banking with AI/ML | OCBC and Cloudera Unleashing Data-Driven Insights

From Hive Tables to Iceberg Tables: Hassle-Free

12 Times Faster Query Planning With Iceberg Manifest Caching in Impala

Integrating Cloudera Data Warehouse with Kudu Clusters

Cloudera Data Catalog | Data Stewardship, Data Lakes, & GDPR in Pharma

Hive Acid Table Replication

Calving Apache Iceberg

Monthly Archive

Follow Us