Monthly Archive

Faster Analytics with Cloudera Data Warehouse (CDW) Demo Highlight

Jul 30, 2020 By Cloudera In Cloudera

The cloud-led journey to digital transformation requires organizations to become significantly more data-driven, yet traditional data warehouses have difficulty with new data volumes, new data types, and a variety of use cases. In this session, we will show you how Cloudera Data Warehouse offers a guide to your cloud journey by offering a modern hybrid cloud solution for an unprecedented scale that delivers insight to every part of your organization, faster while saving costs.

View Video

Cloudera

Read more about Faster Analytics with Cloudera Data Warehouse (CDW) Demo Highlight

Meeting Medical Device Data Privacy, Governance, and Security Challenges

Jul 30, 2020 By Michael Ger In Cloudera

Medical devices have become increasingly complex as technology evolves, and the sheer number of these devices now being worn or implanted has grown exponentially over the past few years. There are currently over 500,000 different types of smart, connected medical devices in use that have the ability to collect, share, or store private patient data and protected health information (PHI)(1).

Read Post

Cloudera

Read more about Meeting Medical Device Data Privacy, Governance, and Security Challenges

Zeppelin Architecture and Operational Workflow

Jul 29, 2020 By Cloudera In Cloudera

This video describes Zeppelin Architecture and Operational Workflow.

View Video

Cloudera

Analytics
BI

Read more about Zeppelin Architecture and Operational Workflow

The reinvention of the Telco: From Pipe to Processor

Jul 28, 2020 By Vijay Raja In Cloudera

The next generation of 5G networks are unlocking a mind-bending array of new use cases. Blistering speed, super low latency, and access to more powerful mobile hardware bring VR, AR and ultra high-definition experiences into sharp focus for the near future. But there’s a bigger shift being driven by 5G, and it’s not actually about speed at all. It’s about re-thinking the modern telco business model.

Read Post

Cloudera

Read more about The reinvention of the Telco: From Pipe to Processor

Building a Scalable Process Using NiFi, Kafka and HBase on CDP

Jul 28, 2020 By Tui Leauanae In Cloudera

Navistar is a leading global manufacturer of commercial trucks. With a fleet of 350,000 vehicles, unscheduled maintenance and vehicle breakdowns created ongoing disruption to their business. Navistar required a diagnostics platform that would help them predict when a vehicle needed maintenance to minimize downtime.

Read Post

Cloudera

Read more about Building a Scalable Process Using NiFi, Kafka and HBase on CDP

Enabling high-speed Spark direct reader for Apache Hive ACID tables

Jul 27, 2020 By Anishek Agarwal In Cloudera

Apache Hive supports transactional tables which provide ACID guarantees. There has been a significant amount of work that has gone into hive to make these transactional tables highly performant. Apache Spark provides some capabilities to access hive external tables but it cannot access hive managed tables. To access hive managed tables from spark Hive Warehouse Connector needs to be used.

Read Post

Cloudera

Read more about Enabling high-speed Spark direct reader for Apache Hive ACID tables

Digital Transformation is Way More than Just Digital

Jul 24, 2020 By Michael Ger In Cloudera

Over the last 25 years, I have an unparalleled front seat to the digital transformation that is now accelerating in the connected manufacturing and automotive industry. Not many people have had the opportunity to witness the transformation and be as active in this area as I have; I consider myself lucky.

Read Post

Cloudera

Read more about Digital Transformation is Way More than Just Digital

The benefits of building an on-demand data lake in healthcare

Jul 22, 2020 By Abbas Mooraj In Cloudera

This blog was written in partnership with Navdeep Alam, Senior Director, Global Data Warehouse, IQVIA Healthcare is unique. It isn’t defined like other businesses by how much revenue can be generated, but more in terms of achieving positive health outcomes, better value, and saving lives through the rapid development of new treatments and therapies.

Read Post

Cloudera

Read more about The benefits of building an on-demand data lake in healthcare

The Rise Of Connected Manufacturing - How Data Is Driving Innovation Part II

Jul 20, 2020 By Cindy Maike In Cloudera

A Shift Towards Industry 4.0 Is Improving Manufacturing Efficiency And Increasing Innovation In Part II of our series with Michael Ger, Managing Director of Manufacturing and Automotive at Cloudera, he looks in greater detail at how AI, big data, and machine learning are impacting connected living and the evolution of autonomous driving.

Read Post

Cloudera

Read more about The Rise Of Connected Manufacturing - How Data Is Driving Innovation Part II

Cloudera Operational Database experience (dbPaaS) available as Technical Preview

Jul 20, 2020 By Krishna Maheshwari In Cloudera

The Cloudera Operational Database (COD) experience is a managed dbPaaS solution which abstracts the underlying cluster instance as a Database. It can auto-scale based on the workload utilization of the cluster and will be adding the ability to auto-tune (better performance within the existing infrastructure footprint) and auto-heal (resolve operational problems automatically) later this year.

Read Post

Cloudera

Read more about Cloudera Operational Database experience (dbPaaS) available as Technical Preview

Operational Database Scalability

Jul 17, 2020 By Liliana Kadar In Cloudera

Cloudera’s Operational Database provides unparalleled scale and flexibility for applications, enabling enterprises to bring together and process data of all types and from more sources, while providing developers with the flexibility they need. In this blog, we’ll look into capabilities that make Operational Database the right choice for hyperscale.

Read Post

Cloudera

Read more about Operational Database Scalability

Minimizing Cloud Concentration Risk for Financial Services Institutions, Regulators and Cloud Service Providers

Jul 16, 2020 By Richard Harmon In Cloudera

Since the financial crisis of 2008, regulators have been consistently working to identify emerging risks that can potentially result in financial stability events. The growth in cloud adoption across the Financial Services Industry (FSI) and the associated increase in reliance on third-party infrastructure providers has gained the attention of regulators at global, regional, and national levels.

Read Post

Cloudera

Read more about Minimizing Cloud Concentration Risk for Financial Services Institutions, Regulators and Cloud Service Providers

Connected Manufacturing Insights from the Edge with Cloudera DataFlow

Jul 15, 2020 By David LeGrand In Cloudera

Connected Manufacturing’s Pivot to an Enterprise Data Solution Connected Manufacturing is at a turning point and it is catalyzed by a real, measurable change and shift in data types – real-time and time-series data is growing 50% faster than latent or static data forms and streaming analytics projected to grow at a 28% CAGR, leaving legacy data platforms that specialize in static historical data solutions, functioning on-prem or in discrete clouds, inadequate in addressing today’s rea

Read Post

Cloudera

Read more about Connected Manufacturing Insights from the Edge with Cloudera DataFlow

Building an effective data approach in a hybrid cloud world

Jul 14, 2020 By Caitriona Snell In Cloudera

“In today’s world of disruption and transformation, there are a few key things that all organizations are trying to figure out: how to remain relevant to their customer base, how to deal with the pressure of disruption in their industry and, undoubtedly, how to look to technology to help deliver a better service.” Paul Mackay Today we are sitting down with Marc Beierschoder, Analytics & Cognitive Offering Lead at Deloitte Germany and Paul Mackay, the EMEA Cloud Lead at Cloudera to dis

Read Post

Cloudera

Read more about Building an effective data approach in a hybrid cloud world

Public cloud: security fright or delight?

Jul 13, 2020 By Wim Stoop In Cloudera

Learning additional languages is a common practice in the Netherlands. In primary school, we learn English and secondary school offers French, German, and a host of other options. Learning a new language and speaking it well is tricky.

Read Post

Cloudera

Read more about Public cloud: security fright or delight?

CDP Private Cloud ends the battle between agility & control in the data center

Jul 13, 2020 By Tom Deane In Cloudera

As a BI Analyst, have you ever encountered a dashboard that wouldn’t refresh because other teams were using it? As a data scientist, have you ever had to wait 6 months before you could access the latest version of Spark? As an application architect, have you ever been asked to wait 12 weeks before you could get hardware to onboard a new application?

Read Post

Cloudera

Read more about CDP Private Cloud ends the battle between agility & control in the data center

Apache Hadoop YARN in CDP Data Center 7.1: What's new and how to upgrade

Jul 10, 2020 By Szilard Nemeth In Cloudera

This blogpost will cover how customers can migrate clusters and workloads to the new Cloudera Data Platform – Data Center 7.1 (CDP DC 7.1 onwards) plus highlights of this new release. CDP DC 7.1 is the on-premises version of Cloudera Data Platform.

Read Post

Cloudera

Read more about Apache Hadoop YARN in CDP Data Center 7.1: What's new and how to upgrade

Overview of the Operational Database performance in CDP

Jul 9, 2020 By Liliana Kadar In Cloudera

This article gives you an overview of Cloudera’s Operational Database (OpDB) performance optimization techniques. Cloudera’s Operational Database can support high-speed transactions of up to 185K/second per table and a high of 440K/second per table. On average, the recorded transaction speed is about 100K-300K/second per node. This article provides you an overview of how you can optimize your OpDB deployment in either Cloudera Data Platform (CDP) Public Cloud or Data Center.

Read Post

Cloudera

Read more about Overview of the Operational Database performance in CDP

Eliminate the pitfalls on your path to public cloud

Jul 8, 2020 By Wim Stoop In Cloudera

As organizations look to get smarter and more agile in how they gain value and insight from their data, they are now able to take advantage of a fundamental shift in architecture. In the last decade, as an industry, we have gone from monolithic machines with direct-attached storage to VMs to cloud. The main attraction of cloud is due to its separation of compute and storage – a major architectural shift in the infrastructure layer that changes the way data can be stored and processed.

Read Post

Cloudera

Read more about Eliminate the pitfalls on your path to public cloud

How to run queries periodically in Apache Hive

Jul 8, 2020 By Zoltan Haindrich In Cloudera

In the lifecycle of a data warehouse in production, there are a variety of tasks that need to be executed on a recurring basis. To name a few concrete examples, scheduled tasks can be related to data ingestion (inserting data from a stream into a transactional table every 10 minutes), query performance (refreshing a materialized view used for BI reporting every hour), or warehouse maintenance (executing replication from one cluster to another on a daily basis).

Read Post

Cloudera

Read more about How to run queries periodically in Apache Hive

Introducing FlinkSQL in Cloudera Streaming Analytics

Jul 7, 2020 By Marton Balassi In Cloudera

Our 1.2.0.0 release of Cloudera Streaming Analytics Powered by Apache Flink brings a wide range of new functionality, including support for lineage and metadata tracking via Apache Atlas, support for connecting to Apache Kudu and the first iteration of the much-awaited FlinkSQL API. Flink’s SQL interface democratizes stream processing, as it caters to a much larger community than the currently widely used Java and Scala APIs focusing on the Data Engineering crowd.

Read Post

Cloudera

Read more about Introducing FlinkSQL in Cloudera Streaming Analytics

Systems | Development | Analytics | API | Testing

Faster Analytics with Cloudera Data Warehouse (CDW) Demo Highlight

Meeting Medical Device Data Privacy, Governance, and Security Challenges

Zeppelin Architecture and Operational Workflow

The reinvention of the Telco: From Pipe to Processor

Building a Scalable Process Using NiFi, Kafka and HBase on CDP

Enabling high-speed Spark direct reader for Apache Hive ACID tables

Digital Transformation is Way More than Just Digital

The benefits of building an on-demand data lake in healthcare

The Rise Of Connected Manufacturing - How Data Is Driving Innovation Part II

Cloudera Operational Database experience (dbPaaS) available as Technical Preview

Operational Database Scalability

Minimizing Cloud Concentration Risk for Financial Services Institutions, Regulators and Cloud Service Providers

Connected Manufacturing Insights from the Edge with Cloudera DataFlow

Building an effective data approach in a hybrid cloud world

Public cloud: security fright or delight?

CDP Private Cloud ends the battle between agility & control in the data center

Apache Hadoop YARN in CDP Data Center 7.1: What's new and how to upgrade

Overview of the Operational Database performance in CDP

Eliminate the pitfalls on your path to public cloud

How to run queries periodically in Apache Hive

Introducing FlinkSQL in Cloudera Streaming Analytics

Monthly Archive

Follow Us