Systems | Development | Analytics | API | Testing

Analytics

The BigQuery admin reference guide: Resource Hierarchy

Starting this week, we’re adding new content to the BigQuery Spotlight Youtube series. Throughout the summer we’ll be adding new videos and blog posts focused on helping new BigQuery architects and administrators master the fundamentals. You can find complimentary material for the topics discussed in the official BigQuery documentation.

AWS Data Pipeline Best Practices

Knowing best practices for Amazon Web Services (AWS) data pipelines is essential for modern companies handling large datasets and requiring secure ETL (Extract, Transform, Load) processes. In this article, we discuss AWS data pipeline best practices to ensure top performance and streamlined processes — without complications that can impede the execution of data transfer.

5 Steps to Prevent PII Data Breaches

When it was revealed in September 2017, the massive Equifax data breach made international headlines. As one of the three major credit agencies in the United States, Equifax is responsible for processing personally identifiable information (PII) such as individuals’ names, addresses, and social security numbers. According to Equifax, 143 million people were affected by the data breach, making it one of the biggest cybersecurity disasters in history.

Migrate Hive data from CDH to CDP public cloud

Many Cloudera customers are making the transition from being completely on-prem to cloud by either backing up their data in the cloud, or running multi-functional analytics on CDP Public cloud in AWS or Azure. The Replication Manager service facilitates both disaster recovery and data migration across different environments.

Getting to Know the Apache Hadoop Technology Stack

With technology innovations raging at incredible speeds over the past few decades, new and exciting platforms for gathering, storing, transforming, and manipulating data are entering the market every day. Apache Hadoop was one of these disrupters when it entered the market in 2006, offering distributed storage and big data processing using a network of many computers.

The Ultimate Guide to HIPAA

The Healthcare Insurance Portability and Accountability Act (HIPAA) has been an important federal law in healthcare since 1996. Part of its purpose was to create standards meant to protect sensitive patient information, and it took on even more important once the digitalization of patient health records became widespread. Now it’s required for certain types of businesses to protect patient health information—or face fines that range from $100 to $50,000 per violation.

Deploying applications on CDP Operational Database (COD)

CDP Operational Database Experience (COD) is a PaaS offering on the Cloudera Data Platform (CDP). COD enables you to create a new operational database with a few clicks and auto-scales based on your workload. Behind the scenes, COD automatically manages cluster deployment and configuration, reducing overheads related to setting up new database instances. Additionally, auto-scaling eliminates the need to size a cluster for your workloads.

Ecommerce analytics 101: The ultimate guide

To grow your eCommerce business ahead of your competitors you need to rely on analytics. Ecommerce analytics is the compass that replaces your gut feelings as you scale your e-shop to higher grounds and more online sales. In this ultimate guide to eCommerce analytics we will look at: What is eCommerce analytics? Why is eCommerce analytics crucial for the success of your store? What are the best metrics and KPIs to track for eCommerce?