Systems | Development | Analytics | API | Testing

%term

Apache Hadoop YARN in CDP Data Center 7.1: What's new and how to upgrade

This blogpost will cover how customers can migrate clusters and workloads to the new Cloudera Data Platform – Data Center 7.1 (CDP DC 7.1 onwards) plus highlights of this new release. CDP DC 7.1 is the on-premises version of Cloudera Data Platform.

5 Challenges of Simplifying DevOps for Data Apps

The benefits of building a DevOps culture for software companies are clear. DevOps practices integrate once-siloed teams across the software development lifecycle, from Dev to QA to Ops, resulting in both faster innovation and improved product quality. As a result, most software development teams have deployed tools to enable DevOps practices across their workflow.

Top Seven E-commerce Platforms in 2020

The introduction of e-commerce stores has made life so easy for the people. It does not make a difference if you are the consumer or a seller. For a seller, it provides the opportunity to express the worth of their brand and product(s). For a consumer, it gives them an all in one platform, where they can shop for multiple categories. With this, the most significant ease for both parties is to opt for e-commerce business is that you can do all this without taking a step out of their house.

A Dose Of Data Science Demystification

Join two data engineers and analysts in pulling back the curtain on real customer engagements, showing how to select and implement advanced data science and analytic techniques. In this session we will discuss our implementation of two data science models at a large agricultural products manufacturer: a propensity-to-buy model and a recommendation engine. We will discuss how each of these models works and how they were implemented for our client.

Make Your Data Fabrics Work Better

To gain the full benefits of the DataOps strategy, your data lakes must change. The traditional concept of bringing all data to one place, whether on-premises or in the cloud, raises questions of timing, scale, organization and budget. The answer? Data fabric. It replaces traditional data lake organization concepts with a more flexible and economical architecture. In this session, we'll define what a data fabric is, show you how you can begin organizing around the concept, and discuss how to align it to your business objectives.

How to add business logic with content-based router

To explain the usage of our content-based router that enables you to add some business logic to your integration flow, let’s imagine that I have received a Google Spreadsheet file with unsorted leads data, for example, from downloading a coupon on a website. To convert these leads to customers, I’d like to offer them something of interest with a specifically targeted campaign based on the country of residence. For that, I need to filter and split the list.

Demand for Data Grows in Agriculture

Agriculture (Ag) is the oldest and largest industrial vertical in the world, and its importance continues to grow as it becomes more challenging for people to access healthy and fresh food. A recent Agriculture Analytics Market report, released by Markets and Markets, estimates that by 2023, the global agriculture analytics market size will grow from 585 million to 1.2 billion dollars as demands for real-time data analysis and improved operations increase.

How to Create a Python Stack

All programming languages provide efficient data structures that allow you to logically or mathematically organize and model your data. Most of us are familiar with simpler data structures like lists (or arrays) and dictionaries (or associative arrays), but these basic array-based data structures act more as generic solutions to your programming needs and aren’t really optimized for performance on custom implementations. There’s much more than programming languages bring to the table.