Systems | Development | Analytics | API | Testing

Data Science

Building a Machine Learning Application With Cloudera Data Science Workbench And Operational Database, Part 1: The Set-Up & Basics

Introduction Python is used extensively among Data Engineers and Data Scientists to solve all sorts of problems from ETL/ELT pipelines to building machine learning models. Apache HBase is an effective data storage system for many workflows but accessing this data specifically through Python can be a struggle. For data professionals that want to make use of data stored in HBase the recent upstream project “hbase-connectors” can be used with PySpark for basic operations.

Data Science vs. Data Engineering: What You Need to Know

According to The Economist, “the world’s most valuable resource is no longer oil, but data.” Despite the value of enterprise data, much has been written about the so-called “data science shortage”: the supposed lack of professionals with knowledge of how to use and manipulate big data. A 2018 study by LinkedIn estimated that there were more than 151,000 unfilled jobs in the U.S. requiring data science skills.

How Has COVID-19 Impacted Data Science?

The COVID-19 pandemic disrupted supply chains and brought economies around the world to a standstill. In turn, businesses need access to accurate, timely data more than ever before. As a result, the demand for data analytics is skyrocketing as businesses try to navigate an uncertain futured. However, the sudden surge in demand comes with its own set of challenges.

The Modern Data Science Stack

Automated data integration can help you jumpstart your machine learning efforts. Learn about the modern data science stack. It’s an oft-cited factoid that data scientists spend only 20% of their time doing actual data science work, while the rest of their time is spent on doing what is often delightfully referred to as “data munging” — that is, obtaining, cleaning, and preparing data for analysis.

3 Snowflake Features That Make Data Science Easier

Data science is proving to be a major competitive advantage for companies. While business intelligence (BI) helps companies with reporting and historical analysis, data science goes a step further and predicts the future. It can leverage much more data from many more sources, and using machine learning (ML) principles, it automatically identifies patterns and trends to model, predict, or forecast future outcomes.

A Dose Of Data Science Demystification

Join two data engineers and analysts in pulling back the curtain on real customer engagements, showing how to select and implement advanced data science and analytic techniques. In this session we will discuss our implementation of two data science models at a large agricultural products manufacturer: a propensity-to-buy model and a recommendation engine. We will discuss how each of these models works and how they were implemented for our client.