Building an Automated ML Pipeline with a Feature Store Using Iguazio & Snowflake

When operationalizing machine and deep learning, a production-first approach is essential for moving from research and development to scalable production pipelines in a much faster and more effective manner. Without the need to refactor code, add glue logic and spend significant efforts on data and ML engineering, more models will make it to production and with less issues like drift.

Data Vault Techniques on Snowflake: Streams and Tasks on Views

Snowflake removes the need to perform maintenance tasks on your data platform and provides you with the freedom to choose your data model methodology for the cloud. When attempting to keep the cost of data processing low, both data volume and velocity can make things challenging.

SCIM (System for Cross-domain Identity Management)

The identity team at Cloudera has been working to add the System for Cross-domain Identity Management (SCIM) support to Cloudera Data Platform (CDP) and we’re happy to announce the general availability of SCIM on Azure Active Directory! In Part One we discussed: CDP SCIM Support for Active Directory, which discusses the core elements of CDP’s SCIM support for Azure AD.

About the State of Value Stream Management in 2022

Value Stream Management (VSM) is about empowering delivery organizations to measure, mitigate, and monitor complexity. Simply put, it aims at improving the flow of value in your organisation. The VSM Consortium recently released their highly anticipated report on “The State of Value Stream Management 2022” . In this post we recap some of the findings and look at it specifically from a software engineering and platform engineering point of view.

DataOps Observability Designed for Data Teams

Today every company is a data company. And even with all the great new data systems and technologies, it’s people—data teams—who unlock the power of data to drive business value. But today’s data teams are getting bogged down. They’re struggling to keep pace with the increased volume, velocity, variety, complexity—and cost—of the modern data stack. That’s where Unravel DataOps observability comes in.

Why ETL is Critical for Ecommerce Data Success & How to Start

It’d be hard to find anyone who’d say that taking a data-driven approach to business decisions is not worthwhile. Yet, so many businesses aren’t doing it because, as simple as it may sound on paper, it takes a great deal of strategic planning to pull off. One of the most crucial tools when it comes to accomplishing a data-driven decision-making process is known as ETL.

How To Deploy a HuggingFace Model (Seamlessly)

What if I want to serve a Huggingface model on ClearML? Where do I start? In general, machine learning engineers know by now that a good model serving engine is invaluable when serving models in production. These days, NVIDIA’s Triton inference engine is a popular option to do so, but it is lacking in some respects.