Analytics

Filter more pay less with the latest Cloudera Data Warehouse runtime!

One of the most effective ways to improve performance and minimize cost in database systems today is by avoiding unnecessary work, such as data reads from the storage layer (e.g., disks, remote storage), transfers over the network, or even data materialization during query execution. Since its early days, Apache Hive improves distributed query execution by pushing down column filter predicates to storage handlers like HBase or columnar data format readers such as Apache ORC.

How to use Snowflake Guides & Labs | Behind The Data Cloud

Developers, in this episode, you’ll learn how to kick off quickly with Snowflake Guides as well as how to access a repository of open source projects in Snowflake Labs. We’ll also reveal Snowflake’s Awesome List which contains key resources, learning opportunities, and open source demos. We switch things up with Daniel Myers from Developer Relations taking a turn as our guest, with Snowflake Community Manager Elsa Mayer acting as host. If you enjoy this episode, make sure to subscribe and share this video with a colleague.

Google BigQuery is a Leader in The 2021 Forrester Wave: Cloud Data Warehouse

We are thrilled to announce that Google has been named a Leader in The Forrester Wave™: Cloud Data Warehouse, Q1 2021 report. For more than a decade, BigQuery, our petabyte-scale cloud data warehouse, has been in a class of its own. We're excited to share this recognition and we want to thank our strong community of customers and partners for voicing their opinion. We believe this report validates the alignment of our strategy with our customers’ analytics needs.

Decentralized Data Teams Helped With Low Code

When a company is small, having a fully centralized data team may not be an issue. As you grow, however, problems can start to arise. You have one structure that’s supporting all of your business units, and they may not be able to dedicate sufficient time and resources to individual business units. This can lead to delays in surfacing important insights and decisions made on old or inaccurate data.

Prepare Your Data - The Self-Service Data Roadmap, Session 2 of 4

In this webinar, Unravel CDO and VP Engineering Sandeep Uttamchandani describes the second step for any large, data-driven project: the Prep phase. Having found the data you need in the Discover phase, it's time to get your data ready. You must structure, clean, enrich, and validate static data, and ensure that "live," updated or streamed data events are continually ready for processing.