Systems | Development | Analytics | API | Testing

BI

Yellowfin does it again - Gartner 2021 Magic Quadrant

Once again, Yellowfin has been recognized in the Gartner Magic Quadrant. This is the eighth time that we have been recognized and for the second year, we are in the visionary quadrant. Yellowfin is also the only Australian vendor to be included. Gartner has recognized Yellowfin for three things in particular - innovation, breadth of capability and openness.

Exploring Data & Dashboard Creation on CDP Public Cloud

In this video, we'll walk through an example on how you can use Cloudera Data Warehouse to both easily run ad hoc queries against data as well as turn the results of those queries into beautiful, interactive, data visualizations and dashboards that show off the results of your data exploration.

How to use Snowflake Guides & Labs | Behind The Data Cloud

Developers, in this episode, you’ll learn how to kick off quickly with Snowflake Guides as well as how to access a repository of open source projects in Snowflake Labs. We’ll also reveal Snowflake’s Awesome List which contains key resources, learning opportunities, and open source demos. We switch things up with Daniel Myers from Developer Relations taking a turn as our guest, with Snowflake Community Manager Elsa Mayer acting as host. If you enjoy this episode, make sure to subscribe and share this video with a colleague.

Filter more pay less with the latest Cloudera Data Warehouse runtime!

One of the most effective ways to improve performance and minimize cost in database systems today is by avoiding unnecessary work, such as data reads from the storage layer (e.g., disks, remote storage), transfers over the network, or even data materialization during query execution. Since its early days, Apache Hive improves distributed query execution by pushing down column filter predicates to storage handlers like HBase or columnar data format readers such as Apache ORC.

Prepare Your Data - The Self-Service Data Roadmap, Session 2 of 4

In this webinar, Unravel CDO and VP Engineering Sandeep Uttamchandani describes the second step for any large, data-driven project: the Prep phase. Having found the data you need in the Discover phase, it's time to get your data ready. You must structure, clean, enrich, and validate static data, and ensure that "live," updated or streamed data events are continually ready for processing.

CDP Endpoint Gateway provides Secure Access to CDP Public Cloud Services running in private networks

Cloudera Data Platform (CDP) Public Cloud allows users to deploy analytic workloads into their cloud accounts. These workloads cover the entire data lifecycle and are managed from a central multi-cloud Cloudera Control Plane. CDP provides the flexibility to deploy these resources into public or private subnets. Nearly unanimously, we’ve seen customers deploy their workloads to private subnets.

Why DataOps is Critical for Your Business

Data is often compared to oil – it powers today’s organizations, just like the fossil fuel powered companies of the past. Just like oil, the data that companies collect needs to be refined, structured, and easily analyzed in order for it to really provide value in the form of gaining actionable insights. Every organization today is in the process of harnessing the power of their data using advanced analytics, which is likely running on a modern data stack.

Analyzing Python package downloads in BigQuery

The Google Cloud Public Datasets program recently published the Python Package Index (PyPI) dataset into the marketplace. PyPI is the standard repository for Python packages. If you’ve written code in Python before, you’ve probably downloaded packages from PyPI using pip or pipenv. This dataset provides statistics for all package downloads, along with metadata for each distribution. You can learn more about the underlying data and table schemas here.