Analytics

How to optimize your cloud data costs: 4 steps to reduce cloud data platform costs

If you have managed a cloud data platform, you have undoubtedly gotten that call. You know the one, it's usually from finance or the office of the CFO, inquiring about your monthly spend. And it usually comes in one of two forms: While both are clear and present dangers to cloud data platform owners, they don’t have to be.

Consumer GPUs vs Datacenter GPUs for CV: The Surprising Cost-Effective Winner

We recently rolled out our very own GPU autoscaler in Collaboration with Genesis Cloud and it has been quite a success. Also recently, YOLOv8 by Ultralytics was unveiled, the new king of object detection, segmentation and classification. In this blogpost we’ll see that you can train a computer vision model using the ClearML/Genesis Cloud autoscaler at a fraction of the cost of competing cloud services like AWS or GCP. And it even runs 100% off of green energy! 😎

Iceberg Tables: Catalog Support Now Available

As announced at Snowflake Summit 2022, Iceberg Tables combines unique Snowflake capabilities with Apache Iceberg and Apache Parquet open source projects to support your architecture of choice. As part of the latest Iceberg release, we’ve added catalog support to the Iceberg project to ensure that engines outside of Snowflake can interoperate with Iceberg Tables.

ThoughtSpot for Google Cloud Platform

ThoughtSpot is partnering with Google Cloud to expand self-service analytics capabilities beyond the dashboards! Now you can use AI-powered search to query Google BigQuery in real-time, access the Looker semantic layer to obtain reliable and standardized data models, and close the productivity loop with ThoughtSpot plugins for Google Sheets, Connected Sheets, and Slides.

How Manufacturers Drive Profits with Connected Products

It’s been a decade since “connected” objects—commonly referred to as “the internet of things” (IoT)— reached broad audiences. Connected toothbrushes, sensors embedded in sneakers, and smart watches have started to change consumer behavior through a data-driven, gamified approach. Technology has rapidly evolved to handle large data volumes at high velocities and big data analytics. AI has become more democratized.

5 engineering tools every analytics and data engineer needs to know

Are you considering venturing into the world of analytics engineering? Analytics engineers are the newest addition to data teams and sit somewhere between data engineers and data analysts. They are technical, business savvy, and love to learn. A huge part of an analytics engineer’s role is learning new modern data tools to implement within data stacks.

Kubeflow Vs. MLflow Vs. MLRun: Which One is Right for You?

The open source ML tooling ecosystem has become vast in the last few years, with many tools both overlapping in their capabilities as well as complimenting each other nicely. In part because AI/ML is a still-immature practice, the messaging around what all these tools can accomplish can be quite vague. In this article, we’ll dive into three tools to better understand their capabilities, and how they fit into the ML lifecycle.

Setting up Google BigQuery as a data warehouse in minutes

In this tutorial, learn how to set up a new Google BigQuery cloud-based data warehouse account and extract data from all your data sources using Stitch in less than 3 minutes. Stitch partners with the most common data warehouses and data lakes to help move your data from sources like Shopify, MongoDB, LinkedIn Ads, Zapier, Hubspot, SendGrid, Google Analytics, and more. Google Analytics. Watch this step-by-step tutorial on how to set up Google BigQuery for data storage.