Analytics

What are ETL tools?

Dec 4, 2020 By Brandon Chen In Fivetran

Thinking of building out an ETL process or refining your current one? Read more to learn about how ETL tools give you time to focus on building data models. ETL stands for extract-transform-load, and is commonly used when referring to the process of data integration. Extract refers to pulling data from a particular data source. Transforms are used to make that data into a processable format. Load is the final step to drop the data into the designated target.

Read Post

Fivetran

Read more about What are ETL tools?

Achieve Pin-Point Historical Analysis of Your Salesforce Data

Dec 3, 2020 By Amy Peterson In Fivetran

Want to look at how data has changed over time? Simply enable history mode, a Fivetran feature that data analysts can turn on for specific tables to analyze historical data. The feature achieves Type 2 Slowly Changing Dimensions (Type 2 SCD), meaning a new timestamped row is added for every change made to a column. We launched history mode for Salesforce in May and have been delighted with the response.

Read Post

Fivetran

Read more about Achieve Pin-Point Historical Analysis of Your Salesforce Data

Moving Big Data and Streaming Data Workloads to AWS

Dec 3, 2020 By Unravel In Unravel

Cloud migration may be the biggest challenge, and the biggest opportunity, facing IT departments today - especially if you use big data and streaming data technologies, such as Cloudera, Hadoop, Spark, and Kafka. In this 55-minute webinar, Unravel Data product marketer Floyd Smith and Solutions Engineering Director Chris Santiago describe how to move workloads to AWS EMR, Databricks, and other destinations on AWS, fast and at the lowest possible cost.

View Video

Unravel

Read more about Moving Big Data and Streaming Data Workloads to AWS

How to Transform and Load Data into MongoDB

Dec 3, 2020 By Xplenty In Xplenty

Teri will demonstrate how to build a data pipeline that helps a health organization cleanse and mask their sensitive PII data, before centralizing it in a Mongo database which is optimized for handling large data volumes.

View Video

Xplenty

Read more about How to Transform and Load Data into MongoDB

Fivetran vs. MuleSoft vs. Xplenty : An ETL Comparison

Dec 3, 2020 By Mark Smallcombe In Integrate

The key differences between Fivetran, MuleSoft, and Xplenty: Hiring a data scientist or engineer can cost up to $140,000 per year —something many businesses can't afford. Still, organizations need to pull data from different locations into a data lake or warehouse for business insights. An Extract, Transform, and Load (ETL) platform makes this process easier, but few organizations have the technical or coding know-how to make it happen.

Read Post

Integrate

Read more about Fivetran vs. MuleSoft vs. Xplenty : An ETL Comparison

How leading organizations govern their data to find success

Dec 2, 2020 By Talend In Talend

With the increased focus on delivering value customers, it is imperative to build a next generation customer hub that delivers high quality and governed data. In this video we will share best practices for implementing a comprehensive data governance approach. Learn how to leverage the capabilities of the Talend Data Fabric to deploy a forward-looking data management architecture that detects and retrieves metadata from across databases and applications, builds data lineage, and adds traceability.

View Video

Talend

Analytics
BI

Read more about How leading organizations govern their data to find success

Data Automation: How to do it properly

Dec 2, 2020 By Keboola In Keboola

Economists have predicted that a leisurely 15-hour workweek awaits us in the future, with robots taking over the menial tasks so that we’ll be free to explore the more cognitively stimulating aspects of our jobs. Sounds like science fiction, right?

Read Post

Keboola

Read more about Data Automation: How to do it properly

How to configure clients to connect to Apache Kafka Clusters securely - Part 1: Kerberos

Dec 2, 2020 By Andre Araujo In Cloudera

This is the first installment in a short series of blog posts about security in Apache Kafka. In this article we will explain how to configure clients to authenticate with clusters using different authentication mechanisms.

Read Post

Cloudera

Read more about How to configure clients to connect to Apache Kafka Clusters securely - Part 1: Kerberos

Hive vs. SQL: Which One Performs Data Analysis Better?

Dec 2, 2020 By Mark Smallcombe In Integrate

Key differences between Hive and SQL: Big data requires powerful tools. Successful organizations query, manage and analyze thousands of data sets from hundreds of data sources. This is where tools like Hive and SQL come in. Although very different, both query and program big data. But which tool is right for your organization? In this review, we compare Hive vs. SQL on features, prices, support, user scores, and more.

Read Post

Integrate

Read more about Hive vs. SQL: Which One Performs Data Analysis Better?

Data in Discussion: Navigating the Machine Learning Lifecycle

Dec 1, 2020 By Cloudera In Cloudera

In this video, enterprise data and machine learning experts Sam Charrington (TWIML) and Sushil Thomas (Cloudera ML) discuss what is required to effectively operationalize ML in the enterprise — from requirements across the ML lifecycle to enabling decision-makers.

View Video