Systems | Development | Analytics | API | Testing

Latest News

3 Reasons Extract, Load & Transform is a Bad Idea

Extract, Load, Transform (ELT) technology makes it easy for organizations to pull data from databases, applications, and other sources, and move it into a data lake. But companies pay for this convenience in many ways. ELT solutions can have a negative impact on data privacy, data quality, and data management.

ETL with Apache Airflow

Written in Python, Apache Airflow is an open-source workflow manager used to develop, schedule, and monitor workflows. Created by Airbnb, Apache Airflow is now being widely adopted by many large companies, including Google and Slack. Being a workflow management framework, Apache Airflow differs from other frameworks in that it does not require exact parent-child relationships. Instead, you only need to define parents between data flows, automatically organizing them into a DAG (directed acyclic graph).

Create a Salesforce ETL Pipeline in 30 Minutes

Salesforce is one of the world’s most popular CRM (customer relationship management) software platforms, helping businesses of all sizes and industries beat their competitors and better serve their clients. But instead of keeping your Salesforce data inside the CRM platform itself, you can make better use of this information by moving it into a target data warehouse.

ELT: Easy to Deploy, Easy to Outgrow

Extract, load, transform (ELT) technology is a type of data pipeline that ingests data from one or more sources, loads the data into its destination (typically a data lake), and then allows end-users to perform ad-hoc transformations on it as needed. ELT can perform mass extraction of all data types, including raw data, without the need to set up transformation rules and filters before data loading.

What is File Transfer Protocol?

Transferring files between two or more machines is an essential part of the ETL (extract, transform, load) process. Of course, there are multiple ways to move data, including flat file databases. For example, you can physically copy the data onto a USB drive or send it to the recipient via email. But methods like these are far less efficient than sending data via FTP. So what is FTP exactly, and how do you use it to transfer files and data? Keep reading for all the answers.

Say Goodbye to Data Quality with ELT

ELT is a three-step process that first extracts raw, structured, and unstructured data from source databases, applications, data stores, and other repositories. It then loads that data into a data lake and transforms it as needed by analysts. Since it doesn't move the data to an intermediate staging area or transform it before loading, the extraction process is speedy. You don’t need to pick and choose what data loads into the data lake or wait for it to be processed.