Analytics

What is File Transfer Protocol?

Transferring files between two or more machines is an essential part of the ETL (extract, transform, load) process. Of course, there are multiple ways to move data, including flat file databases. For example, you can physically copy the data onto a USB drive or send it to the recipient via email. But methods like these are far less efficient than sending data via FTP. So what is FTP exactly, and how do you use it to transfer files and data? Keep reading for all the answers.

The Ethics of AI Comes Down to Conscious Decisions

This blog post was written by Pedro Pereira as a guest author for Cloudera. Right now, someone somewhere is writing the next fake news story or editing a deepfake video. An authoritarian regime is manipulating an artificial intelligence (AI) system to spy on technology users. No matter how good the intentions behind the development of a technology, someone is bound to corrupt and manipulate it. Big data and AI amplify the problem. “If you have good intentions, you can make it very good.

The Complete Guide to GDPR Compliance

The General Data Protection Regulation (GDPR) is a landmark piece of legislation that affects how organizations can handle, process, and store the personal data of European Union (EU) citizens and residents. But what does the GDPR require exactly, and how can you be sure that your organization complies with it? We go over everything you need to know in this all-in-one guide to GDPR compliance.

Say Goodbye to Data Quality with ELT

ELT is a three-step process that first extracts raw, structured, and unstructured data from source databases, applications, data stores, and other repositories. It then loads that data into a data lake and transforms it as needed by analysts. Since it doesn't move the data to an intermediate staging area or transform it before loading, the extraction process is speedy. You don’t need to pick and choose what data loads into the data lake or wait for it to be processed.

AutoZone: Exceeding customer expectations with speed of service

“Talend is amazing because it’s open, flexible, and visual. The robustness and reliability of Talend have made it an integral part of our solution set. It’s easy to learn and fast to ramp up.” – Jason Vogel, IT Manager, AutoZone AutoZone is America’s #1 vehicle solutions provider. It was founded in 1979 and has since expanded to more than 6,400 stores across three countries, with over 96,000 employees.

Real-time Change Data Capture for data replication into BigQuery

Businesses hoping to make timely, data-driven decisions know that the value of their data may degrade over time and can be perishable. This has created a growing demand to analyze and build insights from data the moment it becomes available, in real-time.