Analytics

How we build custom data extractors to meet client ETL Needs

On one of our webinars in April 2020 we talked about the developer portal and how our developer community are pushing the Keboola Connection platform into places that often surprise our own core team. Our partners often are the creative ones, adding their knowledge and expertise to expand our platform in service of our shared customers and their varying needs. This is a guest post, written by Johnathan Brook, Solutions Architect at 4 Mile Analytics.

Databases Demystified Lesson 1 Introduction to Databases and SQL

In the first episode of Databases Demystified with Michael Kaminsky, we give a high-level overview of the most important concepts in databases. We start with a brief history of databases going from the invention of relational databases through present day and we talk about the differences between analytical and transactional databases, distributed and single-node databases, and in-memory vs on-disk databases We finish up talking briefly about SQL and what makes it special.

5 Challenges of Building Data Applications

Fast-growing software companies are building data applications for a variety of uses, from marketing apps that provide customer insights, to IoT apps that handle device feedback, and data analytics apps that process both historical and near real-time data. But developers often face obstacles when building, designing, and supporting applications that need to parse large volumes of information.

Are Your Machine Learning Models Wrong?

In addition to the very real negative impact on every person around the world, the COVID-19 pandemic is driving business disruptions and closures at an unprecedented scale. Enormous government stimulus programs are resulting in explosions in fiscal deficits, regulators are relaxing capital constraints on banks and central banks are supporting economic stability with a range of interest rate cuts and other stimulus measures.

Databases Demystified: Lesson 7: Distributed Databases Part 2

Episode 7 of Michael Kaminsky's Databases Demystified. Learn about new issues we face in distributed databases and all about the CAP theorem. We'll talk about leader and follower nodes, what happens when distributed databases lose connection with a node, and what CAP stands for: consistency, availability, and partition tolerance.

Databases Demystified Lesson 4: Transactions Part 1

In this episode of Michael Kaminksy's Databases Demystified, we learn all about what a transaction is, and what ACID means. Learn why database constraints are important, and what the commands "begin" "commit" and "rollback" mean. We talk about atomicity, consistency, isolation, and durability and why transactions are so important.

Databases Demystified Lesson 6: Distributed Databases Part 1

Welcome to episode 6 of Michael Kaminsky's Databases Demystified. In this lesson, we introduce a fascinating and incredibly important topic: distributed databases. We discuss "nodes" and "clusters" and we cover the two major paradigms in distributed databases: big-compute databases and high-availability databases.