Systems | Development | Analytics | API | Testing

Why you need metadata management and how to approach it

As your data operations evolve, they become messier. Diverse data sources and data models at their sources, multiple movements of data throughout your platform, and cobbled-up infrastructure, which has grown in complexity through every deployment have made it hard to identify, trace, classify, and understand your data assets. This can be as simple as an analyst spending hours trying to figure out where a data attribute in a table came from and whether it is trustworthy.

How to do data transformation in your ETL process?

Working with raw or unprocessed data often leads to poor decision-making. This explains why data scientists, engineers, and other analytic professionals spend over 80% of their time finding, cleaning, and organizing data. Accordingly, the ETL process - the foundation of all data pipelines - devotes an entire section to T, transformations: the act of cleaning, molding, and reshaping data into a valuable format.

CNC: The journey from Excel spreadsheets to automated data pipelines and fast, reliable insights

Founded in 1991, CNC (Czech News Center) is one of the largest media companies in the Czech Republic. They offer dozens of print and online publications to the Czech market, including Blesk, Aha!, and E15. A commitment to journalistic integrity has enabled their growth, now reaching millions of readers. They are currently undergoing a vast digitalization process with the aim to become the fastest-growing and largest media house in the Czech Republic.

Get the most out of Shopify Analytics

Running an eCommerce store is very much like flying a plane - you can reach unprecedented heights, but you won't be able to do it blindfolded. You have to see where you are going to touch the skies. E-commerce analytics gives you the guidance to make the right choice and scale your online store to new heights. In this article, we will take a deep dive into Shopify Analytics Shopify offers analytics as an out-of-the-box default service to all Shopify store owners and admins.

Ecommerce analytics 101: The ultimate guide

To grow your eCommerce business ahead of your competitors you need to rely on analytics. Ecommerce analytics is the compass that replaces your gut feelings as you scale your e-shop to higher grounds and more online sales. In this ultimate guide to eCommerce analytics we will look at: What is eCommerce analytics? Why is eCommerce analytics crucial for the success of your store? What are the best metrics and KPIs to track for eCommerce?

Looking for an ETL tool? Stop. Right. Here.

You have started your data journey. You know you need to somehow collect data from various sources and land them into a data warehouse or data lake of some sort. Right now you’re browsing tools and calculating costs - there’s one for extraction, another one for transformations, there’s an ETL tool. What if we told you there’s a better way?

Get control over your data pipelines with data orchestration

Enterprises are tapping and leveraging big data to get ahead of the competition. As Peter Sondergaard, ex-Executive Vice President at Gartner said: The problem with the combustion engine is that it does not scale well. As companies grow, the data platforms they previously relied on for analytics start to break apart.

Product announcement: Say hello to the new and improved Storage UI

Keboola’s Storage UI now comes with a new - slicker - look which will improve the user experience for all Keboola veterans. (New to Keboola? Do not fear. Simply follow along with the guided tour when you sign up for free, and you can unlock all the new features after June 2nd).

How to get started with data lineage

Modern enterprises leverage over 400 data sources to stay ahead of the competition. The sheer volume and complexity of data operations raise several challenges for intrepid organizations: How to cut the complexity of data operations? Enterprises turn to data lineage for the answer. Data lineage is the process of recording and visualizing data assets as they flow along your system.