Systems | Development | Analytics | API | Testing

December 2022

From Data Warehouse to Lakehouse

This is a guest post for Integrate.io written by Bill Inmon, an American computer scientist recognized as the "father of the data warehouse." Inmon wrote the first book and first magazine column about data warehousing, held the first conference about this topic, and was the first person to teach data warehousing classes.

Top 6 Airbyte Alternatives

The data-driven culture cultivated in modern-day organizations is focused on deriving the best possible business insights from their data. With data scattered across the globe, these organizations' most significant challenge is to break the silos of their decentralized data and gather new data for analysis in real-time. To address the data silo problem, data engineering brought forward solutions like ETL, ELT, and data integration tools.

What Is Data Observability in a Data Pipeline?

The five things you need to know about data observability in a data pipeline are: Becoming a data-driven organization is a vital goal for businesses of all sizes and industries—but this is easier said than done. Too many companies fail to attain the fundamental principle of data observability: knowing the existence and status of all the enterprise data at their fingertips.

Seven Benefits of Investing in Cross-Functional Data Projects

Nowadays most people in organizations understand how visibility into data adds overall value and there is a general dedication to be and remain data driven and increase overall data literacy. At the same time, sometimes there are limitations to how much organizations want to invest or augment their investment in data projects. It's important to make sure that companies have support across departments to budget appropriately for their data needs.

Fulfilling End-User Expectations When Building New Computer Systems

This is a guest post for Integrate.io written by Bill Inmon, an American computer scientist recognized as the "father of the data warehouse." Inmon wrote the first book and first magazine column about data warehousing, held the first conference about this topic, and was the first person to teach data warehousing classes.

Is Data Integration a Critical Element in Data Analytics?

The five reasons why data integration is a critical element for data analytics are: According to research by IDC and Tableau, 83 percent of CEOs say that they want their company to be “more data-driven.” The study finds that data-driven organizations have observed many positive impacts, from faster time to market to more new customers. Of course, becoming a truly data-driven company is easier said than done—and data analytics is the way to do it.

Making Data Actionable

Too many data teams focus on data movement and creating data pipelines without aligning those activities to business outcomes. Data teams are meant to make data flows work so much of their focus involves managing data connections across the data ecosystem. Pulling data from Salesforce or ensuring reverse ETL from a data warehouse helps support successful data movement and overall data pipeline development. It doesn't always align to a more effective supply chain or cost savings.

Data Warehousing and Data Mesh: Different Types of Goals

The world is full of different types of goals. Consider football. The goal in football is at the end of the field. A runner either crosses the goal or they don't, when trying to make a touchdown. Or, consider basketball. In basketball, when a player shoots the ball, the player’s shot either goes through the net or it doesn’t. Alternatively, consider ice hockey. When a hockey player shoots the puck, it either goes into the net — or it doesn’t.

All the Features A Robust Data Lake Should Have

From databases to data warehouses and, finally, to data lakes, the data landscape is changing rapidly as volumes and sources of data increase. With a growth projection of almost 30%, the data lake market will grow from USD 3.74 billion in 2020 to USD 17.6 billion by 2026. Also, from the 2022 Data and AI Summit, it is clear that data lake architecture is the future of data management and governance.

Key criteria in software selection - Support

I used to work with organizations to support their software evaluations, selections, and implementations. What regularly started as a desire for greater visibility into data, generally turned into a need for better management of data across their ecosystems. Basically, organizations cannot look at individual data and analytics projects without evaluating their needs in a cohesive manner. The reality is that selecting a new set of tools for data optimization requires a lot of effort.

Defining a Technical Architecture

When doing long-term planning for an organization, it is really helpful to start with a statement of what the architecture is and what that architecture will encompass. The architectural definition of the technology that will be used will serve as a long-term guide for making technical decisions. In addition, a properly built architecture serves as an instrument of focus, direction, and prioritization for the organization.