GEODIS Distribution & Express, a subsidiary of GEODIS, is the leader in France for reliable last-mile delivery service (deliveries within 24 to 48 hours). In 2020 alone, its 115 agencies handled 100 million parcels and carried out 5,000 rounds per day in more than 35 countries across Europe. Nathalie Mandjee, Business Intelligence Manager at GEODIS Distribution & Express, discovered that this part of the company was growing into a profit center for the larger business.
Data volumes are increasing exponentially with no sign of slowing. Experts predict that by 2025, the global volume of data will reach 181 zettabytes — that’s more than four times pre-COVID levels in 2019. Data analysts at Centogene agree: “Every mouse click, keyboard button press, swipe or tap is used to shape business decisions. Everything is about data these days - data is information, and information is power.”
The dynamic and interconnected world of global ecommerce, crypto currencies, and alternative payments places increased pressure on anti-financial crime measures to keep pace and transform alongside these initiatives. Consumers worldwide are projected to use mobile devices to make more than 30.7 billion ecommerce transactions by 2026, a five-fold increase over the 6.1 billion predicted for 2022.
BigQuery is Google Cloud’s fully managed serverless data platform that supports querying using ANSI SQL. BigQuery also has a data lake storage engine that unifies SQL queries with other open source processing frameworks such as Apache Spark, Tensorflow, and Dask. BigQuery storage provides an API layer for OSS engines to process data. This API enables mixing and matching programming in languages like Python with structured SQL in the same data platform.
Apache Kafka is an open-source distributed event streaming platform used by thousands of companies for high-performance data pipelines, streaming analytics, data integration, and mission-critical applications. Administrators, developers, and data engineers who use Kafka clusters struggle to understand what is happening in their Kafka implementations.
In part 1 of this blog we discussed how Cloudera DataFlow for the Public Cloud (CDF-PC), the universal data distribution service powered by Apache NiFi, can make it easy to acquire data from wherever it originates and move it efficiently to make it available to other applications in a streaming fashion.