Analytics

AWS, Qlik, and SAP Data: Turning the Lifeblood of Business into Value and Action

One of my favorite analogies is that data is the lifeblood of the business. Before you roll your eyes at me (I see it now), hear me out. At your annual physical, when you get your blood work done, think of how much information is uncovered about your overall health from a tiny vial of your blood. From those 10 CCs they extract comes back pages of information regarding your cell counts, glucose, cholesterol, and other information.

Apache Kafka Message Compression

Apache Kafka® supports incredibly high throughput. It’s been known for feats like supporting 20 million orders per hour to get COVID tests out to US citizens during the pandemic. Kafka's approach to partitioning topics helps achieve this level of scalability. Topic partitions are the main "unit of parallelism" in Kafka. What’s a unit of parallelism? It’s like having multiple cashiers in the same store instead of one.

Technology Spotlight: Open Data Lakehouse for Private Cloud

Cloudera emphasizes the importance of trusted data for reliable AI. We've introduced Open Data Lakehouse for private cloud, incorporating Apache Iceberg for enhanced data management and security. This empowers analysts and data scientists with direct access to all data, including real-time streaming. Iceberg's capabilities reduce silos, lower storage costs, and mitigate business risks. We also focus on scalability, introducing features like snapshots and user quotas in Apache Ozone. Cloudera prioritizes enterprise readiness with Zero Downtime Upgrades and broader hardware and software support.