Systems | Development | Analytics | API | Testing

January 2024

Streams Replication Manager Prefixless Replication

Replication is a crucial capability in distributed systems to address challenges related to fault tolerance, high availability, load balancing, scalability, data locality, network efficiency, and data durability. It forms a foundational element for building robust and reliable distributed architectures. It is also important to have multiple options (like normal and prefixless replication) to do the replication process, since every solution has its own advantages.

Achieving Trusted AI in Manufacturing

In the dynamic landscape of modern manufacturing, AI has emerged as a transformative differentiator, reshaping the industry for those seeking the competitive advantages of gained efficiency and innovation. As we navigate the fourth and fifth industrial revolution, AI technologies are catalyzing a paradigm shift in how products are designed, produced, and optimized.

Metadata Management and Data Governance with Cloudera SDX

In this article, we will walk you through the process of implementing fine grained access control for the data governance framework within the Cloudera platform. This will allow a data office to implement access policies over metadata management assets like tags or classifications, business glossaries, and data catalog entities, laying the foundation for comprehensive data access control.

Setting up and Getting Started with Cloudera's New SQL AI Assistant

As described in our recent blog post, an SQL AI Assistant has been integrated into Hue with the capability to leverage the power of large language models (LLMs) for a number of SQL tasks. It can help you to create, edit, optimize, fix, and succinctly summarize queries using natural language. This is a real game-changer for data analysts on all levels and will make SQL development faster, easier, and less error-prone.

Monitoring Cloudera DataFlow Deployments With Prometheus and Grafana

Cloudera DataFlow for the Public Cloud (CDF-PC) is a complete self-service streaming data capture and movement platform based on Apache NiFi. It allows developers to interactively design data flows in a drag and drop designer, which can be deployed as continuously running, auto-scaling flow deployments or event-driven serverless functions. CDF-PC comes with a monitoring dashboard out of the box for data flow health and performance monitoring.

What's new in Cloudera DataFlow 2.7: Change Data Capture with NiFi & Productivity improvements

Learn how the latest Cloudera DataFlow release enables Change Data Capture use cases and improves developer productivity with new features like deployment configuration export, flow version tagging and new monitoring capabilities.

AI and LLMs: How to navigate these technologies to build trusted AI

From customer success and fraud detection to process automation and code completion, Varun Jaitly, Santiago Giraldo, and Robert Hryniewicz discuss the AI and GenAI use cases that separate early enterprise leaders, and the key obstacles businesses across verticals must overcome for successful LLM development and deployment.