Systems | Development | Analytics | API | Testing

August 2023

Deploying an LLM ChatBot Augmented with Enterprise Data

The release of ChatGPT pushed the interest in and expectations of Large Language Model based use cases to record heights. Every company is looking to experiment, qualify and eventually release LLM based services to improve their internal operations and to level up their interactions with their users and customers. At Cloudera, we have been working with our customers to help them benefit from this new wave of innovation.

The Art of Data Leadership | A discussion with Synchrony's Head of Provisioning, Ram Karnati

Did you know there are 1.4 million open tech jobs, but global educational institutions only produce 400,000 qualified individuals annually to fill them? In our third episode of The Art of Data Leadership, Synchrony’s Ram Karnati believes the answer lies with #AI, “Day-to-day coding is going to get easier; AI is going to take care of it. So, the next phase of talent transformation will not be focused on being the best coder out there. People will start to look for generalists who also understand business.”

Breaking the ice around Apache Iceberg to drive next-gen analytics

Watch the Apache Iceberg meetup presented by Cloudera and LinkedIn. Learn how you can improve your BI analytical performance with Iceberg. See how easy it is to adopt Iceberg without any disruption to your business. Learn from LinkedIn how they have deployed DBT, Coral, and Iceberg for boosting SQL Workflows.

How to Ensure Supply Chain Security for AI Applications

Machine Learning (ML) is at the heart of the boom in AI Applications, revolutionizing various domains. From powering intelligent Large Language Model (LLM) based chatbots like ChatGPT and Bard, to enabling text-to-AI image generators like Stable Diffusion, ML continues to drive innovation. Its transformative impact advances multiple fields from genetics to medicine to finance. Without exaggeration, ML has the potential to profoundly change lives, if it hasn’t already.

Streaming Data Pipeline Development

This Meetup will cover how to build applications from some common use cases and highlight tips, tricks, best practices and patterns In this interactive session, Tim will lead participants through how to best build streaming data pipelines. He will cover how to build applications from some common use cases and highlight tips, tricks, best practices and patterns. He will show how to build the easy way and then dive deep into the underlying open source technologies including Apache NiFi, Apache Flink, Apache Kafka and Apache Iceberg.

HDFS Snapshot Best Practices

The snapshots feature of the Apache Hadoop Distributed Filesystem (HDFS) enables you to capture point-in-time copies of the file system and protect your important data against corruption, user-, or application errors. This feature is available in all versions of Cloudera Data Platform (CDP), Cloudera Distribution for Hadoop (CDH) and Hortonworks Data Platform (HDP).