Systems | Development | Analytics | API | Testing

Latest News

How to Use Apache Iceberg in CDP's Open Lakehouse

In June 2022, Cloudera announced the general availability of Apache Iceberg in the Cloudera Data Platform (CDP). Iceberg is a 100% open-table format, developed through the Apache Software Foundation, which helps users avoid vendor lock-in and implement an open lakehouse. The general availability covers Iceberg running within some of the key data services in CDP, including Cloudera Data Warehouse (CDW), Cloudera Data Engineering (CDE), and Cloudera Machine Learning (CML).

What is the Impact of Using a Business Analytics Dashboard?

It's common knowledge today that business intelligence (BI) dashboards are powerful tools that can help you track and analyze your business performance, identify trends and patterns, and make better decisions based on data, not gut feeling. There several benefits to using a business intelligence dashboard, and it can have a big impact on your business.

How to Choose the Best Data Visualization for Your Reporting

Data visualization involves visually imparting information to communicate complex data. Graphics such as charts help simplify data so more people can understand the insights embedded within a dashboard or report. Businesses use data visualizations to find and highlight patterns and trends in big data sets, with many data visualization examples available today, including graphs, maps and plots to help others grasp data conveniently.

Sustainability Reporting: A Modern Finance Imperative

ESG reporting is rapidly becoming a key focus area for finance teams around the world. ESG stands for “environmental, social, and governance.” It’s a set of standards through which companies can report metrics that indicate how well their activities align with issues of environmental stewardship and social issues. In late 2021, the International Accounting Standards Board (IASB) announced the creation of a new ESG reporting standard.

Zero-ETL approach to analytics on Bigtable data using BigQuery

Modern businesses are increasingly relying on real-time insights to stay ahead of their competition. Whether it's to expedite human decision-making or fully automate decisions, such insights require the ability to run hybrid transactional analytical workloads that often involve multiple data sources. BigQuery is Google Cloud’s serverless, multi-cloud data warehouse that simplifies analytics by bringing together data from multiple sources.

Applying Fine Grained Security to Apache Spark

Apache Spark with its rich data APIs has been the processing engine of choice in a wide range of applications from data engineering to machine learning, but its security integration has been a pain point.t Many enterprise customers needi finer granularity of control, in particular at the column and row level (commonly known as Fine Grained Access Control or FGAC).

The Human Side of the Equation - How Stories Bring Data to Life (Part 2)

As data science has taken center stage in a lot of organizations, many are relearning what they’ve already known – that dry, mathematical calculations don’t inspire and don’t stick. It’s the story that matters. In this second of a two-part blog series, we look at some best practices for data storytelling and how Qlik analytics can help.

Balancing data sharing and compliance is hard - but it doesn't have to be

If there is a single most delicate aspect to the balance of data sharing and compliance, it lies in the process of creating a single source of truth. This project involves many departments across the company: sales, customer support, and of course, IT. The more stakeholders are involved, the more project's complexity rises, as it contains different objectives from different parties.