Systems | Development | Analytics | API | Testing

Building a Machine Learning Application With Cloudera Data Science Workbench And Operational Database, Part 2: Querying/ Loading Data

In this installment, we’ll discuss how to do Get/Scan Operations and utilize PySpark SQL. Afterward, we’ll talk about Bulk Operations and then some troubleshooting errors you may come across while trying this yourself. Read the first blog here. Get/Scan Operations In this example, let’s load the table ‘tblEmployee’ that we made in the “Put Operations” in Part 1. I used the same exact catalog in order to load the table. Executing table.show() will give you:

Digital Transformation: What Does It Mean for a CIO?

As organisations adopt increasingly complex digital transformation initiatives, the need for strong organisational leadership from the CIO (Chief Information Officer) has never been more apparent. In particular, the CIO plays a key role in setting the digital transformation agenda for the organisation, sponsoring the delivery of component initiatives and ensuring senior stakeholder buy-in throughout the journey.

Apache NiFi - the data movement enabler in a hybrid cloud environment

Cloudera provides its customers with a set of consistent solutions running on-premises and in the cloud to ensure customers are successful in their data journey for all of their use cases, regardless of where they are deployed. Cloudera DataFlow provides Apache NiFi in both the Cloudera Data Platform Private Cloud Base (on-premises) and Public Cloud (AWS, Azure, and Google Cloud) products in this hybrid cloud strategy.

Peloton & Qlik: The Analytics of It All

Ok, I’ll admit it… I’m one of those people, I own a Peloton – and it’s awesome. But, as a data professional, I’ve struggled with getting decent metrics about how I’m doing and trying to see if I’m making progress with my fitness level. How can I discern performance stats to answer basic questions to gauge my performance over time?

What is a Good Example of Load Testing?

Whenever I get asked why do I need load testing or why is load testing so important, over and over again I come up with this simple example. Let’s say that you send an email campaign to 10,000 subscribers of your website/blog. What will happen to your website if 1000 subscribers open the email they’ve just received and the email is super interesting and they decide to click the action button in the email, link which will take them back to your website.

Best Performance Testing Tools

Implementing the best performance testing tools allows for an optimized end user experience and improved web performance. In order to execute accurate and effective performance testing, it is important for QA engineers to have access to the right set of tools. With the plethora of performance testing tools, it has become tedious to pick the right tool for your use case. Let’s explore our list of the best performance testing tools.

Enabling Self-Service Business Insights with Cloudera Data Warehouse

Requests to Central IT for data warehousing services can take weeks or months to deliver. Central IT teams at large organizations face a proliferation of IT projects arising from the complexities of markets and from the needs of internal lines of business (LoBs). At the same time, Central IT must juggle cost and risk.