%term

Troubleshooting Apache Spark

Sep 9, 2021 By Unravel In Unravel

Apache Spark is the leading technology for big data processing, on-premises and in the cloud. Spark powers advanced analytics, AI, machine learning, and more. Spark provides a unified infrastructure for all kinds of professionals to work together to achieve outstanding results.

View Video

Unravel

Analytics
BI

Read more about Troubleshooting Apache Spark

Spark Troubleshooting Solutions - DataOps, Spark UI or logs, Platform or APM Tools

Sep 9, 2021 By Floyd Smith In Unravel

Spark is known for being extremely difficult to debug. But this is not all Spark’s fault. Problems in running a Spark job can be the result of problems with the infrastructure Spark is running on, inappropriate configuration of Spark, Spark issues, the currently running Spark job, other Spark jobs running at the same time – or interactions among these layers.

Read Post

Unravel

Read more about Spark Troubleshooting Solutions - DataOps, Spark UI or logs, Platform or APM Tools

Unravel

Sep 5, 2021

Unravel helps you monitor, manage, and improve your data pipelines in the cloud and on-premises – to drive more reliable performance in the applications that power your business.

View Organisation

Read more about Unravel

Migrating Data Pipelines from Enterprise Schedulers to Airflow

Sep 2, 2021 By Sara Petrie In Unravel

At Airflow Summit 2021, Unravel’s co-founder and CTO, Shivnath Babu and Hari Nyer, Senior Software Engineer, delivered a talk titled Lessons Learned while Migrating Data Pipelines from Enterprise Schedulers to Airflow. This story, along with the slides and videos included in it, comes from the presentation.

Read Post

Unravel

Read more about Migrating Data Pipelines from Enterprise Schedulers to Airflow

Accelerate Amazon EMR for Spark & More

Aug 19, 2021 By Unravel In Unravel

Amazon EMR is growing in popularity, and is emerging as the leading platform for big data processing on AWS. EMR is the preferred platform for “lift and shift” migration of existing Hadoop and Spark workloads to the cloud, with minimal refactoring. You get better control, enhanced flexibility, and greater responsiveness.

View Video

Unravel

Analytics
BI

Read more about Accelerate Amazon EMR for Spark & More

Data Pipeline HealthCheck

Aug 17, 2021 By Sara Petrie In Unravel

At Airflow Summit 2021, Unravel’s co-founder and CTO, Shivnath Babu, led a talk titled Data Pipeline HealthCheck for Correctness, Performance & Cost Efficiency. This story, along with the slides and videos included in it, come from the presentation.

Read Post

Unravel

Read more about Data Pipeline HealthCheck

Driving Data Governance and Data Products at ING Bank France

Aug 12, 2021 By Sara Petrie In Unravel

In this episode of Data+AI Battlescars, Sandeep Uttamchandani, Unravel Data’s CDO, speaks with Samir Boualla, CDO at ING Bank France, one of the largest banks in the world. They cover his battlescars in Driving Data Governance Across Business Teams and Building Data Products. At ING Bank France, Samir is the Chief Data Officer. He’s responsible for several teams that govern, develop, and manage data infrastructure and data assets to deliver value to the business.

Read Post

Unravel

Read more about Driving Data Governance and Data Products at ING Bank France

Spark Troubleshooting, Part 1 - Ten Challenges

Aug 6, 2021 By Floyd Smith In Unravel

“The most difficult thing is finding out why your job is failing, which parameters to change. Most of the time, it’s OOM errors…” Jagat Singh, Quora Spark has become one of the most important tools for processing data – especially non-relational data – and deriving value from it. And Spark serves as a platform for the creation and delivery of analytics, AI, and machine learning applications, among others.

Read Post

Unravel

Read more about Spark Troubleshooting, Part 1 - Ten Challenges

Simplifying Data Management at LinkedIn Part 2

Jun 18, 2021 By Sara Petrie In Unravel

In the second of this two-part episode of Data+AI Battlescars, Sandeep Uttamchandani, Unravel Data’s CDO, speaks with Kapil Surlaker, VP of Engineering and Head of Data at LinkedIn. In part one, they covered LinkedIn’s challenges related to metadata management and data access APIs. This second part dives deep into data quality.

Read Post

Unravel

Read more about Simplifying Data Management at LinkedIn Part 2

Simplifying Data Management at LinkedIn Part 1

Jun 17, 2021 By Sara Petrie In Unravel

In the first of this two-part episode of Data+AI Battlescars, Sandeep Uttamchandani, Unravel Data’s CDO, speaks with Kapil Surlaker, VP of Engineering and Head of Data at LinkedIn. In this first part, they cover LinkedIn’s challenges related to Metadata Management and Data Access APIs. Part 2 will dive deep into data quality.

Read Post

Unravel

Read more about Simplifying Data Management at LinkedIn Part 1

Systems | Development | Analytics | API | Testing

Troubleshooting Apache Spark

Spark Troubleshooting Solutions - DataOps, Spark UI or logs, Platform or APM Tools

Unravel

Migrating Data Pipelines from Enterprise Schedulers to Airflow

Accelerate Amazon EMR for Spark & More

Data Pipeline HealthCheck

Driving Data Governance and Data Products at ING Bank France

Spark Troubleshooting, Part 1 - Ten Challenges

Simplifying Data Management at LinkedIn Part 2

Simplifying Data Management at LinkedIn Part 1

Monthly Archive

Follow Us