Systems | Development | Analytics | API | Testing

Latest News

Spark Troubleshooting Solutions - DataOps, Spark UI or logs, Platform or APM Tools

Spark is known for being extremely difficult to debug. But this is not all Spark’s fault. Problems in running a Spark job can be the result of problems with the infrastructure Spark is running on, inappropriate configuration of Spark, Spark issues, the currently running Spark job, other Spark jobs running at the same time – or interactions among these layers.

Optimizing your BigQuery incremental data ingestion pipelines

When you build a data warehouse, the important question is how to ingest data from the source system to the data warehouse. If the table is small you can fully reload a table on a regular basis, however, if the table is large a common technique is to perform incremental table updates. This post demonstrates how you can enhance incremental pipeline performance when you ingest data into BigQuery.

Supporting Transformation with an Integrated Data Platform. Three Common Questions Answered.

In recent years there has been increased interest in how to safely and efficiently extend enterprise data platforms and workloads into the cloud. CDOs are under increasing pressure to reduce costs by moving data and workloads to the cloud, similar to what has happened with business applications during the last decade. Our upcoming webinar is centered on how an integrated data platform supports the data strategy and goals of becoming a data-driven company.

Early-stage growth: Why shifting the founder mindset is critical to acquiring your first 10 customers

Growth. It’s the mountain every startup founder must learn to climb in order to run a successful business. And as with any great mountain, the journey to the top never feels more daunting than at the base. How your startup earns its first 10 customers will set the tone for the rest of the trek and determine how fast your team reaches the summit — if at all.

The role of a CDO with Cosmo, Chief Destiny Officer

Have you ever wished you had a crystal ball? We tracked down a CDO who actually uses one. See, Cosmo, CDO is not a Chief Data Officer — he’s a Chief Destiny Officer. We’re all about data at Talend, but sometimes it’s good to see things from another perspective. We sat down with Cosmo to ask him about his job, his background, and his methods.

Spectacular growth: Beaumotica accelerates expansion with data-driven insights from Talend

Beaumotica combines smart lighting, design, and top brands to create the perfect mood and atmosphere for any room. And with help from Talend, the company can now combine data, analytics, and automation to optimize business decisions and accelerate growth. Last year alone the company tripled its business and expanded into new territories across Europe. Based in The Netherlands, Beaumotica has been growing steadily since 2007.

With Stitch, Simba is losing no sleep over aggressive growth plans

“If we didn’t have Stitch, we would have to recruit and hire data engineers, buy space for hundreds of millions of rows that we’re sinking into the database, and on and on. For us, Stitch is essential.” –Tomasz Eitner, BI and Data Analyst, Simba Sleep Simba Sleep has always been a data-driven company. Before the firm was even formally launched, the founders purchased research profiles from more than 10 million sleepers—including 180 million body profile data points.

Our reflections on the 2021 Gartner Magic Quadrant for Data Integration Tools

“The data integration tool market is seeing renewed momentum, driven by requirements for hybrid and multi-cloud data integration, augmented data management, and data fabric designs.” This is what Gartner assesses in its latest Magic Quadrant for Data Integration Tools* report. And that assessment makes perfect sense. Data is the lifeblood of an organization.

Optimizing Cloudera Data Engineering Autoscaling Performance

The shift to cloud has been accelerating, and with it, a push to modernize data pipelines that fuel key applications. That is why cloud native solutions which take advantage of the capabilities such as disaggregated storage & compute, elasticity, and containerization are more paramount than ever. At Cloudera, we introduced Cloudera Data Engineering (CDE) as part of our Enterprise Data Cloud product — Cloudera Data Platform (CDP) — to meet these challenges.