Systems | Development | Analytics | API | Testing

Analytics

Multi-Raft - Boost up write performance for Apache Hadoop-Ozone

Apache Hadoop-Ozone is a new-era object storage solution for Big Data platform. It is scalable with strong consistency. Ozone uses Raft protocol, implemented by Apache Ratis (Incubating), to achieve high availability in its distributed system. My team in Tencent started to introduce Ozone as a backend object storage in production a few months ago and we’re onboarding more and more data warehouse users.

Speed Up Development With Powered by Fivetran

Powered by Fivetran (PBF) provides a simple framework for developers to go beyond internal analytics projects to build data pipelines into their applications within the Fivetran platform. With no engineering overhead, you can easily access hundreds of customer accounts across countless Fivetran-supported data sources, including advertising platforms, CRM systems, databases, web events and more.

The Rise Of Connected Manufacturing And How Data Is Driving Innovation, Part I

This interview was conducted by Cindy Maike, VP Industry Solutions The shift towards Industry 4.0 is improving manufacturing efficiency and the factory of the future will increasingly be driven by technology like the Internet of Things (IoT), Automation, Artificial Intelligence (AI), and Cloud Computing.

MLRun Functions DEMO: Python Jupyter (Open-Source Data Science Orchestration + Experiment Tracking)

MLRun is a generic and convenient mechanism for #data scientists and software developers to build, run, and monitor #machinelearning (ML) tasks and pipelines on a scalable cluster while automatically tracking executed code, metadata, inputs, and outputs. On-Premise or Barebone/Metal - including Edge AI / Analytics Customers include NetApp, Quadient, Payoneer (and many more).

Git-based CI / CD for Machine Learning & MLOps

For decades, machine learning engineers have struggled to manage and automate ML pipelines in order to speed up model deployment in real business applications. Similar to how software developers leverage DevOps to increase efficiency and speed up release velocity, MLOps streamlines the ML development lifecycle by delivering automation, enabling collaboration across ML teams and improving the quality of ML models in production while addressing business requirements.

Auto-TLS in Cloudera Data Platform Data Center

Wire encryption protects data in motion, and Transport Layer Security (TLS) is the most widely used security protocol for wire encryption. TLS provides authentication, privacy and data integrity between applications communicating over a network by encrypting the packets transmitted between endpoints. Users interact with Hadoop clusters via browser or command line tools, while applications use REST APIs or Thrift.

Using Your Existing API to Become a Snowflake Data Marketplace Provider, Part 1

Many data providers who participate in Snowflake Data Marketplace are already using Snowflake Cloud Data Platform as their primary data store, and they can share secure slices of their data via Global Snowflake, Snowflake’s global data sharing feature, with any other Snowflake consumer regardless of which cloud or Snowflake region each is using. But other potential data providers, especially data enrichment companies, are not yet using Snowflake themselves.