Cypress: a distributed transactional file system in YTsaurus

Photo
Grigory Reznikov

YTsaurus

About speaker

I am a software engineer interested in distributed storage systems and efficient data processing. For the last four years, I’ve been lucky to be a part of YTsaurus team working on metadata storage, distributed transactions, and multiple other things.

About speakers's company

YTsaurus is a data storage and processing system that became open-sourced recently.

4 July, 11:10, «Hall 3»

Abstracts

YTsaurus is a data storage and processing system that recently became open-sourced. This system includes storage for huge-sized tables, efficient data processing engines allowing execution of ad-hoc analytical queries as well as building of data processing pipelines, and also efficient OLTP key-value storage. As a result of 12 years of development by experienced developers, YTsaurus proved itself to be a reliable, efficient and convenient way to manage data for different purposes. YTsaurus scales well, allows to store exabytes of data and processes them using millions of CPUs.

Cypress is a distributed file system used in the core of YTsaurus. In this talk, we will discuss the Cypress interface and its functionality, as well as demonstrate how the Cypress features can be used to simplify working with data.

We will pay special attention to transactions and discuss how transactions allow using Cypress as a distributed coordination system.

The talk was accepted to the conference program