Data Management Platform over YTsaurus: The Path from User to Contributor

Photo
Maxim Pchelin

Nebius

About speaker

Product manager of Analytical Platform with over 10 years of experience in BI-DWH development. Has worked in market leading companies in FinTech, consulting, GameDev, and IT spheres. Advanced from being a BI developer to the head of BI, DWH, and DevOps teams. After that, focused on the development of data services as a business product. Developed the product side of DMP services for DWH, BI, and analysts in Yandex GO. Currently, leads the product stream of analytical platform at Nebius.

About speakers's company

We are a startup with offices in the Netherlands, Serbia and Israel. Our ambition is to create a world-class ecosystem of full-fledged cloud and ML-driven solutions for the B2B market. Our team has experience in building data centers and supercomputers, developing ML-technologies with millions of daily users, and launching full-stack public cloud platforms.

4 July, 14:40, «Hall 3»

Abstracts

What do you do when your business wants a fast, reliable and convenient data management platform (DMP) for DWH without spending money on paid solutions? How do you build a high-quality system for processing petabytes of data using open-source tools and modify them if necessary?

In the talk, we will share our experiences in creating DMP that ensures day-to-day operation of DWH for Yandex Taxi, Eats, Grocery, Delivery, and others. We will present our technology stack, primarily based on YTsaurus ― the brand new open-source alternative to Hadoop. We will tell you how we built our platform over it, when we used other tools (like ClickHouse or Greenplum), and how we made improvements to YTsaurus.

Photo
Vladimir Verstov

Yandex Go

About speaker

DMP (data management platform) head of development at Yandex GO. Over 12 years of experience in IT. At university was interested in parallel and distributed computing, developed his own computer-aided design system, and has a Ph.D. in two subjects. 5 years of experience in enterprise development in consulting. Has progressed from system analyst to Team&Tech Lead. For the last 6 years, has been developing DWH services for Yandex GO.

About speakers's company

Yandex Go is a superapp provided by a group of companies that operates mobility and delivery businesses in more than 20 countries. We use a stack of our own technologies: mapping, routing, and navigation as well as smart order distribution technologies based on machine learning. The app has been available in Serbia since 2018.

4 July, 14:40, «Hall 3»

Abstracts

What do you do when your business wants a fast, reliable and convenient data management platform (DMP) for DWH without spending money on paid solutions? How do you build a high-quality system for processing petabytes of data using open-source tools and modify them if necessary?

In the talk, we will share our experiences in creating DMP that ensures day-to-day operation of DWH for Yandex Taxi, Eats, Grocery, Delivery, and others. We will present our technology stack, primarily based on YTsaurus ― the brand new open-source alternative to Hadoop. We will tell you how we built our platform over it, when we used other tools (like ClickHouse or Greenplum), and how we made improvements to YTsaurus.

The talk was accepted to the conference program