Pinterest大数据平台的当下和未来 (英语演讲)

所属专题:大数据平台架构实践

所属领域:

嘉宾 : 武永胜 | PinterestHead of Big Data and Machine Learning Platform

会议室 : 爱晚亭

讲师介绍

专题演讲嘉宾:武永胜

Pinterest Head of Big Data and Machine Learning Platform

Yongsheng Wu leads Big Data and Machine Learning Platform at Pinterest, where the team provides a unified big data and machine learning platform to enable engineers to derive trustworthy, actionable insights and apply ML to solve complex problems with ease and confidence. Prior to that, Yongsheng was an early engineer on infrastructure team to help scale Pinterest from ~10M MAUs to 200+M MAUs; he also led teams to spearhead Pinterest's transition to a micro-services based architecture, from lucene/solr based search system to highly scalable, efficient, performant and extensible home-grown search infrastructure, and offer asynchronous job processing system, distributed caching, storage and serving systems as services to all engineers at Pinterest.

Before Pinterest, Yongsheng worked at Twitter, Salesforce, Seven Networks and Oracle. Yongsheng holds Master's degree in Computer Science at Stanford, and he is a USTC alumni from China.

武永胜目前在Pinterest领导大数据和机器学习平台团队,提供统一的大数据和机器学习平台,帮助工程师们更自信地用机器学习平台来解决复杂问题,并获得信任。在此之前,永胜是基础设施团队的一名早期工程师,帮助Pinterest从大约10M MAU扩展到200+M MAU;此外,他还率队带领Pinterest度过微服务架构转型期,从基于lucene/solr的搜索系统到高可伸缩,高效率,高性能和可扩展的本土搜索基础架构,并为Pinterest所有工程师提供异步任务处理系统,分布式缓存,存储和服务系统。

在加入Pinterest之前,永胜在Twitter,Salesforce,Seven Networks和Oracle工作过。永胜拥有斯坦福大学计算机科学硕士学位。此外,他还是来自中国的“在美科大校友会”成员。

议题介绍

地点:爱晚亭
所属专题:大数据平台架构实践
所属领域:

演讲:Pinterest大数据平台的当下和未来 (英语演讲)

Big Data Platform at Pinterest - Now and in the Future

Big data platform at Pinterest is a public cloud based platform at massive scale (100+PB data, hundreds of billions of new events per day, ~PB new data ingested per day) focusing on empowering all engineers.

In this talk, Yongsheng will deep dive into the current technology landscape of big data platform at Pinterest across data ingestion (real-time events, logging, database snapshots, database incremental dump), batch/streaming data processing platforms (Hadoop, Spark, Flink, Kafka Streams), query platforms (Hive, Presto, Spark SQL), and their homegrown workflow engine at Pinterest; he will also offer insights into how these technologies will evolve in their ecosystem in the future. Besides key technologies powering big data platform at Pinterest, Yongsheng will also cover what enables Pinterest engineering to encourage ownership and accountability, improve platform efficiency, and elevate platform team from being overwhelmed by operation and support to focus on platform advancement.

You will leave this talk with great insights into how big data ecosystem works at Pinterest and being inspired to re-envision your own big data platform.

参考译文:

Pinterest 的大数据平台是一个大规模的公共云平台(100+PB数据,每天数千亿新事件,每天接收约PB级新数据),它的重点在于赋予所有工程师工作处理能力。

在这次演讲中,永胜将深入探讨Pinterest数据平台技术栈,围绕数据摄入(实时事件,日志记录,数据库快照,数据库增量转储),批量/流式数据处理平台(Hadoop, Spark, Flink, Kafka Streams),查询平台(Hive,Presto,Spark SQL)以及 Pinterest 本土工作流引擎;他还将提供有关未来这些技术如何在其生态系统中演变的个人见解。除了在 Pinterest 支持大数据平台的关键技术之外,永胜还将分享涵盖 Pinterest 工程鼓励所有权和问责制,提高平台效率,避免平台团队受到运营压力,并致力于平台进步。

参会者会从永胜的演讲中获益,深入了解大数据生态系统在 Pinterest 的工作原理,并启发参会者如何重新审视目前自己大数据平台。

想要批量报名或更多优惠?
立即联系票务报名小助手豆包
或致电:010-84780850