27 subscribers
התחל במצב לא מקוון עם האפליקציה Player FM !
One Click to Run Apache Spark as a Service on Kubernetes (DoK Day EU 2022) // Bo Yang
Manage episode 329944570 series 2865115
https://go.dok.community/slack
From the DoK Day EU 2022 (https://youtu.be/Xi-h4XNd5tE)
It is still challenging to run Apache Spark and other big data processing workload on Kubernetes, especially in large scale. People need to address various issues like resource isolation, queuing, and cost efficiency. This session will share details about those challenges and how to address them. We will also present a convenient (one-click) way to deploy Apache Spark on Kubernetes, and dramatically lower the barrier to use Spark.
Bo has been working in big data area for 10 years, and gained intensive experience of running Apache Spark in large scale while he worked in in Uber. In the Kubernetes era, he focuses on building world class Apache Spark Service, and making the tooling available to the Spark and Kubernetes communities.
243 פרקים
Manage episode 329944570 series 2865115
https://go.dok.community/slack
From the DoK Day EU 2022 (https://youtu.be/Xi-h4XNd5tE)
It is still challenging to run Apache Spark and other big data processing workload on Kubernetes, especially in large scale. People need to address various issues like resource isolation, queuing, and cost efficiency. This session will share details about those challenges and how to address them. We will also present a convenient (one-click) way to deploy Apache Spark on Kubernetes, and dramatically lower the barrier to use Spark.
Bo has been working in big data area for 10 years, and gained intensive experience of running Apache Spark in large scale while he worked in in Uber. In the Kubernetes era, he focuses on building world class Apache Spark Service, and making the tooling available to the Spark and Kubernetes communities.
243 פרקים
כל הפרקים
×
1 Implementing Data & Databases on K8s within the Dutch Government | DoKC Town Hall 44:54

1 Unsticking Ourselves from Glue: Migrating PayIt’s Data Pipelines to Argo Workflows and Hera | DoKC Town Hall 23:17

1 Repel Boarders! How to find a Kubernetes operator that really protects your data | DoKC Town Hall 19:22

1 DoK + Apache Spark | DoKC Town Hall 19:52

1 DoK @ Comcast - Deliver Business Outcomes & Improved DevX with Data Services on K8s | DoKC Town Hall 16:43

1 DoK Talks - What is Kafka? The rise of one of the world's most used streaming data technologies // Abbey Russell 15:28

1 DoK Talks - (almost)Everything you need to know about stateful cloud native network applications // W Watson 43:39

1 The Outer Nerd #001 - Dungeons & Dragons - Why should you care? // Abhi Vaidyanatha, Fabian Met & Chase Christensen 58:25

1 DoK Talks #155 - Databases at the edge with K3s and ARM devices // Sergio Méndez 49:40

1 DoK Talks #154 - StatefulSets in K8 // Srinivas Karnati 31:55

1 Data-driven Diversity, Equity, and Inclusion // Lisa-Marie Namphy, Melissa Logan, Tiffany Jachja, Audra Montenegro & Cortney Nickerson (DoK Day North America 2022) 19:50

1 Formula 1 telemetry processing using Apache Kafka on Kubernetes // Paolo Patierno (DoK Day North America 2022) 15:36

1 Choosing Kubernetes for Stateful Applications // Akshay Ram & Peter Schuurman (DoK Day North America 2022) 18:31

1 Kubernetes 360º - Data driven observability - from Secrets to logs // Ben Hirschberg (DoK Day North America 2022) 17:11

1 Shifting Left Stateful Applications In Kubernetes // Viktor Farcic (DoK Day North America 2022) 15:52

1 Medical - Healthcare Data on Kubernetes // Olyvia Rakshit & Prasad Dorbala (DoK Day North America 2022) 13:41

1 Highly Available Postgres Clusters In Kubernetes // John Long & Jonathan Gonzalez (DoK Day North America 2022) 15:04

1 Inter-Cluster PostreSQL on Kubernetes // Julian Fischer (DoK Day North America 2022) 17:07

1 Open Source Databases on Kubernetes- Best Practices // Peter Zaitsev (DoK Day North America 2022) 16:04

1 The Kubernetes Native Database // Jeffrey Carpenter (DoK Day North America 2022) 16:26

1 Databases on Kubernetes: Why are they important? // With Bhavin Shah, Xing Yang, Gabriele Bartolini & Patrick McFadin (DoK Day North America 2022) 34:51

1 Data streaming on Kubernetes // Yaniv Ben Hemo (DoK Day North America 2022) 13:51

1 Architecting Your First Event Driven Serverless Streaming Applications on K8 // Timothy Spann (DoK Day North America 2022) 13:29

1 Fybrik - A Kubernetes based platform for governed data use // Flora Gilboa-Solomon, Alexey Roytman, Maryna Strelchuk & Barry Hijkoop (DoK Day North America 2022) 20:59

1 The Challenges of Data Processing On Kubernetes - A look at Spark, Flink, Dask, and Ray // Holden Karau (DoK Day North America 2022) 20:09

1 Scaling our SaaS offering to thousands of clusters // Dax McDonald (DoK Day North America 2022) 21:04

1 Why we decided to migrate our Jaeger storage to ClickHouse on Kubernetes // Arul Jegadish Francis (DoK Day North America 2022) 13:48

1 Building a Digital Factory for the Sheet Metal Industry // Elie Assi (From the DoK Day North America 2022) 20:48

1 How we built our Big Data Stack (almost) entirely on top of Kubernetes // Neylson Crepalde (From DoK Day NA 2022) 16:00

1 Dok Talks #153 - CRD Panel // Eyar Zilberman & Álvaro Hernández 58:05

1 Dok #152-Running PostgreSQL in Kubernetes:from day 0 to day 2 with CloudNativePG // Gabriele Bartolini 1:03:50

1 Dok Talks #148 - Cost and Kubernetes // Chris Love 45:25

1 Dok Talks #151 - Analytics with Apache Superset and ClickHouse // Vijay Anand Ramakrishnan 33:00

1 Dok Talks #150 - Building a Simple Postgres Async Streaming Cluster // Julian Fischer 1:04:45

1 DoK Talks #149 - Overcoming challenges with protecting and migrating data in multi-cloud K8s environments // Sebastian Glab & Martin Phan 47:40

1 DoK Talks #147 - Evaluating Cloud Native Storage Vendors // Dinesh Majrekar 1:00:03

1 Dok Talks #146 - OpenFeature - Making feature flags a commodity // Oleg Nenashev 1:01:30

1 DoK Talks #145 - Making Hard Things Easy is Hard // Kurt Rinehart 57:40

1 DoK Talks #144 - We will Dok You! - The journey to adopt stateful workloads on k8s // Guy Menahem 1:06:30

1 DoK Talks #142 - Kubernetes Cluster Upgrade Strategies and Data: Best Practices for your Stateful Workload // Peter Schuurman 58:45

1 DoK Talks #144 - Mastering MongoDB on Kubernetes, the power of operators // Arek Borucki 1:00:50

1 DoK Specials - Why are Operators paramount to running stateful workloads on Kubernetes? 53:36

1 DoK Talks #141 - Dossier: multi-tenant distributed Jupyter Notebooks // Iacoppo Colonnelli & Dario Tranchitella 1:00:10

1 DoK Talks #140 - Data protection of stateful environment // Timothy Dewin 42:35

1 DoK Talks #139 - Private DBaaS on Kubernetes // Sergey Pronin 53:25
ברוכים הבאים אל Player FM!
Player FM סורק את האינטרנט עבור פודקאסטים באיכות גבוהה בשבילכם כדי שתהנו מהם כרגע. זה יישום הפודקאסט הטוב ביותר והוא עובד על אנדרואיד, iPhone ואינטרנט. הירשמו לסנכרון מנויים במכשירים שונים.