Player FM - Internet Radio Done Right
27 subscribers
Checked 1+ y ago
הוסף לפני four שנים
תוכן מסופק על ידי Data on Kubernetes Community. כל תוכן הפודקאסטים כולל פרקים, גרפיקה ותיאורי פודקאסטים מועלים ומסופקים ישירות על ידי Data on Kubernetes Community או שותף פלטפורמת הפודקאסט שלהם. אם אתה מאמין שמישהו משתמש ביצירה שלך המוגנת בזכויות יוצרים ללא רשותך, אתה יכול לעקוב אחר התהליך המתואר כאן https://he.player.fm/legal.
Player FM - אפליקציית פודקאסט
התחל במצב לא מקוון עם האפליקציה Player FM !
התחל במצב לא מקוון עם האפליקציה Player FM !
פודקאסטים ששווה להאזין
בחסות
Join us each week as celebrity guests pitch an idea for a film based on one of the SUPER niche sub-genres on Netflix. From ‘Steamy Crime Movies from the 1970s’ to ‘Australian Dysfunctional Family Comedies Starring A Strong Female Lead’, our celebrity guests will pitch their wacky plot, their dream cast, the marketing stunts, and everything in between. By the end of every episode, Jimmy Carr, Comedian by night / “Netflix Executive” by day, will decide whether the pitch is greenlit or condemned to development hell! New episodes on Wednesdays starting May 28th! Listen on all podcast platforms and watch on the Netflix is a Joke YouTube Channel . The Big Pitch is a co-production by Netflix and BBC Studios Audio.…
DoK Season 1 Extras - #1 - Is my data secure in K8s? // Asier Azaceta
Manage episode 283453641 series 2865115
תוכן מסופק על ידי Data on Kubernetes Community. כל תוכן הפודקאסטים כולל פרקים, גרפיקה ותיאורי פודקאסטים מועלים ומסופקים ישירות על ידי Data on Kubernetes Community או שותף פלטפורמת הפודקאסט שלהם. אם אתה מאמין שמישהו משתמש ביצירה שלך המוגנת בזכויות יוצרים ללא רשותך, אתה יכול לעקוב אחר התהליך המתואר כאן https://he.player.fm/legal.
Bart Farrell interviews Asier Azaceta, Cloud Security Architect in the IBM European Centre of Competence
243 פרקים
Manage episode 283453641 series 2865115
תוכן מסופק על ידי Data on Kubernetes Community. כל תוכן הפודקאסטים כולל פרקים, גרפיקה ותיאורי פודקאסטים מועלים ומסופקים ישירות על ידי Data on Kubernetes Community או שותף פלטפורמת הפודקאסט שלהם. אם אתה מאמין שמישהו משתמש ביצירה שלך המוגנת בזכויות יוצרים ללא רשותך, אתה יכול לעקוב אחר התהליך המתואר כאן https://he.player.fm/legal.
Bart Farrell interviews Asier Azaceta, Cloud Security Architect in the IBM European Centre of Competence
243 פרקים
所有剧集
×
1 Implementing Data & Databases on K8s within the Dutch Government | DoKC Town Hall 44:54
44:54
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי44:54
Implementing Data & Databases on K8s within the Dutch Government Presented by Sebastiaan Mannem , Director at Mannem Solutions A small walkthrough of projects within the Dutch government running databases on OpenShift. This talk shares success stories, provides a proven recipe to `get it done,` and debunks some of the FUD. Related Links DoKC Website - https://dok.community/ DoKC Meetups - https://www.meetup.com/data-on-kubernetes-community/ Join Slack - https://join.slack.com/t/dokcommunity/shared_invite/zt-1vgv7ymz7-YtLFvZicrcLP9fS3o_r2_w…

1 Unsticking Ourselves from Glue: Migrating PayIt’s Data Pipelines to Argo Workflows and Hera | DoKC Town Hall 23:17
23:17
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי23:17
Unsticking Ourselves from Glue: Migrating PayIt’s Data Pipelines to Argo Workflows and Hera Presented by Matt Menzenski, Senior Software Engineering Manager, Payitgov At PayIt, we’ve been deploying applications to Kubernetes almost since the beginning of the company. Our data workloads, however, have run instead in AWS Glue. This has worked well enough for the reporting use cases that have been the main focus of this team historically. However, at the beginning of 2022, the PayIt data team began building out a new data platform, and in the process, ran into a number of challenges with Glue. In this talk, I will share the difficulties that we encountered with building, deploying, and orchestrating ETL pipelines in AWS Glue, our decision process for moving those workloads into Kubernetes, and the ELT architecture that we’ve arrived at today. Related Links DoKC Website - https://dok.community/ DoKC Meetups - https://www.meetup.com/data-on-kubernetes-community/ Join Slack - https://join.slack.com/t/dokcommunity/shared_invite/zt-1vgv7ymz7-YtLFvZicrcLP9fS3o_r2_w…

1 Repel Boarders! How to find a Kubernetes operator that really protects your data | DoKC Town Hall 19:22
19:22
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי19:22
Repel Boarders! How to find a Kubernetes operator that really protects your data Presented by Robert Hodges, Altinity Operators are a godsend for managing data in Kubernetes. But how about protecting it? We'll explore security threats to cloud native databases and show what protection you should look for in operators. Finally we'll introduce a new Data on Kubernetes Community project to develop security standards for database operators in Kubernetes. Related Links DoKC Website - https://dok.community/ DoKC Meetups - https://www.meetup.com/data-on-kubernetes-community/ Join Slack - https://join.slack.com/t/dokcommunity/shared_invite/zt-1vgv7ymz7-YtLFvZicrcLP9fS3o_r2_w…

1 DoK + Apache Spark | DoKC Town Hall 19:52
19:52
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי19:52
DoK + Apache Spark Presented by Holden Karau, Spark Committer and Open Source Engineer at Netflix In this brief talk, Holden will cover some of the best practices from trying to deploy both small and large scale Spark on Kube. Related Links DoKC Website - https://dok.community/ DoKC Meetups - https://www.meetup.com/data-on-kubernetes-community/ Join Slack - https://join.slack.com/t/dokcommunity/shared_invite/zt-1vgv7ymz7-YtLFvZicrcLP9fS3o_r2_w…

1 DoK @ Comcast - Deliver Business Outcomes & Improved DevX with Data Services on K8s | DoKC Town Hall 16:43
16:43
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי16:43
DoK @ Comcast: Delivering Business Outcomes & Improved DevX with Data Services Running on Kubernetes Presented by Greg Otto, Executor Director, DevX Platforms & Charles Ju, Principal Engineer Transforming how to deliver measurable value using data on Kubernetes, while providing psychological safety. If you just sighed, you’re one of the many people leaders and engineers who know the challenges of making these changes. In this talk, we will share our transformation journey, the “Months to Minutes” outcomes we achieved, the architecture approach, and the human journey from one of our engineers. Related Links DoKC Website - https://dok.community/ DoKC Meetups - https://www.meetup.com/data-on-kubernetes-community/ Join Slack - https://join.slack.com/t/dokcommunity/shared_invite/zt-1vgv7ymz7-YtLFvZicrcLP9fS3o_r2_w…

1 DoK Talks - What is Kafka? The rise of one of the world's most used streaming data technologies // Abbey Russell 15:28
15:28
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי15:28
Abbey Russell, PM at Cockroach Labs, shared the backstory on how and why Kafka was created. Along the way, you'll learn about - Who Franz Kafka was - Kafka's earliest use at Linkedin in 2010 - Why organizations like Uber/Coursera/Mailchimp use it today - Future of Data Streaming To find out more about how organizations are benefitting from running streaming workloads on Kubernetes, check out our 2022 research report- https://dok.community/data-on-kubernetes-2022-report/ https://dokcommunity.slack.com/ https://dok.community/ About Data on Kubernetes Community (DoKC) DoKC is an independent tech community where practitioners go to run data workloads on Kubernetes. It is managed by Constantia.io. We produce a range of resources for end users looking to run DoK including: - DoK Landscape An interactive landscape of products and services for running data workloads on Kubernetes. (https://dok.community/landscape/) - Videos and Blogs Hundreds of videos and blogs including how tos, tutorials, use cases, case studies, and more. (https://dok.community/landscape/) (https://dok.community/blog/) - DoK Reports DoKC works with an independent research firm to annually produce the industry’s only DoK-focused report. (https://dok.community/dok-reports/) - Livestreams and Local Meetups Weekly livestreams on DoK-related topics including demos, tutorials, visionary talks, panels, and more. (https://www.meetup.com/data-on-kubern...) - DoK Days @ KubeCon We host hybrid events twice a year for people to get together and learn in-person or online. Details are published on our main website. (https://dok.community/) Connect With Us: https://dok.community/ https://twitter.com/DoKcommunity https://www.linkedin.com/company/6667... https://github.com/dokc…

1 DoK Talks - (almost)Everything you need to know about stateful cloud native network applications // W Watson 43:39
43:39
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי43:39
https://go.dok.community/slack https://dok.community/ https://youtu.be/KjiK6eXYO34 DoK Talk with W Watson, Founder at Vulk Co-op

1 The Outer Nerd #001 - Dungeons & Dragons - Why should you care? // Abhi Vaidyanatha, Fabian Met & Chase Christensen 58:25
58:25
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי58:25
https://dokcommunity.slack.com/ https://dok.community/ ABSTRACT OF THE TALK Fabian, Chris and Abhi will discuss their passion for roleplaying games, and what they can teach us about the power of community, improvisation, and using our creativity.

1 DoK Talks #155 - Databases at the edge with K3s and ARM devices // Sergio Méndez 49:40
49:40
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי49:40
https://go.dok.community/slack https://dok.community/ https://youtu.be/KjiK6eXYO34 ABSTRACT OF THE TALK In this talk Sergio is going to present different ways to store data at the edge using different databases and Long Horn as a storage class. All this running on a Raspberry Pi and showing and small application using a database running at the edge. BIO Sergio Méndez is a systems engineer and professor of operating systems at USAC Guatemala university. His work at the university is related to teaching and researching cloud native technologies with his students. He has experience working on DevOps, and MLOps using open source technologies at work. About open source communities, he is involved in the CNCF Community, promoting students into the CNCF Ecosystem and hosting a Cloud Native meetup in Guatemala. He has been a speaker at several conferences such as KubeCon, WTFisCloudNative, and Kubernetes Community Days. He is also a Linkerd Ambassador and author of Edge Computing systems with Kubernetes with Packt publisher. KEY TAKE-AWAYS FROM THE TALK How to storage data at the edge using databases…

1 DoK Talks #154 - StatefulSets in K8 // Srinivas Karnati 31:55
31:55
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי31:55
https://go.dok.community/slack https://dok.community/ Link: https://youtu.be/n_thXwyJNSU ABSTRACT OF THE TALK Deploying Stateless applications is easy but this is not the case for Stateful applications. StatefulSets are the K8s API object that helps to manage stateful application. Learn about what Stateful sets are, how to create, How it differs from Deployments. BIO Passionate about Cloud Native, Kubernetes, Developer relations KEY TAKE-AWAYS FROM THE TALK This talk is focused on basics of StatefulSet, how StatefulSet differs from Deployments, How to manage Stateful app using StatefulSet…

1 Data-driven Diversity, Equity, and Inclusion // Lisa-Marie Namphy, Melissa Logan, Tiffany Jachja, Audra Montenegro & Cortney Nickerson (DoK Day North America 2022) 19:50
19:50
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי19:50
From the DoK Day North America 2022 ( https://youtu.be/YWTa-DiVljY )

1 Formula 1 telemetry processing using Apache Kafka on Kubernetes // Paolo Patierno (DoK Day North America 2022) 15:36
15:36
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי15:36
From the DoK Day North America 2022 ( https://youtu.be/YWTa-DiVljY ) Video - https://youtu.be/4cPVRWOK-_E ABSTRACT Apache Kafka is the de facto data streaming platform used for ingesting vast amounts of data and processing them in real-time. Low latency analytics are vital if users are to react to events as fast as possible and to effectively shape future decision making. Together with Kubernetes, it allows to develop cloud oriented analytics solutions which are highly scalable. During this session, we will explore an Apache Kafka based solution running on Kubernetes to help a Formula 1 team gain insights during a race. The “real” data will come from a well-known racing game and will be processed by our application in real-time, making us feel like real track-side F1 engineers!…

1 Choosing Kubernetes for Stateful Applications // Akshay Ram & Peter Schuurman (DoK Day North America 2022) 18:31
18:31
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי18:31
From the DoK Day North America 2022 ( https://youtu.be/YWTa-DiVljY ) Video - https://youtu.be/Y4tdy9lctEI ABSTRACT Learn how customers are increasingly deploying stateful applications on Kubernetes to benefit from portability, economies of scale, and built-in orchestration capabilities. This talk will include how customers choose between using Kuberentes, or a data Software as a Service (SaaS) and stateful capabilities of Kubernetes across two dimensions - the application orchestration and the storage layer. Also learn about MariaDB SKYSQL, a database software as a service that runs thousands of StatefulSet Pods across multiple zones and regions on Kubernetes.…

1 Kubernetes 360º - Data driven observability - from Secrets to logs // Ben Hirschberg (DoK Day North America 2022) 17:11
17:11
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי17:11
From the DoK Day North America 2022 ( https://youtu.be/YWTa-DiVljY ) Video - https://youtu.be/A1ch4AhKoeQ ABSTRACT If there’s one thing that everyone can agree on - it’s that the sheer scale and complexity of Kubernetes operations is growing constantly. What’s more, cloud native environments are becoming more and more expensive to operate and manage, as well as increasingly difficult to secure. On the bright side, there is a growing ecosystem of exceptional open source tools to help overcome this complexity, and provide greater situational awareness to what’s happening in your many and multiple Kubernetes clusters. In this talk, we’ll take a deep dive on how to understand everything that is happening with stateful workloads in your Kubernetes clusters from health to security - including the management and monitoring of your clusters, troubleshooting and debugging, through full security scanning in multiple data perspectives- from configMaps, Secrets, hostPaths and many more security driven checks for proper role-based access control, vulnerabilities and misconfigurations. We will demonstrate how you can have a 360º-view of your Kubernetes clusters in a single open source dashboard leveraging tools like Lens and Kubescape, and ultimately enforce policies with Open Policy Agent.…

1 Shifting Left Stateful Applications In Kubernetes // Viktor Farcic (DoK Day North America 2022) 15:52
15:52
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי15:52
From the DoK Day North America 2022 ( https://youtu.be/YWTa-DiVljY ) Video - https://youtu.be/LymPjH6HA3E ABSTRACT Stateless apps are easy to manage. More often than not, a Kubernetes Deployment, with a Service, Ingress, and Horizontal Pod Autoscaler (HPA) is enough. Almost everyone can do it. But, when it comes to stateful applications, things become a bit more complicated. We might need a database and storage. We might need to manage database users and schema. We might need to consider quite a few other things. Stateful apps are harder for everyone, especially if we want to shift left and enable developers to do it themselves. In this talk, we'll try to make the management of stateful applications easy for everyone. We'll accomplish that by creating easy-to-consume services that are made specifically for the needs of our organizations. We'll see how to create new Kubernetes Custom Resource Definitions (CRDs) and controllers using Crossplane. Those controllers will envelop all the tools, resources, and processes we might need. As a result, ops can focus on creating such services while everyone else can consume them (create and manage everything related to stateful apps) without opening JIRA tickets and waiting for others to complete their tasks.…
D
Data on Kubernetes Community

1 Medical - Healthcare Data on Kubernetes // Olyvia Rakshit & Prasad Dorbala (DoK Day North America 2022) 13:41
13:41
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי13:41
From the DoK Day North America 2022 ( https://youtu.be/YWTa-DiVljY ) ABSTRACT Healthcare organizations are transforming their applications and embracing digital platforms for efficient patient care. Today, compute at the edge, plays a critical role in deploying innovative healthcare applications that promise new approaches to patient care. Connected care platforms at scale are turning to Kubernetes infrastructure for deployment of telehealth and telemedicine apps. These kinds of applications often need real time image processing and inferencing at the edge, which in turn, require a distributed application platform on Kubernetes. They also need secure foundations for a regulatory compliant framework for wider adoption. To elaborate further, a distributed platform or a multi-cluster infrastructure is necessary for healthcare applications. All medical application platforms are data intense. For eg. Medical applications that need to stream video data, i.e move large quantities of data need secure connections from edge to cloud. Smart healthcare applications are also seeing the value of multi-cloud deployments. The ability to choose different technologies like AI, data storage systems or some specific relational database systems is driving multi-cloud adoption in healthcare solutions. A multi-cluster connectivity fabric is thus needed for data intensive healthcare applications to distribute across geographical boundaries. The fabric needs to have inbuilt automation and the smarts to make the best use of compute resources in a distributed infrastructure, for improved patient healthcare.…
D
Data on Kubernetes Community

1 Highly Available Postgres Clusters In Kubernetes // John Long & Jonathan Gonzalez (DoK Day North America 2022) 15:04
15:04
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי15:04
From the DoK Day North America 2022 ( https://youtu.be/YWTa-DiVljY ) ABSTRACT A practical session about running Highly Available PostgreSQL in Kubernetes. The primary objective will be to demonstrate how to set up a reliable architecture in a Kubernetes cluster to achieve low RTO and RPO. This will be covered by going over the various Kubernetes native components of our fully open source operator CloudNativePG[0]. It will be demonstrated how these components allow for a simple adoption of PostgreSQL into Kubernetes environments. If time and technical constraints permit there will be a live demo at the end of the presentation; if for some reason that is not possible there will be a link to a recorded demo. https://cloudnative-pg.io/…
D
Data on Kubernetes Community

1 Inter-Cluster PostreSQL on Kubernetes // Julian Fischer (DoK Day North America 2022) 17:07
17:07
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי17:07
From the DoK Day North America 2022 ( https://youtu.be/YWTa-DiVljY ) ABSTRACT In this talk you’ll explore how to run a PostgreSQL cluster across multiple Kubernetes clusters. Learn what challenges arise when using asynchronous streaming replication in a set of Kubernetes clusters spanning across several geographical regions. It will be discussed how such a topology can be established using Kubernetes and Kubernetes Add-Ons. You will learn which qualities the system has, where such a distributed system plays its strengths and where it may be problematic to use it. At the end of the talk you will have an increased understanding about how to create multi-regional data service instances using PostgreSQL as an example.…
D
Data on Kubernetes Community

1 Open Source Databases on Kubernetes- Best Practices // Peter Zaitsev (DoK Day North America 2022) 16:04
16:04
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי16:04
From the DoK Day North America 2022 ( https://youtu.be/YWTa-DiVljY ) ABSTRACT So you’re looking to run your Open Source Database on Kubernetes. What best practices should you follow and what pitfalls should you avoid ? In this presentation we will look at how to run stateful applications on Kubernetes overall as well as what is particularly important for databases - we will cover high availability, security, backups and disaster recovery. Finally we will show how these practices can be implemented with Percona Operators for MySQL, MongoDB, PostgreSQL - one of the leading solutions to run Open Source Databases on Kubernetes…
D
Data on Kubernetes Community

1 The Kubernetes Native Database // Jeffrey Carpenter (DoK Day North America 2022) 16:26
16:26
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי16:26
From the DoK Day North America 2022 ( https://youtu.be/YWTa-DiVljY ) ABSTRACT In the software industry we’re fond of terms that define major trends, like “cloud native”, “Kubernetes native” and “serverless”. As more and more organizations move stateful workloads to Kubernetes, we’ve started to see these terms applied to data infrastructure, where they can get overtaken by marketing hype unless we work to define them. In this talk, we’ll examine two different databases, TiDB and Apache Cassandra, in order to identify what it means for a database to be Kubernetes native and why it matters. We’ll look at points including: - The differences between cloud native, Kubernetes native, and serverless - How databases become Kubernetes native - Benefits of Kubernetes native databases - How Kubernetes can better support databases…
D
Data on Kubernetes Community

1 Databases on Kubernetes: Why are they important? // With Bhavin Shah, Xing Yang, Gabriele Bartolini & Patrick McFadin (DoK Day North America 2022) 34:51
34:51
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי34:51
From the DoK Day North America 2022 ( https://youtu.be/YWTa-DiVljY ) ABSTRACT Kubernetes has crossed the chasm, but what about stateful applications and databases? Join us for this panel discussion and learn more about how organizations are deploying different databases like PostgreSQL and Cassandra on Kubernetes, what are the benefits of running databases on Kubernetes, and how the ecosystem is working towards making these things boring, so you can focus on your applications! We will have an interactive discussion with the hosts and guests of the Kubernetes Bytes podcast, and open it up to the audience to ask questions and learn more about the what, why, and how about Databases on Kubernetes!…
D
Data on Kubernetes Community

1 Data streaming on Kubernetes // Yaniv Ben Hemo (DoK Day North America 2022) 13:51
13:51
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי13:51
From the DoK Day North America 2022 ( https://youtu.be/YWTa-DiVljY ) ABSTRACT I will cover what is the current data streaming on k8s landscape, why it is important, use cases, and what are the challenges needed to solve
D
Data on Kubernetes Community

1 Architecting Your First Event Driven Serverless Streaming Applications on K8 // Timothy Spann (DoK Day North America 2022) 13:29
13:29
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי13:29
From the DoK Day North America 2022 ( https://youtu.be/YWTa-DiVljY ) ABSTRACT Once you have built a topic in Apache Pulsar, you will quickly see the need to build event-driven applications. This can require a lot of decisions on what framework to use, where to run it, how to deploy it, and how to manage these applications on Kubernetes cloud natively. I will walk you through step-by-step in building Pulsar Functions which is the easy way to design, test, develop, integrate, deploy, monitor, and manage serverless streaming applications in Java and Python. Together we will build a full application as an Apache Pulsar function and enjoy the power of running it in the cloud for IoT events and add any routing, transformation, or machine learning that we need to accomplish our business requirements. Through FunctionMesh we run on Kubernetes natively. In this talk, you will deploy ML functions to transform real-time data on Kubernetes.…
D
Data on Kubernetes Community

1 Fybrik - A Kubernetes based platform for governed data use // Flora Gilboa-Solomon, Alexey Roytman, Maryna Strelchuk & Barry Hijkoop (DoK Day North America 2022) 20:59
20:59
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי20:59
From the DoK Day North America 2022 ( https://youtu.be/YWTa-DiVljY ) ABSTRACT Data is the foundation for business value. However, in many enterprises, it is spread across different data stores, public/private clouds, and on-premises. The use of data is governed by regulatory requirements and enterprise policies and enterprises face dynamic data residency challenges. Fybrik ( https://fybrik.io ) is an open-source, Kubernetes-based infrastructure that enables easier access to data distributed across hybrid and multi-cloud environments, while enforcing data governance policy according to the business’s needs. In this talk, we will share use cases from a pilot project carried out with ING (a multinational banking corporation). Fybrik enabled ING the use of their data across geographical borders, across clusters in a secure, automated, and compliant manner.…
D
Data on Kubernetes Community

1 The Challenges of Data Processing On Kubernetes - A look at Spark, Flink, Dask, and Ray // Holden Karau (DoK Day North America 2022) 20:09
20:09
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי20:09
From the DoK Day North America 2022 (https://youtu.be/YWTa-DiVljY) ABSTRACT This talk will go through both the improvements that have been made in Kubernetes for batch analytic workloads as well as some of the current pain experienced by users and developers moving their workloads to Kube. In this talk you will learn about how we “cheated” back in the YARN and Mesos days to make things go fast, why Kubernetes doesn’t like those cheats, and what some alternatives are.…
D
Data on Kubernetes Community

1 Scaling our SaaS offering to thousands of clusters // Dax McDonald (DoK Day North America 2022) 21:04
21:04
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי21:04
From the DoK Day North America 2022 ( https://youtu.be/YWTa-DiVljY ) ABSTRACT Sourcegraph is a code intelligence platform that helps our customers to understand their code better. As we have scaled up, we are starting to run hundreds of instances for our customers in separate kubernetes clusters. Running dozens of distinct clusters with a stateful application that scales horizontally and vertically has been challenging. Sourcegraph's inherently stateful nature requires thoughtful planning to achieve resilience while making tradeoffs to keep costs down. This talk will discuss our methodologies to allow our cluster to scale while minimizing drift across multiple customer deployments. I will also cover our use of Terraform, ArgoCD, and Helm to manage the application and infrastructure. Finally, this talk will discuss our disaster recovery strategies that allowed our organization to achieve SOCII compliance while iterating quickly. If you are looking to scale out a managed offering at your organization, learn from our mistakes and come to this talk!…
D
Data on Kubernetes Community

1 Why we decided to migrate our Jaeger storage to ClickHouse on Kubernetes // Arul Jegadish Francis (DoK Day North America 2022) 13:48
13:48
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי13:48
From the DoK Day North America 2022 ( https://youtu.be/YWTa-DiVljY ) Abstract We at OpsVerse provide a DevOps tools platform with fully-managed open source-based tools. One of our key offerings is a holistic observability platform. Metrics and logs are straightforward to aggregate, however traces – which are collected using CNCF Jaeger – were left with some holes in advanced insights. In this talk, I will explain why we decided to switch our Jaeger storage backend to ClickHouse, and why we decided to run it on Kubernetes. I will also share tips and tricks to migrate and run ClickHouse on Kubernetes.…
D
Data on Kubernetes Community

1 Building a Digital Factory for the Sheet Metal Industry // Elie Assi (From the DoK Day North America 2022) 20:48
20:48
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי20:48
From the DoK Day North America 2022 (https://youtu.be/YWTa-DiVljY) Abstract We develop systems to digitize the sheet metal industry with the belief that they should cooperate with each other in an open way. We are convinced that the future lies in creating a software ecosystem that interconnects all levels of the company and even manages to communicate with supplier and customer systems, making for more agile management throughout the entire value chain. One of our newer projects is providing a multi-tenant cloud service to our clients. The chosen database for the service we are developing is MongoDB. After testing with docker containers on virtual machines, we decided to finally host the service in Kubernetes because we wanted something to be able to scale at large for our clients being a service that will be consumed by a multitude of our clients, so to the database in order to reduce latency.…
D
Data on Kubernetes Community

1 How we built our Big Data Stack (almost) entirely on top of Kubernetes // Neylson Crepalde (From DoK Day NA 2022) 16:00
16:00
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי16:00
From the DoK Day North America 2022 (https://youtu.be/YWTa-DiVljY) Abstract Working with Terabytes of data is a major challenge for organizations both in terms of architecture and cost. In recent years, a new paradigm has emerged in the world of Big Data, that is, implementing the entire architecture for processing massive data from a microservices approach. This architecture proved to be scalable, efficient, with excellent performance, failsafe, portable between different cloud providers and at a lower cost than conventional architectures. In this talk, I will present how my team managed to implement this architecture based on Kubernetes and what were the main gains we had with the innovation.…
D
Data on Kubernetes Community

1 Dok Talks #153 - CRD Panel // Eyar Zilberman & Álvaro Hernández 58:05
58:05
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי58:05
https://go.dok.community/slack https://dok.community We are going to speak about CRDs, and discuss considering them as higher level entities that we normally consider them. CRDs normally are kind of a byproduct of an operator. But in reality, they can be considered as the user-facing API of the operator surface. And as such, we would like to introduce conversations about even standardizing some of them, or creating "specifications" out of some CRDs. Take this for example: there are some CRDs that could be, potentially, common across multiple operators. Think of a CRD to represent an S3 (or any other storage object) bucket. For operators that store data in object stores, each and every one of them are creating their own versions. But shouldn't we have some "standardized" `ObjectStorage` CRD, without a controller, that all could use? We'll be joined by John Sanda of DataStax who works on the K8ssandra project, Alvaro Hernandez who is the CEO of Ongres and Postgres superhero, and Eyar Zilberman Co-Founder and Chief Product Officer at Datree.…
D
Data on Kubernetes Community

1 Dok #152-Running PostgreSQL in Kubernetes:from day 0 to day 2 with CloudNativePG // Gabriele Bartolini 1:03:50
1:03:50
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:03:50
https://go.dok.community/slack https://dok.community With: Gabriele Bartolini - Vice President/CTO of Cloud Native and Kubernetes, EDB Bart Farrell - Head of Community, Data on Kubernetes Community ABSTRACT OF THE TALK Imagine this: you have a virtual infrastructure based on Kubernetes, made up of virtual data centers, possibly spread across multiple Kubernetes clusters and regions. Your infrastructure could even be hosted on premises or on different cloud service providers. Infrastructure as Code is a requirement. You’ve been tasked to run Postgres databases, alongside your applications. The good news is that you can leverage a fully open source stack with Kubernetes, PostgreSQL and the CloudNativePG operator, and deploy your Postgres database in the same way you deploy applications. Join me in this webinar to discover the key role that you have to make this succeed, starting from day 0 through day 2 operations. I’ll share some examples and best practices for running Postgres databases in Kubernetes, before peeking at the new features we are developing for the months to come. BIO A long time open source programmer and entrepreneur, Gabriele has a degree in Statistics from the University of Florence. After having consistently contributed to the growth of 2ndQuadrant and its members through nurturing a lean and devops culture, he is now leading the Cloud Native initiative at EDB. Gabriele lives in Prato, a small but vibrant city located in the northern part of Tuscany, Italy - known for having hosted the first European PostgreSQL conferences. His second home is Melbourne, Australia, where he studied at Monash University and worked in the ICT sector. He loves playing the Blues with his Fender Stratocaster, but his major passions are called Elisabeth and Charlotte!…
D
Data on Kubernetes Community

1 Dok Talks #148 - Cost and Kubernetes // Chris Love 45:25
45:25
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי45:25
https://go.dok.community/slack https://dok.community With: Chris Love - Managing Partner, LionKube Bart Farrell - Head of Community, Data on Kubernetes Community ABSTRACT OF THE TALK Using Kubernetes to run data workloads costs less than running the same workloads on separate servers. But how do we save at least twenty to thirty percent more? We need the data and a cost-savings culture. We will explore a project called Open Cost: cost monitoring for cloud-native environments. Secondly, we will explore why company culture is more important than cost savings tools. BIO Chris Love is a Google Cloud Certified hybrid Multi-cloud Fellow and a Co-founder of Lionkube. Chris co-authored the book Core Kubernetes. He has over 25 years of software and IT engineering experience with companies including Google, Oracle, VMWare, Cisco, Johnson & Johnson, and others. As a thought leader within Kubernetes and the DevOps community, Chris Love has contributed to many open source projects, including Kubernetes, kops (former AWS SIG lead), Bazel (contributed to Kubernetes rules), and Terraform (an early contributor to the VMware plugin). His professional interests include Kubernetes, IT culture transformation, containerization technologies, automated testing frameworks and practices, and DevOps. Love also enjoys speaking around the world about DevOps, Kubernetes, and Technology and mentoring people in the IT and Software industry. Outside of work Love enjoys skiing, volleyball, yoga, and other outdoor activities that come with living in Colorado. He’s also been a practicing martial artist for over 20 years. If you’re interested in having virtual coffee or have questions for Chris, you can contact him at @chrislovecnm on Twitter or LinkedIn. KEY TAKE-AWAYS Two things will save us money running a stateful workload hosted in Kubernetes; data and company culture.…
D
Data on Kubernetes Community

1 Dok Talks #151 - Analytics with Apache Superset and ClickHouse // Vijay Anand Ramakrishnan 33:00
33:00
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי33:00
https://go.dok.community/slack https://dok.community With: Vijay Anand Ramakrishnan - Database Administrator, ChistaDATA Bart Farrell - Head of Community, Data on Kubernetes Community ABSTRACT OF THE TALK This talk concerns performing analytical tasks with Apache Superset with ClickHouse as the data backend. ClickHouse is a super fast database for analytical tasks, and Apache Superset is an Apache Software foundation project meant for data visualization and exploration. Performing analytical tasks using this combo is super fast since both the software are designed to be scalable and capable of handling data of petabyte scale. BIO Vijay Anand is based out of Chennai (India), working as a Database Administrator in ChistaDATA. He has extensive experience in ClickHouse, Python and has contributed as a technical lead in multiple organizations building ClickHouse based solutions. His areas of interest include database design, building software solutions using open source technologies. He is the author of a book on ClickHouse titled "Up and Running with ClickHouse". KEY TAKE-AWAYS Real time analytics, Data exploration and Visualization…
D
Data on Kubernetes Community

1 Dok Talks #150 - Building a Simple Postgres Async Streaming Cluster // Julian Fischer 1:04:45
1:04:45
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:04:45
https://go.dok.community/slack https://dok.community With: Julian Fischer - CEO, anynines GmbH Bart Farrell - Head of Community, Data on Kubernetes Community ABSTRACT OF THE TALK In this talk you will learn how to build the a Postgres service with Kubernetes. See how asynchronous replication is set up using a Kubernetes resources including, a headless Service, StatefulSet, ConfigMap and Secret. BIO Julian Fischer, CEO of anynines, has dedicated his career to the improvement and automation of software operations. In more than fifteen years,he has built several application platforms with various automation tools. His latest passions are data service automation, Cloud Foundry, BOSH and Kubernetes. KEY TAKE-AWAYS At the end of the talk you will know the Kubernetes resources a controller had to manage as input to building an actual Operator.…
D
Data on Kubernetes Community

1 DoK Talks #149 - Overcoming challenges with protecting and migrating data in multi-cloud K8s environments // Sebastian Glab & Martin Phan 47:40
47:40
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי47:40
https://go.dok.community/slack https://dok.community/ With: Sebastian Glab - Cloud Architect, CloudCasa by Catalogic Martin Phan - Field CTO – Americas, CloudCasa by Catalogic Bart Farrell - Head of Community, Data on Kubernetes Community ABSTRACT OF THE TALK If you are running or planning a multi-cloud or even a multi-cluster environment, there are several considerations in implementing a data protection solution – especially if you plan on an organic home-grown, do-it-yourself option. This talk will highlight challenges and best practices around centralized management of configuration, credentials, compliance across multiple accounts, regions, providers etc. We will also highlight the deviations in CSI driver implementations of various storage vendors and cloud providers. Finally, we will cover the various recovery options available in the market today. Kubernetes cloud services are popular since they mitigate, but do not eliminate, the difficulties of operating a Kubernetes environment. This is especially true for protecting the stateful configuration and data of your Kubernetes applications, where the inherent high-availability and infrastructure as code are not a substitute for have cloud-native backup and disaster recovery capabilities. Further, many companies now have multi-cloud strategies for their cloud-native applications. These challenges can be addressed with backup applications that are both Kubernetes managed service and multi-cloud aware in order to snapshot, copy, restore, and migrate Kubernetes workloads (resources and data) running on AKS, EKS and GKE. Capturing information from cloud accounts and how the cluster and storage resources are configured allows 1) centralized visibility into all cloud accounts and the clusters and resources in the accounts including for compliance; 2) cross-account, cross-cluster, and cross-region data restores; 3) automation of the cluster and data restores including for Dev, Test, and Production recovery use cases. BIO Sebastian Glab is a Cloud Architect for CloudCasa and he resides in Poland. He is responsible for integrating the different cloud providers with the CloudCasa service, and making sure that all clusters in the cloud service get discovered and protected. In his free time, he plays volleyball and develops his own projects. Martin Phan is the Field CTO in North America for CloudCasa by Catalogic Software. With over 20+ years of experience in the software-industry, he takes pride in supporting, developing, implementing, and selling enterprise software and data protection solutions to help customer solve their backup and recovery challenges. KEY TAKE-AWAYS FROM THE TALK 1) Challenges and best practices around centralized management of configuration, credentials, compliance across multiple accounts, regions, providers etc. 2) Advantages of cloud awareness and Kubernetes managed service awareness for application and data recovery and security 3) Examples of overcoming Container Storage Interface (CSI) deviations 4) Various recovery options available in the market today.…
D
Data on Kubernetes Community

1 DoK Talks #147 - Evaluating Cloud Native Storage Vendors // Dinesh Majrekar 1:00:03
1:00:03
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:00:03
https://go.dok.community/slack https://dok.community/ With: Dinesh Majrekar - CTO, Civo Bart Farrell - Head of Community, Data on Kubernetes Community ABSTRACT OF THE TALK In a continuation of a talk given at DoK day at KubeCon EU 2022, join Dinesh Majrekar, Civo's CTO as they walk through their evaluation process of the CNCF Storage market. Civo offers managed Kubernetes clusters powered by K3s to customers around the world. We manage thousands of Virtual Machines and stateful customer data within multiple data centres across several continents. In late 2021, Civo had the opportunity to evaluate the CNCF storage landscape to move to a new technology stack. During the migration project, Civo evaluated Mayastor, Ondat, Ceph and Longhorn against the following metrics: Scalability Performance Ease of Support Attendants will see practical examples on how they could carry out their own similar evaluation and see some of the results of the Civo research project. BIO Dinesh is CTO at Civo. Having worked in the hosting industry for many years, Dinesh has a passion for creating solutions that operate at scale. This not only applies to the technology stack, but for nurturing engineers through their career.…
D
Data on Kubernetes Community

1 Dok Talks #146 - OpenFeature - Making feature flags a commodity // Oleg Nenashev 1:01:30
1:01:30
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:01:30
https://go.dok.community/slack https://dok.community/ With: Oleg Nenashev - Community Builder and Developer Advocate, Dynatrace Bart Farrell - Head of Community, Data on Kubernetes Community ABSTRACT OF THE TALK With the evolution of software delivery methods, feature flags became one of the ways to deliver features and get user feedback. Modern applications may have hundreds of feature flags describing customizations, preview features and environment specifics. Feature flags are not just a single toggle anymore, their values are often determined at runtime depending on the instance, user context and inputs from the control plane. In this session we will talk about OpenFeature - a new collaboration by many feature flag vendors and end users. OpenFeature, a CNCF sandbox project, is an open standard for feature flag management, created to support a robust feature flag ecosystem using cloud native technologies and to basically make feature flags a commodity. OpenFeature provides a unified API and SDKs, and a developer-first, cloud-native implementation, with extensibility for open source and commercial offerings. BIO Oleg works on open source programs and Keptn community at the Dynatrace Open Source Program Office. He is a TOC chair and community ambassador in the Continuous Delivery Foundation. Oleg is a core maintainer and board member in Jenkins where he writes code, mentors contributors, and organizes community events and Google Summer of Code in the project. Open source software and open hardware advocate. Oleg has a PhD degree in hardware design and volunteers in the Free and Open Source Silicon Foundation. KEY TAKE-AWAYS FROM THE TALK Feature flags at scale 101. Feature flags are an important topic for stateful apps in K8s, and for progressive delivery and other modern approaches. OpenFeature is like OTel but for feature flags. You can try it now, here is a live demo to follow.…
D
Data on Kubernetes Community

1 DoK Talks #145 - Making Hard Things Easy is Hard // Kurt Rinehart 57:40
57:40
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי57:40
https://go.dok.community/slack https://dok.community/ https://youtu.be/6eSWOUzCb4w With: Kurt Rinehart - Director of Information Engineering, Section Bart Farrell - Head of Community, Data on Kubernetes Community ABSTRACT OF THE TALK Making complicated things simple is a prime objective of technology. In addition to technical challenges there is an interfacing challenge. At Section we're automating all dimensions of hosting containerized workloads and solving the "location challenge" is one aspect of this. Making it simple to get the locations you want evokes many trade-offs and can collide with past habits that may no longer be suitable. BIO Dr. Kurt Rinehart is the Director of Information Engineering at Section. Kurt holds a PhD in Natural Resource Management, specialising in applying quantitative decision analysis to complex conservation problems. After working as a quantitative consultant, Kurt found and fell in love with startup life and Section. His current responsibilities include designing, developing, and managing autonomic computing systems and providing analytics and data science support across every facet of Section. Kurt is married with 2 kids and a dog, and plays guitar for The Intolerables (theintolerables.com). KEY TAKE-AWAYS FROM THE TALK - Good agile practice and feedback loops is critical. - "Trust but verify" is a good philosophy to keep in mind. - When developers are your users, there are extra challenges in finding the happy balance.…
D
Data on Kubernetes Community

1 DoK Talks #144 - We will Dok You! - The journey to adopt stateful workloads on k8s // Guy Menahem 1:06:30
1:06:30
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:06:30
https://go.dok.community/slack https://dok.community/ https://youtu.be/AjvwG53yLMY With: Guy Menahem - Solution Architect, Komodor Bart Farrell - Head of Community, Data on Kubernetes Community ABSTRACT OF THE TALK Stateful workloads are the heart of any application, yet they remain confusing and complicated even to daily K8s practitioners. That’s why many organizations shy away from migrating their data - their prized possession - to the unfamiliar stateful realm of Kubernetes. After meeting with many organizations in the adoption phase, I discovered what works best, what to avoid, and how critical it is to gain confidence and the right knowledge in order to successfully adopt stateful workloads. In this talk I will demonstrate how to optimally adopt Kubernetes and stateful workloads in a few steps, based on what I’ve learned from observing dozens of different adoption journeys. If you are taking your first steps in data on K8s or contemplating where to start - this talk is for you! BIO - A Developer turned Solution Architect. - Working at Komodor, a startup building the first K8s-native troubleshooting platform. - Love everything in infrastructure: storage, networks & security - from 70’s era mainframes to cloud-native. - All about “plan well, sleep well”. KEY TAKE-AWAYS FROM THE TALK - Understand how critical stateful workloads are for any system, and that the key challenges to migrating it to Kubernetes are knowledge and confidence. - How to build the foundational knowledge required to overcome adoption challenges by creating a learning path for individuals and teams. - How to gain confidence to run stateful workloads on Kubernetes with support from the community (and yourself!)…
D
Data on Kubernetes Community

1 DoK Talks #142 - Kubernetes Cluster Upgrade Strategies and Data: Best Practices for your Stateful Workload // Peter Schuurman 58:45
58:45
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי58:45
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK How do you make sure your Stateful Workloads remain available when your Kubernetes infrastructure updates? This talk will discuss different strategies of upgrading a Kubernetes cluster, and how you can manage risk for your workload. The talk will showcase demos of each upgrade strategy. BIO Peter is a Senior Software Engineer on GKE at Google. He works on improving Kubernetes for Stateful workloads. His main focus is on enhancing the Kubernetes ecosystem for high availability applications. KEY TAKE-AWAYS FROM THE TALK The mechanics of different upgrade strategies, when to apply a particular upgrade strategy depending on your Stateful workload and how to mitigate risk to your application’s availability.…
D
Data on Kubernetes Community

1 DoK Talks #144 - Mastering MongoDB on Kubernetes, the power of operators // Arek Borucki 1:00:50
1:00:50
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:00:50
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK During my first talk for DoK community I want to walk you through the world of NoSQL database MongoDB and Kubernetes Operators - Community Edition, Enterprise Edition (MongoDB and Ops Manager on K8s), and Atlas operator, highlight the most important capabilities, talk about use cases and challenges, the theory will be mixed with a live demos! BIO I'm a SRE / NoSQL / DevOps professional. I hold CKA, CKAD, CKS, also I’m MongoDB Certified DBA and MongoDB Champion. I have experience with multiple cloud providers, Kubernetes, different types of K8s operators (Strimzi, RabbitMQ Cluster Operator), but especially MongoDB K8s Operator. I also work with KEDA. Since 2017, I have been a speaker at MongoDB conferences all around the world (USA, China, Europe). KEY TAKE-AWAYS FROM THE TALK I would like to share the best practices of running NoSQL database - MongoDB on Kubernetes also I want to show how to manage Atlas (MongoDB cloud) via K8s operator https://www.mongodb.com/developer/community-champions/arkadiusz-borucki/…
D
Data on Kubernetes Community

1 DoK Specials - Why are Operators paramount to running stateful workloads on Kubernetes? 53:36
53:36
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי53:36
In this panel with Sylvain Kalache, Head of Content at the DoK Community, drives a conversation featuring Nic Vermandé- Principal Developer Advocate at Ondat, Julian Fischer- CEO at anynines, and Sergey Pronin- Group Product Manager at Percona.
D
Data on Kubernetes Community

1 DoK Talks #141 - Dossier: multi-tenant distributed Jupyter Notebooks // Iacoppo Colonnelli & Dario Tranchitella 1:00:10
1:00:10
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:00:10
https://go.dok.community/slack https://dok.community ABSTRACT OF THE TALK When providing data analysis as a service, one must tackle several problems. Data privacy and protection by design are crucial when working on sensitive data. Performance and scalability are fundamental for compute-intensive workloads, e.g. training Deep Neural Networks. User-friendly interfaces and fast prototyping tools are essential to allow domain experts to experiment with new techniques. Portability and reproducibility are necessary to assess the actual value of results. Kubernetes is the best platform to provide reliable, elastic, and maintainable services. However, Kubernetes alone is not enough to achieve large-scale multi-tenant reproducible data analysis. OOTB support for multi-tenancy is too rough, with only two levels of segregation (i.e. the single namespace or the entire cluster). Offloading computation to off-cluster resources is non-trivial and requires the user's manual configuration. Also, Jupyter Notebooks per se cannot provide much scalability (they execute locally and sequentially) and reproducibility (users can run cells in any order and any number of times). The Dossier platform allows system administrators to manage multi-tenant distributed Jupyter Notebooks at the cluster level in the Kubernetes way, i.e. through CRDs. Namespaces are aggregated in Tenants, and all security and accountability aspects are managed at that level. Each Notebook spawns into a user-dedicated namespace, subject to all Tenant-level constraints. Users can rely on provisioned resources, either in-cluster worker nodes or external resources like HPC facilities. Plus, they can plug their computing nodes in a BYOD fashion. Notebooks are interpreted as distributed workflows, where each cell is a task that one can offload to a different location in charge of its execution. BIO Iacopo Colonnelli is a Computer Science research fellow. He received his Ph.D. with honours in Modeling and Data Science at Università di Torino with a thesis on novel workflow models for heterogeneous distributed systems, and his master’s degree in Computer Engineering from Politecnico di Torino with a thesis on a high-performance parallel tracking algorithm for the ALICE experiment at CERN. His research focuses on both statistical and computational aspects of data analysis at large scale and on workflow modeling and management in heterogeneous distributed architectures. Dario is an SWE that turned DevOps, and he's regretting this choice day by day. Besides making memes on Twitter that gain more reactions than technical discussions, leading the development of Open Source projects at CLASTIX, an Open Source-based start-up focusing on Multi-Tenancy in Kubernetes. KEY TAKE-AWAYS FROM THE TALK From this talk, people will learn: - The different requirements of Data analysis as a service - How to configure for multi-tenancy at the cluster level with Capsule - How to write distributed workflows as Notebooks with Jupyter Workflows - How to combine all these aspects into a single platform: Dossier All the software presented in the talk is OpenSource, so attendees can directly play with them and include them in their experiments with no additional restrictions.…
D
Data on Kubernetes Community

1 DoK Talks #140 - Data protection of stateful environment // Timothy Dewin 42:35
42:35
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי42:35
https://go.dok.community/slack https://dok.community ABSTRACT OF THE TALK More and more we see stateful workloads pop up in Kubernetes clusters. These workloads generate data that is unique and is ephemeral. During this talk we will discuss the challenges of stateful workloads and how you can successfully protect BIO Working over a decade in IT as a technical expert for Veeam Software. Specializing in backup for the modern hybrid cloud. Passion for scripting and programming. Husband of Lena, father of Lev. KEY TAKE-AWAYS FROM THE TALK Data backup of Kubernetes. DR for Kubernetes…
D
Data on Kubernetes Community

1 DoK Talks #139 - Private DBaaS on Kubernetes // Sergey Pronin 53:25
53:25
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי53:25
https://go.dok.community/slack https://dok.community ABSTRACT OF THE TALK Percona is committed to deliver solutions to run open source databases anywhere without lock in. As part of this commitment, we have created Operators to run MySQL, PostgreSQL and MongoDB on Kubernetes. Learn how Percona Monitoring and Management (PMM) allows you to enable developers to deploy and manage databases anywhere with private Database-as-a-service capability backed by Operators. BIO Product and technology leader. Worked in various fields: internet service providers, financial sector and merge & acquisition business. Currently leads product @ Percona focusing on cloud native technologies for open source databases KEY TAKE-AWAYS FROM THE TALK Learn how Percona Monitoring and Management (PMM) allows you to enable developers to deploy and manage databases anywhere with private Database-as-a-service capability backed by Operators. You will get high level overview of Percona's Operators structure and how private DBaaS can boost the productivity of your engineering and IT teams.…
D
Data on Kubernetes Community

1 DoK Talks #138 - Build your own social media analytics with Apache Kafka // Jakub Scholz 56:25
56:25
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי56:25
https://go.dok.community/slack https://dok.community ABSTRACT OF THE TALK Apache Kafka is more than just a messaging broker. It has a rich ecosystem of different components. There are connectors for importing and exporting data, different stream processing libraries, schema registries and a lot more. The first part of this talk will explain the Apache Kafka ecosystem and how the different components can be used to load data from social networks and use stream processing and machine learning to analyze them. The second part will show a demo running on Kubernetes which will use Kafka Connect to load data from Twitter and analyze them using the Kafka Streams API. After this talk, the attendees should be able to better understand the full advantages of the Apache Kafka ecosystem especially with focus on Kafka Connect and Kafka Streams API. And they should be also able to use these components on top of Kubernetes. BIO Jakub works at Red Hat as Senior Principal Software Engineer. He has long-term experience with messaging and currently focuses mainly on Apache Kafka and its integration with Kubernetes. He is one of the maintainers of the Strimzi project which provides tooling for running Apache Kafka on Kubernetes. Before joining Red Hat he worked as messaging and solution architect in the financial industry. KEY TAKE-AWAYS FROM THE TALK The key takeaway of this talk is that Apache Kafka is more than just a messaging broker. It is a platform and ecosystem of different components which can be used to solve complex tasks when dealing with events or processing data. The talk demonstrates this on loading tweets from Twitter and processing them using the different parts of the Kafka ecosystem. The whole talk and its demos are running on Kubernetes using the Strimzi project. So it also shows how to easily run all the different components on top of Kubernetes with the help of few simple YAML files.…
D
Data on Kubernetes Community

1 DoK Talks #137 - How to build your own “Doordash” app // Yaniv Ben Hemo 57:50
57:50
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי57:50
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK The entire app is built in microservices, running on k8s pods and uses k8s-native message broker called memphis WORKSHOP Learn how to build a cloud-native, distributed, event-driven food delivery application using a message broker, and deploy it to a kubernetes cluster. We will also discuss the importance of using a message broker within our app, and what are the benefits it provides to a more resilient application. The workshop is based on this article - https://medium.com/memphis-dev/how-to-build-your-own-wolt-app-b220d738bb71 Strongly advise to install before - memphis{dev} broker - https://github.com/memphisdev/memphis-brokerminikube - https://minikube.sigs.k8s.io/docs/start/node.js - https://nodejs.org/ If any questions arise, please raise a question on our Discord channel - https://discord.gg/WZpysvAeTf BIO Develop since I was 10 years old KEY TAKE-AWAYS FROM THE TALK Learn the basics of building and deploying cloud-native, distributed, event-driven app…
D
Data on Kubernetes Community

1 DoK Talks #136 - Building a mesh for databases from scratch and why // Maxwell Miao 47:45
47:45
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי47:45
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK In this talk, Maxwell is going to share his thoughts about Service Mesh and database operations, called Database Mesh, and figure out the design of Pisanix. Running databases on Kubernetes is hard, Database Mesh is building a new standard of databases management on cloud. It tries to make database operation more easier, and provides a framework for different kind of databases. Such as using a CRD called VirtualDatabase to describe a virtual database, developers will only need to know about this, meanwhile DBAs will have to bind TrafficStrategy and DatabaseEndpoint to VirtualDatabase, let Pisanix running like a MySQL proxy for applications. More features like SecurityClaim and TrafficQoS will be added to simplify the database operation work. BIO * Focus on SaaS and Database Mesh * Previously worked at 株式会社ネットスターズ、ChuanyangTech、AntFinancial、Yeepay * Linuxer & Gopher & Rustacean,open source advocator,cloud native envanglist * “CaaSOne” Community Cofounder * Contributor of multiple cloud native open source projects * Participated in writing Analysis and practice of Tensorflow、Istio Handbook KEY TAKE-AWAYS FROM THE TALK What's the difference of Service Mesh and Database Mesh. Why do people need a mesh for databases. What does Pisanix can help DBAs daily work. The current status of Pisanix.…
D
Data on Kubernetes Community

1 DoK Specials - Learn by doing in the DoK Community // Bart Farrell 15:55
15:55
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי15:55
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK As a community we are committed to making learning how to run stateful workloads on Kubernetes as accessible and inclusive as possible. Using our DoK Landscape which we launched at Kubecon, we will be doing a guided "tour" of the technologies featured on there. We will go one by one, providing tutorials with concrete instructions for you tp get hands on experience. At the end of each sprint we will share our learnings in public. BIO CNCF Ambassador. Data on Kubernetes (@Dokcommunity) Community Leader, amateur rapper, tech from the human side.…
D
Data on Kubernetes Community

1 DoK Talks #135 - DoK isn't just Database on Kubernetes // Patrick McFadin 46:00
46:00
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי46:00
https://go.dok.community/slack https://dok.community ABSTRACT OF THE TALK What about your streaming and analytic workloads? If you are all-in on Kubernetes you can't forget about these important parts of your infrastructure. I'll talk about the current state of the art. Why organizations may hesitate to go beyond deploying databases in Kubernetes and most important, some key things you need to be successful. BIO Patrick McFadin is the co-author of the upcoming O’Reilly book “Managing Cloud-Native Data on Kubernetes” He currently works at DataStax in Developer Relations and as a contributor to the Apache Cassandra project. Patrick has worked as Chief Evangelist for Apache Cassandra and as a consultant for DataStax, where he had a great time building some of the largest deployments in production. Previous to DataStax, he held positions as Chief Architect, Engineering Lead and Database DBA/Developer. KEY TAKE-AWAYS FROM THE TALK People should walk away with a better understanding of what it takes to deploy streaming and analytic workloads in Kubernetes.…
D
Data on Kubernetes Community

1 DoK Talks #134 - Introducing CloudNativePG // Gabriele Bartolini & Leonardo Cecchi 1:05:20
1:05:20
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:05:20
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK CloudNativePG is an open source operator for the orchestration of Postgres workloads with a primary and an arbitrary number of read-only replicas that recently applied for the CNCF Sandbox. In this presentation we'll go through the main capabilities and run a live demo showing some of the main characteristics. BIO A long time open source programmer and entrepreneur, Gabriele has a degree in Statistics from the University of Florence. After having consistently contributed to the growth of 2ndQuadrant and its members through nurturing a lean and devops culture, he is now leading the Cloud Native initiative at EDB. Gabriele lives in Prato, a small but vibrant city located in the northern part of Tuscany, Italy - known for having hosted the first European PostgreSQL conferences. His second home is Melbourne, Australia, where he studied at Monash University and worked in the ICT sector. He loves playing the Blues with his Fender Stratocaster, but his major passions are called Elisabeth and Charlotte! Leonardo has been in computer science from the beginning, and programming has always been his obsession. The passion began when he was 6, with his father teaching him the rudiments of C programming. He is a long time PostgreSQL enthusiast, using it since 1998. He is one of the main contributors of the CloudNativePG project and a maintainer. In his free time, when he’s not having fun with Kubernetes and new programming languages, he’s a classical guitarist. KEY TAKE-AWAYS FROM THE TALK Run Postgres inside Kubernetes, alongside applications. Understand how to develop Cloud Native applications that rely on Postgres for data access through SQL. Understand how business continuity of Postgres databases can be done with CloudNativePG…
D
Data on Kubernetes Community

1 Dok Talks #133 - My First 90 days with Clickhouse // Alkin Tezuysal 47:20
47:20
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי47:20
https://go.dok.community/slack https://dok.community ABSTRACT OF THE TALK This talk will tell the story of an analytics use case database from a non-OLAP and ACID-compliant RDBMS (MySQL) perspective. I will cover the basics of the Clickhouse database Sample Clickhouse installation in a lab environment. We are configuring Clickhouse for essential operations. We will load the sample data set and monitor it. We will query and visualize the results. This talk will also base on how Kubernetes can help Clickhouse implementation via an operator. Conclusions will include Do's and Don't of this emerging technology. Best practices and some advice around ingesting and analyzing terabytes of data efficiently. BIO Alkin Tezuysal has extensive experience in open source relational databases, working in various sectors for large corporations. With over 25 years of industry experience, he has acquired skills for managing large projects from the ground up to production. For the past decade, he's been focused on e-commerce, SaaS, and MySQL technologies. Alkin has managed and architected database topologies for high-volume sites. He has several years of experience in 24X7 support and operational tasks and improving database systems for major companies. He has led global operations teams on Tier 1/2/3 support for MySQL customers. He currently holds the position of EVP - Global Services at fast-growing startup ChistaDATA Inc. He's also co-author of the upcoming MySQL Cookbook 4th Edition. KEY TAKE-AWAYS FROM THE TALK Introduction to OLAP database from OLTP DBA…
D
Data on Kubernetes Community

1 DoK Specials - DEI Panel - We can do better 57:55
57:55
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי57:55
https://go.dok.community/slack https://dok.community/ With: Melissa Logan - Director, Data on Kubernetes Lisa-Marie Namphy - Head of Developer Relations, Cockroach Labs Alexandra Rowell - Community Manager, SignalWire Cortney Nickerson - Developer Advocate, Kubeshop Iraide Ruiz - Engineering Manager, Twilio Tiffany Jachja - Engineering Manager, Vox Media Bart Farrell - Head of Community, Data on Kubernetes Community Diversity Equity and Inclusion can't just be words on a website or in an onboarding presentation. How can we make sure these are living, breathing pillars in our organizations? How can we do better? To take a closer look we got Melissa Logan, Director of the Data on Kubernetes, with an all-star panel to see what's happening and what can be improved.…
D
Data on Kubernetes Community

1 DoK Talks #132 - Time-series on SQL Server on Kubernetes on ARM64… without SQL Server! // Álvaro Hernández 1:05:15
1:05:15
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:05:15
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Oh wow. What a weird title. Full of terms that don’t fit together. Or do they? This talk is for believers, those who believe that the author will deliver what the title says. Here’s the promise on what you will see: * Time-series specialized storage and functions on SQL Server. * SQL Server running on Kubernetes. * Using ARM64 (aarch64) nodes. * In reality, without using SQL Server, but rather Postgres Babelfish! The talk will be focused on a demo to show all these components working together. Surely the demo will fail badly, so get ready your rotten tomatoes! The good news is that you will be able to reproduce the demo on your environment too, as all the software used is open source and source code will be provided. BIO Álvaro is a passionate database and software developer. Founder of OnGres (one of the founding sponsors of DoK), he has been dedicated to PostgreSQL and R&D in databases for more than two decades. Álvaro is at heart an open source advocate and developer. He has created software like StackGres, a Platform for running Postgres on Kubernetes. As a well-known member of the PostgreSQL Community, Álvaro founded the non-profit Fundación PostgreSQL and the Spanish PostgreSQL User Group. He has contributed, among others, the SCRAM authentication library to the Postgres JDBC driver. You can find him frequently speaking at PostgreSQL, database, cloud (becoming an AWS Data Hero in 2019), and Java conferences. In the last 10 years, Álvaro has completed more than 100 tech talks ( https://aht.es ). KEY TAKE-AWAYS FROM THE TALK A key to progress is to combine technologies in novel ways to bring new capabilities to environments that typically lack them. For example, using Postgres Babelfish (a SQL Server-compatible Postgres flavor) and Timescaledb (time-series extension for Postgres) we can deliver open source time-series native capabilities on top of SQL Server protocol, available to SQL Server users.…
D
Data on Kubernetes Community

https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 ( https://youtu.be/Xi-h4XNd5tE ) There are plenty Kubernetes Operators for MySQL, including our own at Percona. In this session we will review existing Operators for MySQL and discuss why we are building a new one. Sergey is a passionate technology "driver". After graduation worked in various fields: internet service provider, financial sector and M&A business. Main focal points were infrastructure and products around it. At Percona as a Group Product Manager drives forward Kubernetes and Cloud databases solutions.…
D
Data on Kubernetes Community

1 Why run Postgres in Kubernetes (DoK Day EU 2022) // Gabriele Bartolini 10:02
10:02
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי10:02
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 ( https://youtu.be/Xi-h4XNd5tE ) Postgres should run inside your Kubernetes cluster. Yes, inside, not outside Kubernetes. After all, a database should be seen as an application, a special type of application - for which it is legitimate to require an additional level of care and attention. However, the small price you pay for this is worth the return on investment that your organization receives by running microservice applications, plus database combos, inside your Cloud Native infrastructure. In this session, I will cover why it is important to make such a decision. Discover the challenges and the opportunities that running Postgres inside Kubernetes presents. I will discuss what we all expect from a good Postgres operator, including self-healing, high availability, scalability, backup and recovery, performance, and - last but not least - security. I will then go beyond technical aspects to spark conversations about the holistic improvements that running a Cloud Native database brings to your organization. A long time open source programmer and entrepreneur, Gabriele has a degree in Statistics from the University of Florence. After having consistently contributed to the growth of 2ndQuadrant and its members through nurturing a lean and devops culture, he is now leading the Cloud Native initiative at EDB. Gabriele lives in Prato, a small but vibrant city located in the northern part of Tuscany, Italy - known for having hosted the first European PostgreSQL conferences. His second home is Melbourne, Australia, where he studied at Monash University and worked in the ICT sector. He loves playing the Blues with his Fender Stratocaster, but his major passions are called Elisabeth and Charlotte!…
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 ( https://youtu.be/Xi-h4XNd5tE ) Kubernetes SIG Storage is responsible for ensuring storage is available for containers in a pod when the pod is scheduled on a node. There is the Container Storage Interface (CSI) for block and file storage that allows storage providers to write CSI drivers. There is also a COSI sub-project that is trying to add object storage support in Kubernetes. In this session, Xing will give an update on some of the features that SIG Storage is working on and discuss what might be coming in the future. Xing Yang is a Tech Lead in the Cloud Native Storage team at VMware. She is a co-chair of the CNCF TAG Storage, a co-chair of the Kubernetes SIG Storage, a co-chair of the Kubernetes Data Protection WG, and a maintainer in Kubernetes CSI. Before joining VMware, Xing was the Lead Architect of OpenSDS while working for Futurewei. She also worked at Dell EMC for many years and has developed deep expertise in storage, data protection, disaster recovery, cloud, and virtualization technologies.…
D
Data on Kubernetes Community

1 What we've learned from running a PostgreSQL managed service on Kubernetes (DoK Day EU 2022) // Oleksii Kliukin 11:06
11:06
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי11:06
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 ( https://youtu.be/Xi-h4XNd5tE ) Kubernetes is an emerging platform of choice for deploying and running PostgresSQL. Deploying 100 Postgres clusters is as easy as deploying one, and there is no need to tinker with tools like Ansible or Puppet. Resource sharing can be applied when it makes sense, allowing to run multiple Postgres databases in isolation on a single instance, each storing the data on a dedicated persistent volume. There are great open-source tools out there to deal with high-availability and backups than support or can be easily integrated into the Kubernetes workflow. Monitoring and alerting is easy to implement. People reported success in running Postgres on Kubernetes before. But there are also rough edges, like memory management or certain Postgres maintenance operations, such as installing extensions, that normally cause unnecessary database downtimes on Kubernetes. They are less of a problem for in-house deployments, but may become a deciding factor when running a managed service, competing with other such services running on bare-metal servers or virtual machines that are free of those issues. In this talk, I will share some of our learnings from running a managed PostgreSQL/TimescaleDB service on Kubernetes on AWS for a little more than a year: I’ll start with the motivation of running managed PostgreSQL on Kubernetes, the benefits and drawbacks. I’ll describe the architecture of the managed PostgreSQL cloud on Kubernetes I’ll zoom in on how we solved some of the Kubernetes-specific issues within our cloud, such as upgrading extensions without downtimes, taming the dreaded OOM killer, and doing regular maintenance and PostgreSQL major upgrades. I’ll share how open-source tools from the PostgreSQL ecosystem helps us to run the service and explain how we use them in a slightly non-trivial way. Oleksii has been working with PostgresSQL for almost 20 years, and has been deploying Postgres on Kubernetes since 2016, when his team at Zalando started the internal managed PostgreSQL service based on the in-house and open-source postgresql-operator. Around 2015, with some other team members, he stared working on a PostgreSQL HA project that later became Patroni. Long before that he was hacking PosgreSQL source code to implement binary replication on PostgreSQL 7.x, authoring some PostgreSQL extensions and contributing to the core PostgreSQL itself. He started PosgreSQL meetups in Berlin in 2015 and hopes to get back to meeting in-person somewhere in 2022. Being Ukrainian, he lives in Berlin for a bit more than 9 years with his wife, two children and numerous plants…
D
Data on Kubernetes Community

1 Weathering The Cloud Storm- Modern Data Management Patterns for Reliability and Availability (DoK Day EU 2022) // Denis Magda 10:46
10:46
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי10:46
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 ( https://youtu.be/Xi-h4XNd5tE ) “Zero downtime” and “always-on” are illusions. All systems fail sooner or later, whether it’s a regional e-commerce website or a major cloud region hosting thousands of applications. That’s why, instead of chasing these illusions, it’s worth focusing on the nines of availability. Based on true stories, this session walks you through modern data availability and reliability patterns used by architects whose applications withstood major cloud outages. With the focus on the data storage layer and Kubernetes, you’ll learn: * How to architect the data layer in Kubernetes with the server, zone, and region-level resiliency in mind. * How to find a compromise between latency and availability for multi-region deployments. * How to ensure the data layer remains reliable (i.e., always returns expected data) even during a major incident. Denis Magda has spent half of his career working on distributed systems, applications, and databases. His experience spans from the development of distributed database engines and high-performance applications to training and education on the topic of distributed and cloud computing. Presently, Denis runs the Developer Relations team at Yugabyte and serves a PMC Member for Apache Ignite. He started his professional career at Sun Microsystems and Oracle, where he led one of the Java development groups and worked on technology evangelism efforts.…
D
Data on Kubernetes Community

1 Using Kubernetes to deliver a “serverless” service (DoK Day EU 2022) // Jim Walker 20:21
20:21
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי20:21
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 ( https://youtu.be/Xi-h4XNd5tE ) Serverless promises to change the way we consume software. It allows us to potentially pay for only that which we use and can help drive down operational costs to the minimal amount of resources necessary. Architecting for serverless requires a unique look at app logic and the way it is deployed. It takes a combination of the logical and physical worlds. An architectural pattern has emerged where we can scale ephemeral compute separate from services that need to persist. We use Kubernetes to deliver exactly this. A “serverless” experience that is driven and enabled by compute pods and storage pods. We also have used our experience running thousands of database clusters on Kubernetes to automate the operational expertise of managing a distributed database. In this talk, we will take a dive deep into the architecture of our application and share: * A definition and outline of the challenges of serverless * How we reworked our logic for a serverless approach * How we use Kubernetes to gain serverless autoscaling Jim is a recovering developer turned evangelist who loves useful, cool, cutting-edge tech. He loves to translate and distill complex concepts into compelling, more simple explanations that broader communities can consume. He is an advocate of the developer and an active participant in several open source communities.…
D
Data on Kubernetes Community

1 The many uses of Kubernetes cross cluster migration of persistent data (DoK Day EU 2022) // Ryan Kaw 7:39
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 ( https://youtu.be/Xi-h4XNd5tE ) Multiple clusters exist in most Kubernetes environments today, and number of clusters will increase overtime. The reasons for having multiple Kubernetes clusters are many, for example, overcoming scale limits, reducing complexity, geo separation, redundancy and having separate production, staging, and development environments. Once you have multiple K8S clusters, it can be useful to have the ability to easily move or duplicate workloads across these different clusters. Kubernetes does not have a native method to allow migration or duplication of workloads across clusters. Fortunately, there are tools that provide this functionality. In this presentation we will explore the different uses cases for cross cluster migration, and what is involved, and how these migration tools work. We'll cover some popular uses cases, such as, Disaster Recovery, Test/Dev, and performance testing. Migration could entail moving the entire cluster, or individual workloads. The components that need to be moved would include configuration and resources stored in etcd, and persistent data residing on PVCs. We'll cover the uses cases and challenges for migration, and run through an example of using one of these migration tools.…
D
Data on Kubernetes Community

1 The future of data on Kubernetes with Adobe and CNCF (DoK Day EU 2022) // Joseph Sandoval, Xing Yang & Sylvain Kalache 17:29
17:29
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי17:29
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 ( https://youtu.be/Xi-h4XNd5tE ) Some data-intensive workloads are easier to run in Kubernetes than others. Why? What needs to improve? Join us as we deep dive with Adobe and the CNCF about how easy (or not) it is to run different types of data workloads on Kubernetes – and what is being done both inside and outside of Kubernetes to make data workloads easier. Xing Yang is a Tech Lead in the Cloud Native Storage team at VMware. She is a co-chair of the CNCF TAG Storage, a co-chair of the Kubernetes SIG Storage, a co-chair of the Kubernetes Data Protection WG, and a maintainer in Kubernetes CSI. Before joining VMware, Xing was the Lead Architect of OpenSDS while working for Futurewei. She also worked at Dell EMC for many years and has developed deep expertise in storage, data protection, disaster recovery, cloud, and virtualization technologies. I find myself working at the intersection of OpenSource, Cloud Native software, SRE, and DevOps. My goal is to help bring a pragmatic opinion and shepherd organizations to stay focused on delivering value to their internal and external customers…
D
Data on Kubernetes Community

1 The Data on Kubernetes Landscape (DoK Day EU 2022) // Melissa Logan & Sylvain Kalache 10:25
10:25
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי10:25
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 ( https://youtu.be/Xi-h4XNd5tE ) We know from the first Data on Kubernetes Report that 90% of respondents believe Kubernetes is ready for stateful workloads, but significant challenges remain. The DoK Community continues to grow and build a unique space where people share knowledge and have conversations that are shaping the next decade of data on Kubernetes. Melissa Logan and Sylvain Kalache will discuss the growth of DoK, DoKC, and introduce a new project to help end users on their journey to running data-intensive workloads on Kubernetes more easily and with better control.…
D
Data on Kubernetes Community

1 Testing the Mettle- Evaluating data solutions for large-scale production to check who stacks up (DoK Day EU 2022) // Dinesh Majrekar 9:26
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 ( https://youtu.be/Xi-h4XNd5tE ) The state of the CNCF Storage options has exploded in the past few years, but if you had to choose a project to use today, how would you go about comparing each offering and choosing who to partner with for your future growth? Civo offers managed Kubernetes clusters powered by K3s to customers around the world. We manage thousands of Virtual Machines and stateful customer data within multiple data centres across several continents. In late 2021, Civo had the opportunity to evaluate the CNCF storage landscape to move to a new technology stack. Learn about the steps required to evaluate the market, testing vendor claims in the real world and what needs to be considered outside of the purely technological aspects of any product selection. In late 2021, Civo's Director of Innovation, Dinesh, spoke about the intricacies of migrating between two vendors (https://www.youtube.com/watch?v=GlUsPnSZI_Y&list=PLHgdNuGxrJt2-xlW_l2q1BAE3e4TsPnQo). Now join him as he talks about the wider project and some of business implications of hosting Data on Kubernetes Dinesh is Director of Innovation at Civo and has been the lead architect for the new Civo Stack Platform. Having worked in the hosting industry for many years, Dinesh has a passion for creating solutions that operate at scale. This not only applies to the technology stack, but for nurturing engineers through their career.…
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 ( https://youtu.be/Xi-h4XNd5tE ) The Rap God project acts as a great entry point to many incoming open-source enthusiasts who are interested in learning about the cloud native ecosystem. The Rap-God project uses Kubernetes orchestration for a stateful case which is an emerging topic, the Rap God project acts as a demonstration of how to use such features of Kubernetes. The project will be using Stateful sets that’d deploy Apache Cassandra (for its first cycle) and eventually it’ll be implementing the same API endpoints for various databases that will be with/on Kubernetes. We in the community intend to do this with PersistenceVolumes and Persistent Volume Claims. Keeping in mind the issues, various developers face, we also will be making options for storage classes. The project will allow the members to explore how they can customize the whole storage class setup according to their setup. The project will be bringing Helm, Cassandra, Kubernetes and Argo under its watch and shall actively expand on its implementation with the further iterations. Abhijith Ganesh is an undergrad computer science major, currently pursuing his Freshman year. His areas of interest include DevOps, Kuberenetes and Open Source Projects. He is an active member of the DoK Community where he is currently an intern. He is also member of the Pyrsia and SeaQL communities.…
D
Data on Kubernetes Community

1 Serverless Event Streaming Applications as Functions on K8 (DoK Day EU 2022) // Timothy Spann 8:43
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 ( https://youtu.be/Xi-h4XNd5tE ) We will walk through how to build serverless event streaming applications as functions running in a function mesh on kubernetes with cloud native messaging via Apache Pulsar. In this talk, you will deploy ML functions to transform real-time data on Kubernets. Tim Spann is a Developer Advocate @ StreamNative where he works with Apache Pulsar, Apache Flink, Apache NiFi, Apache MXNet, TensorFlow, Apache Spark, big data, the IoT, machine learning, and deep learning. Tim has over a decade of experience with the IoT, big data, distributed computing, streaming technologies, and Java programming. Previously, he was a Principal Field Engineer at Cloudera, a Senior Solutions Architect at AirisData and a senior field engineer at Pivotal. He blogs for DZone, where he is the Big Data Zone leader, and runs a popular meetup in Princeton on big data, the IoT, deep learning, streaming, NiFi, the blockchain, and Spark. Tim is a frequent speaker at conferences such as IoT Fusion, Strata, ApacheCon, Data Works Summit Berlin, DataWorks Summit Sydney, and Oracle Code NYC. He holds a BS and MS in computer science. https://www.datainmotion.dev/p/about-me.html https://dzone.com/users/297029/bunkertor.html https://conferences.oreilly.com/strata/strata-ny-2018/public/schedule/speaker/185963…
D
Data on Kubernetes Community

1 Running Kafka on Kubernetes, across three clouds at Adobe (DoK Day EU 2022) // Adi Muraru 16:48
16:48
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי16:48
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 ( https://youtu.be/Xi-h4XNd5tE ) Adobe runs dozens of Kafka clusters spread across both public (AWS and Azure) and private clouds to power the Adobe Experience Platform message bus. In this talk, we share the experience of running Kafka as a stateful Kubernetes workload in a cloud-native environment using the open-source Koperator and the tools we built to automate the whole lifecycle. While running a stateful service like Kafka on Kubernetes may be intimidating at the first glance, we share our thought process, the tools, and the results that can make this a reality in any organization. The Kubernetes Operator pattern helped us automate all the operational aspects for the lifecycle of the cluster; abstract away the cloud specifics allowing us to focus on Kafka; achieve increased resilience and elasticity; implement automated Kafka rebalancing using CruiseControl, and harness all the metrics to implement an observable environment. We also plan to demo how these all come together. Join us as we share our team's learnings on: - Considerations for running a stateful service in a multi-tenant Kubernetes cluster: - Pod sizing in terms of memory and CPU - Scheduling - Persistent Volume Claims (PVC) tuning - Resiliency topics - How to handle regular Ethos upgrades and maintenance - How to bring custom monitoring solutions - Advanced network topics such as using dedicated VIPs If you or your team ever wanted to learn how to run your own stateful service in Kubernetes and/or see what's possible, come join us! Computer Science Ph.D. and Principal Scientist at Adobe having experience in complex, large scale distributed systems. My focus currently is on designing and building the message streaming platform for Adobe Experience Platform using Apache Kafka. The endeavor is unique as it spans both Amazon AWS and Microsoft Azure clouds and handles billions of messages a day.…
D
Data on Kubernetes Community

1 Running a database on local NVMes on Kubernetes (DoK Day EU 2022) // Tomáš Nožička & Maciej Zimnoch 9:42
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 ( https://youtu.be/Xi-h4XNd5tE ) Running a database on Kubernetes with persistent storage is relatively easy but when it comes to performance it won’t match local NVMes. This talk will show you how to set up the local NVMes for Kubernetes, how to handle the application and cluster lifecycle in a safe manner and share our experience with running ScyllaDB with local NVMes on different Kubernetes cloud providers. Tomas leads the development of Scylla Operator (https://github.com/scylladb/scylla-operator), a Kubernetes operator to manage ScyllaDB. Previously, he worked on a self-hosted, auto-upgrading Kubernetes control plane for RedHat OpenShift. Tomas is an Emeritus Kubernetes SIG-Apps approver. Maciej is a Go and C++ enthusiast. He is a software engineer working on ScyllaDB management tools. Previously he worked in network companies where he delivered multiple features to SDN solutions and LTE networks.…
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 ( https://youtu.be/Xi-h4XNd5tE ) Redis is a widely used open-source in-memory data store and cache that has become a key component in the development of scalable microservice systems. While all of the main cloud providers provide fully managed Redis services (Amazon ElastiCache, Azure Cache for Redis, and GCP Memorystore), it may also be simply implemented in Kubernetes if you require additional control over the Redis configurations. This session will lead you through setting up a Redis cluster on Kubernetes with data persistence. The lesson would concentrate on establishing a "prod"-ready Redis cluster that considers storage and persistence even after restarts and discusses recommended practices and considerations. With the help of monitoring and hardware optimizations for your cluster, you'll learn about measuring your performance and best practices for Redis on Kubernetes. We would conclude whether operators or Helm chart will be a more mature option for your Data on Kubernetes needs with Redis and a few tips as a production checklist. Ryan Gray is a Technical Consultant with Portainer.io. Before that he spent a couple of decades building ISPs, carrier Ethernet networks, data centers, and software. He is currently obsessed with container orchestration, GitOps, edge computing, and why naming things is so hard. Hrittik is a Community Advocate at Portainer and a student in his sophomore year. Hrittik focuses on building tech communities and software with the help of cloud native technologies. He has been a speaker at conferences like the Cloud Summit and conducted various events that have collectively empowered students and professionals on DevOps and Cloud.…
D
Data on Kubernetes Community

1 PV TrashCan - Protection against accidental deletion of PVs or Namespaces (DoK Day EU 2022) // Veda Talakad, Aditya Kulkarni & Aditya Dani 11:07
11:07
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי11:07
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 ( https://youtu.be/Xi-h4XNd5tE ) Accidental PVC delete or namespace delete can cause the Persistent Volume to get deleted. Such volumes lose their data and the stateful applications lose their state. By the use of Persistent Volume TrashCan, users can get a grace period to undo such unintended delete operation. The deleted Persistent Volumes are staged for delayed deletes. They continue to live even after being deleted from k8 perspective, for a configurable time(retention period) and based on the system’s usage. The storage class of the PVC can dictate if they need to be staged for a delayed delete. StorageClass can also allow for configurable retention period. To recover a deleted PersistentVolume, users can create a new namespace with the same name and reapply the original PVC spec. The PVC will reference a special StorageClass to indicate that the new PersistentVolume needs to be restored from the TrashCan. This will allow the application to restart with the right state and data. This talk will showcase how to overcome one of the admin’s pain point seen in field involving accidental deletions of PVCs by using advanced storage management solutions in Kubernetes. Veda Talakad is a Software professional with BS in electronics and communications mostly working in storage domain. Some of the professional areas of interests include scalability in distributed environment and cloud-native technologies for data management, Aditya Kulkarni: I have 10 years of experiance in Enterprise Data management. I have worked on All Flash Array at Netapp from the device driver layer to WAFL Filesystem. Next, I worked at Portworx where I developed and enhanced the number one data platform for Kubernetes. Then I moved to Pure Storage as part of Portworx acquisition. Aditya Dani is an architect at Portworx by PureStorage, that enables users to run any cloud-native data service, in any cloud, using any Kubernetes platform. He is one of the contributors to the open source project Stork that provides storage awareness to the Kubernetes scheduler. Prior to Portworx, Aditya worked for Amazon on their Music Recommendations Engine. He enjoys working on distributed systems and loves programming in Go.…
D
Data on Kubernetes Community

1 Protecting data with CSI Volume Snapshots on Kubernetes (DoK Day EU 2022) // Grant Griffiths 11:10
11:10
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי11:10
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 ( https://youtu.be/Xi-h4XNd5tE ) The container storage interface (CSI) is a contract between different container orchestrators (Kubernetes, Nomad, etc) and storage plugins. This contract is a set of gRPC services for provisioning, utilizing, and snapshotting storage volumes. In this talk, we will focus on one aspect of the CSI spec: Volume Snapshots. The talk will cover volume snapshots on Kubernetes CSI, with a deep dive on Kubernetes CSI snapshots. A short demo on the basics of the CSI snapshots on Kubernetes will help show how easy it is to start using CSI snapshots. After covering the basics on both container schedulers, we’ll deep dive into how backup systems can programmatically interact with the Kubernetes CSI snapshots client. This example will be based on Stork, an open-source backup system that has implemented the CSI spec. I believe this talk will be beneficial to the community because it covers some storage-related topics that have not been covered much in previous conferences. I’m hoping to bring the unique perspective of a Kubernetes CSI and Nomad CSI contributor, while also covering the CSI plugin maintainer and end-user perspective. With this wide audience, many conference attendees will be able to learn about stateful applications, CSI volume snapshots, and developing CSI drivers for any container scheduler. Grant Griffiths is a core contributor and reviewer for the Kubernetes CSI and SIG Storage communities working at Portworx. For kubernetes-CSI, he helped bring the external-snapshotter to Beta in Kubernetes 1.17 and GA in Kubernetes 1.20, improved support for secrets, release tooling, and fixed various snapshot controller bugs. At Portworx, he works on control plane components such as the CSI Driver, SDK, CLI, and security. Previously, Grant was a Senior Software Engineer at GE Digital where he built Data Services in Go on k8s. He has given talks at various community conferences: Two talks at GopherCon London 2018 https://youtu.be/gm34Aph9v0M https://youtu.be/MAopsd6kdY0 KubeCon Europe 2020: https://youtu.be/Yt5wpZ6raDA HashiTalks: Deploy 2021: https://youtu.be/PXVatM1xZxg…
D
Data on Kubernetes Community

1 Operator Lifecycle Management (DoK Day EU 2022) // Julian Fischer 15:21
15:21
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי15:21
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 ( https://youtu.be/Xi-h4XNd5tE ) The ability to extend Kubernetes with Custom Resource Definitions and respective controllers has led to the OperatorSDK, which became the de facto standard for data service automation on Kubernetes. There are countless operator implementations available, and new operators are being released on a daily basis. Organizations managing hundreds of Kubernetes clusters for dozens of developer teams are also challenged to manage the lifecycle of hundreds of Kubernetes operators. The goal is to keep the operational overhead to a minimum. In this talk, a closer look into the lifecycle of operators will be presented. With an understanding of how operators evolve, it becomes clear what challenges during operator upgrades. A brief overview of lifecycle management tools such as Helm, OLM, and Carvel is presented in this context. In particular, it will be discussed whether these tools can help, which restrictions apply and where further development would be desirable. At the end of this talk, you will know what operator lifecycle management is about, what its challenges are, and which tools may be used to reduce operational friction. Julian Fischer, CEO of anynines, has dedicated his career to the automation of software operations. In more than fifteen years, he has built several application platforms. He has been using Kubernetes, Cloud Foundry, and BOSH in recent years. Within platform automation, Julian has a strong focus on data service automation at scale.…
D
Data on Kubernetes Community

https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 ( https://youtu.be/Xi-h4XNd5tE ) FoundationDB is an open-source distributed transactional Key-Value store that is used by multiple companies like Apple, Snowflake and VMWare Tanzu (previously Wavefront). This talk will cover the design of the FoundationDB operator and lessons learned from operating FoundationDB on Kubernetes. We will discuss some of the missing pieces in Kubernetes to make it easier to operate FoundationDB on top of it and how we solved those challenges in the operator. We will focus on the pieces of the FoundationDB operator that are different to most other operators and why we decided to implement those pieces like they are. We will also discuss how to run an high available FoundationDB cluster on top of Kubernetes and what different choices a user has. We will also cover some challenges that arise when running stateful services at scale on top of Kubernetes and how they can be managed. At the end of this talk we will give an outlook for future design changes and planned features in our operator. The main take-away from this talk is to understand how to run and operate FoundationDB on Kubernetes. Johannes started his journey in the Kubernetes eco-system in early 2015, onboarding projects and applications onto Kubernetes. Since 2020 Johannes works as an SRE for FoundationDB at Apple and is co-leading the development of the open source FoundationDB operator.…
D
Data on Kubernetes Community

https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 ( https://youtu.be/Xi-h4XNd5tE ) It is still challenging to run Apache Spark and other big data processing workload on Kubernetes, especially in large scale. People need to address various issues like resource isolation, queuing, and cost efficiency. This session will share details about those challenges and how to address them. We will also present a convenient (one-click) way to deploy Apache Spark on Kubernetes, and dramatically lower the barrier to use Spark. Bo has been working in big data area for 10 years, and gained intensive experience of running Apache Spark in large scale while he worked in in Uber. In the Kubernetes era, he focuses on building world class Apache Spark Service, and making the tooling available to the Spark and Kubernetes communities.…
D
Data on Kubernetes Community

1 Microservices and Kubernetes for your Full Data Lifecycle (DoK Day EU 2022) // Steve Pousty 14:26
14:26
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי14:26
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 ( https://youtu.be/Xi-h4XNd5tE ) Data doesn’t magically appear in our data centers. There are usually several phases and several storage locations along its journey throughout your organization. New architectural patterns, such as microservices, and new technology, such as Kubernetes are changing how we can think about and manage the large volumes of data coming at us. In this talk we will begin by quickly introducing the architecture and technology and how they make our lives better. From there it’s live demo time combining Java microservices, a processing caching service, a messaging layer, and a relational database all running in Kubernetes . This application will be handling frequently updated data, generating alerts on specific data events, and simultaneously populating a system of record. Come for the discussion, hands on demo, and witty banter! Steve is a Dad, Partner, Son, and Senior Developer Readiness Engineer for VMware Tanzu. In addition to showing off all the great work of the Tanzu team, he helps drive Tanzu developer experience. He can teach you about Containers, Kubernetes, Data Analysis, Java, Python, PostgreSQL, Microservices, and some JavaScript. He has deep subject area expertise in GIS/Spatial, Statistics, and Ecology. Before Tanzu, Steve was a developer Advocate for Crunchy Data, DigitalGlobe, Red Hat, LinkedIn, deCarta, and ESRI. Steve has a Ph.D. in Ecology and can easily be bribed with offers of bird watching or fly fishing. He has spoken at hundreds of conferences and done over 100 workshops including Monktoberfest, Red Hat Summit, MongoNY, JavaOne, FOSS4G, ODSC, AjaxWorld, GeoWeb, Where2.0, and OSCON.…
D
Data on Kubernetes Community

1 Leveraging Running Stateful Workloads on Kubernetes for the Benefit of Developers (DoK Day EU 2022) // Arsh Sharma, Lapo Elisacci & Ramiro Berrelleza 14:02
14:02
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי14:02
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 ( https://youtu.be/Xi-h4XNd5tE ) Kubernetes comes with a lot of useful features like Volumes and StatefulSets, which make running stateful workloads simple. Interestingly, when combined with the right tools, these features can make Kubernetes very valuable for developers wanting to run massive production databases in development! This is exactly what was seen at "Extendi". The developers at Extendi deal with a large amount of data in their production Kubernetes clusters. But when developing locally, they didn't have an easy way of replicating this data. This replication was needed because it allowed developers to test new features instantaneously without worrying if they would work as expected when pushed to production. But replicating a 100Gb+ production database for development wasn't turning out to be an easy task! This is where leveraging Kubernetes + remote development environments came to the rescue. Running data on Kubernetes turned out to be way faster than any of the traditional approaches because of Kubernetes' ability to handle stateful workloads exceptionally well. And since Extendi already used Kubernetes in production - the setup process was fairly simple. This talk will cover practical steps on how leveraging Kubernetes based development environments allowed dev teams at Extendi to run production data on Kubernetes during development using features like Volume Snapshots, having a huge positive impact on developer productivity. Arsh is a Developer Experience Engineer at Okteto. He is an active contributor to the upstream Kubernetes project and was awarded the Kubernetes Contributor Award for his contributions in 2021. Arsh has written blogs and spoken about different topics in the cloud-native ecosystem at various conferences before, including KubeCon + CloudNativeCon + Open Source Summit China 2021. He has also been on the Kubernetes Release Team since the 1.23 release. He also serves as the New Contributor Ambassador for the Documentation Special Interest Group of the Kubernetes project and continuously mentors new folks in the community. Previously, he worked at VMware and was an active contributor to other CNCF projects, including cert-manager and Kyverno. Lapo is a Software Engineer currently leading the development team of a Social Listening and Audience Intelligence platform. He started coding at the early age of 14 and since he turned his passion into a real job, he has always been looking for boosting his knowledge by constantly researching for newer and newer technologies. Active on Ruby Open Source projects Ramiro Berrelleza is one of the founders of Okteto. He has spent most of his career (and his free time) building cloud services and developer tools. Before starting Okteto, Ramiro was an Architect at Atlassian and a Software Engineer at Microsoft Azure. Originally from Mexico, he currently lives in San Francisco.…
D
Data on Kubernetes Community

1 Kanister & Kopia - An Open-Source Data Protection Match Made in Heaven (DoK Day EU 2022) // Pavan Navarathna 13:38
13:38
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי13:38
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 ( https://youtu.be/Xi-h4XNd5tE ) Cloud-native applications comprise various components, including data services, storage systems, and related Kubernetes objects. Each component requires its own data protection tools, strategy, and domain expertise. A robust solution aligned with business requirements often involves complex workflows. What if there was a way to coordinate the implementation of these workflows while optimizing how backups are moved into storage? During this talk, Pavan will demonstrate how two open-source tools, Kanister and Kopia, work together to optimize backup and recovery for Kubernetes applications. - Kanister allows domain experts to capture application-specific workflows in Blueprints to extend and share. - Kopia uses state-of-the-art encryption algorithms (AES-256 or ChaCha20) and compresses data to save bandwidth and storage. This talk is for anyone interested in running stateful workloads in production on Kubernetes. Attendees will leave armed with a streamlined way to protect Kubernetes applications. Pavan joined Kasten by Veeam in March 2018, where he manages a team of cloud-native engineers developing creative solutions for managing and protecting stateful application data in Kubernetes. Prior to Kasten, he was a part of NetApp’s Cloud Data Services Group, where he worked on the data protection stack for a cloud-integrated backup appliance. Pavan is an active contributor to open-source projects including Kanister, a framework for application-level data management on Kubernetes; and Kopia, a fast and secure backup tool. He holds a Master's degree in Computer Science from the University of Florida.…
D
Data on Kubernetes Community

1 Is your database in Kubernetes production ready (DoK Day EU 2022) // Mykola Marzhan 15:21
15:21
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי15:21
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 ( https://youtu.be/Xi-h4XNd5tE ) It only looks simple to run databases in Kubernetes. In fact, it is too many things needed to be considered before running any database in Kubernetes. Failover and traffic switching, replication and data consistency/loss after failover, upgrades, DB and node-level configuration, CNI, backups, monitoring, etc. After this talk, you will have a complete list of questions that should be checked before running a database in production. Mykola has been shorting “Time-to-market” in software companies for more than ten years. Most of his career he has been focused on the *development* of monitoring, update and deployment systems.…
D
Data on Kubernetes Community

https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 ( https://youtu.be/Xi-h4XNd5tE ) How can you keep your data secure and how can you transfer them on a secure way? You will learn to encrypt your data, that you can use them Kubernetes based in a Multi Cloud environment. Sarah Julia Kriesch is working as a DevOps Consultant for Accenture in Germany. She has studied Computer Science at the TH Nürnberg and worked as a Linux System Administrator/ System Engineer for an ISP and in the area of Cloud Computing before. She is also contributing to Open Source since around 10 years.…
D
Data on Kubernetes Community

1 Growing up fast - Kubernetes and Real-Time Analytic Applications (DoK Day EU 2022) // Robert Hodges 15:30
15:30
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי15:30
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 ( https://youtu.be/Xi-h4XNd5tE ) Kubernetes is turning into a preferred platform for real-time analytic app that crunch billions of events per day and return insights in seconds. In this talk we'll introduce the standard analytic app design pattern of fast event streams coupled with low-latency data warehouses, using open source projects. We'll then walk through deploying the pipeline on Kubernetes from ingest to end user access. We'll touch on use of operators, scaling, monitoring, upgrade, security, and approaches to adding custom components. Attendees can expect to leave with concrete lessons about how to stand up low-latency analytics quickly on Kubernetes. Robert Hodges is CEO of Altinity, an enterprise provider for ClickHouse data warehouse. He's also a database geek with experience on at least 20 DBMS types. Robert caught the Kubernetes bug at VMware in 2018.…
D
Data on Kubernetes Community

1 Graph in Kubernetes Panel (DoK Day EU 2022) // Wey Gu, Cheukting Ho & Feynman Zhou 20:15
20:15
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי20:15
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 ( https://youtu.be/Xi-h4XNd5tE ) Graph databases are the fastest growing data store in the world. According to Gartner, the application of graph processing and graph DBMSs will grow at 100 percent annually through 2022 to continuously accelerate data preparation and enable more complex and adaptive data science. However, it is often difficult for data and analytics professionals to distinguish between different implementation models, and to fit them to their use case. This panel will speak directly to Kubetnetes users and provide them with the context they need to run stateful workloads. Topics for discussion would be: - graph databases on Kubenetes challenges, lessons learnt, and patterns for deployment - graph uses case on k8s, including scaling knowledge graphs - K8s makes graph better - technical patterns to use graph databases to get the maximum out of k8s - the panels’ learnings from running graph data on Kubernetes…
D
Data on Kubernetes Community

1 From Laptop to Cloud. Developing Cloud-Native Applications with Containerized Databases (DoK Day EU 2022) - Nic Vermandé 17:16
17:16
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי17:16
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 ( https://youtu.be/Xi-h4XNd5tE ) With the advent of microservices in Kubernetes, individual developer teams now manage their own data, middleware, and databases. Automated tests and CI/CD pipelines have to be revisited to include these new requirements. This session will discuss and demonstrate how to use Kustomize and Tekton to provide Kube-Native automated workflows taking into account new parameters such as database operators, StorageClass and PVC. In this talk, Nic is also going to provide some insights on how to optimize Tekton to work with multiple workspaces and overcome some of the affinity limitations. The demonstration will focus on building a comics cards web application using a flask-based frontend and leveraging postgreSQL as the database. It will cover the automation of multiple lifecycle stages: - local laptop testing automation on K3S with Kustomize - production deployment using Flux, Tekton and the Zalando PostreSQL operator Nicolas is an experienced hands-on technologist, evangelist, and product owner who has been working in the fields of Cloud-Native technologies, Open Source Software, Virtualization, and Datacenter networking for the past 17 years. Passionate about enabling users and building cool tech solving real-life problems, you'll often see him speaking at global tech conferences and online events, spreading the word, and walking the walk with customers and users.…
D
Data on Kubernetes Community

1 Disaggregated Container Attached Storage - Yet Another Topology with What Purpose (DoK Day EU 2022) // Nick Connolly 9:32
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 ( https://youtu.be/Xi-h4XNd5tE ) The storage topology in vogue seems to cycle every few years. Internal storage is followed by centralized Storage Area Networks only to be superseded by one-size-fits-all Hyperconverged models - until scalability constraints led to distributed storage. Then comes NVMe, offering blistering speeds that all of these storage stacks struggle with. Kubernetes inspires Container Attached Storage aspiring to be the perfect model, so why is disaggregated storage now making an appearance? This talk considers the motivations behind yet another storage topology and examines a modern, flexible architecture for delivering high-performance storage under Kubernetes. Nick Connolly is a pioneer of storage virtualisation and the Chief Scientist at DataCore, where his background in real-time computing and multiprocessing led to the creation of a world-class high-performance storage stack on Windows. He holds patents ranging from highly scalable algorithms through to data protection techniques. Recently he has been working with OpenEBS to bring the power and performance of NVMe to Kubernetes.…
D
Data on Kubernetes Community

1 Datashim - a framework for declarative management of datasets on Kubernetes (DoK Day EU 2022) // Srikumar Venugopal 15:36
15:36
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי15:36
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 ( https://youtu.be/Xi-h4XNd5tE ) Many ML pipelines depend on shared filesystems for input, output and intermediate data storage. Standards such as CSI have made it possible for applications in Kubernetes to access a variety of data storage systems. Yet, data scientists still have to deal with low-level details of data access in order to execute their pipelines in Kubernetes. Datashim is a framework that manages the lifecycle of a Dataset object, a CustomResourceDefinition that represents a source of data. Datashim takes care of the details of data access while Kubernetes pods can declaratively access the data by referencing a Dataset in their specifications. This talk will describe Datashim and the Dataset object, discuss its use in ML pipelines, and demonstrate how its pluggable architecture is designed for the development of caching, scheduling and governance plugins. Datashim is an incubating project of the Linux Foundation Data and AI Foundation Srikumar Venugopal is a Research Scientist in IBM Research Europe in Dublin, Ireland. His research interests lie in the area of cloud computing and large-scale distributed systems, specifically in the topics of middleware, resource management, and scalability. He is the co-founder and current lead for the Datashim project.…
D
Data on Kubernetes Community

1 Build your own social media analytics with Apache Kafka (DoK Day EU 2022) // Jakub Scholz 10:22
10:22
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי10:22
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 ( https://youtu.be/Xi-h4XNd5tE ) Apache Kafka is more than just a messaging broker. It has a rich ecosystem of different components. There are connectors for importing and exporting data, different stream processing libraries, schema registries and a lot more. This talk will show how to use it to read data from social networks such as Twitter, process them and use machine learning to analyze them. And all of it will be of course running on top of Kubernetes. Jakub works at Red Hat as Senior Principal Software Engineer. He has long-term experience with messaging and currently focuses mainly on Apache Kafka and its integration with Kubernetes. He is one of the maintainers of the Strimzi project which provides tooling for running Apache Kafka on Kubernetes. Before joining Red Hat he worked as messaging and solution architect in the financial industry.…
D
Data on Kubernetes Community

https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 ( https://youtu.be/Xi-h4XNd5tE ) What does Kubernetes provide that allows us to reduce the complexity of Apache Cassandra while making it better suited for cloud native deployments? That was the question we started with as we began a mission to bring Cassandra closer to Kubernetes and eliminate the redundancy. Many great open source databases have been adapted to run on Kubernetes, without relying on the deep ecosystem of projects that it takes to run in Kubernetes(there is a difference). This talk will discuss the design and implementation of the Astra Serverless Database which re-architected Apache Cassandra to run only on Kubernetes infrastructure. Built to be optimized for multi-tenancy and auto-scaling, we set out with a design goal to completely separate compute and storage. Decoupling different aspects of Cassandra into scaleable services and relying on the benefits of Kubernetes and it's ecosystem created a simpler more powerful database service than a stand alone, bare-metal Cassandra cluster. The entire system is now built on Apache Cassandra, Stargate, Etcd, Prometheus, and object-storage like Minio or Ceph. In this talk we will discuss the downstream changes coming to several open source projects based on the work we have done. Jake is a lead developer and software architect at DataStax with over 20 years of experience in the areas of distributed systems, finance, and manufacturing. He is a member of the Apache Foundation and is on the project committee of the Apache Cassandra, Arrow, and Thrift projects. Jake has a reputation for developing creative solutions to solve difficult problems and fostering a culture of trust and innovation. He believes the best software is built by small diverse teams who are encouraged to think freely. Jake received his B.S. in Computer Science from Lehigh University along with a minor in Cognitive Science.…
D
Data on Kubernetes Community

1 Autoscaling Stateful Workloads in Kubernetes (DoK Day EU 2022) // Mohammad Fahim Abrar & Md. Kamol Hasan 10:14
10:14
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי10:14
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 (https://youtu.be/Xi-h4XNd5tE) Managing stateful workloads in a containerized environment has always been a concern. However, as Kubernetes developed, the whole community worked hard to bring stateful workloads to meet the needs of their enterprise users. As a result, Kubernetes introduced StatefulSets which supports stateful workloads since Kubernetes version 1.9. Users of Kubernetes now can use stateful applications like databases, AI workloads, and big data. Kubernetes support for stateful workloads comes in the form of StatefulSets. And as we all know, Kubernetes lets us automate many administration tasks along with provisioning and scaling. Rather than manually allocating resources, we can generate automated procedures that save time, it lets us respond faster when peaks in demand, and reduce costs by scaling this down when resources are not required. So, it’s really important to capture autoscaling in terms of stateful workloads in Kubernetes for better fault tolerance, high availability, and cost management. There are still a few challenges regarding Autoscaling Stateful Workloads in Kubernetes. They are related to horizontal/vertical scaling and automating the scaling process. In Horizontal Scaling when we are scaling up the workloads, we need to make sure that the infant workloads join the existing workloads in terms of collaboration, integration, load-sharing, etc. And make sure that no data is lost, also the ongoing tasks have to be completed/transferred/aborted while scaling down the workloads. If the workloads are in primary-standby architecture, we need to make sure that scale-up or scale-down happens on standby workloads first, so that the failovers are minimized. While scaling down some workloads, we also need to ensure that the targeted workloads are excluded from the voting to prevent quorum loss. Similarly, while scaling up some workloads, we need to ensure that new workloads join the voting. When new resources are required, we have to make the tradeoff between vertical scaling and horizontal scaling. And when it comes to Automation, we have to determine how to generate resource (CPU/memory) recommendations for the workloads. Also, when to trigger the autoscaling? Let’s say, a group of workloads may need to be autoscaled together. For example, In sharded databases, each shard is represented by one StatefulSet. But, all the shards are treated similarly by the database operator. Each shard may have its own recommendations. So, we have to find a way to scale them with the same recommendations. Also, we need to determine what happens when an autoscaling operation fails and what will happen to the future recommendations after the failure? There can be some workloads that may need a managed restart. For example, in a database, secondary nodes may need to be restarted before the primary. In this case, how to do a managed restart while autoscaling? Also, we need to figure out what happens when the workloads are going through maintenance? We will try to answer some of those questions throughout our session. ----- Fahim is a Software Engineer, working at AppsCode Inc. He has been involved with Kubernetes project since 2018 and is very enthusiastic about Kubernetes and open source in general. ----- MD Kamol Hasan is a Professional Software Developer with expertise in Kubernetes and backend development in Go. One of the lead engineers of KubeDB and KubeVault projects. Competitive contest programmer participated in different national and international programming contests including ACM ICPC, NCPC, etc…
D
Data on Kubernetes Community

https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 ( https://youtu.be/Xi-h4XNd5tE ) Come here about our experience scaling Cassandra on EKS to over 1000 nodes and 20 million transactions per second. This session will cover the lessons learned, successes, failures, and tools used to get there. Usability is Matt’s mission. He has worked with Federal, Fortune 500, and small businesses to help collect, mine and interact with data. When solving a problem, Mr. Overstreet synthesizes experience from a liberal arts and technical background. Matt has previously presented community webinars for DataStax and spoken at the search focused Haystack conference.…
D
Data on Kubernetes Community

1 Dok Talks #131 - How to win friends and influence businesses // Fabian Met 1:00:48
1:00:48
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:00:48
https://go.dok.community/slack https://dok.community ABSTRACT OF THE TALK In this talk I share my personal experience where when I was working for a client the company had a hard time innovating and did not have a lot of faith in their software/infrastructure engineers. From a 'distance' I saw a lot of potential where they already had knowledgable engineers but lacked the experience to get started as a company but also as individual engineers. I created a community at that client, filled with their engineers. It was open to everyone who wanted to learn and we saw everyone's input as equal. Because of that safe space the company now runs almost all their IT in the cloud in a way that made both the engineers and the managers happy. All simply by just being a nice guy! BIO Hi my name is Fabian Met and I have been working in the cloud native space for the past couple of years. I love to learn new tech and do so almost every day! When I learn something new I then always use that knowledge to teach people how tech that might seem difficult at first can actually be quite manageable! Making sure that people do not experience these huge hills they sometimes face trying to learn something new is what I love to do! My role as Techfluencer means that I will help companies and people alike to see the benefit and required investment to start innovating and using new technology. I don't do this as an authoritative figure, instead I love to talk to people and hear their stories and challenges. Listening to them and talking to them usually leads to insight for both parties and possible solutions that previously might not have seen feasible! KEY TAKE-AWAYS FROM THE TALK Even though you might not be in a position of power, I will show you that no matter who you are, be that a manager, engineer or even the janitor you can help make positive change in your company!…
D
Data on Kubernetes Community

1 Dok Talks #130- Leaning on Kubernetes Portability to Manage Databases Anywhere // Robert Hodges 1:04:45
1:04:45
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:04:45
https://go.dok.community/slack https://dok.community ABSTRACT OF THE TALK What if databases only ran in a single place? That would be useless. But it's what we get with most database-as-a-service offerings in the public cloud. Kubernetes has the potential to change the game and let users run complex database anywhere they want. This talk tells the story of how we built a portable ClickHouse cloud service on Kubernetes. It runs on AWS and GCP, of course. Even better, we're working on managing databases in *any* Kubernetes cluster, even those behind firewalls and NAT gateways. We can pull it off because Kubernetes is truly portable. We'll share what we've learned about using Kubernetes portability create a new breed of managed database. We'll speculate about what this portends for the future of database management. BIO Robert Hodges has worked with over 20 different DBMS types starting with M204 back in 1983. His current favorite is ClickHouse, a popular open source data warehouse. He caught the Kubernetes bug at VMware in 2018 and has been working on it ever since. In his day job he runs Altinity, an enterprise software and services provider for ClickHouse data warehouses. KEY TAKE-AWAYS FROM THE TALK Kubernetes is a great data platform even for complex, stateful systems like data warehouses. This talk shows how Kubernetes' portability is not just a convenience. It is a powerful tool that allows users to locate data anywhere it's convenient but still have a SaaS user experience for administrators.…
D
Data on Kubernetes Community

1 Dok Talks #129 - Databases Operations and the Cloud // Barak Nissim 51:20
51:20
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי51:20
https://go.dok.community/slack https://dok.community ABSTRACT OF THE TALK This session walks through the basics of how data is represented in Kubernetes, from the grounds up and explores how databases - as a top consumers of data in this space - are handling multi-node and multi-cluster operations. we will review how operators help and why having a data layer is important when building a kubernetes-native application. BIO With over 15 years of IT experience from core datacenter infrastructure to advanced cloud technologies, Barak leads ionir’s product and solutions portfolio, engaging with customers, partners, and acts as kubernetes-native data advocate. KEY TAKE-AWAYS FROM THE TALK - Running Data on Kubernetes is fun and easy - Running business critical workloads as Databases require a data layer to improve availability, scale and bursting. - Multi cloud with Data is important for bursting and migrations…
D
Data on Kubernetes Community

1 Dok Talks #126- Automatically Instrument Kubernetes Apps with OpenTelemetry // James Blackwood-Sewell 1:03:40
1:03:40
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:03:40
https://go.dok.community/slack https://dok.community ABSTRACT OF THE TALK The rise of Kubernetes has triggered an exponential growth of metric and trace data. This talk explores capturing and persisting telemetry data from Kubernetes apps without any specific code changes using the OpenTelemetry Operator. BIO James Blackwood-Sewell is currently a Principal Developer Advocate at TimescaleDB in Sydney. Before that he meandered through the PostgreSQL, Kubernetes and Prometheus landscapes as the Chief Architect at Jirotech. KEY TAKE-AWAYS FROM THE TALK Storing observability data from Kubernetes on Kubernetes doesn't need to be hard - sometimes you don't even need to change your code to do it!…
D
Data on Kubernetes Community

1 Dok Talks #128- Getting Started with the Kubernetes Secrets Store CSI Driver // Kim Schlesinger 53:10
53:10
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי53:10
https://go.dok.community/slack https://dok.community ABSTRACT OF THE TALK In Kubernetes, it can be difficult to keep application API keys, access tokens and passwords safe. There are several different approaches to solving this problem, and in this talk Kim will demonstrate how to install Hashicorp Vault and the Secrets Store CSI Driver so that your applications can access secrets stored in ephemeral volumes. BIO Kim is technologist and educator, and she currently works as a Developer Advocate at DigitalOcean. Kim lives in Denver, Colorado in the USA. KEY TAKE-AWAYS FROM THE TALK How the Kubernetes Secrets Store CSI Driver with Vault uses Kubernetes volumes…
D
Data on Kubernetes Community

1 Dok Talks #127 - Flux for Helm Users! // Scott Rigby 1:21:35
1:21:35
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:21:35
https://go.dok.community/slack https://dok.community ABSTRACT OF THE TALK Welcome Helm users! CNCF Flux has a best-in-class way to use Helm according to GitOps principles. For you, that means improved security, reliability, and velocity - no more being on the pager on the weekends or having painful troubleshooting or rollback when things go wrong. Built on Kubernetes controller-runtime, Flux’s Helm Controller is an example of a mature software agent that uses Helm’s SDK to full effect. Flux’s biggest addition to Helm is a structured declaration layer for your releases that automatically gets reconciled to your cluster based on your configured rules: ⭐️ The Helm client commands let you imperatively do things ⭐️ Flux Helm Custom Resources let you declare what you want the Helm SDK to do automatically. In addition, Scott will show how to use Helm Charts to run reliable stateful workloads. If you want to follow along with Scott’s demo, here are a couple of resources to help you prepare ahead of time: 📄 Flux for Helm Users Docs: https://fluxcd.io/docs/use-cases/helm/ 📄 Flux Guide: Manage Helm Releases: https://fluxcd.io/docs/guides/helmrel... BIO Scott is a Brooklyn based interdisciplinary artist and Developer Advocate at Weaveworks. He co-founded the Basekamp art and research group in 1998 and the massively collaborative Plausible Artworlds international network. In technology he enjoys helping develop open source software that anyone can use, most recently projects in the cloud native landscape including co-maintaining Helm and Flux. In daily decisions, large or small, he tries to help make the world a better place for everyone. KEY TAKE-AWAYS FROM THE TALK Scott will take you on a tour of Flux’s Helm Controller, share the additional benefits Flux adds to Helm and then walk through a live demo of how to manage helm releases using Flux.…
D
Data on Kubernetes Community

1 Dok Talks #125- Mission and Vision of the Rap-God-Project // Abhijith Ganesh 18:55
18:55
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי18:55
https://go.dok.community/slack https://dok.community ABSTRACT OF THE TALK Explanation of how the how-to-dok project evolved into rap-god-api BIO An aspiring tech enthusiast who's massively into Data Science, Kubernetes and Operations(DevOps)
D
Data on Kubernetes Community

1 Dok Talks #124 - Intro to Druid on Kubernetes // Sergio Ferragut 54:40
54:40
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי54:40
https://go.dok.community/slack https://dok.community ABSTRACT OF THE TALK This talk will provide a high-level overview of Kubernetes, Helm charts and how they can be used to deploy Apache Druid clusters of any size. We'll review how Kubernetes functionality enables resilience and self-healing, historical tiers through node group affinity, middle manager scaling through Kubernetes autoscaling to optimize ingestion capacity and some of the gotchas along the way. BIO Sergio Ferragut is a database veteran turned Developer Advocate at Imply. His experience includes 16 years at Teradata in professional services and engineering roles. He has direct experience in building analytics applications spanning the retail, supply chain, pricing optimization and IoT spaces. Sergio has worked at multiple technology start-ups including APL and Splice Machine where he helped guide product design and field messaging.…
D
Data on Kubernetes Community

1 Dok Talks #123 - Can Data Become a Declarative Resource? // Roey Libfeld, Michael Greenberg & Uri Zaidenwerg 1:07:10
1:07:10
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:07:10
https://go.dok.community/slack https://dok.community ABSTRACT OF THE TALK Most K8s users find stateful K8s deployments challenging, to say the least, when persistent data is involved the declarative, portable, self-healing nature that we are accustomed to in stateless deployments is replaced with complexity, resiliency issues and portability restrictions. While K8s is a standard, portable between environments, data is restricted to its physical location and its provider. In this session, we will review the challenges facing stateful K8s, and how to free data to enjoy the same level of portability as the application layer. KEY TAKE-AWAYS FROM THE TALK To assure stateful deployments enjoy the same declarative nature we came to love in stateless deployments, we the K8s users should adopt technologies that do not restrict data to a physical location or to a specific public cloud vendor, designing portable agnostic K8s apps should be…
D
Data on Kubernetes Community

1 Dok Talks #122 - Operationalizing a Data Infrastructure Stack on Kubernetes 36:35
36:35
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי36:35
https://go.dok.community/slack https://dok.community ABSTRACT OF THE TALK Kubernetes is massively powerful, but there are still a large number of details that are needed to get right before really leveraging to build a robust data infrastructure. I'll dive into the complexities we've uncovered at Plural and how we're solving them, from cloud configuration to observability to managing upgrade lifecycles. BIO Career backend engineer turned tech founder…
D
Data on Kubernetes Community

1 Dok Student Sessions - Contributing to Cloud Native Glossary // Kunal Verma 38:05
38:05
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי38:05
https://go.dok.community/slack https://dok.community ABSTRACT OF THE TALK In this session, we'll be talking about a new open source project in the CNCF community i.e. the Cloud Native Glossary. The main aim of the glossary is to be the centre-point for all the terms related to the cloud-native ecosystem & we are actively looking for awesome contributors from the community. I'll be talking about how one can get involved & start their contributions in this initiative. If you are someone wanting to get involved with the CNCF community, this is your chance!!! BIO I'm Kunal Verma, a sophomore pursuing Bachelor's in IT from India. I am a DevOps enthusiast and also passionate about Open-source! Currently, I am exploring the DevOps and cloud-native ecosystem and contributing to the Kubernetes, Data On Kubernetes Community & the Layer5 community. Really like to interact with folks and be a part of awesome communities and learn more and more!…
D
Data on Kubernetes Community

1 Dok Talks #121 - Running Stateful Apps in Kubernetes Made Simple // Steve Buchanan 1:00:40
1:00:40
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:00:40
https://go.dok.community/slack https://dok.community ABSTRACT OF THE TALK Eventually the time will come to run a stateful app in Kubernetes. This can be a scary thing adding more moving parts to a Kubernetes cluster and deploying as well as managing your app on Kubernetes when it requires state. In this talk Steve Buchanan will take you through a journey of understanding how storage works in Kubernetes, how to Persistent state with pods, what storage options are available with Azure Kubernetes Service, best practices, and a demo of deploying a stateful app to AKS. BIO Steve Buchanan is a Principal Program Manager with a leading global tech giant focused on improving the cloud. He is a Pluralsight author, the author of eight technical books, and a former 10-time Microsoft MVP. He has presented at tech events, including, DevOps Days, Open Source North, Midwest Management Summit (MMS), Microsoft Ignite, BITCon, Experts Live Europe, OSCON, Inside Azure management, and user groups. He has been featured in several publications including the Star Tribune (the 5th largest newspaper in the US). He stays active in the technical community and enjoys blogging about his adventures in the world of IT at www.buchatech.com KEY TAKE-AWAYS FROM THE TALK Overview of Storage in Kubernetes covering Storage Classes, Persistent Volumes, & Persistent Volume Claims. Overview of Azure Storage, Best Practices to running stateful apps in Kubernetes.…
D
Data on Kubernetes Community

1 Dok Talks #120 - A Gentle Introduction to Building Data Intensive Applications // Joe Karlsson 1:01:50
1:01:50
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:01:50
https://go.dok.community/slack https://dok.community ABSTRACT OF THE TALK We all know that data intensive applications have had explosive growth in the past decade. Data now drives significant portions of our lives, from making sense of IoT swarm data to detecting fraudulent transactions at your bank. As software developers, we all have unprecedented opportunities to build data applications that generate value from these massive datasets. Join Bart Farrell and Joe Karlsson for this talk as we discuss gotchas and best practices of building deploying and scaling data intensive apps. KEY TAKE-AWAYS FROM THE TALK Build a data-intensive application from scratch using only SQL. We will discuss gotchas and best practices of building deploying and scaling a key-value store. BIO Joe Karlsson (He/They) is a Database Engineer turned Developer Advocate (and massive data nerd) currently working at SingleStore. He empowers developers to think creatively when building applications with a massive amount of data, through demos, blogs, videos, or whatever else developers need. Joe's career has taken him from building out database best practices and demos for MongoDB, architecting and building one largest eCommerce websites in North America at Best Buy, and teaching at one of the most highly-rated software development boot camps on Earth. Joe is also a TEDx Speaker, film buff, and avid [TikToker]…
D
Data on Kubernetes Community

1 Dok Talks #119 - Cloud-Native Data Pipelines // Hakan Lofcali 53:25
53:25
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי53:25
https://go.dok.community/slack https://dok.community ABSTRACT OF THE TALK This talk walks you through our stack, architecture, and processes. We develop tools to deploy and run data-driven applications in a cloud-native environment. We will give a whirlwind tour on developing a Java Quarkus application, a CICD stack powered by GitHub Actions / ArgoCD, building and deploying containerized Kafka Streams applications at runtime with Jib container builder. Having introduced the above common understanding, we will give a high-level overview of how we utilize modern Kubernetes and Cloud tooling to manage multiple clusters in different organizations together with our customers. BIO DataCater commoditizes data pipeline development lifecycle by applying software engineering and cloud native practices to data work. Hakan is a Software / Data Engineer and CTO of DataCater. He worked and built his knowledge around Software, Data Engineering, and Cloud-Native Computing in severely different environments. From early start-up to hyper-scaler AWS. From sports media companies to highly regulated FSI enterprises. The experiences gained, problems encountered, and solutions found led to him co-founding DataCater to enhance tooling in the Data space.…
D
Data on Kubernetes Community

1 Dok Talks #118 - Troubleshooting ClickHouse Performance // Shiv Lyer 1:02:50
1:02:50
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:02:50
https://go.dok.community/slack https://dok.community ABSTRACT OF THE TALK This talk is about how I use several tools, technologies and processes to troubleshoot ClicHouse Performance. I will be talking about multiple Linux Toolkits, Trace Profilers like DTrace, BPF etc. and also ClickHouse System Tables. This talk also covers best practices/checklist / run-book for building "High Performance ClickHouse Infrastructure Operations" BIO Open Source Database Systems Geek in MySQL, MariaDB, PostgreSQL and ClickHouse with core expertise in performance, scalability, high availability and database reliability engineering, Shiv currently is the founder and principal of MinervaDB Inc., an enterprise-class 24*7 Consultative Support and Managed Services Provider for MySQL, MariaDB and PostgreSQL. Shiv also is the Founder and Principal of ChistaDATA Inc., an independent 24*7 Consultative Support and Managed Service Provider for ClickHouse. Shiv in the past worked for companies like MySQL AB, SUN Microsystems, AOL, eBay, PayPal, PalominoDB and Percona. Shiv also is a frequent speaker in open source conferences worldwide. KEY TAKE-AWAYS FROM THE TALK Troubleshooting ClickHouse Performance intuitively…
D
Data on Kubernetes Community

1 Dok Talks #117 - Why you should care about data mesh // Luke Feeney 58:40
58:40
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי58:40
https://go.dok.community/slack https://dok.community ABSTRACT OF THE TALK Data mesh is a new approach for designing modern data architectures by embracing organizational constructs as well as technological aspects, data management, governance, etc. The idea is that data should be easily accessible and interconnected across the entire business. This talk will look at the concept of data mesh architecture. It will discuss implementing data mesh architecture in organizations and will relate data mesh to updates at TerminusDB. BIO Co-founder of TerminusDB, the open source document graph database. KEY TAKE-AWAYS FROM THE TALK Data mesh TerminusDB Document graphs Git for data https://terminusdb.com/blog/…
D
Data on Kubernetes Community

1 Dok Specials - Ask Us Anything About Postgres // Gabriele Bartolini, Ryan Booz & Álvaro Hernández 1:03:30
1:03:30
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:03:30
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK What's the deal with Postgres in Kubernetes? To get some answers as well as more questions, we're bringing together Álvaro Hernandez from Ongres, Ryan Booz from Timescale, and Gabriele Bartolini from EDB for an "Ask Us Anything" session. Send us your questions on Slack!…
D
Data on Kubernetes Community

1 Dok Specials - Ask Patrick and Jeff Anything About Data on Kubernetes // Patrick McFadin & Jeff Carpenter 1:02:15
1:02:15
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:02:15
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Patrick is a Data on Kubernetes Community veteran. He did the very first session "Is k8s even ready for data?" in July 2020 and has seen the growth of the community since then. Jeff Carpenter is a Software Engineer at Datastax where he works on the S targate.io project. If you have questions you want to be answered in the session, please feel free to message Bart on Slack.…
D
Data on Kubernetes Community

1 Dok Specials - Unravel the key to your Kubernetes secrets 2:14:47
2:14:47
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי2:14:47
https://go.dok.community/slack https://dok.community/
D
Data on Kubernetes Community

1 Dok Talks #116 - Nebula Graph: Open Source Distributed Graph Database // Wey (Siwei) Gu 1:07:10
1:07:10
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:07:10
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Nebula Graph Demystified Graph on K8s Know-How of Graph Database BIO Open Source believer, builder, singer and Graph Magic advocate(to scale the magic to help wizards) https://siwei.io/en/about/ KEY TAKE-AWAYS FROM THE TALK Why& How graph db would help real life problems Arch and Design the Nebula Graph Database and the Intro. of the Project Implementation of the Nebula K8s operator Graph Database Know How in actions on: Building an intelligent Voice Robot MetaData governance and Lineage Building a corporation shareholding relations system…
D
Data on Kubernetes Community

1 Dok Special - Show me the money: The business side of DoK // Evan Powell, Brian Schechter & Misha Herscu 57:10
57:10
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי57:10
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Running stateful workloads on Kubernetes isn't just a technical question. Without keeping the business value it provides in mind, it becomes a moot point. In order to drive these conversations forward, we'll be joined by Melissa Logan (Director at the DoKC), Evan Powell (Adviser/investor who was instrumental in launching the DoKC), Brian Schecther (Partner at Primary Venture Partners). KEY TAKE-AWAYS FROM THE TALK - What are the economic drivers of Data on Kubernetes adoption? - What are the business or economic drivers in the way of Data on Kubernetes adoption? - How will Data on Kubernetes evolve in the coming years given other open source trends?…
D
Data on Kubernetes Community

1 Dok Talks #115 - What More Can I Learn From My OpenTelemetry Traces? // John Pruitt 1:00:45
1:00:45
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:00:45
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Of the three observability data types supported by OpenTelemetry (metrics, logs, and traces) the latter is the one with most potential. Tracing gives users insights into how requests are processed by microservices in a modern, cloud-native architecture. Jaeger and Grafana can visualize a single trace, showing how an individual request traversed your entire system. This helps for distributed debugging and analysis, but using traces only this way is limiting. What if you stored tracing data in a SQL database? You could ask global questions about your system. You could find slow communication paths, where the error rate spiked since the last deployment, or where the request rate suddenly dropped. Thus, tracing can be used proactively to help you spot issues before your customers do. This talk will show you how to do all the above by ingesting OpenTelemetry traces into a PostgreSQL/TimescaleDB database, and building custom dashboards using SQL to make the most out of your tracing data. BIO John Pruitt is a software engineer at Timescale. His work focuses on database/SQL development for the Promscale open-source observability tool, and currently on adding support for OpenTelemetry tracing. Prior to joining Timescale, John grew the DBA team at Shipt. Most of the balance of his career was spent building custom time-series applications in the energy industry and leading data warehousing efforts at regional banks. KEY TAKE-AWAYS FROM THE TALK - What is distributed tracing - Why viewing individual traces is of limited value - How SQL can be used to analyze and visualize traces - What insights can be unlocked using SQL against traces…
D
Data on Kubernetes Community

1 Dok Talks #114 - Helm for Beginners with Portainer // Hrittik Roy 51:55
51:55
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי51:55
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Helm allows you to install packages in your cluster, much like you would use apt, yum on your laptop. Just define the components that you want to install in your application, and it will take care of the installation and configuration of these components, saving you from headaches. This talk will go through the package manager for Kubernetes and its components. In the end, we would deploy an application with Helm to our cluster with Portainer. BIO Hrittik is a Portainer Advocate, a software engineer at Ayedo GmbH and a student in his sophomore year. Hrittik focuses on building tech communities and software with the help of cloud native technologies. He has been a speaker at conferences like the Cloud Summit and conducted various events that have collectively empowered students and professionals on DevOps and Cloud. Connect with him here: https://linktr.ee/hrittikhere KEY TAKE-AWAYS FROM THE TALK - Understand Helm, Charts and Repositories - Understand the need for the package manager - Install Application with Helm via Portainer…
D
Data on Kubernetes Community

1 Dok Talks #113 - Developing Stateful Application on Kubernetes // Rob Pacheco 53:15
53:15
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי53:15
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Modern web applications are typically comprised of multiple services which utilize storage in a variety of ways. Utilizing storage in Kubernetes introduces challenges that are not obvious while developing locally. We’ll introduce these challenges and show good and bad ways to handle them. We’ll dive into the details of an application along with its deployment in Kubernetes to understand why certain storage patterns are problematic and how they can be modified to behave well in a Kubernetes-based deployment. BIO Rob Pacheco is currently leading cloud operations at Rumble. Prior to Rumble, Rob spent time at Vision Government Solutions as well as Black Duck Software re-architecting, securing, and containerizing its products to run within containerized environments, including Kubernetes and Google Kubernetes Engine. Rob is also the author of the liveProject “Creating and Managing Cloud Native Services in Kubernetes”, and the upcoming liveVideo “Surviving Kubernetes Deployments as an Application Developer” KEY TAKE-AWAYS FROM THE TALK * Behavior stateful primitives in Kubernetes * Common storage patterns in web applications * Patterns for well-behaved storage in a Kubernetes cluster…
D
Data on Kubernetes Community

1 Dok Talks #112 - Production Postgres Made Easy on Kubernetes // Jonathan Katz 1:02:45
1:02:45
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:02:45
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK It's one thing to get a database up and running on Kubernetes; it's another to run it in production. There are many "Day 2" features to think about when running any database in production, including high availability, backups + RTO/RPO, monitoring, security, and disaster recovery. Combining Kubernetes with the Operator pattern can help make this easier, and add advanced features like self-healing, simple scaling, secure credential management, and more. PGO, the open source Postgres Operator from Crunchy Data, makes it easy to manage production Postgres clusters on Kubernetes the GitOps way. Using simple manifests, PGO deploys HA Postgres clusters with self-healing capabilities that apps can securely connect to. In this talk we will review the important concepts to know for running databases in production, how PGO implements these features, and then spend most of the time demoing how all of the key "Day 2" features work with a PGO-managed Postgres databases connected with an application! BIO Jonathan S. Katz is the VP of Platform Engineering at Crunchy Data, with a focus on managing PGO, an open source Postgres Operator. Jonathan is a member of the PostgreSQL Core Team and involved in various governance aspects of the PostgreSQL Global Development Group. He serves as a Secretary and Director of the nonprofit PostgreSQL Community Association of Canada and is a Director of the nonprofit United States PostgreSQL Association. He has given talks at conferences all over the world for over a decade on topics involving Postgres, from application developer uses to operations, particularly focusing on Kubernetes over the past several years. Prior to Crunchy Data, Jonathan was CTO at VenueBook, and before that, VP of Technology at Paperless Post. At both companies Jonathan developed robust platforms using PostgreSQL, taking advantage of its many features, from complex data types to its ability to stream logical changes. Jonathan graduated from Tufts University with a B.S. in Computer Science and a B.A. in Mathematics.…
D
Data on Kubernetes Community

1 Dok Talks #111 - Scheduled Scaling with Dask and Argo Workflows 1:05:30
1:05:30
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:05:30
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Complex computational workloads in Python are a common sight these days, especially in the context of processing large and complex datasets. Battle-hardened modules such as Numpy, Pandas, and Scikit-Learn can perform low-level tasks, while tools like Dask makes it easy to parallelize these workloads across distributed computational environments. Meanwhile, Argo Workflows offers a Kubernetes-native solution to provisioning cloud resources in Kubernetes and triggering workflows on a regular schedule. Being Kubernetes-native, Argo Workflows also meshes nicely with other Kubernetes tools. This talk discusses the combination of these two worlds by showcasing a set-up for Argo-managed workflows which schedule and automatically scale-out Dask-powered data pipelines in Python. BIO Former academic in the field of renewable energy simulation and energy systems analysis. Currently responsible for architecting and maintaining the cloud- and data strategy at ACCURE Battery Intelligence KEY TAKE-AWAYS FROM THE TALK Argo Workflows + Dask is a nice combination for data-processing pipelines. There are a a few "gotchyas" to be on the look-out for, but in nevertheless this is still a generally-applicable and powerful combination. https://github.com/sevberg…
D
Data on Kubernetes Community

1 Dok Talks #110 - Overview of the Oracle Database Operator for Kubernetes // Paulo Simoes 44:25
44:25
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי44:25
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK An introduction to OraOperator, the open source project that makes Oracle Database Kubernetes-Native. BIO Paulo works as the Developer Evangelist at Oracle focuses on Microservices, Application Development with Converged Oracle Database, and Event Mesh. His current technical expertise focuses on Cloud Native architecture and best practices, including containerization and microservices development. KEY TAKE-AWAYS FROM THE TALK Oracle Database as a Kubernetes-native platform. Simplified way to operator Oracle Database.…
D
Data on Kubernetes Community

1 Dok Talks #109 - Benchmarking for PostgreSQL workloads in Kubernetes / Part 2 // Gabriele Bartolini 1:04:35
1:04:35
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:04:35
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK 6 months have passed since our last DoK webinar about benchmarking PostgreSQL workloads in a Kubernetes environment. In the meantime, many things have happened at EDB, and we’re happy to share what we’ve learned in this timeframe. We’ll use cnp-bench and cnp-sandbox to help us describe some of the challenges we might face when running PostgreSQL workloads, how to spot them, and what actions to take to make your databases healthier and more longeve. cnp-bench is a collection of Helm charts that help run storage and database benchmarks, using popular open source tools like fio, pgbench, and HammerDB. cnp-sandbox is a Helm chart that sets up a Prometheus/Grafana stack, including basic metrics and dashboards for Cloud Native PostgreSQL, the Kubernetes operator developed by EDB. Both cnp-sandbox and cnp-bench are open source and recommended for development, testing, and pre-production environments only. BIO A long time open-source programmer and entrepreneur, Gabriele has a degree in Statistics from the University of Florence. After having consistently contributed to the growth of 2ndQuadrant and its members through nurturing a lean and devops culture, he is now leading the Cloud Native initiative at EDB. Gabriele lives in Prato, a small but vibrant city located in the northern part of Tuscany, Italy - famous for having hosted the first European PostgreSQL conferences. His second home is Melbourne, Australia, where he studied at Monash University and worked in the ICT sector. He loves playing the Blues with his Fender Stratocaster, but his major passions are called Elisabeth and Charlotte! KEY TAKE-AWAYS FROM THE TALK - A methodology for benchmarking a PostgreSQL database in Kubernetes - Open source set of tools for benchmarking a PostgreSQL database in Kubernetes - Reasons why benchmarking both the storage and the database is important https://github.com/EnterpriseDB/cnp-sandbox https://github.com/EnterpriseDB/cnp-bench…
D
Data on Kubernetes Community

1 Dok Talks #108 - Postgres on Kubernetes Applied at Scale in Zalando // Álvaro Hernández & Alexander Kukushkin 1:02:20
1:02:20
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:02:20
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Some of the topics that may or may not be discussed on this live talk could be: - Other than Patroni and the operator, what else was needed to fully switch to Kubernetes? - What is left, what is the future going to bring to the world of Kubernetes and Postgres operators? Expect strong opinions, and facts. Live. Uncensored. Postgres. On. Kubernetes. BIO Álvaro is a passionate database and software developer. He founded and works as the Founder & CEO of OnGres ( https://ongres.com ). He has been dedicated to PostgreSQL and R&D in databases for two decades. An open source advocate and developer at heart, Álvaro is a well-known member of the PostgreSQL Community, to which he has contributed founding the non-profit Fundación PostgreSQL ( https://postgresql.fund ) and the Spanish PostgreSQL User Group. You can find him frequently speaking at PostgreSQL, database, cloud, and Java conferences. Every year, Álvaro travels approximately three-four times around the globe—in 2020 (or now more likely 2021 due to COVID-19), he will hit the milestone of having delivered 100 tech talks. AWS Data Hero (2019). ------------ During his professional career, Alexander touched PostgreSQL from all possible sides: as a Web Developer, as System Administrator, and as a Database Engineer now. Five years ago he started working on Patroni project and achieved quite a big success with it. Nowadays Patroni is used in the heart of PostgreSQL-as-a-Service on Kubernetes at Zalando to run 1000+ PostgreSQL HA clusters. Besides that, Alexander periodically reports bugs and contributes patches to PostgreSQL and some other open-source projects (usually Postgres related).…
D
Data on Kubernetes Community

1 DoK Talks #107- Cross region replication in Percona Kubernetes Operators // Ege Gunes 45:40
45:40
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי45:40
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK With Percona Kubernetes Operators you can migrate your data to Kubernetes or have a disaster recovery site to recover it in unfortunate times. I'll talk about the implementation, challenges and caveats of such setups and what the future may bring.…
D
Data on Kubernetes Community

1 DoK Talks #106- Cloud native data warehousing with Kubernetes // Mark Cusack & Matthew Ripley 59:25
59:25
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי59:25
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Yellowbrick's cloud data warehouse is natively integrated with Kubernetes. Not only does Kubernetes provide the elasticity, resilience and upgradeability expected of a modern data warehouse, it also enables Yellowbrick to be deployed anywhere: in public clouds, private clouds and at the network edge. During this talk, we will dive into the details of how we use Kubernetes to power our data warehouse solution, how we've created a SQL interface to Kubernetes, and give a live demo. We'll also talk about the business value that enterprises can derive from such a flexible solution built on Kubernetes. KEY TAKE-AWAYS FROM THE TALK You'll come away with: an understanding of the business value derived from a Kubernetes-based data warehouse; insight into our SQL interface to Kubernetes; and the overall capabilities of Yellowbrick BIO Before joining Yellowbrick as CTO, Mark was Vice President for Data and Analytics at Teradata, where he led a variety of product management and technology teams in data warehouse and advanced analytics groups. He was also Chief Architect of Teradata’s IoT Analytics effort. Mark joined Teradata in 2014 when Teradata acquired the startup RainStor, where he was a co-founding developer and Chief Architect. Prior to RainStor, Mark was a lead scientist in the UK Ministry of Defence. Mark holds a Ph.D. in computational physics from Newcastle University, UK, with a thesis centered on discovering the electronic and non-linear optical properties of quantum dots. As a research fellow at Newcastle, he developed new techniques to model these novel quantum structures using large-scale parallel and distributed computing approaches..…
D
Data on Kubernetes Community

1 DoK Special: Mental Health and Covid-19: Retrospective and Perspective // Andrea Dobson, Erin Grinshteyn and Julia simon 59:15
59:15
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי59:15
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Fear, anxiety, uncertainty, frustration- any of these sound familiar since the pandemic started? How is your organization addressing the challenges of Covid-19 on mental health? We will start this session with an overview from Erin Grinshteyn, who is an Associate Professor at the University of San Francisco and has done research about the impact of fear in various collectives. From there we will have Andrea Dobson, Head of People at Container Solutions who is also a psychologist, and Julia Simon, HR Professional and Leader at the #burnout group in the CNCF…
D
Data on Kubernetes Community

1 DoK Talks #105 - Run Graph Database on K3s with KubeSphere // Feynman Zhou & Wey Gu 1:02:10
1:02:10
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:02:10
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Graph databases are fundamentally designed with a focus on the relationship between data sets. Running stateful workloads like databases on a container platform has always been a challenge. How to unleash the power of your connected data on Kubernetes with cloud-native technologies? Nebula Graph is a highly performant linearly scalable graph database available for use via a shared-nothing distributed model. KubeSphere is an open source container platform built on Kubernetes, helping developers to implement application delivery and build observability with ease. In this talk, maintainers from KubeSphere and Nebula Graph community will demonstrate how to leverage Operator to deploy and manage a graph database on K3s with KubeSphere. KEY TAKE-AWAYS FROM THE TALK - KubeSphere and K3s Walkthrough - Graph Database and Nebula Graph: Learn and Practice - How to use KubeKey to install K3s and OpenEBS within minutes - Run Nebula Graph on K3s with KubeSphere - Build cloud native nbservability for Database application BIO Feynman is a CNCF ambassador and community manager. He is growing and maintaining the KubeSphere open source community, which helps users to widely adopt Kubernetes and reduce the learning curve of using cloud-native technologies. He focuses on technical writing and advocacy and outreacy in cloud-native area.…
D
Data on Kubernetes Community

1 DoK Talks #104- How to enable self-service Infrastructure by shifting your Data left with Kubernetes // Nic Vermande 1:08:30
1:08:30
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:08:30
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK On-demand infrastructure services have become a key requirement for developers and DevOps engineers to provision and consume the resources they need. While it can be achieved in public cloud with the support of a rich API ecosystem, there's a lack of common framework across CSP's. As the next-generation Cloud Operating System, Kubernetes provides a variety of services that can be used to reach the same goal, but with a common framework across any type environment: on-premises, private or public clouds. In this talk, we're going to explore the solutions and patterns available for building end-to-end CI/CD pipelines that include persistent data services. BIO Nicolas is an experienced hands-on technologist, evangelist and product owner who has been working in the fields of Cloud-Native technologies, Open Source Software, Virtualization and Datacenter networking for the past 17 years. Passionate about enabling users and building cool tech solving real-life problems, you'll often see him speaking at global tech conferences and online events, spreading the word and walking the walk with customers.…
D
Data on Kubernetes Community

1 DoK Talks#103 -Performant and Version-Aware Analytics With Spark & lakeFS on K8s // Itai Admi 39:25
39:25
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי39:25
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Spark and lakeFS are revolutionizing large scale data processing that is version-aware. Is it possible to run this architecture over Kubernetes? We’ll cover the fastest way to get this environment up and running, and the benefits you get with it. Finally we’ll show how horizontal scaling and the lakeFS Hadoop Filesystem avoid processing bottlenecks as workloads increase. BIO Itai is a R&D team leader at Treeverse, the company behind open-source lakeFS. He thrives on finding creative solutions for complex problems, especially if it involves code. Previously, Itai worked at Microsoft and Ridge on data infrastructure, tooling, and performance. Itai received his B.Sc degree in Computer Science and an MBA from Tel Aviv University. KEY TAKE-AWAYS FROM THE TALK - Importance of building reproducible data pipelines. - Managing your data the same way you're managing your code.…
D
Data on Kubernetes Community

1 DoK Talks #102- From Enemy to Evangelist // Rick Vasquez 1:01:40
1:01:40
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:01:40
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK A few short years ago I was a huge opponent of putting any data in Kubernetes, now I'm one of the most vocal proponents of doing so. Listen to how the ecosystem changed my mind. BIO An experienced technologist who has been hands-on in solving data problems for as long as I can remember. Currently working on changing the way we use storage in the datacenter.…
D
Data on Kubernetes Community

1 DoK Talks #100- CAPE for data backup/restore on kubernetes // Sanjeev Ganjihal 1:02:41
1:02:41
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:02:41
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK CAPE is an open source solution offering advanced kubernetes features for DR, data migration and mobility, multi-cluster app deployment etc. I will demo how to install CAPE and also cover how to backup and restore data using CAPE KEY TAKE-AWAYS Learn about a new open source tool for kubernetes backup/restore BIO I come from a strong infrastructure background with expertise in infrastructure automation, containers, kubernetes, DevOps and service mesh.…
D
Data on Kubernetes Community

1 DoK Talks #101- Redpanda: how to build a storage engine for kubernetes // Alexander Gallego 1:01:25
1:01:25
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:01:25
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK How to build a stateful, transactional streaming storage engine on top of kubernetes. KEY TAKE-AWAYS FROM THE TALK 1. Building storage engines in kubernetes is hard 2. You need tiered storage (S3 or google cloud bucket, Azure Blob) for total disaster recovery 3. You also need sound replication inside the cluster 4. You need a simple architecture so you can scale the pods easily BIO Alex Gallego is the founder and CEO of Vectorized, where he & the team hack on Redpanda, a modern streaming platform for mission critical workloads. Prior to Vectorized, he was a principal engineer at Akamai, as well as co-founder and CTO of Concord.io, a high performance stream processing engine built in C++ and acquired by Akamai in 2016.…
D
Data on Kubernetes Community

1 DoK Talks #99- ETL/ELT on Kubernetes with Airbyte: K8s Development Insights // Abhi Vaidyanatha 1:00:45
1:00:45
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:00:45
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK ETL/ELT on Kubernetes is currently an unsolved problem. There are a lot of different approaches vying for a spot as the de facto method, but none are clear winners. Considering that the cloud-native landscape is built for deploying Dockerized, open-source software, many of the closed-source solutions fall flat and don't mesh with the trajectory of the community. Airbyte is an open-source ETL/ELT tool that harmonizes well with the cloud-native landscape and lives to enable your stateful workloads on Kubernetes. Previously, I have talked about a theoretical deployment on Kubernetes and the nuances behind deploying an ETL/ELT pipeline in such an environment. Now, I'm looking to follow that up with how we actually implemented that strategy as we launched our K8s beta. Additionally, I'll dive into some of the nitty gritty details that we needed to figure out in order to get this all working... stuff that isn't really found online! Overall, this will be a really unique case of getting to do a retrospective on what we planned our architecture to look like and following up with some great development insights as we solidified the final implementation. KEY TAKE-AWAYS FROM THE TALK - Quick overview of Airbyte and open-source ETL/ELT [5 minutes] - Why run your ETL/ELT in K8s? [3 minutes] - A quick recap on the previous talk (what we thought the architecture would look like) [5 minutes] - Display the actual architecture and implementation [10 minutes] -> Talk about how to communicate with k8s pods on STDOUT and STDIN pipes -> Describe parent-child process termination strategy -> Describe persistence layer/strategy and config storage - Quick demo of an Airbyte deployment on K8s [10 minutes] BIO Abhi is a confused economist who enjoys writing backend code for data management software. He now spends most of his time doing developer relations in the data integration space, where he looks to evangelize open source technologies. In his spare time he is a DJ, drummer, and competitive Super Smash Bros. player. He is a staunch advocate of proper semicolon usage, Oxford commas and overused grammar jokes.…
D
Data on Kubernetes Community

1 DoK Talks #98- It´s not me, it´s you: Migrating between third party storage solutions at scale // Dinesh Majrekar 48:30
48:30
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי48:30
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK With the help of 3rd parties, running stateful workloads can be a “trivial” experience but what happens when you need to migrate between vendors? Civo offers managed Kubernetes clusters powered by K3s to customers around the world. We manage thousands of Virtual Machines and stateful customer data within multiple data centres across the globe. During recent vendor contract renewal negotiations, it was decided to move to a new storage provider. This required the team to find a way of having the 2 storage vendors co-exist on existing superclusters, and have the ability to safely migrate data between the 2 vendors. Join Civo’s Director of Innovation, Dinesh, as he talks through the research, testing and migration process carried out in the past few months. BIO Dinesh is Director of Innovation at Civo and has been the lead architect for the new Civo Stack Platform. Having worked in the hosting industry for many years, Dinesh has a passion for creating solutions that operate at scale. This not only applies to the technology stack, but for nurturing engineers through their career.…
D
Data on Kubernetes Community

1 DoK Talks #97- Learn about Developing a Multicluster Operator with K8ssandra Operator // John Sanda 1:02:10
1:02:10
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:02:10
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Cassandra is a highly scalable database with an architecture that makes it well suited for multi-region workloads. A Kubernetes cluster often spans across multiple zones within a single region. Multi-region Kubernetes clusters are less common though due to the challenges that they present. This has led to a growing number of multi-cluster solutions. In this presentation John Sanda introduces K8ssandra Operator. It is designed from the ground up for multi-cluster deployments. John will discuss how to reconcile objects across multiple clusters, how to manage secrets, pitfalls to avoid, and testing strategies. BIO John Sanda is a DataStax engineer working on the K8ssandra project. He is passionate about Cassandra and Kubernetes and loves being involved in open source. Prior to joining DataStax John worked for a year at The Last Pickle as an Apache Cassandra consultant. Prior to that, he spent a number of years at Red Hat as an engineer. It was during that time John got involved with Cassandra when he redesigned a metrics data store and built it with Cassandra in place of an RDBMS. He had his first initial exposure to Cassandra and Kubernetes when the metrics storage engine was later used in OpenShift.…
D
Data on Kubernetes Community

1 DoK Talks #96- Persistent Disk or StatefulSet? The right way and the wrong way to make apps persist state inside a K8s cluster // Neil Cresswell 50:35
50:35
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי50:35
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK There is confusion amongst Kubernetes users as to when you should make a Deployment with a PVC and when you should use a StatefulSet with a PVC. There is also a general lack of understanding when it comes to disk access policies, what RWO/RWX mean and what they allow you to do. These concepts are complicated and require a deep level of understanding in order to avoid users making bad decisions that they come to regret later. In this talk we will explore when you should use each type and what things you need to think about before making a decision. We'll also explore in detail how to safely deploy a multi-replica application with persistence using Portainer. As part of the talk we will explore disk access policy, understand what RWO really is and how RWX disk access changes the equation in regards to persistence. TAKE-AWAYS From this talk delegates will learn the difference between persistent disk and StatefulSet. They will learn when to use each (and when not to) and what are the pitfalls they should be aware of when deploying multi replica apps. They will also learn what is disk access policy, what is RWO and how RWX disk access changes the equation in regards to persistence BIO Neil is the CEO and Co-Founder of Portainer.io, based in Auckland New Zealand. Neil is a career consultant, having spent 25 years in the roles of systems engineering, IT consulting, and more recently IT management. Neil started his career with 12 years at IBM as his foundation, where he had a leading role in Server, Storage, and Virtualisation systems engineering. Neil moved from IBM into self-employment with two business partners, who together created one of NZ’s (and later ASEANs) leading VMware consulting practices, ViFX. During this time, Neil was based in Singapore, and was responsible for the design and deployment of the largest and most complex VMware deployments in Asia Pacific. This experience saw Neil consulting to CIOs, CISO’s and CTO’s all over the region, picking up a vast array of intelligence on what these senior IT leaders expect from a product before it can obtain widespread adoption in their organisations. Neil then moved into a CEO role at a start-up Cloud Service Provider, whereby he grew that business from nothing to $8M ARR in just 2 years. Neil’s exposure to Docker technologies came 4 years ago, when at the helm of the Cloud Service Provider, he wanted to bring a “Containers as a Service” offering to market (well before many of the hyper-scales had such offerings). Neil attempted to find self-service portals that could be deployed by the service provider as a way to help his customers consume the service, but there were none available, so Neil had Portainer built. Portainer has now grown into an entity of its own right, and using Neil’s knowledge of the CIO’s requirements, knowledge of the technology, and knowledge of infrastructure management, Neil has crafted Portainer into a strong contender in the Container management space. Neil now runs the Portainer.io business, and is responsible for the overall product strategy and the team entrusted to deliver on that strategy.…
D
Data on Kubernetes Community

1 DoK Talks #95 I´ve got 99 Workloads and 53 of Them Are Data // Michelle Gienow 1:02:10
1:02:10
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:02:10
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Workloads tell the true story of how Kubernetes is being used in production these days. We talked to 200 architects and DevOps engineers about the nitty gritty deployment details, especially around data-intensive workloads. The result is a report with quantitative information around workload numbers and types, preferred production environments, and what kind of teams are running it all. KEY TAKE-AWAYS Organizations are increasingly running data-intensive workloads on Kubernetes. Serverless technology, the cutting-edge tech whose moment always seems to be on the cusp of arriving, is finally here - and this has implications for the database. BIO Recovering journalist turned front end developer; Jamstack evangelist and Kubernetes curious. I like coffee, 8 bit video games and building things.…
D
Data on Kubernetes Community

1 DoK Talks #94- Security and SRE // Tammy Butow & Prima Virani 1:02:20
1:02:20
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:02:20
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Prima and Tammy join us to discuss the bridges between Security and SRE. How can these two teams work best together? What can they learn from each other? Prima is a Security Engineer and Tammy is a Site Reliability Engineer. They are both Australians living in the USA with 10+ years of experience each working in tech. TALK TAKEAWAYS 1.You'll learn tips for SRE and Security teams to work together 2.You'll learn what SREs can learn from Security and vice versa 3.You'll learn about the new field of DevSecOps and how it can help your organisation improve BIO Tammy Bryant Butow is a principal SRE at Gremlin, where she works on chaos engineering—the facilitation of controlled experiments to identify improvements. Gremlin's enterprise Chaos Engineering platform makes it easy to build more reliable applications in order to prevent outages, innovate faster, and earn customer trust. Previously, Tammy led SRE teams at Dropbox responsible for the databases and storage systems used by over 500 million customers and was an IMOC (incident manager on call), where she was responsible for managing and resolving high-severity incidents across the company. She has also worked in infrastructure engineering, security engineering, and product engineering. Tammy is the cofounder of Girl Geek Academy, a global movement to teach one million women technical skills by 2025. Tammy is an Australian and enjoys riding bikes, skateboarding, snowboarding, and surfing. She also loves mosh pits, crowd surfing, metal, and hardcore punk. Prima is a seasoned Security professional who has worked in a variety of industries such as Consumer Tech, Oil & Gas, Media, and Fin-tech. She is a Senior Security Engineer on the SIRT team at Segment where she enjoys creating automation tooling for Incident Response and occasionally dabbles in Security DevOps. She loves sharing her experiences with the industry and has spoken at many meetups and conferences globally including, but not limited to, Agile India 2020, MacDevOpsCon Vancouver 2019, and Grace Hopper Conference 2017.…
D
Data on Kubernetes Community

1 DoK Talks #93- Uncovering Hidden Delays In The CI/CD DevOps Pipeline // Barak Nissim 47:00
47:00
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי47:00
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Discussion around the drivers for DevOps data management and the primary data wait cases in the typical value stream. You’ll discover how data services solutions that enable instant data recovery and data copy will immediately accelerate your CI/CD pipeline. TAKE-AWAYS We’ll discuss: - the specific data wait time cases that could be bogging down your processes - the infrastructure cost-efficiency of a data services platform for kubernetes - the uplifting impact on productivity across developers, testers, and operations BIO With over 15 years of IT experience from core datacenter infrastructure to advanced cloud technologies, Barak leads ionir’s product and solutions portfolio, engaging with customers, partners, and acts as cloud-native storage advocate.…
D
Data on Kubernetes Community

1 DoK Talks #92- eBPF from the view of a storage developer // Richard Kovacs 58:37
58:37
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי58:37
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK eBPF is one of the key technologies nowadays. There are several existing technologies in network or observability fields but not much in storage space. This presentation tells my research story and tries to define some of the possibilities of the technology. eBPF is a key technology thats found in the market today. It is apparent that there are several existing technologies in both network or observability fields but not much in the way of storage space. During the webinar Richard Kovacs, Integration Engineer will take you through: - His research story & findings on eBPF - Outline some of the possibilities of the technology - Demonstrate findings via a live demo BIO Richard is a Kubernetes integration engineer at StorageOS with many years of DevOps background. His main focuses are Go micro-service and Kubernetes operator development. Richard is passionate about technology and innovation. His constant curiosity drives him to learn, gain knowledge and be an expert in this area. He also loves getting involved with open-source communities. He is also a frequent speaker at local conferences and community events. KEY TAKE-AWAYS FROM THE TALK - What is eBPF and why is it useful. - What are the key features of eBPF in storage space. - How to write our first eBPF plugin.…
D
Data on Kubernetes Community

1 DoK Talks #91- Leveraging Druid Operator to manage Apache Druid on Kubernetes // Adheip Singh 55:50
55:50
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי55:50
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK This talk will introduce Druid-Operator ( https://github.com/druid-io/druid-operator ) and how Kubernetes and Operator framework can be used to write an operator that enables provisioning, management, and scaling of a complex cluster of Apache Druid to 1000s of nodes. This talk will cover why Kubernetes and the operator framework is a perfect fit for managing a complex stateful application. Learnings and pitfalls faced while writing the Druid-Operator and the special considerations to be taken into account when managing and scaling a stateful application. BIO Adheip Singh is a software engineer at Rill Data Inc. He is maintainer of druid-operator and kubectl-druid-plugin. Active contributor the operator ecosystem. TALK TAKEAWAYS 1.This session will help anyone looking for running Apache Druid on Kubernetes 2.It will help understand how someone can write operators to simplify the management of complex applications 3.Special considerations to take into account when managing and scaling a stateful application. 4.How to scope an operator within multiple namespaces 5.Druid operator features and how it scales and manages the underlying statefulsets and deployments.…
D
Data on Kubernetes Community

1 DoK Student Sessions- Data on Kubernetes: What's a poor DBA to do? // Derek Downey 1:11:15
1:11:15
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:11:15
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Database Administrators (DBAs) have jobs today because someone recognized that data is critical. It's critical to businesses, who need it to make good decisions. It's critical to users, who need to know how many people liked their recent post. DBAs developed process and maybe even automation to ensure that this critical data is available and secure, while accessing it is fast. And now someone wants them to put that critical data on Kubernetes and they've got some concerns. In this session, you will learn how to DBA through the lens of exploring typical tasks of Data Availability, Security and Performance on Kubernetes. And if you're not a DBA, hopefully you will gain some empathy for your poor DBA.…
D
Data on Kubernetes Community

1 DoK Talks #90- Build Your Kubernetes Operator with the Right Tool! // Rafal Leszko 1:03:20
1:03:20
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:03:20
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK You want to build a Kubernetes Operator for your software. Which tool to choose? Operator SDK with Helm, Ansible, or Go? Or maybe start from scratch with Python, Java, or any other programming language? And what is the right phase in the Operator Capability/Maturity Model that you should provide? In my talk I'll present: - Different ways of building Kubernetes Operators - Demo of building the same Operator using different tools - Methods used by the most popular Operators (Couchbase, Prometheus, MongoDB) - Operator Capability Model and how it affects your toolkit - Our journey with Hazelcast Operator BIO Tech Lead of the Cloud-Native Team at Hazelcast, author of the book "Continuous Delivery with Docker and Jenkins", trainer, and conference speaker. He specializes in Java development, Cloud environments, and Continuous Delivery. A former employee in a number of companies and scientific organizations: Google, CERN, AGH University, and more.…
D
Data on Kubernetes Community

1 DoK Talks #89- An introduction to Azure Arc Enabled Data Services // Chris Adkin 58:40
58:40
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי58:40
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Azure Arc enabled Data Services allows organizations to run the Azure dialect of database-as-a-service on Kubernetes on virtually any cloud or infrastructure. This session will cover the architecture of this new product, how it is deployed, the dialects of database engine that it currently supports and the storage considerations for deploying this service. BIO Chris has been working with databases for over twenty years in various roles ranging from DBA to developer to architect. Chris currently works at Pure Storage as a solutions architect and one of his main focus areas is databases running on Kubernetes, Chris is also a co-organizer of the Azure Cloud Native Data Platform meetup user group.…
D
Data on Kubernetes Community

1 DoK Talks #88- NSA & CISA K8s hardening Guidance: data & secrets protection // Leonid Sandler 1:11:05
1:11:05
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:11:05
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK The National Security Agency (NSA) and the Cybersecurity and Infrastructure Security Agency (CISA) have published comprehensive recommendations for strengthening the security of an organization’s Kubernetes system to help companies make their Kubernetes environment more difficult to compromise. one of the key areas this report covers is how to configure K8s to prevent data breaches. in this session, Leonid Sandler, ARMO CTO and Co-Founder, will review this guidance, and its impact on K8s data protection and will reveal everything you should know before choosing Kubernetes secret management method. BIO Leonid Sandler is CTO and co-founder of ARMO, 20+ years software security experience, NDS CTO Software Security, Designed Cloud Security products serving the world largest content providers. Built & managed Product and Development team of over 130 people KEY TAKE-AWAYS FROM THE TALK * How to use NSA & CISA K8s hardening guidance * How to protect K8s secrets…
D
Data on Kubernetes Community

1 DoK Talks #87- Cloud Native Microgateway Architecture // Leonardo Silva 1:04:00
1:04:00
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:04:00
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Building a gateway api architecture is not an easy task. In this presentation Leonardo will talk about how the microgateway strategy can be used to resiliently and reliably expose apis in a cloud native environment while increasing the productivity of engineering teams. BIO Experienced Senior Solutions Architect and DevSecOps Engineer with a demonstrated history of working in the information technology and services industry. Skilled in Microservices architecture, AWS Cloud Architecture, DevOps, and Kubernetes. KEY TAKE-AWAYS FROM THE TALK Use microgateway architecture to expose APIs instead of monolithic approaches. You can combine API gateways, kubernetes and istio features to increase your system's resilience whithout losing control of security and compliance You can apply gitops techniques to improve teams performance in real world projects.…
D
Data on Kubernetes Community

1 DoK Student Sessions- Intro to Apache Cassandra // Aleks Volochnev 2:02:35
2:02:35
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי2:02:35
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK What do companies like Apple, Netflix, Huawei or Instagram have in common? Aside from billions of requests, petabytes of data, and millions of users around the world, all of these companies use Apache Cassandra™ - the NoSQL decentralised database, well known by data experts for its scaling capabilities, high availability and incredible performance. Join us for the Introduction to Apache Cassandra workshop to learn the fundamentals of one of the most powerful databases of our time, make your step into NoSQL world and take your chance to get a voucher for free Cassandra Developer or Cassandra Administrator certification exam. BIO After many years in formation technologies as a developer, technical lead, DevOps Engineer and architect, Aleks focused himself on cloud computing and distributed systems. Professional Cloud Architect and Developer Advocate, he shares his knowledge and expertise in the field of high-performant and disaster tolerant systems.…
D
Data on Kubernetes Community

1 DoK Talks #86- Understanding performance design and benchmarking in Kubernetes // Dean Lewis 1:04:27
1:04:27
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:04:27
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK This session will talk about the considerations and tools available when it comes to performance and design of Kubernetes running in a virtualised environment. BIO Working at VMware as a Cloud Management specialist focusing on automation and cloud native technologies. KEY TAKE-AWAYS FROM THE TALK From this session you will learn what tools are readily available today to help you benchmark your platforms.…
D
Data on Kubernetes Community

1 DoK Talks #83- (Further) Exploring DataOps and Kubernetes // Randy Abernethy 57:10
57:10
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי57:10
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Randy will discuss the increasing importance of DataOps to the modern analytics-driven enterprise and how it integrates with continuous development and operations environments. He’ll illustrate how Kubernetes enables DataOps across four common patterns used for continuous data integration: batch, streaming, microservice and ETL solutions. Randy will also explain how these design patterns – in concert with automation, CI/CD and containerization – can be applied to high-scale data integration scenarios, creating an environment that supports rapid development of elastic analytics solutions. BIO Randy Abernethy is a Managing Partner at RX-M, a cloud native advisory and training firm. He is a tech entrepreneur, startup advisor, financial technology pioneer, Apache Thrift committer, Cloud Native Ambassador and author. Randy’s work allows him to pursue his passion for large scale distributed systems and all things cloud native. Interacting with hundreds of technology professionals each year, he stays highly connected with the latest software and platform trends and developments. Randy enjoys collaborating and networking with individuals around the world.…
D
Data on Kubernetes Community

1 DoK Talks #85- I lost my cluster, can I get it back? // Salman Iqbal 1:30:25
1:30:25
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:30:25
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK We all heard about how Kubernetes is self healing. However, what happens if you lose your Kubernetes cluster and your data? Is there a way to recover that? This talk will explore the why and how on backing up and restoring Kubernetes cluster. The talk will focus on three scenarios for backup and restore: - Kubernetes control plane - Applications - Data A comparison of different techniques will be presented with some live demos! BIO Salman works as an MLOps Engineer at Appvia and a Kuberenetes Instructor at Learnk8s. He has worked with a number of organisations in setting up Machine Learning platforms for teams to operate at scale. He is also a Certified Kubernetes Administrator and a strong advocate for Cloud Native technologies. He co-founded Cloud Native Wales, an initiative to bring together a community that shares ideas on how to use cloud native technologies effectively. You can also find him on YouTube as Soulman Iqbal where he tries to explain cloud native concepts by simplifying them.…
D
Data on Kubernetes Community

1 DoK Talks #84- Build your IDP with Otomi, the first and only Kubernetes-based O.S // Sander Rodenhuis 1:06:45
1:06:45
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:06:45
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Kubernetes is a platform for building platforms and is not really developer-friendly. That's why red Kubes created Otomi, the first and only Kubernetes-based operating system. Install Otomi with a single Helm Chart, login, and start deploying applications with only a few clicks. Use Otomi to build your Enterprise Container Platform or Internal Developer Platform in minutes. In this talk, I'm going to introduce Otomi. We'll start by looking at the architecture and how you can install Otomi on any Kubernetes cluster. But don't expect a long and boring slide deck, we'll quickly take a look at a live install and navigate through all the Otomi features. BIO Sander is the founder and CEO of Red Kubes, the company behind Otomi. Otomi is an open-source extension for Kubernetes to support the most common DevOps and SRE use cases. Before Sander founded Red Kubes, he worked as a freelance consultant for the last 12 years, doing a lot of cloud-native stuff with Azure, AWS, and of course Kubernetes. KEY TAKE-AWAYS FROM THE TALK - Now I can turn my Kubernetes cluster into my own Internal Developer Platform in minutes - With Otomi, developers can deploy containers without having to write any YAML manifests - With Otomi, you can provide developers with self-service and onboard new development teams onto the platform within minutes - Use Otomi to build your Enterprise-grade Container Platform - Otomi is completely FREE and open-source - And Otomi runs on any Kubernetes cluster in any cloud…
D
Data on Kubernetes Community

1 DoK Talks #82- What we learned from 100 Kubernetes postmortems and how to prevent them // Shimon Tolts 52:25
52:25
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי52:25
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK A smart person learns from their own mistakes, but a truly wise person learns from the mistakes of others. When launching our product, we wanted to learn as much as possible about typical pains in our ecosystem, and did so by reviewing many post-mortems (100+!) to discover the recurring patterns, anti-patterns, and root causes of typical outages in Kubernetes-based systems. In this talk we have aggregated for you the insights we gathered, and in particular will review the most obvious DON’Ts and some less obvious ones, that may help you prevent your next production outage by learning from others' real world (horror) stories. BIO At 15 years old, Shimon started his first company, providing web hosting on top of Linux servers in the pre-cloud era. He established and managed the Software Engineering Infrastructure department for 400 engineers at ironSource. Also as an AWS Community Hero, Shimon runs the largest AWS user-group worldwide and is an avid speaker at conferences. Shimon co-leads the Israeli CNCF chapter Today, Shimon is the CEO and Co-Founder at Datree, which prevents Kubernetes misconfiguration from reaching production. KEY TAKE-AWAYS FROM THE TALK How to prevent misconfiguration in Kubernetes workloads…
D
Data on Kubernetes Community

1 DoK Talks #81- How to Alert on SLOs using Error Budget Burn Rate // Yuri Grinshteyn 56:15
56:15
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי56:15
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Join this session to learn: - why reliability is a key feature of your service - why SLOs are the best way to measure it and set reliability targets - how error budgets increase your velocity - what an error budget burn rate is - how to set up alert using burn rates to optimize your response and operational load BIO Yuri works with Google Cloud Platform customers to help them design, architect, build, and operate reliable applications and services and advocates for SRE principles and practices. He occasionally writes code in Node.js, Python, and Go and likes to play video games on the train ride to work.…
D
Data on Kubernetes Community

1 DoK Talks #80- Introduction To Keptn // Brad McCoy & Adam Gardner 1:03:25
1:03:25
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:03:25
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Keptn is an event-driven orchestration platform with SRE best practices (declarative pipelines, quality gates and automated remediation) available "out of the box". This talk will be an introduction to Keptn, the core usecases and Keptn in the wider IT ecosystem BIO Brad started his career as a Software Engineer at TripAdvisor and then got into DevOps and moved into Fintech where he led the Cloud Engineering Innovation team at Temenos. Brad is an advocate for cloud-native and open source technologies. He contributes to the CNCF Security Technical Advisory Group. He also runs the GitOps working group for Ortelius a project of the CD Foundation. Adam is an Automation Architect and Keptn contributor with a hint of Java development and Search Engine Optimisation. SCUBA diver and craft beer drinker. Not all at the same time.…
D
Data on Kubernetes Community

1 DoK Talks #79- Running Apache Pulsar in Kubernetes // Chris Bartholomew 1:06:45
1:06:45
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:06:45
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK When I founded Kesque in 2019, my goal was to Kubernetes as the base platform for all our software. Because Kesque was a cloud-based SaaS powered by Apache Pulsar, that meant getting Apache Pulsar, a high-performance streaming solution and Kafka alternative, up and running in Kubernetes. In this talk, I will give an overview of Apache Pulsar and describe how we got Pulsar up and running in Kubernetes. We will cover some of the features of Pulsar that make it "cloud-native" and easy to run in Kubernetes as well as some of the challenges we faced and how we solved them. Kesque was acquired by DataStax, which is a strong supporter of Kubernetes. I will also cover how we continue to use Kubernetes as the foundation for the work we are doing at DataStax around Apache Pulsar. BIO Chris Bartholomew is a Streaming Engineering Leader at DataStax. He has been working with high-performance pub–sub systems for over a decade. He has tested, supported, and operated messaging systems that are deployed in banking, capital markets, and transportation industries. He was the founder and CEO of Kesque, a cloud-based managed service built around Apache Pulsar that was acquired by DataStax.…
D
Data on Kubernetes Community

1 DoK Talks #78- Leverage AI, Big Data workload on Kubernetes with Volcano // Kevin (Zefeng) Wang 1:10:25
1:10:25
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:10:25
ABSTRACT OF THE TALK Volcano is a system for running high-performance workloads on Kubernetes. It features powerful batch scheduling capability that Kubernetes cannot provide but is commonly required by many classes of high-performance workloads, including Machine Learning, Deep Learning, Big Data, Bioinformatics Computing, etc. These types of workloads typically run on generalized domain frameworks like TensorFlow, Spark, PyTorch, MPI, etc. In this talk, Kevin will introduce how Volcano is integrated with these frameworks to allow users to run their applications without extra adaptation efforts while enjoying remarkable batch scheduling. BIO Kevin Wang is a contributor in the CNCF community since its beginning, leader of the cloud native open source team at Huawei, and co-founder of the KubeEdge and Volcano projects. Kevin has contributed to Kubenretes upstream for years and now spends 100% of his work and focus on wider open source cloud native community development.…
D
Data on Kubernetes Community

1 DoK Talks #77- Kubernetes Essential Tools // Javier Ramos 1:08:59
1:08:59
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:08:59
ABSTRACT OF THE TALK In this talk I will try to summarize my favorite tools for Kubernetes with special emphasis on the newest and lesser known tools which I think will become very popular. BIO Javier has over 15 years of experience in the IT industry as an enterprise architect with especial emphasis on Big Data and DevOps. He has worked in small startups and also large multinationals such Oracle, IBM or Cisco. He holds several certifications in AWS, GCP and Azure and writes regularly on Medium.com. Certified Java Architect/AWS/GCP/Azure/Kubernetes: Microservices/Kubernetes/Serverless, AWS/GCP Kafka/Akka/Flink/Spark, AI, JS/React/Angular/PWA…
D
Data on Kubernetes Community

1 DoK Talks #71 Introducing Kubestr: A new way to benchmark your Kubernetes storage // Michael Cade 1:01:44
1:01:44
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:01:44
Abstract of the talk… Benchmarking storage is not a new concept, this has been happening on storage for a long time. But have we overlooked the benchmarking capabilities or at least the ease in which to achieve this in a cloud-native, container-based, Kubernetes landscape? There has been a rise in stateful workloads and support around persistent storage in Kubernetes is improving. Now we can take our traditional workloads such as SQL Server, Oracle and SAP alongside our data stores for microservices with the same storage system for MongoDB, Cassandra, Redis, MySQL and PostgreSQL. With each of these stateful applications having different performance requirements, it becomes necessary to benchmark the storage backing these Persistent volumes. The CSI (Container Storage Interface) is the standard for creating custom components to work with data storage. This has enabled many more storage vendors to adopt their platforms to the cloud-native approach and offerings. All of this is great, but how do we ensure that the right datastore is used to achieve the performance required for our microservices running these stateful workloads? Bio… A community first technologist for Kasten by Veeam Software. Based in the UK with over 16 years of industry experience with a key focus on technologies such as cloud-native, automation & data management. His role at Kasten is to act as a technical thought leader, community champion and project owner to engage with the community to enable influencers and customers to overcome the challenges of Cloud-Native Data Management and be successful, speaking at events sharing the technical vision and corporate strategy whilst providing ongoing feedback from the field into product management to shape the future success. Key take-aways from the talk… Kubestr can assist here in three ways: · Identify the various storage options present in a cluster. · Validate if the storage options are configured correctly. · Evaluate the storage using common benchmarking tools like FIO.…
D
Data on Kubernetes Community

1 DoK Student Sessions- Why you should learn the Julia Programming Language // Logan Kilpatrick 1:03:40
1:03:40
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:03:40
Abstract of the talk… In this talk, we will go over what Julia is, why you would want to learn it, and how to contribute to the ecosystem. Bio… Logan is the Community Manager for the Julia Programming Language, a member of the Board of Directors at NumFOCUS, and a Machine Learning Engineer. Outside of work, Logan is a graduate student at Harvard University and Northwestern Universities Pritzker School of Law. Key take-aways from the talk… Viewers will leave with a high-level understanding of the Julia ecosystem, the benefits the language provides, how to contribute to it, and more.…
D
Data on Kubernetes Community

1 DoK Talks #76 AHOY - open source release manager for Kubernetes // Neil White 1:08:15
1:08:15
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:08:15
Abstract of the talk… AHOY is a release manager for Kubernetes, fully open source. Bio… I thought, this is easier explained, in, interpretive rap... Look, if you had one shot, one opportunity To fix everything that was ever broke In one deployment Would you rebuild it, or just let it fail? Yo His palms are sweaty, config weak, server load is heavy There's vomit on this keyboard already, Kubernetes He's nervous, but the control plane says Nodes are ready Fork Bomb, but he keeps on forgetting Commands he wrote down, to restart all the pods He opens up Wikis, but the page won't load out Server's chokin now, everybody's phoning now. Release window runs out, times up, over bloaw! Roll back to the previous, oh this is grievous Oh there goes docker, it's borked! He's mad cause K8 don't, downgrade that easily? No He won't have it, he knows, manual deploys back’s on the ropes It don’t’ matter He's dope, he knows that, but he broke He's whole Openstack So when he goes back to his mobile app,that's when it's Back to the Trello Board yo, this whole Agile thing sux He better go capture the screen logs next time using tmux You better lose yourself in this DevOps, the SecOps You own it, you better never let it go Ho! You only get one shot, do not miss your change to Window This opportunity comes once in a weeks time Yo! So that is what I do on a typical day. For those who want the more traditional summary.... Open source consultant specializing in enterprise grade solutions based on open source software and open source integration. Worked with multiple DevOps teams inside numerous South Africa companies, to help bring these worlds together. Then watching the worlds explode. Sometimes for the good, sometimes for the bad. Currently a strong focus on Kubernetes. And then all the usual toppings, GitOps, CI/CD tooling, automation, test-driven infrastructure, and everything cloud native. Key take-aways from the talk… It automatically incorporates GitOps, auto-generation of Helm charts, all stored on Git.…
D
Data on Kubernetes Community

1 DoK Talks #75 Run production-grade Databases on Kubernetes using KubeDB // Tamal Saha 1:08:55
1:08:55
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:08:55
Abstract of the talk… Kubernetes has emerged as the de-facto way to deploy modern containerized apps on cloud or on-premises. Despite all that growth on the application layer, it remains challenging to run a database in a distributed environment like Kubernetes. However, many developers want to treat data infrastructure the same as application stacks. Operators want to use the same tools for databases and applications and get the same benefits as the application layer in the data layer: rapid spin-up and repeatability across environments. This is where KubeDB by AppsCode comes as a solution. KubeDB by AppsCode is a production-grade cloud-native database management solution for Kubernetes. KubeDB simplifies and automates routine database tasks such as provisioning, patching, backup, recovery, failure detection, and repair for various popular databases on private and public clouds. Bio… Tamal Saha is the Founder/CEO of AppsCode. Tamal was born and raised in Bangladesh. He completed his BSc in CSE from the top engineering school in Bangladesh. He came to the USA as a Ph.D. student at the University of Virginia in 2009. After finishing MSc, he dropped out of the PhD program and spent the next 5 years building core infrastructure software that powers Amazon retail and Google Ads business. While at Google in 2015, he became an early adopter and contributor to Kubernetes. Since he has developed well known projects in the Kubernetes space like KubeDB, Stash, Kubeform, KubeVault, Voyager, etc. Key take-aways from the talk… - Kubernetes is a viable platform for running production-grade databases. - Helm charts are not enough for Day 2 life cycle management of databases. - Users really need an operator based solution for running databases in Kubernetes.…
D
Data on Kubernetes Community

1 DoK Talks #74 Introducing LitmusChaos 2.0 // Prithvi Raj & Sayan Mondal 1:17:05
1:17:05
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:17:05
Abstract of the talk… Chaos Engineering in the past few years has evolved with the development of the Cloud-Native paradigm and with the introduction of the LitmusChaos tool which helps curate Cloud-Native Chaos Engineering. Belonging to the CNCF ecosystem, Litmus stresses upon applying Chaos Engineering to modern architectures and building resilient applications simultaneously. The features and ideas majorly surround Community Collaboration, Observability, GitOps, Open APIs & much more. Chaos testing is seen as a must for all Cloud-Native practitioners emphasizing reliability as a container challenge. This talk stresses the evolution of Chaos Engineering with the introduction of the LitmusChaos 2.0 Bio… Prithvi Raj has been an active community manager in not only building LitmusChaos, the CNCF project based on Cloud-Native Chaos Engineering but also the overall Chaos Engineering community by working extensively as a co-organizer of the global conference Chaos Carnival as well as hosting monthly meetups at the CNCF Kubernetes Chaos Engineering Meetup group. His love for Chaos Engineering started in 2020 and has been growing ever since leading to the active adoption of chaos testing as a means of testing. He has also worked as a co-organizer for KCD Bengaluru 2021 and has delivered talks at various forums including KCD Africa, Cloud-Native Students, etc. Other than this Prithvi loves listening to Indie pop music and is an ardent fan of Liverpool FC. Watching cricket and wrestling are also his leisure activities. He loves collecting currencies as a hobby! Sayan currently works at ChaosNative as a Chaos Engineer and contributor to Litmus. He helps design, architect, and execute the Litmus Web with precision. He is not only passionate about development but also curious about breaking stuff with the practice of Chaos Engineering.…
D
Data on Kubernetes Community

1 DoK Talks #73- Build Reproducible Experiments with Kubeflow and lakeFS // Barak Amar 1:00:45
1:00:45
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:00:45
Abstract of the talk… Kubeflow is a cloud-native ML platform that simplifies the training and deployment of machine learning pipelines on Kubernetes. lakeFS is wrapper layer around an object store that enables git-like operations such as branching and committing over datasets. Learn how to build ML workflows that are portable, scalable, and reproducible by integrating lakeFS operations into your Kubeflow pipeline components Bio… Barak is a software developer of the open source lakeFS project at Treeverse. He has more than 15 years of experience in a wide range of systems, from scheduling to security. He’s passionate about three things: programming languages, distributed systems, and long-distance running.…
D
Data on Kubernetes Community

1 DoK Talks #72- Highly available, pluggable and long term storage metrics for everyone. Extending Prometheus with Thanos // Wiard van Rij 1:09:21
1:09:21
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:09:21
Abstract of the talk… Prometheus was initially made for short metric retention to answer questions on “what is happening ‘now’”. It is a strong project that solves certain problems really well, but still as a monolith when doing so. Thanos has been made to enable scaling, highly available setups and long term (cheap) storage for Prometheus. Everyone could leverage Thanos for these features. It does not stop there; Thanos has multiple components that could be used for multi-cluster telemetry, remote writes, and multi-tenancy. We want to introduce everyone to Thanos. Explaining the use-cases and how it could benefit your stack now observability becomes such an important factor in tech. Bio… Wiard van Rij’s main focus is in the field of observability at Fullstaq. As a consultant he is helping people, teams, and organizations with various cloud-native challenges with a strong focus on Kubernetes and Observability. Wiard is a Thanos team member, open source enthusiast and has extra fun with security and hacking. Key take-aways from the talk… - Introduction to Prometheus and Thanos - How to extend your stack(s) for highly available and long term metrics - By leveraging the right tools and services, one can have a rich set of features which are also cost effective…
D
Data on Kubernetes Community

1 Intro to Persistence in Kubernetes 57:30
57:30
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי57:30
Abstract of the talk… So you want want to run stateful containers? A brief intro to persistence in Kubernetes. Kubernetes and persistent storage go together like oil and water. Kubernetes is inherently an ephemeral system and persistent storage by definition must survive. After his talk, you should have a clear understanding of the how to get started on the path to successfully manage a persistent data storage solution on your Kubernetes cluster Bio… Kunal - Junior pursuing Computer Science & Engineering. Co-founder at Code for Cause. CNCF Intern 2020. MLH Coach. Google Summer of Code Mentor. YouTuber. Gold Microsoft Learn Student Ambassador. Eric has worked with distributed systems for the better part of a decade. From databases to containerization. He has been an architect, developer, and network engineer. Currently he is in developer relations trying to bring his enthusiasm for tech to the rest of the world.…
D
Data on Kubernetes Community

1 DoK Talks #70 - YugabyteDB - Distributed SQL Database on Kubernetes // Amey Banarse 1:12:25
1:12:25
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:12:25
Abstract of the talk… Kubernetes has hit a home run for stateless workloads, but can it do the same for stateful services such as distributed databases? Before we can answer that question, we need to understand the challenges of running stateful workloads on, well anything. In this talk, we will first look at which stateful workloads, specifically databases, are ideal for running inside Kubernetes. Secondly, we will explore the various concerns around running databases in Kubernetes for production environments, such as: - The production-readiness of Kubernetes for stateful workloads in general - The pros and cons of the various deployment architectures - The failure characteristics of a distributed database inside containers In this session, we will demonstrate what Kubernetes brings to the table for stateful workloads and what database servers must provide to fit the Kubernetes model. This talk will also highlight some of the modern databases that take full advantage of Kubernetes and offer a peek into what’s possible if stateful services can meet Kubernetes halfway. We will go into the details of deployment choices, how the different cloud-vendor managed container offerings differ in what they offer, as well as compare performance and failure characteristics of a Kubernetes-based deployment with an equivalent VM-based deployment. Bio… Amey is a VP of Data Engineering at Yugabyte with a deep passion for Data Analytics and Cloud-Native technologies. In his current role, he collaborates with Fortune 500 enterprises to architect their business applications with scalable microservices and geo-distributed, fault-tolerant data backend using YugabyteDB. Prior to joining Yugabyte, he spent 5 years at Pivotal as Platform Data Architect and has helped enterprise customers across multiple industry verticals to extend their analytical capabilities using Pivotal & OSS Big Data platforms. He is originally from Mumbai, India, and has a Master's degree in Computer Science from the University of Pennsylvania(UPenn), Philadelphia. Twitter: @ameybanarse LinkedIn: linkedin.com/in/ameybanarse/…
D
Data on Kubernetes Community

1 DoK Talks #67- Run Apache APISIX in Kubernetes // Jintao Zhang 43:29
43:29
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי43:29
Abstract of the talk… Apache APISIX is a dynamic, real-time, high-performance API gateway. You can use Apache APISIX to handle traditional north-south traffic, as well as east-west traffic between services. It can also be used as a k8s ingress controller. In this talk, Jintao Zhang will introduce how to run Apache APISIX on k8s and how to use Apache APISIX as an ingress controller. Bio… Apache APISIX committer, Kubernetes ingress-nginx reviewer…
D
Data on Kubernetes Community

1 DoK #69- To Certify or Not to Certify, is Kubernetes Certification Worth it? // Keith McClellan 1:09:15
1:09:15
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:09:15
Abstract of the talk… As an engineer, should I consider getting a certification? What makes a certification valuable to me or my employer? How do I pick which one to get? Will these really help me build stateful applications on Kubernetes? In this talk, we will discuss the relative value of certifying on different technologies, with a specific focus on CNCF certifications for administration of k8s and developing Kubernetes-native applications. In this session we will discuss: - The pros and cons of getting certified - Why your current and future employers might care about your certifications - What are other things you can do to make yourself a more attractive candidate in this cloud-native landscape And of course, since Keith is a long-time database geek, we'll talk about how these might help you (or not) build stateful applications on Kubernetes. Bio… Keith McClellan is the Director of Partner Solutions Engineering at Cockroach Labs. He is responsible for building CockroachDB-based solutions with our largest technology partners, including Kubernetes and the broader open-source ecosystem. He spearheaded Cockroach Labs' Kubernetes operator project, acting as the technical lead on the project and being a primary contributor to making that the best way to run CockroachDB on Kubernetes. Prior to Cockroach Labs, Keith has held technical leadership positions in cloud-native and big data companies including DataStax, Mesosphere (now D2IQ), and Platfora.…
D
Data on Kubernetes Community

1 DoK Talks #68- The Kubernetes-native way to providing database services to developers // Adam Sandor 58:10
58:10
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי58:10
Bio… Adam is a Solutions Architect at Styra, helping companies adopt Cloud Native tech. Coming from a Java-dev background he is most excited about the space where software development and operations meet. When he is not flying in DCS World he is hacking on CI/CD, GitOps and Kubernetes Operators. Abstract of the talk… As Kubernetes is becoming the infrastructure platform of choice in many companies, database teams are struggling with the question whether to run databases on the cluster or outside of it. This talk will not answer that age-old question. I would rather focus everyone's attention to the developer experience. Modern CI/CD processes need development teams to be flexible and able to deliver without having to ask other teams for resources. So how can a database team provide reliable service while upping their game in DevEx? The answer in Kubernetes-land is providing Custom Resources backed by operators which handle database provisioning on or off the cluster in a way that fits into a GitOps CI/CD workflow. In this talk I will explain the concepts and dive into how you can make your own operator to provide a self-service interface for developers.…
D
Data on Kubernetes Community

1 DoK #66 Crossplane Packages as a Distribution Mechanism // Daniel Mangum 1:05:40
1:05:40
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:05:40
Abstract of the talk… A typical user's journey with Crossplane starts with provisioning infrastructure using the Kubernetes API, then evolves to composing infrastructure into higher level abstractions, and culminates with building a complete platform using packages. Crossplane packages are distributed as OCI images, meaning that a platform API can easily be reproduced in any cluster, and they can declare dependencies, which specify the lower level services that support the higher level abstractions. This functionality allows for companies to distribute their product in an infrastructure provider-agnostic manner, and for infrastructure admins to build internal platforms made up of both generic and organization-specific components. Bio… Daniel Mangum is a senior software engineer at Upbound where he is a maintainer of Crossplane, an open source CNCF project. He has held leadership positions in the Kubernetes community, and is an active participant in multiple other open source efforts. When not working in the Cloud Native space, Daniel spends his time writing, speaking, and building tooling for the RISC-V ISA. Key take-aways from the talk… This talk will be useful for folks building an internal infrastructure platform, as well as folks that build a product that depends on some form of infrastructure (databases, caches, blob storage, etc.). We will cover how to both build and consume packages, paving the way for advanced usage of Crossplane.…
D
Data on Kubernetes Community

1 DoK #65 Using Kubernetes and ClickHouse to enable high performance app analytics // Robert Hodges 1:06:30
1:06:30
הפעל מאוחר יותר
הפעל מאוחר יותר
רשימות
לייק
אהבתי1:06:30
Abstract of the talk… Embedded analytics are a major source of value to application users. Virtually every SaaS offering has them or is adding them now. This talk shows how to build low latency analytic applications on Kubernetes with ClickHouse, a popular, open source data warehouse. We'll start with the ClickHouse Kubernetes Operator to manage data warehouses, then cover ingest and visualization options to build a complete apps. Since this is a K8s talk, we'll of course geek out on the underlying plumbing as well. Bio… Robert is database geek. He started with M204 in the early 1980s and is now working on ClickHouse 20 years later, with digressions into virtualization, SaaS, and security topics. He met up with Kubernetes in 2018 at VMware. Robert is CEO of Altinity, which enables enterprises to use ClickHouse successfully. Key take-aways from the talk… 1. What are embedded analytics? 2. How can you build them with Kubernetes? 3. What are some of the design issues you'll need to consider?…
ברוכים הבאים אל Player FM!
Player FM סורק את האינטרנט עבור פודקאסטים באיכות גבוהה בשבילכם כדי שתהנו מהם כרגע. זה יישום הפודקאסט הטוב ביותר והוא עובד על אנדרואיד, iPhone ואינטרנט. הירשמו לסנכרון מנויים במכשירים שונים.