התחל במצב לא מקוון עם האפליקציה Player FM !
פודקאסטים ששווה להאזין
בחסות
Enabling end-to-end machine learning pipelines in real-world applications
Manage episode 372641232 series 3497926
In this episode of the Data Show, I spoke with Nick Pentreath, principal engineer at IBM. Pentreath was an early and avid user of Apache Spark, and he subsequently became a Spark committer and PMC member. Most recently his focus has been on machine learning, particularly deep learning, and he is part of a group within IBM focused on building open source tools that enable end-to-end machine learning pipelines.
We had a great conversation spanning many topics, including:
- AI Fairness 360 (AIF360), a set of fairness metrics for data sets and machine learning models
- Adversarial Robustness Toolbox (ART), a Python library for adversarial attacks and defenses.
- Model Asset eXchange (MAX), a curated and standardized collection of free and open source deep learning models.
- Tools for model development, governance, and operations, including MLflow, Seldon Core, and Fabric for deep learning
- Reinforcement learning in the enterprise, and the emergence of relevant open source tools like Ray.
Related resources:
- “Modern Deep Learning: Tools and Techniques”—a new tutorial at the Artificial Intelligence conference in San Jose
- Harish Doddi on “Simplifying machine learning lifecycle management”
- Sharad Goel and Sam Corbett-Davies on “Why it’s hard to design fair machine learning models”
- “Managing risk in machine learning”: considerations for a world where ML models are becoming mission critical
- “The evolution and expanding utility of Ray”
- “Local Interpretable Model-Agnostic Explanations (LIME): An Introduction”
- Forough Poursabzi Sangdeh on why “It’s time for data scientists to collaborate with researchers in other disciplines”
15 פרקים
Manage episode 372641232 series 3497926
In this episode of the Data Show, I spoke with Nick Pentreath, principal engineer at IBM. Pentreath was an early and avid user of Apache Spark, and he subsequently became a Spark committer and PMC member. Most recently his focus has been on machine learning, particularly deep learning, and he is part of a group within IBM focused on building open source tools that enable end-to-end machine learning pipelines.
We had a great conversation spanning many topics, including:
- AI Fairness 360 (AIF360), a set of fairness metrics for data sets and machine learning models
- Adversarial Robustness Toolbox (ART), a Python library for adversarial attacks and defenses.
- Model Asset eXchange (MAX), a curated and standardized collection of free and open source deep learning models.
- Tools for model development, governance, and operations, including MLflow, Seldon Core, and Fabric for deep learning
- Reinforcement learning in the enterprise, and the emergence of relevant open source tools like Ray.
Related resources:
- “Modern Deep Learning: Tools and Techniques”—a new tutorial at the Artificial Intelligence conference in San Jose
- Harish Doddi on “Simplifying machine learning lifecycle management”
- Sharad Goel and Sam Corbett-Davies on “Why it’s hard to design fair machine learning models”
- “Managing risk in machine learning”: considerations for a world where ML models are becoming mission critical
- “The evolution and expanding utility of Ray”
- “Local Interpretable Model-Agnostic Explanations (LIME): An Introduction”
- Forough Poursabzi Sangdeh on why “It’s time for data scientists to collaborate with researchers in other disciplines”
15 פרקים
כל הפרקים
×
1 Machine learning for operational analytics and business intelligence 51:38

1 Machine learning and analytics for time series data 40:31

1 Understanding deep neural networks 39:31

1 Becoming a machine learning practitioner 33:22

1 Labeling, transforming, and structuring training data sets for machine learning 40:51


1 Acquiring and sharing high-quality data 39:20

1 Tools for machine learning development 39:24

1 Enabling end-to-end machine learning pipelines in real-world applications 42:53

1 Bringing scalable real-time analytics to the enterprise 37:12

1 Applications of data science and machine learning in financial services 42:32

1 Real-time entity resolution made accessible 27:09

1 Why companies are in need of data lineage solutions 34:29

1 What data scientists and data engineers can do with current generation serverless technologies 36:32

1 It’s time for data scientists to collaborate with researchers in other disciplines 36:08
ברוכים הבאים אל Player FM!
Player FM סורק את האינטרנט עבור פודקאסטים באיכות גבוהה בשבילכם כדי שתהנו מהם כרגע. זה יישום הפודקאסט הטוב ביותר והוא עובד על אנדרואיד, iPhone ואינטרנט. הירשמו לסנכרון מנויים במכשירים שונים.