22 subscribers
התחל במצב לא מקוון עם האפליקציה Player FM !
177: AI-Based Data Cleaning, Data Labelling, and Data Enrichment with LLMs Featuring Rishabh Bhargava of refuel
Manage episode 400954497 series 3264623
Highlights from this week’s conversation include:
- The overview of refuel (0:33)
- The evolution of AI and LLMs (3:51)
- Types of LLM models (12:31)
- Implementing LLM use cases and cost considerations (00:15:52)
- User experience and fine-tuning LLM models (21:49)
- Categorizing search queries (22:44)
- Creating internal benchmark framework (29:50)
- Benchmarking and evaluation (35:35)
- Using refuel for documentation (44:18)
- The challenges of analytics (46:45)
- Using customer support ticket data (48:17)
- The tagging process (50:18)
- Understanding confidence scores (59:22)
- Training the model with human feedback (1:02:37)
- Final thoughts and takeaways (1:05:48)
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.
RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
441 פרקים
Manage episode 400954497 series 3264623
Highlights from this week’s conversation include:
- The overview of refuel (0:33)
- The evolution of AI and LLMs (3:51)
- Types of LLM models (12:31)
- Implementing LLM use cases and cost considerations (00:15:52)
- User experience and fine-tuning LLM models (21:49)
- Categorizing search queries (22:44)
- Creating internal benchmark framework (29:50)
- Benchmarking and evaluation (35:35)
- Using refuel for documentation (44:18)
- The challenges of analytics (46:45)
- Using customer support ticket data (48:17)
- The tagging process (50:18)
- Understanding confidence scores (59:22)
- Training the model with human feedback (1:02:37)
- Final thoughts and takeaways (1:05:48)
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.
RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
441 פרקים
כל הפרקים
×
1 238: What Every Developer Needs to Know About Microservices in 2025 with Mark Fussell, Founder & CEO at Diagrid 52:21


1 237: Startups, Sales, and Spreadsheets: How a Real Estate Developer Built an AI Company 59:17

1 236: Ringing Out the Old: AI's Role in Redefining Data Teams, Tools, and Business Models 53:39


1 235: Pete Soderling on the Evolution of Data Engineering 43:23

1 The PRQL: What AI Founders Need to Know About Data (Before It’s Too Late) with Pete Soderling of Zero Prime Ventures 3:27

1 234: The Cynical Data Guy on AI, Data Tools, and the Future of Coding 35:42


1 233: The Power of a Triple Threat in Data: Business, Engineering, and Strategy with Solomon Kahn of Delivery Layer and Top Data People 57:35

1 The PRQL: From Data Engineer to Data Entrepreneur with Solomon Kahn of Delivery Layer and Top Data People 2:08

1 232: Building a Business Solo: Streaming Data, Synthetic Testing, and Startup Lessons with Michael Drogalis of ShadowTraffic.io 48:22

1 The PRQL: Solopreneurship, Streaming Data, and Synthetic Testing with Michael Drogalis of ShadowTraffic.io 2:27

1 231: From Pre-Med to Product Strategy: Eric Dodds’ Journey in Data and Startups 47:37

1 230: The Cynical Data Guy: Data Tech Debt, Data Mesh, and Dashboard Directives 25:20

1 229: The Future of AI: Superhuman Intelligence, Autonomous Coding, and the Path to AGI with Misha Laskin of ReflectionAI 52:25

1 The PRQL: From Theoretical Physics to AI: Misha Laskin on AGI, Superhuman Intelligence, and Autonomous Coding 4:02

1 228: The Machine Learning Reality Check: When AI Makes Sense for Marketing Attribution with Lew Dawson of Momentum Consulting 41:46

1 227: The Art & Science of Marketing Attribution: From UTMs to Machine Learning with Lew Dawson of Momentum Consulting 1:02:45

1 226: Building Trust in Marketing Data: An Engineer's Guide to Attribution Architecture with Lew Dawson of Momentum Consulting 1:02:52

1 The PRQL: From Data Chaos to Marketing Truth: An Engineer's Guide to Attribution with Lew Dawson of Momentum Consulting 3:38

1 225: The Stone Cold Truth About Data: False Hopes and Hard Truths with The Cynical Data Guy 33:30


1 224: Bridging Gaps: DevRel, Marketing Synergies, and the Future of Data with Pedram Navid of Dagster Labs 53:24

1 The PRQL: Developer Relations, Marketing Synergies, and the Future of Data Platforms with Pedram Navid of Dagster Labs 1:52

1 223: End-of-Year Product Trends: The Cost of Rushing Features with The Cynical Data Guy 22:42


1 222: The Future of Data Modeling: Breaking Free from Tables with Best-Selling Author, Joe Reis of Ternary Data 1:00:48

1 The PRQL: From Tables to AI: The Future of Data Modeling with Best-Selling Author, Joe Reis of Ternary Data 4:29
ברוכים הבאים אל Player FM!
Player FM סורק את האינטרנט עבור פודקאסטים באיכות גבוהה בשבילכם כדי שתהנו מהם כרגע. זה יישום הפודקאסט הטוב ביותר והוא עובד על אנדרואיד, iPhone ואינטרנט. הירשמו לסנכרון מנויים במכשירים שונים.