התחל במצב לא מקוון עם האפליקציה Player FM !
177: AI-Based Data Cleaning, Data Labelling, and Data Enrichment with LLMs Featuring Rishabh Bhargava of refuel
Manage episode 400954497 series 3264623
Highlights from this week’s conversation include:
- The overview of refuel (0:33)
- The evolution of AI and LLMs (3:51)
- Types of LLM models (12:31)
- Implementing LLM use cases and cost considerations (00:15:52)
- User experience and fine-tuning LLM models (21:49)
- Categorizing search queries (22:44)
- Creating internal benchmark framework (29:50)
- Benchmarking and evaluation (35:35)
- Using refuel for documentation (44:18)
- The challenges of analytics (46:45)
- Using customer support ticket data (48:17)
- The tagging process (50:18)
- Understanding confidence scores (59:22)
- Training the model with human feedback (1:02:37)
- Final thoughts and takeaways (1:05:48)
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.
RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
465 פרקים
Manage episode 400954497 series 3264623
Highlights from this week’s conversation include:
- The overview of refuel (0:33)
- The evolution of AI and LLMs (3:51)
- Types of LLM models (12:31)
- Implementing LLM use cases and cost considerations (00:15:52)
- User experience and fine-tuning LLM models (21:49)
- Categorizing search queries (22:44)
- Creating internal benchmark framework (29:50)
- Benchmarking and evaluation (35:35)
- Using refuel for documentation (44:18)
- The challenges of analytics (46:45)
- Using customer support ticket data (48:17)
- The tagging process (50:18)
- Understanding confidence scores (59:22)
- Training the model with human feedback (1:02:37)
- Final thoughts and takeaways (1:05:48)
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.
RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
465 פרקים
כל הפרקים
×ברוכים הבאים אל Player FM!
Player FM סורק את האינטרנט עבור פודקאסטים באיכות גבוהה בשבילכם כדי שתהנו מהם כרגע. זה יישום הפודקאסט הטוב ביותר והוא עובד על אנדרואיד, iPhone ואינטרנט. הירשמו לסנכרון מנויים במכשירים שונים.