התחל במצב לא מקוון עם האפליקציה Player FM !
Data Augmentation in Natural Language Processing
Manage episode 481575886 series 2570898
This week’s guests are Steven Feng, Graduate Student and Ed Hovy, Research Professor, both from the Language Technologies Institute of Carnegie Mellon University. We discussed their recent survey paper on Data Augmentation Approaches in NLP (GitHub), an active field of research on techniques for increasing the diversity of training examples without explicitly collecting new data. One key reason why such strategies are important is that augmented data can act as a regularizer to reduce overfitting when training models.
Subscribe: Apple • Android • Spotify • Stitcher • Google • RSS.
Detailed show notes can be found on The Data Exchange web site.
Subscribe to The Gradient Flow Newsletter.
294 פרקים
Manage episode 481575886 series 2570898
This week’s guests are Steven Feng, Graduate Student and Ed Hovy, Research Professor, both from the Language Technologies Institute of Carnegie Mellon University. We discussed their recent survey paper on Data Augmentation Approaches in NLP (GitHub), an active field of research on techniques for increasing the diversity of training examples without explicitly collecting new data. One key reason why such strategies are important is that augmented data can act as a regularizer to reduce overfitting when training models.
Subscribe: Apple • Android • Spotify • Stitcher • Google • RSS.
Detailed show notes can be found on The Data Exchange web site.
Subscribe to The Gradient Flow Newsletter.
294 פרקים
Tất cả các tập
×
1 The Quantum Advantage Is Real—But Where's the Infrastructure? 45:53

1 From Human-Readable to Machine-Usable: The New API Stack 38:23

1 Why Voice Security Is Your Next Big Problem 41:37

1 Unlocking Unstructured Data with LLMs 27:46

1 Building Production-Grade RAG at Scale 31:24

1 Unlocking AI Superpowers in Your Terminal 44:59

1 From Vibe Coding to Autonomous Agents 51:16

1 How a Public-Benefit Startup Plans to Make Open Source the Default for Serious AI 48:45

1 The Highly Uncertain Future of OpenAI’s Dominance 54:07

1 Beyond Guardrails: Defending LLMs Against Sophisticated Attacks 44:31

1 Navigating the Generative AI Maze in Business 49:35

1 The Practical Realities of AI Development 37:30

1 Beyond the Demo: Building AI Systems That Actually Work 27:36

1 Vibe Coding and the Rise of AI Agents: The Future of Software Development is Here 36:35

1 2025 Artificial Intelligence Index 51:44
ברוכים הבאים אל Player FM!
Player FM סורק את האינטרנט עבור פודקאסטים באיכות גבוהה בשבילכם כדי שתהנו מהם כרגע. זה יישום הפודקאסט הטוב ביותר והוא עובד על אנדרואיד, iPhone ואינטרנט. הירשמו לסנכרון מנויים במכשירים שונים.