15 subscribers
התחל במצב לא מקוון עם האפליקציה Player FM !
פודקאסטים ששווה להאזין
בחסות


1 Dave Ramsey: 5 Stages to Build and Scale a Business That Lasts | Entrepreneurship | E344 1:03:38
#218 How Large Language Models Replace Neural Machine Translation with Unbabel’s João Graca
Manage episode 428499235 series 2975363
João Graça, Co-founder and CTO of language operations platform Unbabel, joins SlatorPod to talk about the present and future of large language models (LLMs) and their broad impact across all things translation and localization.
First, the CTO explains how Unbabel was founded to address language barriers for people using services like Airbnb, combining MT with human validation to improve translation quality.
João believes that LLMs are quickly replacing neural MT models as much more R&D is going into LLMs vs NMT. He highlights that LLMs can handle more complex tasks like automatic post-editing, source correction, and cultural adaptation, which were previously difficult to achieve with traditional models.
He also tells the backstory of the company's decision to develop TowerLLM. João shares how Unbabel's approach involves using open-source LLMs, fine-tuning them with multilingual data, and applying techniques like retrieval-augmented generation to improve translation quality in production settings.
Despite the advancements, João acknowledges that human intervention is still necessary for high-stakes translation tasks.
The podcast concludes with the hiring environment for AI talent and the future directions for LLM development, with João expressing optimism about the continued progress and potential of these models.
פרקים
1. Intro (00:00:00)
2. Background and Motivation Behind Unbabel (00:00:34)
3. Research Contributions (00:04:13)
4. NLP and LLM Impact (00:07:10)
5. RAG Approach (00:09:12)
6. Adapting Production Processes (00:11:04)
7. Evaluating Model Usage (00:12:42)
8. Evolution from Neural MT to LLMs (00:13:56)
9. Comparing Price (00:15:44)
10. Why Unbabel Decided to Build TowerLLM (00:16:43)
11. TowerLLM Development Process (00:18:49)
12. Multilingual Model Performance (00:23:07)
13. Model Usage and Commercial Restrictions (00:25:25)
14. Quality Testing Process (00:26:24)
15. TowerLLM Challenges (00:29:20)
16. Future of Translation Technology (00:30:19)
17. Areas of Application for LLMs (00:32:09)
18. Understanding xTOWER (00:34:49)
19. AI Pipelines (00:37:23)
20. Language Coverage (00:38:59)
21. Hiring Environment (00:40:43)
22. Acceleration of LLMs and AI Progress (00:41:51)
247 פרקים
Manage episode 428499235 series 2975363
João Graça, Co-founder and CTO of language operations platform Unbabel, joins SlatorPod to talk about the present and future of large language models (LLMs) and their broad impact across all things translation and localization.
First, the CTO explains how Unbabel was founded to address language barriers for people using services like Airbnb, combining MT with human validation to improve translation quality.
João believes that LLMs are quickly replacing neural MT models as much more R&D is going into LLMs vs NMT. He highlights that LLMs can handle more complex tasks like automatic post-editing, source correction, and cultural adaptation, which were previously difficult to achieve with traditional models.
He also tells the backstory of the company's decision to develop TowerLLM. João shares how Unbabel's approach involves using open-source LLMs, fine-tuning them with multilingual data, and applying techniques like retrieval-augmented generation to improve translation quality in production settings.
Despite the advancements, João acknowledges that human intervention is still necessary for high-stakes translation tasks.
The podcast concludes with the hiring environment for AI talent and the future directions for LLM development, with João expressing optimism about the continued progress and potential of these models.
פרקים
1. Intro (00:00:00)
2. Background and Motivation Behind Unbabel (00:00:34)
3. Research Contributions (00:04:13)
4. NLP and LLM Impact (00:07:10)
5. RAG Approach (00:09:12)
6. Adapting Production Processes (00:11:04)
7. Evaluating Model Usage (00:12:42)
8. Evolution from Neural MT to LLMs (00:13:56)
9. Comparing Price (00:15:44)
10. Why Unbabel Decided to Build TowerLLM (00:16:43)
11. TowerLLM Development Process (00:18:49)
12. Multilingual Model Performance (00:23:07)
13. Model Usage and Commercial Restrictions (00:25:25)
14. Quality Testing Process (00:26:24)
15. TowerLLM Challenges (00:29:20)
16. Future of Translation Technology (00:30:19)
17. Areas of Application for LLMs (00:32:09)
18. Understanding xTOWER (00:34:49)
19. AI Pipelines (00:37:23)
20. Language Coverage (00:38:59)
21. Hiring Environment (00:40:43)
22. Acceleration of LLMs and AI Progress (00:41:51)
247 פרקים
כל הפרקים
×
1 #247 CIOL CEO John Worne on How AI Is Impacting the Language Profession 46:17

1 #246 AI and the Future of Transcreation with Creative Translation CEO Luke Innes 36:56

1 #245 Dealmaker Wows with 10M Translation Education Gift 31:35

1 #244 AI in Multilingual Patient Care with Jaide Health CEO Joe Corkery, MD 35:42

1 #243 Gridly CEO Anna Albinsson on Efficient Localization 29:02

1 #242 CEOs React as Trump Declares English the Sole Official Language of the US 30:06

1 #241 memoQ CEO Peter Reynolds on Adaptive Generative Translation and AI 48:12

1 #240 Anthropic Stuns with Fact About Claude Use for AI Translation 34:34

1 #239 DeepSeek Translation, ElevenLabs AI Dubbing, Sorensen M&A 26:13

1 #238 Why Interpreting Remains a Growth Market with Boostlingo CEO Bryan Forrester 45:18

1 #237 DeepL Voice Launch and Interpreter Job Surprise 30:20

1 #236 The Year in Review and 2025 Predictions! 44:45

1 #235 The Interpreting SAFE AI Task Force with Katharine Allen and Dr. Bill Rivers 47:03

1 #234 RWS and BLEND Appoint New CEOs, Microsoft and Amazon Speech AI 28:14

1 #233 Charles Campbell on Juntos and the Future of Localization in Latin America 32:56

1 #232 Reddit Rises on AI Translation, TransPerfect Buys Studio 27:00

1 #231 AI Impact on Media Localization with ZOO Digital CEO Stuart Green 58:35

1 #230 Christian Elongue on Advancing the Language Industry in Africa 41:54

1 #229 Leading Localization from Asia with EC Innovations’ Sijie Wei 38:16

1 #228 OpenAI’s Advanced Voice Mode and Interpreting, AI Dubs for Instagram 29:26

1 #227 Leading a Boutique LSP with GORR Co-founders Gregor and Olivera Rosulnik 50:32

1 #226 MTPE Use, Pricing, AI Adoption, and Other Findings from the 2024 ALC Survey 29:36

1 #225 Smartcat's Series C, YouTube Dubs, Viva Translate Shuts Down 23:15

1 #224 On AI for Deep Content Understanding with Zetta Cloud Chief Strategist George Bara 43:18

1 #223 Evaluating Translations Objectively with ISO 5060 Lead Christopher Kurz 46:00

1 #222 Hacks, Strikes, Layoffs, Rankings, and Localization Jobs 32:57

1 #221 Localization Is a Top ROI Use Case for GenAI with Lilt CEO Spence Green 45:39

1 #220 Building an AI Video Localization Startup with Supernative Co-founder Paul Ghio 44:40

1 #219 PR Wars over LLM Translation Quality 32:27

1 #218 How Large Language Models Replace Neural Machine Translation with Unbabel’s João Graca 44:59

1 #217 Legal Battles, Bankruptcies, Acquisitions, and AI Interpreting Guidelines 30:10

1 #216 Josh Goldsmith on How Language Experts Become Successful 44:12

1 #215 Humanless LSP as a Fun Weekend Project 34:23

1 #214 Terra Translations CEO Marina Ilari on Why Game Localization Is Her Passion 34:13

1 #213 Phrase CEO Georg Ell on the Arms Race in Language Technology 47:48

1 #212 TRSB CEO Mary Kazamias on Leading Canada’s Largest Language Service Provider 29:09

1 #211 How Many Translation and Interpreting Jobs Will There Be in the Future? 31:10

1 #210 The Future of Language Jobs with tbo.'s Charles Campbell 45:17

1 #209 Sourcing Language Data from the Four Corners of the Earth with XRI Global’s Daniel Wilson 34:56

1 #208 The Normalization of Language AI 32:21

1 #207 The Art and Business of Literary Translation with Dr. B.J. Woodstein 42:42

1 #206 Video Highlight’s Founder on How LLMs Enable New Ways to Engage with Video Content 46:26

1 #205 Deluxe’s Chris Reynolds on Dubbing, Subtitling, and Localization for Hollywood 46:23

1 #204 Language Network CEO Jordan Evans on Sales Tactics and Growth Through M&A 44:06

1 #203 The Top 50 Language AI Companies to Watch in 2024 39:17
ברוכים הבאים אל Player FM!
Player FM סורק את האינטרנט עבור פודקאסטים באיכות גבוהה בשבילכם כדי שתהנו מהם כרגע. זה יישום הפודקאסט הטוב ביותר והוא עובד על אנדרואיד, iPhone ואינטרנט. הירשמו לסנכרון מנויים במכשירים שונים.