Discover with us how to lead QA teams and do web and software QA better. We’re a QA community exploring the latest and best quality assurance approaches to help you with DevOps, Agile, Automation, Performance, API, Mobile, Analytics, RPA, AIOps, Shift-Left and right. Find the tips, tricks, how-to guides, and tools you need to build your QA team, and banish the bugs! Hosted by CTO of Digital Assured, President of Vivit-Worldwide, and Tedx Speaker Jonathon Wright.
…
continue reading
This is The Here's Waldo Podcast where we sit down with top visionaries and creatives in the video game industry. Together, we'll unravel their journeys and learn more about the path they're forging ahead.
…
continue reading
Professional advice, information and inspiration on getting a job making video games. I’m Jason, your career mentor at Game Industry Career Guide. I’ve spent over 15 years in the game industry, doing lots of different jobs: QA Tester, 3D Modeler, Game Designer, Lead Programmer, Studio Technical Director, Director of Studio Operations, and General Purpose Entrepreneur. (Whew!) I’ve got years of experience hiring and managing game people from many backgrounds. I’ve also written for industry ma ...
…
continue reading
Software Testing and related topics from Software Development Expert Alan Richardson. Covering topics like: Software Testing, Exploratory Testing, Test Automation, Test Management, Software Development and Programming. Show notes at https://eviltester.com/show
…
continue reading
Test IO Open Mic Podcast Series is about QA testing, manual testing, and the life of Test IO Community. It discusses testing issues and gets the most value from real people, real cases, including technicalities and career tips.
…
continue reading
Welcome to THE TESTING ACADEMY Where we discuss the latest trends in software Test Automation and help you grow in your Test automation Career and Now Its your Host Promode.
…
continue reading
Join a group of life long friends where we talk about gaming news, technology news, and nerd culture. New episodes every Thursday.
…
continue reading
A podcast by Many Cats Studios, hosted by Chris Goodyear. DNA of Games stands for the Disability, Neurodiversity and Accessibility of Games. Chatting with disabled and neurodiverse developers / content creators that make games and play them we explore how they got into games, how their disability or neurodiversity has effected their life and those awesome gaming moments that stand out to them. Support us! - https://ko-fi.com/manycats
…
continue reading
The ARKeologist's Podcast is a weekly show in which the crew is made up of players from Official PvP Servers on the PC platform. We cover the latest patch notes, threads, and tweets to keep you up-to-date on what is happening and how it effects the Official PvP Server scene for ARK: Survival Evolved. Redbubble Shop - https://www.redbubble.co Join our Discord Channel! - discord.gg/FKqPUc5 Follow me on Twitter - twitter.com/SeanDKnight Follow me on Facebook - https://www.facebook.com/seandknig ...
…
continue reading
Doubting if going to university is the only option to have a successful life? You've come to the right podcast. How Real Life Works is a podcast that prepares young people for life after high school. We discuss education topics and interview professionals who provide tips on what you can do now to get your dream job. Don't wait until after high school to think about your life. Learn How Real Life Works, starting now. If you are a student who is passionate about content creation, join us! Sig ...
…
continue reading
Software testing Podcast is a one-stop marketplace where you can listen to what is happening in the software QA testing industry. Subscribe us now for the best and most popular testing podcasts. Happy Podcasting!!
…
continue reading
Running out of time to catch up with new arXiv papers? We take the most impactful papers and present them as convenient podcasts. If you're a visual learner, we offer these papers in an engaging video format. Our service fills the gap between overly brief paper summaries and time-consuming full paper reads. You gain academic insights in a time-efficient, digestible format. Code behind this work: https://github.com/imelnyk/ArxivPapers Support this podcast: https://podcasters.spotify.com/pod/s ...
…
continue reading
1
[QA] Automated Red Teaming with GOAT: the Generative Offensive Agent Tester
7:19
7:19
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
7:19
https://arxiv.org/abs//2410.01606 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/supp…
…
continue reading
1
[QA] EnsemW2S: Can an Ensemble of LLMs be Leveraged to Obtain a Stronger LLM?
7:42
7:42
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
7:42
This research proposes an innovative ensemble method for weak-to-strong generalization in AI, enhancing LLM performance through collaborative supervision, achieving significant improvements on challenging tasks. https://arxiv.org/abs//2410.04571 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcast…
…
continue reading
1
EnsemW2S: Can an Ensemble of LLMs be Leveraged to Obtain a Stronger LLM?
18:39
18:39
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
18:39
This research proposes an innovative ensemble method for weak-to-strong generalization in AI, enhancing LLM performance through collaborative supervision, achieving significant improvements on challenging tasks. https://arxiv.org/abs//2410.04571 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcast…
…
continue reading
1
[QA] Density estimation with LLMs: a geometric investigation of in-context learning trajectories
8:13
8:13
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
8:13
This study explores LLaMA-2's in-context learning for probability density estimation, revealing unique learning trajectories and interpreting its behavior as adaptive kernel density estimation. https://arxiv.org/abs//2410.05218 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcast…
…
continue reading
1
Density estimation with LLMs: a geometric investigation of in-context learning trajectories
12:31
12:31
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
12:31
This study explores LLaMA-2's in-context learning for probability density estimation, revealing unique learning trajectories and interpreting its behavior as adaptive kernel density estimation. https://arxiv.org/abs//2410.05218 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcast…
…
continue reading
1
Empowering Parents Foster a Positive Gaming Space for Kids with Heidi Vogel of GuardianGamer AI
34:43
34:43
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
34:43
Heidi Vogel is a seasoned entrepreneur with over five companies under her belt, and today she shares her incredible journey from tech pioneer to gaming innovator. As a mother of four, she faced challenges managing her children's digital playground, which inspired her to develop GuardianGamer's monitoring solution. Her latest venture, GuardianGamer …
…
continue reading
1
[QA] Teaching Transformers Modular Arithmetic at Scale
8:29
8:29
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
8:29
This paper enhances modular addition in machine learning by introducing diverse training data, angular embedding, and a custom loss function, improving performance for cryptographic applications and other modular arithmetic problems. https://arxiv.org/abs//2410.03569 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxi…
…
continue reading
1
Teaching Transformers Modular Arithmetic at Scale
13:01
13:01
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
13:01
This paper enhances modular addition in machine learning by introducing diverse training data, angular embedding, and a custom loss function, improving performance for cryptographic applications and other modular arithmetic problems. https://arxiv.org/abs//2410.03569 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxi…
…
continue reading
1
[QA] What Matters for Model Merging at Scale?
7:57
7:57
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
7:57
This study evaluates model merging at scale, revealing insights on expert model quality, size, and merging methods, ultimately enhancing generalization and performance in large-scale applications. https://arxiv.org/abs//2410.03617 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podc…
…
continue reading
1
What Matters for Model Merging at Scale?
24:32
24:32
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
24:32
This study evaluates model merging at scale, revealing insights on expert model quality, size, and merging methods, ultimately enhancing generalization and performance in large-scale applications. https://arxiv.org/abs//2410.03617 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podc…
…
continue reading
1
[QA] Depth Pro: Sharp Monocular Metric Depth in Less Than a Second
7:17
7:17
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
7:17
Depth Pro is a fast foundation model for zero-shot monocular depth estimation, producing high-resolution, metric depth maps without metadata, outperforming previous methods in accuracy and detail. https://arxiv.org/abs//2410.02073 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podc…
…
continue reading
1
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second
14:08
14:08
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
14:08
Depth Pro is a fast foundation model for zero-shot monocular depth estimation, producing high-resolution, metric depth maps without metadata, outperforming previous methods in accuracy and detail. https://arxiv.org/abs//2410.02073 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podc…
…
continue reading
This work revisits LSTMs and GRUs, introducing minimal versions that eliminate hidden state dependencies, enabling efficient parallel training while matching the performance of recent sequence models. https://arxiv.org/abs//2410.01201 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://…
…
continue reading
This work revisits LSTMs and GRUs, introducing minimal versions that eliminate hidden state dependencies, enabling efficient parallel training while matching the performance of recent sequence models. https://arxiv.org/abs//2410.01201 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://…
…
continue reading
1
[QA] OOD-CHAMELEON: Is Algorithm Selection for OOD Generalization Learnable?
8:08
8:08
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
8:08
The paper introduces OOD-CHAMELEON, a method for selecting algorithms for out-of-distribution generalization by predicting performance based on dataset characteristics, outperforming individual algorithms and heuristics. https://arxiv.org/abs//2410.02735 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Appl…
…
continue reading
1
OOD-CHAMELEON: Is Algorithm Selection for OOD Generalization Learnable?
21:51
21:51
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
21:51
The paper introduces OOD-CHAMELEON, a method for selecting algorithms for out-of-distribution generalization by predicting performance based on dataset characteristics, outperforming individual algorithms and heuristics. https://arxiv.org/abs//2410.02735 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Appl…
…
continue reading
1
[QA] Training Language Models on Synthetic Edit Sequences Improves Code Synthesis
7:53
7:53
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
7:53
The paper presents LintSeq, a synthetic data generation algorithm that refactors code into edit sequences, improving LLM performance in code synthesis and achieving state-of-the-art results with smaller models. https://arxiv.org/abs//2410.02749 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts…
…
continue reading
1
Training Language Models on Synthetic Edit Sequences Improves Code Synthesis
18:46
18:46
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
18:46
The paper presents LintSeq, a synthetic data generation algorithm that refactors code into edit sequences, improving LLM performance in code synthesis and achieving state-of-the-art results with smaller models. https://arxiv.org/abs//2410.02749 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts…
…
continue reading
1
From Sales to Software Testing - A Vietnamese Tester's Journey | Ep. [2]
4:41
4:41
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
4:41
Join host Charlie as he interviews Duy (Paul), a successful software tester from Vietnam 🇻🇳, in this episode of #TestIOOpenMic. Discover how Paul transitioned from an Area Sales Manager to a top performer on the Test IO platform. 🎬 Episode Highlights: - Paul's love for plot-twist movies like 'Shutter Island' - The journey from IT graduate to sales …
…
continue reading
1
Automated Red Teaming with GOAT: the Generative Offensive Agent Tester
13:38
13:38
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
13:38
https://arxiv.org/abs//2410.01606 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/supp…
…
continue reading
1
[QA] Not All LLM Reasoners Are Created Equal
7:23
7:23
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
7:23
https://arxiv.org/abs//2410.01748 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/supp…
…
continue reading
1
Not All LLM Reasoners Are Created Equal
9:40
9:40
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
9:40
https://arxiv.org/abs//2410.01748 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/supp…
…
continue reading
1
[QA] Law of the Weakest Link: Cross Capabilities of Large Language Models
7:32
7:32
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
7:32
https://arxiv.org/abs//2409.19951 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/supp…
…
continue reading
1
Law of the Weakest Link: Cross Capabilities of Large Language Models
16:18
16:18
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
16:18
https://arxiv.org/abs//2409.19951 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/supp…
…
continue reading
1
[QA] Realistic Evaluation of Model Merging for Compositional Generalization
8:29
8:29
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
8:29
This paper evaluates various model merging methods for compositional generalization in image classification, generation, and NLP, clarifying their merits, requirements, and computational costs in a shared experimental setting. https://arxiv.org/abs//2409.18314 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_paper…
…
continue reading
1
Realistic Evaluation of Model Merging for Compositional Generalization
21:13
21:13
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
21:13
This paper evaluates various model merging methods for compositional generalization in image classification, generation, and NLP, clarifying their merits, requirements, and computational costs in a shared experimental setting. https://arxiv.org/abs//2409.18314 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_paper…
…
continue reading
1
The Rise of Unity Asia and the Evolution of Cloud Gaming with John Goodale of Jam.gg
49:35
49:35
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
49:35
John Goodale was employee #60 at Unity in 2010, where he launched and rapidly expanded Unity Asia. Under his leadership, he transformed Unity into the preferred platform for game developers across the region, driving exceptional growth and establishing the company as a dominant force in the gaming industry. Having also held key positions at Sega an…
…
continue reading
1
[QA] Emu3: Next-Token Prediction is All You Need
7:43
7:43
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
7:43
Emu3 introduces a next-token prediction model for multimodal tasks, outperforming existing models and simplifying design by focusing on tokenization of images, text, and videos. https://arxiv.org/abs//2409.18869 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/p…
…
continue reading
1
Emu3: Next-Token Prediction is All You Need
17:28
17:28
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
17:28
Emu3 introduces a next-token prediction model for multimodal tasks, outperforming existing models and simplifying design by focusing on tokenization of images, text, and videos. https://arxiv.org/abs//2409.18869 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/p…
…
continue reading
1
[QA] MIO: A Foundation Model on Multimodal Tokens
8:38
8:38
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
8:38
MIO is a novel multimodal foundation model that excels in understanding and generating speech, text, images, and videos, outperforming existing models in any-to-any capabilities and diverse tasks. https://arxiv.org/abs//2409.17692 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podc…
…
continue reading
1
MIO: A Foundation Model on Multimodal Tokens
19:09
19:09
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
19:09
MIO is a novel multimodal foundation model that excels in understanding and generating speech, text, images, and videos, outperforming existing models in any-to-any capabilities and diverse tasks. https://arxiv.org/abs//2409.17692 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podc…
…
continue reading
1
[QA] A Preliminary Study of o1 in Medicine: Are We Closer to an AI Doctor ?
7:52
7:52
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
7:52
The paper evaluates OpenAI's o1 model in medical scenarios, highlighting its enhanced reasoning and accuracy over GPT-4, while also identifying weaknesses and releasing data for further research. https://arxiv.org/abs//2409.15277 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podca…
…
continue reading
1
A Preliminary Study of o1 in Medicine: Are We Closer to an AI Doctor ?
8:52
8:52
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
8:52
The paper evaluates OpenAI's o1 model in medical scenarios, highlighting its enhanced reasoning and accuracy over GPT-4, while also identifying weaknesses and releasing data for further research. https://arxiv.org/abs//2409.15277 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podca…
…
continue reading
1
[QA] Logic-of-Thought: Injecting Logic into Contexts for Full Reasoning in Large Language Models
8:44
8:44
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
8:44
The Logic-of-Thought (LoT) prompting method enhances logical reasoning in Large Language Models by integrating propositional logic, significantly improving performance across various reasoning tasks. https://arxiv.org/abs//2409.17539 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://p…
…
continue reading
1
Logic-of-Thought: Injecting Logic into Contexts for Full Reasoning in Large Language Models
16:05
16:05
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
16:05
The Logic-of-Thought (LoT) prompting method enhances logical reasoning in Large Language Models by integrating propositional logic, significantly improving performance across various reasoning tasks. https://arxiv.org/abs//2409.17539 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://p…
…
continue reading
1
[QA] Making Text Embedders Few-Shot Learners
7:45
7:45
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
7:45
We propose bge-en-icl, a model leveraging in-context learning in LLMs for high-quality text embeddings, achieving state-of-the-art performance on MTEB and AIR-Bench benchmarks. https://arxiv.org/abs//2409.15700 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/po…
…
continue reading
1
Making Text Embedders Few-Shot Learners
16:11
16:11
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
16:11
We propose bge-en-icl, a model leveraging in-context learning in LLMs for high-quality text embeddings, achieving state-of-the-art performance on MTEB and AIR-Bench benchmarks. https://arxiv.org/abs//2409.15700 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/po…
…
continue reading
1
[QA] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale
6:45
6:45
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
6:45
The paper introduces PROX, a framework enabling small language models to refine data effectively, outperforming human-crafted methods and enhancing efficiency in LLM pre-training across various benchmarks. https://arxiv.org/abs//2409.17115 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: htt…
…
continue reading
1
Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale
8:57
8:57
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
8:57
The paper introduces PROX, a framework enabling small language models to refine data effectively, outperforming human-crafted methods and enhancing efficiency in LLM pre-training across various benchmarks. https://arxiv.org/abs//2409.17115 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: htt…
…
continue reading
1
[QA] Infer Human's Intentions Before Following Natural Language Instruction
8:18
8:18
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
8:18
The FISER framework enhances AI's ability to follow ambiguous human instructions by inferring intentions, outperforming traditional methods in collaborative tasks, particularly on the HandMeThat benchmark. https://arxiv.org/abs//2409.18073 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: htt…
…
continue reading
1
Infer Human's Intentions Before Following Natural Language Instruction
27:36
27:36
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
27:36
The FISER framework enhances AI's ability to follow ambiguous human instructions by inferring intentions, outperforming traditional methods in collaborative tasks, particularly on the HandMeThat benchmark. https://arxiv.org/abs//2409.18073 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: htt…
…
continue reading
1
[QA] MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models
7:05
7:05
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
7:05
This paper presents a learnable pruning method for Large Language Models, achieving efficient N:M sparsity, improved mask quality, and transferability across tasks, outperforming existing techniques in empirical evaluations. https://arxiv.org/abs//2409.17481 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers …
…
continue reading
1
MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models
15:10
15:10
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
15:10
This paper presents a learnable pruning method for Large Language Models, achieving efficient N:M sparsity, improved mask quality, and transferability across tasks, outperforming existing techniques in empirical evaluations. https://arxiv.org/abs//2409.17481 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers …
…
continue reading
1
[QA] Counterfactual Token Generation in Large Language Models
7:53
7:53
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
7:53
This paper presents a method to enable large language models to perform counterfactual token generation, enhancing their capabilities without fine-tuning, and applying it for bias detection. https://arxiv.org/abs//2409.17027 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.a…
…
continue reading
1
Counterfactual Token Generation in Large Language Models
14:52
14:52
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
14:52
This paper presents a method to enable large language models to perform counterfactual token generation, enhancing their capabilities without fine-tuning, and applying it for bias detection. https://arxiv.org/abs//2409.17027 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.a…
…
continue reading
1
[QA] Characterizing stable regions in the residual stream of LLMs
7:45
7:45
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
7:45
The paper identifies stable regions in Transformers' residual streams, showing insensitivity to small changes but high sensitivity at boundaries, aligning with semantic distinctions and clustering similar prompts. https://arxiv.org/abs//2409.17113 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podca…
…
continue reading
1
Characterizing stable regions in the residual stream of LLMs
5:26
5:26
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
5:26
The paper identifies stable regions in Transformers' residual streams, showing insensitivity to small changes but high sensitivity at boundaries, aligning with semantic distinctions and clustering similar prompts. https://arxiv.org/abs//2409.17113 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podca…
…
continue reading
1
[QA] Watch Your Steps: Observable and Modular Chains of Thought
7:30
7:30
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
7:30
We introduce Program Trace Prompting, enhancing chain of thought explanations with formal syntax, improving observability, and enabling analysis of reasoning errors across diverse tasks in the BIG-Bench Hard benchmark. https://arxiv.org/abs//2409.15359 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple …
…
continue reading
1
Watch Your Steps: Observable and Modular Chains of Thought
29:35
29:35
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
29:35
We introduce Program Trace Prompting, enhancing chain of thought explanations with formal syntax, improving observability, and enabling analysis of reasoning errors across diverse tasks in the BIG-Bench Hard benchmark. https://arxiv.org/abs//2409.15359 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple …
…
continue reading
1
[QA] Seeing Faces in Things: A Model and Dataset for Pareidolia
7:38
7:38
נגן מאוחר יותר
נגן מאוחר יותר
רשימות
לייק
אהבתי
7:38
This paper explores face pareidolia in computer vision, presenting a dataset of annotated images and analyzing the differences in face detection between humans and machines. https://arxiv.org/abs//2409.16143 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podca…
…
continue reading