התחל במצב לא מקוון עם האפליקציה Player FM !
GPT-5 has Arrived
Manage episode 498916367 series 3611272
GPT-5 will change how hundreds of millions of people use AI. Yes, you might have to forgive the chart crimes, the underwhelming livestream and Altman hype… But it’s a good model. I have read the 50 page system card in full, have the benchmark scores, coding tests, and things you might have missed.
https://app.grayswan.ai/ai-explained
Announcement: https://openai.com/index/introducing-gpt-5/
System Card: https://cdn.openai.com/pdf/8124a3ce-ab78-4f06-96eb-49ea29ffb52f/gpt5-system-card-aug7.pdf
Extra Paper: https://cdn.openai.com/pdf/be60c07b-6bc2-4f54-bcee-4141e1d6c69a/gpt-5-safe_completions.pdf
Altman tweet: https://x.com/sama/status/1953551377873117369
Livestream: https://www.youtube.com/watch?v=0Uu_VJeVVfo
METR Report: https://metr.github.io/autonomy-evals-guide/gpt-5-report/
ARC-AGI-2: https://x.com/fchollet/status/1953511631054680085
Claude Opus 4.1: https://www.anthropic.com/news/claude-opus-4-1
MMMU: https://mmmu-benchmark.github.io/
Cursor Praise: https://x.com/ryolu_/status/1953531724895596669
36 פרקים
Manage episode 498916367 series 3611272
GPT-5 will change how hundreds of millions of people use AI. Yes, you might have to forgive the chart crimes, the underwhelming livestream and Altman hype… But it’s a good model. I have read the 50 page system card in full, have the benchmark scores, coding tests, and things you might have missed.
https://app.grayswan.ai/ai-explained
Announcement: https://openai.com/index/introducing-gpt-5/
System Card: https://cdn.openai.com/pdf/8124a3ce-ab78-4f06-96eb-49ea29ffb52f/gpt5-system-card-aug7.pdf
Extra Paper: https://cdn.openai.com/pdf/be60c07b-6bc2-4f54-bcee-4141e1d6c69a/gpt-5-safe_completions.pdf
Altman tweet: https://x.com/sama/status/1953551377873117369
Livestream: https://www.youtube.com/watch?v=0Uu_VJeVVfo
METR Report: https://metr.github.io/autonomy-evals-guide/gpt-5-report/
ARC-AGI-2: https://x.com/fchollet/status/1953511631054680085
Claude Opus 4.1: https://www.anthropic.com/news/claude-opus-4-1
MMMU: https://mmmu-benchmark.github.io/
Cursor Praise: https://x.com/ryolu_/status/1953531724895596669
36 פרקים
כל הפרקים
×ברוכים הבאים אל Player FM!
Player FM סורק את האינטרנט עבור פודקאסטים באיכות גבוהה בשבילכם כדי שתהנו מהם כרגע. זה יישום הפודקאסט הטוב ביותר והוא עובד על אנדרואיד, iPhone ואינטרנט. הירשמו לסנכרון מנויים במכשירים שונים.