התחל במצב לא מקוון עם האפליקציה Player FM !
פודקאסטים ששווה להאזין
בחסות
Big Data Quality, Then and Now
Manage episode 326421750 series 3331732
A decade ago, just before the beginning of the data science hype cycle was the big data hype cycle. At that time I had the privilege of sitting down with Ph.D. Statistician Dr. Thomas C. Redman (aka the “Data Doc”).
We discussed whether data quality matters less in larger data sets, if statistical outliers represent business insights or data quality issues, statistical sampling errors versus measurement calibration errors, mistaking signal for noise (i.e., good data for bad data), and whether or not the principles and practices of true “data scientists” will truly be embraced by an organization’s business leaders.
This episode is an edited and slightly shortened version of that discussion, which even though it is from ten years ago, I think it still provides good insight into big data quality, then and now.
Extended Show Notes: ocdqblog.com/dbp
Follow Jim Harris on Twitter: @ocdqblog
Email Jim Harris: ocdqblog.com/contact
Other ways to listen: bit.ly/listen-dbp
10 פרקים
Manage episode 326421750 series 3331732
A decade ago, just before the beginning of the data science hype cycle was the big data hype cycle. At that time I had the privilege of sitting down with Ph.D. Statistician Dr. Thomas C. Redman (aka the “Data Doc”).
We discussed whether data quality matters less in larger data sets, if statistical outliers represent business insights or data quality issues, statistical sampling errors versus measurement calibration errors, mistaking signal for noise (i.e., good data for bad data), and whether or not the principles and practices of true “data scientists” will truly be embraced by an organization’s business leaders.
This episode is an edited and slightly shortened version of that discussion, which even though it is from ten years ago, I think it still provides good insight into big data quality, then and now.
Extended Show Notes: ocdqblog.com/dbp
Follow Jim Harris on Twitter: @ocdqblog
Email Jim Harris: ocdqblog.com/contact
Other ways to listen: bit.ly/listen-dbp
10 פרקים
כל הפרקים
×

1 Machine Learning is Label Making 15:00

1 Cloudy with a Chance of Data Analytics 27:36


1 Three Questions for Data Analytics 12:44


1 Home Schooling your Machine Learning Model 11:39


1 Defining Data Analytics, Machine Learning, and Data Science 24:42

ברוכים הבאים אל Player FM!
Player FM סורק את האינטרנט עבור פודקאסטים באיכות גבוהה בשבילכם כדי שתהנו מהם כרגע. זה יישום הפודקאסט הטוב ביותר והוא עובד על אנדרואיד, iPhone ואינטרנט. הירשמו לסנכרון מנויים במכשירים שונים.