big data is not about the data! - harvard...

79
Big Data is Not About the Data! Gary King 1 Institute for Quantitative Social Science Harvard University (Gov2001: Advanced Quantitative Research Methodology, 4/28/2013) 1 GaryKing.org 1/8

Upload: others

Post on 19-Jul-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

Big Data is Not About the Data!

Gary King1

Institute for Quantitative Social ScienceHarvard University

(Gov2001: Advanced Quantitative Research Methodology, 4/28/2013)

1GaryKing.org1 / 8

Page 2: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Data In Big Data (about people)

The Last 50 Years:

� Survey research

� Aggregate government statistics

� One off studies of individual places, people, or events

The Next 50 Years: Fast increases in new data sources, due to. . .

� Much more of the above — improved, expanded, and applied

� Shrinking computers & the growing Internet: data everywhere

� The replication movement: data sharing (e.g., Dataverse)

� Governments encouraging data collection & experimentation

� Advances in statistical methods, informatics, & software

� You are part of a tectonic movement: The march ofquantification through academia, professions, government, &commerce (SuperCrunchers, The Numerati, MoneyBall)

2 / 8

Page 3: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Data In Big Data (about people)

The Last 50 Years:

� Survey research

� Aggregate government statistics

� One off studies of individual places, people, or events

The Next 50 Years: Fast increases in new data sources, due to. . .

� Much more of the above — improved, expanded, and applied

� Shrinking computers & the growing Internet: data everywhere

� The replication movement: data sharing (e.g., Dataverse)

� Governments encouraging data collection & experimentation

� Advances in statistical methods, informatics, & software

� You are part of a tectonic movement: The march ofquantification through academia, professions, government, &commerce (SuperCrunchers, The Numerati, MoneyBall)

2 / 8

Page 4: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Data In Big Data (about people)

The Last 50 Years:

� Survey research

� Aggregate government statistics

� One off studies of individual places, people, or events

The Next 50 Years: Fast increases in new data sources, due to. . .

� Much more of the above — improved, expanded, and applied

� Shrinking computers & the growing Internet: data everywhere

� The replication movement: data sharing (e.g., Dataverse)

� Governments encouraging data collection & experimentation

� Advances in statistical methods, informatics, & software

� You are part of a tectonic movement: The march ofquantification through academia, professions, government, &commerce (SuperCrunchers, The Numerati, MoneyBall)

2 / 8

Page 5: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Data In Big Data (about people)

The Last 50 Years:

� Survey research

� Aggregate government statistics

� One off studies of individual places, people, or events

The Next 50 Years: Fast increases in new data sources, due to. . .

� Much more of the above — improved, expanded, and applied

� Shrinking computers & the growing Internet: data everywhere

� The replication movement: data sharing (e.g., Dataverse)

� Governments encouraging data collection & experimentation

� Advances in statistical methods, informatics, & software

� You are part of a tectonic movement: The march ofquantification through academia, professions, government, &commerce (SuperCrunchers, The Numerati, MoneyBall)

2 / 8

Page 6: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Data In Big Data (about people)

The Last 50 Years:

� Survey research

� Aggregate government statistics

� One off studies of individual places, people, or events

The Next 50 Years: Fast increases in new data sources, due to. . .

� Much more of the above — improved, expanded, and applied

� Shrinking computers & the growing Internet: data everywhere

� The replication movement: data sharing (e.g., Dataverse)

� Governments encouraging data collection & experimentation

� Advances in statistical methods, informatics, & software

� You are part of a tectonic movement: The march ofquantification through academia, professions, government, &commerce (SuperCrunchers, The Numerati, MoneyBall)

2 / 8

Page 7: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Data In Big Data (about people)

The Last 50 Years:

� Survey research

� Aggregate government statistics

� One off studies of individual places, people, or events

The Next 50 Years: Fast increases in new data sources, due to. . .

� Much more of the above — improved, expanded, and applied

� Shrinking computers & the growing Internet: data everywhere

� The replication movement: data sharing (e.g., Dataverse)

� Governments encouraging data collection & experimentation

� Advances in statistical methods, informatics, & software

� You are part of a tectonic movement: The march ofquantification through academia, professions, government, &commerce (SuperCrunchers, The Numerati, MoneyBall)

2 / 8

Page 8: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Data In Big Data (about people)

The Last 50 Years:

� Survey research

� Aggregate government statistics

� One off studies of individual places, people, or events

The Next 50 Years: Fast increases in new data sources, due to. . .

� Much more of the above — improved, expanded, and applied

� Shrinking computers & the growing Internet: data everywhere

� The replication movement: data sharing (e.g., Dataverse)

� Governments encouraging data collection & experimentation

� Advances in statistical methods, informatics, & software

� You are part of a tectonic movement: The march ofquantification through academia, professions, government, &commerce (SuperCrunchers, The Numerati, MoneyBall)

2 / 8

Page 9: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Data In Big Data (about people)

The Last 50 Years:

� Survey research

� Aggregate government statistics

� One off studies of individual places, people, or events

The Next 50 Years: Fast increases in new data sources, due to. . .

� Much more of the above — improved, expanded, and applied

� Shrinking computers & the growing Internet: data everywhere

� The replication movement: data sharing (e.g., Dataverse)

� Governments encouraging data collection & experimentation

� Advances in statistical methods, informatics, & software

� You are part of a tectonic movement: The march ofquantification through academia, professions, government, &commerce (SuperCrunchers, The Numerati, MoneyBall)

2 / 8

Page 10: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Data In Big Data (about people)

The Last 50 Years:

� Survey research

� Aggregate government statistics

� One off studies of individual places, people, or events

The Next 50 Years: Fast increases in new data sources, due to. . .

� Much more of the above — improved, expanded, and applied

� Shrinking computers & the growing Internet: data everywhere

� The replication movement: data sharing (e.g., Dataverse)

� Governments encouraging data collection & experimentation

� Advances in statistical methods, informatics, & software

� You are part of a tectonic movement: The march ofquantification through academia, professions, government, &commerce (SuperCrunchers, The Numerati, MoneyBall)

2 / 8

Page 11: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Data In Big Data (about people)

The Last 50 Years:

� Survey research

� Aggregate government statistics

� One off studies of individual places, people, or events

The Next 50 Years: Fast increases in new data sources, due to. . .

� Much more of the above — improved, expanded, and applied

� Shrinking computers & the growing Internet: data everywhere

� The replication movement: data sharing (e.g., Dataverse)

� Governments encouraging data collection & experimentation

� Advances in statistical methods, informatics, & software

� You are part of a tectonic movement: The march ofquantification through academia, professions, government, &commerce (SuperCrunchers, The Numerati, MoneyBall)

2 / 8

Page 12: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Data In Big Data (about people)

The Last 50 Years:

� Survey research

� Aggregate government statistics

� One off studies of individual places, people, or events

The Next 50 Years: Fast increases in new data sources, due to. . .

� Much more of the above — improved, expanded, and applied

� Shrinking computers & the growing Internet: data everywhere

� The replication movement: data sharing (e.g., Dataverse)

� Governments encouraging data collection & experimentation

� Advances in statistical methods, informatics, & software

� You are part of a tectonic movement: The march ofquantification through academia, professions, government, &commerce (SuperCrunchers, The Numerati, MoneyBall)

2 / 8

Page 13: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Data In Big Data (about people)

The Last 50 Years:

� Survey research

� Aggregate government statistics

� One off studies of individual places, people, or events

The Next 50 Years: Fast increases in new data sources, due to. . .

� Much more of the above — improved, expanded, and applied

� Shrinking computers & the growing Internet: data everywhere

� The replication movement: data sharing (e.g., Dataverse)

� Governments encouraging data collection & experimentation

� Advances in statistical methods, informatics, & software

� You are part of a tectonic movement: The march ofquantification through academia, professions, government, &commerce (SuperCrunchers, The Numerati, MoneyBall)

2 / 8

Page 14: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Data in Big Data: Examples

1. Unstructured text: emails, speeches, reports, social mediaupdates, web pages, newspapers, scholarly literature, productreviews

2. Commerce: credit cards, sales, real estate transactions, RFIDs

3. Geographic location: cell phones, Fastlane, garage cameras

4. Health information: digital medical records, hospitaladmittances, accelerometers & other devices in cell phones

5. Biological sciences: genomics, proteomics, metabolomics,imaging producing numerous person-level variables

6. Satellite imagery: increasing in scope & resolution

7. Electoral activity: ballot images, precinct-level results,individual-level registration, primary participation, campaigncontributions

8. Web surfing artifacts: clicks, searches, and advertisingclickthroughs, multiplayer games, virtual worlds

9. > 90% of all data ever created was created last year

3 / 8

Page 15: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Data in Big Data: Examples1. Unstructured text: emails, speeches, reports, social media

updates, web pages, newspapers, scholarly literature, productreviews

2. Commerce: credit cards, sales, real estate transactions, RFIDs

3. Geographic location: cell phones, Fastlane, garage cameras

4. Health information: digital medical records, hospitaladmittances, accelerometers & other devices in cell phones

5. Biological sciences: genomics, proteomics, metabolomics,imaging producing numerous person-level variables

6. Satellite imagery: increasing in scope & resolution

7. Electoral activity: ballot images, precinct-level results,individual-level registration, primary participation, campaigncontributions

8. Web surfing artifacts: clicks, searches, and advertisingclickthroughs, multiplayer games, virtual worlds

9. > 90% of all data ever created was created last year

3 / 8

Page 16: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Data in Big Data: Examples1. Unstructured text: emails, speeches, reports, social media

updates, web pages, newspapers, scholarly literature, productreviews

2. Commerce: credit cards, sales, real estate transactions, RFIDs

3. Geographic location: cell phones, Fastlane, garage cameras

4. Health information: digital medical records, hospitaladmittances, accelerometers & other devices in cell phones

5. Biological sciences: genomics, proteomics, metabolomics,imaging producing numerous person-level variables

6. Satellite imagery: increasing in scope & resolution

7. Electoral activity: ballot images, precinct-level results,individual-level registration, primary participation, campaigncontributions

8. Web surfing artifacts: clicks, searches, and advertisingclickthroughs, multiplayer games, virtual worlds

9. > 90% of all data ever created was created last year

3 / 8

Page 17: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Data in Big Data: Examples1. Unstructured text: emails, speeches, reports, social media

updates, web pages, newspapers, scholarly literature, productreviews

2. Commerce: credit cards, sales, real estate transactions, RFIDs

3. Geographic location: cell phones, Fastlane, garage cameras

4. Health information: digital medical records, hospitaladmittances, accelerometers & other devices in cell phones

5. Biological sciences: genomics, proteomics, metabolomics,imaging producing numerous person-level variables

6. Satellite imagery: increasing in scope & resolution

7. Electoral activity: ballot images, precinct-level results,individual-level registration, primary participation, campaigncontributions

8. Web surfing artifacts: clicks, searches, and advertisingclickthroughs, multiplayer games, virtual worlds

9. > 90% of all data ever created was created last year

3 / 8

Page 18: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Data in Big Data: Examples1. Unstructured text: emails, speeches, reports, social media

updates, web pages, newspapers, scholarly literature, productreviews

2. Commerce: credit cards, sales, real estate transactions, RFIDs

3. Geographic location: cell phones, Fastlane, garage cameras

4. Health information: digital medical records, hospitaladmittances, accelerometers & other devices in cell phones

5. Biological sciences: genomics, proteomics, metabolomics,imaging producing numerous person-level variables

6. Satellite imagery: increasing in scope & resolution

7. Electoral activity: ballot images, precinct-level results,individual-level registration, primary participation, campaigncontributions

8. Web surfing artifacts: clicks, searches, and advertisingclickthroughs, multiplayer games, virtual worlds

9. > 90% of all data ever created was created last year

3 / 8

Page 19: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Data in Big Data: Examples1. Unstructured text: emails, speeches, reports, social media

updates, web pages, newspapers, scholarly literature, productreviews

2. Commerce: credit cards, sales, real estate transactions, RFIDs

3. Geographic location: cell phones, Fastlane, garage cameras

4. Health information: digital medical records, hospitaladmittances, accelerometers & other devices in cell phones

5. Biological sciences: genomics, proteomics, metabolomics,imaging producing numerous person-level variables

6. Satellite imagery: increasing in scope & resolution

7. Electoral activity: ballot images, precinct-level results,individual-level registration, primary participation, campaigncontributions

8. Web surfing artifacts: clicks, searches, and advertisingclickthroughs, multiplayer games, virtual worlds

9. > 90% of all data ever created was created last year

3 / 8

Page 20: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Data in Big Data: Examples1. Unstructured text: emails, speeches, reports, social media

updates, web pages, newspapers, scholarly literature, productreviews

2. Commerce: credit cards, sales, real estate transactions, RFIDs

3. Geographic location: cell phones, Fastlane, garage cameras

4. Health information: digital medical records, hospitaladmittances, accelerometers & other devices in cell phones

5. Biological sciences: genomics, proteomics, metabolomics,imaging producing numerous person-level variables

6. Satellite imagery: increasing in scope & resolution

7. Electoral activity: ballot images, precinct-level results,individual-level registration, primary participation, campaigncontributions

8. Web surfing artifacts: clicks, searches, and advertisingclickthroughs, multiplayer games, virtual worlds

9. > 90% of all data ever created was created last year

3 / 8

Page 21: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Data in Big Data: Examples1. Unstructured text: emails, speeches, reports, social media

updates, web pages, newspapers, scholarly literature, productreviews

2. Commerce: credit cards, sales, real estate transactions, RFIDs

3. Geographic location: cell phones, Fastlane, garage cameras

4. Health information: digital medical records, hospitaladmittances, accelerometers & other devices in cell phones

5. Biological sciences: genomics, proteomics, metabolomics,imaging producing numerous person-level variables

6. Satellite imagery: increasing in scope & resolution

7. Electoral activity: ballot images, precinct-level results,individual-level registration, primary participation, campaigncontributions

8. Web surfing artifacts: clicks, searches, and advertisingclickthroughs, multiplayer games, virtual worlds

9. > 90% of all data ever created was created last year

3 / 8

Page 22: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Data in Big Data: Examples1. Unstructured text: emails, speeches, reports, social media

updates, web pages, newspapers, scholarly literature, productreviews

2. Commerce: credit cards, sales, real estate transactions, RFIDs

3. Geographic location: cell phones, Fastlane, garage cameras

4. Health information: digital medical records, hospitaladmittances, accelerometers & other devices in cell phones

5. Biological sciences: genomics, proteomics, metabolomics,imaging producing numerous person-level variables

6. Satellite imagery: increasing in scope & resolution

7. Electoral activity: ballot images, precinct-level results,individual-level registration, primary participation, campaigncontributions

8. Web surfing artifacts: clicks, searches, and advertisingclickthroughs, multiplayer games, virtual worlds

9. > 90% of all data ever created was created last year

3 / 8

Page 23: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Data in Big Data: Examples1. Unstructured text: emails, speeches, reports, social media

updates, web pages, newspapers, scholarly literature, productreviews

2. Commerce: credit cards, sales, real estate transactions, RFIDs

3. Geographic location: cell phones, Fastlane, garage cameras

4. Health information: digital medical records, hospitaladmittances, accelerometers & other devices in cell phones

5. Biological sciences: genomics, proteomics, metabolomics,imaging producing numerous person-level variables

6. Satellite imagery: increasing in scope & resolution

7. Electoral activity: ballot images, precinct-level results,individual-level registration, primary participation, campaigncontributions

8. Web surfing artifacts: clicks, searches, and advertisingclickthroughs, multiplayer games, virtual worlds

9. > 90% of all data ever created was created last year3 / 8

Page 24: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Value in Big Data: the Analytics

� Data:

� easy to come by; often a free byproduct of IT improvements� becoming commoditized� Ignore it & every institution will still have more every year� With a bit of effort: huge data production increases

� Where the Value is: the Analytics

� Output can be highly customized� Moore’s law (doubling speed/power every 18 months) v. 1000x

increase with one algorithm� $2M computer v. 2 hours of algorithm design� Low cost; little infrastructure; mostly human capital needed� Innovative analytics: enormously better than off-the-shelf

approaches

4 / 8

Page 25: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Value in Big Data: the Analytics

� Data:

� easy to come by; often a free byproduct of IT improvements� becoming commoditized� Ignore it & every institution will still have more every year� With a bit of effort: huge data production increases

� Where the Value is: the Analytics

� Output can be highly customized� Moore’s law (doubling speed/power every 18 months) v. 1000x

increase with one algorithm� $2M computer v. 2 hours of algorithm design� Low cost; little infrastructure; mostly human capital needed� Innovative analytics: enormously better than off-the-shelf

approaches

4 / 8

Page 26: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Value in Big Data: the Analytics

� Data:� easy to come by; often a free byproduct of IT improvements

� becoming commoditized� Ignore it & every institution will still have more every year� With a bit of effort: huge data production increases

� Where the Value is: the Analytics

� Output can be highly customized� Moore’s law (doubling speed/power every 18 months) v. 1000x

increase with one algorithm� $2M computer v. 2 hours of algorithm design� Low cost; little infrastructure; mostly human capital needed� Innovative analytics: enormously better than off-the-shelf

approaches

4 / 8

Page 27: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Value in Big Data: the Analytics

� Data:� easy to come by; often a free byproduct of IT improvements� becoming commoditized

� Ignore it & every institution will still have more every year� With a bit of effort: huge data production increases

� Where the Value is: the Analytics

� Output can be highly customized� Moore’s law (doubling speed/power every 18 months) v. 1000x

increase with one algorithm� $2M computer v. 2 hours of algorithm design� Low cost; little infrastructure; mostly human capital needed� Innovative analytics: enormously better than off-the-shelf

approaches

4 / 8

Page 28: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Value in Big Data: the Analytics

� Data:� easy to come by; often a free byproduct of IT improvements� becoming commoditized� Ignore it & every institution will still have more every year

� With a bit of effort: huge data production increases

� Where the Value is: the Analytics

� Output can be highly customized� Moore’s law (doubling speed/power every 18 months) v. 1000x

increase with one algorithm� $2M computer v. 2 hours of algorithm design� Low cost; little infrastructure; mostly human capital needed� Innovative analytics: enormously better than off-the-shelf

approaches

4 / 8

Page 29: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Value in Big Data: the Analytics

� Data:� easy to come by; often a free byproduct of IT improvements� becoming commoditized� Ignore it & every institution will still have more every year� With a bit of effort: huge data production increases

� Where the Value is: the Analytics

� Output can be highly customized� Moore’s law (doubling speed/power every 18 months) v. 1000x

increase with one algorithm� $2M computer v. 2 hours of algorithm design� Low cost; little infrastructure; mostly human capital needed� Innovative analytics: enormously better than off-the-shelf

approaches

4 / 8

Page 30: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Value in Big Data: the Analytics

� Data:� easy to come by; often a free byproduct of IT improvements� becoming commoditized� Ignore it & every institution will still have more every year� With a bit of effort: huge data production increases

� Where the Value is: the Analytics

� Output can be highly customized� Moore’s law (doubling speed/power every 18 months) v. 1000x

increase with one algorithm� $2M computer v. 2 hours of algorithm design� Low cost; little infrastructure; mostly human capital needed� Innovative analytics: enormously better than off-the-shelf

approaches

4 / 8

Page 31: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Value in Big Data: the Analytics

� Data:� easy to come by; often a free byproduct of IT improvements� becoming commoditized� Ignore it & every institution will still have more every year� With a bit of effort: huge data production increases

� Where the Value is: the Analytics� Output can be highly customized

� Moore’s law (doubling speed/power every 18 months) v. 1000xincrease with one algorithm

� $2M computer v. 2 hours of algorithm design� Low cost; little infrastructure; mostly human capital needed� Innovative analytics: enormously better than off-the-shelf

approaches

4 / 8

Page 32: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Value in Big Data: the Analytics

� Data:� easy to come by; often a free byproduct of IT improvements� becoming commoditized� Ignore it & every institution will still have more every year� With a bit of effort: huge data production increases

� Where the Value is: the Analytics� Output can be highly customized� Moore’s law (doubling speed/power every 18 months) v. 1000x

increase with one algorithm

� $2M computer v. 2 hours of algorithm design� Low cost; little infrastructure; mostly human capital needed� Innovative analytics: enormously better than off-the-shelf

approaches

4 / 8

Page 33: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Value in Big Data: the Analytics

� Data:� easy to come by; often a free byproduct of IT improvements� becoming commoditized� Ignore it & every institution will still have more every year� With a bit of effort: huge data production increases

� Where the Value is: the Analytics� Output can be highly customized� Moore’s law (doubling speed/power every 18 months) v. 1000x

increase with one algorithm� $2M computer v. 2 hours of algorithm design

� Low cost; little infrastructure; mostly human capital needed� Innovative analytics: enormously better than off-the-shelf

approaches

4 / 8

Page 34: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Value in Big Data: the Analytics

� Data:� easy to come by; often a free byproduct of IT improvements� becoming commoditized� Ignore it & every institution will still have more every year� With a bit of effort: huge data production increases

� Where the Value is: the Analytics� Output can be highly customized� Moore’s law (doubling speed/power every 18 months) v. 1000x

increase with one algorithm� $2M computer v. 2 hours of algorithm design� Low cost; little infrastructure; mostly human capital needed

� Innovative analytics: enormously better than off-the-shelfapproaches

4 / 8

Page 35: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Value in Big Data: the Analytics

� Data:� easy to come by; often a free byproduct of IT improvements� becoming commoditized� Ignore it & every institution will still have more every year� With a bit of effort: huge data production increases

� Where the Value is: the Analytics� Output can be highly customized� Moore’s law (doubling speed/power every 18 months) v. 1000x

increase with one algorithm� $2M computer v. 2 hours of algorithm design� Low cost; little infrastructure; mostly human capital needed� Innovative analytics: enormously better than off-the-shelf

approaches

4 / 8

Page 36: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

Examples of what’s now possible

� Opinions of activists:

A few thousand interviews billions ofpolitical opinions in social media posts (1B every 2 Days)

� Exercise:

A survey: “How many times did you exercise lastweek? 500K people carrying cell phones withaccelerometers

� Social contacts:

A survey: “Please tell me your 5 bestfriends” continuous record of phone calls, emails, textmessages, bluetooth, social media connections, address books

� Economic development in developing countries:

Dubious ornonexistent governmental statistics satellite images ofhuman-generated light at night, road networks, otherinfrastructure

� Many, many, more. . .

� In each: without new analytics, the data are useless

5 / 8

Page 37: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

Examples of what’s now possible

� Opinions of activists:

A few thousand interviews billions ofpolitical opinions in social media posts (1B every 2 Days)

� Exercise:

A survey: “How many times did you exercise lastweek? 500K people carrying cell phones withaccelerometers

� Social contacts:

A survey: “Please tell me your 5 bestfriends” continuous record of phone calls, emails, textmessages, bluetooth, social media connections, address books

� Economic development in developing countries:

Dubious ornonexistent governmental statistics satellite images ofhuman-generated light at night, road networks, otherinfrastructure

� Many, many, more. . .

� In each: without new analytics, the data are useless

5 / 8

Page 38: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

Examples of what’s now possible

� Opinions of activists: A few thousand interviews

billions ofpolitical opinions in social media posts (1B every 2 Days)

� Exercise:

A survey: “How many times did you exercise lastweek? 500K people carrying cell phones withaccelerometers

� Social contacts:

A survey: “Please tell me your 5 bestfriends” continuous record of phone calls, emails, textmessages, bluetooth, social media connections, address books

� Economic development in developing countries:

Dubious ornonexistent governmental statistics satellite images ofhuman-generated light at night, road networks, otherinfrastructure

� Many, many, more. . .

� In each: without new analytics, the data are useless

5 / 8

Page 39: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

Examples of what’s now possible

� Opinions of activists: A few thousand interviews billions ofpolitical opinions in social media posts (1B every 2 Days)

� Exercise:

A survey: “How many times did you exercise lastweek? 500K people carrying cell phones withaccelerometers

� Social contacts:

A survey: “Please tell me your 5 bestfriends” continuous record of phone calls, emails, textmessages, bluetooth, social media connections, address books

� Economic development in developing countries:

Dubious ornonexistent governmental statistics satellite images ofhuman-generated light at night, road networks, otherinfrastructure

� Many, many, more. . .

� In each: without new analytics, the data are useless

5 / 8

Page 40: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

Examples of what’s now possible

� Opinions of activists: A few thousand interviews billions ofpolitical opinions in social media posts (1B every 2 Days)

� Exercise:

A survey: “How many times did you exercise lastweek? 500K people carrying cell phones withaccelerometers

� Social contacts:

A survey: “Please tell me your 5 bestfriends” continuous record of phone calls, emails, textmessages, bluetooth, social media connections, address books

� Economic development in developing countries:

Dubious ornonexistent governmental statistics satellite images ofhuman-generated light at night, road networks, otherinfrastructure

� Many, many, more. . .

� In each: without new analytics, the data are useless

5 / 8

Page 41: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

Examples of what’s now possible

� Opinions of activists: A few thousand interviews billions ofpolitical opinions in social media posts (1B every 2 Days)

� Exercise: A survey: “How many times did you exercise lastweek?

500K people carrying cell phones withaccelerometers

� Social contacts:

A survey: “Please tell me your 5 bestfriends” continuous record of phone calls, emails, textmessages, bluetooth, social media connections, address books

� Economic development in developing countries:

Dubious ornonexistent governmental statistics satellite images ofhuman-generated light at night, road networks, otherinfrastructure

� Many, many, more. . .

� In each: without new analytics, the data are useless

5 / 8

Page 42: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

Examples of what’s now possible

� Opinions of activists: A few thousand interviews billions ofpolitical opinions in social media posts (1B every 2 Days)

� Exercise: A survey: “How many times did you exercise lastweek? 500K people carrying cell phones withaccelerometers

� Social contacts:

A survey: “Please tell me your 5 bestfriends” continuous record of phone calls, emails, textmessages, bluetooth, social media connections, address books

� Economic development in developing countries:

Dubious ornonexistent governmental statistics satellite images ofhuman-generated light at night, road networks, otherinfrastructure

� Many, many, more. . .

� In each: without new analytics, the data are useless

5 / 8

Page 43: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

Examples of what’s now possible

� Opinions of activists: A few thousand interviews billions ofpolitical opinions in social media posts (1B every 2 Days)

� Exercise: A survey: “How many times did you exercise lastweek? 500K people carrying cell phones withaccelerometers

� Social contacts:

A survey: “Please tell me your 5 bestfriends” continuous record of phone calls, emails, textmessages, bluetooth, social media connections, address books

� Economic development in developing countries:

Dubious ornonexistent governmental statistics satellite images ofhuman-generated light at night, road networks, otherinfrastructure

� Many, many, more. . .

� In each: without new analytics, the data are useless

5 / 8

Page 44: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

Examples of what’s now possible

� Opinions of activists: A few thousand interviews billions ofpolitical opinions in social media posts (1B every 2 Days)

� Exercise: A survey: “How many times did you exercise lastweek? 500K people carrying cell phones withaccelerometers

� Social contacts: A survey: “Please tell me your 5 bestfriends”

continuous record of phone calls, emails, textmessages, bluetooth, social media connections, address books

� Economic development in developing countries:

Dubious ornonexistent governmental statistics satellite images ofhuman-generated light at night, road networks, otherinfrastructure

� Many, many, more. . .

� In each: without new analytics, the data are useless

5 / 8

Page 45: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

Examples of what’s now possible

� Opinions of activists: A few thousand interviews billions ofpolitical opinions in social media posts (1B every 2 Days)

� Exercise: A survey: “How many times did you exercise lastweek? 500K people carrying cell phones withaccelerometers

� Social contacts: A survey: “Please tell me your 5 bestfriends” continuous record of phone calls, emails, textmessages, bluetooth, social media connections, address books

� Economic development in developing countries:

Dubious ornonexistent governmental statistics satellite images ofhuman-generated light at night, road networks, otherinfrastructure

� Many, many, more. . .

� In each: without new analytics, the data are useless

5 / 8

Page 46: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

Examples of what’s now possible

� Opinions of activists: A few thousand interviews billions ofpolitical opinions in social media posts (1B every 2 Days)

� Exercise: A survey: “How many times did you exercise lastweek? 500K people carrying cell phones withaccelerometers

� Social contacts: A survey: “Please tell me your 5 bestfriends” continuous record of phone calls, emails, textmessages, bluetooth, social media connections, address books

� Economic development in developing countries:

Dubious ornonexistent governmental statistics satellite images ofhuman-generated light at night, road networks, otherinfrastructure

� Many, many, more. . .

� In each: without new analytics, the data are useless

5 / 8

Page 47: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

Examples of what’s now possible

� Opinions of activists: A few thousand interviews billions ofpolitical opinions in social media posts (1B every 2 Days)

� Exercise: A survey: “How many times did you exercise lastweek? 500K people carrying cell phones withaccelerometers

� Social contacts: A survey: “Please tell me your 5 bestfriends” continuous record of phone calls, emails, textmessages, bluetooth, social media connections, address books

� Economic development in developing countries: Dubious ornonexistent governmental statistics

satellite images ofhuman-generated light at night, road networks, otherinfrastructure

� Many, many, more. . .

� In each: without new analytics, the data are useless

5 / 8

Page 48: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

Examples of what’s now possible

� Opinions of activists: A few thousand interviews billions ofpolitical opinions in social media posts (1B every 2 Days)

� Exercise: A survey: “How many times did you exercise lastweek? 500K people carrying cell phones withaccelerometers

� Social contacts: A survey: “Please tell me your 5 bestfriends” continuous record of phone calls, emails, textmessages, bluetooth, social media connections, address books

� Economic development in developing countries: Dubious ornonexistent governmental statistics satellite images ofhuman-generated light at night, road networks, otherinfrastructure

� Many, many, more. . .

� In each: without new analytics, the data are useless

5 / 8

Page 49: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

Examples of what’s now possible

� Opinions of activists: A few thousand interviews billions ofpolitical opinions in social media posts (1B every 2 Days)

� Exercise: A survey: “How many times did you exercise lastweek? 500K people carrying cell phones withaccelerometers

� Social contacts: A survey: “Please tell me your 5 bestfriends” continuous record of phone calls, emails, textmessages, bluetooth, social media connections, address books

� Economic development in developing countries: Dubious ornonexistent governmental statistics satellite images ofhuman-generated light at night, road networks, otherinfrastructure

� Many, many, more. . .

� In each: without new analytics, the data are useless

5 / 8

Page 50: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

Examples of what’s now possible

� Opinions of activists: A few thousand interviews billions ofpolitical opinions in social media posts (1B every 2 Days)

� Exercise: A survey: “How many times did you exercise lastweek? 500K people carrying cell phones withaccelerometers

� Social contacts: A survey: “Please tell me your 5 bestfriends” continuous record of phone calls, emails, textmessages, bluetooth, social media connections, address books

� Economic development in developing countries: Dubious ornonexistent governmental statistics satellite images ofhuman-generated light at night, road networks, otherinfrastructure

� Many, many, more. . .

� In each: without new analytics, the data are useless

5 / 8

Page 51: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The End of The Quantitative-Qualitative Divide

� Qualitative researchers: overwhelmed by information; needhelp

� Quantitative researchers: recognize the huge amounts ofinformation in qualitative analyses, starting to analyzeunstructured text, video, audio as data

� Expert-vs-analytics contests: Whenever enough information isquantified, a right answer exists, and good analytics areapplied: analytics wins

6 / 8

Page 52: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The End of The Quantitative-Qualitative Divide

�� Qualitative researchers: overwhelmed by information; needhelp

� Quantitative researchers: recognize the huge amounts ofinformation in qualitative analyses, starting to analyzeunstructured text, video, audio as data

� Expert-vs-analytics contests: Whenever enough information isquantified, a right answer exists, and good analytics areapplied: analytics wins

6 / 8

Page 53: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The End of The Quantitative-Qualitative Divide

�� Qualitative researchers: overwhelmed by information; needhelp

� Quantitative researchers: recognize the huge amounts ofinformation in qualitative analyses, starting to analyzeunstructured text, video, audio as data

� Expert-vs-analytics contests: Whenever enough information isquantified, a right answer exists, and good analytics areapplied: analytics wins

6 / 8

Page 54: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The End of The Quantitative-Qualitative Divide

�� Qualitative researchers: overwhelmed by information; needhelp

� Quantitative researchers: recognize the huge amounts ofinformation in qualitative analyses, starting to analyzeunstructured text, video, audio as data

� Expert-vs-analytics contests: Whenever enough information isquantified, a right answer exists, and good analytics areapplied: analytics wins

6 / 8

Page 55: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The End of The Quantitative-Qualitative Divide

�� Qualitative researchers: overwhelmed by information; needhelp

� Quantitative researchers: recognize the huge amounts ofinformation in qualitative analyses, starting to analyzeunstructured text, video, audio as data

� Expert-vs-analytics contests: Whenever enough information isquantified, a right answer exists, and good analytics areapplied: analytics wins

6 / 8

Page 56: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Spectacular Success of Quantitative Social Science

What university research has had the biggest impact on you?

�� The genetics revolution?

� The Higgs-like particle?

� Exoplanets? The Mars rovers?

� Doubling the human life span in the last century?...

� Quantitative social science (aka “big data,” “data analytics,”“data science”): transformed most Fortune 500 firms;established new industries; altered friendship networks;increased human expressive capacity (social media); changedpolitical campaigns; transformed public health; changed legalanalysis; impacted crime and policing; reinvented economics;transformed sports; set standards for evaluating public policy;etc.; etc., etc.

7 / 8

Page 57: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Spectacular Success of Quantitative Social Science

What university research has had the biggest impact on you?

� The genetics revolution?

� The Higgs-like particle?

� Exoplanets? The Mars rovers?

� Doubling the human life span in the last century?...

� Quantitative social science (aka “big data,” “data analytics,”“data science”): transformed most Fortune 500 firms;established new industries; altered friendship networks;increased human expressive capacity (social media); changedpolitical campaigns; transformed public health; changed legalanalysis; impacted crime and policing; reinvented economics;transformed sports; set standards for evaluating public policy;etc.; etc., etc.

7 / 8

Page 58: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Spectacular Success of Quantitative Social Science

What university research has had the biggest impact on you?

� The genetics revolution?

� The Higgs-like particle?

� Exoplanets? The Mars rovers?

� Doubling the human life span in the last century?...

� Quantitative social science (aka “big data,” “data analytics,”“data science”): transformed most Fortune 500 firms;established new industries; altered friendship networks;increased human expressive capacity (social media); changedpolitical campaigns; transformed public health; changed legalanalysis; impacted crime and policing; reinvented economics;transformed sports; set standards for evaluating public policy;etc.; etc., etc.

7 / 8

Page 59: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Spectacular Success of Quantitative Social Science

What university research has had the biggest impact on you?

� The genetics revolution?

� The Higgs-like particle?

� Exoplanets? The Mars rovers?

� Doubling the human life span in the last century?...

� Quantitative social science (aka “big data,” “data analytics,”“data science”): transformed most Fortune 500 firms;established new industries; altered friendship networks;increased human expressive capacity (social media); changedpolitical campaigns; transformed public health; changed legalanalysis; impacted crime and policing; reinvented economics;transformed sports; set standards for evaluating public policy;etc.; etc., etc.

7 / 8

Page 60: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Spectacular Success of Quantitative Social Science

What university research has had the biggest impact on you?

� The genetics revolution?

� The Higgs-like particle?

� Exoplanets? The Mars rovers?

� Doubling the human life span in the last century?...

� Quantitative social science (aka “big data,” “data analytics,”“data science”): transformed most Fortune 500 firms;established new industries; altered friendship networks;increased human expressive capacity (social media); changedpolitical campaigns; transformed public health; changed legalanalysis; impacted crime and policing; reinvented economics;transformed sports; set standards for evaluating public policy;etc.; etc., etc.

7 / 8

Page 61: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Spectacular Success of Quantitative Social Science

What university research has had the biggest impact on you?

� The genetics revolution?

� The Higgs-like particle?

� Exoplanets? The Mars rovers?

� Doubling the human life span in the last century?

...

� Quantitative social science (aka “big data,” “data analytics,”“data science”): transformed most Fortune 500 firms;established new industries; altered friendship networks;increased human expressive capacity (social media); changedpolitical campaigns; transformed public health; changed legalanalysis; impacted crime and policing; reinvented economics;transformed sports; set standards for evaluating public policy;etc.; etc., etc.

7 / 8

Page 62: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Spectacular Success of Quantitative Social Science

What university research has had the biggest impact on you?

� The genetics revolution?

� The Higgs-like particle?

� Exoplanets? The Mars rovers?

� Doubling the human life span in the last century?...

� Quantitative social science (aka “big data,” “data analytics,”“data science”): transformed most Fortune 500 firms;established new industries; altered friendship networks;increased human expressive capacity (social media); changedpolitical campaigns; transformed public health; changed legalanalysis; impacted crime and policing; reinvented economics;transformed sports; set standards for evaluating public policy;etc.; etc., etc.

7 / 8

Page 63: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Spectacular Success of Quantitative Social Science

What university research has had the biggest impact on you?

� The genetics revolution?

� The Higgs-like particle?

� Exoplanets? The Mars rovers?

� Doubling the human life span in the last century?...

� Quantitative social science (aka “big data,” “data analytics,”“data science”): transformed most Fortune 500 firms;established new industries; altered friendship networks;increased human expressive capacity (social media); changedpolitical campaigns; transformed public health; changed legalanalysis; impacted crime and policing; reinvented economics;transformed sports; set standards for evaluating public policy;etc.; etc., etc.

7 / 8

Page 64: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Spectacular Success of Quantitative Social Science

What university research has had the biggest impact on you?

� The genetics revolution?

� The Higgs-like particle?

� Exoplanets? The Mars rovers?

� Doubling the human life span in the last century?...

� Quantitative social science

(aka “big data,” “data analytics,”“data science”): transformed most Fortune 500 firms;established new industries; altered friendship networks;increased human expressive capacity (social media); changedpolitical campaigns; transformed public health; changed legalanalysis; impacted crime and policing; reinvented economics;transformed sports; set standards for evaluating public policy;etc.; etc., etc.

7 / 8

Page 65: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Spectacular Success of Quantitative Social Science

What university research has had the biggest impact on you?

� The genetics revolution?

� The Higgs-like particle?

� Exoplanets? The Mars rovers?

� Doubling the human life span in the last century?...

� Quantitative social science (aka “big data,” “data analytics,”“data science”):

transformed most Fortune 500 firms;established new industries; altered friendship networks;increased human expressive capacity (social media); changedpolitical campaigns; transformed public health; changed legalanalysis; impacted crime and policing; reinvented economics;transformed sports; set standards for evaluating public policy;etc.; etc., etc.

7 / 8

Page 66: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Spectacular Success of Quantitative Social Science

What university research has had the biggest impact on you?

� The genetics revolution?

� The Higgs-like particle?

� Exoplanets? The Mars rovers?

� Doubling the human life span in the last century?...

� Quantitative social science (aka “big data,” “data analytics,”“data science”): transformed most Fortune 500 firms

;established new industries; altered friendship networks;increased human expressive capacity (social media); changedpolitical campaigns; transformed public health; changed legalanalysis; impacted crime and policing; reinvented economics;transformed sports; set standards for evaluating public policy;etc.; etc., etc.

7 / 8

Page 67: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Spectacular Success of Quantitative Social Science

What university research has had the biggest impact on you?

� The genetics revolution?

� The Higgs-like particle?

� Exoplanets? The Mars rovers?

� Doubling the human life span in the last century?...

� Quantitative social science (aka “big data,” “data analytics,”“data science”): transformed most Fortune 500 firms;established new industries

; altered friendship networks;increased human expressive capacity (social media); changedpolitical campaigns; transformed public health; changed legalanalysis; impacted crime and policing; reinvented economics;transformed sports; set standards for evaluating public policy;etc.; etc., etc.

7 / 8

Page 68: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Spectacular Success of Quantitative Social Science

What university research has had the biggest impact on you?

� The genetics revolution?

� The Higgs-like particle?

� Exoplanets? The Mars rovers?

� Doubling the human life span in the last century?...

� Quantitative social science (aka “big data,” “data analytics,”“data science”): transformed most Fortune 500 firms;established new industries; altered friendship networks

;increased human expressive capacity (social media); changedpolitical campaigns; transformed public health; changed legalanalysis; impacted crime and policing; reinvented economics;transformed sports; set standards for evaluating public policy;etc.; etc., etc.

7 / 8

Page 69: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Spectacular Success of Quantitative Social Science

What university research has had the biggest impact on you?

� The genetics revolution?

� The Higgs-like particle?

� Exoplanets? The Mars rovers?

� Doubling the human life span in the last century?...

� Quantitative social science (aka “big data,” “data analytics,”“data science”): transformed most Fortune 500 firms;established new industries; altered friendship networks;increased human expressive capacity (social media)

; changedpolitical campaigns; transformed public health; changed legalanalysis; impacted crime and policing; reinvented economics;transformed sports; set standards for evaluating public policy;etc.; etc., etc.

7 / 8

Page 70: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Spectacular Success of Quantitative Social Science

What university research has had the biggest impact on you?

� The genetics revolution?

� The Higgs-like particle?

� Exoplanets? The Mars rovers?

� Doubling the human life span in the last century?...

� Quantitative social science (aka “big data,” “data analytics,”“data science”): transformed most Fortune 500 firms;established new industries; altered friendship networks;increased human expressive capacity (social media); changedpolitical campaigns

; transformed public health; changed legalanalysis; impacted crime and policing; reinvented economics;transformed sports; set standards for evaluating public policy;etc.; etc., etc.

7 / 8

Page 71: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Spectacular Success of Quantitative Social Science

What university research has had the biggest impact on you?

� The genetics revolution?

� The Higgs-like particle?

� Exoplanets? The Mars rovers?

� Doubling the human life span in the last century?...

� Quantitative social science (aka “big data,” “data analytics,”“data science”): transformed most Fortune 500 firms;established new industries; altered friendship networks;increased human expressive capacity (social media); changedpolitical campaigns; transformed public health

; changed legalanalysis; impacted crime and policing; reinvented economics;transformed sports; set standards for evaluating public policy;etc.; etc., etc.

7 / 8

Page 72: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Spectacular Success of Quantitative Social Science

What university research has had the biggest impact on you?

� The genetics revolution?

� The Higgs-like particle?

� Exoplanets? The Mars rovers?

� Doubling the human life span in the last century?...

� Quantitative social science (aka “big data,” “data analytics,”“data science”): transformed most Fortune 500 firms;established new industries; altered friendship networks;increased human expressive capacity (social media); changedpolitical campaigns; transformed public health; changed legalanalysis

; impacted crime and policing; reinvented economics;transformed sports; set standards for evaluating public policy;etc.; etc., etc.

7 / 8

Page 73: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Spectacular Success of Quantitative Social Science

What university research has had the biggest impact on you?

� The genetics revolution?

� The Higgs-like particle?

� Exoplanets? The Mars rovers?

� Doubling the human life span in the last century?...

� Quantitative social science (aka “big data,” “data analytics,”“data science”): transformed most Fortune 500 firms;established new industries; altered friendship networks;increased human expressive capacity (social media); changedpolitical campaigns; transformed public health; changed legalanalysis; impacted crime and policing

; reinvented economics;transformed sports; set standards for evaluating public policy;etc.; etc., etc.

7 / 8

Page 74: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Spectacular Success of Quantitative Social Science

What university research has had the biggest impact on you?

� The genetics revolution?

� The Higgs-like particle?

� Exoplanets? The Mars rovers?

� Doubling the human life span in the last century?...

� Quantitative social science (aka “big data,” “data analytics,”“data science”): transformed most Fortune 500 firms;established new industries; altered friendship networks;increased human expressive capacity (social media); changedpolitical campaigns; transformed public health; changed legalanalysis; impacted crime and policing; reinvented economics

;transformed sports; set standards for evaluating public policy;etc.; etc., etc.

7 / 8

Page 75: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Spectacular Success of Quantitative Social Science

What university research has had the biggest impact on you?

� The genetics revolution?

� The Higgs-like particle?

� Exoplanets? The Mars rovers?

� Doubling the human life span in the last century?...

� Quantitative social science (aka “big data,” “data analytics,”“data science”): transformed most Fortune 500 firms;established new industries; altered friendship networks;increased human expressive capacity (social media); changedpolitical campaigns; transformed public health; changed legalanalysis; impacted crime and policing; reinvented economics;transformed sports

; set standards for evaluating public policy;etc.; etc., etc.

7 / 8

Page 76: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Spectacular Success of Quantitative Social Science

What university research has had the biggest impact on you?

� The genetics revolution?

� The Higgs-like particle?

� Exoplanets? The Mars rovers?

� Doubling the human life span in the last century?...

� Quantitative social science (aka “big data,” “data analytics,”“data science”): transformed most Fortune 500 firms;established new industries; altered friendship networks;increased human expressive capacity (social media); changedpolitical campaigns; transformed public health; changed legalanalysis; impacted crime and policing; reinvented economics;transformed sports; set standards for evaluating public policy

;etc.; etc., etc.

7 / 8

Page 77: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Spectacular Success of Quantitative Social Science

What university research has had the biggest impact on you?

� The genetics revolution?

� The Higgs-like particle?

� Exoplanets? The Mars rovers?

� Doubling the human life span in the last century?...

� Quantitative social science (aka “big data,” “data analytics,”“data science”): transformed most Fortune 500 firms;established new industries; altered friendship networks;increased human expressive capacity (social media); changedpolitical campaigns; transformed public health; changed legalanalysis; impacted crime and policing; reinvented economics;transformed sports; set standards for evaluating public policy;etc.

; etc., etc.

7 / 8

Page 78: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Spectacular Success of Quantitative Social Science

What university research has had the biggest impact on you?

� The genetics revolution?

� The Higgs-like particle?

� Exoplanets? The Mars rovers?

� Doubling the human life span in the last century?...

� Quantitative social science (aka “big data,” “data analytics,”“data science”): transformed most Fortune 500 firms;established new industries; altered friendship networks;increased human expressive capacity (social media); changedpolitical campaigns; transformed public health; changed legalanalysis; impacted crime and policing; reinvented economics;transformed sports; set standards for evaluating public policy;etc.; etc.

, etc.

7 / 8

Page 79: Big Data is Not About the Data! - Harvard Universityprojects.iq.harvard.edu/files/gov2001/files/evbase-gov2001.pdf · The Data In Big Data (about people) The Last 50 Years: Survey

The Spectacular Success of Quantitative Social Science

What university research has had the biggest impact on you?

� The genetics revolution?

� The Higgs-like particle?

� Exoplanets? The Mars rovers?

� Doubling the human life span in the last century?...

� Quantitative social science (aka “big data,” “data analytics,”“data science”): transformed most Fortune 500 firms;established new industries; altered friendship networks;increased human expressive capacity (social media); changedpolitical campaigns; transformed public health; changed legalanalysis; impacted crime and policing; reinvented economics;transformed sports; set standards for evaluating public policy;etc.; etc., etc.

7 / 8