Loading presentation...

Present Remotely

Send the link below via email or IM

Copy

Present to your audience

Start remote presentation

  • Invited audience members will follow you as you navigate and present
  • People invited to a presentation do not need a Prezi account
  • This link expires 10 minutes after you close the presentation
  • A maximum of 30 users can follow your presentation
  • Learn more about this feature in our knowledge base article

Do you really want to delete this prezi?

Neither you, nor the coeditors you shared it with will be able to recover it again.

DeleteCancel

Make your likes visible on Facebook?

Connect your Facebook account to Prezi and let your likes appear on your timeline.
You can change this under Settings & Account at any time.

No, thanks

Big Data

No description
by

Alicia Reinhardt

on 7 January 2014

Comments (0)

Please log in to add your comment.

Report abuse

Transcript of Big Data

Big Data
What Is Big Data?
Data sets too vast for common software tools to handle.
A movement to efficiently mine and analyze all of this data to find patterns, spot trends, and make effective decision.
The result of our expanding ability to create & capture data and our ever expanding ability to store all of that data.
It is as much current data that has been born digital as it is past information that has been digitized.
Massive amounts of digital data being created and accumulated consciously and unconsciously in the digital world we live in.
Sources of
Big Data
Businesses

Electronic Records
Social Media
Digital Collections
Internet of Things
Google
Pros
What is our big data?
Impact on Libraries
cultural heritage collections
digital books
digital music
digitized historical records
born digital collections
metadata
transactional data
i.e. catalog searches,
circulation records
Topics for Library Professionals to Consider
The need for straightforward tools that allow individual scholars to perform computational research (The Signal: "One Culture" by Trevor Owens)

Data Seal of Approval developed by the Data Archiving & Network Services

Can our infrastructure support the downloading of large quantities of data? real-time querying of thousands/millions/billions of full-text items?

If we provide access to data, do we leave it in its native format or provide services to transpose the data to a different format? (The Signal: "Big Data and the Dawn of the Super Researcher" by Mike Ashenfelder)

What kind/how much training/support do we provide the public?
Pros
Nowcasting ("Now" + "Forecasting")
Google's Flu Trends www.google.org/flutrends
Increased productivity comparable to the uptick seen during the PC/Internet revolutions
Possibility of an "open source branch of medicine" (New York Times: Big Data in Your Blood by Quentin Hardy)
Unknown/the sky could be the limit/only time will tell
Cons
Biased fact finding

False discoveries or apophenia

Privacy Concerns

"Catastrophic miscalculations" (Pew Research Center Report on Big Data)

Extreme targeted advertising

"You loop" (Pew Report)
Remember, "big" is relative. Our "big" could be somebody else's "small."

The treatment of a collection can redefine it as big data.
Big Data Movement
Software
Hadoop
Google Dremel
InfoChimp
Map Reduce
Factual
Organizations
Google
University of Rochester
Walmart
Humanities/ Education
Digging into Data Challenge
Creating a new field, Master of Science in Business Analytics
The Cloud
Scientific Research
Digital Devices
The Rabbit Hole
A movement to look at these large data sets from a holistic viewpoint rather than as individual pieces of information.
http://prezi.com/ymrgyy160obr
Full transcript