Loading presentation...

Present Remotely

Send the link below via email or IM

Copy

Present to your audience

Start remote presentation

  • Invited audience members will follow you as you navigate and present
  • People invited to a presentation do not need a Prezi account
  • This link expires 10 minutes after you close the presentation
  • A maximum of 30 users can follow your presentation
  • Learn more about this feature in our knowledge base article

Do you really want to delete this prezi?

Neither you, nor the coeditors you shared it with will be able to recover it again.

DeleteCancel

Make your likes visible on Facebook?

Connect your Facebook account to Prezi and let your likes appear on your timeline.
You can change this under Settings & Account at any time.

No, thanks

BIG DATA

No description
by

Lori Katlin

on 7 October 2014

Comments (0)

Please log in to add your comment.

Report abuse

Transcript of BIG DATA

BIG DATA & THE CLOUD
What is Big Data?
A massive volume of structured and unstructured data that is so large that it is difficult to process using traditional database techniques.
Big Data
Pros:
Future cost and time reductions, minimal risks, and smarter business decision making
Unlock significant value by making information transparent

Cons:
Data mining threatens privacy
Risk analysis for new employees
Focuses more on big institutions and away from individuals
What is the Cloud?
Flexible & Versatile Platforms
Storing big data requires processors that are fast and capable of storing petabytes of data

eBay
90PB data warehouse
two 40PB Hadoop clusters

It is important to realize and manage information with this magnitude


Jessica Hayes, Lori Katlin, & David Wilson
"We're undergoing a revolution in the way that information about our purchases, our conversations, our social networks, our movements and even our physical identities are collected, stored, analyzed, and used."

"So we need to find a way to balance the law that protects our privacy with the ability to have technology."

-John Podesta, Counselor to the President
Random Facts about Data
Every hour enough information is consumed by internet traffic to fill 7 million DVDs
The world's 500,000 plus data centers are large enough to fill 5,955 football fields
By 2020 one third of all data will be stored or have passed through the cloud which will have created 35,000,000,000,000,000,000,000 bytes of data
Cloud computing is internet-based computing in which large groups of remote servers are networked in real time to allow the centralized data storage, and online access to computer services or resources
On demand self-service
Network access anywhere, any time, any device
Rapid Elasticity
Big Data Uses
***from 2011
Many organizations are using big data to compute machine-learning algorithms to...
Identify which burglaries likely were committed by the same offender
Detect signs of gambling addictions through online gaming
Analyze customer shopping patterns to improve the layout of their stores
Use predictive modeling to identify college students who are at risk of dropping out
One of most controversial big data implementations was used by Target
Privacy Issues
Conflict between privacy and technology

Personal data is easily access through social media, as well as our GPS locator on our phones

Organizations use this data for marketing, advertising, and other business activities

SnapChat example

The future shows that once data is created, data is permanent
References
http://www.whitehouse.gov/sites/default/files/microsites/ostp/PCAST/pcast_big_data_and_privacy_-_may_2014.pdf

http://iveybusinessjournal.com/topics/strategy/why-big-data-is-the-new-competitive-advantage#.VDIExUsonM5

https://www.aclu.org/blog/technology-and-liberty/eight-problems-big-data

http://searchsoa.techtarget.com/essentialguide/An-architects-guide-How-to-use-big-data#guideSection1

http://www.computerweekly.com/feature/Big-data-applications-bring-new-database-choices-challenges

Cloud- based processing engines
Amazon AWS, Microsoft Azure, and Google Compute Engine


Small and medium sized businesses have access to multimillion dollar hardware platforms without excessive maintenance fees


Modern Data Processing
For the past decade, big data has only been available to large corporations
costly and required high TCO
competitive advantage

With new cloud technologies smaller business and web services have access to versatile and low cost solutions
Big Data and Databases
Majority of data is unstructured and user-generated
Need to find a database that can store, analyze, and compute data efficiently
Need speed and flexibility


Businesses need to adjust their traditional relational databases, or entire IT departments, to support their big data driven systems


Cloud database platforms will enable new industries to realize the benefits of big data and could force them to change existing business processes to accommodate these new tools
Full transcript