Send the link below via email or IMCopy
Present to your audienceStart remote presentation
- Invited audience members will follow you as you navigate and present
- People invited to a presentation do not need a Prezi account
- This link expires 10 minutes after you close the presentation
- A maximum of 30 users can follow your presentation
- Learn more about this feature in our knowledge base article
Transcript of BIG DATA
Thanks for your
A large data set,
mostly generated on internet
It also refers to a
method of data analysis
Big Data management is difficult with conventional storage solutions and treatment. (mass data)
DATABASE DESIGN AND MODELLING
DATABASE MANAGEMENT SYSTEM
Store unstructured data
Document oriented with key values storage
Increase reliability and availabilty
Examples : Mongo DB, Cassandra (Facebook) , Couchbase, HBase
(Not only SQL)
No relationnal DBMS
The logical drive is no longer the table
Suggest alternative solutions to traditional databases and analysis
Triple issues (3V rule):
an important data volume to be processed
various information type and data sources
a high velocity level to achieve:
Project of the Apache Software Foundation
Composed of several modules, including
Two steps for MapReduce
The MAP step
The REDUCE step
Increase of data produced
New computing tools and technologies :
Lower cost of storage with cloud computing
Storage architecture should be redesigned to fit with large volumes of data.
Cloud Computing : distributed computing over a network
Parallelized database systems : database load is balanced among servers
Improve processing speeds.
Many data sources :
Entreprise Ressources Planning (ERP), social networks, web pages ...
Structured and unstructured data :
80% of information is unstructured)
90% of information is not exploited...
Real-time analysis :
Data Stream Mining
Increasing the frequency at which data are generated, captured and shared.
A SUPPORT for industry and scientific research
Provide a degree of accuracy and flexibility unreached
Big data requires human interpretation to process information
Be careful to pollution effects ! More data does not necessarily mean better data
Change information landscapes
Boost the traditional BI
Huge opportunity for processing new data formats (audios, videos and pictures)
Take more specific information on customer consumption patterns
Invest all kinds of application areas and companies
Rafaèle BONDAZ / Séverine BOUCHET / Inès SENHADJI
90% of the data in the world were created in the past two years
Big-data computing is considered as the biggest innovation in computing in the last decade.
University teachers noticed that the number of books was growing exponentially.
“We call this the problem of big data”
Big data can be found everywhere
Computer scientists thought about a way to gather and group all the important information into data storage.
Big data analysis played a large role in Barack Obama's successful 2012 re-election campaign.
The United States Federal Government owns six of the ten most powerful supercomputers in the world.
Data science is the study of the generalizable extraction of knowledge from data, yet the key word is science.
Analysts estimate that enterprises will spend
$34 billion on big data investments in 2013.
Big brother is watching you ....
Any Questions ?