Send the link below via email or IMCopy
Present to your audienceStart remote presentation
- Invited audience members will follow you as you navigate and present
- People invited to a presentation do not need a Prezi account
- This link expires 10 minutes after you close the presentation
- A maximum of 30 users can follow your presentation
- Learn more about this feature in our knowledge base article
What is data?
Transcript of What is data?
Applied case: CrIP
specialised on official EC and Member States web sources from the construction domain
of documents in relevant summarizing categories (not pre-defined)
User-friendly, intuitive visualization of results
for retrieval of most relevant documents.
What is big data?
Why does data matter?
Relational data / data bases
Text / websites / PDF
Spatial / temporal data
patterns or associations between data
Knowledge / Semantics:
information which is structured as a representative model in our brain.
Examples: historical patterns, future trends, sentiment in a text, meaning of a text...
How do we extract knowledge?
Visualization and extraction of patterns in data
Categorization of data
Construction Information Platform from DG ENTR
Help to decision making
Example: Distribution of Grants accross Europe
The power of SEMANTICS
Past weather when playing tennis
How do we decide?
Used as Audit, Risk Management processes
Different sectors such as Finance, Telecom, Government etc.
Traditional methods of data analysis are complex and time consuming
Data Mining techniques are used to identify characteristics of fraud and patterns that govern them.
Example usages: Fiscal Control detection, Telecom Line Frauding, Document plagiarism, etc.
Selection of the most important sentences in the document(s) and display them.
What other people think?
Sentiment Analysis by collecting data from Social media
Used for analysing opinion of citizens on policies, political parties, trademarks, products etc.
Health reform in US
Evidence-based decision making process for the elaboration of legislation
The amount of raw data stored in corporate data bases is exploding.
"Data rich and information poor"
"Non-trivial process of identifying valid, novel and potentially useful and ultimately understandable patterns in data." (Fayyad et al. 1996)