Loading presentation...

Present Remotely

Send the link below via email or IM


Present to your audience

Start remote presentation

  • Invited audience members will follow you as you navigate and present
  • People invited to a presentation do not need a Prezi account
  • This link expires 10 minutes after you close the presentation
  • A maximum of 30 users can follow your presentation
  • Learn more about this feature in our knowledge base article

Do you really want to delete this prezi?

Neither you, nor the coeditors you shared it with will be able to recover it again.


Make your likes visible on Facebook?

Connect your Facebook account to Prezi and let your likes appear on your timeline.
You can change this under Settings & Account at any time.

No, thanks

Introducing in Data Science topic

No description

Janis Hermanis

on 26 September 2013

Comments (0)

Please log in to add your comment.

Report abuse

Transcript of Introducing in Data Science topic

Introducing in Data Science topic
Porto, September 26th

Data Science
Necessary knowledge base

But first, who I am and from where? :)
Definitions and terms
What kind of knowledge You need?
Data Scientist is man or women, why understand programming more than regular statistician and understand statistics more than regular programmer :)
Some key tools
Name - Janis
Surname - Hermanis
I'm from BA
BA = BA School of Business and Finance (SBF)
I'm teacher in SBF since...
... I can't remember :)
I have...
One wife
One daughter
One home
One cat
One car
One workplace
One good leg :)
Two hands
Lots of ideas how to use my knowledge
About SBF
Funded - 1992
We are Public
# of students ~1400
Study programs
2 directions
4 levels
1st level
Few key facts about Latvia
Founded as Republic of Latvia on 1918
Occupied by Soviets - 1940
Restored independence - 1990/1991
Join to EU - 2004
Some figures
Area - 64 589 km2
Population ~ 2 000 000
Latvians are 62%
Russians - 27%
GDP per capita - 13,899 USD
Currency - Latvian Lats (LVL)
1 LVL = 1.422872 EUR
1 EUR = 0.702804 LVL

From 1st January 2014 we are join to EURO zone
Other official thinks
Location on Europe
Flag of Latvia
Arm of Latvia
Capital city - Riga
We are famous with...
Latvian Song and Dance Festival
Winter sports, like...
Microtik routers
canned Fish
Second greenest country in the world
Data Science
Data science seeks to use all available and relevant data to effectively tell a story that can be easily understood by everybody
From historical POV
Data Science = Statistic
But from 2001 Data Science is independet discipline in the science

Data Scientist
People who are working in Data Science area are called Data Scientist
Rising alongside the relatively new technology of big data is the new job title data scientist
Big Data
Big data is the term for a collection of data sets so large and complex that it becomes difficult to process using on-hand database management tools or traditional data processing applications
Every day 2.5 quintillion (2.5E18) bytes of data were created
Big data sizes are a constantly moving target, as of 2012 ranging from a few dozen terabytes to many petabytes of data in a single data set
Need examples, please...
Statistics is the study of the collection, organization, analysis, interpretation and presentation of data
Descriptive Statistics
Average and Mean
Normal Distribution
Shoping basket
Data Mining
The overall goal of the data mining process is to extract information from a data set and transform it into an understandable structure for further use.

And getting new knowledge...
Few main steps on using Data Mining methods:

(1) Selection
(2) Pre-processing
(3) Transformation
(4) Data Mining
(5) Interpretation/Evaluation
Data Scientist is man or women, why understand programming more than regular statistician and understand statistics more than regular programmer :)
Everything what you understand by this term, programming languages, web developing, developing mobile apps, etc.....
Data Base
For Big Data needs we need other kinds of Data Bases

Good explanation you can find here:

Data Warehouse
A data warehouse is a database used for reporting and data analysis
In our case one more nice article about this topic

Data Visualization
Main goal of data visualization is to communicate information clearly and effectively through graphical means
Data visualization or data visualisation is the study of the visual representation of data, meaning "information that has been abstracted in some schematic form, including attributes or variables for the units of information"
Business Intelligence
Business intelligence is a set of theories, methodologies, processes, architectures, and technologies that transform raw data into meaningful and useful information for business purposes
Business Intelligence is a set of methodologies, processes, architectures, and technologies that transform raw data into meaningful and useful information used to enable more effective strategic, tactical, and operational insights and decision-making
Programming languages
Understanding business processes

And a bit spirit of art
From Microsoft
From IBM
Please........tell me!!!!!!!!
You can follow me
Full transcript