Send the link below via email or IMCopy
Present to your audienceStart remote presentation
- Invited audience members will follow you as you navigate and present
- People invited to a presentation do not need a Prezi account
- This link expires 10 minutes after you close the presentation
- A maximum of 30 users can follow your presentation
- Learn more about this feature in our knowledge base article
Do you really want to delete this prezi?
Neither you, nor the coeditors you shared it with will be able to recover it again.
Make your likes visible on Facebook?
You can change this under Settings & Account at any time.
Data Bootcamp - Team Intro, working with data
Transcript of Data Bootcamp - Team Intro, working with data
accurate business metrics
to transform data
from logs to charts
tools for analyzing data on s3
store logs on s3
framework for automated
Write your own job, test it, deploy it
Keep it healthy
Create charts, reports, etc.
Help your team understanding them!
Produce nice and tidy (structured) logs
Data @ Prezi
Working with data
unstructured*, textual data
sorted, cleaned, 'structured'
Amazon's data warehouse solution
Quite fast, but expensive
create charts, dashboards*
*Tamas Imre (MX) can help
SQL-like language for creating MapReduce programs for Hadoop
slow, but deals with any data
hourly or daily jobs
old data pipeline framework
new jobs here!
proper dependency handling
You can reach us:
core-data @ HipChat
s3cat, catlog, s3cmd, s3tac, piggrep
Read our projects' tutorial
(ask if something is unclear)
How to start it?
write your job,
test it on etl.prezi.com
if jenkins is green, deploy with
but you can use it as documentation
"Prezilians can get answers to quantifiable question within an hour."
(mainly Data Infrastructure)
Director of Data
Importance of being data-driven: