Loading presentation...

Present Remotely

Send the link below via email or IM

Copy

Present to your audience

Start remote presentation

  • Invited audience members will follow you as you navigate and present
  • People invited to a presentation do not need a Prezi account
  • This link expires 10 minutes after you close the presentation
  • A maximum of 30 users can follow your presentation
  • Learn more about this feature in our knowledge base article

Do you really want to delete this prezi?

Neither you, nor the coeditors you shared it with will be able to recover it again.

DeleteCancel

Make your likes visible on Facebook?

Connect your Facebook account to Prezi and let your likes appear on your timeline.
You can change this under Settings & Account at any time.

No, thanks

Importing Data into Neo4j

No description
by

Rik Van Bruggen

on 30 November 2015

Comments (0)

Please log in to add your comment.

Report abuse

Transcript of Importing Data into Neo4j

Source


Data
Importing Data
into

Graph data model is richer
you don’t just dump data
you create a (complex) structure
you want to fill the right model
no 1:1 copy of your existing database-model


Different Import Mechanisms
transactional - live updates
batch - high volume initial load

Cypher - all APIs
Java - embedded database
Potential issues
“expensive” index reads
duplicate checking
huge transaction sizes
high volumes via Cypher
transactional Mode
Why is
Data Import
even a topic
for
Neo4j users

YOUR IMPORT TOOLSET
YOUR


DATABASE
Your Unique
Import Process

Your Specific Import Requirement
YOUR DATA MODEL
YOUR IMPORT SCENARIO
YOUR
PROCESS
Existing database
Relational, NoSQL, Mainframe, ...
Existing File
.CSV, GraphML, Gephi, Geoff, Spreadsheet, ...
Existing Service
Web Service, REST API, .NET, ...
Dataset
Format
Dataset Size
DOES matter
1000s
1000000s
10000000s
Initial Bulk Load
Incremental Load
Initial Bulk + Incremen-tal Load
Spread- sheet
Neo4j
Shell
Batch- Importer
3rd Party
Tools
Custom
Software
Map Import to model to tooling
DEMO TIME
Scenario 1
Small, quick and dirty import of a couple hundred / thousand nodes and relationships

Using SPREADSHEETS and cypher statements to do so
Scenario 2
Mid-sized import of up to a couple hundred thousand nodes and relationships

Using Cypher's LOAD CSV capability to do so
Scenario 3
Large import of millions of nodes and relationships

Using the Batch Import capability to do so
Tweak the tooling
Java Heap Size
Memory mapping
of store files
Caching config
Neo4j
Test data!
http://www.wired.com/2013/07/mapping-stereotypes/
Things to keep in mind!
Cypher's
Load CSV
Full transcript