Loading presentation...

Present Remotely

Send the link below via email or IM

Copy

Present to your audience

Start remote presentation

  • Invited audience members will follow you as you navigate and present
  • People invited to a presentation do not need a Prezi account
  • This link expires 10 minutes after you close the presentation
  • A maximum of 30 users can follow your presentation
  • Learn more about this feature in our knowledge base article

Do you really want to delete this prezi?

Neither you, nor the coeditors you shared it with will be able to recover it again.

DeleteCancel

Make your likes visible on Facebook?

Connect your Facebook account to Prezi and let your likes appear on your timeline.
You can change this under Settings & Account at any time.

No, thanks

Wikidata. What is it... (long)

No description
by

Adam Shorland

on 10 April 2015

Comments (0)

Please log in to add your comment.

Report abuse

Transcript of Wikidata. What is it... (long)

Terms
Namespaces
-
Items (main space), properties (Property:) , queries (Query:)

Entity
- Every entity is identified by a unique entity Id
It is also identified by a unique combination of label and description per language

Item
- A page that represents a real-life topic, concept, or subject

Property
- The descriptor for a data value using a data type

Query
- (future feature) a predefined search across items
Entities, Items, Properties and Queries
Language Attributes
- Language specific labels, descriptions and aliases that are assigned to entities

Label
- Primary means to distinguish entries

Description
- Provides context for the label

Aliases
- Alternate names that can be used for lookup the same way as labels


Language Fallback
- Methods to systematically replace missing language attributes with strings from alternate languages
Language Attributes
Claim
- a piece of data about the entity on whose page the claim appears
A claim consists of a property (such as "Location") and a value (e.g., "Germany")
A claim can have qualifiers, such as a qualifier saying that the claim is valid within a specific time frame

Statement
- a piece of data about an item, recorded on the item's page
A statement consists of a property (such as "Location") and a value (e.g. "Germany")
A statement can have qualifiers, (e.g. the claim is valid within a specific time frame)
A statement can have references (e.g. giving the source for the claim)
A statement can have a rank (used to distinguish between several claims containing the same property)
Claims and Statements
Value
- (or datavalues) the information pieces embedded in each claim
NoValue - a marker when there is no known value
UnknownValue - a marker when there is some value but the exact value is not known

Qualifier
- part of the claim that says something about the specific claim, often in a descriptive way

Rank
- a quality factor used for simple selection/filtering in cases where there are many statements for a given property

Reference
- describes the origin of a statement in Wikidata
Claims and Statements cont.
Sitelink
- an identification of a linked page or article on another Wikimedia site such as a Wikipedia language version
Consists of a site identifier (e.g. "dewiki") and a title (e.g. "Berlin")

Badge
- a marker attached to a sitelink, which could , for example,identify that the article is a "featured article" on a specific site
Sitelinks
Head to the test site
http://test.wikidata.org/
Create some items
http://test.wikidata.org/wiki/Special:NewItem
Create some properties
http://test.wikidata.org/wiki/Special:NewProperty
Play with language attributes
Play with claims and statements

test.wikidata properties:
CommonsName (P6), instance of (P7), Coordinate (P10), Date of birth (P18), URL (P20)


also see the wikidata sandbox!
https://www.wikidata.org/wiki/Q4115189
time to have a play!
Possible data types: Item, Commons media, String, Time, Globe Coordinate, URL
How is this all used?
http://dl.dropboxusercontent.com/u/172199972/map/map.html
Wikidata Map
http://goo.gl/oCQsmB
The Wiki Atlas
http://4thmain.github.io/projects/hacks/wiki-atlas.html
Geneawiki
https://toolserver.org/~magnus/ts2/geneawiki/
The Basics
Community Overview
The Talk Page
http://toolserver.org/~magnus/thetalkpage/
3900 active users
85 flagged bots
100~ administrators (sysops)
4 bureaucrats
2 oversighters
37+ task forces
database is 106.6GB (as of 08/01/13)

Useful Places
Learn about the basics of Wikidata:
Wikidata:Introduction

Find answer to frequently asked questions:
Wikidata:FAQ

Chat with other community members:
Wikidata:Project chat

Read the latest news around Wikidata:
Wikidata:News

Get a weekly summary of what is happening:
Wikidata:Status updates

Find out about gadgets and tools:
Wikidata:Tools
Policies & Guidelines
Wikidata:Blocking policy
Wikidata:Notability
Wikidata:Use common sense
Wikidata:Page protection policy
Help:Description
Wikidata:IP block exemption
Wikidata:Oversight
Help:Sources
The Guidelines
Wikidata:Administrators
Wikidata:Bureaucrats
The Policies
Wikidata:Bots
Help:Label
Help:Aliases
Help:Statements
Help:Sitelinks
Help:Merge
Wikidata:Verifiability
Wikidata:Deletion policy
Wikidata:No personal attacks
Wikidata:Assume good faith
The Proposed
Wikidata in its first phases has two main goals: to
centralize interlanguage links
across Wikimedia projects and to
serve as a general knowledge base
for the world at large.
An item is acceptable if and only if it fulfils at least one of these two goals, that is if it meets
at least one of the criteria below
:

It
contains at least one valid sitelink
to a Wikipedia, Wikivoyage, or Wikimedia Commons page.
To be valid, a link must not be a talk page, MediaWiki page, special page, user page, subtemplate, or file. Note that a single Wikimedia page cannot have more than one sitelink in Wikidata and that a sitelink cannot point to a redirect.

It refers to
an instance of a clearly identifiable conceptual or material entity
. The entity must be notable, in the sense that it can be described using serious and publicly available references. If there is no item about you yet, you are probably not notable.

It
fulfills some structural need
, for example: it is needed to make statements made in other items more useful.
Notability!
https://www.wikidata.org/wiki/Wikidata:List_of_policies_and_guidelines
Some History
October 26, 2012: The Wikidata database is created but is not available for editing yet.
October 29, 2012: Item Q1 ("Universe") is created.
October 30, 2012: Wikidata is launched.

November 3, 2012: Item ID Q10000 is created, four days after opening the wiki.
November 8, 2012: The first set of local temporary administrators are elected.
November 10, 2012: Item ID Q20000 is created.
November 14, 2012: Victory becomes item ID Q50000, fifteen days after Wikidata's launch.
November 24, 2012: Item ID Q100000 is created.
November 29, 2012: Human and bot collaboration helps Wikidata reach item ID Q200000

December 1, 2012: Item ID Q250000 is created.
December 7, 2012: Item ID Q500000 create, within 55 hours of Q400000's creation.
December 15, 2012: Finnish television series Matkaoppaat is added as item ID one million.
December 29, 2012: Item ID two million, is created within sixty days of Wikidata's launch.

January 14, 2013: Wikidata phase 1 goes live on the Hungarian Wikipedia
January 30, 2013: Wikidata phase 1 goes live on the Hebrew and Italian Wikipedias

February 2, 2013: Item 4,000,000 is created, just over three months after launch.
February 4, 2013: Phase 2 of Wikidata commences
February 13, 2013: Wikidata phase 1 goes live on the English Wikipedia
February 20, 2013: The five-millionth item

March 6, 2013: Phase 1 of Wikidata goes live on the remaining 282 editions of Wikipedia
March 27, 2013: Phase 2 of Wikidata is deployed on 11 editions of Wikipedia

April 4, 2013: The ten-millionth item is created
April 12, 2013: The first 2 Wikidata bureaucrats are elected.
April 24, 2013: Phase 2 of Wikidata is deployed across all editions of Wikipedia

May 8, 2013: The thirteen-millionth item is created
May 29, 2013: The long-awaited time datatype is now available

June 9, 2013: Wikidata is edited for the 50,000,000th time
June 10, 2013: The coordinates datatype is now available

July 17, 2013: The fourteen-millionth item
July 23, 2013: Phase 1 of Wikidata has gone live on Wikivoyage

August 26, 2013: Phase 2 of Wikidata has gone live on Wikivoyage

September 10, 2013: URL datatype is now available
September 16, 2013: The first 2 Wikidata oversighters are elected
September 23, 2013: Phase 1 of Commons has launched
Wikidata is 1 this weekend!
Wikidata is a free, collaborative, multilingual, structured knowledge base that can be read and edited by humans and machines alike.
Free
- The data in Wikidata is published under a free license
Multilingual
- Editing, consuming, browsing and reusing the data is fully multilingual
Structured
- Data is stored in a structured form allowing easy access and reuse
Collaborative
- The data is maintained by Wikidata editors (volunteers)
What does this mean?
Wikidata vs Wikibase
Wikidata
- A Wikimedia project that runs an instance of MediaWiki with the Wikibase extensions

Wikibase
- The software behind Wikidata. It consists of three MediaWiki extensions:

Wikibase repo
- enables Mediawiki to deal with the collection and maintenance of the structured data

Wikibase client
- enables Mediawiki installations such as Wikipedia to query and display information from a repo

Wikibase lib
- common libraries for the above extensions
Rest API for reading and writing
Full dumps of all data stored
Data access in many formats, JSON, XML, RDF
Creative Commons 0 licence
Stable Identifiers
What Wikibase gives us
Wikidata and Wikibase
1
'interwiki' support
- Items, Sitelinks

2
'infobox' support
- Properties, Claims, Statements

3
'list' support
- Queries
Wikidata deployment phases
The Team
Gadgets and tools
https://www.wikidata.org/wiki/Wikidata:Tools

SlurpInterwiki
This script imports automatically all interwiki-links from one Wikipedia article

The gadget then can be launched by clicking on the "Import interwiki" link in the toolbox. After that a dialog appears where a language for import can be selected.
LabelLister
This script shows a list of all labels in all languages that are supported in the specific item.

And edit label without having to change your main language.
EnumItems
This script adds one or two tabs next to the watchlist star tabs, to jump to the previous or the next item, respectively
autoEdit
This script adds automatically all labels with the help of the interwiki links and adds descriptions from an editable list.
Main Language First
This script highlights the language used at the moment, and shows it on the top. It is also possible to do this for a list of languages.
Deletion Request
This script adds a small link to the top of the page of each item with a link for quickly requesting a deletion at Wikidata:Requests for deletion.
AuthorityControl
For items with 'Authority Control' properties, set the corresponding codes as links to the online Authority Control tool.

Moreover, if the user clicks 'edit', the link disappears instantly.
Move
Adds a move link to the edit tools when editing a sitelink.

It looks like [­save|remove|move|cancel]
CommonsMedia
Allows to view a Wikimedia Commons image from Wikidata.
Preview
Shows a "preview" button next to each linked article to get a preview of the article.
Merge
Adds a "Merge it with ..." button in a submenu next to the "Watch" button that, when clicked, allows to merge items and automatically delete the extra items after merge or request the deletion of them.
more...
Sitenames in user-language
Makes language names in sitelinks appear in your language.
importScript( 'User:Ricordisamoa/SitenamesInUserLanguage.js' );

DMY dates
Makes all dates in claims appear as "dd month-name-in-your-language yyyy"
importScript( 'User:Ricordisamoa/DMYdates.js' );

Consistency check
A tool to check consistencies of statements, that is, if B is "child" of A, then A should be "father" or "mother" of B.
importScript( 'User:Xmlizer/consistency_check.js' );

Wikidata useful
Adds a box of quick "add statement" links to the right side of the page. Entity and phylogeny types, countries (location and citizenship). More to come.
importScript( 'User:Magnus Manske/wikidata_useful.js' );
see https://www.wikidata.org/wiki/Wikidata:Tools
To come!
DataTypes
Numbers / Quantities
Geographic Shapes
Media
Multilingual texts
Monolingual texts
Features
Sorting of Statements
Fallback Languages
Ajax free editing
Redirects
Accessing data from an arbitrary items
Sister Projects
Support meta info for commons
Wikiquote
Wiktionary
Wikispecies
etc.
Processes
Wikidata:RFC (Request For Comment)
Wikidata:RFP (Request For Permissions)
Wikidata:RFD (Request For Deletion)
Wikidata:AN (Administrators' Noticeboard)
Wikidata:BN (Bureaucrats' Noticeboard)
Category:Wikidata protected edit requests
Wikidata:Bot requests
Wikidata:Property proposal
Wikidata:Properties for deletion
Property Proposal
Before a new property is created, it has to be discussed.
If after some time there are some supporters, but no or very few opponents, the property is created by a property creator or an administrator.
Request For Deletion
Before deleting items, check to ensure that they are not in use.
This can be easily done with the "links" link below the header of each request.

Please merge items before requesting their deletion.
Regular users and IPs are able to merge items. If you do not know how to merge, please read Help:Merge.

Do not try to pre-emptively delete an item because its page is up for deletion on a site.
The link will be removed by bots and reported here in the future if a deletion takes place.

Please use Wikidata:Properties for deletion if you want to nominate a property for deletion.
Getting used to Wikidata
Goto:
https://en.wikipedia.org/wiki/Special:UnconnectedPages

Find A Page!

Make an Item For It!

Special Pages
Special:DispatchStats
This page provides statistics about dispatching changes to the clients.
A client wiki is notified of a change on Wikidata once the change has been dispatched on that wiki, and the job queue on that wiki has processed that notification.
Special:NewItem
Special:Set*
This form allows you to set the label of an entity.

Such pages exist for descriptions, aliases and sitelinks
Special:(Entities|Items)Without*
Special:MyLanguageFallbackChain
The language fallback chain used to display entity labels for you is shown below.
Item by title
Item Disambiguation
Special:NewProperty
Creates a new item
Creates a new property
Wikibase Client
Using Information From Repo
Interwiki Links
240,000,000 language links removed

2013 analysis: http://simia.net/languagelinks/2013.html
2012 analysis: http://simia.net/languagelinks/index.html
The inclusion syntax presented here is meant to work for very simple cases, and only for them.

Complicated cases are to be realized using Lua.
www.mediawiki.org/wiki/Extension:WikibaseClient/Lua
Properties
{{#property:P36}}
{{#property:capital}}
Properties from other items (not yet implemented)
{{#property:capital|of=Q183}}
{{#property:capital|of=Germany}}
LUA
mw.wikibase.getEntity()
gets entity data of the Wikidata item connected with the current page.

mw.wikibase.label( id )
takes an item ID and returns the label in the language of the local wiki.

mw.wikibase.sitelink( id )
takes an item ID and returns the title of the corresponding page on the local wiki.
https://www.mediawiki.org/wiki/Extension:WikibaseClient/Lua
The Wikibase Extensions and Modules
mediawiki/extensions/Wikibase
mediawiki/extensions/WikibaseDataModel
mediawiki/extensions/WikibaseDatabase
mediawiki/extensions/WikibaseQuery
mediawiki/extensions/WikibaseQueryEngine
mediawiki/extensions/Wikidata
mediawiki/extensions/DataTypes
mediawiki/extensions/DataValues
mediawiki/extensions/Diff

https://www.mediawiki.org/wiki/Extension:Wikibase_Repo
Api modules
Serialized entity form
getentities
setlabel
setdescription
setaliases
setclaimvalue
setclaim
createclaim
getclaims
editentity
removeclaims
setreference
removereferences
setqualifier
removequalifiers
setsitelink
linktitles
setstatementrank
"Q2": {
"id": "Q2",
"type": "item",
"aliases": {
"en": [
{
"language": "en",
"value": "alias1"
},
{
"language": "en",
"value": "alias2"
}
]
},
"labels": {
"en": {
"language": "en",
"value": "I am a label"
}
},
"descriptions": {
"en": {
"language": "en",
"value": "I am a description!"
}
},
"claims": {
"P5": [
{
"id": "Q2$0457f6a4-45b3-46db-cecf-1933df515bb7",
"mainsnak": {
"snaktype": "value",
"property": "P5",
"datavalue": {
"value": {
"time": "+00000002013-06-12T00:00:00Z",
"timezone": 0,
"before": 0,
"after": 0,
"precision": 11,
"calendarmodel": "http://www.wikidata.org/entity/Q1985727"
},
"type": "time"
}
},
"type": "statement",
"rank": "normal"
}
]
},
"sitelinks": {
"enwiki": {
"site": "enwiki",
"title": "User:Addshore"
}
}
}
"Q8": {
"pageid": 13,
"ns": 120,
"title": "Item:Q8",
"lastrevid": 54,
"modified": "2013-09-28T07:28:56Z",
"id": "Q8",
"type": "item"
}
action=wbgetentities *
API module to get the data for multiple Wikibase entities.

ids
sites
titles
props
sort
dir
languages
languagefallback
normalize
action=wbsetlabel *
API module to set a label for a single Wikibase entity.

id
site
title
baserevid
summary
token
bot
language
value
action=wbsetdescription *
API module to set a description for a single Wikibase entity.

id
site
title
baserevid
summary
token
bot
language
value
action=wbsetaliases *
API module to set the aliases for a Wikibase entity.

id
site
title
baserevid
summary
token
bot
add
set
remove
language
action=wbsetsitelink *
API module to associate an article on a wiki with a Wikibase item or remove an already made such association.

id
site
title
baserevid
summary
token
bot
linksite
linktitle
action=wblinktitles *
API module to associate two articles on two different wikis with a Wikibase item.

tosite
totitle
fromsite
fromtitle
token
bot
action=wbeditentity *
API module to create a single new Wikibase entity and modify it with serialised information.

id
site
title
baserevid
summary
token
bot
data
clear
new
action=wbgetclaims *
API module for getting Wikibase claims.

entity
property
claim
rank
props
action=wbcreateclaim *
API module for creating Wikibase claims.

summary
token
baserevid
bot
entity
snaktype
property
value
action=wbsetclaim *
API module for creating or updating an entire Claim.

claim
token
baserevid
bot
action=wbsetclaimvalue *
API module for setting the value of a Wikibase claim.

summary
token
baserevid
bot
claim
value
snaktype
action=wbremoveclaims *
API module for removing Wikibase claims.

summary
token
baserevid
bot
claim
action=wbsetreference *
API module for creating a reference or setting the value of an existing one.

summary
token
baserevid
bot
statement
snaks
snaks-order
reference
action=wbsetreference *
API module for removing one or more references of the same statement.

summary
token
baserevid
bot
statement
references
action=wbsetqualifier *
API module for creating a qualifier or setting the value of an existing one.

summary
token
baserevid
bot
claim
property
value
snaktype
snakhash
action=wbremovequalifiers *
API module for removing a qualifier from a claim.

summary
token
baserevid
bot
claim
qualifiers
action=wbsetstatementrank *
API module for setting the rank of a Wikibase statement.

summary
token
baserevid
bot
statement
rank
https://www.wikidata.org
Full transcript