Send the link below via email or IMCopy
Present to your audienceStart remote presentation
- Invited audience members will follow you as you navigate and present
- People invited to a presentation do not need a Prezi account
- This link expires 10 minutes after you close the presentation
- A maximum of 30 users can follow your presentation
- Learn more about this feature in our knowledge base article
Transcript of Stylometric Evolution
in 878 books by 24 authors in 10 languages
(in 45 minutes)
Jan Rybicki, Jagiellonian University
puts together texts by the same authors, but traces of other "signals" are also visible: genre, theme, or
...even though two risky assumptions have to be made:
language is a "bag of words",
word frequencies are independent of each other.
thors exhibit stylometric e
be almost linear (Galdós, J
Other factors are
It is a truth universally acknowledged, that a single man in possession of a good fortune, must be in want of a wife. However…
I wish either my father or my mother, or indeed both of them, as they were in duty both equally bound to it, had minded what they were about when they begot me…
'NOW, what I want is, Facts. Teach these boys and girls nothing but Facts. Facts alone are wanted in life. Plant nothing else, and root out everything else…
I have just returned from a visit to my landlord - the solitary neighbour that I shall be troubled with. This is certainly a beautiful country!...
Dear Father and Mother,
I have great trouble, and some comfort, to acquaint you with. The trouble is, that my good lady died of the illness I mentioned to you, and left us all much grieved for the loss of her...
Bag of texts, bag of words
A lot of Deltas
In a set of texts by several authors, we can group the texts by their authors by comparing frequencies of 50, 100, 500...
most frequent words.
Let's face it...
Looking at title pages works too.
What is it that we see?
stic change over
evolution in individu