Introducing 

Prezi AI.

Your new presentation assistant.

Refine, enhance, and tailor your content, source relevant images, and edit visuals quicker than ever before.

Loading…
Transcript

Personality Cloning

Shivani Poddar

Personality Cloning

What we have today?

  • Sentiment140-Tool -> http://help.sentiment140.com/
  • Opinion Crawler -> http://www.opinioncrawl.com/
  • SAS -> commercialized website - http://www.sas.com/text-analytics/sentiment-analysis/

What is the problem?

  • Give positives and negatives to the parsed words.
  • No policy of inhering qualities

http://www.scienceforseo.com

Check out this beautiful visual sentiment system <3 

Tools for coders

SentiWordNet

LingPipe sentiment analysis

Long list of tools at CodeSpeak

The Toolkit for Advanced Discriminative Modeling (TADM)

RapidMiner

The process part 3

  • No analysis of intentions

The average Sentiment orientation of all the phrases we gathered is computed.

This allows the machine to say something like:

"Generally people like the new iphone"

--> They recommend it

or

"Generally people hate the new iphone"

--> They don't recommend it

Openness

Things to read

The process...

Sentiment analysis in Text (SFS)

Opinion mining and sentiment analysis (Bo Pang, Lillian Lee)

Opinion Extraction, Summarization and Tracking in News and Blog Corpora (Ku, Liang, Chen)

Sentiment analysis and subjectivity (Bing Liu)

International sentiment analysis for news and blogs (Bautin)

Sentiment analysis: does coreference matter? (Nikolov)

CIKM Workshop on sentiment analysis

1 - Part-of-speech tagging (but also position and more):

The word in the text (or the sentence) at tagged using a POS-tagger so that it assigns a label to each word, allowing the machine to do something with it. It looks something like this:

Then we extract defined patterns like [Det] + [NN] for example

Classifying sentiments

S = subject

VP = Verb Phrase

V = Verb

N = Noun

NP = Noun Phrase

PP = Preposition

Det = Determiner

This is very difficult but experiments have been done using:

  - Naive Bayes (probabilistic classifier using Bayes theorem)

                        

  - Maximum Entropy (Uses probability distributions on the basis of partial knowledge)

  - Support vector machine (Data is set as 2 vectors in an n-dimensional space)

Pang et al. found the SVM to be the most accurate classifier (around 80%).

There are other methods being explored as well.

The process part 2

In the press

We look at sentiment orientation (SO) of the patterns we extracted. For example we may have extracted:

Amazing + Phone

which is: 

[JJ] + [NN] (or adjective followed by noun in human)

The opposite might be "Terrible" for example. In this stage, the machines tries to situate the words on an emotive scale (so to speak).

Google and sentiment analysis (SeoByTheSea)

5 ways sentiment analysis is ramping up in 2009 (RWW)

Mining the web for feelings not facts (NY Times)

Is sentiment analysis reliable? (Marketing Pilgrim)

Sentimental searching (Watching the watchers)

So does it work?

The wider your throw the net and the more complex the language, the less accurate the system will be. This is simply due to the level of complexity it has to deal with.

If you want to classifiy sentiments into +/- groups, then you are more likely to get a good result than if you are trying to classify into more exact groups (Excellent, incredible, good...). More granularity requires more accuracy and this in turn requires a deeper understanding of human language. 

There are commercial systems in place at this time and also systems like NaCTeM in the research space.

Language is ambiguous

Conscientiousness

Consider:

"The watch isn't water resistant" - In a product review this could be negative.

"As much use as a trapdoor on a lifeboat" - negative but not obvious to the machine.

"The canon camera is better than the Fisher Price one" - comparisons are hard to classify.

"imo the ice cream is luuurrrrrrvely" - slang and the way we communicate in general needs to be processed.

Why would we want to do this?

The Quintuple is hard to resolve

It allows business to track:

        - Flame detection (bad rants)

        - New product perception

        - Brand perception

        - Reputation management

        

It allows individuals to get:

    - An opinion on something (reviews) on a global scale

    

Oj = Named Entity Extraction

f jk = Information extraction

SO ijkl = Sentiment analysis

hi = Information extraction

Ti = Data extraction

All of these problems are as yet unsolved in computer science

{See Bing Liu}

Several fields of computing merge

Openness is a general appreciation for art, emotion, adventure, unusual ideas,

Natural language processing

It deals with the actual text element. It transforms it into a format that the machine can use.

Artificial intelligence

It uses the information given by the NLP and uses a lot of maths to determine whether

something is negative or positive: it is used for clustering.

What is it?

It's not always easy to

differentiate between fact

and opinion.

Extraversion

It's software for automatically extracting opinions, emotions and sentiments in text.

It allows us to track attitudes and feelings on the web. People write blog posts, comments, reviews and tweets about all sorts of different topics. 

We can track products, brands and people for example and determine whether they are viewed positively or negatively on the web. 

The problem has several dimensions

1 - How does a machine define subjectivity & sentiment?

2 - How does a machine analyse polaraity (negative/positive)?

3 - How does a machine deal with subjective word senses?

4 - How does a machine assign an opinion rating?

5 - How does a machine know about sentiment intensity?

We can analyze...

What is an opinion to a machine?

                      acts:

"The painting was more expensive than a Monet"

                    pinions:

"I honestly don't like Monet, Pollock is the better artist"

It is a "quintuple", an object made up of 5 different things:

Oj = The thing in question (i.e product) 

f jk = a feature of Oj

SO ijkl = the sentiment value of the opinion of the opinion holder hi on feature fjk of object oj at time tl

These 5 elements have to be identified by the machine

What is an opinion?

"a personal belief or judgment that is not founded on proof or certainty" (WordNet)

But:

“The fact that an opinion has been widely held is no evidence whatever that it is not utterly absurd.”

(Bertrand Russell)

How Sentiment Analysis

works in machines

(an introduction)

Agreeableness

Conscientiousness is a tendency to show self-discipline, act dutifully, and aim for achievement against measures or outside expectations.

Marie-Claire Jenkins                     

Extraversion is characterized by positive emotions, surgency, and the tendency to seek out stimulation and the company of others.

Neuroticism

Agreeableness is a tendency to be compassionate and cooperative rather than suspicious and antagonistic towards others.

Understanding Personality!

Neuroticism is the tendency to experience negative emotions, such as anger, anxiety, or depression

http://www.scienceforseo.com

Personality Cloning

Aesthetics,Narrative &

Design.

Intensions might be associated with the illustration of each of these traits

Lets Solve it!

Pleasing others != agreeableness

  • Do not go on face value !

Society

  • Look for intentions for they lead to personality

Peer Pressure

Family

  • Formulate a graph structure wherein the relationship between different intentions and projected sentiments can be mapped from

Sympathy seeking != neuroticism

Can I make a personality character based on these 5 traits ??!

No!

We still need intentions in play here !

How to figure out??!!

Reaction to reactions!

If I want sympathy I will react positively to it

else not... and so on

Learn more about creating dynamic, engaging presentations with Prezi