Loading presentation...

Present Remotely

Send the link below via email or IM

Copy

Present to your audience

Start remote presentation

  • Invited audience members will follow you as you navigate and present
  • People invited to a presentation do not need a Prezi account
  • This link expires 10 minutes after you close the presentation
  • A maximum of 30 users can follow your presentation
  • Learn more about this feature in our knowledge base article

Do you really want to delete this prezi?

Neither you, nor the coeditors you shared it with will be able to recover it again.

DeleteCancel

Make your likes visible on Facebook?

Connect your Facebook account to Prezi and let your likes appear on your timeline.
You can change this under Settings & Account at any time.

No, thanks

The Kaggle Project

No description
by

shruti tripathi

on 30 June 2014

Comments (0)

Please log in to add your comment.

Report abuse

Transcript of The Kaggle Project

Regression Techniques
So far we have run logistic regression on the data in SAS and have achieved a concordance of 62%
Problem Description
Companies often offer discounts on products to attract customers.
This challenge asks the participants to predict those customers who will return to purchase the same product post an initial incentive.
The data provided includes a basket level pre-offer transaction history which dates to approximately a year prior to the date on which the offer was made is provided and a sample prediction of the post incentive behavior of a subset of the total number of customers is made.
Phases of the project
Understanding the data

Variable creation

Running regression on data

Improving prediction model
Understanding the variables
We are provided with four relational files:

transactions.csv - contains transaction history for all customers for a period of at least 1 year prior to their offered incentive

trainHistory.csv - contains the incentive offered to each customer and information about the behavioral response to the offer

testHistory.csv - contains the incentive offered to each customer but does not include their response (we are predicting the repeater column for each id in this file)

offers.csv - contains information about the offers

Challenge Name :
Acquire Valued Shoppers Challenge
Objective :
To predict which shoppers will become repeat buyers
The Kaggle Project
Variable Creation
We have analyzed all these files and used the existing variables to further create more derived variables using the RFM model and our own ideas
Improving prediction model
We are currently working on improving the prediction model and are testing techniques such as Random Forest for better variable creation and Poisson Distribution for more accurate prediction
Thank You!
Full transcript