top of page

The InvitAItion 

Eleven Strategy - DSBA Hackathon

How can we generate the best guest list for a luxe event to increase revenues? 

This was the hackathon objective presented to half of my MSc class by Eleven Strategy, a leading consulting firm based in Paris. After a random group allocation, we had 5 days to present the most convincing data-driven consulting proposal.

1280px-Logo_CentraleSupélec-1.svg.png
1280px-Logo_CentraleSupélec-1 2.svg.png

Context and Data provided

We were given around 55K transactions belonging to more than 13K different customers. Moreover, we had access to 10K event invitations, some of which were successful while others were not.

image.png

≈ 55K Transaction

image.png

≈ 13K Clients

image.png

10K Invitations

Our solution was tailored to a potential luxury business that wants to optimise the guests invited to luxury events. Optimal guests are those who are:

  • more likely to purchase a luxury item after having attended an event (classification model)

  • expected to spend the largest amount (regression model)

Current figures

After an initial data exploration, we noticed that only about 50% of the invitations sent were successful (the client attended the event). Therefore, the client's current invitation strategy is no better than flipping a coin. Moreover, we saw that most of the clients (93%) who accepted an invitation attended a single event. To have an unbiased estimate of the current average event gain we:

  • excluded the clients who attended multiple events

  • set a fixed window of 90 days to estimate the sales before and after a given event (gain)

image.png

≈ 50% Invitation Success

image.png

≈ 28K Mean Event Gain ($)

Our solution KPIs

Our solution will be evaluated on the following 3 KPIs:

  • Increase in average event gain → Business KPI

  • Increase in invitation success rate → Business KPI

  • Model performance/accuracy → Technical KPI

In particular, regarding model performance we have focused on recall for the classification model and MSE for the regression model. 

image.png

Proposed methodology to assess the true impact of an event

To assess the true impact of an event, we first identified customers with similar purchasing behaviour before they attended an event. To do so, we used a simple K-means unsupervised clustering algorithm. This step is crucial to obtain a base line. We then compared the average change in purchasing behaviour between clients that belong to the same cluster, given than only some of them attended a certain event. The difference between these two deltas gave us the true event gain, as shown by the diagram below:

background.jpg

Clustering

For each cluster

image.png

For each event

image.png
image_edited.png
image.png
image_edited.png

AVG sales

before

the event

AVG sales

after

the event

image.png
image.png

Did not attend 

event

customer-satisfaction copy.png
image.png
customer-satisfaction copy.png

Attended event

AVG ATTENDEES

GAIN

image.png
image_edited.png

AVG sales

before

the event

AVG sales

after

the event

image.png

AVG BASE

GAIN

image.png

TRUE

EVENT 

GAIN

Clustering

Following our clustering analysis, we identified 6 clusters:

image.png

Loyal Shoppers

image.png

Women's Style Enthusiasts

image.png

Men's Fashion Explorers

image.png

Rare Shoppers

image.png

Tiny Trends

image.png

Timepieces Lovers 

# Clients

297 (2%)

8719 (67%)

2364 (18%)

1286 (10%)

257 (2%)

163 (1%)

Sales (€ M)

31 (30%)

58 (56%)

7 (8%)

3 (4%)

0.5 (0.5%)

2 (1%)

# Purchases

30

3

2

1

2

2

Customer life (Month)

16

3

2

1

2

3

To cluster the existing clients into different groups, we mainly focused on their purchasing behaviour. Variables such as:

  • purchasing amount 

  • number of purchases 

  • categories bought

  • customer life time (month) 

​are some of the indicators we used to cluster existing customers. Such variable are very strong indicators of future consumer behaviour, and should therefore be prioritised. On the other hand, demographic variables can be used to reach and find customers. 

Successful event 

How can we understand and visualise the true impact on sales of a given event? Let's follow the steps explained in the proposed methodology to visually assess the monetary average gain of a given event.

AVG

Sales ($)

Cluster 1

time

AVG sales from people belonging to a specific cluster and who attended a given a even

AVG sales from people belonging to a specific cluster and who did not attend a given a even

Cluster 2

Event

Gain due to event

Gain due to external factors

As we can see from the illustration above, for both clusters we notice that the average purchasing amount changed after the event took place. For example, if we focus just on cluster 1, both subgroups (attending and not attending people) increased their spendings. However, the magnitude of such increases is higher for those who attended the event than for those who did not. As both subgroups belong to the same cluster, we can assume that they had similar purchasing behaviours before the event took place. Therefore, the difference between the attendees gain and the base gain can be attributed to the event itself, and is highlighted by the blue rectangle. If we repeat the same procedure for all clusters and sum up all the correspondent event gains we obtain the total true event gain. We can categorise the event above as successful because there is a gain due to the event. 

Unsuccessful event 

On the contrary, below is the representation of an event that is not successful, as both subgroups of customers belonging to the same cluster experienced the same increase in spendings. Such an increase can be attributed to external factors such as improved ecomic conditions or seasonal events (e.g., Christmas).

AVG

Sales ($)

Cluster 1

time

AVG sales from people belonging to a specific cluster and who attended a given a even

AVG sales from people belonging to a specific cluster and who did not attend a given a even

Cluster 2

Event

Gain due to event (not present)

Gain due to external factors

Scoring Methodology

Now that we have understood how to classify an event as successful, let's focus on finding the optimal guests to invite. To quantify guests "optimality" we will use a simple scoring metric. We can obtain such metric by multiplying the following two figures:

  • client's expected purchasing amount after attending the event (regression model)

  • client's likelihood of attending an event after receiving an invitation (classification model)

Expected Sales of customer after attending an event

image.png

Monetary value

Likelihood

of attending an event after being invited

image.png

Probability

=

SCORE

Customers ranked in descending order

image.png

Monetary value

Automated platform

We can now focus on the product we would implement if the client were to choose us as its consultants. We prototyped a fully working platform that takes as inputs the characteristics of a new event. It then outputs the optimal guests list for such event. Among the input variables the client could choose:

  • type of event

  • product to promote

  • invitation channel

  • number of invitees

  • cost per attending invitee

  • other product-related variables 

dash.png

The optimal guests list given as output includes the clients with the highest score. The total gain is computed as the sum of the individual scores minus the total expected costs to organise the event (number of invitees x cost per invitee).

Additional Services offered

Being a data-driven consulting company, we also had to come up with additional services to sell to our client. Therefore, we thought of providing:

  • weekly training for an easy onboarding of the solution

  • maintenance of the model and dashboard

  • improved data collection process

  • continuous model retraining with new data

image.png

Trainings

image.png

Maintenance

image.png

Data Collection

image.png

Continuos Learning

Our Results

Compared to the initial situation, we were able to substantially improve important metrics. While the initial invitation rate of success was around 50%, we were able to take it all the way up to 89% accuracy, with a recall for successful invitation (category 1) of 91%. Regarding the average expected gain per event, our model predicted around €49K, a significant improvement over the initial 29K. 

image.png

≈ 89% Invitation Success

image.png

≈ 49K Mean Event Gain ($)

However, it is important to notice that the competition's scope was not providing a perfect model/solution, as we only had 5 days to work on the project. The goal of the hackathon was crafting the most convincing data-consulting proposal, with the goal of being chosen by a potential client. In real life, providing a solid solution for such a project would require months of work and thorough analysis. 

Skills acquired/improved and tools used in the project

Taking part on this hackathon was very demanding and rewarding at the same time. Random group allocations was another challenging factor, which is meant to replicate even more a real work environment. I was lucky enough to work with equally driven and qualified people, without whom it would have been much harder to win.

Python-Logo-PNG-Image-1.png
image.png

SKlearn

image.png

Pandas

mkt.png
image.png

Clustering

image.png

Client scoring

image.png
image.png
image.png

Team leading

motivation.png
image.png

Team building

I took the role of project leader, with the objective to motivate everyone, work together towards our common goal, and present our final solution. I believe that a good leader is someone able to:

  1. listen to every team member 

  2. identify each member's strength 

  3. have a deep technical understanding 

  4. be humble enough to trust and delegate 

  5. be a strong believer and motivator 

Every member of the team needs an environment where they feel valued, listened, and motivated.

This is why I truly believe that great leaders are great environment creators. 

image.png
bottom of page