Neural Network Recommender - Project II

Department of Mathematics and Computer Science, Adam Mickiewicz University, 2023

Author: Paweł Łączkowski

Project description

The main goal of the project was to create a neural network recommender for hotel data (room recommendations) whose @HR10 metric score would be higher than the amazon recommender.

To complete this task, the following subtasks had to be accomplished:

Preparation of data (data preprocessing for hybrid models).
Preparation of user features (for hybrid models).
Preparation of item features (for hybrid models).
Creating a recommender (prepare network architectures).
Tuning the recommender.
Evaluation of the recommender.

Preparation of training data (data preprocessing)

The entire process of preparing data can be found in the file project_1_data_preparation.

It covers the creation of new features based on available data, such as a length_of_stay or weekend_stay, and the bucketing of important features.

Preparation of user features

The task was to extract numerical features for users from the data that could be used to train the recommendation model.

User features

User features are distributions of individual item features for all items with which that user interacted.

In general, the data shows what proportion of all user interactions were, for example, Christmas term or Weekend Stay interactions.

Preparation of item features

The task was to extract numerical features for items from the data that could be used to train the recommendation model.

Item features were prepared in a similar way to user features due to the later use of the scalar product of features to train the recommender.

Item features

Item features are One-Hot Encoded.

Creating a recommender.

The recomenders are included in file - project_2_recommender_and_evaluation - final.

The recomender fitting works as follows:

Prepare users and items features (for hybrid models).
Generate negative interactions.
Prepare input for training machine learning model.
Train machine learning model.

The recomendations work as follows:

Prepare users and items features (for hybrid models).
Calculate score (predict values) for input data.
Get highest scores from list.
Return n items with highest score.

My experiments consisted of testing recommenders based on several different architectures.

Throughout the workflow, I prepared 12 different architectures, 7 based solely on embeddings layers and 5 on embeddings layers with the addition of content-based features that I prepared earlier for Project 1.

Non-hybrid Models

[1] One fully-connected layer without bias (200+ evals):

(item_embedding): Embedding(763, 6)
(user_embedding): Embedding(14503, 6)
(fc): Linear(in_features=6, out_features=1, bias=False)

Attention! The number of neurons in the architecture depends on the embedding dimension (in the example it is set to 6).

This architecture uses multiplied element-by-element embeddings of the users and the items, and then passes them to a fully connected layer. Finally, sigmoid function is applied.

The best result (with additional manual tuning): 0.248812

This neural network is really simple, but it also gives pretty good results.

[2] Two fully-connected layers without bias (50+ evals):

(item_embedding): Embedding(763, 6)
(user_embedding): Embedding(14503, 6)
(fc1): Linear(in_features=6, out_features=8, bias=False)
(fc2): Linear(in_features=8, out_features=1, bias=False)

Attention! The number of neurons in the architecture depends on the embedding dimension (in the example it is set to 6).

The best result: 0.179009

This neural network performs significantly worse than its simpler counterpart, so I decided to give up on this architecture after 50+ evaluations.

[3, 4, 5, 6] Three/Four/Five/Six fully-connected layers without bias:

This architectures performs significantly worse than its simpler counterparts, so I decided to give up on all of this architectures.

[7] GMF + MLP (150 + evals):

(item_embedding): Embedding(763, 6)
(user_embedding): Embedding(14503, 6)
(gmf_user_embedding): Embedding(14503, 6)
(gmf_item_embedding): Embedding(763, 6)
(mlp_user_embedding): Embedding(14503, 6)
(mlp_item_embedding): Embedding(763, 6)
(mlp_fc1): Linear(in_features=12, out_features=16, bias=True)
(mlp_fc2): Linear(in_features=16, out_features=8, bias=True)
(fc): Linear(in_features=14, out_features=1, bias=False)

Attention! The number of neurons in the architecture depends on the embedding dimension (in the example it is set to 6).

This architectures uses two separate pairs or embeddings for users and items. First pair (gmf_user_embedding and gmf_item_embedding) is multiplied element-by-element. Second pair (mlp_user_embedding and mlp_item_embedding) is concatenated into one vector and then passed through two fully-connected layers. Then, two results mentioned above are then concatenated and passed through a final fully-connected layer. Finally, sigmoid function is applied.

This architecture performs really solid, but in the end it achieved a final result slightly worse than the first one.

The best result: 0.240665

Hybrid Models

[1, 2] One/Three fully connected-layers + content-based features:

This architectures performs really badly, and the results are below any reasonable level, so I decided to give up on this architectures.

[3] GMF + MLP + Content-based #1 (150+ evals):

(item_embedding): Embedding(763, 6)
(user_embedding): Embedding(14503, 6)
(gmf_user_embedding): Embedding(14503, 6)
(gmf_item_embedding): Embedding(763, 6)
(mlp_user_embedding): Embedding(14503, 6)
(mlp_item_embedding): Embedding(763, 6)
(mlp_fc1): Linear(in_features=12, out_features=16, bias=True)
(mlp_fc2): Linear(in_features=16, out_features=8, bias=True)
(content_based): Linear(in_features=24, out_features=8, bias=True)
(cb_fc1): Linear(in_features=48, out_features=16, bias=True)
(cb_fc2): Linear(in_features=16, out_features=8, bias=True)
(fc1): Linear(in_features=30, out_features=16, bias=True)
(fc2): Linear(in_features=16, out_features=1, bias=True)

Attention! The number of neurons in the architecture depends on the embedding dimension (in the example it is set to 6).

This architecture combines GMF+MLP architecture with content-based features.

Content-based features are provided in two ways:

as multipled vectors (element-by-element) of users and items features.
as concatenated vectors of users and items features.

Multipled vectors of features are passed through one fully-connected layer without any activation function.

Concatenated vectors are passed through two fully-connected layers with relu activation functions.

In the end all results are concatenated and passed through two final fully-connected layers. Finally, sigmoid function is applied.

This architecture works really well and gives overall better results than the first (non-hybrid) neural network.

The best result: 0.255601

[4] GMF + MLP + Content-based #2 (75+ evals):

(item_embedding): Embedding(763, 6)
(user_embedding): Embedding(14503, 6)
(gmf_user_embedding): Embedding(14503, 6)
(gmf_item_embedding): Embedding(763, 6)
(mlp_user_embedding): Embedding(14503, 6)
(mlp_item_embedding): Embedding(763, 6)
(mlp_fc1): Linear(in_features=12, out_features=16, bias=True)
(mlp_fc2): Linear(in_features=16, out_features=8, bias=True)
(content_based): Linear(in_features=24, out_features=8, bias=True)
(fc1): Linear(in_features=22, out_features=16, bias=True)
(fc2): Linear(in_features=16, out_features=1, bias=True)

Attention! The number of neurons in the architecture depends on the embedding dimension (in the example it is set to 6).

This architecture combines GMF+MLP architecture with content-based features.

The difference between GMF + MLP + Content-based #2 and this architecture is that this architecture uses only multipled vectors of users and items features.

This architecture works really well and in the end got the best score among all architectures.

The best result (with additional manual tuning): 0.265105

[5] GMF + MLP + Content-based #3 (200+ evals):

(item_embedding): Embedding(763, 6)
(user_embedding): Embedding(14503, 6)
(gmf_user_embedding): Embedding(14503, 6)
(gmf_item_embedding): Embedding(763, 6)
(mlp_user_embedding): Embedding(14503, 6)
(mlp_item_embedding): Embedding(763, 6)
(mlp_fc1): Linear(in_features=12, out_features=16, bias=True)
(mlp_fc2): Linear(in_features=16, out_features=8, bias=True)
(content_based): Linear(in_features=24, out_features=8, bias=True)
(content_based_sep): Linear(in_features=48, out_features=8, bias=True)
(fc1): Linear(in_features=30, out_features=16, bias=True)
(fc2): Linear(in_features=16, out_features=1, bias=True)

Attention! The number of neurons in the architecture depends on the embedding dimension (in the example it is set to 6).

This architecture combines GMF+MLP architecture with content-based features.

It is simmilar to GMF + MLP + Content-based #1 architecture.

The difference is that this architecture does not pass the connected vectors through fully-connected layer and does not use the activation function.

This architecture works really well and in the end got the second best score among all architectures.

The best result (with additional manual tuning): 0.262729

Tuning the recommender.

Recommender tuning involves properly training the machine learning models used to predict scores.

Many tunings have been performed for all the models mentioned above (only a small part of the tuning process is included in files).

Most of the tuning results are in the catalog results_csv.

I also hand-tuned the best models for maximum performance and highest scores.

Tuned parameters

n_neg_per_pos

n_neg_per_pos - number of negative interactions per positive one (1-10).

n_epochs

n_epochs - number of training epochs (1-100).

batch_size

batch_size - number defining batch size.

I used values that are a power of two: 32, 64, 128, 256, 512 and 1024.

embedding_dim

embedding_dim - number defining embedding size.

At first I used values that are power of two: 8, 16, 32 and 64.

Later, I also decided to include different values: 3, 4, 6, 10, 12, 20, 24, 48 and 56.

learning_rate

learning_rate - number defining learning rate (0.001-0.03).

I tested various ranges and eventually chose the above one, as it gave the most stable results.

weight_decay

weight_decay - number defining L2 regularization weight (0.001-0.03).

I tested various ranges and finally chose the same range as for learning_rate because it gave the most stable results.

Evaluation of the recommender.

The final results are as follows (models whose results were unsatisfactory were omitted):

Non-hybrid Models

Recommender	HR@10
NNRecommender1	0.248812
NNRecommender7	0.240665

Hybrid Models

Recommender	HR@10
NNRecommender3Hybrid	0.255601
NNRecommender4Hybrid	0.265105
NNRecommender5Hybrid	0.262729

Content-based Models from Project I

Recommender	HR@10
LinearRegressionCBUIRecommender	0.243720
RandomForestCBUIRecommender	0.233876
XGBoostCBUIRecommender	0.252206
CatBoostRegressorCBUIRecommender	0.244060

Amazon Recommender

Recommender	HR@10
AmazonRecommender	0.223693

All results

Recommender	HR@10
NNRecommender1	0.248812
NNRecommender7	0.240665
NNRecommender3Hybrid	0.255601
NNRecommender4Hybrid	0.265105
NNRecommender5Hybrid	0.262729
LinearRegressionCBUIRecommender	0.243720
RandomForestCBUIRecommender	0.233876
XGBoostCBUIRecommender	0.252206
CatBoostRegressorCBUIRecommender	0.244060
AmazonRecommender	0.223693

Conclusion

As I mentioned above all architectures based on GMF+MLP worked really well. They obtained, on average, much better results than the other models. Hybrid models based on GML+MLP were especially good, and they gave me the highest results, above 0.26.

A great failure were models with many fully connected layers and a large number of neurons. They had trouble with learning anything, and when they did learn something, the results were much worse than those obtained by amazon's recomender.

In the end, I managed to get a score of 0.265105, which beat amazon's recomender by 0.041412 points.

Project requirements

Project is written in python.

Requires:

Python version 3.8 or higher.
Anaconda.
Git

All the necessary dependencies can be found in the file requirements.txt and environment.yml.

To run the project:

Fork this repository.
Clone your repository.
```
git clone your_repository_address
```

Create environment.

conda env create --name name_of_environment -f environment.yml

Activate your environment.
```
conda activate name_of_environment
```
Launch Jupyter Notebook.
```
jupyter notebook
```
Enjoy!

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
charts		charts
data/hotel_data		data/hotel_data
data_preprocessing		data_preprocessing
evaluation_and_testing		evaluation_and_testing
recommenders		recommenders
results_csv		results_csv
README.md		README.md
__init__.py		__init__.py
environment.yml		environment.yml
project_1_data_preparation.html		project_1_data_preparation.html
project_1_data_preparation.ipynb		project_1_data_preparation.ipynb
project_2_recommender_and_evaluation - final.html		project_2_recommender_and_evaluation - final.html
project_2_recommender_and_evaluation - final.ipynb		project_2_recommender_and_evaluation - final.ipynb
requirements.txt		requirements.txt

pawel-laczkowski/neural-network-recommender

Folders and files

Latest commit

History

Repository files navigation

Neural Network Recommender - Project II

Table of contents

Project description

Preparation of training data (data preprocessing)

Preparation of user features

User features

Preparation of item features

Item features

Creating a recommender.

Non-hybrid Models

[1] One fully-connected layer without bias (200+ evals):

[2] Two fully-connected layers without bias (50+ evals):

[3, 4, 5, 6] Three/Four/Five/Six fully-connected layers without bias:

[7] GMF + MLP (150 + evals):

Hybrid Models

[1, 2] One/Three fully connected-layers + content-based features:

[3] GMF + MLP + Content-based #1 (150+ evals):

[4] GMF + MLP + Content-based #2 (75+ evals):

[5] GMF + MLP + Content-based #3 (200+ evals):

Tuning the recommender.

Tuned parameters

n_neg_per_pos

n_epochs

batch_size

embedding_dim

learning_rate

weight_decay

Evaluation of the recommender.

Non-hybrid Models

Hybrid Models

Content-based Models from Project I

Amazon Recommender

All results

Conclusion

Project requirements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages