Skip to content

Commit

Permalink
Merge branch 'master' of github.com:PierreGe/RL-movie-recommender
Browse files Browse the repository at this point in the history
  • Loading branch information
Pierre Gérard committed Jan 5, 2017
2 parents 2cca6e0 + 70b7411 commit bc30baa
Showing 1 changed file with 3 additions and 1 deletion.
4 changes: 3 additions & 1 deletion report/recommendation-rl.tex
Original file line number Diff line number Diff line change
Expand Up @@ -259,7 +259,7 @@ \subsection{Adaptation of \cite{main} to movie recommendation}
\end{center}
\end{figure}

This graph tells us that on the long term or mid term, the recommender will give us better result than a random selection of movie. Also it underlines the importance of balancing exploration and exploitation. Indeed, the result difference between different greedy approach is quite significant.
This graph tells us that on the long term or mid term, the recommender will give us better result than a random selection of movie. Also it underlines the importance of balancing exploration and exploitation. Indeed, the result difference between different greedy approach is quite significant. We can see that greedy UCB does not seem to work well.

\begin{figure}[H]
\begin{center}
Expand Down Expand Up @@ -377,6 +377,8 @@ \section{Discussion}
%On se rend compte que le greedy est meilleur que le random etc etc et ca on s'en fout on s'en doutait. Mais ce qui est fort c'est que le CF est meilleur que le CB. Pourtant dans le papier original il a implémenté que le CB. Donc pq ? est-ce qu'il s'en est juste pas douté ? ou pcq c'etait de la musique et pas des films. Si je ne m'abuse il dit qu'il choisit du CB pcq y a plus d'exploration et moins d'exploitation que du CF. Simplement pcq le CF propose plus facilement des films tres connus que peu connus, de part sa nature.
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

The reason why the greedy-UCB algorithm does not seem to work well is that the UCB is very difficult to compute. Bayes-UCB should solve this problem by representing the rating as a random variable.

%\todo{would the result be correct recommendation, is it a viable way to do recommendation ? (expected result vs reality, time for a recommendation, accuracy, ....)}

%However, one could argues that the computation could be parallelized on multi-core or on a big data cluster. \todo{dernier phrase in discussion}
Expand Down

0 comments on commit bc30baa

Please sign in to comment.