-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathc139.txt
60 lines (46 loc) · 1.45 KB
/
c139.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
Demographic Filtering- They offer generalized recommendations to every
user, based on movie popularity and/or genre.
We need a metric or score to
rate movie
● We then need to calculate
the score for every movie
● We finally need to sort the
scores and recommend the
best rated movie to the users
We first need a metric or a score to
rate the movie. Average rating, but nope.
The weighted rating formula:
((v/(v+m))*R)+((m/(v+m))*C)
Here,
● v - The number of votes for the
movies (or number of
ratings/reviews in case of an
amazon product)
● m - The minimum votes
required to be listed in the
chart
● R - Average rating of the movie
● C - Mean votes across the
whole report
If we look at our data, we already
have v(vote count) and
R(vote_average)! We can calculate
the C by calculating the mean of all
the vote averages.
Next, we want to determine an
appropriate value for m, the
minimum votes required to be listed
in the chart. Do you know how we can
do that?
We can take the 90th Percentile as
our cutoff. In other words, for a movie
to feature in the charts, it must have
more votes than at least 90% of the
rest of the movies in the list.
This is called a quantile. In future you
will be writing competitive exams
where this quantile concept will come
to place. For instance, if you get 90%
quantile then it means, you were
better than 90% of the other students
who appeared for the exam.