Request: | Word |
Response: | WordID |
Request: | WordID |
Response: | DocID1, PageRank, number of hits, DocID2, PageRank, number of hits, ... |
Request: | WordID |
Response: | DocID1, PageRank, hit-list, DocID2, PageRank, hit-list, ... |
Suppose the user queries for two words, WordA and WordB. Doc1 conatins 500 instances of WordA but only 1 instance of WordB. Doc2 on the other hand contains 200 instances each of WordA and WordB. Obviously, Doc2 is more relevant to the query posted by the user. The function described above will ensure that the total points of Doc2 are more than Doc1.To calculate the Total-Rank of a page, we add the points for each queried word to the normalized PageRank. The PageRank is normalized by multiplying it by some weight. This allows us to control the effect that the PageRank has on the Total-Rank. The formula for calculating the Total-Rank is given below:
Total Rank = (Wt x PageRank) + Pts(Word1) + Pts(Word2) + ...
Pts(Word1) calculates the points corresponding to the occurrences of Word1.
Wt is the weight that is multiplied with the PageRank to normalize it.
Total Rank = (Wt x PageRank) + Pts. from proximity + Pts. due to font size, title/anchor etc.