Tf-Idf and Cosine similarity

Seeking Wisdom

In the year 1998 Google handled 9800 average search queries every day. In 2012 this number shot up to 5.13 billion average searches per day. The graph given below shows this astronomical growth.

YearAverage Search per day
19989800
200060000000
20071200000000
20081745000000
20092610000000
20103627000000
20114717000000
20125134000000

googlesearchperday

The major reason for google’s success is because of its pageRank algorithm. PageRank determines how trustworthy and reputable a given website is. But there is also another part. The input query entered by the user should be used to match the relevant documents and score them. In this post I will focus on the second part. I have created 3 documents to show how this works. Take some time to go through them.

Document 1: The game of life is a game of everlasting learning
Document 2: The unexamined life is not worth living

View original post 1,226 more words