HN Books @HNBooksMonth

The best books of Hacker News.

Hacker News Comments on
Introduction to Information Retrieval

Christopher D. Manning, Prabhakar Raghavan, Hinrich Schütze · 5 HN comments
HN Books has aggregated all Hacker News stories and comments that mention "Introduction to Information Retrieval" by Christopher D. Manning, Prabhakar Raghavan, Hinrich Schütze.
View on Amazon [↗]
HN Books may receive an affiliate commission when you make purchases on sites after clicking through links on this page.
Amazon Summary
Class-tested and coherent, this groundbreaking new textbook teaches web-era information retrieval, including web search and the related areas of text classification and text clustering from basic concepts. Written from a computer science perspective by three leading experts in the field, it gives an up-to-date treatment of all aspects of the design and implementation of systems for gathering, indexing, and searching documents; methods for evaluating systems; and an introduction to the use of machine learning methods on text collections. All the important ideas are explained using examples and figures, making it perfect for introductory courses in information retrieval for advanced undergraduates and graduate students in computer science. Based on feedback from extensive classroom experience, the book has been carefully structured in order to make teaching more natural and effective. Although originally designed as the primary text for a graduate or advanced undergraduate course in information retrieval, the book will also create a buzz for researchers and professionals alike.
HN Books Rankings

Hacker News Stories and Comments

All the comments and stories posted to Hacker News that reference this book.
I'm also very interested in search engines this are two books I would recommend:

Introduction to Information Retrieval: https://www.amazon.com/Introduction-Information-Retrieval-Ch...

Search User Interfaces: https://www.amazon.com/Search-User-Interfaces-Marti-Hearst-e...

I don't even know if anybody has written a book specifically about search at "web scale" (no MongoDB jokes here, please). But about the closest things I know of would be something like:

https://www.amazon.com/Managing-Gigabytes-Compressing-Multim...

https://www.amazon.com/Information-Retrieval-Implementing-Ev...

https://www.amazon.com/Introduction-Information-Retrieval-Ch...

^ Great answer. So far, this is the only correct one in the thread.

I took Information Retrieval 101 in grad school and it was an interesting course. If you're curious to learn more, term frequency–inverse document frequency (tf–idf) is a good place to start. The underlying idea is surprisingly simple.

https://en.wikipedia.org/wiki/Tf–idf

Likewise with the core of Google's (original) ranking algorithm, PageRank, which is inspired by ideas like h-index.

https://en.wikipedia.org/wiki/PageRank

Also, the "standard" book which we used is quite readable: Introduction to Information Retrieval by Manning, et al.

https://www.amazon.com/Introduction-Information-Retrieval-Ch...

I heartily recommend "Introduction to Information Retrieval": http://www.amazon.com/Introduction-Information-Retrieval-Chr...

Skim it once to collect vocabulary, then use it as a reference for IR algorithms.

Introduction to Information Retrieval: http://www.amazon.com/dp/0521865719/

Convex Optimization: http://www.amazon.com/dp/0521833787/

Foundations of Statistical Natural Language Processing: http://www.amazon.com/dp/0262133601/

Read Peter Norvig's review for Foundations of ....http://www.amazon.com/review/R3GSYXSKRU8V17/

I haven't read any of these books, yet, highly recommended by some friends.

jules
Here are video lectures by the author of the convex optimization book (Stephen Boyd):

http://see.stanford.edu/see/courseinfo.aspx?coll=2db7ced4-39...

http://see.stanford.edu/see/courseinfo.aspx?coll=523bbab2-dc...

And video lectures by the author of the NLP book (Christopher D. Manning):

http://see.stanford.edu/see/courseinfo.aspx?coll=63480b48-88...

HN Books is an independent project and is not operated by Y Combinator or Amazon.com.
~ yaj@
;laksdfhjdhksalkfj more things
yahnd.com ~ Privacy Policy ~
Lorem ipsum dolor sit amet, consectetur adipisicing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur. Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est laborum.