Skip to the content
- Search Engines and Infromation Retrieval
- Architecutre of a Search Engine
- Crawls and Feeds
- Processing Text
- Ranking with Indexes
- Queries and Interfaces
- Retrieval Models
- Search Engines and Infromation Retrieval
- What is Infroamtion Rereival?
- The Big Issues
- Search Engines
- Search Engineers
- Architecutre of a Search Engine
- what is an architecture?
- basic building blocks
- breaking it down
- text acquisition
- text transformation
- index creation
- user interaction
- ranking
- evaluation
- how does is really work?
- Crawls and Feeds
- deciding what to search
- crawling the web
- retrieving web pages
- the web crawler
- freshness
- focused crawling
- deep web
- sitemaps
- distributed crawling
- crawling documents and email
- document feeds
- the conversion problem
- character encodings
- scoring the documents
- using a database system
- random access
- compression and large files
- update
- bigtable
- detecting duplicates
- removing noise
- Processing Text
- from words to terms
- text statistics
- document parsing
- document structure and markup
- link analysis
- information extraction
- internationalization
- Ranking with Indexes
- overview
- abstract model of ranking
- inverted indexes
- compression
- auxilary structures
- index construction
- query processing
- Queries and Interfaces
- infromation needs and queries
- query transformation and refinement
- showing the results
- result pages and snippets
- advertising and serach
- clustering the results
- cross-language search
- Retrieval Models
- overview of retreival models
- boolean retrieval
- the vector space model
- probabilistic models
- information retreival as classification
- the bm25 ranking algorithm
- ranking based on language models
- query likelihood ranking
- relevance models and pseudo-relevance feedback
- complex queries and combining evidence
- web search
- machine learning and information retreival
- application-based models
Comment is the energy for a writer, thanks!