Sponsorised links
This year
Lucid Imagination » Solr’s New Clustering Capabilities
One of the new things in Solr 1.4 that I am particularly excited about is the new document and search results clustering capabilities. This is an optional module that lives in Solr’s contrib/clustering directory and was added via SOLR-769.
Searchable: Annotation-Driven Indexing and Searching with Lucene :: Drive-by Digressions
Searchable is a toolkit for Lucene that harnesses the power of annotations to specify what properties to index and how to treat them.
Phlat (Prototype for Helpful Lookup and Tagging) - Microsoft Research
Phlat is a new interface for Windows Desktop Search, enabling search through a user's own e-mail, files, and viewed Web pages. Phlat makes it easy for users to specify queries and filters, attempting to integrate search and browsing in one intuitive interface. In addition, Phlat supports a unified tagging scheme for organizing personal content across storage systems, such as files and e-mail.
Sig.ma - Live views on the Web of Data - Sindice Blog
Pages exposing RDF, RDFa or Microformats will appear. If you or your company want information to be found on the web of data, it is very simple to mark up your HTML using RDFa, then submit it to Sindice. You will find it returned by Sig.ma within 10-15 minutes.
Read It: Search User Interfaces
Read the Book
To make this book available to as many readers as possible, the author, with permission of Cambridge University Press, has placed the full text online free of charge. See the terms of service on the right.
Search is an integral part of peoples' online lives; people turn to search engines for help with a wide range of needs and desires, from satisfying idle curiousity to finding life-saving health remedies, from learning about medieval art history to finding video game solutions and pop music lyrics. Web search engines are now the second most frequently used online computer application, after email. Not long ago, most software applications did not contain a search module. Today, search is fully integrated into operating systems and is viewed as an essential part of most information systems.
Documentation for the Combine (focused) crawling system
The Combine system is an open, free, and highly configurable system for focused crawling of Internet resources. It aims at providing a robust and efficient tool for creating topic-specific moderate sized databases (up to a few million records). Crawling speed is around 200 URLs per minute and a complete structured record takes up an average of 25 kilobytes disk-space.
SourceForge.net: Local Lucene
Geographical search extension to the java lucene search engine
Sponsorised links
2008
DoodleBuzz:Typographic News Explorer
2007
Christian Fauré » Blog Archive » Le Magic Quadrant 2007 du Search par Gartner
Search For Cash
2006
The Semantic Indexing Project
Krugle
Windows Live Academic Home Page
EEVL : the internet guide to engineering, mathematics and computing
