skip to main | skip to sidebar

Building A Search Engine

This blog is about the experience of trying to put the pieces together to get a search engine that scales and actually can yield correct query results on a reasonable number of web pages

2008/02/18

Close to three million

Despite being quite busy with other projects lately, the last index update now has about 3 million pages, and response time is ok when doing a query (about half a second), but far from great. Multiple words search is now available.

A lot of work is still ahead.
Posted by buildingasearchengine at 12:46 AM 0 comments
06.2008 10.2007 Home
Subscribe to: Posts (Atom)

Blog Archive

  • ►  2009 (4)
    • ►  October (1)
      • Design patterns
    • ►  August (1)
      • Ubuntu and "large files" (files greater than 2 G)
    • ►  January (2)
      • Error
      • Security concepts and an open source search engine...
  • ▼  2008 (6)
    • ►  November (3)
      • Encoding
      • Thank you !
      • OsO @ Ignite Paris #3
    • ►  July (1)
      • By the Book (of law)
    • ►  June (1)
      • Over five million pages
    • ▼  February (1)
      • Close to three million
  • ►  2007 (12)
    • ►  October (1)
      • Update
    • ►  September (4)
      • Optimizing: things that matter
      • Optimizing: bzero/memset, loops and beyond
      • Reading
      • Being out of url
    • ►  August (7)
      • building a spider in python
      • Even fortune said so
      • Garbage collectors and memory
      • Memory leaks
      • Choosing a library
      • Code coverage
      • Prototype 1

About Me

Denis Chatelain
View my complete profile