June 16, 2008 – 4:25 pm
To recognize further about search engine performance, there is something important, mainly concerning with architecture and mechanism this search engine.
Spider
Representing download webpage program what they found, similar with browser. The difference is browser present entire information directly (text, picture, etc) for human necessary at that time, but spider don’t do to present something look. Because of machine necessary, not human, spiders also run by automatic machine. Its importance to take webpage visited to keep in search engine database.
Crawler
Representing program owned by search engine for track and find the link each page was found. The duty is determine spider where must go and evaluate link base address which has been determined begin. Crawler will find link and attempt to find document not known yet by search engine.
Indexer
This component does activity to describe each page and research kinds of element, likes texts, headers, structure or feature writing style, tag HTML, specific, etc.
Database
Representing a standard place to keep data from page visited, downloaded, and have been analyzed. Sometime was said index search engine.
Result engine
Machine do cluster and determine ranking in search engine in the search result. this machine determine the best page as criteria from result search base request, and how process display will be present. This process done base search engine’s ranking algorithm. To follow page rank was used by them is right of them, researcher identify characteristic system, mainly for increase result search engine.
Web server
Representing computer which serve request and give back response requested. Web server usually gives information and document in HTML format. On that page providing service for fill keyword by user requested. Web server also responsible to submit search result to computer’s asked information.
Posted in Internet | No Comments »