For a web search engine, the retrieval of data is a combination activity of the crawler, the database and the search algorithm. These three elements work in concert to retrieve web pages that are related to the word or phrase that user enters into the search engine’s user interface.
Commercial search engines are a key access point to the Web and have the difficult task of trying to find the most useful of the billion of web pages for each user query centered.
The really tricky part is the results ranking. Ranking is also what the user will spend the most time and effort trying to affect. Google’s PageRank was an attempt to resolve this dilemma based upon the assumptions that:
*More useful pages will have more links to them
*Links form well linked to pages are better indicators of quality
Many query on real search engines have hundred, thousands or even millions of hits. And the users of search engines generally prefer to look through only a handful of results, perhaps five or ten at the most.
Therefore, a search engine must be capable of picking the best few from a very large number of hits.
A good search engine will not only pick up the best few hits, but display them in the most useful order. The task of picking out the best few hits in the right order is called ‘ranking’.
Search engine algorithm
A computer system comprises hardware and software components, aiming to offer a powerful computational tool. These systems play a crucial role across diverse domains, aiding us in numerous tasks. The prevalence of the internet has significantly bolstered the utilization of computers for information sharing and communication. Computer systems empower us to store, process, display, and transmit information. Even in a basic modern computer system, multiple programs are typically required to carry out various functions effectively.
The Most Popular Posts
-
The integration of computer technology into business organizations has fundamentally reshaped how companies operate, driving efficiency, pro...
-
An Enterprise Information Portal (EIP) is a centralized, digital hub that enables employees within an organization to access a wide array of...
-
The utility programs is a type of system software that allows a user to perform maintenance-type tasks, usually related to managing computer...
-
The trash can has been a familiar presence on computer desktops starting with the early Macintosh systems. Unwanted files can be moved t...
-
The earliest device that qualifies as a digital computer is the “abacus” also known as “soroban”. Abacus is the simplest form of a digital c...