-
Terms
Hi All
Ive got a few terms I need explaining.
I came across a diagram of an opensource metasearch engine but require some terms explaining.
The terms Are
Stemming , Stopping .Cleaning of Titles and Snippets.
Thus Ive found some information on Stemming I cant find anything on cleaning and stopping of Titles and snippets.
If anyone can provide url or can an explanation that would be great.
thanks
-
Re: Terms
Quote:
Originally posted here by esi1
Hi All
Ive got a few terms I need explaining.
I came across a diagram of an opensource metasearch engine but require some terms explaining.
The terms Are
Stemming , Stopping .Cleaning of Titles and Snippets.
Thus Ive found some information on Stemming I cant find anything on cleaning and stopping of Titles and snippets.
If anyone can provide url or can an explanation that would be great.
thanks
Dude use google.
http://www.google.com/search?hl=en&q...=Google+Search :rolleyes:
-
Dude I tried , As I said Ive found out what stemming was but couldnt find anything on stopping and cleaning.
If I could find it on google I wouldnt be asking.
-
Is this what you are looking for...
Search Engine Result Summarization
* Clustering Search Result (Leouski and Croft, 1996, Zamir and Etzioni, 1997):
Categorizes documents using phrases in titles and snippets
Data Cleaning
Our system parses the log files to produce the sequence of pages that have been downloaded. Unfortunately some of these pages are just advertisements, as many web pages will launch a pop-up ad window when they are loaded. As few of these advertisement pages will contribute to the subject's information needs, leaving them in the training data might confuse the learner. We therefore assembled a list of advertisement domain names, such as: ads.orbitz.com, ads.realcities.com, etc. We compare each URL's domain name with the ad server list and ignore a URL if it is in the list.
Source
or
Info
Lots of reading, but may help you figure out the words meanings and where they apply...
Luck