http://drupal.org/node/2092
There was a problem of search.module from before.
What is a problem? It investigated.
I think that a problem is in search.module. Please see the following code.
// $Id: search.module,v 1.87 2004/09/21 18:35:30 dries Exp $
// Remove punctuation/special characters.
//$wordlist = preg_replace("'(\xBB|\xAB|!|\xA1|%|,|:|;|\(|\)|\&|\"|\'|\.|-|\/|\?|\\\)'", '', $wordlist);
// Strip out (now mangled) http and tags.
$wordlist = preg_replace("'http\w+'", '', $wordlist);
$wordlist = preg_replace("'www\w+'", '', $wordlist);
// Remove all newlines of any type.
$wordlist = preg_replace("'([\r\n]|[\r]|[\n])'", ' ', $wordlist);
// Lower case the whole thing.
$wordlist = strtolower($wordlist);
// Remove "noise words".
$noise = explode(',', variable_get('noisewords', ''));
foreach ($noise as $word) {
$word = trim($word);
$wordlist = trim(preg_replace("' $word '", ' ', ' ' .$wordlist. ' '));
}
// Remove punctuation/special characters.
special characters is deleted with punctuation.
I think that it is for putting an English word into an index. Although, as for a problem, punctuation does not have the multi-byte characters, either,
if special characters? (\xBB, \xAB, \xA1) is deleted, a problem will occur from the multi-byte characters.