Index: modules/search/search.module =================================================================== RCS file: /cvs/drupal/drupal/modules/search/search.module,v retrieving revision 1.335 diff --unified -r1.335 search.module --- modules/search/search.module 10 Jan 2010 21:46:16 -0000 1.335 +++ modules/search/search.module 12 Jan 2010 02:54:16 -0000 @@ -13,16 +13,16 @@ * * Characters with the following General_category (gc) property values are * excluded from the search index. Also, they are used as word boundaries. - * While this does not fully conform to the Word Boundaries algorithm - * described in http://unicode.org/reports/tr29, as PCRE does not contain the - * Word_Break property table, this simpler algorithm has to do. + * While this does not fully conform to the Word Boundaries algorithm described + * in http://unicode.org/reports/tr29, as PCRE does not contain the Word_Break + * property table, this simpler algorithm has to do. * - Cc, Cf, Cn, Co, Cs: Other. * - Pc, Pd, Pe, Pf, Pi, Po, Ps: Punctuation. * - Sc, Sk, Sm, So: Symbols. * - Zl, Zp, Zs: Separators. * * Consequently, the index only contains characters with the following - * General_category (gc) property values: + * General_Category (gc) property values: * - Ll, Lm, Lo, Lt, Lu: Letters. * - Mc, Me, Mn: Combining Marks. * - Nd, Nl, No: Numbers. @@ -406,7 +406,7 @@ $text = preg_replace('/([' . PREG_CLASS_NUMBERS . ']+)[' . PREG_CLASS_PUNCTUATION . ']+(?=[' . PREG_CLASS_NUMBERS . '])/u', '\1', $text); // The dot, underscore and dash are simply removed. This allows meaningful - // search behavior with acronyms and URLs. No need to use the unicode modifer + // search behavior with acronyms and URLs. No need to use the Unicode modifier // here because 0-127 ASCII characters can't match higher UTF-8 characters as // the leftmost bit of those are 1. $text = preg_replace('/[._-]+/', '', $text);