[Zope-dev] Re: Spitter.c Hack

Jason Spisak 444@hiretechs.com
Mon, 08 Jan 2001 15:54:38 GMT


Tres:

Okay, I uploaded it to my member folder.

http://www.zope.org/Members/jspisak/Splitter/

I wasn't usinga sandbox for this, I just downloaded the source for 2.2.4

Here's the diff -u though:

--- Zope-2.2.4-src/lib/python/SearchIndex/Splitter.c    Thu Jan  4 10:41:15
2001
+++ Zope-2.2.4-src/lib/python/SearchIndex/Splitter_Old.c        Fri Jan  5
17:29:43 2001
@@ -169,24 +169,8 @@
     len = PyString_Size(word) - 1;
 
     len = PyString_Size(word);
-    /*if(len < 2)       Single-letter words are stop words!
-    {
-      Py_INCREF(Py_None);
-      return Py_None;
-    }     */
-
-    /*************************************************************
-      Test whether a word has any letters.                       */
 
     for (; --len >= 0 && ! isalpha((unsigned char)cword[len]); );
-    /*if (len < 0)
-    {
-        Py_INCREF(Py_None);
-        return Py_None;
-    }
-
-     * If no letters, treat it as a stop word.
-     *************************************************************/
 
     Py_INCREF(word);
 


Let me know what else I can do.  Did you see my other mails regarding
stats?

> > From: "Jason Spisak" <444@hiretechs.com> wrote:
> > 
> > Zopists,
> > 
> > I finally got Splitter.c to let me index numbers and 'C++' in a TextIndex. 
> > I have about 50,000 objects in that index, and search performance is nearly
> > instantaneous still.  I am running on a big machine though.  If anyone
> > wants those changes there's really easy.  Just mail me directly, since it's
> > a long file to post.
> 
> Can you post a patch, or upload it to your Zope.org member folder
> and post the link?
> 
>   cvs -q diff -u lib/python/SearchIndex/Splitter.c
> 
> would do it, if you were working in a CVS sandbox for Zope.
> 
> Tres.
> -- 
> ===============================================================
> Tres Seaver                                tseaver@digicool.com
> Digital Creations     "Zope Dealers"       http://www.zope.org

All my best,


Jason Spisak
CIO
    __ ___       ______        __
   / // (_)_____/_  __/__ ____/ /  ___  _______  __ _
  / _  / / __/ -_) / / -_) __/ _ \(_-<_/ __/ _ \/  ' \
 /_//_/_/_/  \__/_/  \__/\__/_//_/___(_)__/\___/_/_/_/

6151 West Century Boulevard
Suite 900
Los Angeles, CA 90045
P. 310.665.3444
F. 310.665.3544

Under US Code Title 47, Sec.227(b)(1)(C), Sec.227(a)(2)(B) This email
address may not be added to any commercial mail list with out my
permission.  Violation of my privacy with advertising or SPAM will
result in a suit for a MINIMUM of $500 damages/incident, $1500 for
repeats.