arachnode.net
An Open Source C# web crawler with Lucene.NET search using SQL Server 2008/2012/2014/2016/CE An Open Source C# web crawler with Lucene.NET search using MongoDB/RavenDB/Hadoop

Completely Open Source @ GitHub

Does arachnode.net scale? | Download the latest release
Extreme gratitude for NClassifier...

If the terms 'PorterStemmer', 'BayesianClassifier' and C# get you out of bed early in the morning, then you owe it to yourself and world to check out NClassifier.

I haven't been this excited about a body of C# work since I don't know when.

My good friend found one bug in the PorterStemmer.cs class.  The function below take a parameter, 'i', but was using a private memeber variable'_i'.  An easy fix and the PorterStemmer is working like a champ!

This is the correct function:

private bool Cons(int i)
        {
            switch (_wordToStem[i])
            {
                case 'a' :
                case 'e' :
                case 'i' :
                case 'o' :
                case 'u' :
                    return false;
                case 'y' :
                    return (i == 0) ? true : !Cons(i - 1);
                default :
                    return true;
            }
        }


Posted Thu, Dec 25 2008 10:28 AM by arachnode.net
An Open Source C# web crawler with Lucene.NET search using SQL 2008/2012/CE

copyright 2004-2017, arachnode.net LLC