arachnode.net
An Open Source C# web crawler with Lucene.NET search using SQL Server 2008/2012/2014/2016/CE An Open Source C# web crawler with Lucene.NET search using MongoDB/RavenDB/Hadoop

Completely Open Source @ GitHub

Does arachnode.net scale? | Download the latest release

Error: The value of the date string in the header is invalid

rated by 0 users
Answered (Verified) This post has 1 verified answer | 1 Reply | 1 Follower

Top 200 Contributor
1 Posts
bigdavelamb posted on Sun, May 6 2012 7:56 AM

Hi,

 

I am evaluating the product the first domain I have attempted to crawl is generating an error! Here is the line from the exeception table:

2012-05-06 15:46:13.553 1 http://www.sbobet.com/ http://www.sbobet.com/ NULL The value of the date string in the header is invalid. System   at System.Net.HttpProtocolUtils.string2date(String S)     at System.Net.HttpWebResponse.get_LastModified()     at (Object )     at #i.#k.ProcessCrawlRequest(CrawlRequest crawlRequest, Boolean obeyCrawlRules, Boolean executeCrawlActions)     at Arachnode.SiteCrawler.Components.Crawl.ProcessCrawlRequest(CrawlRequest crawlRequest, Boolean obeyCrawlRules, Boolean executeCrawlActions)

Am I doing something wrong? Thanks for any help

 

David.

Answered (Verified) Verified Answer

Top 10 Contributor
1,905 Posts

I'll take a look... BBIAB.

...This is a bug in the .NET framework.  I have a workaround.

Sadly, it's not easy/quick to update the demo.

If you'd like to purchase I would be happy to extend the demo period to 90 days.

Thanks,
Mike

[EDIT: Certain WebServers send back nonsense date values and this is why this pops up - it has been fixed in the release code.]

For best service when you require assistance:

  1. Check the DisallowedAbsoluteUris and Exceptions tables first.
  2. Cut and paste actual exceptions from the Exceptions table.
  3. Include screenshots.

Skype: arachnodedotnet

Page 1 of 1 (2 items) | RSS
An Open Source C# web crawler with Lucene.NET search using SQL 2008/2012/CE

copyright 2004-2017, arachnode.net LLC