Eldy赶在新年前把AWStats 6.9发布了: 主要的蜘蛛定义和搜索引擎定义修改以及本站的配置样例我已经打包在这里;AWStats虽然是perl写的,但是基本上要用起来不需要对perl熟悉,主要是配置的修改,并且可以适用于于大部分网站的流量,结合Google Analytic统计作为网站状况的轻量级基础统计解决方案;
相关的更新也已经提交: 欢迎各位补充,争取在下一个版本中发布
AWStats - Patches - 4 items
1569229 Simplified Chinese language file update
1569201 top Chinese browser and robot update
1569151 TOP Chinese local search engines update
2499455 robots.txt: clfmerged log files maybe not start with /
AWStats - Feature Requests - 2 items
2498163 configurable $LIMITFLUSH and increase default value to 50000
706297 IIS timezone:change the timeline instead of change time
完整的diff附后: 包含了awstats.pl本身的2个小修改;
1 针对大量URL:增大$LIMITFLUSH减少临时文件I/O;
2 针对泛域名型应用的修改:使用clfmerge -b合并后的日志无法匹配"GET /robots.txt" (因为被clfmerge拼上域名,变成了 "GET http://foo.example.com/robots.txt");
cvs diff: Diffing . Index: awstats.pl =================================================================== RCS file: /cvsroot/awstats/awstats/wwwroot/cgi-bin/awstats.pl,v retrieving revision 1.926 diff -r1.926 awstats.pl 46c46 < 5000; # Nb of records in data arrays after how we need to flush data on disk --- > 50000; # Nb of records in data arrays after how we need to flush data on disk 10385c10385 < my $regrobot = qr/^\/robots\.txt$/i; --- > my $regrobot = qr/\/robots\.txt$/i; cvs diff: Diffing lang cvs diff: Diffing lang/tooltips_f cvs diff: Diffing lang/tooltips_m cvs diff: Diffing lang/tooltips_w cvs diff: Diffing lib Index: lib/robots.pm =================================================================== RCS file: /cvsroot/awstats/awstats/wwwroot/cgi-bin/lib/robots.pm,v retrieving revision 1.53 diff -r1.53 robots.pm 356a357 > 'yahoo!\sslurp\schina', 476a478 > 'soso', 677a680,688 > 'blogbus', > 'QBikSpider', > 'Zwl.Rss.Spider', > 'BBReaderFeedSpider', > 'yeti', > 'doubanbot', > 'moreoverbot', > 'xianguo', > 'blogvibebot', 998a1010 > 'yahoo!\sslurp\schina','<a href="http://misc.yahoo.com.cn/help.html" title="Bot home page [new window]" target="_blank">Yahoo! Slurp China</a>', 1115a1128 > 'soso','<a href="http://help.soso.com/webspider.htm" target="_blank">Soso Spider</a>', 1319a1333,1341 > 'blogbus','<a href="http://www.blogbus.com" title="BlogBus Home [ new window ]" target="_balnk">BlogBus</a>', > 'QBikSpider','QBikSpider', > 'Zwl.Rss.Spider','Zwl.Rss.Spider', > 'BBReaderFeedSpider','BBReaderFeedSpider', > 'yeti','<a href="http://help.naver.com/robots/" target="_blank">Yeti</a>', > 'doubanbot','<a href="http://www.douban.com" target="_blank">doubanbot</a>', > 'moreoverbot','<a href="http://www.moreover.com" target="_blank">Moreoverbot</a>', > 'xianguo','<a href="http://www.xianguo.com/" target="_blank">XianGuo</a>', > 'blogvibebot','BlogVibeBot', Index: lib/search_engines.pm =================================================================== RCS file: /cvsroot/awstats/awstats/wwwroot/cgi-bin/lib/search_engines.pm,v retrieving revision 1.46 diff -r1.46 search_engines.pm 288a289 > 'image\.baidu\.com', # baidu image search 296a298 > '\.youdao\.com', # powered by NetEase 503a506 > 'image\.baidu\.com','baidu_image', 508a512 > '\.youdao\.com','yodao', 768a773 > 'baidu_image','(wd|word)=', 774c779,780 < 'soso','q=', --- > 'yodao','q=', > 'soso','w=', 996a1003 > 'baidu_image','<a href="http://image.baidu.com/" target="_blank">Baidu Image</a>', 1001a1009 > 'yodao', '<a href="http://www.youdao.com/" target="_blank">YoDao</a>',作者:车东 发表于:2009-01-12 12:01 最后更新于:2009-02-11 18:02
版权声明:可以转载,转载时请务必以超链接形式标明文章 AWStats 6.9发布: 补充中文搜索引擎定义和配置样例下载 的原始出处和作者信息及本版权声明。
http://www.chedong.com/blog/archives/001460.html
Comments
大大你好,在awstats的
參考資訊:
來源網址
... 由那些搜尋引擎轉介
會看到 eg
1)Yahoo
2)Google,
這是否包括從
com.cn or .com or .hk的yahoo,
還是可分出由什麼地區的Yahoo 轉介?
謝謝
由: Yuki 发表于 2009年03月11日 下午12时47分
WIN平台下用GNUWIN32打PATCH打不上,怎么办呢?
youdao的蜘蛛手工怎么加呢?
谢谢
Index: search_engines.pm
===================================================================
RCS file: /cvsroot/awstats/awstats/wwwroot/cgi-bin/lib/search_engines.pm,v
retrieving revision 1.46
diff -r1.46 search_engines.pm
288a289
> 'image\.baidu\.com', # baidu image search
296a298
> '\.youdao\.com', # powered by NetEase
503a506
> 'image\.baidu\.com','baidu_image',
508a512
> '\.youdao\.com','yodao',
768a773
> 'baidu_image','(wd|word)=',
774c779,780
'yodao','q=',
> 'soso','w=',
996a1003
> 'baidu_image','Baidu Image',
1001a1009
> 'yodao', 'YoDao',
由: 你好 发表于 2009年08月04日 下午06时00分