AWStats的补充定义:区分百度图片搜索和一些新出现的360搜索引擎蜘蛛


更新后的AWStats最新版本lib目录打包下载(latest: 2012-08-23),蜘蛛定义部分增加了区分Yahoo!中国,Soso 豆瓣,鲜果,360蜘蛛等,其他的是几个国外的RSS阅读器;搜索引擎部分区分了百度图片,有道搜索,soso搜索,360搜索; diff附后:

cvs diff: Diffing .
Index: robots.pm
===================================================================
RCS file: /cvsroot/awstats/awstats/wwwroot/cgi-bin/lib/robots.pm,v
retrieving revision 1.64
diff -r1.64 robots.pm
359a360,364
> '360spider',
> 'sosospider',
> 'youdaobot',
> 'doubanbot',
> 'friendfeedbot',
850c855
< 'qihoobot',
---
> 'jikespider',
1072d1076
< 'youdao',
1129a1134,1138
> 'sosospider','Soso Spider',
> 'doubanbot','DoubanBot',
> 'youdaobot','YoudaoBot',
> '360spider','360Spider',
> 'friendfeedbot','FriendFeedBot',
1616c1625
< 'qihoobot','QihooBot',
---
> 'jikespider','JikeSpider',
1855d1863
< 'youdao', 'youdao', 
Index: search_engines.pm
===================================================================
RCS file: /cvsroot/awstats/awstats/wwwroot/cgi-bin/lib/search_engines.pm,v
retrieving revision 1.53
diff -r1.53 search_engines.pm
175d174
< 'live\.com',
290,298c289,295
< '\.baidu\.com',     # baidu search portal
< '\.vnet\.cn',       # powered by MSN
< '\.soso\.com',      # powered by Google
< '\.sogou\.com',     # powered by Sohu
< '\.3721\.com',      # powered by Yahoo!
< 'iask\.com',        # powered by Sina
< '\.accoona\.com',   # Accoona
< '\.163\.com',       # powered by Google
< '\.zhongsou\.com',  # zhongsou search portal
---
> 'baidu\.',     # baidu search portal
> '118114\.cn',       # powered by Bing
> 'vnet\.cn',       # powered by Bing
> 'soso\.com',      # powered by TenCent
> 'sogou\.com',     # powered by Sohu
> 'youdao\.com',       # powered by NetEase
> '360\.cn',         # powered by QIHU
368c365,368
< 'yandex\.'=>'direct\.yandex\.'
---
> 'yandex\.'=>'direct\.yandex\.',
> 'baidu\.'=>'hi\.baidu\.',
> 'baidu\.'=>'zhidao\.baidu\.',
> 'baidu\.'=>'tieba\.baidu\.'
394d393
< 'live\.com','live',
508,516c507,513
< '\.baidu\.com','baidu',
< 'iask\.com','iask',
< '\.accoona\.com','accoona',
< '\.3721\.com','3721',
< '\.163\.com','netease',
< '\.soso\.com','soso',
< '\.zhongsou\.com','zhongsou',
< '\.vnet\.cn','vnet',
< '\.sogou\.com','sogou',
---
> 'baidu\.','baidu',
> 'youdao\.com','youdao',
> '360\.cn','360',
> 'soso\.com','soso',
> '118114\.cn','vnet',
> 'vnet\.cn','vnet',
> 'sogou\.com','sogou',
672d668
< 'live','q=',
777,782c773,775
< 'iask','(w|k)=',
< 'accoona','qt=',
< '3721','(p|name)=',
< 'netease','q=',
< 'soso','q=',
< 'zhongsou','(word|w)=',
---
> 'youdao','q=',
> '360','q=',
> 'soso','w=',
903d895
< 'live','Microsoft Windows Live',
1007,1015c999,1003
< 'baidu','Baidu',
< 'iask','Iask',
< 'accoona','Accoona',
< '3721','3721',
< 'netease', 'NetEase',
< 'soso','SoSo',
< 'zhongsou','ZhongSou',
< 'sogou', 'SoGou',
< 'vnet','VNet',
---
> 'baidu','Baidu',
> 'youdao', 'YouDao',
> 'soso','SoSo',
> 'sogou', 'SoGou',
> 'vnet','VNet',

作者:车东 发表于:2008-10-03 13:10 最后更新于:2012-08-23 14:08
版权声明:可以转载,转载时请务必以超链接形式标明文章 的原始出处和作者信息及本版权声明

Comments

多谢车东,已经用上。

这个解压后直接上传到lib目录就好了吗?

老大有没有研究过awstats的对于页面下载时间的分析?
iis,squid中都可以定义页面下载时间这一段,如果能分析最好了.很实用.

奇怪, 为什么我去下载6.8已经自带了. 可能已经更新了吧

发表一个评论

(如果你此前从未在此 Blog 上发表过评论,则你的评论必须在 Blog 主人验证后才能显示,请你耐心等候。)

Creative Commons License
此 Blog 中的日记遵循以下授权 Creative Commons(创作共用)授权.
Powered by
Movable Type 3.36