2006-9-30
2006-9-29
2006-9-28
2006-9-26
2006-9-24
很久没有使用bloglines了,最近Google新发布了RSS Reader。我才想起BlogLines,去上面转了一圈,Google学BlogLines真实学得很到家啊。但是Google在大规模数据得存取速度上,明显高出BlogLines一筹。BlogLines后台应该还是延续数据库思维:一定要告诉你,你订阅的FEED上有873条新内容,而Google就告诉你(100+):因为能翻10页以后的用户不超过2%。
再仔细看看:BlogLines的新功能,看来,社群化是BlogLines努力的方向之一。另外就是发现了一个claim功能:认领你自己发布的Feed。
认领流程如下:
0 输入你的blog地址:bloglines会分别生成2个key,一个ckey(用于证明你能发表内容),一个ukey(用于证明你能拥有网站);
1 发一篇blog:里面包含feedkey:<!-- ckey="#######" -->
2 修改首页模板:包含一个:<!-- ukey="#######" -->
结果:如果bloglines在首页上同时发现了ckey+ukey,并且在feed中只发现了ckey就成功了。
从站点留言中看到的这个消息。Dreamhost 在搞大优惠。按照他们时区的时间还有几个小时了。只可惜我已经有帐户在上面了,多申请一个也没有什么用途。有需要的朋友可以去立刻拿一个。更优惠的价格,更大的空间与带宽,很划算。
To celebrate nine years in the hosting biz, we're having an absolutely CRAZY one-day-only sale! TODAY! This offer is good for accounts that sign up on October 3rd, 2006 before 11:59 PM PDT only! Don't miss the boat!Sign up for any of our hosting plans TODAY using either the one or two-year prepay option and use the promo code "9999". You'll get an INSTANT discount of $99.99 off your bill!
But that's not all!
We've also upped our plan limits like crazy! All our plans now have at least DOUBLE the amount of bandwidth and up to TEN TIMES the amount of disk space they had yesterday! You'll get to keep that extra disk space and bandwidth for as long as you keep your account active!The Fine Print:
This offer is for new customers only - those who do not have an active account with DreamHost.
If you sign up and forget to use the "9999" promotional code, you WILL NOT receive the sale pricing. No amount of complaining will change this!
Domains and accounts may not be transferred from an existing DreamHost Web Hosting account to a "9999" hosting plan.
谢谢带来这个消息的朋友。
--EOF--
从后台日志上观察到有大量来自 114.com.cn 的搜索。最开始没有注意,还以为是 VNet 过来的--都有个 114 嘛。这两天查询突然暴增,仔细一看,还真不是一回事:
$ grep 114.com.cn access.log |awk '{print substr($11,1,80)}' |head "http://so.114.com.cn/usearchp?keyword=\xd4\xbd\xd3\xfc\xb5\xda\xb6\xfe\xbc\xbe& "http://so.114.com.cn/usearchp?keyword=\xd4\xbd\xd3\xfc\xb5\xda\xb6\xfe\xbc\xbe& "http://so.114.com.cn/usearchp?keyword=\xd4\xbd\xd3\xfc\xb5\xda\xb6\xfe\xbc\xbe& "http://so.114.com.cn/usearchp?keyword=\xd4\xbd\xd3\xfc\xb5\xda\xb6\xfe\xbc\xbe& "http://so.114.com.cn/usearchp?keyword=\xd4\xbd\xd3\xfc\xb5\xda\xb6\xfe\xbc\xbe& "http://so.114.com.cn/usearchp?logo=1&keyword=\xd4\xbd\xd3\xfc\xb5\xda\xd2\xbb\x "http://so.114.com.cn/usearchp?logo=1&keyword=\xd4\xbd\xd3\xfc\xb5\xda\xd2\xbb\x "http://so.114.com.cn/usearchp?keyword=\xd4\xbd\xd3\xfc\xb5\xda\xb6\xfe\xbc\xbe& "http://so.114.com.cn/usearchp?keyword=\xd4\xbd\xd3\xfc\xb5\xda\xb6\xfe\xbc\xbe& "http://so.114.com.cn/usearchp?keyword=\xd4\xbd\xd3\xfc\xb5\xda\xb6\xfe\xbc\xbe&
为了节省空间,没有把 URL 都打出来,所用的参数极为诡异,后面还有几个奇怪的参数,AWstats 也根本不能探测到引用的关键词是什么。
搜索了一下,这个站点叫"中国网上黄页", 是厦门的一家叫什么"中资源"的公司做的。
添加了一下这个搜索引擎的定义,暂时叫他 '114' 吧. 添加定义挺简单的,我做的修改:
$ diff search_engines.pm search_engines.pm.backup1003
192d191
< '114\.com\.cn',
366d364
< '114\.com\.cn','114',
578d575
< '114','keyword=',
754d750
< '114','114',
观察到的效果:
来自搜索引擎 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
感觉国内的个别搜索引擎根本不关心什么规范之类的事情,Bot 随便爬,爱咋咋地的态度。
--EOF--