今天看了一下FeedBurner的同步策略: 他们在带宽的节省方面还是下了很大的工夫的。
首先最近3天的日志中:只有少量的是 真正产生流量的200访问,大部分都向服务器发送了缓存校验,服务器返回是304(未更新) 只有当有新条目生成的时 才返回新的内容。而收到新条目后,FeedBurner还会发送HEAD校验一下新条目URL是否存在。
grep http://www.FeedBurner.com chedong_access_log.200601*|awk '{print $0}'|grep -v 304
chedong_access_log.20060117:66.150.96.109 - - [17/Jan/2006:06:00:52 +0800] "HEAD /blog/archives/001065.html HTTP/1.1" 200 0 "-" "FeedBurner/1.0 (http://www.FeedBurner.com)" 66.150.96.109.36261137448852425
chedong_access_log.20060118:66.150.96.109 - - [18/Jan/2006:11:47:43 +0800] "GET /blog/index.rdf HTTP/1.1" 200 29845 "-" "FeedBurner/1.0 (http://www.FeedBurner.com)" 66.150.96.109.312341137556063122
chedong_access_log.20060118:66.150.96.109 - - [18/Jan/2006:12:49:59 +0800] "GET /blog/index.rdf HTTP/1.1" 200 32033 "-" "FeedBurner/1.0 (http://www.FeedBurner.com)" 66.150.96.109.234351137559798399
chedong_access_log.20060119:66.150.96.109 - - [19/Jan/2006:13:03:27 +0800] "HEAD /blog/archives/001111.html HTTP/1.1" 200 0 "-" "FeedBurner/1.0 (http://www.FeedBurner.com)" 66.150.96.109.127431137647007148
原始日志:大约30分钟左右同步一次: 虽然访问频度很高,但是产生流量的只有少数的几次。
grep http://www.FeedBurner.com chedong_access_log.200601*|awk '{print $4,$6,$7,$9,$10}'
[17/Jan/2006:00:09:39 "GET /blog/index.rdf 304 -
[17/Jan/2006:00:41:21 "GET /blog/index.rdf 304 -
[17/Jan/2006:01:13:06 "GET /blog/index.rdf 304 -
[17/Jan/2006:01:43:14 "GET /blog/index.rdf 304 -
[17/Jan/2006:02:15:51 "GET /blog/index.rdf 304 -
[17/Jan/2006:02:49:20 "GET /blog/index.rdf 304 -
[17/Jan/2006:03:19:39 "GET /blog/index.rdf 304 -
[17/Jan/2006:03:50:27 "GET /blog/index.rdf 304 -
[17/Jan/2006:04:20:53 "GET /blog/index.rdf 304 -
[17/Jan/2006:04:51:48 "GET /blog/index.rdf 304 -
[17/Jan/2006:05:25:30 "GET /blog/index.rdf 304 -
[17/Jan/2006:05:56:12 "GET /blog/index.rdf 304 -
[17/Jan/2006:06:00:52 "HEAD /blog/archives/001065.html 200 0
[17/Jan/2006:06:26:21 "GET /blog/index.rdf 304 -
[17/Jan/2006:06:59:27 "GET /blog/index.rdf 304 -
[17/Jan/2006:07:30:10 "GET /blog/index.rdf 304 -
[17/Jan/2006:08:01:09 "GET /blog/index.rdf 304 -
[17/Jan/2006:08:33:45 "GET /blog/index.rdf 304 -
[17/Jan/2006:09:04:12 "GET /blog/index.rdf 304 -
[17/Jan/2006:09:34:14 "GET /blog/index.rdf 304 -
[17/Jan/2006:10:05:36 "GET /blog/index.rdf 304 -
[17/Jan/2006:10:35:32 "GET /blog/index.rdf 304 -
[17/Jan/2006:11:06:13 "GET /blog/index.rdf 304 -
[17/Jan/2006:11:36:39 "GET /blog/index.rdf 304 -
[17/Jan/2006:12:06:58 "GET /blog/index.rdf 304 -
[17/Jan/2006:12:36:58 "GET /blog/index.rdf 304 -
[17/Jan/2006:13:08:36 "GET /blog/index.rdf 304 -
[17/Jan/2006:13:39:37 "GET /blog/index.rdf 304 -
[17/Jan/2006:14:09:46 "GET /blog/index.rdf 304 -
[17/Jan/2006:14:39:48 "GET /blog/index.rdf 304 -
[17/Jan/2006:15:11:06 "GET /blog/index.rdf 304 -
[17/Jan/2006:15:41:12 "GET /blog/index.rdf 304 -
[17/Jan/2006:16:11:29 "GET /blog/index.rdf 304 -
[17/Jan/2006:16:41:46 "GET /blog/index.rdf 304 -
[17/Jan/2006:17:12:00 "GET /blog/index.rdf 304 -
[17/Jan/2006:17:42:27 "GET /blog/index.rdf 304 -
[17/Jan/2006:18:13:37 "GET /blog/index.rdf 304 -
[17/Jan/2006:18:45:23 "GET /blog/index.rdf 304 -
[17/Jan/2006:19:18:56 "GET /blog/index.rdf 304 -
[17/Jan/2006:19:49:45 "GET /blog/index.rdf 304 -
[17/Jan/2006:20:19:49 "GET /blog/index.rdf 304 -
[17/Jan/2006:20:50:18 "GET /blog/index.rdf 304 -
[17/Jan/2006:21:20:57 "GET /blog/index.rdf 304 -
[17/Jan/2006:21:51:02 "GET /blog/index.rdf 304 -
[17/Jan/2006:22:21:36 "GET /blog/index.rdf 304 -
[17/Jan/2006:22:51:54 "GET /blog/index.rdf 304 -
[17/Jan/2006:23:22:00 "GET /blog/index.rdf 304 -
[17/Jan/2006:23:52:05 "GET /blog/index.rdf 304 -
[18/Jan/2006:00:22:13 "GET /blog/index.rdf 304 -
[18/Jan/2006:00:57:30 "GET /blog/index.rdf 304 -
[18/Jan/2006:01:27:58 "GET /blog/index.rdf 304 -
[18/Jan/2006:01:59:56 "GET /blog/index.rdf 304 -
[18/Jan/2006:02:29:56 "GET /blog/index.rdf 304 -
[18/Jan/2006:03:01:31 "GET /blog/index.rdf 304 -
[18/Jan/2006:03:33:44 "GET /blog/index.rdf 304 -
[18/Jan/2006:04:07:14 "GET /blog/index.rdf 304 -
[18/Jan/2006:04:37:03 "GET /blog/index.rdf 304 -
[18/Jan/2006:05:08:38 "GET /blog/index.rdf 304 -
[18/Jan/2006:05:39:33 "GET /blog/index.rdf 304 -
[18/Jan/2006:06:09:44 "GET /blog/index.rdf 304 -
[18/Jan/2006:06:41:03 "GET /blog/index.rdf 304 -
[18/Jan/2006:07:11:17 "GET /blog/index.rdf 304 -
[18/Jan/2006:07:45:12 "GET /blog/index.rdf 304 -
[18/Jan/2006:08:16:06 "GET /blog/index.rdf 304 -
[18/Jan/2006:08:48:01 "GET /blog/index.rdf 304 -
[18/Jan/2006:09:19:47 "GET /blog/index.rdf 304 -
[18/Jan/2006:09:49:48 "GET /blog/index.rdf 304 -
[18/Jan/2006:10:19:54 "GET /blog/index.rdf 304 -
[18/Jan/2006:10:50:07 "GET /blog/index.rdf 304 -
[18/Jan/2006:11:20:56 "GET /blog/index.rdf 304 -
[18/Jan/2006:11:47:43 "GET /blog/index.rdf 200 29845
[18/Jan/2006:12:19:52 "GET /blog/index.rdf 304 -
[18/Jan/2006:12:49:59 "GET /blog/index.rdf 200 32033
[18/Jan/2006:13:20:02 "GET /blog/index.rdf 304 -
[18/Jan/2006:13:50:12 "GET /blog/index.rdf 304 -
[18/Jan/2006:14:20:19 "GET /blog/index.rdf 304 -
[18/Jan/2006:14:50:30 "GET /blog/index.rdf 304 -
[18/Jan/2006:15:21:00 "GET /blog/index.rdf 304 -
[18/Jan/2006:15:51:05 "GET /blog/index.rdf 304 -
[18/Jan/2006:16:21:40 "GET /blog/index.rdf 304 -
[18/Jan/2006:16:51:56 "GET /blog/index.rdf 304 -
[18/Jan/2006:17:22:31 "GET /blog/index.rdf 304 -
[18/Jan/2006:17:53:29 "GET /blog/index.rdf 304 -
[18/Jan/2006:18:25:48 "GET /blog/index.rdf 304 -
[18/Jan/2006:18:56:50 "GET /blog/index.rdf 304 -
[18/Jan/2006:19:27:35 "GET /blog/index.rdf 304 -
[18/Jan/2006:19:58:42 "GET /blog/index.rdf 304 -
[18/Jan/2006:20:29:44 "GET /blog/index.rdf 304 -
[18/Jan/2006:21:00:06 "GET /blog/index.rdf 304 -
[18/Jan/2006:21:30:26 "GET /blog/index.rdf 304 -
[18/Jan/2006:22:01:13 "GET /blog/index.rdf 304 -
[18/Jan/2006:22:33:47 "GET /blog/index.rdf 304 -
[18/Jan/2006:23:05:14 "GET /blog/index.rdf 304 -
[18/Jan/2006:23:36:47 "GET /blog/index.rdf 304 -
[19/Jan/2006:00:07:05 "GET /blog/index.rdf 304 -
[19/Jan/2006:00:37:56 "GET /blog/index.rdf 304 -
[19/Jan/2006:01:10:10 "GET /blog/index.rdf 304 -
[19/Jan/2006:01:41:16 "GET /blog/index.rdf 304 -
[19/Jan/2006:02:11:46 "GET /blog/index.rdf 304 -
[19/Jan/2006:02:46:42 "GET /blog/index.rdf 304 -
[19/Jan/2006:03:20:13 "GET /blog/index.rdf 304 -
[19/Jan/2006:03:50:16 "GET /blog/index.rdf 304 -
[19/Jan/2006:04:21:03 "GET /blog/index.rdf 304 -
[19/Jan/2006:04:51:33 "GET /blog/index.rdf 304 -
[19/Jan/2006:05:21:36 "GET /blog/index.rdf 304 -
[19/Jan/2006:05:51:47 "GET /blog/index.rdf 304 -
[19/Jan/2006:06:21:55 "GET /blog/index.rdf 304 -
[19/Jan/2006:06:51:59 "GET /blog/index.rdf 304 -
[19/Jan/2006:07:22:05 "GET /blog/index.rdf 304 -
[19/Jan/2006:07:52:15 "GET /blog/index.rdf 304 -
[19/Jan/2006:08:26:11 "GET /blog/index.rdf 304 -
[19/Jan/2006:08:57:10 "GET /blog/index.rdf 304 -
[19/Jan/2006:09:28:29 "GET /blog/index.rdf 304 -
[19/Jan/2006:10:00:04 "GET /blog/index.rdf 304 -
[19/Jan/2006:10:30:05 "GET /blog/index.rdf 304 -
[19/Jan/2006:11:01:01 "GET /blog/index.rdf 304 -
[19/Jan/2006:11:31:20 "GET /blog/index.rdf 304 -
[19/Jan/2006:12:01:51 "GET /blog/index.rdf 304 -
[19/Jan/2006:12:32:41 "GET /blog/index.rdf 304 -
[19/Jan/2006:13:03:16 "GET /blog/index.rdf 304 -
[19/Jan/2006:13:03:27 "HEAD /blog/archives/001111.html 200 0
[19/Jan/2006:13:34:50 "GET /blog/index.rdf 304 -
[19/Jan/2006:14:06:07 "GET /blog/index.rdf 304 -
[19/Jan/2006:14:36:38 "GET /blog/index.rdf 304 -
[19/Jan/2006:15:06:41 "GET /blog/index.rdf 304 -
[19/Jan/2006:15:37:11 "GET /blog/index.rdf 304 -
[19/Jan/2006:16:08:19 "GET /blog/index.rdf 304 -
[19/Jan/2006:16:40:24 "GET /blog/index.rdf 304 -
[19/Jan/2006:17:10:53 "GET /blog/index.rdf 304 -
[19/Jan/2006:17:46:15 "GET /blog/index.rdf 304 -
[19/Jan/2006:18:17:42 "GET /blog/index.rdf 304 -
[19/Jan/2006:18:47:59 "GET /blog/index.rdf 304 -
[19/Jan/2006:19:19:48 "GET /blog/index.rdf 304 -
[19/Jan/2006:19:50:22 "GET /blog/index.rdf 304 -
[19/Jan/2006:20:20:27 "GET /blog/index.rdf 304 -
[19/Jan/2006:20:50:57 "GET /blog/index.rdf 304 -
[19/Jan/2006:21:21:38 "GET /blog/index.rdf 304 -
[19/Jan/2006:21:52:13 "GET /blog/index.rdf 304 -
[19/Jan/2006:22:22:37 "GET /blog/index.rdf 304 -
[19/Jan/2006:22:55:25 "GET /blog/index.rdf 304 -
[19/Jan/2006:23:26:48 "GET /blog/index.rdf 304 -
[20/Jan/2006:00:00:23 "GET /blog/index.rdf 304 -
[20/Jan/2006:00:30:40 "GET /blog/index.rdf 304 -
BlogLines的策略类似:30分钟同步一次,但是对于index.xml index.rdf atom.xml这几个feed还是会重取index.xml一次,其他2个校验是否有更新。
[19/Jan/2006:21:38:37 "GET /blog/index.xml 200 12831 "Bloglines/3.0-rho
[19/Jan/2006:21:38:37 "GET /blog/index.rdf 304 - "Bloglines/3.0-rho
[19/Jan/2006:21:38:37 "GET /blog/atom.xml 304 - "Bloglines/3.0-rho
[19/Jan/2006:21:39:47 "GET /grassland/index.xml 200 19744 "Bloglines/3.0-rho
[19/Jan/2006:21:43:29 "GET /cache/opml.xml 200 5958 "Bloglines/3.0-rho
[19/Jan/2006:21:43:40 "GET /blog/index.rdf 304 - "Bloglines/3.0-rho
[19/Jan/2006:22:03:09 "GET /blog/index.rdf 304 - "Bloglines/3.0-rho
[19/Jan/2006:22:03:09 "GET /blog/atom.xml 304 - "Bloglines/3.0-rho
[19/Jan/2006:22:03:09 "GET /blog/index.xml 200 12831 "Bloglines/3.0-rho
[19/Jan/2006:22:04:41 "GET /grassland/index.xml 200 19744 "Bloglines/3.0-rho
[19/Jan/2006:22:09:09 "GET /cache/opml.xml 200 5958 "Bloglines/3.0-rho
[19/Jan/2006:22:09:22 "GET /blog/index.rdf 304 - "Bloglines/3.0-rho
[19/Jan/2006:22:32:44 "GET /blog/index.rdf 304 - "Bloglines/3.0-rho
[19/Jan/2006:22:32:44 "GET /blog/atom.xml 304 - "Bloglines/3.0-rho
[19/Jan/2006:22:32:45 "GET /blog/index.xml 200 12831 "Bloglines/3.0-rho
[19/Jan/2006:22:34:00 "GET /grassland/index.xml 200 19744 "Bloglines/3.0-rho
[19/Jan/2006:22:38:00 "GET /cache/opml.xml 200 5958 "Bloglines/3.0-rho
[19/Jan/2006:22:38:13 "GET /blog/index.rdf 304 - "Bloglines/3.0-rho
[19/Jan/2006:23:02:56 "GET /blog/index.rdf 304 - "Bloglines/3.0-rho
[19/Jan/2006:23:02:56 "GET /blog/atom.xml 304 - "Bloglines/3.0-rho
[19/Jan/2006:23:02:58 "GET /blog/index.xml 200 12831 "Bloglines/3.0-rho
参考资料: http返回状态码
http://www.w3.org/Protocols/rfc2616/rfc2616-sec10.html
版权声明:可以转载,转载时请务必以超链接形式标明文章 FeedBurner的更新频度: 30分钟同步一次 的原始出处和作者信息及本版权声明。
http://www.chedong.com/blog/archives/001113.html
Comments
请教个问题,我用feedburner烧成的种子,在订阅器中常常会出现乱码,但有时又是正常的,如果我使用WP自己的种子,则不会出现这种问题,这个问题是出在feedburner还是我自己身上?如何解决呢?谢谢!
由: ChamPioNeer 发表于 2006年01月20日 上午09时10分
我怀疑FB可能还会分析RSS更新频度,并动态调整抓取频度。
由: Andy 发表于 2006年01月20日 上午09时56分
:) I love feedburener
由: pica 发表于 2006年01月22日 夜间03时35分
这30分钟我几乎感觉不到,感觉就是“完全同步”的
由: woobc 发表于 2006年01月22日 上午11时42分
Yup, but you can manually ping your blog from feedburner to trigger the update.
由: CetaMac 发表于 2006年02月16日 下午04时56分