FeedBurner的更新频度: 30分钟同步一次


今天看了一下FeedBurner的同步策略: 他们在带宽的节省方面还是下了很大的工夫的。

首先最近3天的日志中:只有少量的是 真正产生流量的200访问,大部分都向服务器发送了缓存校验,服务器返回是304(未更新) 只有当有新条目生成的时 才返回新的内容。而收到新条目后,FeedBurner还会发送HEAD校验一下新条目URL是否存在。

grep http://www.FeedBurner.com chedong_access_log.200601*|awk '{print $0}'|grep -v 304
chedong_access_log.20060117:66.150.96.109 - - [17/Jan/2006:06:00:52 +0800] "HEAD /blog/archives/001065.html HTTP/1.1" 200 0 "-" "FeedBurner/1.0 (http://www.FeedBurner.com)" 66.150.96.109.36261137448852425
chedong_access_log.20060118:66.150.96.109 - - [18/Jan/2006:11:47:43 +0800] "GET /blog/index.rdf HTTP/1.1" 200 29845 "-" "FeedBurner/1.0 (http://www.FeedBurner.com)" 66.150.96.109.312341137556063122
chedong_access_log.20060118:66.150.96.109 - - [18/Jan/2006:12:49:59 +0800] "GET /blog/index.rdf HTTP/1.1" 200 32033 "-" "FeedBurner/1.0 (http://www.FeedBurner.com)" 66.150.96.109.234351137559798399
chedong_access_log.20060119:66.150.96.109 - - [19/Jan/2006:13:03:27 +0800] "HEAD /blog/archives/001111.html HTTP/1.1" 200 0 "-" "FeedBurner/1.0 (http://www.FeedBurner.com)" 66.150.96.109.127431137647007148

原始日志:大约30分钟左右同步一次: 虽然访问频度很高,但是产生流量的只有少数的几次。

grep http://www.FeedBurner.com chedong_access_log.200601*|awk '{print $4,$6,$7,$9,$10}'
[17/Jan/2006:00:09:39 "GET /blog/index.rdf 304 -
[17/Jan/2006:00:41:21 "GET /blog/index.rdf 304 -
[17/Jan/2006:01:13:06 "GET /blog/index.rdf 304 -
[17/Jan/2006:01:43:14 "GET /blog/index.rdf 304 -
[17/Jan/2006:02:15:51 "GET /blog/index.rdf 304 -
[17/Jan/2006:02:49:20 "GET /blog/index.rdf 304 -
[17/Jan/2006:03:19:39 "GET /blog/index.rdf 304 -
[17/Jan/2006:03:50:27 "GET /blog/index.rdf 304 -
[17/Jan/2006:04:20:53 "GET /blog/index.rdf 304 -
[17/Jan/2006:04:51:48 "GET /blog/index.rdf 304 -
[17/Jan/2006:05:25:30 "GET /blog/index.rdf 304 -
[17/Jan/2006:05:56:12 "GET /blog/index.rdf 304 -
[17/Jan/2006:06:00:52 "HEAD /blog/archives/001065.html 200 0
[17/Jan/2006:06:26:21 "GET /blog/index.rdf 304 -
[17/Jan/2006:06:59:27 "GET /blog/index.rdf 304 -
[17/Jan/2006:07:30:10 "GET /blog/index.rdf 304 -
[17/Jan/2006:08:01:09 "GET /blog/index.rdf 304 -
[17/Jan/2006:08:33:45 "GET /blog/index.rdf 304 -
[17/Jan/2006:09:04:12 "GET /blog/index.rdf 304 -
[17/Jan/2006:09:34:14 "GET /blog/index.rdf 304 -
[17/Jan/2006:10:05:36 "GET /blog/index.rdf 304 -
[17/Jan/2006:10:35:32 "GET /blog/index.rdf 304 -
[17/Jan/2006:11:06:13 "GET /blog/index.rdf 304 -
[17/Jan/2006:11:36:39 "GET /blog/index.rdf 304 -
[17/Jan/2006:12:06:58 "GET /blog/index.rdf 304 -
[17/Jan/2006:12:36:58 "GET /blog/index.rdf 304 -
[17/Jan/2006:13:08:36 "GET /blog/index.rdf 304 -
[17/Jan/2006:13:39:37 "GET /blog/index.rdf 304 -
[17/Jan/2006:14:09:46 "GET /blog/index.rdf 304 -
[17/Jan/2006:14:39:48 "GET /blog/index.rdf 304 -
[17/Jan/2006:15:11:06 "GET /blog/index.rdf 304 -
[17/Jan/2006:15:41:12 "GET /blog/index.rdf 304 -
[17/Jan/2006:16:11:29 "GET /blog/index.rdf 304 -
[17/Jan/2006:16:41:46 "GET /blog/index.rdf 304 -
[17/Jan/2006:17:12:00 "GET /blog/index.rdf 304 -
[17/Jan/2006:17:42:27 "GET /blog/index.rdf 304 -
[17/Jan/2006:18:13:37 "GET /blog/index.rdf 304 -
[17/Jan/2006:18:45:23 "GET /blog/index.rdf 304 -
[17/Jan/2006:19:18:56 "GET /blog/index.rdf 304 -
[17/Jan/2006:19:49:45 "GET /blog/index.rdf 304 -
[17/Jan/2006:20:19:49 "GET /blog/index.rdf 304 -
[17/Jan/2006:20:50:18 "GET /blog/index.rdf 304 -
[17/Jan/2006:21:20:57 "GET /blog/index.rdf 304 -
[17/Jan/2006:21:51:02 "GET /blog/index.rdf 304 -
[17/Jan/2006:22:21:36 "GET /blog/index.rdf 304 -
[17/Jan/2006:22:51:54 "GET /blog/index.rdf 304 -
[17/Jan/2006:23:22:00 "GET /blog/index.rdf 304 -
[17/Jan/2006:23:52:05 "GET /blog/index.rdf 304 -
[18/Jan/2006:00:22:13 "GET /blog/index.rdf 304 -
[18/Jan/2006:00:57:30 "GET /blog/index.rdf 304 -
[18/Jan/2006:01:27:58 "GET /blog/index.rdf 304 -
[18/Jan/2006:01:59:56 "GET /blog/index.rdf 304 -
[18/Jan/2006:02:29:56 "GET /blog/index.rdf 304 -
[18/Jan/2006:03:01:31 "GET /blog/index.rdf 304 -
[18/Jan/2006:03:33:44 "GET /blog/index.rdf 304 -
[18/Jan/2006:04:07:14 "GET /blog/index.rdf 304 -
[18/Jan/2006:04:37:03 "GET /blog/index.rdf 304 -
[18/Jan/2006:05:08:38 "GET /blog/index.rdf 304 -
[18/Jan/2006:05:39:33 "GET /blog/index.rdf 304 -
[18/Jan/2006:06:09:44 "GET /blog/index.rdf 304 -
[18/Jan/2006:06:41:03 "GET /blog/index.rdf 304 -
[18/Jan/2006:07:11:17 "GET /blog/index.rdf 304 -
[18/Jan/2006:07:45:12 "GET /blog/index.rdf 304 -
[18/Jan/2006:08:16:06 "GET /blog/index.rdf 304 -
[18/Jan/2006:08:48:01 "GET /blog/index.rdf 304 -
[18/Jan/2006:09:19:47 "GET /blog/index.rdf 304 -
[18/Jan/2006:09:49:48 "GET /blog/index.rdf 304 -
[18/Jan/2006:10:19:54 "GET /blog/index.rdf 304 -
[18/Jan/2006:10:50:07 "GET /blog/index.rdf 304 -
[18/Jan/2006:11:20:56 "GET /blog/index.rdf 304 -
[18/Jan/2006:11:47:43 "GET /blog/index.rdf 200 29845
[18/Jan/2006:12:19:52 "GET /blog/index.rdf 304 -
[18/Jan/2006:12:49:59 "GET /blog/index.rdf 200 32033
[18/Jan/2006:13:20:02 "GET /blog/index.rdf 304 -
[18/Jan/2006:13:50:12 "GET /blog/index.rdf 304 -
[18/Jan/2006:14:20:19 "GET /blog/index.rdf 304 -
[18/Jan/2006:14:50:30 "GET /blog/index.rdf 304 -
[18/Jan/2006:15:21:00 "GET /blog/index.rdf 304 -
[18/Jan/2006:15:51:05 "GET /blog/index.rdf 304 -
[18/Jan/2006:16:21:40 "GET /blog/index.rdf 304 -
[18/Jan/2006:16:51:56 "GET /blog/index.rdf 304 -
[18/Jan/2006:17:22:31 "GET /blog/index.rdf 304 -
[18/Jan/2006:17:53:29 "GET /blog/index.rdf 304 -
[18/Jan/2006:18:25:48 "GET /blog/index.rdf 304 -
[18/Jan/2006:18:56:50 "GET /blog/index.rdf 304 -
[18/Jan/2006:19:27:35 "GET /blog/index.rdf 304 -
[18/Jan/2006:19:58:42 "GET /blog/index.rdf 304 -
[18/Jan/2006:20:29:44 "GET /blog/index.rdf 304 -
[18/Jan/2006:21:00:06 "GET /blog/index.rdf 304 -
[18/Jan/2006:21:30:26 "GET /blog/index.rdf 304 -
[18/Jan/2006:22:01:13 "GET /blog/index.rdf 304 -
[18/Jan/2006:22:33:47 "GET /blog/index.rdf 304 -
[18/Jan/2006:23:05:14 "GET /blog/index.rdf 304 -
[18/Jan/2006:23:36:47 "GET /blog/index.rdf 304 -
[19/Jan/2006:00:07:05 "GET /blog/index.rdf 304 -
[19/Jan/2006:00:37:56 "GET /blog/index.rdf 304 -
[19/Jan/2006:01:10:10 "GET /blog/index.rdf 304 -
[19/Jan/2006:01:41:16 "GET /blog/index.rdf 304 -
[19/Jan/2006:02:11:46 "GET /blog/index.rdf 304 -
[19/Jan/2006:02:46:42 "GET /blog/index.rdf 304 -
[19/Jan/2006:03:20:13 "GET /blog/index.rdf 304 -
[19/Jan/2006:03:50:16 "GET /blog/index.rdf 304 -
[19/Jan/2006:04:21:03 "GET /blog/index.rdf 304 -
[19/Jan/2006:04:51:33 "GET /blog/index.rdf 304 -
[19/Jan/2006:05:21:36 "GET /blog/index.rdf 304 -
[19/Jan/2006:05:51:47 "GET /blog/index.rdf 304 -
[19/Jan/2006:06:21:55 "GET /blog/index.rdf 304 -
[19/Jan/2006:06:51:59 "GET /blog/index.rdf 304 -
[19/Jan/2006:07:22:05 "GET /blog/index.rdf 304 -
[19/Jan/2006:07:52:15 "GET /blog/index.rdf 304 -
[19/Jan/2006:08:26:11 "GET /blog/index.rdf 304 -
[19/Jan/2006:08:57:10 "GET /blog/index.rdf 304 -
[19/Jan/2006:09:28:29 "GET /blog/index.rdf 304 -
[19/Jan/2006:10:00:04 "GET /blog/index.rdf 304 -
[19/Jan/2006:10:30:05 "GET /blog/index.rdf 304 -
[19/Jan/2006:11:01:01 "GET /blog/index.rdf 304 -
[19/Jan/2006:11:31:20 "GET /blog/index.rdf 304 -
[19/Jan/2006:12:01:51 "GET /blog/index.rdf 304 -
[19/Jan/2006:12:32:41 "GET /blog/index.rdf 304 -
[19/Jan/2006:13:03:16 "GET /blog/index.rdf 304 -
[19/Jan/2006:13:03:27 "HEAD /blog/archives/001111.html 200 0
[19/Jan/2006:13:34:50 "GET /blog/index.rdf 304 -
[19/Jan/2006:14:06:07 "GET /blog/index.rdf 304 -
[19/Jan/2006:14:36:38 "GET /blog/index.rdf 304 -
[19/Jan/2006:15:06:41 "GET /blog/index.rdf 304 -
[19/Jan/2006:15:37:11 "GET /blog/index.rdf 304 -
[19/Jan/2006:16:08:19 "GET /blog/index.rdf 304 -
[19/Jan/2006:16:40:24 "GET /blog/index.rdf 304 -
[19/Jan/2006:17:10:53 "GET /blog/index.rdf 304 -
[19/Jan/2006:17:46:15 "GET /blog/index.rdf 304 -
[19/Jan/2006:18:17:42 "GET /blog/index.rdf 304 -
[19/Jan/2006:18:47:59 "GET /blog/index.rdf 304 -
[19/Jan/2006:19:19:48 "GET /blog/index.rdf 304 -
[19/Jan/2006:19:50:22 "GET /blog/index.rdf 304 -
[19/Jan/2006:20:20:27 "GET /blog/index.rdf 304 -
[19/Jan/2006:20:50:57 "GET /blog/index.rdf 304 -
[19/Jan/2006:21:21:38 "GET /blog/index.rdf 304 -
[19/Jan/2006:21:52:13 "GET /blog/index.rdf 304 -
[19/Jan/2006:22:22:37 "GET /blog/index.rdf 304 -
[19/Jan/2006:22:55:25 "GET /blog/index.rdf 304 -
[19/Jan/2006:23:26:48 "GET /blog/index.rdf 304 -
[20/Jan/2006:00:00:23 "GET /blog/index.rdf 304 -
[20/Jan/2006:00:30:40 "GET /blog/index.rdf 304 -


BlogLines的策略类似:30分钟同步一次,但是对于index.xml index.rdf atom.xml这几个feed还是会重取index.xml一次,其他2个校验是否有更新。
[19/Jan/2006:21:38:37 "GET /blog/index.xml 200 12831 "Bloglines/3.0-rho
[19/Jan/2006:21:38:37 "GET /blog/index.rdf 304 - "Bloglines/3.0-rho
[19/Jan/2006:21:38:37 "GET /blog/atom.xml 304 - "Bloglines/3.0-rho
[19/Jan/2006:21:39:47 "GET /grassland/index.xml 200 19744 "Bloglines/3.0-rho
[19/Jan/2006:21:43:29 "GET /cache/opml.xml 200 5958 "Bloglines/3.0-rho
[19/Jan/2006:21:43:40 "GET /blog/index.rdf 304 - "Bloglines/3.0-rho
[19/Jan/2006:22:03:09 "GET /blog/index.rdf 304 - "Bloglines/3.0-rho
[19/Jan/2006:22:03:09 "GET /blog/atom.xml 304 - "Bloglines/3.0-rho
[19/Jan/2006:22:03:09 "GET /blog/index.xml 200 12831 "Bloglines/3.0-rho
[19/Jan/2006:22:04:41 "GET /grassland/index.xml 200 19744 "Bloglines/3.0-rho
[19/Jan/2006:22:09:09 "GET /cache/opml.xml 200 5958 "Bloglines/3.0-rho
[19/Jan/2006:22:09:22 "GET /blog/index.rdf 304 - "Bloglines/3.0-rho
[19/Jan/2006:22:32:44 "GET /blog/index.rdf 304 - "Bloglines/3.0-rho
[19/Jan/2006:22:32:44 "GET /blog/atom.xml 304 - "Bloglines/3.0-rho
[19/Jan/2006:22:32:45 "GET /blog/index.xml 200 12831 "Bloglines/3.0-rho
[19/Jan/2006:22:34:00 "GET /grassland/index.xml 200 19744 "Bloglines/3.0-rho
[19/Jan/2006:22:38:00 "GET /cache/opml.xml 200 5958 "Bloglines/3.0-rho
[19/Jan/2006:22:38:13 "GET /blog/index.rdf 304 - "Bloglines/3.0-rho
[19/Jan/2006:23:02:56 "GET /blog/index.rdf 304 - "Bloglines/3.0-rho
[19/Jan/2006:23:02:56 "GET /blog/atom.xml 304 - "Bloglines/3.0-rho
[19/Jan/2006:23:02:58 "GET /blog/index.xml 200 12831 "Bloglines/3.0-rho


参考资料: http返回状态码

http://www.w3.org/Protocols/rfc2616/rfc2616-sec10.html

作者:车东 发表于:2006-01-20 00:01 最后更新于:2007-04-15 19:04
版权声明:可以转载,转载时请务必以超链接形式标明文章 的原始出处和作者信息及本版权声明

引用通告

以下是前来引用的链接: FeedBurner的更新频度: 30分钟同步一次:

» 配合搜索引擎将页面静态化 来自 Blog^Bug=BlogBug;
对于一般的海量型搜索引擎,大多由机器人在一定时间内循环抓取内容源,这势必会造成重复访问部分页面。其重复的频度一般由搜索引擎自己决定,但大多是根据网页的优先级来考虑。如果... [阅读更多细节]

Comments

请教个问题,我用feedburner烧成的种子,在订阅器中常常会出现乱码,但有时又是正常的,如果我使用WP自己的种子,则不会出现这种问题,这个问题是出在feedburner还是我自己身上?如何解决呢?谢谢!

我怀疑FB可能还会分析RSS更新频度,并动态调整抓取频度。

:) I love feedburener

这30分钟我几乎感觉不到,感觉就是“完全同步”的

Yup, but you can manually ping your blog from feedburner to trigger the update.

发表一个评论

(如果你此前从未在此 Blog 上发表过评论,则你的评论必须在 Blog 主人验证后才能显示,请你耐心等候。)

相关文章

关于

此页面包含了发表于2006年01月20日 夜间12时17分的 Blog 上的单篇日记。

此 Blog 的前一篇日记是 分布式应用日志的集中化存储

此 Blog 的后一篇日记是 AJAX技术如何节省应用的带宽:多次交互,每次少量更新

更多信息可在 主索引 页和 归档 页看到。

Creative Commons License
此 Blog 中的日记遵循以下授权 Creative Commons(创作共用)授权.
Powered by
Movable Type 3.36