08 Jun 2005 - Digest by CheDong.com

23:08 Secrets of Product Development and What Journalists Write » Jeremy Zawodny's blog

Before I came out to California to work at Yahoo, I watched the business and culture of Silicon Valley from a distance. I read lots of the trade rags, tech web sites, and books about early Internet companies (the Netscape era). One of the things that amazed me about Internet companies (usually the portals) was how quickly they built things and were able to react to each others moves with frightening speed. Company X would do something amazing and new...

19:35 GNU工具箱：充分用命令行代替SQL » 车东[Blog^2]

最近Winter刚教会了我一个文件比较命令: comm，是一个比diff更简单的取2个文件交集/补集的方法。原先以为需要用join 2个表的方法，现在很少几个参数就实现了。

随着时间的积累，我发现原先很多需要用数据库才能实现的排序，过滤，分列输出其实都可以shell代替了：而且效率更高。目前正在整理积累起来的oneliner工具集，经常使用的有：
awk: 可用用作select 控制指定列的输出，并且附带了length() mod 等简单函数，通过if条件还可以实现更复杂的判断逻辑，而且比perl更容易读
sed: 控制到某一行的输出相当于limit 30,40
perl：正则表达式过滤，替换，非常强大，网上可以找到很多的one liner的现成工具，不过阅读起来有些困难；
sort: 相当于 order by
uniq: 相当于distinct
grep: 相当于like, not like
wc: 相当于count()

再加上翻页输出more head等。结合报表输出工具：GNUPlot，R-Project等，就可以生成漂亮的报表了。谈不上数据挖掘，但是用于一些简单actionable data采集确实是非常快速有效。

14:22 2GB AIM Mail, So What? » Jan's Tech Blog

AOL近日接二连三的推出新服务，继AIM连Browser的古怪组合推出之后，这次来一个比较正常的AIM Mail。...

13:20 sunset [Flickr] » 互联教育体系-博录(CES Blog)

Isaac Mao posted a photo:

del.icio.us/url/b745c8fd0a5bb2cc2145bdca27d63689

13:00 Links for 2005-06-07 [del.icio.us] » 互联教育体系-博录(CES Blog)

09:19 网志流的周期性 » 桑林志

如它自我介绍的，BlogPulse是一个blog圈流行趋势自动发掘系统。我时不时用它玩耍玩耍。今天用一些关键字做了些trend search。我发现，用 “science”, “physics”, 这样的关键字，得出的结果明显地有周期性。周期为一个礼拜，周日的时候帖子数量最少，中间帖子数量最多。但是对于别的关键字，比如“blog”，就没有这种周期性。是不是很有趣。难道说常常写 “science”, “physics” 等等内容的bloggers，生活得更有规律一点，周末就是周末，在不碰网络了？让我回想起，2个月前的反日游行，也是以星期为周期的，不过都在周末发生。

07:00 2005/06/08 07:00:00TraCQ洽谈通搜索力指数排行榜 » TraCQ洽谈通搜索力指数

搜索引擎	搜索力指数	排名升降	份额
1. Baidu	62917730		42.98%
2. 3721	39306702		26.85%
3. Google	27858074		19.03%
4. 163	2930334		2.00%
5. Sohu	2840330		1.94%
6. Sina	2427758		1.66%
7. Yisou	2105082		1.44%
8. Yahoo	1740102		1.19%
9. Sogou	1533762		1.05%
10. QQ	1438470		0.98%
11. Tom	973018		0.66%
12. Zhongsou	169134		0.12%
13. China	153162		0.10%

05:34 Google Sitemap vs. Ping Servers » Jeremy Zawodny's blog

Sometimes I'm a little surprised by how long some ideas take to bubble up. Other times I'm surprised by the form they take. I'm doubly surprised this time. Google Sitemaps (BETA, of course) has me scratching my head a bit. Rather than build on existing work, it seems that Google wants people to build up and submit sitemaps to them so they can increase the freshness and coverage (or comprehensiveness) of their web search index. Of course, those are two...

05:17 Flickr Schwag 1.0, baby! » FlickrBlog

One of the most often requested Flickr features is schwag. We're pleased announce the arrival of Flickr Schwag 1.0. These three buttons and two stickers can by yours by sending a Self Addressed Envelope* to: Flickr P.O.Box 3816 Sunnyvale,...