Source favicon23:31 [sandbox]Lucene中文分词的2个试验模块 » gRaSSland开发日记

目前有2个在Lucene中实现简体中文分词的尝试公开源代码出来:
小叮咚分词模块 下载 - 感谢田春峰
Xerdoc的分词模块 下载 - 感谢XerDoc团队
基于Apache License发布

Source favicon23:11 Picture Blog » Not isaacmao.com
Source favicon20:05 MSN Spaces Podcast » Jan's Tech Blog
Sorry,並不是MSN Spaces推出新服務,而是今天看了Eric Rice的Spaces,發現原來藉Feedburner,就可以在MSN Spaces的Blog功能上加入Podcast。基本上十分容易,只要在Blog Entry加入以下的Tag,再用Feedburner的Smartcast便行: [a href="mp3 url" rel=enclosure] 事實上,這不單只可以用在MSN Spaces,甚至可以用在任何一個Blog Service / Engine當中。有Feedburner,要Podcast原來真是很簡單。...
Source favicon19:09 Le Parkour » Easy Librarian
David Belle redefine the free running in the city with parkour, a game played to challenge the limitation of yourself. To amuse yourself with art of free moving, jumping and climbing, one harvest kind of freedom in his crowded space and routine life paradigm. Parkour was created as a way of being free in one's environment, a method of flowing movement over whatever obstacles one encounters. One may find the limitation is himself when he practice parkour more and more, finally find his own way in the street.
Source favicon17:32 Footie stars in Beijing: money for 'Real' » Danwei RSS 2.0
Chinese football (soccer) is quite uninspiring. The national Super League is marred by a rather low technical level, match fixing, and it is poorly managed by the bureaucrats of the Chinese Football Federation. Chinese fans are dismissive about the depressing state of their football league: '不想看!' (I don't want to watch it!), '实在太差了!' (it's really bad!) are the most common comments on the streets. For the national team the situation is even more depressing, considering that China did not manage to qualify for the 2006 World Cup in Germany. As a matter of fact, Chinese football fans don't really have much to look forward to. This notwithstanding, in China the passion for football is massive and keeps on growing. Fans wildly relish watching Italian, British and Spanish league games on TV, broadcast live on CCTV 5 (China Central TV sports channel) and on many other local TV stations across China. Unlike many publications in China, sports newspapers and specialised magazines sell extremely well. In terms of coverage, national and foreign football rule the scene with their in-depth features, fresh news, good quality photographs, and competent commentary.
Source favicon15:25 Beijing Media Top Stories: high temperature vacation, typhoon and fossil... » Danwei RSS 2.0
1. Almost one thousand companies in Beijing allow their staff for vacation during the high temperature days 2. Typhoon Haitang churning toward the Chinese mainland after battering Taiwan 3. The fossil of the teeth of carnivorous animals found in Shi Du, Beijing 4. A Chinese father and son get robbed in South Africa, and the son was shot to death 5. The Palace Museum starts using its new Logo The pictured front page is from Beijing Morning Post. It features a photo of an upcoming rainstorm which caused by Typhoon Haitang in Hangzhou.
Source favicon15:16 奇怪的Bloglines故障 » 未完成 - Incomplete
下午像往常一样往bloglines里添加一个blog的Feed,准备随后再同步到我的GreatNews中。到其他页面浏览一番再回到bloglines的这个页面,发现大家很熟悉的页面无法显示的提示。由于最近连接到bloglines经常很慢,所以我也没在意,就想再次打开bloglines看看刚才的feed到底加进去了没有,一看吓一跳,我在bloglines中的400多个feed,居然全都不见了,难道我那么好运气,和keso遭遇了同样的数据库故障? 不过幸好我基本上都是使用GreatNews来阅读RSS,因此绝大部分的Feed都已经有了备份,只有最近两天新加而还没同步的几个Feed丢失了。用GreatNews的同步功能来进入bloglines,同样也是没有任何的Feed,看来Feed真的不见了。尝试再次添加其他Feed,发现我在bloglines设置的Feed目录还在,说明数据并没有完全丢失?但只要提交增加Feed,就会出现页面无法访问的情况。本来已经不抱什么希望了,不过在更新GreatNews中的bloglines同步频道的时候,居然发现可以更新到新的内容,那说明我的feed并没有真正丢失,但在我的用户界面却完全看不到这些Feed的踪影。 看来Bloglines的稳定性的确越来越值得怀疑了,我只能庆幸自己还拥有一个桌面的备份,不至于欲哭无泪。决定继续观察一天,看看我的Feed是否又会神奇地恢复。 Update:昨天的确是Bloglines的故障,不久后就恢复了,不过我好像倒是没有遇到其他人的全部标成已读的问题。只是在GreatNews中新增同步频道出了问题,提示非法字符,又要花时间排除故障了。
Source favicon13:03 网志写作对于个人职业发展的好处 Kingsley.Tagbo : Reasons Why Blogging May Be Good For Your Career » del.icio.us/chedong
个人市场营销,磨砺技能,表现的你非功利性的一面,提高你的生产力,从其他人那里获得反馈,零发布成本,世界范围内的读者/市场(须英文),展现个人专长
Source favicon13:01 隐私问题 » Blog on 27th Floor
自从Google变得为人所知之后,就经常有人要质疑它一下子,说它会带来隐私问题。我们当然没有任何理由无条件地相信一家公司,所以必须自己注意保护自己的隐私,对谁都不例外。

以前看电视,上面说一个问题,就是有人生孩子之后,家里就天天接到电话,全是推销婴儿用品的,不胜其烦。想来想去,生孩子这件事加上家庭电话除了自己人和派出所知道,也就是生孩子的那家医院了。很可能地,就是这家医院把这些信息卖给了婴儿用品公司。

在Google注册账号,使用Gmail,用Google搜索,都一样存在这个危险,和在任何网站上填写注册信息的风险是一样的。但一般来讲,收集信息的公司都会在用户条款里说明不会泄露用户个人信息。不清楚是否有这方面的法律,但最保险的作法就是,除非必要,一概不写真实信息,尤其是姓名,住址和家庭电话。

但实际上除了上述婴儿用品公司这样做直销的,一个个人的信息是没有意义的,有意义的是统计信息。也就是从成千上万的用户群中统计出来一些信息是有实际的价值的。在信用卡应用广泛的地方,信用卡公司实际上了解你的所有行为,什么时候在什么地方吃饭,在什么住旅店,在什么地方买面包,它一清二楚。但它一般不会出售某人喜欢在某让买某品牌面包的信息,它只会把它上亿记的用户统计一下,说原味面包的销量正在下降,其他品味卖得正好,这样的信息对食品公司的价值不言而喻。据说,美国的那几家信用卡公司就在做这样的生意。

Googl也一样可以做这样的生意,它非常清楚网民关注什么东西,它放出来的信息比如美国人最爱搜索的词是britney spears,这个名字已经出现了490种错误拼写。这是大众化的信息,唱片公司也用不着去买。但某个旅游目的地,某个品牌的笔记本电脑这样的搜索统计信息,甚至和对手的比较数据,是不是就极具价值呢?不过似乎没听说Google做这个,倒是有许多所谓市场研究公司在做。

Google采取的作法是在信件中或搜索中发现某些关键词的时候,就在屏幕右边写上几条文字加链接的广告,希望这些广告对用户有用--说实话,有时确实有用--这比打电话到你家里要好多了。而对Google来说,由机器扫描内容并显示文字广告,比之于卖这些信息给其他公司,要有利可图得多,更不用说需要这些信息的公司其实正是Google的竞争对手或是目标客户了。

危险确实存在,不管它出现的机会有多小。但也不必过于担心,因为与直接卖掉用户个人信息这个手段相比,这些公司还有更好的利用这些信息的赢利模式。当然也有公司只收集信息,却没有相应的赢利模式,那才是我们真正要小心的。
Source favicon12:31 PSPad支持Unicode和换行(WordWrap )了! » Andy's blog

下载这个文件:http://pspad.cincura.net/beta/pspad450b2109.cab (1.1 MB) ,然后解压缩并替换掉PSPad安装目录下对应的PSPad.EXE以及语言文件。

消息来自:http://forum.pspad.com/read.php?f=6&i=2092&t=2092

我前一篇PSPad的介绍文章:我最喜欢的免费文本编辑工具-PSPad

Source favicon11:03 Attempting to Interview Ask Jeeves » Jeremy Zawodny's blog
This is the funniest thing I've seen today, aside from CNet blogging about parking at Yahoo. The folks at SatireWire decided to interview Ask Jeeves and ended up with amusing results. It seems that we've haven't come that far since Eliza, have we? ;-)...
Source favicon09:09 TypePad Booster Package for Power Blogging » ProNet
LivingDot, popular hosting service and one of our Movable Type Hosting Partners, has just announced the LivingDot TypePad Booster Package. This collection of services lets you add a domain name, email accounts, and additional disk storage to your TypePad account,...
Source favicon09:06 Joe Gregorio on Secure Syndication » ProNet
Joe Gregorio's posted a new article on XML.com, called Secure RSS Syndication, and the story covers just what the title suggests. Using a regular XML feed, some Greasemonkey magic, and a private key, Joe's able to syndicate data without his...
Source favicon08:46 Macromedia Blog Authoring Survey » ProNet
Deeje Cooley's posted a link to a Macromedia Blog Authoring Survey. If you maintain a blog regularly, they're looking for your feedback, and survey participants who finish all 30 questions are eligible to win an iPod mini....
Source favicon08:07 An Email Blacklist of Technology PR Agencies? » Jeremy Zawodny's blog
Does anyone know of a published list of Public Relations companies--or at least those involved in Technology PR? I get so damned much spam (I mean "pitches") that I'm starting to think that life would be better if I just blocked email from all the big names in Tech PR. Have you seen such a beast? I'd be glad to host it if others are willing to contribute. It could even form the basis of a nice set of add-on...
Source favicon02:38 Finding great ramen nearby » Google Blog




You may have seen our Local and Maps products that help you find businesses and maps throughout the US, Canada and the UK. We've just added Local and Maps services in Japan.



You think Tokyo is expensive? Maybe, but you can still find a number of venues serving a 1000-yen combo lunch near Shinjuku.



Of course, Local and Maps Japan are designed to work with Japanese language, but even if you don't know Japanese, I'm sure you can appreciate why we developed it with some particulars in mind for Japanese users. Many Japanese live and work around train stations, for example, and refer to neighborhoods defined by their proximity to them. So we made sure they can search for businesses easily and refer to location by station names. After all, no one wants to walk 10 miles from a station, just to grab a $10 lunch!
Source favicon00:59 Tagging Feedback at MSN Search » MSN Search's WebLog

We find customer feedback quite delicious at MSN Search.  To help the team digest the massive flow of feedback we receive, we designed a tagging and viewing system inspired by the faceted browsing system, Flamenco, developed at UC Berkeley. We’re very happy to have addressed our top feature request in the latest release – yellow page results, called out in the image below as “ypResults” with the introduction of local search.

 

Tagging is particularly appropriate for feedback as users rarely talk just about one issue or neatly constrain their comments to a single feature team. In the picture above, we’ve masked the actual issue names with fruits, vegetables, and trees – we’re not trying to air our dirty laundry! It shows just one view of the feedback, which segments by the type of issue.  User raised issues are dealt with in every stage of development, while feature requests feed into planning, and usability sits somewhere in between.  Another essential set of views slices this data by the feature. This allows our image team to see requests for larger images without hearing about Search Builder discovery issues.

 

Use the “help us improve” links on every page. We’re listening (and tagging).

 

Andy Edmonds, Relevance Measurement PM

 

 


^==Back Home: www.chedong.com

<== 2005-07-18

==> 2005-07-20