A write up on what LinkedIn is doing to integrate large offline Hadoop data processing jobs with a fast, distributed online key-value storage system, Project Voldemort.
It is a known fact that ext3 is not the most efficient file system out there and for example file removals can be painfully slow and cause a lot of random I/O. However, as it turns out, it can sometimes have a much more severe impact on the MySQL performance that it would seem. When or why?
When you run DROP TABLE
there are several things that need to happen – write lock on a table so it cannot be used by any other thread, the data file(s) removal by the storage engine and of course in the end MySQL has to destroy the definition file (.frm). That's not all that happens, there is one other thing:
The entire code which removes a table is surrounded by LOCK_open
mutex. The mutex is used in various places inside MySQL, but primarily when tables are being opened or closed. It means that when LOCK_open
is locked, no query can be executed because they are stopped from accessing any table.
Here is when the slow file removal on the ext3 file system starts to be a pain. Deleting a 10GB file can take a few seconds and if that is a MySQL table, the mutex remains locked for all that time stalling all queries:
I tried some alternative approaches to trick MySQL into removing smaller files with DROP TABLE
to minimize the effect, such as:
Unfortunately as it turned out each of the administrative commands like ALTER TABLE
or OPTIMIZE TABLE
one way or another uses LOCK_open
mutex when the old table files are deleted:
The only alternative seems to be changing the file system. To XFS for example, which handles file removals much more efficiently:
EXT3
XFS
A better solution through MySQL internals could be to simulate the table drop by renaming the corresponding data file or files and physically removing them outside of the mutex lock. However it may not be that simple, because the actual removal is performed by the storage engine, so it's not something MySQL code can control.
This is certainly not a common situation, but may become a problem to anyone when it's the least expected (e.g. dropping old unused tables).
Entry posted by Maciej Dobrzanski | One comment
工作职责:
1 根据产品需求制定交互方案,输出产品保真原型和设计文档;
2 对现有产品和功能的优化,提升易用性;
3 设计规范的维护和建设。
职位要求:
1 至少独立完成过一个完整项目的交互流程设计;
2 对新鲜事物充满好奇心,资深网虫,喜欢体验各种网站和软件;
3 具备一定的视觉设计和html相关的知识和技能;
4 工业设计、计算机、视觉传达、心理学、信息学相关背景,一年工作经验以上;
5 工作认真细致,良好的沟通能力。
工作地点位于北京。有兴趣或者问题请email至 team(a)douban.com。请注明“UE”。
六月 2009 | ||||||
一 | 二 | 三 | 四 | 五 | 六 | 日 |
1 | 2 | 3 | 4 | 5 | 6 | 7 |
8 | 9 | 10 | 11 | 12 | 13 | 14 |
15 | 16 | 17 | 18 | 19 | 20 | 21 |
22 | 23 | 24 | 25 | 26 | 27 | 28 |
29 | 30 |