#ceph IRC Log

Index

IRC Log for 2011-12-03

Timestamps are in GMT/BST.

[0:26] * joshd (~joshd@aon.hq.newdream.net) Quit (Remote host closed the connection)
[0:28] * joshd (~joshd@aon.hq.newdream.net) has joined #ceph
[0:45] <Tv> mwahahah
[0:45] <Tv> GPTHeader(signature='EFI PART', revision='\x00\x00\x01\x00', header_size=92, crc32='LBr\xd2', current_lba=1, backup_lba=20000001, first_usable_lba=34, last_usable_lba=19999968, disk_guid='\x11\xaau\x9fv*bA\xa1\x06U\xe3v\x08\xde\x06', part_entry_start_lba=2, num_part_entries=128, part_entry_size=128, crc32_part_array='\x7f\xa7\xff\xb1\x00\x00\x00\x00')
[0:45] <Tv> GPTPartition(type='\xa2\xa0\xd0\xeb\xe5\xb93D\x87\xc0h\xb6\xb7&\x99\xc7', unique='\xfb\xf9%\xb5I\xceKD\x93\x93s\xc4\x93v\xa7\xbd', first_lba=2048, last_lba=19998719, flags=0, name='\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x0
[0:45] <Tv> 0\x00\x00\x00')
[0:45] <Tv> need to pretty-print guids and add a couple of safety checks, but then it's done
[0:52] * The_Bishop (~bishop@port-92-206-183-175.dynamic.qsc.de) Quit (Ping timeout: 480 seconds)
[1:04] * Tv (~Tv|work@aon.hq.newdream.net) Quit (Read error: Operation timed out)
[1:12] <nwatkins> I'm getting the following speeds from the ceph bench on the troubleshooting page. 2011-12-02 16:09:01.682124 log 2011-12-02 16:09:00.661823 osd.0 192.168.141.124:6800/2419 193 : [INF] bench: wrote 1024 MB in blocks of 4096 KB in 79.555075 sec at 13180 KB/sec
[1:13] <nwatkins> This speed is going through the user-space client, but I can achieve full line rate (~100MB/s) using the kernel client
[1:14] <gregaf> nwatkins: the bench command tells each OSD to benchmark the disk it uses for storing data
[1:14] <gregaf> it's not related to the clients at all
[1:14] <gregaf> but you've got a hell of a slow disk on osd 0 it looks like
[1:15] <nwatkins> hmm
[1:15] * mib_lnh6ex (bcd7608d@ircip2.mibbit.com) has joined #ceph
[1:15] <mib_lnh6ex> Erepublik is the only browser game i have played since 2008 http://www.erepublik.com/en/referrer/OJSimpson I will help you grow
[1:15] * mib_lnh6ex (bcd7608d@ircip2.mibbit.com) has left #ceph
[1:17] <nwatkins> gregaf: on osd.0 i'm seeing 100 MB/s writes to the ceph disk with dd
[1:18] <gregaf> hrm, were you doing other things with it at the time you ran bench?
[1:18] <nwatkins> no
[1:18] <gregaf> huh
[1:18] <gregaf> let me check the code again, but as I recall it just tells the OSDs to write 1GB of data to disk and report back how long it took...sagekw?
[1:18] <gregaf> sagewk?
[1:19] <sagewk> yeah
[1:20] <sagewk> the bench is doing 4k ios.. probably change that to something larger and you'll see teh 100mb/sec
[1:20] <sagewk> -b <bytes> to bench command, iirc
[1:21] <gregaf> looks like it's just by order, so bench <bytes_per> <total_bytes>
[1:23] * adjohn (~adjohn@70-36-139-247.dsl.dynamic.sonic.net) Quit (Quit: adjohn)
[1:24] <gregaf> and then it loops through dispatching transactions that the OSD processes normally, then does a sync_and_flush() at the end
[1:24] <gregaf> it should be pretty close to what you pull off the disk normally
[1:26] <nwatkins> gregaf: kinda weird behavior... simple hadoop job writing a couple files, after several minutes only a few KB have made it out to the file system.
[1:27] * verwilst (~verwilst@dD576F10D.access.telenet.be) Quit (Ping timeout: 480 seconds)
[1:31] <nwatkins> gregaf: here's the client log. basically some files are being open, but the client just gets stuck indefinitely. http://pastebin.com/jnh7xrRc
[1:33] <gregaf> nwatkins: can you give me a little more context?
[1:34] <nwatkins> gregaf: sure
[1:34] <gregaf> the job is writing files but when you look at ceph -s it's only got a few KB of stuff added?
[1:36] <nwatkins> he client trace i just posted is from a job that basically creates a directory and writes a few KB into a couple files. i stopped the client after several minutes, and the directory had not even been created, but the behavior isn't consistent. earlier the same setup began writing its data files, but an ls revealed only a few KB had been written.
[1:36] <gregaf> in that log it does look like there are only 4 writes of a few hundred k going out to the OSDs
[1:37] <nwatkins> it seems like something is hanging
[1:39] <nwatkins> gregaf: that's about all i know. i have to run, but i'll try to do narrow this down later
[1:39] <gregaf> okay
[1:39] <gregaf> I'll see if there's something I can get out of this log
[1:39] <gregaf> unfortunately I might be less accessible than usual later — my power's out at home :(
[1:39] <nwatkins> gregaf: btw, i may just try to revert back to ceph version from a month ago when things were working fine. is that localized reads patch expected to be easily cherrypicked that far back?
[1:40] <gregaf> nwatkins: yeah, the localized reads stuff is just a few lines in the Objecter
[1:40] <gregaf> as long as it's not what's causing the problems :/
[1:40] <nwatkins> hmm
[1:40] <nwatkins> i'll test that first
[1:57] <gregaf> nwatkins: hmm, I'm not getting much out of this log — it sends out 4 OSD requests, which are replied to; it sends out a bunch of MDS requests which are all replied to; it's not waiting for anything that I can find...
[2:00] * fronlius (~fronlius@e176052065.adsl.alicedsl.de) Quit (Quit: fronlius)
[2:10] * gregaf (~Adium@aon.hq.newdream.net) has left #ceph
[2:12] * gregaf (~Adium@aon.hq.newdream.net) has joined #ceph
[2:13] * The_Bishop (~bishop@port-92-206-183-175.dynamic.qsc.de) has joined #ceph
[2:35] * bchrisman (~Adium@108.60.121.114) Quit (Quit: Leaving.)
[2:46] * MarkDude (~MT@c-71-198-138-155.hsd1.ca.comcast.net) has joined #ceph
[3:36] * MK_FG (~MK_FG@188.226.51.71) has joined #ceph
[3:53] * joshd (~joshd@aon.hq.newdream.net) Quit (Quit: Leaving.)
[4:28] * aa (~aa@r186-51-251-1.dialup.mobile.ancel.net.uy) has joined #ceph
[5:26] <darkfader> Tv: wanna dig through that in a query tomnorrow?
[6:15] * darkfaded (~floh@188.40.175.2) has joined #ceph
[6:17] * darkfader (~floh@188.40.175.2) Quit (Read error: Operation timed out)
[6:23] * MarkDude (~MT@c-71-198-138-155.hsd1.ca.comcast.net) Quit (synthon.oftc.net oxygen.oftc.net)
[6:23] * grape (~grape@216.24.166.226) Quit (synthon.oftc.net oxygen.oftc.net)
[6:23] * lxo (~aoliva@lxo.user.oftc.net) Quit (synthon.oftc.net oxygen.oftc.net)
[6:23] * kirkland (~kirkland@74.126.19.140.static.a2webhosting.com) Quit (synthon.oftc.net oxygen.oftc.net)
[6:23] * psomas (~psomas@inferno.cc.ece.ntua.gr) Quit (synthon.oftc.net oxygen.oftc.net)
[6:27] * MarkDude (~MT@c-71-198-138-155.hsd1.ca.comcast.net) has joined #ceph
[6:27] * grape (~grape@216.24.166.226) has joined #ceph
[6:27] * lxo (~aoliva@lxo.user.oftc.net) has joined #ceph
[6:27] * kirkland (~kirkland@74.126.19.140.static.a2webhosting.com) has joined #ceph
[6:27] * psomas (~psomas@inferno.cc.ece.ntua.gr) has joined #ceph
[6:28] * MarkDude (~MT@c-71-198-138-155.hsd1.ca.comcast.net) Quit (Read error: Connection reset by peer)
[6:31] * MarkDude (~MT@c-71-198-138-155.hsd1.ca.comcast.net) has joined #ceph
[6:31] * izdubar (~MT@c-71-198-138-155.hsd1.ca.comcast.net) has joined #ceph
[6:33] * izdubar (~MT@c-71-198-138-155.hsd1.ca.comcast.net) Quit ()
[6:39] * bchrisman (~Adium@c-76-103-130-94.hsd1.ca.comcast.net) has joined #ceph
[9:28] -solenoid.oftc.net- *** Looking up your hostname...
[9:28] -solenoid.oftc.net- *** Checking Ident
[9:28] -solenoid.oftc.net- *** No Ident response
[9:28] -solenoid.oftc.net- *** Found your hostname
[9:28] * CephLogBot (~PircBot@rockbox.widodh.nl) has joined #ceph
[9:28] * wido (~wido@rockbox.widodh.nl) has joined #ceph
[9:59] * NightDog (~karl@52.84-48-58.nextgentel.com) has joined #ceph
[11:17] * fronlius (~fronlius@e176052065.adsl.alicedsl.de) has joined #ceph
[11:36] * fronlius (~fronlius@e176052065.adsl.alicedsl.de) Quit (Quit: fronlius)
[11:44] * donut (~construct@exit-01c.noisetor.net) has joined #ceph
[11:45] * fronlius (~fronlius@e176052065.adsl.alicedsl.de) has joined #ceph
[11:47] * fronlius (~fronlius@e176052065.adsl.alicedsl.de) Quit ()
[11:54] * donut (~construct@83TAABTZA.tor-irc.dnsbl.oftc.net) Quit (Quit: Leaving)
[12:25] * morse (~morse@supercomputing.univpm.it) Quit (Remote host closed the connection)
[13:00] * morse (~morse@supercomputing.univpm.it) has joined #ceph
[14:43] * fronlius (~fronlius@g231136124.adsl.alicedsl.de) has joined #ceph
[15:22] * fronlius (~fronlius@g231136124.adsl.alicedsl.de) Quit (Quit: fronlius)
[15:39] * NightDog (~karl@52.84-48-58.nextgentel.com) Quit (Quit: Leaving)
[15:42] * NightDog (~karl@52.84-48-58.nextgentel.com) has joined #ceph
[15:53] * fronlius (~fronlius@g231136124.adsl.alicedsl.de) has joined #ceph
[16:11] * aa (~aa@r186-51-251-1.dialup.mobile.ancel.net.uy) Quit (Ping timeout: 480 seconds)
[16:15] * bchrisman (~Adium@c-76-103-130-94.hsd1.ca.comcast.net) has joined #ceph
[16:21] * aa (~aa@r186-51-129-4.static.adinet.com.uy) has joined #ceph
[17:04] * andresambrois (~aa@r190-64-71-154.dialup.adsl.anteldata.net.uy) has joined #ceph
[17:05] * aa (~aa@r186-51-129-4.static.adinet.com.uy) Quit (Ping timeout: 480 seconds)
[17:19] * fronlius (~fronlius@g231136124.adsl.alicedsl.de) Quit (Quit: fronlius)
[17:22] * andresambrois (~aa@r190-64-71-154.dialup.adsl.anteldata.net.uy) Quit (Ping timeout: 480 seconds)
[18:07] * aa (~aa@r186-48-202-160.dialup.adsl.anteldata.net.uy) has joined #ceph
[18:27] * The_Bishop (~bishop@port-92-206-183-175.dynamic.qsc.de) Quit (Quit: Wer zum Teufel ist dieser Peer? Wenn ich den erwische dann werde ich ihm mal die Verbindung resetten!)
[18:48] * NightDog (~karl@52.84-48-58.nextgentel.com) Quit (Remote host closed the connection)
[19:09] * MarkDude (~MT@c-71-198-138-155.hsd1.ca.comcast.net) Quit (Quit: Leaving)
[19:47] * MarkDude (~MT@c-71-198-138-155.hsd1.ca.comcast.net) has joined #ceph
[20:00] * adjohn (~adjohn@70-36-139-247.dsl.dynamic.sonic.net) has joined #ceph
[20:01] * adjohn (~adjohn@70-36-139-247.dsl.dynamic.sonic.net) Quit ()
[20:38] * andresambrois (~aa@r186-48-210-168.dialup.adsl.anteldata.net.uy) has joined #ceph
[20:45] * aa (~aa@r186-48-202-160.dialup.adsl.anteldata.net.uy) Quit (Ping timeout: 480 seconds)
[22:28] * NightDog (~karl@52.84-48-58.nextgentel.com) has joined #ceph
[23:37] * colonD (~colonD@173-165-224-105-minnesota.hfc.comcastbusiness.net) has joined #ceph
[23:42] * colon_D (~colon_D@173-165-224-105-minnesota.hfc.comcastbusiness.net) Quit (Ping timeout: 480 seconds)
[23:44] * acaos_ (~zac@209-99-103-42.fwd.datafoundry.com) has joined #ceph
[23:44] * colonD (~colonD@173-165-224-105-minnesota.hfc.comcastbusiness.net) Quit (resistance.oftc.net graviton.oftc.net)
[23:44] * MK_FG (~MK_FG@188.226.51.71) Quit (resistance.oftc.net graviton.oftc.net)
[23:44] * Meths (rift@2.27.73.221) Quit (resistance.oftc.net graviton.oftc.net)
[23:44] * acaos (~zac@209-99-103-42.fwd.datafoundry.com) Quit (resistance.oftc.net graviton.oftc.net)
[23:44] * jjchen (~jjchen@lo4.cfw-a-gci.greatamerica.corp.yahoo.com) Quit (resistance.oftc.net graviton.oftc.net)
[23:44] * mfoemmel (~mfoemmel@chml01.drwholdings.com) Quit (resistance.oftc.net graviton.oftc.net)
[23:44] * sagewk (~sage@aon.hq.newdream.net) Quit (resistance.oftc.net graviton.oftc.net)
[23:44] * jpieper (~josh@209-6-86-62.c3-0.smr-ubr2.sbo-smr.ma.cable.rcn.com) Quit (resistance.oftc.net graviton.oftc.net)
[23:44] * yehudasa_ (~yehudasa@aon.hq.newdream.net) Quit (resistance.oftc.net graviton.oftc.net)
[23:44] * ajm (adam@adam.gs) Quit (resistance.oftc.net graviton.oftc.net)
[23:44] * eternaleye_ (~eternaley@195.215.30.181) Quit (resistance.oftc.net graviton.oftc.net)
[23:44] * svenx_ (92744@diamant.ifi.uio.no) Quit (resistance.oftc.net graviton.oftc.net)
[23:45] * colonD (~colonD@173-165-224-105-minnesota.hfc.comcastbusiness.net) has joined #ceph
[23:45] * MK_FG (~MK_FG@188.226.51.71) has joined #ceph
[23:45] * Meths (rift@2.27.73.221) has joined #ceph
[23:45] * eternaleye_ (~eternaley@195.215.30.181) has joined #ceph
[23:45] * svenx_ (92744@diamant.ifi.uio.no) has joined #ceph
[23:45] * ajm (adam@adam.gs) has joined #ceph
[23:45] * yehudasa_ (~yehudasa@aon.hq.newdream.net) has joined #ceph
[23:45] * jpieper (~josh@209-6-86-62.c3-0.smr-ubr2.sbo-smr.ma.cable.rcn.com) has joined #ceph
[23:45] * sagewk (~sage@aon.hq.newdream.net) has joined #ceph
[23:45] * mfoemmel (~mfoemmel@chml01.drwholdings.com) has joined #ceph
[23:45] * jjchen (~jjchen@lo4.cfw-a-gci.greatamerica.corp.yahoo.com) has joined #ceph
[23:45] * acaos (~zac@209-99-103-42.fwd.datafoundry.com) has joined #ceph
[23:46] * acaos (~zac@209-99-103-42.fwd.datafoundry.com) Quit (Ping timeout: 480 seconds)
[23:49] * yehudasa_ (~yehudasa@aon.hq.newdream.net) Quit (Ping timeout: 480 seconds)
[23:49] * yehudasa_ (~yehudasa@aon.hq.newdream.net) has joined #ceph
[23:55] * NightDog (~karl@52.84-48-58.nextgentel.com) Quit (Read error: Connection reset by peer)

These logs were automatically created by CephLogBot on irc.oftc.net using the Java IRC LogBot.