#ceph IRC Log

Index

IRC Log for 2013-03-02

Timestamps are in GMT/BST.

[0:00] <rturk> nobody in the participants list with @gnu.org
[0:00] <rturk> good to know that our polling tool is picky about js :-/
[0:02] <lxo> I don't recall what kind of page I got in response to submitting the form... it surely didn't seem like an error :-(
[0:06] * miroslav (~miroslav@c-67-169-138-140.hsd1.ca.comcast.net) has joined #ceph
[0:09] * mcclurmc_laptop (~mcclurmc@cpc10-cmbg15-2-0-cust205.5-4.cable.virginmedia.com) has joined #ceph
[0:13] * miroslav (~miroslav@c-67-169-138-140.hsd1.ca.comcast.net) Quit (Quit: Leaving.)
[0:20] * xmltok_ (~xmltok@pool101.bizrate.com) has joined #ceph
[0:20] * xmltok_ (~xmltok@pool101.bizrate.com) Quit ()
[0:22] * jjgalvez1 (~jjgalvez@12.248.40.138) has joined #ceph
[0:25] * drokita1 (~drokita@199.255.228.128) has joined #ceph
[0:26] * xmltok (~xmltok@pool101.bizrate.com) Quit (Ping timeout: 480 seconds)
[0:27] * jjgalvez (~jjgalvez@12.248.40.138) Quit (Ping timeout: 480 seconds)
[0:29] * drokita (~drokita@199.255.228.128) Quit (Ping timeout: 480 seconds)
[0:31] * jjgalvez1 (~jjgalvez@12.248.40.138) Quit (Quit: Leaving.)
[0:33] * drokita1 (~drokita@199.255.228.128) Quit (Ping timeout: 480 seconds)
[0:49] * terje (~joey@63-154-140-190.mpls.qwest.net) has joined #ceph
[0:49] * noob21 (~cjh@173.252.71.7) Quit (Quit: Leaving.)
[0:51] * BillK (~BillK@124-169-167-246.dyn.iinet.net.au) Quit (Ping timeout: 480 seconds)
[0:55] * noob21 (~cjh@173.252.71.7) has joined #ceph
[0:57] * terje (~joey@63-154-140-190.mpls.qwest.net) Quit (Ping timeout: 480 seconds)
[0:59] * aliguori (~anthony@cpe-70-112-157-87.austin.res.rr.com) Quit (Remote host closed the connection)
[1:01] * BillK (~BillK@58-7-153-234.dyn.iinet.net.au) has joined #ceph
[1:07] * ScOut3R (~ScOut3R@catv-89-133-43-117.catv.broadband.hu) Quit (Remote host closed the connection)
[1:12] * Cube1 (~Cube@12.248.40.138) Quit (Ping timeout: 480 seconds)
[1:13] * noob21 (~cjh@173.252.71.7) Quit (Quit: Leaving.)
[1:15] * xmltok (~xmltok@205.209.7.111) has joined #ceph
[1:16] * xmltok (~xmltok@205.209.7.111) Quit (Remote host closed the connection)
[1:16] * xmltok (~xmltok@pool101.bizrate.com) has joined #ceph
[1:17] * The_Bishop (~bishop@2001:470:50b6:0:f142:bed2:bc77:71bc) has joined #ceph
[1:24] <ShaunR> Is the default crush map not smart enough out of the box to realize which OSD's are on the same server? It looks to me like it is but i keep getting (and seeing) people always say you need to modify your crush map for ceph to perform best.... I get that i should when it comes down to a production cluster in difffernt racks, rows, rooms, etc but out of the box it seams like the crush map should
[1:24] <ShaunR> be smarty enough to realize which OSD's are on the same host.
[1:29] * lx0 (~aoliva@lxo.user.oftc.net) has joined #ceph
[1:34] * xmltok (~xmltok@pool101.bizrate.com) Quit (Quit: Bye!)
[1:36] * lxo (~aoliva@lxo.user.oftc.net) Quit (Ping timeout: 480 seconds)
[1:36] <gregaf> it depends on which version of the code and what setup mechanism you used whether 1) the map includes the necessary data, and 2) whether the CRUSH rules choose to take it into account
[1:38] * tryggvil (~tryggvil@17-80-126-149.ftth.simafelagid.is) Quit (Quit: tryggvil)
[1:39] * tryggvil (~tryggvil@17-80-126-149.ftth.simafelagid.is) has joined #ceph
[1:40] * tryggvil (~tryggvil@17-80-126-149.ftth.simafelagid.is) Quit ()
[1:42] * mikedawson (~chatzilla@c-98-220-189-67.hsd1.in.comcast.net) has joined #ceph
[1:44] * terje_ (~joey@63-154-140-190.mpls.qwest.net) has joined #ceph
[1:46] * miroslav (~miroslav@c-67-169-138-140.hsd1.ca.comcast.net) has joined #ceph
[1:49] * tziOm (~bjornar@ti0099a340-dhcp0628.bb.online.no) Quit (Remote host closed the connection)
[1:50] * terje (~joey@63-154-140-190.mpls.qwest.net) has joined #ceph
[1:50] * wer_ (~wer@wer.youfarted.net) has joined #ceph
[1:51] * jlogan1 (~Thunderbi@2600:c00:3010:1:217f:2c08:a1d4:e762) Quit (Ping timeout: 480 seconds)
[1:51] <MrNPP> so i have a crash and i'm stuck
[1:51] <MrNPP> http://paste.scurvynet.com/?bb764dcfa66528cb#0H2rBokp9o15p91XuoG9CfeFHztpIPrn78dP/HqcfGo=
[1:51] <MrNPP> i can create the image perfectly, but info doesn't work
[1:51] <MrNPP> segment fault
[1:51] <MrNPP> tried recompiling everything
[1:51] <MrNPP> running the latest release
[1:52] * noob21 (~cjh@173.252.71.7) has joined #ceph
[1:53] * terje_ (~joey@63-154-140-190.mpls.qwest.net) Quit (Ping timeout: 480 seconds)
[1:55] * wer (~wer@211.sub-70-192-207.myvzw.com) Quit (Ping timeout: 480 seconds)
[1:58] * terje (~joey@63-154-140-190.mpls.qwest.net) Quit (Ping timeout: 480 seconds)
[2:00] * noob21 (~cjh@173.252.71.7) Quit (Quit: Leaving.)
[2:03] * xmltok (~xmltok@pool101.bizrate.com) has joined #ceph
[2:05] * BManojlovic (~steki@85.222.180.248) Quit (Ping timeout: 480 seconds)
[2:12] <joshd> MrNPP: not sure exactly why that's happening, but if you leave out -f rbd it works
[2:15] <MrNPP> really?
[2:15] <MrNPP> right you are
[2:15] <MrNPP> haha
[2:18] * noob21 (~cjh@173.252.71.7) has joined #ceph
[2:20] * noob21 (~cjh@173.252.71.7) Quit ()
[2:22] * yanzheng (~zhyan@jfdmzpr01-ext.jf.intel.com) Quit (Remote host closed the connection)
[2:24] * alram (~alram@38.122.20.226) Quit (Quit: leaving)
[2:25] <ShaunR> gregaf: I havnt touched the crush map, it's in whatever default form it would be in from the centos 6 56.3 rpms
[2:29] * miroslav (~miroslav@c-67-169-138-140.hsd1.ca.comcast.net) Quit (Quit: Leaving.)
[2:37] * diegows (~diegows@190.188.190.11) Quit (Ping timeout: 480 seconds)
[2:45] * tryggvil (~tryggvil@17-80-126-149.ftth.simafelagid.is) has joined #ceph
[2:54] * xmltok (~xmltok@pool101.bizrate.com) Quit (Quit: Leaving...)
[3:14] <janos> if i'm starting a new cluster from the ashes of the old, do i need to wipe the journals? (they are partitions on SSD's)
[3:15] <janos> i wipe the /var/lib/ceph/mon/* dirs on the mons
[3:15] * rturk is now known as rturk-away
[3:15] <janos> and use --mkfs with mkcephfs
[3:19] * themgt (~themgt@24-177-232-181.dhcp.gnvl.sc.charter.com) Quit (Quit: Pogoapp - http://www.pogoapp.com)
[3:30] * zK4k7g (~zK4k7g@digilicious.com) Quit (Quit: Leaving.)
[3:32] * sstan_ (~chatzilla@modemcable016.164-202-24.mc.videotron.ca) has joined #ceph
[3:37] * KevinPerks (~Adium@cpe-066-026-239-136.triad.res.rr.com) Quit (Quit: Leaving.)
[3:58] * chutzpah (~chutz@199.21.234.7) Quit (Quit: Leaving)
[4:04] * BillK (~BillK@58-7-153-234.dyn.iinet.net.au) Quit (Ping timeout: 480 seconds)
[4:08] * KevinPerks (~Adium@cpe-066-026-239-136.triad.res.rr.com) has joined #ceph
[4:13] * BillK (~BillK@124-148-75-89.dyn.iinet.net.au) has joined #ceph
[4:20] * KevinPerks (~Adium@cpe-066-026-239-136.triad.res.rr.com) Quit (Ping timeout: 480 seconds)
[4:28] <sstan_> where's everyone?
[4:31] * jjgalvez (~jjgalvez@cpe-76-175-30-67.socal.res.rr.com) has joined #ceph
[4:55] <dmick> I'm here. I'm always here. Whereever I am, that's here.
[4:57] * yanzheng (~zhyan@jfdmzpr03-ext.jf.intel.com) has joined #ceph
[5:00] * KevinPerks (~Adium@cpe-066-026-239-136.triad.res.rr.com) has joined #ceph
[5:06] <nhm> sstan_: I was going to say that would be hanging out on IRC on a friday night, but then, dmick responded, and, well, then I did.
[5:07] <nhm> but now I'm going to bed. Good night!
[5:08] * KevinPerks (~Adium@cpe-066-026-239-136.triad.res.rr.com) has left #ceph
[5:08] * wer_ is now known as wer
[5:08] <wer> nite
[5:30] * jjgalvez (~jjgalvez@cpe-76-175-30-67.socal.res.rr.com) Quit (Quit: Leaving.)
[5:30] * terje (~joey@63-154-134-82.mpls.qwest.net) has joined #ceph
[5:38] * terje (~joey@63-154-134-82.mpls.qwest.net) Quit (Ping timeout: 480 seconds)
[6:03] * noob21 (~cjh@pool-96-249-204-90.snfcca.dsl-w.verizon.net) has joined #ceph
[6:22] <ShaunR> hmm, adding a new/additional server was kind of painful.
[6:22] <ShaunR> how should one normally go about adding say 24 additional OSD's? Would you bring them all up at once or add one at a time?
[6:41] * terje (~joey@63-154-134-82.mpls.qwest.net) has joined #ceph
[6:49] * terje (~joey@63-154-134-82.mpls.qwest.net) Quit (Ping timeout: 480 seconds)
[7:04] * noob21 (~cjh@pool-96-249-204-90.snfcca.dsl-w.verizon.net) Quit (Quit: Leaving.)
[7:11] * terje (~joey@63-154-159-234.mpls.qwest.net) has joined #ceph
[7:19] * terje (~joey@63-154-159-234.mpls.qwest.net) Quit (Ping timeout: 480 seconds)
[7:40] <phantomcircuit> ShaunR, i would do it one at a time
[7:40] * The_Bishop (~bishop@2001:470:50b6:0:f142:bed2:bc77:71bc) Quit (Ping timeout: 480 seconds)
[7:40] <phantomcircuit> iirc the rate limiting for backfill is per peer
[7:40] <phantomcircuit> (that might be wrong)
[7:41] <phantomcircuit> ShaunR, it will increase the total amount of work but should decrease the overhead at any given moment
[7:41] * The_Bishop (~bishop@2001:470:50b6:0:f142:bed2:bc77:71bc) has joined #ceph
[7:42] * The_Bishop (~bishop@2001:470:50b6:0:f142:bed2:bc77:71bc) Quit ()
[7:44] * The_Bishop (~bishop@2001:470:50b6:0:f142:bed2:bc77:71bc) has joined #ceph
[7:48] * yanzheng (~zhyan@jfdmzpr03-ext.jf.intel.com) Quit (Remote host closed the connection)
[7:51] <BillK> question about journals: used the formula and worked out I needed 1G, so reduced them down to to that from 4G
[7:51] <BillK> performance sucked, so after some variations, I am up to 5G per journal
[7:52] <BillK> seems the bigger the better, and they are basicly a queue in the way they work
[7:52] <BillK> Is that a valid observation?
[8:20] <phantomcircuit> BillK, sort of
[8:20] <phantomcircuit> there are a lot of parameters which change the performance characteristics of ceph
[8:20] <phantomcircuit> they depend largely on the workload from clients
[8:21] <BillK> Yes, I ralise its an (over) simplificatiom, but I am trying to understand why its wrking like it does in the real world
[8:21] <BillK> current I had a platter with the journals failed, replaced with ssd and recovery with
[8:22] <BillK> the reccomended size journals wasnt going well ... now seems to be sorting itself out
[8:33] * jjgalvez (~jjgalvez@cpe-76-175-30-67.socal.res.rr.com) has joined #ceph
[9:03] * Philip_ (~Philip@hnvr-4d079d7d.pool.mediaWays.net) has joined #ceph
[9:03] * eschnou (~eschnou@29.89-201-80.adsl-dyn.isp.belgacom.be) has joined #ceph
[9:05] * The_Bishop_ (~bishop@2001:470:50b6:0:f142:bed2:bc77:71bc) has joined #ceph
[9:05] * The_Bishop (~bishop@2001:470:50b6:0:f142:bed2:bc77:71bc) Quit (Read error: Connection reset by peer)
[9:18] * yanzheng (~zhyan@jfdmzpr01-ext.jf.intel.com) has joined #ceph
[9:20] * loicd (~loic@2a01:e35:2eba:db10:120b:a9ff:feb7:cce0) Quit (Quit: Leaving.)
[9:20] * loicd (~loic@magenta.dachary.org) has joined #ceph
[9:21] * lx0 is now known as lxo
[9:27] * The_Bishop__ (~bishop@2001:470:50b6:0:f142:bed2:bc77:71bc) has joined #ceph
[9:27] * The_Bishop_ (~bishop@2001:470:50b6:0:f142:bed2:bc77:71bc) Quit (Read error: Connection reset by peer)
[9:36] * Vjarjadian (~IceChat77@5ad6d005.bb.sky.com) Quit (Quit: Now if you will excuse me, I have a giant ball of oil to throw out my window)
[9:38] * eschnou (~eschnou@29.89-201-80.adsl-dyn.isp.belgacom.be) Quit (Ping timeout: 480 seconds)
[10:08] * BManojlovic (~steki@85.222.222.132) has joined #ceph
[10:09] * jjgalvez (~jjgalvez@cpe-76-175-30-67.socal.res.rr.com) Quit (Quit: Leaving.)
[10:13] * loicd (~loic@magenta.dachary.org) Quit (Quit: Leaving.)
[10:13] * loicd (~loic@magenta.dachary.org) has joined #ceph
[10:13] * jtang1 (~jtang@79.97.135.214) has joined #ceph
[10:33] * The_Bishop__ (~bishop@2001:470:50b6:0:f142:bed2:bc77:71bc) Quit (Quit: Wer zum Teufel ist dieser Peer? Wenn ich den erwische dann werde ich ihm mal die Verbindung resetten!)
[10:33] * The_Bishop (~bishop@2001:470:50b6:0:f142:bed2:bc77:71bc) has joined #ceph
[11:06] * Philip__ (~Philip@hnvr-4d07bddd.pool.mediaWays.net) has joined #ceph
[11:09] * verwilst (~verwilst@dD5769628.access.telenet.be) has joined #ceph
[11:13] * Philip_ (~Philip@hnvr-4d079d7d.pool.mediaWays.net) Quit (Ping timeout: 480 seconds)
[11:17] * terje_ (~joey@63-154-143-198.mpls.qwest.net) has joined #ceph
[11:21] * eschnou (~eschnou@29.89-201-80.adsl-dyn.isp.belgacom.be) has joined #ceph
[11:25] * terje_ (~joey@63-154-143-198.mpls.qwest.net) Quit (Ping timeout: 480 seconds)
[11:34] * tziOm (~bjornar@ti0099a340-dhcp0628.bb.online.no) has joined #ceph
[11:47] * verwilst (~verwilst@dD5769628.access.telenet.be) Quit (Quit: Ex-Chat)
[11:51] * eschnou (~eschnou@29.89-201-80.adsl-dyn.isp.belgacom.be) Quit (Ping timeout: 480 seconds)
[12:06] * The_Bishop (~bishop@2001:470:50b6:0:f142:bed2:bc77:71bc) Quit (Ping timeout: 480 seconds)
[12:14] * The_Bishop (~bishop@2001:470:50b6:0:8835:6a09:37fc:fef1) has joined #ceph
[12:43] * yanzheng (~zhyan@jfdmzpr01-ext.jf.intel.com) Quit (Remote host closed the connection)
[12:47] * terje_ (~joey@63-154-159-218.mpls.qwest.net) has joined #ceph
[12:54] * yanzheng (~zhyan@jfdmzpr01-ext.jf.intel.com) has joined #ceph
[12:55] * terje_ (~joey@63-154-159-218.mpls.qwest.net) Quit (Ping timeout: 480 seconds)
[13:11] * ScOut3R (~scout3r@BC24BBCE.dsl.pool.telekom.hu) has joined #ceph
[13:16] * xiaoxi (~xiaoxiche@jfdmzpr06-ext.jf.intel.com) has joined #ceph
[13:21] * yanzheng (~zhyan@jfdmzpr01-ext.jf.intel.com) Quit (Remote host closed the connection)
[13:33] * yanzheng (~zhyan@134.134.139.72) has joined #ceph
[13:40] * mikedawson (~chatzilla@c-98-220-189-67.hsd1.in.comcast.net) Quit (Quit: ChatZilla 0.9.90 [Firefox 19.0/20130215130331])
[13:44] * diegows (~diegows@190.188.190.11) has joined #ceph
[13:57] * yanzheng (~zhyan@134.134.139.72) Quit (Remote host closed the connection)
[14:03] * eschnou (~eschnou@29.89-201-80.adsl-dyn.isp.belgacom.be) has joined #ceph
[14:15] * diegows (~diegows@190.188.190.11) Quit (Ping timeout: 480 seconds)
[14:28] * The_Bishop (~bishop@2001:470:50b6:0:8835:6a09:37fc:fef1) Quit (Ping timeout: 480 seconds)
[14:34] * eschnou (~eschnou@29.89-201-80.adsl-dyn.isp.belgacom.be) Quit (Ping timeout: 480 seconds)
[14:36] * The_Bishop (~bishop@2001:470:50b6:0:f142:bed2:bc77:71bc) has joined #ceph
[15:23] * disarone (~disa@xdsl-81-173-232-158.netcologne.de) has joined #ceph
[15:32] * KindOne (~KindOne@h1.42.28.71.dynamic.ip.windstream.net) Quit (Ping timeout: 480 seconds)
[15:33] * lxo (~aoliva@lxo.user.oftc.net) Quit (Ping timeout: 480 seconds)
[15:35] * KindOne (KindOne@h1.42.28.71.dynamic.ip.windstream.net) has joined #ceph
[15:35] * xiaoxi (~xiaoxiche@jfdmzpr06-ext.jf.intel.com) Quit (Remote host closed the connection)
[15:43] * lxo (~aoliva@lxo.user.oftc.net) has joined #ceph
[15:47] <jtang1> there many people using juju?
[15:47] <jtang1> many/any?
[16:00] * Rocky (~r.nap@188.205.52.204) has joined #ceph
[16:20] * LeaChim (~LeaChim@b01bd511.bb.sky.com) Quit (Ping timeout: 480 seconds)
[16:22] * terje (~joey@63-154-159-219.mpls.qwest.net) has joined #ceph
[16:29] * LeaChim (~LeaChim@b0faa0c8.bb.sky.com) has joined #ceph
[16:31] * terje (~joey@63-154-159-219.mpls.qwest.net) Quit (Ping timeout: 480 seconds)
[16:32] * nwat (~nwatkins@c-50-131-197-174.hsd1.ca.comcast.net) has joined #ceph
[16:33] * nwat (~nwatkins@c-50-131-197-174.hsd1.ca.comcast.net) Quit ()
[16:33] * nwat (~nwatkins@c-50-131-197-174.hsd1.ca.comcast.net) has joined #ceph
[16:34] * nwat (~nwatkins@c-50-131-197-174.hsd1.ca.comcast.net) has left #ceph
[16:35] * nwat (~nwatkins@c-50-131-197-174.hsd1.ca.comcast.net) has joined #ceph
[16:37] * verwilst (~verwilst@dD5769628.access.telenet.be) has joined #ceph
[16:50] * loicd (~loic@magenta.dachary.org) Quit (Quit: Leaving.)
[16:50] * loicd (~loic@magenta.dachary.org) has joined #ceph
[17:03] * terje (~joey@63-154-158-54.mpls.qwest.net) has joined #ceph
[17:03] * BillK (~BillK@124-148-75-89.dyn.iinet.net.au) Quit (Read error: Connection reset by peer)
[17:11] * terje (~joey@63-154-158-54.mpls.qwest.net) Quit (Ping timeout: 480 seconds)
[17:17] * noob21 (~cjh@pool-96-249-204-90.snfcca.dsl-w.verizon.net) has joined #ceph
[17:41] * mtk (~mtk@ool-44c35983.dyn.optonline.net) Quit (Remote host closed the connection)
[17:45] * mtk (~mtk@ool-44c35983.dyn.optonline.net) has joined #ceph
[17:50] * Vjarjadian (~IceChat77@5ad6d005.bb.sky.com) has joined #ceph
[18:02] <jluis> :wq
[18:03] <jluis> I wondered where it had gone; wrong window
[18:04] <SteveB> Yesterday I did a mkcephfs --mkfs on a new two-node cluster, each node with 36 disks/OSDs. It's still dragging along, not active+clean yet. The systems don't seem to be doing a whole lot so I'm not sure what's taking so long. Is this normal? All OSDs are in and up but I see a lot of them complaining about slow requests, oldest > 10000 seconds in some cases. It does seem to be making progress, just very slowly.
[18:11] * nwat (~nwatkins@c-50-131-197-174.hsd1.ca.comcast.net) Quit (Quit: nwat)
[18:21] * xmltok (~xmltok@cpe-76-170-26-114.socal.res.rr.com) has joined #ceph
[18:22] * xmltok (~xmltok@cpe-76-170-26-114.socal.res.rr.com) Quit ()
[18:26] * nwat (~nwatkins@soenat3.cse.ucsc.edu) has joined #ceph
[18:30] * Vjarjadian (~IceChat77@5ad6d005.bb.sky.com) Quit (Quit: Depression is merely anger without enthusiasm)
[18:33] * terje (~joey@63-154-158-54.mpls.qwest.net) has joined #ceph
[18:41] * terje (~joey@63-154-158-54.mpls.qwest.net) Quit (Ping timeout: 480 seconds)
[18:42] <gucki> SteveB: i think it's not normal. what does iostat -x -d 1 show?
[18:46] * noob21 (~cjh@pool-96-249-204-90.snfcca.dsl-w.verizon.net) Quit (Quit: Leaving.)
[18:55] * BillK (~BillK@124-148-91-65.dyn.iinet.net.au) has joined #ceph
[19:03] * diegows (~diegows@190.188.190.11) has joined #ceph
[19:05] * ScOut3R (~scout3r@BC24BBCE.dsl.pool.telekom.hu) Quit (Remote host closed the connection)
[19:07] <jmlowe> SteveB: Sounds like the problem I had for a week and a half, finally figured out one of my mons was choking, try restarting all of your mons
[19:07] * loicd (~loic@magenta.dachary.org) Quit (Quit: Leaving.)
[19:07] * loicd (~loic@magenta.dachary.org) has joined #ceph
[19:08] <jmlowe> especially if you go almost exactly 15 minutes between recovery operations and you have mostly default settings
[19:14] <jmlowe> gucki: how many threads do your osd's use?
[19:14] <jmlowe> something like ps -eLf |grep osd.1|wc -l
[19:14] <gucki> jmlowe: guess you mean SteveB, right?
[19:15] <jmlowe> gucki: I'm asking you, mine are between 106 and 135 threads each, seems a little high to me
[19:16] <gucki> jmlowe: on a new bobtail cluster which is currently idle i see 55-67 threads per osd
[19:17] <gucki> jmlowe: mon uses 16 threads
[19:17] <jmlowe> gucki: ok, I guess the low 100's isn't that far off for a active bobtail cluster
[19:17] <gucki> jmlowe: i'll be able to tell you in a few days ;-)
[19:19] <gucki> jmlowe: on my an old argonat cluster, which i tuned for many small ops, i see 330 threads/ osd
[19:19] <jmlowe> gucki: I was expecting < 10, I don't know what is normal
[19:20] <gucki> jmlowe: not sure if ceph is evented or threaded...but it seems threaded. so probably a new thread for each neighbor osd, mon, etc..
[19:20] <gucki> jmlowe: and then threads for ops, filestore, etc...
[19:28] <MrNPP> not sure how to debug qemu with ceph. http://paste.scurvynet.com/?f2e63c644f67c694#9zKvdtVeFvTFzxPPONTC86s9LjnH7PoFfA62lFrXBFk=
[19:28] <MrNPP> qemu-img info rbd:libvirt-pool/gentoo-vm - works
[19:29] <jmlowe> what does virsh dumpxml vader look like?
[19:31] <MrNPP> jmlowe: http://paste.scurvynet.com/?470ecd6f77ca0c05#UjUtR/C9d4hwaHH0mLj2oh89GVFabEPvUiigrsx5XC8=
[19:31] <MrNPP> tried with auth, and without auth
[19:32] <jmlowe> hmm, so I rely on /etc/ceph/ceph.conf being there
[19:33] <jmlowe> http://pastebin.com/hdXiGbNa
[19:33] <iggy> those paste urls are almost as long as the paste
[19:34] <jmlowe> If I had to guess I'd say it wasn't parsing the extra ceph options or wasn't happy with the host name
[19:35] <MrNPP> iggy: i like encryption :)
[19:35] * slang (~slang@207-229-177-80.c3-0.drb-ubr1.chi-drb.il.cable.rcn.com) Quit (Ping timeout: 480 seconds)
[19:36] <janos> hrm. i have a heavily degraded cluster. and one of the osd's i just changed from btfrs to xfs - i added in with a weight of zero
[19:36] <janos> i've been reweighting it, but ceph osd tree always shows 0
[19:37] <janos> is it waiting for the degraded state to clear?
[19:37] * danieagle (~Daniel@186.214.56.35) has joined #ceph
[19:37] <MrNPP> jmlowe: i tried adding driver, error: unsupported configuration: unknown driver format value 'rbd'
[19:39] <MrNPP> i left out the hose, and still have trouble connecting to monitor
[19:42] <MrNPP> s/hose/host?
[19:42] <MrNPP> haha
[19:43] <janos> hahaha, i thought "left out the hose" was some saying i didn't get
[19:43] <janos> sounds like a real old-timer saying
[19:43] * terje (~joey@63-154-142-11.mpls.qwest.net) has joined #ceph
[19:44] <janos> "damint, that boy left out the hose again"
[19:45] <jmlowe> it might be, from what I recall the docs say to wait until degrades passes before changing the weight without saying why
[19:45] <jmlowe> that was for janos
[19:45] <janos> ah sorry, i dind't see that in the docs
[19:46] <janos> damnit, i left out the hose again
[19:46] <janos> i'm transferring large files to a new small home cluster (btrfs) and it seems fine until i come back later and 2 osd's are dead
[19:46] <janos> so i'm changing them to xfs for now
[19:47] <janos> trying to anyway - before getting too many dirty looks from my wife for neglecting kids, but playing with a cluster
[19:47] * BillK (~BillK@124-148-91-65.dyn.iinet.net.au) Quit (Ping timeout: 480 seconds)
[19:48] * slang1 (~slang@207-229-177-80.c3-0.drb-ubr1.chi-drb.il.cable.rcn.com) has joined #ceph
[19:49] <jmlowe> MrNPP: all I say is my xml works, here is what is actually generated, you might check /var/log/libvirt/qemu/vader.log
[19:50] <jmlowe> /usr/bin/kvm -name gw4 -S -M pc-1.2 -enable-kvm -m 8192 -smp 2,sockets=2,cores=1,threads=1 -uuid 1dd45f6d-81d5-ab45-8729-1f4ecd205eff -no-user-config -nodefaults -chardev socket,id=charmonitor,path=/var/lib/libvirt/qemu/gw4.monitor,server,nowait -mon chardev=charmonitor,id=monitor,mode=control -rtc base=utc -no-shutdown -device virtio-serial-pci,id=virtio-serial0,bus=pci.0,addr=0x6 -device piix3-usb-uhci,id=usb,bus=pci.0,addr=0x1.0x2 -driv
[19:51] <MrNPP> so for testing i commented out the auth supported
[19:51] * terje (~joey@63-154-142-11.mpls.qwest.net) Quit (Ping timeout: 480 seconds)
[19:52] <MrNPP> 2013-03-02 10:51:48.750785 7f9ffd27d700 10 monclient(hunting): none of our auth protocols are supported by the server
[19:52] <MrNPP> i'm guessing thats my problem
[19:53] <jmlowe> yeah, that doesn't sound like it would work
[19:54] <MrNPP> hmmm, not sure how to solve it
[19:54] <MrNPP> why would it complain if i don't have anything set
[19:55] <jmlowe> I'm hunting for the libvirt xml that would generate this argument to qemu "discard_granularity=512"
[20:05] <MrNPP> i might be an idiot
[20:05] <MrNPP> do all osd's have to have a copy of the key?
[20:06] <jmlowe> no, the osd's each have their own unique key
[20:06] <MrNPP> http://paste.scurvynet.com/?d942dd46a259cea6#it491FlLJRLRZOCeUjkRjkSL4LDBN+GcZ09bHHh/lZo=
[20:06] <MrNPP> i'm getting closer
[20:06] <MrNPP> osd_op_reply(1 gentoo-vm.rbd [stat] = -1 (Operation not permitted))
[20:15] * joao (~JL@89.181.153.32) has joined #ceph
[20:15] * ChanServ sets mode +o joao
[20:21] * jluis (~JL@89-181-155-140.net.novis.pt) Quit (Ping timeout: 480 seconds)
[20:32] * danieagle (~Daniel@186.214.56.35) Quit (Quit: Inte+ :-) e Muito Obrigado Por Tudo!!! ^^)
[20:39] * Philip__ (~Philip@hnvr-4d07bddd.pool.mediaWays.net) Quit (Ping timeout: 480 seconds)
[21:04] * sage (~sage@76.89.177.113) has joined #ceph
[21:14] * terje (~joey@63-154-150-38.mpls.qwest.net) has joined #ceph
[21:16] * davidz (~Adium@ip68-96-75-123.oc.oc.cox.net) Quit (Ping timeout: 480 seconds)
[21:19] * janeUbuntu (~jane@2001:3c8:c103:a001:54ed:2bf5:26ad:2af8) has joined #ceph
[21:22] * terje (~joey@63-154-150-38.mpls.qwest.net) Quit (Ping timeout: 480 seconds)
[21:37] * mtk0 (~mtk@ool-44c35983.dyn.optonline.net) has joined #ceph
[21:38] * mtk0 (~mtk@ool-44c35983.dyn.optonline.net) Quit (Remote host closed the connection)
[22:20] * disarone (~disa@xdsl-81-173-232-158.netcologne.de) Quit (Quit: This computer has made booboo)
[22:43] * slang1 (~slang@207-229-177-80.c3-0.drb-ubr1.chi-drb.il.cable.rcn.com) Quit (Ping timeout: 480 seconds)
[22:49] * nwat (~nwatkins@soenat3.cse.ucsc.edu) Quit (Quit: nwat)
[22:56] * leseb (~leseb@78.250.73.79) has joined #ceph
[22:58] * slang1 (~slang@207-229-177-80.c3-0.drb-ubr1.chi-drb.il.cable.rcn.com) has joined #ceph
[23:00] * lightspeed (~lightspee@81.187.0.153) Quit (Ping timeout: 480 seconds)
[23:02] * BillK (~BillK@124-169-212-175.dyn.iinet.net.au) has joined #ceph
[23:04] * terje (~joey@63-154-150-151.mpls.qwest.net) has joined #ceph
[23:12] * terje (~joey@63-154-150-151.mpls.qwest.net) Quit (Ping timeout: 480 seconds)
[23:16] * mdxi (~mdxi@74-95-29-182-Atlanta.hfc.comcastbusiness.net) Quit (Quit: leaving)
[23:16] * verwilst (~verwilst@dD5769628.access.telenet.be) Quit (Quit: Ex-Chat)
[23:21] * leseb (~leseb@78.250.73.79) Quit (Remote host closed the connection)
[23:27] * mdxi (~mdxi@74-95-29-182-Atlanta.hfc.comcastbusiness.net) has joined #ceph

These logs were automatically created by CephLogBot on irc.oftc.net using the Java IRC LogBot.