#ceph IRC Log

Index

IRC Log for 2013-05-12

Timestamps are in GMT/BST.

[0:01] * rustam (~rustam@94.15.91.30) has joined #ceph
[0:07] * BManojlovic (~steki@fo-d-130.180.254.37.targo.rs) Quit (Ping timeout: 480 seconds)
[0:10] * yehuda_hm (~yehuda@2602:306:330b:1410:9c43:730e:bbf0:a165) Quit (Ping timeout: 480 seconds)
[0:10] * DarkAceZ (~BillyMays@50.107.54.92) has joined #ceph
[0:11] * yehuda_hm (~yehuda@2602:306:330b:1410:9c43:730e:bbf0:a165) has joined #ceph
[0:15] * ay (~ay@91.247.228.48) has joined #ceph
[0:15] * ay says hi
[0:16] <ay> I have just set up a cluster of there machines. Everything seems fine. Except that osd does not start on node 3.
[0:16] <ay> Starting Ceph osd.3 on node03...
[0:16] <ay> global_init: unable to open config file from search list
[0:16] <ay> I can't for the love of some god understand why
[0:16] <Kioob> great, when PG are in "incomplete" state, you can't remove data (rbd rm), so... I don't know how to fix that state.
[0:17] <ay> ...search list /temp/ceph.conf.(somehashnotontmp)
[0:23] * BManojlovic (~steki@fo-d-130.180.254.37.targo.rs) has joined #ceph
[0:26] * leseb (~Adium@bea13-1-82-228-104-16.fbx.proxad.net) has joined #ceph
[0:27] <Kioob> sage : I seen that report http://tracker.ceph.com/issues/4672 ; I also see some "slow request ... currently reached pg" on 0.61.1. Is it same kind of problem ?
[0:35] * leseb (~Adium@bea13-1-82-228-104-16.fbx.proxad.net) Quit (Ping timeout: 480 seconds)
[0:43] * BillK (~BillK@124-169-231-135.dyn.iinet.net.au) has joined #ceph
[0:56] * Kdecherf (~kdecherf@shaolan.kdecherf.com) Quit (Ping timeout: 480 seconds)
[0:57] * Kdecherf (~kdecherf@shaolan.kdecherf.com) has joined #ceph
[0:58] * BManojlovic (~steki@fo-d-130.180.254.37.targo.rs) Quit (Quit: Ja odoh a vi sta 'ocete...)
[0:58] * BManojlovic (~steki@fo-d-130.180.254.37.targo.rs) has joined #ceph
[0:59] * portante` (~user@66.187.233.206) has joined #ceph
[1:00] * buck (~buck@c-24-6-91-4.hsd1.ca.comcast.net) has left #ceph
[1:00] * dosaboy (~dosaboy@host86-161-206-107.range86-161.btcentralplus.com) has joined #ceph
[1:01] * alexxy[home] (~alexxy@2001:470:1f14:106::2) has joined #ceph
[1:02] * Zethrok_ (~martin@95.154.26.34) has joined #ceph
[1:02] * treaki_ (504f231e82@p4FF4A89B.dip0.t-ipconnect.de) has joined #ceph
[1:03] * mistur_ (~yoann@kewl.mistur.org) has joined #ceph
[1:03] * BillK (~BillK@124-169-231-135.dyn.iinet.net.au) Quit (charon.oftc.net solenoid.oftc.net)
[1:03] * ggreg_ (~ggreg@int.0x80.net) Quit (charon.oftc.net solenoid.oftc.net)
[1:03] * Meths (rift@2.25.193.124) Quit (charon.oftc.net solenoid.oftc.net)
[1:03] * portante|afk (~user@66.187.233.206) Quit (charon.oftc.net solenoid.oftc.net)
[1:03] * dosaboy_ (~dosaboy@host86-161-206-107.range86-161.btcentralplus.com) Quit (charon.oftc.net solenoid.oftc.net)
[1:03] * LeaChim (~LeaChim@176.250.188.136) Quit (charon.oftc.net solenoid.oftc.net)
[1:03] * alexxy (~alexxy@2001:470:1f14:106::2) Quit (charon.oftc.net solenoid.oftc.net)
[1:03] * tjikkun (~tjikkun@2001:7b8:356:0:225:22ff:fed2:9f1f) Quit (charon.oftc.net solenoid.oftc.net)
[1:03] * rtek (~sjaak@rxj.nl) Quit (charon.oftc.net solenoid.oftc.net)
[1:03] * mynameisbruce (~mynameisb@tjure.netzquadrat.de) Quit (charon.oftc.net solenoid.oftc.net)
[1:03] * asadpanda (~asadpanda@2001:470:c09d:0:20c:29ff:fe4e:a66) Quit (charon.oftc.net solenoid.oftc.net)
[1:03] * mistur (~yoann@kewl.mistur.org) Quit (charon.oftc.net solenoid.oftc.net)
[1:03] * infernix (nix@5ED33947.cm-7-4a.dynamic.ziggo.nl) Quit (charon.oftc.net solenoid.oftc.net)
[1:03] * nyerup (irc@jespernyerup.dk) Quit (charon.oftc.net solenoid.oftc.net)
[1:03] * ivoks (~ivoks@jupiter.init.hr) Quit (charon.oftc.net solenoid.oftc.net)
[1:03] * wonko_be (bernard@november.openminds.be) Quit (charon.oftc.net solenoid.oftc.net)
[1:03] * Zethrok (~martin@95.154.26.34) Quit (charon.oftc.net solenoid.oftc.net)
[1:03] * soren (~soren@hydrogen.linux2go.dk) Quit (charon.oftc.net solenoid.oftc.net)
[1:03] * sbadia (~sbadia@yasaw.net) Quit (charon.oftc.net solenoid.oftc.net)
[1:04] * maswan (maswan@kennedy.acc.umu.se) Quit (Remote host closed the connection)
[1:04] * maswan (maswan@kennedy.acc.umu.se) has joined #ceph
[1:04] * mynameisbruce (~mynameisb@tjure.netzquadrat.de) has joined #ceph
[1:07] * BillK (~BillK@124-169-231-135.dyn.iinet.net.au) has joined #ceph
[1:07] * ggreg_ (~ggreg@int.0x80.net) has joined #ceph
[1:07] * Meths (rift@2.25.193.124) has joined #ceph
[1:07] * LeaChim (~LeaChim@176.250.188.136) has joined #ceph
[1:07] * nyerup (irc@jespernyerup.dk) has joined #ceph
[1:07] * tjikkun (~tjikkun@2001:7b8:356:0:225:22ff:fed2:9f1f) has joined #ceph
[1:07] * rtek (~sjaak@rxj.nl) has joined #ceph
[1:07] * asadpanda (~asadpanda@2001:470:c09d:0:20c:29ff:fe4e:a66) has joined #ceph
[1:07] * infernix (nix@5ED33947.cm-7-4a.dynamic.ziggo.nl) has joined #ceph
[1:07] * soren (~soren@hydrogen.linux2go.dk) has joined #ceph
[1:07] * ivoks (~ivoks@jupiter.init.hr) has joined #ceph
[1:07] * sbadia (~sbadia@yasaw.net) has joined #ceph
[1:07] * wonko_be (bernard@november.openminds.be) has joined #ceph
[1:18] * drokita1 (~drokita@24-107-180-86.dhcp.stls.mo.charter.com) has joined #ceph
[1:22] * drokita (~drokita@24-107-180-86.dhcp.stls.mo.charter.com) Quit (Ping timeout: 482 seconds)
[1:49] * yehuda_hm (~yehuda@2602:306:330b:1410:9c43:730e:bbf0:a165) Quit (Ping timeout: 480 seconds)
[1:50] * yehuda_hm (~yehuda@2602:306:330b:1410:9c43:730e:bbf0:a165) has joined #ceph
[1:52] * diegows (~diegows@190.190.2.126) has joined #ceph
[2:26] * rustam (~rustam@94.15.91.30) Quit (Remote host closed the connection)
[2:27] * rustam (~rustam@94.15.91.30) has joined #ceph
[2:28] * brambles_ is now known as brambles
[2:30] * eternaleye (~eternaley@cl-43.lax-02.us.sixxs.net) Quit (Ping timeout: 480 seconds)
[2:30] * yehuda_hm (~yehuda@2602:306:330b:1410:9c43:730e:bbf0:a165) Quit (Ping timeout: 480 seconds)
[2:31] * yehuda_hm (~yehuda@2602:306:330b:1410:9c43:730e:bbf0:a165) has joined #ceph
[2:35] * rustam (~rustam@94.15.91.30) Quit (Ping timeout: 480 seconds)
[2:39] * yehuda_hm (~yehuda@2602:306:330b:1410:9c43:730e:bbf0:a165) Quit (Ping timeout: 480 seconds)
[2:41] * yehuda_hm (~yehuda@2602:306:330b:1410:9c43:730e:bbf0:a165) has joined #ceph
[2:45] * eternaleye (~eternaley@2607:f878:fe00:802a::1) has joined #ceph
[2:45] * BManojlovic (~steki@fo-d-130.180.254.37.targo.rs) Quit (Quit: Ja odoh a vi sta 'ocete...)
[3:28] <lx0> woah, what happened in 0.61.1 vs 0.60 that the VIRT size of the osd processes went down from 3-4G to 600-800M? mon shrank from 2-3G to 300M, too!
[3:31] * lx0 is now known as lxo
[3:34] * esammy (~esamuels@host-2-103-103-135.as13285.net) has left #ceph
[3:59] * loicd (~loic@magenta.dachary.org) Quit (Quit: Leaving.)
[3:59] * loicd (~loic@magenta.dachary.org) has joined #ceph
[4:03] * treaki__ (db70d26d90@p4FDF6EBA.dip0.t-ipconnect.de) has joined #ceph
[4:03] * diegows (~diegows@190.190.2.126) Quit (Ping timeout: 480 seconds)
[4:07] * treaki_ (504f231e82@p4FF4A89B.dip0.t-ipconnect.de) Quit (Ping timeout: 480 seconds)
[4:24] <themgt> are there any details on the new rgw CORS support?
[4:25] <themgt> ahh nm I see github issue
[4:28] * lxo (~aoliva@lxo.user.oftc.net) Quit (Quit: later)
[4:32] * rustam (~rustam@94.15.91.30) has joined #ceph
[4:37] * loicd (~loic@magenta.dachary.org) Quit (Quit: Leaving.)
[4:37] * loicd (~loic@magenta.dachary.org) has joined #ceph
[4:38] * lxo (~aoliva@lxo.user.oftc.net) has joined #ceph
[4:49] <saras> boo
[4:59] <saras> - dot <<<< what is it
[5:05] <saras> is their inktanks guys here
[5:11] <saras> in the githut read one depense is dot what package it that their not pack in ubuntu of debian
[5:13] <phantomcircuit> 2013-05-12 05:18:15.838735 2afcf846700 1 heartbeat_map is_healthy 'OSD::op_tp thread 0x2afbf025700' had timed out after 15
[5:13] <phantomcircuit> this eventually ends with a fatal timeout
[5:13] <phantomcircuit> 0.56.3
[5:14] * loicd (~loic@magenta.dachary.org) Quit (Quit: Leaving.)
[5:14] * loicd (~loic@magenta.dachary.org) has joined #ceph
[5:16] <phantomcircuit> common/HeartbeatMap.cc: 79: FAILED assert(0 == "hit suicide timeout")
[5:17] <phantomcircuit> damn of course i had to hit this at 8pm on a saturday
[5:26] <saras> if you make commit on line githut if fires a email or message or some thing right
[5:27] * themgt (~themgt@24-177-232-33.dhcp.gnvl.sc.charter.com) Quit (Quit: themgt)
[5:50] * [cave] (~quassel@boxacle.net) has joined #ceph
[5:51] <phantomcircuit> hmm i wonder if 0.56.3 is incompatible with 0.56.6
[5:52] <saras> sounds like possible
[5:52] <saras> sounds possible
[6:09] <phantomcircuit> 2013-05-12 06:09:09.656867 2e8cd3c4700 1 journal check_for_full at 84467712 : JOURNAL FULL 84467712 >= 4095 (max_size 994050048 start 84471808)
[6:09] <phantomcircuit> both are in this state
[6:09] <phantomcircuit> and dont appear to be actively doing anything
[6:15] <saras> will I think your right their don't like each other
[6:22] * yehuda_hm (~yehuda@2602:306:330b:1410:9c43:730e:bbf0:a165) Quit (Ping timeout: 480 seconds)
[6:23] * loicd (~loic@magenta.dachary.org) Quit (Quit: Leaving.)
[6:23] * loicd (~loic@magenta.dachary.org) has joined #ceph
[6:23] * mnash (~chatzilla@66-194-114-178.static.twtelecom.net) Quit (Ping timeout: 480 seconds)
[6:23] * yehuda_hm (~yehuda@2602:306:330b:1410:9c43:730e:bbf0:a165) has joined #ceph
[6:26] <saras> Oh 2 five task strated not bad right
[6:26] <saras> phantomcircuit: good luck i wish could help
[6:29] <saras> https://github.com/sarasfox/ceph/blob/master/README.pi i got something done
[6:29] <saras> nite all
[6:45] <phantomcircuit> this is bizarre
[6:45] <phantomcircuit> the osds are running
[6:45] <phantomcircuit> and appear to be fine
[6:45] <phantomcircuit> except they wont connect to the monitors
[7:02] * vipr_ (~vipr@78-23-112-130.access.telenet.be) has joined #ceph
[7:09] * vipr (~vipr@78-23-113-244.access.telenet.be) Quit (Ping timeout: 480 seconds)
[7:57] * loicd (~loic@magenta.dachary.org) Quit (Quit: Leaving.)
[7:58] * loicd (~loic@magenta.dachary.org) has joined #ceph
[8:32] * tnt (~tnt@91.178-201-80.adsl-dyn.isp.belgacom.be) has joined #ceph
[9:25] * yehuda_hm (~yehuda@2602:306:330b:1410:9c43:730e:bbf0:a165) Quit (Ping timeout: 480 seconds)
[9:29] * yehuda_hm (~yehuda@2602:306:330b:1410:9c43:730e:bbf0:a165) has joined #ceph
[9:44] * eschnou (~eschnou@175.93-201-80.adsl-dyn.isp.belgacom.be) has joined #ceph
[9:58] * eschnou (~eschnou@175.93-201-80.adsl-dyn.isp.belgacom.be) Quit (Ping timeout: 480 seconds)
[10:21] * eschnou (~eschnou@175.93-201-80.adsl-dyn.isp.belgacom.be) has joined #ceph
[10:50] * loicd (~loic@magenta.dachary.org) Quit (Quit: Leaving.)
[10:50] * loicd (~loic@magenta.dachary.org) has joined #ceph
[11:05] * eschnou (~eschnou@175.93-201-80.adsl-dyn.isp.belgacom.be) Quit (Ping timeout: 480 seconds)
[11:14] * rustam (~rustam@94.15.91.30) Quit (Remote host closed the connection)
[11:14] * rustam (~rustam@94.15.91.30) has joined #ceph
[11:22] * rustam (~rustam@94.15.91.30) Quit (Ping timeout: 480 seconds)
[11:30] * andreask (~andreas@h081217068225.dyn.cm.kabsi.at) has joined #ceph
[11:30] * andreask (~andreas@h081217068225.dyn.cm.kabsi.at) Quit ()
[11:55] * jamespage (~jamespage@culvain.gromper.net) has joined #ceph
[12:32] * rustam (~rustam@94.15.91.30) has joined #ceph
[13:44] * athrift (~nz_monkey@123.255.47.222) Quit (Ping timeout: 480 seconds)
[13:54] * diegows (~diegows@190.190.2.126) has joined #ceph
[14:10] * eschnou (~eschnou@175.93-201-80.adsl-dyn.isp.belgacom.be) has joined #ceph
[14:16] * bergerx (~bergerx@194.27.149.32) has joined #ceph
[14:18] * athrift (~nz_monkey@222.47.255.123.static.snap.net.nz) has joined #ceph
[14:50] * eschnou (~eschnou@175.93-201-80.adsl-dyn.isp.belgacom.be) Quit (Ping timeout: 480 seconds)
[14:57] * brian_appscale (~brian@2600:1008:b100:3c43:8490:a223:6e86:25ea) has joined #ceph
[14:57] * bergerx (~bergerx@194.27.149.32) Quit (Ping timeout: 480 seconds)
[15:14] <BillK> after recreating a 3 osd setup on a single host, I can create an image but get this error when I try and map it ... why? rbd: add failed: (6) No such device or address
[15:15] <BillK> used .56.3, .56.5, 0.60, 0.61 and 0.61.1 which is what destroyed the original system (took out the OS when it crashed :(
[15:19] * bergerx (~bergerx@194.27.149.32) has joined #ceph
[15:20] * tnt (~tnt@91.178-201-80.adsl-dyn.isp.belgacom.be) Quit (Ping timeout: 480 seconds)
[15:25] * brian_appscale (~brian@2600:1008:b100:3c43:8490:a223:6e86:25ea) Quit (Ping timeout: 480 seconds)
[15:34] * leseb (~Adium@bea13-1-82-228-104-16.fbx.proxad.net) has joined #ceph
[15:41] * diegows (~diegows@190.190.2.126) Quit (Ping timeout: 480 seconds)
[16:04] * Rorik_ (~rorik@199.182.216.68) has joined #ceph
[16:04] * kyle__ (~kyle@216.183.64.10) has joined #ceph
[16:04] * capri_wk (~capri@212.218.127.222) has joined #ceph
[16:04] * eegiks (~quassel@2a01:e35:8a2c:b230:98d5:8632:b533:6675) Quit (Ping timeout: 480 seconds)
[16:04] * leseb (~Adium@bea13-1-82-228-104-16.fbx.proxad.net) Quit (Quit: Leaving.)
[16:04] * ShaunR- (~ShaunR@staff.ndchost.com) has joined #ceph
[16:04] * scuttlemonkey_ (~scuttlemo@c-69-244-181-5.hsd1.mi.comcast.net) has joined #ceph
[16:05] * john_barbee__ (~jbarbee@c-98-226-73-253.hsd1.in.comcast.net) Quit (Read error: Connection reset by peer)
[16:05] * john_barbee_ (~jbarbee@c-98-226-73-253.hsd1.in.comcast.net) has joined #ceph
[16:05] * Romeo_ (~romeo@198.144.195.85) has joined #ceph
[16:05] * Romeo (~romeo@198.144.195.85) Quit (Read error: Connection reset by peer)
[16:08] * eegiks (~quassel@2a01:e35:8a2c:b230:b981:9397:6cc3:f108) has joined #ceph
[16:09] * Rorik (~rorik@199.182.216.68) Quit (Ping timeout: 481 seconds)
[16:10] * ShaunR (ShaunR@ip68-96-89-159.oc.oc.cox.net) Quit (Ping timeout: 480 seconds)
[16:10] * coyo (~unf@pool-71-170-191-140.dllstx.fios.verizon.net) has joined #ceph
[16:11] * kyle_ (~kyle@216.183.64.10) Quit (Ping timeout: 480 seconds)
[16:11] * scuttlemonkey (~scuttlemo@c-69-244-181-5.hsd1.mi.comcast.net) Quit (Ping timeout: 480 seconds)
[16:11] * capri_on (~capri@212.218.127.222) Quit (Ping timeout: 480 seconds)
[16:18] * eschnou (~eschnou@175.93-201-80.adsl-dyn.isp.belgacom.be) has joined #ceph
[16:19] * KindTwo (~KindOne@h184.178.130.174.dynamic.ip.windstream.net) has joined #ceph
[16:21] * john_barbee__ (~jbarbee@c-98-226-73-253.hsd1.in.comcast.net) has joined #ceph
[16:21] * KindOne (KindOne@0001a7db.user.oftc.net) Quit (Ping timeout: 480 seconds)
[16:21] * KindTwo is now known as KindOne
[16:21] * john_barbee_ (~jbarbee@c-98-226-73-253.hsd1.in.comcast.net) Quit (Ping timeout: 480 seconds)
[16:30] * john_barbee__ (~jbarbee@c-98-226-73-253.hsd1.in.comcast.net) Quit (Ping timeout: 480 seconds)
[16:58] * bergerx (~bergerx@194.27.149.32) Quit (Quit: Leaving)
[17:06] * john_barbee_ (~jbarbee@c-98-226-73-253.hsd1.in.comcast.net) has joined #ceph
[17:17] * saras (~kvirc@74-61-8-52.war.clearwire-wmx.net) Quit (Quit: KVIrc 4.1.3 Equilibrium http://www.kvirc.net/)
[17:29] * eschnou (~eschnou@175.93-201-80.adsl-dyn.isp.belgacom.be) Quit (Ping timeout: 480 seconds)
[17:30] * yehuda_hm (~yehuda@2602:306:330b:1410:9c43:730e:bbf0:a165) Quit (Ping timeout: 480 seconds)
[17:30] * yehuda_hm (~yehuda@2602:306:330b:1410:942f:17b1:c111:4865) has joined #ceph
[17:48] * newbie (~kvirc@74-61-8-52.war.clearwire-wmx.net) has joined #ceph
[17:49] <newbie> anyone how ceph uses libatmoic-ops around
[17:49] * newbie is now known as saras
[18:06] * loicd (~loic@magenta.dachary.org) Quit (Quit: Leaving.)
[18:06] * loicd (~loic@magenta.dachary.org) has joined #ceph
[18:08] * BillK (~BillK@124-169-231-135.dyn.iinet.net.au) Quit (Ping timeout: 480 seconds)
[18:09] <lxo> I think I know what reduced the VIRT size so much after I upgraded to 0.61.1: I started using pre-built packages rather than rolling out my own, and for some reason mine didn't use tcmalloc whereas the pre-built ones do. I'm surprised it makes such a difference!
[18:26] * mnash (~chatzilla@vpn.expressionanalysis.com) has joined #ceph
[18:35] * The_Bishop (~bishop@93.182.144.2) has joined #ceph
[18:44] * The_Bishop (~bishop@93.182.144.2) Quit (Ping timeout: 480 seconds)
[18:53] * dcasier (~dcasier@223.103.120.78.rev.sfr.net) has joined #ceph
[18:57] * gregaf1 (~Adium@2607:f298:a:607:c8b1:de29:9e01:d804) has joined #ceph
[18:59] * The_Bishop (~bishop@2001:470:50b6:0:d59f:b451:2b64:16b6) has joined #ceph
[19:04] * gregaf (~Adium@2607:f298:a:607:10e3:a393:f44c:3d3) Quit (Ping timeout: 480 seconds)
[19:05] * dcasier (~dcasier@223.103.120.78.rev.sfr.net) Quit (Read error: No route to host)
[19:06] <via> joao: ping
[19:13] * tnt (~tnt@91.177.214.32) has joined #ceph
[19:22] <joao> via, here, but barely
[19:22] <joao> what's up?
[19:23] <via> joao: well, i've been with a completely nonfunctioning cluster for many days now...and was really wondering if i could safely downgrade back to bobtail
[19:23] <via> so that i can use it
[19:24] <joao> I don't think you can downgrade the osds; iirc there was a format change of sorts
[19:25] <via> well, i never restarted the osds with the new version
[19:25] <via> only the mons
[19:25] <joao> ah
[19:25] <joao> well
[19:25] <joao> have the monitors ever formed quorum?
[19:25] <via> the monitors won't start because of the crash
[19:26] <via> i tried updating all three at the same time
[19:26] <via> as per the release notes
[19:26] <joao> that crash being the one during store conversion, right?
[19:26] <via> yeah
[19:26] <joao> we have an idea on how to fix that, but haven't gotten around to implement it though
[19:26] <joao> will have to wait for tomorrow
[19:26] <via> on all nodes
[19:26] <via> to fix the conversion?
[19:27] <joao> well, the bug is not in the conversion; do you recall which bobtail you have used so far?
[19:27] <via> .56.6
[19:27] <via> was what i was using immediately prior
[19:27] <joao> did you used any prior versions to that at some point?
[19:27] <via> technically the osd's and mds's are still running with that
[19:27] <via> yes
[19:28] <via> i started with argonaut and have moved up over the last 6 months
[19:28] <joao> we believe you're suffering from fallout from a previously fixed bug during bobtail, in which Global Versions (required for the new monitor format) would sometimes incur in duplicate versions
[19:29] <joao> so the fix would be to make conversion 'smarter'
[19:29] <joao> in order to work around that
[19:30] <joao> and I think that the reason why you're hitting that with the mdsmonitor is that we don't trim mdsmaps, so those old versions are still there
[19:31] <joao> anyway, if you never reached quorum and still have all the mon data dir contents, then yes, it is safe to downgrade the monitors
[19:31] <via> okay cool
[19:31] <via> thanks for working on this, i'm glad to hear there's a solution in the works
[19:32] <via> i suppose mainly just wondering...there's nothing about the conversion process starting that and failing that would make the downgrade screw up?
[19:32] <via> i'll go ahead and try it now
[19:37] <joao> no
[19:37] <joao> the conversion process is read-only wrt the old store
[19:41] <via> awesome
[19:41] <via> joao: it worked falwlessly
[19:41] <via> flawlessly even
[19:42] <via> anyway, if/when you all get a fix, i'm willing to test
[19:43] <joao> cool
[19:43] <joao> glad it worked
[19:43] <joao> and thanks! :)
[19:52] <phantomcircuit> i have an osd that's failing with common/HeartbeatMap.cc: 79: FAILED assert(0 == "hit suicide timeout")
[19:53] <phantomcircuit> i recently took updated the monitors from 0.56.3 to 0.56.6 while the osd is still 0.56.3
[19:57] <saras> any core dev's around
[20:00] * lxo (~aoliva@lxo.user.oftc.net) Quit (Quit: later)
[20:00] * lxo (~aoliva@lxo.user.oftc.net) has joined #ceph
[20:54] * jksM (~jks@3e6b5724.rev.stofanet.dk) Quit (Quit: jksM)
[20:54] * Cube (~Cube@cpe-76-95-217-129.socal.res.rr.com) has joined #ceph
[20:56] * BManojlovic (~steki@fo-d-130.180.254.37.targo.rs) has joined #ceph
[20:57] * eschnou (~eschnou@175.93-201-80.adsl-dyn.isp.belgacom.be) has joined #ceph
[20:59] * uli (~uli@p57BA4A5E.dip0.t-ipconnect.de) has joined #ceph
[21:00] <uli> hey there, i get a strange error, and have no idea whats missing, if i do ceph-deploy disk zap sba-ceph01:sdb no error is reported
[21:01] <uli> on host sba-ceph01 i get in syslog: sd 0:0:1:0: [sdb] Cache data unavailable sd 0:0:1:0: [sdb] Assuming drive cache: write through sdb: unknown partition table
[21:02] <uli> why the partitiontable is unknown?
[21:05] <uli> when i then try to do a ceph-deploy osd create sba-ceph01:sdb on host sba-ceph01 in osd-logfile i get : mkfs in /var/lib/ceph/tmp/mnt.9paj1W mkfs fsid is already set to 872e63af-e60f-4a9e-9423-fc7f122bb46 leveldb db exists/created mkjournal error creating journal on /var/lib/ceph/tmp/mnt.9paj1W/journal: (2) No such file or directory
[21:05] <uli> error ceating bla
[21:06] <uli> has someone an idea what could be missing
[21:06] <uli> ceph version 0.61.1
[21:06] <uli> os: debian wheezy
[21:08] <Cube> sdb1 instead of sdb maybe?
[21:09] * mistur_ is now known as mistur
[21:11] <phantomcircuit> 2013-05-12 21:16:45.260430 2a4e90d9700 1 heartbeat_map is_healthy 'OSD::op_tp thread 0x2a4db0bd700' had timed out after 15
[21:12] <phantomcircuit> this eventually times out completely
[21:12] <phantomcircuit> joao, any idea why heartbeats would fail?
[21:13] <phantomcircuit> common/HeartbeatMap.cc: 79: FAILED assert(0 == "hit suicide timeout")
[21:17] <phantomcircuit> 2013-05-12 21:18:03.962227 2a4d6fb4700 0 -- 0.0.0.0:6802/20603 >> x.x.x.x:6803/29454 pipe(0x79600dcc80 sd=29 :54946 s=1 pgs=0 cs=1 l=0).connect claims to be 0.0.0.0:6803/20603 not x.x.x.x:6803/29454 - wrong node!
[21:17] <phantomcircuit> wat
[21:24] * Cube (~Cube@cpe-76-95-217-129.socal.res.rr.com) Quit (Quit: Leaving.)
[21:30] * jks (~jks@3e6b5724.rev.stofanet.dk) has joined #ceph
[21:54] * Cube (~Cube@cpe-76-95-217-129.socal.res.rr.com) has joined #ceph
[22:00] <saras> uli: give me few see if works for me
[22:06] * Cube (~Cube@cpe-76-95-217-129.socal.res.rr.com) Quit (Ping timeout: 480 seconds)
[22:09] <joao> phantomcircuit, iirc, one possible cause are ops-in-flight delaying stuff
[22:10] <joao> don't really have the time to dive into it now though, sorry
[22:10] <joao> trying to finish a presentation on btrfs to give a class tomorrow (and by finish I mean start)
[22:14] <saras> joao: lol
[22:18] <uli> saras, used sdb1 instead sdb, cube found the typo....
[22:18] <saras> kool
[22:18] <saras> then i was going to have issue then
[22:18] <saras> i don
[22:19] <saras> i don't think what in sda has any filesystem on it
[22:24] <saras> so i was going to take some time
[22:31] <saras> https://www.coursera.org/course/malsoftware is one taking this class
[22:31] <saras> https://www.coursera.org/course/malsoftware is anyone taking this class
[22:37] * b1tbkt_ (~Peekaboo@24-216-67-250.dhcp.stls.mo.charter.com) has joined #ceph
[22:37] <saras> have fun guys i got run
[22:37] * saras (~kvirc@74-61-8-52.war.clearwire-wmx.net) Quit (Quit: KVIrc 4.1.3 Equilibrium http://www.kvirc.net/)
[22:38] * themgt (~themgt@24-177-232-33.dhcp.gnvl.sc.charter.com) has joined #ceph
[22:39] * rustam (~rustam@94.15.91.30) Quit (Remote host closed the connection)
[22:41] * alo (~al.o@host73-111-dynamic.48-82-r.retail.telecomitalia.it) has joined #ceph
[22:41] * b1tbkt (~Peekaboo@24-216-67-250.dhcp.stls.mo.charter.com) Quit (Ping timeout: 480 seconds)
[22:53] * BillK (~BillK@124-169-231-135.dyn.iinet.net.au) has joined #ceph
[22:54] * eschnou (~eschnou@175.93-201-80.adsl-dyn.isp.belgacom.be) Quit (Remote host closed the connection)
[23:04] * uli (~uli@p57BA4A5E.dip0.t-ipconnect.de) Quit (Ping timeout: 480 seconds)
[23:05] * rustam (~rustam@94.15.91.30) has joined #ceph
[23:11] * BillK (~BillK@124-169-231-135.dyn.iinet.net.au) Quit (Remote host closed the connection)
[23:13] * consus (~Unknown@ppp91-122-142-36.pppoe.avangarddsl.ru) has joined #ceph
[23:20] * BillK (~BillK@124-169-231-135.dyn.iinet.net.au) has joined #ceph
[23:23] <nigwil> is it possible that a MDS service restart loses log contents? I just had to restart mine since it was wedged and now the logs are gone
[23:24] * consus (~Unknown@ppp91-122-142-36.pppoe.avangarddsl.ru) Quit (Quit: no)
[23:54] * alo (~al.o@host73-111-dynamic.48-82-r.retail.telecomitalia.it) Quit (Ping timeout: 480 seconds)
[23:59] * Cube (~Cube@cpe-76-95-217-129.socal.res.rr.com) has joined #ceph

These logs were automatically created by CephLogBot on irc.oftc.net using the Java IRC LogBot.