#ceph IRC Log

Index

IRC Log for 2014-03-03

Timestamps are in GMT/BST.

[0:14] * jks (~jks@3e6b5724.rev.stofanet.dk) has joined #ceph
[0:16] * Discovery (~Discovery@109.235.55.205) has joined #ceph
[0:23] * davidzlap (~Adium@cpe-23-242-31-175.socal.res.rr.com) Quit (Quit: Leaving.)
[0:25] * yanzheng (~zhyan@134.134.139.72) has joined #ceph
[0:33] * BManojlovic (~steki@fo-d-130.180.254.37.targo.rs) Quit (Quit: Ja odoh a vi sta 'ocete...)
[0:33] * nwat (~textual@c-50-131-197-174.hsd1.ca.comcast.net) has joined #ceph
[0:37] * nwat (~textual@c-50-131-197-174.hsd1.ca.comcast.net) Quit ()
[0:39] * houkouonchi-home (~linux@houkouonchi-1-pt.tunnel.tserv15.lax1.ipv6.he.net) Quit (Ping timeout: 480 seconds)
[0:44] * ivotron (~ivotron@c-50-150-124-250.hsd1.ca.comcast.net) Quit (Remote host closed the connection)
[0:44] * ivotron (~ivotron@2601:9:2700:178:686e:53ce:b49:248e) has joined #ceph
[0:52] * KevinPerks (~Adium@cpe-066-026-252-218.triad.res.rr.com) Quit (Quit: Leaving.)
[0:52] * ivotron (~ivotron@2601:9:2700:178:686e:53ce:b49:248e) Quit (Ping timeout: 480 seconds)
[0:58] * yanzheng (~zhyan@134.134.139.72) Quit (Ping timeout: 480 seconds)
[1:01] * KevinPerks (~Adium@cpe-066-026-252-218.triad.res.rr.com) has joined #ceph
[1:05] * root__ (~root@90.201.113.214) has joined #ceph
[1:06] * root__ (~root@90.201.113.214) Quit ()
[1:14] * LeaChim (~LeaChim@host86-166-182-74.range86-166.btcentralplus.com) Quit (Ping timeout: 480 seconds)
[1:17] * ivotron (~ivotron@2601:9:2700:178:2d2e:7143:5260:2622) has joined #ceph
[1:20] * wschulze (~wschulze@cpe-72-229-37-201.nyc.res.rr.com) Quit (Quit: Leaving.)
[1:27] * Midnightmyth_ (~quassel@93-167-84-102-static.dk.customer.tdc.net) Quit (Ping timeout: 480 seconds)
[1:35] * lightspeed (~lightspee@2001:8b0:16e:1:216:eaff:fe59:4a3c) Quit (Quit: Leaving)
[1:40] * kitz_ (~kitz@c-71-192-30-36.hsd1.ma.comcast.net) has joined #ceph
[1:41] * xmltok (~xmltok@cpe-76-90-130-65.socal.res.rr.com) Quit (Quit: Bye!)
[1:43] * xmltok (~xmltok@cpe-76-90-130-65.socal.res.rr.com) has joined #ceph
[1:46] * diegows (~diegows@190.190.5.238) has joined #ceph
[1:52] * elder (~elder@z88l218.static.ctm.net) has joined #ceph
[1:52] * ChanServ sets mode +o elder
[1:59] * geekmush (~Adium@cpe-66-68-198-33.rgv.res.rr.com) Quit (Quit: Leaving.)
[2:00] * geekmush (~Adium@cpe-66-68-198-33.rgv.res.rr.com) has joined #ceph
[2:11] * raso (~raso@deb-multimedia.org) Quit (Quit: WeeChat 0.4.3)
[2:12] * raso (~raso@deb-multimedia.org) has joined #ceph
[2:17] * kitz_ (~kitz@c-71-192-30-36.hsd1.ma.comcast.net) Quit (Remote host closed the connection)
[2:17] * kitz_ (~kitz@admin163-72.hampshire.edu) has joined #ceph
[2:18] * nerdtron (~oftc-webi@202.60.8.250) has joined #ceph
[2:19] * nwat (~textual@c-50-131-197-174.hsd1.ca.comcast.net) has joined #ceph
[2:20] <nerdtron> ih all
[2:21] <nerdtron> health HEALTH_WARN mds cluster is degraded; mds ceph-node1 is laggy
[2:21] <nerdtron> any idea how to handle this?
[2:22] * cjh973 (~cjh973@ps123903.dreamhost.com) Quit (Ping timeout: 480 seconds)
[2:22] * cjh973 (~cjh973@ps123903.dreamhost.com) has joined #ceph
[2:29] * sjm (~sjm@pool-108-53-56-179.nwrknj.fios.verizon.net) has joined #ceph
[2:36] * yguang11 (~yguang11@2406:2000:ef96:e:c8d9:2df3:ef14:1ab8) has joined #ceph
[2:37] * xmltok (~xmltok@cpe-76-90-130-65.socal.res.rr.com) Quit (Quit: Bye!)
[2:38] * kitz__ (~kitz@c-71-192-30-36.hsd1.ma.comcast.net) has joined #ceph
[2:39] * kitz_ (~kitz@admin163-72.hampshire.edu) Quit (Ping timeout: 480 seconds)
[2:43] * glzhao (~glzhao@220.181.11.232) has joined #ceph
[2:44] <kitz__> I created journal buckets and moved my osds into them grouped by their ssd journal drive using `ceph osd crush add-bucket` and `ceph osd crush move`. When I rebooted one of my nodes all of the osds moved out of the journal buckets and are now just in the host bucket. How do I get my CRUSH map to stick?
[2:51] * elder (~elder@z88l218.static.ctm.net) Quit (Quit: This computer has gone to sleep)
[2:57] * lightspeed (~lightspee@2001:8b0:16e:1:216:eaff:fe59:4a3c) has joined #ceph
[3:04] * yguang11 (~yguang11@2406:2000:ef96:e:c8d9:2df3:ef14:1ab8) Quit (Remote host closed the connection)
[3:04] * yguang11 (~yguang11@2406:2000:ef96:e:c8d9:2df3:ef14:1ab8) has joined #ceph
[3:07] * Discovery (~Discovery@109.235.55.205) Quit (Read error: Connection reset by peer)
[3:09] * valeech (~valeech@pool-71-171-123-210.clppva.fios.verizon.net) has joined #ceph
[3:15] * erkules_ (~erkules@port-92-193-46-105.dynamic.qsc.de) has joined #ceph
[3:20] * dlan__ (~dennis@116.228.88.131) has joined #ceph
[3:22] * haomaiwa_ (~haomaiwan@118.186.133.131) Quit (Remote host closed the connection)
[3:22] * erkules (~erkules@port-92-193-74-66.dynamic.qsc.de) Quit (Ping timeout: 480 seconds)
[3:22] <kitz__> uhg. rebooted another node and got the same behavior. Only I also had one of my OSDs come up as /dev/sdae which had previously been where I was keeping journals.
[3:22] * dlan__ (~dennis@116.228.88.131) Quit ()
[3:23] * dlan__ (~dennis@116.228.88.131) has joined #ceph
[3:23] * haomaiwang (~haomaiwan@219-87-173-15.static.tfn.net.tw) has joined #ceph
[3:24] <kitz__> Which brings me to: Can I `ceph-deploy osd prepare host.domain:sdx:/dev/disk/by-partuuid/<GUID>`?
[3:26] * dlan__ (~dennis@116.228.88.131) Quit ()
[3:36] * cofol1986 (~xwrj@110.90.119.113) has joined #ceph
[3:37] <cofol1986> Hey guys, do anyone compile ceph-osd for arm?
[3:40] * wrale-josh (~oftc-webi@cpe-107-9-20-3.woh.res.rr.com) Quit (Remote host closed the connection)
[3:40] * yguang11 (~yguang11@2406:2000:ef96:e:c8d9:2df3:ef14:1ab8) Quit ()
[3:41] * diegows (~diegows@190.190.5.238) Quit (Ping timeout: 480 seconds)
[3:43] <kitz__> Removing my bad journal symlinks on the OSDs and replacing them with new symlinks to /dev/disk/by-partuuid/<GUID> seems to be working fine so far.
[3:44] * kitz__ (~kitz@c-71-192-30-36.hsd1.ma.comcast.net) Quit (Remote host closed the connection)
[3:45] * kitz_ (~kitz@admin162-254.hampshire.edu) has joined #ceph
[3:55] * ircolle (~Adium@2601:1:8380:2d9:f8ac:374b:8663:f102) has joined #ceph
[3:57] * yanzheng (~zhyan@134.134.139.72) has joined #ceph
[4:02] * kitz__ (~kitz@c-71-192-30-36.hsd1.ma.comcast.net) has joined #ceph
[4:06] * sjm (~sjm@pool-108-53-56-179.nwrknj.fios.verizon.net) has left #ceph
[4:09] * kitz_ (~kitz@admin162-254.hampshire.edu) Quit (Ping timeout: 480 seconds)
[4:11] * valeech (~valeech@pool-71-171-123-210.clppva.fios.verizon.net) Quit (Quit: valeech)
[4:12] <lurbs> Anyone else had trouble with Ceph monitors failing to be able to bind to IPv6 addresses on boot? Starting after boot is fine.
[4:13] <lurbs> I can even get netcat to bind to the same IP/port inside the pre-start script block of the Upstart init script (/etc/init/ceph-mon.conf), so the address is definitely on the machine.
[4:13] <lurbs> A netstat inside the same block verifies nothing else is listening there.
[4:23] * kitz__ (~kitz@c-71-192-30-36.hsd1.ma.comcast.net) Quit (Quit: kitz__)
[5:03] * sjm (~sjm@pool-108-53-56-179.nwrknj.fios.verizon.net) has joined #ceph
[5:27] * KevinPerks (~Adium@cpe-066-026-252-218.triad.res.rr.com) Quit (Quit: Leaving.)
[5:27] * Vacum_ (~vovo@88.130.216.34) has joined #ceph
[5:34] * nwat (~textual@c-50-131-197-174.hsd1.ca.comcast.net) Quit (Quit: My MacBook has gone to sleep. ZZZzzz???)
[5:34] * Vacum (~vovo@88.130.216.149) Quit (Ping timeout: 480 seconds)
[5:35] * pvh_sa (~pvh@41-133-202-127.dsl.mweb.co.za) Quit (Ping timeout: 480 seconds)
[5:46] * ircolle (~Adium@2601:1:8380:2d9:f8ac:374b:8663:f102) Quit (Quit: Leaving.)
[5:58] * KevinPerks (~Adium@cpe-066-026-252-218.triad.res.rr.com) has joined #ceph
[5:59] * KevinPerks1 (~Adium@cpe-066-026-252-218.triad.res.rr.com) has joined #ceph
[5:59] * KevinPerks (~Adium@cpe-066-026-252-218.triad.res.rr.com) Quit (Read error: Connection reset by peer)
[6:02] * sjm (~sjm@pool-108-53-56-179.nwrknj.fios.verizon.net) has left #ceph
[6:10] * KevinPerks1 (~Adium@cpe-066-026-252-218.triad.res.rr.com) Quit (Ping timeout: 480 seconds)
[6:14] * warrenSusui (~Warren@2607:f298:a:607:38fc:445b:1848:70d4) Quit (Read error: Connection reset by peer)
[6:14] * warrenSusui (~Warren@2607:f298:a:607:38fc:445b:1848:70d4) has joined #ceph
[6:15] * sprachgenerator (~sprachgen@c-67-167-211-254.hsd1.il.comcast.net) Quit (Quit: sprachgenerator)
[6:37] * sputnik13 (~sputnik13@wsip-68-105-248-60.sd.sd.cox.net) has joined #ceph
[6:44] * mattt (~textual@cpc9-rdng20-2-0-cust565.15-3.cable.virginm.net) has joined #ceph
[6:46] * mattt (~textual@cpc9-rdng20-2-0-cust565.15-3.cable.virginm.net) Quit ()
[6:48] * haomaiwa_ (~haomaiwan@219-87-173-15.static.tfn.net.tw) has joined #ceph
[6:48] * haomaiwang (~haomaiwan@219-87-173-15.static.tfn.net.tw) Quit (Read error: Connection reset by peer)
[6:59] * sprachgenerator (~sprachgen@c-67-167-211-254.hsd1.il.comcast.net) has joined #ceph
[7:00] * wschulze (~wschulze@cpe-72-229-37-201.nyc.res.rr.com) has joined #ceph
[7:11] * elder (~elder@z88l218.static.ctm.net) has joined #ceph
[7:11] * ChanServ sets mode +o elder
[7:14] * capri (~capri@212.218.127.222) has joined #ceph
[7:19] * calcifer (~calcifer@hudrydum.cz) has joined #ceph
[7:28] * hasues (~hazuez@108-236-232-243.lightspeed.knvltn.sbcglobal.net) Quit (Quit: Leaving.)
[7:37] * rotbeard (~redbeard@2a02:908:df19:7a80:76f0:6dff:fe3b:994d) Quit (Quit: Verlassend)
[7:41] <winston-d> a n00b question about ceph-mon: how does it save (persistently) the state of whole cluster when ceph-mon service is stopped?
[7:42] * calcifer (~calcifer@hudrydum.cz) Quit (Remote host closed the connection)
[7:43] * AfC (~andrew@2407:7800:400:1011:2ad2:44ff:fe08:a4c) Quit (Quit: Leaving.)
[7:48] * elder (~elder@z88l218.static.ctm.net) Quit (Quit: This computer has gone to sleep)
[7:56] * mattt (~textual@94.236.7.190) has joined #ceph
[7:56] * JeffK (~JeffK@38.99.52.10) Quit (Read error: Connection reset by peer)
[7:56] * wschulze (~wschulze@cpe-72-229-37-201.nyc.res.rr.com) Quit (Quit: Leaving.)
[7:57] * JeffK (~JeffK@38.99.52.10) has joined #ceph
[8:00] * sprachgenerator (~sprachgen@c-67-167-211-254.hsd1.il.comcast.net) Quit (Quit: sprachgenerator)
[8:02] * elder (~elder@n182z4l226.static.ctm.net) has joined #ceph
[8:02] * ChanServ sets mode +o elder
[8:05] <joshd1> winston-d: each ceph-mon has its own on-disk store (using leveldb these days)
[8:06] * mattt_ (~textual@94.236.7.190) has joined #ceph
[8:07] <winston-d> joshd1: thx, Josh.
[8:08] * mattt (~textual@94.236.7.190) Quit (Ping timeout: 480 seconds)
[8:08] * mattt_ is now known as mattt
[8:09] * dvanders (~dvanders@46.227.20.178) Quit (Quit: dvanders)
[8:12] * haomaiwang (~haomaiwan@106.38.255.123) has joined #ceph
[8:17] * dvanders (~dvanders@46.227.20.178) has joined #ceph
[8:19] * haomaiwa_ (~haomaiwan@219-87-173-15.static.tfn.net.tw) Quit (Ping timeout: 480 seconds)
[8:22] * schmee (~quassel@phobos.isoho.st) Quit (Remote host closed the connection)
[8:25] * dvanders (~dvanders@46.227.20.178) Quit (Ping timeout: 480 seconds)
[8:28] * rotbeard (~redbeard@b2b-94-79-138-170.unitymedia.biz) has joined #ceph
[8:28] * i_m (~ivan.miro@nat-5-carp.hcn-strela.ru) has joined #ceph
[8:30] * pvh_sa (~pvh@41.164.8.114) has joined #ceph
[8:37] * penguinRaider (~KiKo@14.139.82.6) Quit (Ping timeout: 480 seconds)
[8:40] * JCL (~JCL@2601:9:5980:39b:6ca3:31c6:1b2e:d7ea) has joined #ceph
[8:41] * JCL (~JCL@2601:9:5980:39b:6ca3:31c6:1b2e:d7ea) Quit ()
[8:44] <winston-d> is Sebastien around?
[8:44] <winston-d> sorry I forgot your irc handle.
[8:48] <winston-d> it seems ceph-deploy disk prepare/ osd activate doesn't save osd config to my ceph.conf, is that expected?
[8:48] * schmee (~quassel@phobos.isoho.st) has joined #ceph
[8:49] * _sileht (~sileht@gizmo.sileht.net) Quit (Quit: WeeChat 0.4.3)
[8:49] * sileht (~sileht@2a01:6600:8081:d6ff::feed:cafe) has joined #ceph
[8:49] * elder (~elder@n182z4l226.static.ctm.net) Quit (Quit: Leaving)
[8:50] <singler> winston-d: yes
[8:51] <singler> (I am not Sebastien)
[8:52] * ksingh (~Adium@2001:708:10:10:4027:d828:83c4:1c50) has joined #ceph
[8:52] <winston-d> singler: :) thx. so how can I dump the configuration of OSDs out of running OSD daemon? Or there is no need to save OSD configuration?
[8:54] * sputnik13 (~sputnik13@wsip-68-105-248-60.sd.sd.cox.net) Quit (Quit: Textual IRC Client: www.textualapp.com)
[8:57] <singler> I believe there is no need to save it. (unless for backups or disaster recovery)
[9:00] * dvanders (~dvanders@pb-d-128-141-237-118.cern.ch) has joined #ceph
[9:00] <winston-d> singler: that's new to me. haven't touch Ceph since Bobtail. thx.
[9:02] <singler> np
[9:05] * sputnik13 (~sputnik13@wsip-68-105-248-60.sd.sd.cox.net) has joined #ceph
[9:10] <winston-d> singler: i'm wondering if ceph-deploy is able to read config file as an input for adding OSDs, so that I don't have to write a script to do the same.
[9:10] * nwat (~textual@c-50-131-197-174.hsd1.ca.comcast.net) has joined #ceph
[9:11] * steki (~steki@91.195.39.5) has joined #ceph
[9:17] * Sysadmin88 (~IceChat77@176.254.32.31) Quit (Read error: Connection reset by peer)
[9:17] <singler> winston-d: sorry, I do not know that
[9:22] <oblu-> hello. no builds for i386 from gitbuilder? http://gitbuilder.ceph.com/ceph-deb-wheezy-x86_64-basic/ref/master/dists/wheezy/main/binary-i386/Packages
[9:26] <oblu-> sad day for my archaic p3
[9:28] * nwat (~textual@c-50-131-197-174.hsd1.ca.comcast.net) Quit (Quit: My MacBook has gone to sleep. ZZZzzz???)
[9:35] * haomaiwang (~haomaiwan@106.38.255.123) Quit (Remote host closed the connection)
[9:35] * haomaiwang (~haomaiwan@219-87-173-15.static.tfn.net.tw) has joined #ceph
[9:37] * elmo (~james@faun.canonical.com) Quit (Read error: Operation timed out)
[9:41] * fghaas (~florian@91-119-140-244.dynamic.xdsl-line.inode.at) has joined #ceph
[9:47] * andreask (~andreask@h081217067008.dyn.cm.kabsi.at) has joined #ceph
[9:47] * ChanServ sets mode +v andreask
[9:58] * ksingh1 (~Adium@2001:708:10:91:c10a:644c:5042:5e5d) has joined #ceph
[9:59] * yanzheng (~zhyan@134.134.139.72) Quit (Quit: Leaving)
[10:00] * shang (~ShangWu@219.250.81.130) has joined #ceph
[10:02] * wido__ (~wido@2a00:f10:121:100:4a5:76ff:fe00:199) Quit (Quit: No Ping reply in 180 seconds.)
[10:02] * morse (~morse@supercomputing.univpm.it) has joined #ceph
[10:02] * ksingh (~Adium@2001:708:10:10:4027:d828:83c4:1c50) Quit (Ping timeout: 480 seconds)
[10:02] * ksingh (~Adium@a-v6-0008.vpn.csc.fi) has joined #ceph
[10:08] * Cataglottism (~Cataglott@dsl-087-195-030-170.solcon.nl) has joined #ceph
[10:08] * wido (~wido@2a00:f10:121:100:4a5:76ff:fe00:199) has joined #ceph
[10:09] * ksingh1 (~Adium@2001:708:10:91:c10a:644c:5042:5e5d) Quit (Ping timeout: 480 seconds)
[10:15] * b0e (~aledermue@juniper1.netways.de) has joined #ceph
[10:22] * thb (~me@2a02:2028:281:6b60:6267:20ff:fec9:4e40) has joined #ceph
[10:23] * dvanders_ (~dvanders@dvanders-air.cern.ch) has joined #ceph
[10:25] * dvanders (~dvanders@pb-d-128-141-237-118.cern.ch) Quit (Ping timeout: 480 seconds)
[10:25] * dvanders_ is now known as dvanders
[10:36] * shang (~ShangWu@219.250.81.130) Quit (Read error: Connection reset by peer)
[10:36] * TMM (~hp@sams-office-nat.tomtomgroup.com) has joined #ceph
[10:39] * penguinRaider (~KiKo@14.139.82.6) has joined #ceph
[10:40] * gstaicu (~oftc-webi@remote-munich.teradata.com) has joined #ceph
[10:43] * nerdtron (~oftc-webi@202.60.8.250) Quit (Quit: Page closed)
[10:48] * LeaChim (~LeaChim@host86-166-182-74.range86-166.btcentralplus.com) has joined #ceph
[10:52] * Cataglottism (~Cataglott@dsl-087-195-030-170.solcon.nl) Quit (Quit: My Mac Pro has gone to sleep. ZZZzzz???)
[10:56] * fghaas (~florian@91-119-140-244.dynamic.xdsl-line.inode.at) Quit (Quit: Leaving.)
[11:01] * sputnik13 (~sputnik13@wsip-68-105-248-60.sd.sd.cox.net) Quit (Quit: My MacBook has gone to sleep. ZZZzzz???)
[11:03] * penguinRaider (~KiKo@14.139.82.6) Quit (Ping timeout: 480 seconds)
[11:05] * Midnightmyth (~quassel@93-167-84-102-static.dk.customer.tdc.net) has joined #ceph
[11:28] * BillK (~BillK-OFT@106-68-65-143.dyn.iinet.net.au) Quit (Ping timeout: 480 seconds)
[11:30] * BillK (~BillK-OFT@106-68-42-3.dyn.iinet.net.au) has joined #ceph
[11:30] * dvanders (~dvanders@dvanders-air.cern.ch) Quit (Ping timeout: 480 seconds)
[11:38] * BillK (~BillK-OFT@106-68-42-3.dyn.iinet.net.au) Quit (Ping timeout: 480 seconds)
[11:39] * BillK (~BillK-OFT@58-7-75-57.dyn.iinet.net.au) has joined #ceph
[11:49] * Kioob`Taff (~plug-oliv@local.plusdinfo.com) has joined #ceph
[11:52] * allsystemsarego (~allsystem@188.25.129.255) has joined #ceph
[11:58] * fdmanana (~fdmanana@bl13-134-213.dsl.telepac.pt) has joined #ceph
[12:07] * jbd_ (~jbd_@2001:41d0:52:a00::77) has joined #ceph
[12:08] * jbd_ (~jbd_@2001:41d0:52:a00::77) has left #ceph
[12:26] * ksingh1 (~Adium@2001:708:10:91:d8cf:e0d0:249:1d45) has joined #ceph
[12:27] * jbd_ (~jbd_@2001:41d0:52:a00::77) has joined #ceph
[12:28] * i_m (~ivan.miro@nat-5-carp.hcn-strela.ru) Quit (Quit: Leaving.)
[12:29] * glzhao (~glzhao@220.181.11.232) Quit (Quit: leaving)
[12:29] * ksingh (~Adium@a-v6-0008.vpn.csc.fi) Quit (Ping timeout: 480 seconds)
[12:36] * kitz_ (~kitz@c-71-192-30-36.hsd1.ma.comcast.net) has joined #ceph
[12:53] * Cataglottism (~Cataglott@dsl-087-195-030-170.solcon.nl) has joined #ceph
[12:56] * i_m (~ivan.miro@deibp9eh1--blueice2n2.emea.ibm.com) has joined #ceph
[12:56] * i_m (~ivan.miro@deibp9eh1--blueice2n2.emea.ibm.com) Quit ()
[12:56] * i_m (~ivan.miro@deibp9eh1--blueice4n2.emea.ibm.com) has joined #ceph
[13:12] * ircuser-1 (~ircuser-1@35.222-62-69.ftth.swbr.surewest.net) Quit (Ping timeout: 480 seconds)
[13:16] * ksingh1 (~Adium@2001:708:10:91:d8cf:e0d0:249:1d45) Quit (Quit: Leaving.)
[13:16] * ksingh (~Adium@2001:708:10:10:a0ca:8de1:b211:5dd1) has joined #ceph
[13:20] * rendar (~s@host83-181-dynamic.20-87-r.retail.telecomitalia.it) has joined #ceph
[13:27] * fghaas (~florian@91-119-140-244.dynamic.xdsl-line.inode.at) has joined #ceph
[13:30] * kitz_ (~kitz@c-71-192-30-36.hsd1.ma.comcast.net) Quit (Quit: kitz_)
[13:31] * penguinRaider (~KiKo@14.139.82.6) has joined #ceph
[13:32] * KevinPerks (~Adium@cpe-066-026-252-218.triad.res.rr.com) has joined #ceph
[13:34] * b0e1 (~aledermue@juniper1.netways.de) has joined #ceph
[13:38] * b0e (~aledermue@juniper1.netways.de) Quit (Ping timeout: 480 seconds)
[13:40] * penguinRaider (~KiKo@14.139.82.6) Quit (Ping timeout: 480 seconds)
[13:43] * kitz_ (~kitz@c-71-192-30-36.hsd1.ma.comcast.net) has joined #ceph
[13:46] * kitz_ (~kitz@c-71-192-30-36.hsd1.ma.comcast.net) Quit ()
[13:57] * ircuser-1 (~ircuser-1@35.222-62-69.ftth.swbr.surewest.net) has joined #ceph
[14:11] * hasues (~hazuez@108-236-232-243.lightspeed.knvltn.sbcglobal.net) has joined #ceph
[14:11] * sekas (~oftc-webi@213.244.168.131) has joined #ceph
[14:15] <sekas> Yo guys. I created a 2-node cluster initially (72.2) with 2 monitors and 28 OSD's and this is all working well, but as i added the 3rd monitor following the guide, this monitor does not come back up after rebooting it. Everything else still works Following your troubleshooting guide to restart the monitor i get error:
[14:15] <sekas> etc/init.d/ceph restart mon.bauta2 /etc/init.d/ceph: mon.bauta2 not found (/etc/ceph/ceph.conf defines mon.bauta osd.1 osd.27 osd.16 osd.3 osd.7 osd.18 osd.14 osd.0 osd.19 osd.12 osd.11 osd.20 osd.23 osd.2 osd.10 osd.13 osd.4 osd.24 osd.21 osd.25 osd.22 osd.9 osd.17 osd.26 osd.5 osd.6 osd.15 osd.8 , /var/lib/ceph defines mon.bauta osd.1 osd.27 osd.16 osd.3 osd.7 osd.18 osd.14 osd.0 osd.19 osd.12 osd.11 osd.20 osd.23 osd.2 osd.10 osd.13 osd.4 osd.24 osd.21 o
[14:16] <sekas> Everything else works fine. I run 2 monitors on different ports on the same host.
[14:16] <sekas> It worked fine when i imported a monmap, the keyring and so forth, it was up and in quorum
[14:18] <sekas> Seems to me like if the monitor somehow was missing from the ceph.conf, but with 72.2 almost nothing is in the config anymore
[14:19] * wschulze (~wschulze@cpe-72-229-37-201.nyc.res.rr.com) has joined #ceph
[14:21] <sekas> i.e with 72.2, if a monitor is not added while the cluster is created, it will not be added to the sysv scripts?
[14:22] * sekon (~harish@li291-152.members.linode.com) has left #ceph
[14:26] * valeech (~valeech@pool-71-171-123-210.clppva.fios.verizon.net) has joined #ceph
[14:26] * JeffK (~JeffK@38.99.52.10) Quit (Read error: Connection reset by peer)
[14:26] * JeffK (~JeffK@38.99.52.10) has joined #ceph
[14:27] <Gugge-47527> the upstart scripts will start the monitor with the same name as the host
[14:27] <Gugge-47527> if it exists in /var/lib/ceph
[14:27] <Gugge-47527> as far as i remember
[14:31] <sekas> I only have two hosts, and this worked well on 62.x? There must me some way to add a monitor to the configs manually, right?
[14:35] <sekas> I see both mons in /var/lib/ceph/mon
[14:35] <sekas> ceph-bauta/ ceph-bauta2/
[14:35] <alfredodeza> there are a few steps to get a new monitor in sekas
[14:35] <alfredodeza> have you gone through the docs?
[14:36] <alfredodeza> sekas: http://ceph.com/docs/master/rados/operations/add-or-rm-mons/#adding-a-monitor-manual
[14:36] <sekas> Its not a new monitor actually. i created the cluster with 2 monitors initally, then during the same day i added the 3rd which worked fine until the system was rebooted
[14:36] <alfredodeza> ah ok
[14:37] <alfredodeza> you need to make sure it is part of the monmap
[14:37] <alfredodeza> have you checked if that is (or is not) the case?
[14:37] <sekas> I did do all that :) And for sure i can dump the monmap and keyrings again and end up in this situation on next reboot once more, but there must be a better solution.
[14:37] * sroy (~sroy@207.96.182.162) has joined #ceph
[14:37] * b0e1 (~aledermue@juniper1.netways.de) Quit (Quit: Leaving.)
[14:38] <sekas> It was for 2 week and ceph status claims the monitor is down, so i would say so..
[14:39] <sekas> Its just the init script that doesnt seem to be aware.
[14:43] * b0e (~aledermue@juniper1.netways.de) has joined #ceph
[14:43] * BillK (~BillK-OFT@58-7-75-57.dyn.iinet.net.au) Quit (Ping timeout: 480 seconds)
[14:43] * dlq84 (~dlq84@h-199-142.a137.corp.bahnhof.se) has joined #ceph
[14:46] * BillK (~BillK-OFT@58-7-51-109.dyn.iinet.net.au) has joined #ceph
[14:48] <alfredodeza> I see
[14:50] <dlq84> Hey people! I have a question, When we add OSD's, we encounter performance loss (which is expected) objects seems to be "unreachable" as in RBD reads block for large enough of time that it affects our service and stuff times out.. Any ideas on how to lower the performance impact of adding more OSD's (remapped+backfilling), will I gain from having more PG's maybe?
[14:51] * sjm (~sjm@pool-108-53-56-179.nwrknj.fios.verizon.net) has joined #ceph
[14:53] * acaos_ (~zac@209.99.103.42) has joined #ceph
[14:53] * acaos (~zac@209.99.103.42) Quit (Read error: Connection reset by peer)
[14:53] * t0rn (~ssullivan@2607:fad0:32:a02:d227:88ff:fe02:9896) has joined #ceph
[14:56] * marrusl (~mark@209-150-43-182.c3-0.wsd-ubr2.qens-wsd.ny.cable.rcn.com) has joined #ceph
[14:56] * Elbandi (~ea333@elbandi.net) Quit (Ping timeout: 480 seconds)
[14:59] * Elbandi (~ea333@elbandi.net) has joined #ceph
[15:01] <sekas> dlq84: Im not part of the ops here but i might be able to help you.. How many nodes do you have and how many replicas do you keep?
[15:03] <dlq84> When I started I had 6 nodes, and expanding to 9
[15:03] <dlq84> I added them one by one
[15:04] <dlq84> 3 replicas (min_size 2 though)
[15:04] <sekas> Slow drives?
[15:04] * jcsp (~Adium@0001bf3a.user.oftc.net) has joined #ceph
[15:04] <dlq84> Yeah EC2 Ephermal
[15:06] <sekas> Okay, large drives or smaller? Whats the throughput you get on a dd to any of the drives?
[15:06] <dlq84> The disks are 410GB each, I have to get back with throughput, hold on
[15:06] <sekas> I use 4tb/7200rpm drives and adding a drive can take quite a lot of time when it starts replicating.
[15:07] <sekas> Especially if you already have a load on the cluster.
[15:08] <sekas> Numbers are not important, just a guess is fine for me to try to compare it to what i see..
[15:10] <sekas> But ec2 isnt fast, high access time could result in very slow replication
[15:10] <dlq84> ran dd if=/dev/zero of=/dev/xvdb2 bs=1M count=500 conv=fdatasync
[15:10] <dlq84> three times and gave me around 70MB/s
[15:10] <sekas> Fair enough
[15:10] <sekas> Not bad, so thats basically what my drives gives me.
[15:11] <Gugge-47527> sekas: what happens if you start the extra monitor manually with /usr/bin/ceph-mon --cluster=ceph -i bauta2 -f
[15:11] <dlq84> yeah, do you know if reads/writes block on a PG while it's being remapped?
[15:11] <sekas> Gugge-47527: IO error: lock /var/lib/ceph/mon/ceph-bauta2/store.db/LOCK: Resource temporarily unavailable 2014-03-03 15:11:24.851640 7fcf476647c0 -1 failed to create new leveldb store
[15:12] <Gugge-47527> strange :)
[15:12] <dlq84> adding more PGs will make it block less amount of time maybe?
[15:12] <sekas> Should i just drop the monitor and recreate it, have a feeling i might have f-ed it up when testing to recreate the monitor today
[15:12] <Gugge-47527> dlq84: the pg is not blocked while its backfilling
[15:13] <dlq84> ok good
[15:13] <Gugge-47527> sekas: i would try than, and then i would start it manually on boot, because the init scripts doesnt automatically start a mon with that nmae
[15:13] <Gugge-47527> name
[15:14] <sekas> Okay, but as i have 2 monitors on one host that will have to happen on every reboot?
[15:14] <Gugge-47527> yes
[15:15] <Gugge-47527> you are aware its a bad idea having two mons on one host right? :)
[15:15] <dlq84> (also having only 2 mons is bad, since they can't reach quorum)
[15:15] <Gugge-47527> sure they can
[15:15] <Gugge-47527> but only when both are up
[15:16] * marrusl (~mark@209-150-43-182.c3-0.wsd-ubr2.qens-wsd.ny.cable.rcn.com) Quit (Remote host closed the connection)
[15:16] <Gugge-47527> having two on one host leaves you with one non working active, when that host crashes
[15:16] <dlq84> ok, I intepreted the docu wrong then ;)
[15:16] <Gugge-47527> yes
[15:16] <Gugge-47527> quorum requires over half the monitors up
[15:16] <Gugge-47527> 2 of 2 is over half
[15:16] <dlq84> yes
[15:17] <dlq84> true
[15:17] <sekas> Gugge-47527: Painfully aware :) Im buying a few mor nodes the next week so it will have to work until then.
[15:17] * marrusl (~mark@209-150-43-182.c3-0.wsd-ubr2.qens-wsd.ny.cable.rcn.com) has joined #ceph
[15:20] * japuzzo (~japuzzo@pok2.bluebird.ibm.com) has joined #ceph
[15:20] <sekas> This is interesting.. Before i deleted the old monitor i deleted the lock file and restarted it. and look and behold: monmap e2: 3 mons at {bauta=10.139.24.1:6789/0,bauta2=10.139.24.1:6790/0,megalith=10.139.24.2:6789/0}, election epoch 36, quorum 0,1,2 bauta,bauta2,megalith
[15:21] <Gugge-47527> great :)
[15:21] <sekas> Mysterious and great :)
[15:22] <sekas> Doing a ceph status looks great but if i do a etc/init.d/ceph status it only lists the first monitor still.
[15:23] <Gugge-47527> yes
[15:23] <Gugge-47527> the init scripts only works with the mon named the same as your host
[15:23] <Gugge-47527> that is why i said you should start it manually on boot
[15:24] <sekas> can i fool it by using a server alias or something? Not that i think i will have to reboot the host again before i get the new pods, but..
[15:24] <Gugge-47527> you could rewrite the script to support more monitors
[15:25] <sekas> Cool, ill have a little look and see if i can ugly hack it. Thanks mate
[15:26] * andreask (~andreask@h081217067008.dyn.cm.kabsi.at) has left #ceph
[15:30] <sekas> Gugge-47527: just did a who is on you and saw your realname.. You didnt happen to work for cybercity in Norreport around 1999/2000?
[15:30] * pvh_sa (~pvh@41.164.8.114) Quit (Ping timeout: 480 seconds)
[15:31] * markbby (~Adium@168.94.245.2) has joined #ceph
[15:32] <sekas> I worked there too, hence me askin.
[15:32] * capri (~capri@212.218.127.222) Quit (Read error: Connection reset by peer)
[15:34] * fghaas (~florian@91-119-140-244.dynamic.xdsl-line.inode.at) has left #ceph
[15:36] <sekas> Anyhow... later
[15:36] * sekas (~oftc-webi@213.244.168.131) Quit ()
[15:43] * markbby (~Adium@168.94.245.2) Quit (Quit: Leaving.)
[15:46] * markbby (~Adium@168.94.245.3) has joined #ceph
[15:58] * thb (~me@0001bd58.user.oftc.net) Quit (Quit: Leaving.)
[16:02] * ismell (~ismell@host-64-17-89-79.beyondbb.com) has joined #ceph
[16:03] * Cataglottism (~Cataglott@dsl-087-195-030-170.solcon.nl) Quit (Quit: My Mac Pro has gone to sleep. ZZZzzz???)
[16:05] * i_m (~ivan.miro@deibp9eh1--blueice4n2.emea.ibm.com) Quit (Quit: Leaving.)
[16:07] * dmsimard (~Adium@108.163.152.2) has joined #ceph
[16:12] * hasues1 (~hasues@kwfw01.scrippsnetworksinteractive.com) has joined #ceph
[16:15] * nwat (~textual@c-50-131-197-174.hsd1.ca.comcast.net) has joined #ceph
[16:17] * Cataglottism (~Cataglott@dsl-087-195-030-170.solcon.nl) has joined #ceph
[16:23] * glambert (~glambert@37.157.50.80) has joined #ceph
[16:23] <glambert> Hi, I've got this randomly on ceph health detail:
[16:23] <glambert> pg 16.57 is active+clean+inconsistent, acting [1,2]
[16:23] <glambert> 1 scrub errors
[16:23] <glambert> root@st003:~#
[16:24] <glambert> I've tried a scrub and deep-scrub on 16.57
[16:24] <glambert> neither fixed the problem
[16:25] <glambert> any ideas?
[16:26] <glambert> just issued a repair, see if that sorts it
[16:27] <glambert> repair fixed it
[16:27] <glambert> not sure what the problem was there
[16:33] * i_m (~ivan.miro@nat-5-carp.hcn-strela.ru) has joined #ceph
[16:34] * i_m (~ivan.miro@nat-5-carp.hcn-strela.ru) Quit ()
[16:34] * i_m (~ivan.miro@nat-5-carp.hcn-strela.ru) has joined #ceph
[16:44] * zerick (~eocrospom@190.187.21.53) has joined #ceph
[16:49] * dgbaley27 (~matt@c-76-120-64-12.hsd1.co.comcast.net) has joined #ceph
[16:54] <Gugge-47527> glambert: somehow it did not write the same to osd 1 and 2
[16:54] <Gugge-47527> glambert: bad hardware is my guess
[16:54] <glambert> Gugge-47527, just the one error in months, must be a one off I guess
[16:57] * ivotron (~ivotron@2601:9:2700:178:2d2e:7143:5260:2622) Quit (Remote host closed the connection)
[16:58] * Discovery (~Discovery@109.235.55.213) has joined #ceph
[17:00] * toutour (~toutour@causses.idest.org) Quit (Ping timeout: 480 seconds)
[17:04] * joao (~joao@a79-168-11-205.cpe.netcabo.pt) Quit (Quit: Leaving)
[17:05] * Cataglottism (~Cataglott@dsl-087-195-030-170.solcon.nl) Quit (Ping timeout: 480 seconds)
[17:06] * dmsimard1 (~Adium@70.38.0.246) has joined #ceph
[17:07] * Cataglottism (~Cataglott@dsl-087-195-030-184.solcon.nl) has joined #ceph
[17:07] * bens (~ben@c-71-231-52-111.hsd1.wa.comcast.net) Quit (Quit: Changing server)
[17:08] * sprachgenerator (~sprachgen@130.202.135.203) has joined #ceph
[17:09] * joao (~joao@a79-168-11-205.cpe.netcabo.pt) has joined #ceph
[17:09] * ChanServ sets mode +o joao
[17:11] * dmsimard2 (~Adium@108.163.152.66) has joined #ceph
[17:12] <kitz> I created journal buckets in crush and placed my OSDs in them. When I rebooted one of my hosts all of the OSDs were moved out of the journal buckets and back to the host buckets. Any idea why?
[17:13] * dmsimard (~Adium@108.163.152.2) Quit (Ping timeout: 480 seconds)
[17:13] * sjm (~sjm@pool-108-53-56-179.nwrknj.fios.verizon.net) Quit (Ping timeout: 480 seconds)
[17:16] * dmsimard1 (~Adium@70.38.0.246) Quit (Ping timeout: 480 seconds)
[17:20] * joef (~Adium@2620:79:0:131:c860:358a:ef42:4b77) Quit (Quit: Leaving.)
[17:20] * steki (~steki@91.195.39.5) Quit (Ping timeout: 480 seconds)
[17:20] * joef (~Adium@2620:79:0:131:fcd5:b939:26a9:9fbb) has joined #ceph
[17:22] * ksingh (~Adium@2001:708:10:10:a0ca:8de1:b211:5dd1) has left #ceph
[17:22] * sjm (~sjm@pool-108-53-56-179.nwrknj.fios.verizon.net) has joined #ceph
[17:22] * dgbaley27 (~matt@c-76-120-64-12.hsd1.co.comcast.net) Quit (Quit: Leaving.)
[17:25] <jerker> anone been testing netatalk to cephfs?
[17:26] <jerker> timemachine in particular
[17:26] <jerker> it would be such a sweet distributed time machine host, escpecially together with btrfs/zfs for compression.
[17:27] <jerker> i have read ppl running a mac server on the block device but running native cephfs would be cooler.
[17:30] * Cataglottism (~Cataglott@dsl-087-195-030-184.solcon.nl) Quit (Quit: My Mac Pro has gone to sleep. ZZZzzz???)
[17:35] * simulx (~simulx@66-194-114-178.static.twtelecom.net) Quit (Ping timeout: 480 seconds)
[17:36] * nwat (~textual@c-50-131-197-174.hsd1.ca.comcast.net) Quit (Quit: My MacBook has gone to sleep. ZZZzzz???)
[17:36] * nwat (~textual@c-50-131-197-174.hsd1.ca.comcast.net) has joined #ceph
[17:36] * nwat (~textual@c-50-131-197-174.hsd1.ca.comcast.net) Quit ()
[17:37] * mnash (~chatzilla@vpn.expressionanalysis.com) Quit (Ping timeout: 480 seconds)
[17:38] * simulx (~simulx@vpn.expressionanalysis.com) has joined #ceph
[17:46] * glambert (~glambert@37.157.50.80) Quit (Quit: <?php exit(); ?>)
[17:47] * mnash (~chatzilla@vpn.expressionanalysis.com) has joined #ceph
[17:47] * alram (~alram@38.122.20.226) has joined #ceph
[17:58] * mattt (~textual@94.236.7.190) Quit (Read error: Operation timed out)
[17:58] * markbby (~Adium@168.94.245.3) Quit (Remote host closed the connection)
[18:00] * xarses_ (~andreww@c-24-23-183-44.hsd1.ca.comcast.net) Quit (Ping timeout: 480 seconds)
[18:02] * b0e (~aledermue@juniper1.netways.de) Quit (Remote host closed the connection)
[18:04] * ircolle (~Adium@2601:1:8380:2d9:cd3e:120f:5693:553d) has joined #ceph
[18:06] * ivotron (~ivotron@dhcp-59-219.cse.ucsc.edu) has joined #ceph
[18:13] * nwat (~textual@eduroam-241-123.ucsc.edu) has joined #ceph
[18:18] * thb (~me@port-32521.pppoe.wtnet.de) has joined #ceph
[18:18] * thb is now known as Guest2059
[18:19] * Guest2059 is now known as thb
[18:19] * bandrus (~Adium@adsl-75-5-250-121.dsl.scrm01.sbcglobal.net) has joined #ceph
[18:22] * yuriw1 is now known as yuriw
[18:30] * JCL (~JCL@2601:9:5980:39b:5dd1:36e8:d869:d82a) has joined #ceph
[18:52] * toutour (~toutour@causses.idest.org) has joined #ceph
[18:53] * xarses_ (~andreww@12.164.168.117) has joined #ceph
[18:53] * rotbeard (~redbeard@b2b-94-79-138-170.unitymedia.biz) Quit (Quit: Leaving)
[18:53] * kaizh (~kaizh@128-107-239-234.cisco.com) has joined #ceph
[18:54] * davidzlap (~Adium@cpe-23-242-31-175.socal.res.rr.com) has joined #ceph
[18:56] * bdonnahue2 (~James@24-148-64-18.c3-0.mart-ubr2.chi-mart.il.cable.rcn.com) has left #ceph
[18:59] * bandrus (~Adium@adsl-75-5-250-121.dsl.scrm01.sbcglobal.net) Quit (Ping timeout: 480 seconds)
[18:59] * xmltok (~xmltok@216.103.134.250) has joined #ceph
[19:03] * Sysadmin88 (~IceChat77@176.254.32.31) has joined #ceph
[19:03] * TMM (~hp@sams-office-nat.tomtomgroup.com) Quit (Quit: Ex-Chat)
[19:07] * bandrus (~Adium@adsl-75-5-250-121.dsl.scrm01.sbcglobal.net) has joined #ceph
[19:08] * odyssey4me (~odyssey4m@41-132-198-203.dsl.mweb.co.za) has joined #ceph
[19:08] * mtanski (~mtanski@69.193.178.202) has joined #ceph
[19:08] * dmsimard (~Adium@108.163.152.2) has joined #ceph
[19:13] * pvh_sa (~pvh@41-133-202-127.dsl.mweb.co.za) has joined #ceph
[19:13] * marrusl (~mark@209-150-43-182.c3-0.wsd-ubr2.qens-wsd.ny.cable.rcn.com) Quit (Remote host closed the connection)
[19:14] * WarrenUsui (~Warren@2607:f298:a:607:c5a4:40bb:68ee:c717) has joined #ceph
[19:14] * dmsimard2 (~Adium@108.163.152.66) Quit (Ping timeout: 480 seconds)
[19:20] * mtanski (~mtanski@69.193.178.202) Quit (Quit: mtanski)
[19:20] * Sysadmin88 (~IceChat77@176.254.32.31) Quit (Quit: When the chips are down, well, the buffalo is empty)
[19:21] * warrenSusui (~Warren@2607:f298:a:607:38fc:445b:1848:70d4) Quit (Ping timeout: 480 seconds)
[19:21] <winston-d> how can i start/stop service that uses custom cluster name (instead of 'ceph')?
[19:22] <winston-d> /etc/init.d/ceph has 'ceph' as cluster name hard coded.
[19:23] <winston-d> btw, i'm using 0.67.7 on ubunt 12.04
[19:25] <joshd1> upstart scripts don't hardcode it (sysv init were just fixed recently)
[19:28] <joshd1> winston-d: something like 'service ceph start cluster=name' to use upstart
[19:28] * dmsimard1 (~Adium@108.163.152.66) has joined #ceph
[19:29] * dmsimard (~Adium@108.163.152.2) Quit (Read error: Operation timed out)
[19:31] <winston-d> joshd1: well, look at: http://paste.openstack.org/show/71841/ line 5
[19:32] * andreask (~andreask@h081217067008.dyn.cm.kabsi.at) has joined #ceph
[19:32] * ChanServ sets mode +v andreask
[19:32] * jbd_ (~jbd_@2001:41d0:52:a00::77) has left #ceph
[19:32] * andreask (~andreask@h081217067008.dyn.cm.kabsi.at) has left #ceph
[19:33] <winston-d> joshd1: but 'stop ceph-osd cluster=XXX' does works. thx
[19:33] * houkouonchi-home (~linux@2001:470:c:c69::2) has joined #ceph
[19:33] <joshd1> winston-d: sorry, that's sysv init again. I meant to say initctl start ceph cluster=xxx
[19:34] * xmltok (~xmltok@216.103.134.250) Quit (Quit: Bye!)
[19:35] * xmltok (~xmltok@216.103.134.250) has joined #ceph
[19:36] * dmsimard (~Adium@108.163.152.2) has joined #ceph
[19:36] * mtanski (~mtanski@69.193.178.202) has joined #ceph
[19:36] * nwat (~textual@eduroam-241-123.ucsc.edu) Quit (Quit: My MacBook has gone to sleep. ZZZzzz???)
[19:36] <winston-d> joshd1: is there a '-a' argument for upstart script like sysv init that I can use to apply same action to all services on all hosts?
[19:37] <joshd1> I think it's just local
[19:39] <joshd1> it's a small patch to make the sysv init scripts behave better if you want that behavior still: https://github.com/ceph/ceph/pull/1292/files
[19:39] <winston-d> hmm, sysv init fix will be in next minor version upgrade to Dumpling? and when will that be?
[19:40] * Cataglottism (~Cataglott@dsl-087-195-030-184.solcon.nl) has joined #ceph
[19:40] * dmsimard1 (~Adium@108.163.152.66) Quit (Ping timeout: 480 seconds)
[19:40] <joshd1> not sure exactly, but so far there aren't a lot of new fixes for dumpling that would merit another point release
[19:41] <joshd1> I'd guess another couple months
[19:42] * stewiem20001 (~stewiem20@195.10.250.233) Quit (Quit: Leaving.)
[19:42] * dmsimard1 (~Adium@70.38.0.246) has joined #ceph
[19:43] * Pedras (~Adium@216.207.42.132) has joined #ceph
[19:44] * dmsimard (~Adium@108.163.152.2) Quit (Read error: Connection reset by peer)
[19:44] * dpippenger1 (~riven@cpe-198-72-157-189.socal.res.rr.com) Quit (Quit: Leaving.)
[19:48] * dereky (~derek@pool-71-114-104-38.washdc.fios.verizon.net) Quit (Quit: dereky)
[19:48] * gregsfortytwo1 (~Adium@2607:f298:a:607:7d47:afc1:51fa:6088) Quit (Quit: Leaving.)
[19:49] * dereky (~derek@pool-71-114-104-38.washdc.fios.verizon.net) has joined #ceph
[19:49] * melodous (~melodous@31.49.11.37.dynamic.jazztel.es) Quit (Read error: No route to host)
[19:50] * melodous (~melodous@31.49.11.37.dynamic.jazztel.es) has joined #ceph
[19:51] * rotbeard (~redbeard@2a02:908:df10:6f00:76f0:6dff:fe3b:994d) has joined #ceph
[19:52] * gregsfortytwo (~Adium@2607:f298:a:607:3d79:585d:5557:ffbc) has joined #ceph
[19:55] * dereky (~derek@pool-71-114-104-38.washdc.fios.verizon.net) Quit (Quit: dereky)
[19:55] <winston-d> joshd1: thx, i'll take that patch and fix it myself for now.
[19:56] * dereky (~derek@pool-71-114-104-38.washdc.fios.verizon.net) has joined #ceph
[19:57] * angdraug (~angdraug@12.164.168.117) has joined #ceph
[20:00] * JC (~JC@2601:9:5980:39b:adfe:7250:8bfb:b776) has joined #ceph
[20:00] * nwat (~textual@eduroam-241-123.ucsc.edu) has joined #ceph
[20:00] * diegows (~diegows@190.190.5.238) has joined #ceph
[20:02] * pvh_sa (~pvh@41-133-202-127.dsl.mweb.co.za) Quit (Ping timeout: 480 seconds)
[20:03] * dlq84 (~dlq84@h-199-142.a137.corp.bahnhof.se) Quit (Quit: Leaving)
[20:03] * pvh_sa (~pvh@41-133-202-127.dsl.mweb.co.za) has joined #ceph
[20:05] * Cube (~Cube@12.248.40.138) has joined #ceph
[20:10] * dmsimard (~Adium@108.163.152.2) has joined #ceph
[20:10] * dmsimard1 (~Adium@70.38.0.246) Quit (Read error: Connection reset by peer)
[20:19] * JC (~JC@2601:9:5980:39b:adfe:7250:8bfb:b776) Quit (Quit: Leaving.)
[20:22] * dereky (~derek@pool-71-114-104-38.washdc.fios.verizon.net) Quit (Quit: dereky)
[20:22] * dereky (~derek@pool-71-114-104-38.washdc.fios.verizon.net) has joined #ceph
[20:24] * zidarsk8 (~zidar@89-212-142-10.dynamic.t-2.net) has joined #ceph
[20:25] * zidarsk8 (~zidar@89-212-142-10.dynamic.t-2.net) has left #ceph
[20:25] * mnash_ (~chatzilla@vpn.expressionanalysis.com) has joined #ceph
[20:26] * simulx2 (~simulx@66-194-114-178.static.twtelecom.net) has joined #ceph
[20:29] * simulx (~simulx@vpn.expressionanalysis.com) Quit (Ping timeout: 480 seconds)
[20:31] * mnash (~chatzilla@vpn.expressionanalysis.com) Quit (Ping timeout: 480 seconds)
[20:31] * mnash_ is now known as mnash
[20:32] * marrusl (~mark@209-150-43-182.c3-0.wsd-ubr2.qens-wsd.ny.cable.rcn.com) has joined #ceph
[20:42] * koleosfuscus (~koleosfus@77.47.66.235.dynamic.cablesurf.de) has joined #ceph
[20:56] * JCL (~JCL@2601:9:5980:39b:5dd1:36e8:d869:d82a) Quit (Quit: Leaving.)
[20:56] * JCL (~JCL@2601:9:5980:39b:5dd1:36e8:d869:d82a) has joined #ceph
[21:01] * markbby (~Adium@168.94.245.1) has joined #ceph
[21:05] * bens (~ben@c-71-231-52-111.hsd1.wa.comcast.net) has joined #ceph
[21:05] <bens> Febuary is over, no more sir-mix-alot lyrics. this month is bon jovi
[21:05] * Gamekiller77 (~Gamekille@128-107-239-235.cisco.com) has joined #ceph
[21:06] <bens> ( ???? ???? ????)
[21:06] * kaizh (~kaizh@128-107-239-234.cisco.com) Quit (Read error: Connection reset by peer)
[21:06] * kaizh (~kaizh@128-107-239-233.cisco.com) has joined #ceph
[21:10] * scuttlemonkey (~scuttlemo@c-107-5-193-244.hsd1.mi.comcast.net) has joined #ceph
[21:10] * ChanServ sets mode +o scuttlemonkey
[21:13] * kaizh (~kaizh@128-107-239-233.cisco.com) Quit (Remote host closed the connection)
[21:22] * dpippenger (~riven@66-192-9-78.static.twtelecom.net) has joined #ceph
[21:24] * mtanski (~mtanski@69.193.178.202) Quit (Quit: mtanski)
[21:26] * kaizh (~kaizh@128-107-239-234.cisco.com) has joined #ceph
[21:38] * mattt (~textual@cpc9-rdng20-2-0-cust565.15-3.cable.virginm.net) has joined #ceph
[21:43] * rotbeard (~redbeard@2a02:908:df10:6f00:76f0:6dff:fe3b:994d) Quit (Quit: Verlassend)
[21:44] * mtanski (~mtanski@69.193.178.202) has joined #ceph
[21:46] * BManojlovic (~steki@fo-d-130.180.254.37.targo.rs) has joined #ceph
[21:52] * mattt_ (~textual@92.52.76.140) has joined #ceph
[21:55] * mattt (~textual@cpc9-rdng20-2-0-cust565.15-3.cable.virginm.net) Quit (Ping timeout: 480 seconds)
[21:55] * mattt_ is now known as mattt
[21:55] * markbby (~Adium@168.94.245.1) Quit (Remote host closed the connection)
[21:56] * andreask (~andreask@h081217067008.dyn.cm.kabsi.at) has joined #ceph
[21:56] * ChanServ sets mode +v andreask
[21:56] * andreask (~andreask@h081217067008.dyn.cm.kabsi.at) has left #ceph
[22:01] * Cataglottism (~Cataglott@dsl-087-195-030-184.solcon.nl) Quit (Quit: My Mac Pro has gone to sleep. ZZZzzz???)
[22:07] * Sysadmin88 (~IceChat77@176.254.32.31) has joined #ceph
[22:17] * MarkN (~nathan@142.208.70.115.static.exetel.com.au) has joined #ceph
[22:18] * MarkN (~nathan@142.208.70.115.static.exetel.com.au) has left #ceph
[22:24] * ptone (~ptone@ip98-171-189-30.sb.sd.cox.net) has joined #ceph
[22:26] * mtanski (~mtanski@69.193.178.202) Quit (Quit: mtanski)
[22:27] <ptone> just getting started with ceph - any ideas on how to get a more helpful error than this? http://bpaste.net/show/pGWtMZPumOnd2hREoCUI/
[22:29] * allsystemsarego (~allsystem@188.25.129.255) Quit (Quit: Leaving)
[22:29] <dmick> ptone: my gut feeling is that you can ignore that error; alfredodeza ? ^
[22:29] <ptone> yeah - rerunning it seemed to work
[22:29] <alfredodeza> ptone: dmick is right that is safe to ignore
[22:29] <ptone> the ceph.conf got created on the node
[22:30] <alfredodeza> that would not create a ceph.conf on the remote node though
[22:30] <ptone> OK - I guess it was created by ceph-deploy tool install then
[22:30] <alfredodeza> it is possible, yes
[22:31] <alfredodeza> I was not aware you had run that beforehand
[22:31] <ptone> I'm just working through: http://ceph.com/docs/master/start/quick-start-preflight/#ceph-node-setup
[22:31] <alfredodeza> right right I was just not sure what you had run :)
[22:31] <ptone> Oh - I thought you needed to run that on each of the base nodes - but I can see now it is sort of ansible like in design - just through SSH
[22:32] <alfredodeza> yes it is all through SSH
[22:32] <ptone> so to clarify - the heading section "For other Ceph Nodes (and for initial monitors prior to ceph-deploy v1.1.3) perform the following steps:"
[22:32] <ptone> applies to all nodes beyond the first one, even with the current ceph deploy?
[22:33] <alfredodeza> the current ceph-deploy does them all at once
[22:33] <alfredodeza> that is, when you are using 'mon create-initial'
[22:33] <alfredodeza> so it will look at ceph.conf and deploy all of them for you
[22:33] <alfredodeza> then it will wait for them to form quorum
[22:34] <alfredodeza> and finally if they have formed quorum it will gatherkeys
[22:34] <ptone> OK - thanks, just getting started here, so I will spend some time digging and working with the docs - thanks
[22:37] * sjm1 (~sjm@pool-108-53-56-179.nwrknj.fios.verizon.net) has joined #ceph
[22:37] * sjm (~sjm@pool-108-53-56-179.nwrknj.fios.verizon.net) Quit (Quit: Leaving.)
[22:39] * rmoe (~quassel@12.164.168.117) has joined #ceph
[22:43] * mtanski (~mtanski@69.193.178.202) has joined #ceph
[22:44] * japuzzo (~japuzzo@pok2.bluebird.ibm.com) Quit (Quit: Leaving)
[22:47] * joshd1 (~jdurgin@2602:306:c5db:310:2dbe:ad8e:8ea1:4c71) Quit (Ping timeout: 480 seconds)
[22:51] * sroy (~sroy@207.96.182.162) Quit (Quit: Quitte)
[22:53] <ptone> does ceph-deploy not play well with ssh-agent from the admin node?
[22:54] <alfredodeza> it may have issues, but I am not entirely sure because we don't test with ssh-agent
[22:54] <ptone> I'm connecting local -> admin instance with key, then using ssh-forwarding to reach node instances with ssh
[22:55] * rendar (~s@host83-181-dynamic.20-87-r.retail.telecomitalia.it) Quit ()
[22:55] * diegows (~diegows@190.190.5.238) Quit (Ping timeout: 480 seconds)
[22:55] <ptone> But this is the result: http://bpaste.net/show/Xp9OTdVLAofzRmr3Mtwi/
[22:56] * fatih (~fatih@78.186.36.182) has joined #ceph
[22:57] <dmick> sudo: sorry, you must have a tty to run sudo
[22:57] <dmick> that's a sudoers setting, Requiretty
[22:57] <alfredodeza> what dmick said
[22:57] <ptone> OK - so it looks like I'll have go in and at least do some things on each node manually first then
[22:57] <alfredodeza> errr also ceph-deploy should handle that
[22:57] * sjm1 (~sjm@pool-108-53-56-179.nwrknj.fios.verizon.net) Quit (Ping timeout: 480 seconds)
[22:57] <alfredodeza> no need to spit that traceback
[22:58] * markbby (~Adium@168.94.245.3) has joined #ceph
[22:58] <mo-> Im seeing a mon abort its syncing process after exactly 30seconds. could there be some sort of timeout going on? full log section is on the mailing list. was hoping somebody had seen that before
[22:59] <alfredodeza> issue 7585
[22:59] <kraken> alfredodeza might be talking about http://tracker.ceph.com/issues/7585 [ceph-deploy should handle requiretty failures]
[22:59] <alfredodeza> ptone: ^ ^
[22:59] <alfredodeza> thanks for reporting that
[22:59] * mattt (~textual@92.52.76.140) Quit (Read error: Connection reset by peer)
[22:59] <ptone> nice - thanks
[22:59] <ptone> at least it is being tracked
[22:59] * sjm (~sjm@pool-108-53-56-179.nwrknj.fios.verizon.net) has joined #ceph
[22:59] <ptone> never nice to spit the python traceback to users ;-)
[22:59] * sjm (~sjm@pool-108-53-56-179.nwrknj.fios.verizon.net) has left #ceph
[23:00] <alfredodeza> well, it is OK if we don't know. That is better than eating them all up
[23:00] * sjm (~sjm@pool-108-53-56-179.nwrknj.fios.verizon.net) has joined #ceph
[23:00] <alfredodeza> :D
[23:00] * joshd1 (~jdurgin@2602:306:c5db:310:19ef:5fb1:726b:fa9e) has joined #ceph
[23:01] <ptone> sure - but in a CLI, it is nice to wrap the message in some red terminal coloring and making it a bit more friendly in the CLI context - though I had verbose flags, so I guess you would want it unadulterated in that context
[23:03] * i_m (~ivan.miro@nat-5-carp.hcn-strela.ru) Quit (Quit: Leaving.)
[23:07] * Cube (~Cube@12.248.40.138) Quit (Quit: Leaving.)
[23:10] <ptone> where/how are you doing platform checks? "[ceph_deploy][ERROR ] UnsupportedPlatform: Platform is not supported: "
[23:10] <ptone> this is amazon-linux - more or less RHEL
[23:11] <darkfader> i heard there's an alternative to backtraces called error handling. dunno if python has it :)
[23:11] * AfC (~andrew@2407:7800:400:1011:2ad2:44ff:fe08:a4c) has joined #ceph
[23:12] <alfredodeza> darkfader: of course there is no such thing as error handling in Python! it is a scripting language!
[23:12] <alfredodeza> ptone: https://github.com/ceph/ceph-deploy/blob/master/ceph_deploy/hosts/remotes.py#L9
[23:13] <darkfader> heh
[23:14] <alfredodeza> can you share the output of python -c "import platform; print platform.linux_distribution()"
[23:14] <alfredodeza> I bet it is all ''
[23:14] <ptone> yup - exactly ;-)
[23:14] <alfredodeza> ptone: ^ ^
[23:14] <alfredodeza> yeah
[23:15] <alfredodeza> so the reason why that is important is because we need to know about your package manager and other distro-specific stuff
[23:15] <alfredodeza> so custom distros that are 'almost like $DISTRO' will fail
[23:15] <ptone> which is the most tested distro?
[23:16] <ptone> IOW - which is my happy-path?
[23:19] <dmick> alfredodeza: maybe you could implement a Psychic Distribution Genie
[23:19] <dmick> just figure out how things work on this completely-non-mainstream thing. we could include kernel unit tests
[23:19] <dmick> ;)
[23:21] <ptone> I can't find/remember now - but there is some /proc/platform like place where other tools check platform, and I think amazon-linux returns rhel
[23:23] * kitz_ (~kitz@c-71-192-30-36.hsd1.ma.comcast.net) has joined #ceph
[23:26] * nwat (~textual@eduroam-241-123.ucsc.edu) Quit (Quit: My MacBook has gone to sleep. ZZZzzz???)
[23:28] * kitz_ (~kitz@c-71-192-30-36.hsd1.ma.comcast.net) Quit (Remote host closed the connection)
[23:29] * kitz_ (~kitz@admin161-255.hampshire.edu) has joined #ceph
[23:39] * fatih (~fatih@78.186.36.182) Quit (Remote host closed the connection)
[23:39] * fatih (~fatih@78.186.36.182) has joined #ceph
[23:39] * markbby (~Adium@168.94.245.3) Quit (Remote host closed the connection)
[23:39] * gstaicu (~oftc-webi@remote-munich.teradata.com) Quit (Remote host closed the connection)
[23:41] * hasues1 (~hasues@kwfw01.scrippsnetworksinteractive.com) Quit (Quit: Leaving.)
[23:45] * pvh_sa (~pvh@41-133-202-127.dsl.mweb.co.za) Quit (Ping timeout: 480 seconds)
[23:45] * marrusl (~mark@209-150-43-182.c3-0.wsd-ubr2.qens-wsd.ny.cable.rcn.com) Quit (Quit: sync && halt)
[23:53] * koleosfuscus (~koleosfus@77.47.66.235.dynamic.cablesurf.de) has left #ceph
[23:55] * marrusl (~mark@209-150-43-182.c3-0.wsd-ubr2.qens-wsd.ny.cable.rcn.com) has joined #ceph
[23:59] * flaxy (~afx@78.130.174.164) Quit (Quit: WeeChat 0.4.2)

These logs were automatically created by CephLogBot on irc.oftc.net using the Java IRC LogBot.