#ceph IRC Log

Index

IRC Log for 2011-05-26

Timestamps are in GMT/BST.

[0:00] <Tv> bchrisman: i have a way to run a python scripts with ssh connections to all necessary nodes etc; once i figure out ssh connection re-establishment, installing the right kernel for the test will be just a utility function that sleeps until it has gotten the right kernel in place (noop if it's already good)
[0:01] <bchrisman> ahh
[0:03] <Tv> and i already have a gitbuilder for our kernel repo that builds the debs for every commit
[0:03] <Tv> (btw others newdreamers, http://gitbuilder-kernel-amd64.ceph.newdream.net/output/ref/origin_master/ )
[0:03] <Tv> that's probably not visible to outsiders yet
[0:04] <Tv> as in, that url most definitely isn't
[0:04] <Tv> and i don't know how to add the proxy rules
[0:05] <Tv> i still need a nicer mechanism for identifying the kernel version in use.. git describe is neat but impossible to reproduce, especially without the repo
[0:05] <Tv> but even just "upgrade kernel once a day" will be a joy
[0:05] * alexxy (~alexxy@79.173.81.171) has joined #ceph
[0:09] <bchrisman> hmm??? makes sense.. using rpm packaging here otherwise we'd lean on you guys to open that up.. :)
[0:10] <Tv> the builder stuff is open
[0:10] <Tv> the controlling thingie will be open once i actually have commits ;)
[0:11] <Tv> bchrisman: http://ceph.newdream.net/git/?p=autobuild-ceph.git;a=summary for the builder script
[0:11] <bchrisman> kernel packages getting built in there?
[0:11] <Tv> example web output: http://ceph.newdream.net/gitbuilder/
[0:12] <Tv> e.g. http://ceph.newdream.net/git/?p=autobuild-ceph.git;a=blob;f=build-kernel.sh;h=7f2206da637434465ac3e80994e0f3f834e1c0cc;hb=HEAD
[0:15] <Tv> bchrisman: oh and README might be out of date, fabfile.py is the real deploy mechanism
[0:16] <bchrisman> ahh??? so that's the hook into gitbuilder to rebuild yer kernel??? that's building the full kernel pkg?
[0:16] <Tv> 47 ionice -c3 nice -n20 make O=build~/out LOCALVERSION=-ceph KDEB_PKGVERSION=ceph deb-pkg -j16 "$@" || exit 4
[0:16] <Tv> that's the magic line
[0:19] <cmccabe> tv: perhaps enabling /proc/kconfig.gz would help
[0:20] <Tv> cmccabe: i already have that in /boot
[0:20] <cmccabe> tv: that might help identify the running kernel I mean
[0:20] <Tv> the problem is, it contains the "pretty" version, not the raw sha1
[0:20] <cmccabe> tv: maybe try setting LOCALVERSION to the SHA1?
[0:21] <Tv> cmccabe: horribly long
[0:21] <cmccabe> heh
[0:21] <cmccabe> SHA1 tends to be that way
[0:21] <Tv> i think something like echo SHA1="$(git rev-parse HEAD)" >>.config should work
[0:22] * aliguori (~anthony@32.97.110.59) Quit (Quit: Ex-Chat)
[0:42] * sjustlaptop (~sam@ip-66-33-206-8.dreamhost.com) Quit (Quit: Leaving.)
[0:56] * Yulya__ (~Yu1ya_@ip-95-220-130-125.bb.netbynet.ru) has joined #ceph
[1:03] * Yulya_ (~Yu1ya_@ip-95-220-235-151.bb.netbynet.ru) Quit (Ping timeout: 480 seconds)
[1:08] * Meths (rift@2.25.212.21) Quit (Ping timeout: 480 seconds)
[1:36] * yehuda_hm (~yehuda@99-48-179-68.lightspeed.irvnca.sbcglobal.net) has joined #ceph
[1:39] * Tv (~Tv|work@ip-66-33-206-8.dreamhost.com) Quit (Ping timeout: 480 seconds)
[1:53] * verwilst (~verwilst@dD57693C8.access.telenet.be) Quit (Quit: Ex-Chat)
[2:03] * Yulya_ (~Yu1ya_@ip-95-220-254-85.bb.netbynet.ru) has joined #ceph
[2:06] * Yulya__ (~Yu1ya_@ip-95-220-130-125.bb.netbynet.ru) Quit (Ping timeout: 480 seconds)
[2:24] * Meths (rift@2.25.212.84) has joined #ceph
[2:27] * bchrisman (~Adium@70-35-37-146.static.wiline.com) Quit (Quit: Leaving.)
[2:59] * cmccabe (~cmccabe@208.80.64.174) has left #ceph
[3:03] * gregorg (~Greg@78.155.152.6) Quit (Read error: Connection reset by peer)
[3:04] * gregorg (~Greg@78.155.152.6) has joined #ceph
[3:06] * aliguori (~anthony@cpe-70-123-132-139.austin.res.rr.com) has joined #ceph
[3:08] * sjustlaptop (~sam@adsl-76-208-176-239.dsl.lsan03.sbcglobal.net) has joined #ceph
[3:23] * bchrisman (~Adium@c-98-207-207-62.hsd1.ca.comcast.net) has joined #ceph
[3:31] * aliguori (~anthony@cpe-70-123-132-139.austin.res.rr.com) Quit (Ping timeout: 480 seconds)
[3:54] * ghaskins (~ghaskins@66-189-113-47.dhcp.oxfr.ma.charter.com) Quit (Quit: Leaving)
[3:59] * ghaskins (~ghaskins@66-189-113-47.dhcp.oxfr.ma.charter.com) has joined #ceph
[4:03] * joshd (~joshd@ip-66-33-206-8.dreamhost.com) Quit (Quit: Leaving.)
[4:12] * sjustlaptop (~sam@adsl-76-208-176-239.dsl.lsan03.sbcglobal.net) Quit (Ping timeout: 480 seconds)
[4:20] * sjustlaptop (~sam@adsl-76-208-176-239.dsl.lsan03.sbcglobal.net) has joined #ceph
[5:24] * sjustlaptop (~sam@adsl-76-208-176-239.dsl.lsan03.sbcglobal.net) Quit (Ping timeout: 480 seconds)
[6:11] * MarkN (~nathan@59.167.240.178) has left #ceph
[7:11] * tjikkun (~tjikkun@195-240-187-63.ip.telfort.nl) has joined #ceph
[7:37] * mtk (~mtk@ool-182c8e6c.dyn.optonline.net) Quit (Ping timeout: 480 seconds)
[7:42] * mtk (~mtk@ool-182c8e6c.dyn.optonline.net) has joined #ceph
[7:49] * lidongyang (~lidongyan@222.126.194.154) Quit (Remote host closed the connection)
[7:53] * lidongyang (~lidongyan@222.126.194.154) has joined #ceph
[7:56] * neurodrone (~neurodron@cpe-76-180-162-12.buffalo.res.rr.com) Quit (Quit: zzZZZZzz)
[8:33] * Jiaju (~jjzhang@222.126.194.154) has joined #ceph
[9:11] * sjustlaptop (~sam@adsl-76-208-176-239.dsl.lsan03.sbcglobal.net) has joined #ceph
[9:15] * allsystemsarego (~allsystem@188.27.167.240) has joined #ceph
[9:24] * sjustlaptop (~sam@adsl-76-208-176-239.dsl.lsan03.sbcglobal.net) Quit (Ping timeout: 480 seconds)
[9:37] * gregorg_taf (~Greg@78.155.152.6) has joined #ceph
[9:39] * damien1 (~damien@94-23-154-182.kimsufi.com) has joined #ceph
[9:39] * lx0 (~aoliva@186.214.48.236) has joined #ceph
[9:39] * gregorg (~Greg@78.155.152.6) Quit (reticulum.oftc.net kilo.oftc.net)
[9:39] * jbd (~jbd@ks305592.kimsufi.com) Quit (reticulum.oftc.net kilo.oftc.net)
[9:39] * andret (~andre@pcandre.nine.ch) Quit (reticulum.oftc.net kilo.oftc.net)
[9:39] * lxo (~aoliva@186.214.48.236) Quit (reticulum.oftc.net kilo.oftc.net)
[9:39] * nms (martin@sexyba.be) Quit (reticulum.oftc.net kilo.oftc.net)
[9:39] * damoxc (~damien@94-23-154-182.kimsufi.com) Quit (reticulum.oftc.net kilo.oftc.net)
[9:40] * andret (~andre@pcandre.nine.ch) has joined #ceph
[9:40] * nms (martin@sexyba.be) has joined #ceph
[9:51] * yehuda_hm (~yehuda@99-48-179-68.lightspeed.irvnca.sbcglobal.net) Quit (Read error: Operation timed out)
[10:04] * yehuda_hm (~yehuda@99-48-179-68.lightspeed.irvnca.sbcglobal.net) has joined #ceph
[11:12] * jbd (~jbd@ks305592.kimsufi.com) has joined #ceph
[14:06] * Yulya__ (~Yu1ya_@ip-95-220-177-101.bb.netbynet.ru) has joined #ceph
[14:09] * Yulya_ (~Yu1ya_@ip-95-220-254-85.bb.netbynet.ru) Quit (Ping timeout: 480 seconds)
[14:38] * MK_FG (~MK_FG@188.226.51.71) Quit (Quit: o//)
[14:41] * MK_FG (~MK_FG@188.226.51.71) has joined #ceph
[14:51] * aliguori (~anthony@cpe-70-123-132-139.austin.res.rr.com) has joined #ceph
[15:35] * bchrisman (~Adium@c-98-207-207-62.hsd1.ca.comcast.net) Quit (Quit: Leaving.)
[15:37] * Yulya_ (~Yu1ya_@ip-95-220-240-97.bb.netbynet.ru) has joined #ceph
[15:44] * Yulya__ (~Yu1ya_@ip-95-220-177-101.bb.netbynet.ru) Quit (Ping timeout: 480 seconds)
[15:53] * MKFG (~MK_FG@188.226.51.71) has joined #ceph
[15:57] * MK_FG (~MK_FG@188.226.51.71) Quit (Ping timeout: 480 seconds)
[15:57] * MKFG is now known as MK_FG
[15:58] * MK_FG (~MK_FG@188.226.51.71) Quit (Quit: o//)
[16:01] * MK_FG (~MK_FG@188.226.51.71) has joined #ceph
[16:30] * neurodrone (~neurodron@cpe-76-180-162-12.buffalo.res.rr.com) has joined #ceph
[17:04] * yehuda_hm (~yehuda@99-48-179-68.lightspeed.irvnca.sbcglobal.net) Quit (Ping timeout: 480 seconds)
[17:21] * Tv (~Tv|work@ip-66-33-206-8.dreamhost.com) has joined #ceph
[17:58] * aliguori (~anthony@cpe-70-123-132-139.austin.res.rr.com) Quit (Ping timeout: 480 seconds)
[18:09] <wonko_be> anyone an idea on this: common_init: unable to open config file.
[18:09] <wonko_be> failed: '/sbin/mkcephfs -d /tmp/mkfs.ceph.806 --init-daemon mon.0'
[18:10] <Tv> wonko_be: do you have a ceph.conf somewhere?
[18:11] <Tv> wonko_be: try running it with sh -x to see what exactly fails
[18:11] <wonko_be> i do have a ceph.conf
[18:11] <wonko_be> it is actually restarting a existing installation
[18:13] <Tv> sage: this is part of why i'm not happy with mkcephfs.. 500 lines of shell is hard to debug :(
[18:13] <wonko_be> ack, I was browsing through it, and I quickly though that someone else might have encountered this
[18:14] <wonko_be> + bash -c /sbin/mkcephfs -d /tmp/mkfs.ceph.2351 --init-daemon mon.0
[18:14] <wonko_be> this fails
[18:14] * bchrisman (~Adium@70-35-37-146.static.wiline.com) has joined #ceph
[18:14] <wonko_be> let me see if it creates this somewhere
[18:14] <Tv> wido: sh -x /sbin/mkcephfs ....
[18:14] <wonko_be> i assume it looks for /tmp/mkfs.ceph.2351/conf
[18:14] <wonko_be> Tv: this is with the sh -x
[18:14] <Tv> it should
[18:15] <Tv> but i'd like to see the cmon mkfs call
[18:16] <Tv> oh that bash -c came out of the sh -x run, odd
[18:16] <wonko_be> hmmmz, seems that the /tmp/... is created on the remote hosts, but not on the local one
[18:16] <Tv> err, i don't see how
[18:16] <wonko_be> there it is named differently
[18:16] <wonko_be> let me check my config
[18:21] * darkfaded (~floh@188.40.175.2) has joined #ceph
[18:28] * aliguori (~anthony@32.97.110.64) has joined #ceph
[18:29] <wonko_be> there we go
[18:29] <wonko_be> now, lets check if it is a bug
[18:34] <wido> Tv: what?
[18:35] <wido> why should I mkcephfs?
[18:35] <Tv> wido: sorry, bad tab completion
[18:35] <wonko_be> wido: it was me he needed i assume
[18:35] <wonko_be> ah, see
[18:35] <wido> ah, ok :)
[18:36] <wido> But, now I'm here, how 'stable
[18:36] <wido> ' is the new peering code considered?
[18:36] <wido> I upgraded to v0.28.1 yesterday, but I'm still seeing various crashes in the peering
[18:36] <Tv> wido: i overheard a few bugs being found i think, so it might need time to settle
[18:36] <Tv> but it was done because the old code was problematic too, so..
[18:37] * Tv waves at sjust
[18:37] <sjust> wido: there was a fix pushed to stable yesterday at around 10:30 our time
[18:37] <sjust> wido: it should fix at least one common bug
[18:40] * aliguori (~anthony@32.97.110.64) Quit (Ping timeout: 480 seconds)
[18:46] * neurodrone (~neurodron@cpe-76-180-162-12.buffalo.res.rr.com) Quit (Quit: zzZZZZzz)
[18:46] <wonko_be> so fixed it
[18:48] * joshd (~joshd@ip-66-33-206-8.dreamhost.com) has joined #ceph
[18:59] * joshd (~joshd@ip-66-33-206-8.dreamhost.com) Quit (Quit: Leaving.)
[19:05] * Tv (~Tv|work@ip-66-33-206-8.dreamhost.com) Quit (Ping timeout: 480 seconds)
[19:07] * cmccabe (~cmccabe@c-24-23-254-199.hsd1.ca.comcast.net) has joined #ceph
[19:12] * sagelap (~sage@12.248.40.138) has joined #ceph
[19:26] <wido> sjust: Tnx, I'll take a look at it
[19:26] * neurodrone (~neurodron@cpe-76-180-162-12.buffalo.res.rr.com) has joined #ceph
[19:59] * sjustlaptop (~sam@12.248.40.138) has joined #ceph
[20:02] * sjustlaptop (~sam@12.248.40.138) Quit ()
[20:10] * aliguori (~anthony@32.97.110.64) has joined #ceph
[20:21] * allsystemsarego (~allsystem@188.27.167.240) Quit (Quit: Leaving)
[20:39] * sjustlaptop (~sam@206.29.188.187) has joined #ceph
[20:39] * sjustlaptop (~sam@206.29.188.187) Quit ()
[21:00] * Yulya__ (~Yu1ya_@ip-95-220-154-105.bb.netbynet.ru) has joined #ceph
[21:07] * Yulya_ (~Yu1ya_@ip-95-220-240-97.bb.netbynet.ru) Quit (Ping timeout: 480 seconds)
[21:16] * sjustlaptop (~sam@12.248.40.138) has joined #ceph
[21:17] * aliguori (~anthony@32.97.110.64) Quit (Ping timeout: 480 seconds)
[21:20] * darkfader (~floh@188.40.175.2) Quit (Quit: leaving)
[21:22] * sjustlaptop (~sam@12.248.40.138) Quit (Quit: Leaving.)
[21:32] * Yulya_ (~Yu1ya_@ip-95-220-147-60.bb.netbynet.ru) has joined #ceph
[21:38] * Yulya__ (~Yu1ya_@ip-95-220-154-105.bb.netbynet.ru) Quit (Ping timeout: 480 seconds)
[21:40] * sagelap1 (~sage@12.248.40.138) has joined #ceph
[21:40] * sagelap (~sage@12.248.40.138) Quit (Read error: Connection reset by peer)
[21:41] * aliguori (~anthony@32.97.110.64) has joined #ceph
[21:46] * Yulya__ (~Yu1ya_@ip-95-220-181-38.bb.netbynet.ru) has joined #ceph
[21:53] * aliguori_ (~anthony@32.97.110.64) has joined #ceph
[21:53] * Yulya_ (~Yu1ya_@ip-95-220-147-60.bb.netbynet.ru) Quit (Ping timeout: 480 seconds)
[21:56] * aliguori (~anthony@32.97.110.64) Quit (Ping timeout: 480 seconds)
[22:04] * Yulya_ (~Yu1ya_@ip-95-220-162-121.bb.netbynet.ru) has joined #ceph
[22:06] * verwilst (~verwilst@dD57693C8.access.telenet.be) has joined #ceph
[22:08] * alexxy (~alexxy@79.173.81.171) Quit (Ping timeout: 480 seconds)
[22:11] * Yulya__ (~Yu1ya_@ip-95-220-181-38.bb.netbynet.ru) Quit (Ping timeout: 480 seconds)
[22:22] * Yulya_ (~Yu1ya_@ip-95-220-162-121.bb.netbynet.ru) Quit (Ping timeout: 480 seconds)
[22:36] * rsharpe (~Adium@70-35-37-146.static.wiline.com) Quit (Quit: Leaving.)
[22:44] <cmccabe> looks like I somehow created a defunct cosd on rgw-cmccabe
[22:44] <cmccabe> it seems like I'm getting btrfs errors
[22:48] <cmccabe> well, I'm not on the latest kernel, so I guess debugging this would be fruitless
[23:03] * mtk (~mtk@ool-182c8e6c.dyn.optonline.net) Quit (Ping timeout: 480 seconds)
[23:07] * slang (~slang@chml01.drwholdings.com) Quit (Read error: Connection reset by peer)
[23:10] * mtk (~mtk@ool-182c8e6c.dyn.optonline.net) has joined #ceph
[23:30] <bchrisman> if I wanted to turn off osd scrubbing for testing purposes??? how can I set a variable like: osd_scrub_load_threshold
[23:30] <bchrisman> I was thinking of setting that to 0.0
[23:31] <cmccabe> add this to your config
[23:31] <cmccabe> osd_scrub_load_threshold = 0.0
[23:31] <cmccabe> in either the global or the osd section
[23:31] <bchrisman> with the '_'s then? or does that matter?
[23:31] <cmccabe> spaces or underscores; either will work
[23:32] <bchrisman> okay.. thought I tried that particular incantation and it didn't work.. but I removed it from the conf file.. maybe I typoed.. will try again
[23:33] <bchrisman> hmm.. parses this time.
[23:36] * Yulya_ (~Yu1ya_@ip-95-220-172-33.bb.netbynet.ru) has joined #ceph
[23:39] * aliguori (~anthony@32.97.110.64) has joined #ceph
[23:40] * aliguori_ (~anthony@32.97.110.64) Quit (Read error: Connection reset by peer)
[23:51] * Tv (~Tv|work@ip-66-33-206-8.dreamhost.com) has joined #ceph
[23:54] * joshd (~joshd@ip-66-33-206-8.dreamhost.com) has joined #ceph

These logs were automatically created by CephLogBot on irc.oftc.net using the Java IRC LogBot.