#ceph IRC Log

Index

IRC Log for 2012-03-17

Timestamps are in GMT/BST.

[0:00] * Jaykra (~Jamie@64-126-89-248.dyn.everestkc.net) Quit (Quit: Leaving.)
[0:02] * softcrack (de8084f0@ircip3.mibbit.com) has joined #ceph
[0:03] * aa (~aa@r200-40-114-26.ae-static.anteldata.net.uy) Quit (Remote host closed the connection)
[0:09] * softcrack (de8084f0@ircip3.mibbit.com) Quit (Quit: http://www.mibbit.com ajax IRC Client)
[0:10] * softcrack (~samuel@202.85.210.57) has joined #ceph
[0:10] * aliguori (~anthony@cpe-70-123-132-139.austin.res.rr.com) Quit (Read error: Operation timed out)
[0:17] * ArtemGr (~chatzilla@94.188.74.88) Quit (Quit: ChatZilla 0.9.88.1 [Firefox 11.0/20120312181643])
[0:24] * ArtemGr (~chatzilla@94.188.74.88) has joined #ceph
[0:25] * aliguori (~anthony@cpe-70-123-132-139.austin.res.rr.com) has joined #ceph
[0:27] <ArtemGr> "rados lspools" sometimes aborts: http://pastebin.com/WVJR4xGu
[0:27] * aliguori (~anthony@cpe-70-123-132-139.austin.res.rr.com) Quit (Remote host closed the connection)
[0:29] * ArtemGr (~chatzilla@94.188.74.88) Quit ()
[0:38] * Oliver1 (~oliver1@p5483D33E.dip.t-dialin.net) has left #ceph
[0:38] * stxShadow (~jens@ip-88-153-224-220.unitymediagroup.de) Quit (Read error: Connection reset by peer)
[0:44] * softcrack (~samuel@202.85.210.57) has left #ceph
[0:46] <sagewk> artemgr: which version is that?
[0:48] <joshd> sagewk: backtrace says 0.43 (9fa8781c0147d66fcef7c2dd0e09cd3c69747d37)
[0:49] <sagewk> 702f09ea74dd134f34b84cd9d80dbfd79573b644 fixes this, i believe
[0:50] <joshd> yeah, that sounds like the same problem
[1:07] * Tv|work (~Tv_@aon.hq.newdream.net) Quit (Read error: Operation timed out)
[1:13] <absynth> hm
[1:13] <absynth> i'm seeing this issue
[1:13] <absynth> http://ceph.newdream.net/gitbuilder-amd64/log.cgi?log=b327b28faf2ae5168662c763ed60f8a0e2075550
[1:14] <absynth> on a build that sage prepared for us
[1:14] <absynth> wip-omap-decode
[1:14] <joshd> absynth: does 'git submodule init && git submodule update' help?
[1:14] <absynth> Makefile.am:182: `lib/libgtest.a' is not a standard libtool library name
[1:14] <absynth> i think the actual error is
[1:14] <absynth> autoreconf: `configure.ac' or `configure.in' is required
[1:15] <absynth> sec
[1:15] <absynth> snoring cat next to me
[1:15] <absynth> how idyllic
[1:16] <absynth> yes, looks better
[1:16] <absynth> what does that do?
[1:17] <joshd> we have submodules for some test data and for leveldb, so we don't need to include them directly in the ceph repo
[1:19] <absynth> ah, okay
[1:28] <absynth> hm, can you tell me which commit and version i should be seeing after install?
[1:28] <absynth> i am seeing ceph version 0.43-287-g3a6c085 (commit:3a6c085e8207877fbaac38c9f6055f54bad2a2c9)
[1:28] <absynth> does that correspond to wip-omap-decode?
[1:33] <dmick> wip-omap-decode is currently at cffe0caecdeba57c97e2bd3f74f679c16d4a4e0a, if that helps
[1:33] <absynth> hm, i think someone checked out the wrong branch
[1:33] <absynth> unfortunately, that someone is in bed now
[1:34] <absynth> could you help me checking out the right one?
[1:34] <absynth> i'm not into git at all
[1:36] <dmick> you have a ceph source tree and are trying to specifically build the most-current rev of the wip-omap-decode branch?
[1:36] <absynth> exactly
[1:36] <dmick> in the tree, you should be able to do git checkout wip-omap-decode
[1:36] <dmick> it's probably worth verifying first
[1:36] <dmick> that git status shows nothing unexpected
[1:37] <absynth> hmm
[1:37] <absynth> root@fcmsnode0:/usr/src/ceph# git status
[1:37] <absynth> # On branch wip-omap-decode
[1:37] <absynth> # Untracked files:
[1:37] <absynth> looks like the branch is correct
[1:37] <absynth> still, sage says the version doesn't match
[1:38] <dmick> "on the branch" doesn't mean "up to date"
[1:38] <dmick> checking some state here
[1:39] <joshd> absynth: do 'git fetch; git reset --hard origin/wip-omap-decode; git clean -f -d -x; git submodule init; git submodule update'
[1:39] <absynth> what's git syntax for "update current tree"?
[1:39] <joshd> it's overkill, but it will get you in the correct state
[1:40] <dmick> yeah, that's weird. I don't even see that SHA1 in the tree
[1:40] <dmick> I'd go with joshd's suggestion for sure, he's got a lot more git-fu than me
[1:40] <absynth> root@fcmsnode0:/usr/src/ceph# git reset --hard origin/wip-imap-decodefatal: ambiguous argument 'origin/wip-imap-decode': unknown revision or path not in the working tree.
[1:41] <dmick> omap
[1:41] <absynth> doh
[1:41] <absynth> i work with mail servers a lot...
[1:41] <dmick> :)
[1:42] * bchrisman (~Adium@108.60.121.114) Quit (Quit: Leaving.)
[1:42] <absynth> ok... next try
[1:52] * lofejndif (~lsqavnbok@28IAADDRG.tor-irc.dnsbl.oftc.net) Quit (Quit: Leaving)
[1:54] * Tv__ (~tv@cpe-24-24-131-250.socal.res.rr.com) has joined #ceph
[1:57] <absynth> compiling went fine, starting up went fine, now i gotta wait thru 13% degradation, sigh
[2:00] <sage> absynth: can you install the ceph-dbg package?
[2:02] * ^conner (~conner@leo.tuc.noao.edu) Quit (Read error: Operation timed out)
[2:02] <absynth> yeah, wait
[2:02] <absynth> i installed it, i also have a corefile too
[2:02] <absynth> sent you the crashdump via mail to the same address as jabber
[2:03] <absynth> i'm now at 3% degradation on the other 3 osds so i gotta hope they'll recover
[2:03] <sage> mind if i try starting osd.0 one last time?
[2:04] <absynth> sure. i guess if anyone knows what they're doing, it's you :)
[2:04] <absynth> i just hope the other osds won't crash on the magic .79% barrier again
[2:05] <sage> no dice. ok, i'll see what the scan_range thing is about, since it's clearly reproducible.
[2:06] <absynth> great. you wanna grab a couple more hints from the core file?
[2:08] <sage> did that. going to do one more attempt and capture an strace
[2:08] <absynth> ok... i'll cover my eyes
[2:09] * tnt_ (~tnt@148.47-67-87.adsl-dyn.isp.belgacom.be) Quit (Ping timeout: 480 seconds)
[2:11] <absynth> ok, i think i'll wait thru the 4 something percent to make sure the osd.2 doesn't crash on .79 again
[2:12] <absynth> and then i'll go to bed. it's 2am here
[2:16] <sage> absynth: ok. keeping an eye on it.
[2:19] * DLange (~DLange@dlange.user.oftc.net) Quit (Quit: rebooooooo. . . . . . .)
[2:22] * DLange (~DLange@dlange.user.oftc.net) has joined #ceph
[2:37] * joshd (~joshd@aon.hq.newdream.net) Quit (Quit: Leaving.)
[2:58] <joao> sage, still around?
[3:13] * BManojlovic (~steki@212.200.240.216) Quit (Remote host closed the connection)
[3:24] * joao (~JL@89.181.145.28) Quit (Quit: Leaving)
[3:38] * chutzpah (~chutz@216.174.109.254) Quit (Quit: Leaving)
[5:03] <sage> joao: am now
[5:14] * d405 (~nobody@un.interestingsh.it) Quit (Quit: leaving)
[5:17] * bchrisman (~Adium@c-76-103-130-94.hsd1.ca.comcast.net) has joined #ceph
[5:19] * gregaf1 (~Adium@aon.hq.newdream.net) has joined #ceph
[5:19] * yehudasa_ (~yehudasa@aon.hq.newdream.net) has joined #ceph
[5:20] * sagewk (~sage@aon.hq.newdream.net) Quit (Read error: Operation timed out)
[5:20] * sjust (~sam@aon.hq.newdream.net) Quit (Write error: connection closed)
[5:20] * mkampe1 (~markk@aon.hq.newdream.net) has joined #ceph
[5:21] * sagewk (~sage@aon.hq.newdream.net) has joined #ceph
[5:21] * sjust (~sam@aon.hq.newdream.net) has joined #ceph
[5:22] * dmick1 (~dmick@aon.hq.newdream.net) has joined #ceph
[5:26] * gregaf (~Adium@aon.hq.newdream.net) Quit (Ping timeout: 480 seconds)
[5:26] * mkampe (~markk@aon.hq.newdream.net) Quit (Ping timeout: 480 seconds)
[5:26] * dmick (~dmick@aon.hq.newdream.net) Quit (Ping timeout: 480 seconds)
[5:26] * yehudasa (~yehudasa@aon.hq.newdream.net) Quit (Ping timeout: 480 seconds)
[5:27] * yehudasa__ (~yehudasa@aon.hq.newdream.net) has joined #ceph
[5:28] * gregaf (~Adium@aon.hq.newdream.net) has joined #ceph
[5:28] * dmick (~dmick@aon.hq.newdream.net) has joined #ceph
[5:29] * gregaf1 (~Adium@aon.hq.newdream.net) Quit (Read error: Operation timed out)
[5:29] * yehudasa_ (~yehudasa@aon.hq.newdream.net) Quit (Read error: Operation timed out)
[5:29] * sagewk1 (~sage@aon.hq.newdream.net) has joined #ceph
[5:30] * mkampe (~markk@aon.hq.newdream.net) has joined #ceph
[5:30] * yehudasa_ (~yehudasa@aon.hq.newdream.net) has joined #ceph
[5:31] * groovious (~Adium@64-126-49-62.dyn.everestkc.net) Quit (Quit: Leaving.)
[5:31] * sagewk2 (~sage@aon.hq.newdream.net) has joined #ceph
[5:32] * yehudasa (~yehudasa@aon.hq.newdream.net) has joined #ceph
[5:32] * dmick2 (~dmick@aon.hq.newdream.net) has joined #ceph
[5:32] * sjust1 (~sam@aon.hq.newdream.net) has joined #ceph
[5:32] * mkampe2 (~markk@aon.hq.newdream.net) has joined #ceph
[5:33] * mkampe1 (~markk@aon.hq.newdream.net) Quit (Ping timeout: 480 seconds)
[5:33] * dmick1 (~dmick@aon.hq.newdream.net) Quit (Ping timeout: 480 seconds)
[5:34] * sjust (~sam@aon.hq.newdream.net) Quit (Ping timeout: 480 seconds)
[5:35] * sagewk (~sage@aon.hq.newdream.net) Quit (Ping timeout: 480 seconds)
[5:36] * gregaf (~Adium@aon.hq.newdream.net) Quit (Ping timeout: 480 seconds)
[5:36] * dmick (~dmick@aon.hq.newdream.net) Quit (Ping timeout: 480 seconds)
[5:37] * yehudasa__ (~yehudasa@aon.hq.newdream.net) Quit (Ping timeout: 480 seconds)
[5:37] * sagewk1 (~sage@aon.hq.newdream.net) Quit (Ping timeout: 480 seconds)
[5:38] * mkampe (~markk@aon.hq.newdream.net) Quit (Ping timeout: 480 seconds)
[5:39] * yehudasa_ (~yehudasa@aon.hq.newdream.net) Quit (Ping timeout: 480 seconds)
[5:39] * dmick2 (~dmick@aon.hq.newdream.net) Quit (Quit: Leaving.)
[6:21] * cattelan is now known as cattelan_away
[7:18] * andreask (~andreas@chello062178013131.5.11.vie.surfer.at) has joined #ceph
[7:18] * andreask (~andreas@chello062178013131.5.11.vie.surfer.at) has left #ceph
[7:47] * bchrisman1 (~Adium@c-76-103-130-94.hsd1.ca.comcast.net) has joined #ceph
[7:54] * bchrisman (~Adium@c-76-103-130-94.hsd1.ca.comcast.net) Quit (Ping timeout: 480 seconds)
[8:17] * Tv__ (~tv@cpe-24-24-131-250.socal.res.rr.com) Quit (Ping timeout: 480 seconds)
[8:19] * tnt_ (~tnt@148.47-67-87.adsl-dyn.isp.belgacom.be) has joined #ceph
[8:39] <Qten> So with the way ceph works would i be better off with more servers with say 4 disks and 16gb of ram or less server with 32gig of ram and 12 disks when running virtual machines?
[9:14] * Dieter_be (~Dieterbe@dieter2.plaetinck.be) has joined #ceph
[11:18] * tjikkun_ (~tjikkun@82-169-255-84.ip.telfort.nl) has joined #ceph
[13:16] <NaioN> Qten: true but also think about the networking
[13:16] <NaioN> even 4 disks can do more than 1Gbit/s
[13:16] <NaioN> and use something like ssd as journal
[13:36] <darkfader> "true"?
[13:36] <darkfader> he had an "or" modifier in there ;)
[13:46] <dwm__> I suspect the distribution of resources is less significant than the total amount. If you get more individual spindles / capacity with one or t'other..
[13:46] <dwm__> (All other things being equal.)
[14:04] * tjikkun (~tjikkun@82-169-255-84.ip.telfort.nl) Quit (Remote host closed the connection)
[14:07] <NaioN> darkfader: sorry, but 4disks per system with more mem per disk is better
[14:07] <NaioN> also because you have less overcommitment on the network side
[14:08] <NaioN> but I also wanted to point out that you have to look at more things
[14:08] <darkfader> NaioN: ah i didnt catch on, you're right 4:16 or 12:32
[14:08] <NaioN> you also have to look at the network interfaces
[14:08] <NaioN> and the jorunal device
[14:09] <darkfader> yup
[14:09] <darkfader> i just wanted to help clarify because otherwise you'll never know what he thinks you recommended
[14:09] <darkfader> aside from we're talking to a phantom anyway
[14:09] <NaioN> at the moment I'm using 24disks per system with 1 ssd (with partitions)
[14:09] <darkfader> i'll go outside get more sun
[14:10] <NaioN> k :)
[14:10] <darkfader> oh and the 1 ssd keeps up throughput wise?
[14:10] <darkfader> interesting!
[14:10] <NaioN> well I have only 1gbit/s network
[14:10] <NaioN> so that's the bottleneck
[14:10] <darkfader> hehe ok yes
[14:10] <NaioN> I plan to use bonding (2x1gbit/s)
[14:10] <darkfader> but you could say:
[14:11] <NaioN> but that will also be the bottleneck I think
[14:11] <darkfader> "the new setup has proven to sustain line rate for prolonged periods" *snicker*
[14:11] <NaioN> yeps :)
[14:40] * lofejndif (~lsqavnbok@28IAADD2S.tor-irc.dnsbl.oftc.net) has joined #ceph
[14:46] * tjikkun (~tjikkun@82-169-255-84.ip.telfort.nl) has joined #ceph
[14:48] * sage (~sage@cpe-76-94-40-34.socal.res.rr.com) Quit (Read error: Operation timed out)
[14:51] <Qten> I was going to suggest 20gbit IPoIB
[14:52] <Qten> for networking side of things
[14:53] <Qten> i was thinking of the simular technology as to what scale computing uses they only use 4 disks per node with 16gb of ram it seems
[14:53] <Qten> with optional 10gbit ethernet
[14:54] <Qten> just trying to work out why they would only use 4 disks per node and not 8
[14:54] <Qten> or 12/16/24 etc
[14:55] <Qten> apart from costs the drives are at this stage the most expensive bit of hardware ha, but that said they deisgned it well before the price rise on hard disks so its interesting to know why they may have picked 4 instead of a higher number
[14:56] <Qten> http://scalecomputing.com/ who havent heard of it before
[15:10] <NaioN> well with only 4 disks per node i think the rest of the node is more expensive than the 4 disks
[16:01] * gohko_ (~gohko@natter.interq.or.jp) has joined #ceph
[16:08] * gohko (~gohko@natter.interq.or.jp) Quit (Ping timeout: 480 seconds)
[16:29] * groovious (~Adium@64-126-49-62.dyn.everestkc.net) has joined #ceph
[17:05] * tnt_ (~tnt@148.47-67-87.adsl-dyn.isp.belgacom.be) Quit (Ping timeout: 480 seconds)
[18:06] * andreask (~andreas@chello062178013131.5.11.vie.surfer.at) has joined #ceph
[18:06] * andreask (~andreas@chello062178013131.5.11.vie.surfer.at) has left #ceph
[18:15] * tjikkun_ (~tjikkun@82-169-255-84.ip.telfort.nl) Quit (Ping timeout: 480 seconds)
[18:17] * lofejndif (~lsqavnbok@28IAADD2S.tor-irc.dnsbl.oftc.net) Quit (Quit: Leaving)
[18:23] * tnt_ (~tnt@38.cust-D00.waldc.net) has joined #ceph
[19:00] * LarsFronius (~LarsFroni@testing78.jimdo-server.com) Quit (Quit: LarsFronius)
[19:23] * lxo (~aoliva@lxo.user.oftc.net) Quit (Ping timeout: 480 seconds)
[19:32] * lxo (~aoliva@lxo.user.oftc.net) has joined #ceph
[19:39] * LarsFronius (~LarsFroni@f054104029.adsl.alicedsl.de) has joined #ceph
[19:47] * stass (stas@ssh.deglitch.com) Quit (Read error: Connection reset by peer)
[19:51] * stass (stas@ssh.deglitch.com) has joined #ceph
[20:02] * jefferai (~quassel@quassel.jefferai.org) Quit (Ping timeout: 480 seconds)
[20:07] <absynth> morning everyone
[20:08] * jefferai (~quassel@quassel.jefferai.org) has joined #ceph
[20:14] * Tv__ (~tv@cpe-24-24-131-250.socal.res.rr.com) has joined #ceph
[20:15] * tjikkun_ (~tjikkun@82-169-255-84.ip.telfort.nl) has joined #ceph
[20:15] * yehudasa_ (~yehudasa@aon.hq.newdream.net) has joined #ceph
[20:16] * sagewk (~sage@aon.hq.newdream.net) has joined #ceph
[20:16] * yehudasa__ (~yehudasa@aon.hq.newdream.net) has joined #ceph
[20:16] * mkampe2 (~markk@aon.hq.newdream.net) Quit (Read error: Operation timed out)
[20:16] * sjust1 (~sam@aon.hq.newdream.net) Quit (Read error: Operation timed out)
[20:16] * sjust (~sam@aon.hq.newdream.net) has joined #ceph
[20:18] * sagewk1 (~sage@aon.hq.newdream.net) has joined #ceph
[20:18] * mkampe (~markk@aon.hq.newdream.net) has joined #ceph
[20:18] * sjust1 (~sam@aon.hq.newdream.net) has joined #ceph
[20:19] * yehudasa__ (~yehudasa@aon.hq.newdream.net) Quit (Read error: Operation timed out)
[20:19] <absynth> sagewk1: awake yet?
[20:20] * sagewk3 (~sage@aon.hq.newdream.net) has joined #ceph
[20:21] * sjust (~sam@aon.hq.newdream.net) Quit (Read error: Operation timed out)
[20:22] * yehudasa (~yehudasa@aon.hq.newdream.net) Quit (Ping timeout: 480 seconds)
[20:22] * sagewk2 (~sage@aon.hq.newdream.net) Quit (Ping timeout: 480 seconds)
[20:23] * yehudasa_ (~yehudasa@aon.hq.newdream.net) Quit (Ping timeout: 480 seconds)
[20:23] * sjust (~sam@aon.hq.newdream.net) has joined #ceph
[20:24] * mkampe1 (~markk@aon.hq.newdream.net) has joined #ceph
[20:24] * sjust1 (~sam@aon.hq.newdream.net) Quit (Read error: Operation timed out)
[20:24] * sagewk (~sage@aon.hq.newdream.net) Quit (Ping timeout: 480 seconds)
[20:25] * sagewk3 (~sage@aon.hq.newdream.net) Quit (Read error: Operation timed out)
[20:26] * sagewk1 (~sage@aon.hq.newdream.net) Quit (Ping timeout: 480 seconds)
[20:28] * sjust1 (~sam@aon.hq.newdream.net) has joined #ceph
[20:28] * mkampe2 (~markk@aon.hq.newdream.net) has joined #ceph
[20:28] * sjust (~sam@aon.hq.newdream.net) Quit (Read error: Operation timed out)
[20:28] * mkampe (~markk@aon.hq.newdream.net) Quit (Ping timeout: 480 seconds)
[20:28] * mkampe1 (~markk@aon.hq.newdream.net) Quit (Read error: Operation timed out)
[20:31] * lofejndif (~lsqavnbok@659AAAPXG.tor-irc.dnsbl.oftc.net) has joined #ceph
[20:32] * sagewk (~sage@aon.hq.newdream.net) has joined #ceph
[20:38] * BManojlovic (~steki@212.200.240.216) has joined #ceph
[20:43] * sagewk1 (~sage@aon.hq.newdream.net) has joined #ceph
[20:43] * mkampe (~markk@aon.hq.newdream.net) has joined #ceph
[20:44] * sjust (~sam@aon.hq.newdream.net) has joined #ceph
[20:45] * sagewk (~sage@aon.hq.newdream.net) Quit (Read error: Operation timed out)
[20:48] * mkampe2 (~markk@aon.hq.newdream.net) Quit (Ping timeout: 480 seconds)
[20:49] * sjust1 (~sam@aon.hq.newdream.net) Quit (Ping timeout: 480 seconds)
[20:54] * joao (~JL@89-181-145-28.net.novis.pt) has joined #ceph
[21:00] * stxShadow (~jens@ip-88-153-224-220.unitymediagroup.de) has joined #ceph
[21:03] * sjust1 (~sam@aon.hq.newdream.net) has joined #ceph
[21:03] * mkampe1 (~markk@aon.hq.newdream.net) has joined #ceph
[21:04] * sagewk (~sage@aon.hq.newdream.net) has joined #ceph
[21:09] * sjust (~sam@aon.hq.newdream.net) Quit (Ping timeout: 480 seconds)
[21:09] * mkampe (~markk@aon.hq.newdream.net) Quit (Ping timeout: 480 seconds)
[21:09] * sagewk1 (~sage@aon.hq.newdream.net) Quit (Ping timeout: 480 seconds)
[21:14] * stxShadow (~jens@ip-88-153-224-220.unitymediagroup.de) Quit (Read error: Connection reset by peer)
[21:28] * sage (~sage@dsl092-035-022.lax1.dsl.speakeasy.net) has joined #ceph
[21:31] * verwilst (~verwilst@dD5769628.access.telenet.be) has joined #ceph
[21:38] * tjikkun_ (~tjikkun@82-169-255-84.ip.telfort.nl) Quit (Quit: Ex-Chat)
[21:41] * stingray (~stingray@stingr.net) Quit (Quit: fuck this shit)
[21:54] * bchrisman (~Adium@c-76-103-130-94.hsd1.ca.comcast.net) has joined #ceph
[22:00] * bchrisman1 (~Adium@c-76-103-130-94.hsd1.ca.comcast.net) Quit (Ping timeout: 480 seconds)
[22:02] * stxShadow (~jens@ip-88-153-224-220.unitymediagroup.de) has joined #ceph
[22:08] * stxShadow (~jens@ip-88-153-224-220.unitymediagroup.de) Quit (Quit: bye bye !! )
[22:11] * groovious (~Adium@64-126-49-62.dyn.everestkc.net) Quit (Quit: Leaving.)
[22:18] * yehudasa (~yehudasa@aon.hq.newdream.net) has joined #ceph
[22:48] * verwilst (~verwilst@dD5769628.access.telenet.be) Quit (Quit: Ex-Chat)
[23:37] * tnt_ (~tnt@38.cust-D00.waldc.net) Quit (Ping timeout: 480 seconds)

These logs were automatically created by CephLogBot on irc.oftc.net using the Java IRC LogBot.