<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Tools for Storage Monitoring: iostat</title>
	<atom:link href="http://www.linux-mag.com/id/7906/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.linux-mag.com/id/7906/</link>
	<description>Open Source, Open Standards</description>
	<lastBuildDate>Sat, 05 Oct 2013 13:48:18 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.1</generator>
	<item>
		<title>By: eavedrop44</title>
		<link>http://www.linux-mag.com/id/7906/#comment-631299</link>
		<dc:creator>eavedrop44</dc:creator>
		<pubDate>Fri, 07 Dec 2012 17:12:41 +0000</pubDate>
		<guid isPermaLink="false">http://www.linux-mag.com/id/7906/#comment-631299</guid>
		<description>Linux Sys Admin jobs for almost the entire last 8 years. Mostly, thses have beeen as thin on the ground as thin ice, and now there is absolutely nothing. So yes, I too am having a VERY tough time. &lt;a href=&quot;http://www.adventureeducationinc.com&quot; rel=&quot;nofollow&quot;&gt;adventureeducationinc&lt;/a&gt;
&lt;a href=&quot;http://www.backtoschoolkarate.com&quot; rel=&quot;nofollow&quot;&gt;backtoschoolkarate&lt;/a&gt;
&lt;a href=&quot;http://www.biggereducation.com&quot; rel=&quot;nofollow&quot;&gt;biggereducation&lt;/a&gt;</description>
		<content:encoded><![CDATA[<p>Linux Sys Admin jobs for almost the entire last 8 years. Mostly, thses have beeen as thin on the ground as thin ice, and now there is absolutely nothing. So yes, I too am having a VERY tough time. <a href="http://www.adventureeducationinc.com" rel="nofollow">adventureeducationinc</a><br />
<a href="http://www.backtoschoolkarate.com" rel="nofollow">backtoschoolkarate</a><br />
<a href="http://www.biggereducation.com" rel="nofollow">biggereducation</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: eavedrop44</title>
		<link>http://www.linux-mag.com/id/7906/#comment-525199</link>
		<dc:creator>eavedrop44</dc:creator>
		<pubDate>Thu, 01 Nov 2012 08:08:12 +0000</pubDate>
		<guid isPermaLink="false">http://www.linux-mag.com/id/7906/#comment-525199</guid>
		<description>Well, this can become a good basic course on parallel programming if the author continues and goes deeper talking about real issues that makes.&lt;a href=&quot;http://www.traveling-inn.com&quot; rel=&quot;nofollow&quot;&gt;traveling-inn&lt;/a&gt;</description>
		<content:encoded><![CDATA[<p>Well, this can become a good basic course on parallel programming if the author continues and goes deeper talking about real issues that makes.<a href="http://www.traveling-inn.com" rel="nofollow">traveling-inn</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: pascoff</title>
		<link>http://www.linux-mag.com/id/7906/#comment-8798</link>
		<dc:creator>pascoff</dc:creator>
		<pubDate>Wed, 30 Nov -0001 00:00:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.linux-mag.com/id/7906/#comment-8798</guid>
		<description>&lt;p&gt;Hi there,&lt;/p&gt;
&lt;p&gt;one real problem I see with tools for storage monitoring is lack of them for finding the real &quot;abuser&quot; in your system when you have a IO bottleneck. For example, with iostat you can see the whole performance goes to hell on a busy system but you can not see which exactly process for example is doing some nasty IO. Of course there are some other tools like atop that can pin down the real &quot;abuser&quot; but again there are some limitations like you need a recent kernel version with some patches to be able to use this utility.&lt;/p&gt;
&lt;p&gt;I wonder, as the author does, how come there are no good and proven tools for IO monitoring like ps, top, etc. for process monitoring ...
&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>Hi there,</p>
<p>one real problem I see with tools for storage monitoring is lack of them for finding the real &#8220;abuser&#8221; in your system when you have a IO bottleneck. For example, with iostat you can see the whole performance goes to hell on a busy system but you can not see which exactly process for example is doing some nasty IO. Of course there are some other tools like atop that can pin down the real &#8220;abuser&#8221; but again there are some limitations like you need a recent kernel version with some patches to be able to use this utility.</p>
<p>I wonder, as the author does, how come there are no good and proven tools for IO monitoring like ps, top, etc. for process monitoring &#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: laytonjb</title>
		<link>http://www.linux-mag.com/id/7906/#comment-8799</link>
		<dc:creator>laytonjb</dc:creator>
		<pubDate>Wed, 30 Nov -0001 00:00:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.linux-mag.com/id/7906/#comment-8799</guid>
		<description>&lt;p&gt;@pascoff,&lt;/p&gt;
&lt;p&gt;Sounds like we belong to the same tribe - the tribe of &quot;we need better tools!&quot;.&lt;/p&gt;
&lt;p&gt;While I&#039;m not a &quot;coder&quot; in the literal sense of the word (all my coding was around my research area with is aeronautics and astronautics), I have started down the path of trying to put something together like you mentioned - finding the &quot;abuser&quot;. I think with a combination of SystemTap and some other things, a simple tool can be created to monitor the system and when certain conditions happen, such as a high load, the tool can find the process that is driving the problems. I&#039;m just beginning some research into what the pieces that go into the tool and how they get glued together.&lt;/p&gt;
&lt;p&gt;I&#039;m not sure what to do once you identify the rogue process. I thought about just killing the rogue process but that&#039;s way to draconian. I also thought about trying to throttle the rogue process so that the load (or whatever the measurement is), doesn&#039;t go crazy. But I think for a while the default will be just to inform the admin of the problem and let them determine the course of action.&lt;/p&gt;
&lt;p&gt;But this tool will be slow in coming since my day job interferes with real work time.&lt;/p&gt;
&lt;p&gt;Thanks for the post and comments!&lt;/p&gt;
&lt;p&gt;Jeff
&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>@pascoff,</p>
<p>Sounds like we belong to the same tribe &#8211; the tribe of &#8220;we need better tools!&#8221;.</p>
<p>While I&#8217;m not a &#8220;coder&#8221; in the literal sense of the word (all my coding was around my research area with is aeronautics and astronautics), I have started down the path of trying to put something together like you mentioned &#8211; finding the &#8220;abuser&#8221;. I think with a combination of SystemTap and some other things, a simple tool can be created to monitor the system and when certain conditions happen, such as a high load, the tool can find the process that is driving the problems. I&#8217;m just beginning some research into what the pieces that go into the tool and how they get glued together.</p>
<p>I&#8217;m not sure what to do once you identify the rogue process. I thought about just killing the rogue process but that&#8217;s way to draconian. I also thought about trying to throttle the rogue process so that the load (or whatever the measurement is), doesn&#8217;t go crazy. But I think for a while the default will be just to inform the admin of the problem and let them determine the course of action.</p>
<p>But this tool will be slow in coming since my day job interferes with real work time.</p>
<p>Thanks for the post and comments!</p>
<p>Jeff</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: mossholderm</title>
		<link>http://www.linux-mag.com/id/7906/#comment-8800</link>
		<dc:creator>mossholderm</dc:creator>
		<pubDate>Wed, 30 Nov -0001 00:00:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.linux-mag.com/id/7906/#comment-8800</guid>
		<description>&lt;p&gt;@pascoff , One tool that helps is iotop. &lt;/p&gt;
&lt;p&gt;http://guichaz.free.fr/iotop/&lt;br /&gt;
=====&lt;br /&gt;
iotop does for I/O usage what top(1) does for CPU usage. It watches I/O usage information output by the Linux kernel and displays a table of current I/O usage by processes on the system. It is handy for answering the question &quot;Why is the disk churning so much?&quot;.&lt;br /&gt;
=====
&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>@pascoff , One tool that helps is iotop. </p>
<p><a href="http://guichaz.free.fr/iotop/" rel="nofollow">http://guichaz.free.fr/iotop/</a><br />
=====<br />
iotop does for I/O usage what top(1) does for CPU usage. It watches I/O usage information output by the Linux kernel and displays a table of current I/O usage by processes on the system. It is handy for answering the question &#8220;Why is the disk churning so much?&#8221;.<br />
=====</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: wcorey</title>
		<link>http://www.linux-mag.com/id/7906/#comment-8801</link>
		<dc:creator>wcorey</dc:creator>
		<pubDate>Wed, 30 Nov -0001 00:00:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.linux-mag.com/id/7906/#comment-8801</guid>
		<description>&lt;p&gt;I think the real challenge is not so much expressing the raw data but expressing information. In other words, depicting when a channel is saturated, perhaps when a drive is failing, when the queue depth is outside of normal and service time is outside of normal and io volume in req/sec and bytes/sec are outside of normal. Implicit in this is for the tool/system to determine what is &#039;normal&#039;. To put this simply, we are awash with data but starved for information.
&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>I think the real challenge is not so much expressing the raw data but expressing information. In other words, depicting when a channel is saturated, perhaps when a drive is failing, when the queue depth is outside of normal and service time is outside of normal and io volume in req/sec and bytes/sec are outside of normal. Implicit in this is for the tool/system to determine what is &#8216;normal&#8217;. To put this simply, we are awash with data but starved for information.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: laytonjb</title>
		<link>http://www.linux-mag.com/id/7906/#comment-8802</link>
		<dc:creator>laytonjb</dc:creator>
		<pubDate>Wed, 30 Nov -0001 00:00:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.linux-mag.com/id/7906/#comment-8802</guid>
		<description>&lt;p&gt;@wcorey,&lt;/p&gt;
&lt;p&gt;Great comment and I agree with you. Determining what is &quot;normal&quot; is one thing but we must also determine what is &quot;acceptable&quot;. In addition, I think we can also include what is &quot;maximum&quot;. So is normal some percentage of the maximum? Can that normal fluctuate? &lt;/p&gt;
&lt;p&gt;I personally think we also need to focus on what is acceptable. For example, is it OK for an NFS server to have a very high peak load where the system apparently freezes for a few seconds but then returns to &quot;normal&quot;. This behavior might be acceptable. That is, going beyond the normal levels for a small period of time. But then again, if it happens too often it may not be acceptable.&lt;/p&gt;
&lt;p&gt;Sounds like a machine learning or AI kind of problem to me :)  Maybe this is a way for AI to make a come-back! :)&lt;/p&gt;
&lt;p&gt;Again - thanks for the insightful comments.&lt;/p&gt;
&lt;p&gt;Jeff
&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>@wcorey,</p>
<p>Great comment and I agree with you. Determining what is &#8220;normal&#8221; is one thing but we must also determine what is &#8220;acceptable&#8221;. In addition, I think we can also include what is &#8220;maximum&#8221;. So is normal some percentage of the maximum? Can that normal fluctuate? </p>
<p>I personally think we also need to focus on what is acceptable. For example, is it OK for an NFS server to have a very high peak load where the system apparently freezes for a few seconds but then returns to &#8220;normal&#8221;. This behavior might be acceptable. That is, going beyond the normal levels for a small period of time. But then again, if it happens too often it may not be acceptable.</p>
<p>Sounds like a machine learning or AI kind of problem to me :)  Maybe this is a way for AI to make a come-back! :)</p>
<p>Again &#8211; thanks for the insightful comments.</p>
<p>Jeff</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: shankar_k_e</title>
		<link>http://www.linux-mag.com/id/7906/#comment-8803</link>
		<dc:creator>shankar_k_e</dc:creator>
		<pubDate>Wed, 30 Nov -0001 00:00:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.linux-mag.com/id/7906/#comment-8803</guid>
		<description>&lt;p&gt;I feel that you can hook onto SystemTap Api&#039;s to figure out whats going on in the system or to use iotop to figure out which user is hogging the machine.
&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>I feel that you can hook onto SystemTap Api&#8217;s to figure out whats going on in the system or to use iotop to figure out which user is hogging the machine.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: kappaj</title>
		<link>http://www.linux-mag.com/id/7906/#comment-8804</link>
		<dc:creator>kappaj</dc:creator>
		<pubDate>Wed, 30 Nov -0001 00:00:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.linux-mag.com/id/7906/#comment-8804</guid>
		<description>&lt;p&gt;Part of my gripe is the &quot;unknown&quot; when starting out.&lt;br /&gt;
I have Sun X4100 boxes with 2 CPU (dual core) giving me 4 cpu&#039;s with total of 8Gig memory. 4 disk in raid 0-1 config.&lt;br /&gt;
During heavy loads I have major MySQL transaction issues.&lt;br /&gt;
Looking at SAR and IOStat, I see %iowait values between 15 and 23.&lt;br /&gt;
Now is that good or bad - how do I determine what a good/acceptable/bad value for %iowait is...&lt;br /&gt;
Anybody with some pointers?
&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>Part of my gripe is the &#8220;unknown&#8221; when starting out.<br />
I have Sun X4100 boxes with 2 CPU (dual core) giving me 4 cpu&#8217;s with total of 8Gig memory. 4 disk in raid 0-1 config.<br />
During heavy loads I have major MySQL transaction issues.<br />
Looking at SAR and IOStat, I see %iowait values between 15 and 23.<br />
Now is that good or bad &#8211; how do I determine what a good/acceptable/bad value for %iowait is&#8230;<br />
Anybody with some pointers?</p>
]]></content:encoded>
	</item>
</channel>
</rss>