<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Spread it and forget</title>
	<atom:link href="http://blog.nigelpoulton.com/spread-it-and-forget/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.nigelpoulton.com/spread-it-and-forget/#utm_source=feed&amp;utm_medium=feed&amp;utm_campaign=feed</link>
	<description>with nigel poulton</description>
	<lastBuildDate>Tue, 20 Dec 2011 19:37:54 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.6</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Cleanur</title>
		<link>http://blog.nigelpoulton.com/spread-it-and-forget/comment-page-1/#comment-105</link>
		<dc:creator>Cleanur</dc:creator>
		<pubDate>Mon, 08 Jan 2007 12:32:52 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.rupturedmonkey.com/?p=56#comment-105</guid>
		<description>On the EVA chunk size is fixed at 128KB (256 blocks of 512 bytes each). This is often confused with the minimum PSEG size of 2MB. A PSEG is the minumu space the EVA can allocate internally when creating VDisks. Each PSeg being equal to 16 chunks (16 X 128KB = 2MB).</description>
		<content:encoded><![CDATA[<p>On the EVA chunk size is fixed at 128KB (256 blocks of 512 bytes each). This is often confused with the minimum PSEG size of 2MB. A PSEG is the minumu space the EVA can allocate internally when creating VDisks. Each PSeg being equal to 16 chunks (16 X 128KB = 2MB).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jesse (SanGod)</title>
		<link>http://blog.nigelpoulton.com/spread-it-and-forget/comment-page-1/#comment-104</link>
		<dc:creator>Jesse (SanGod)</dc:creator>
		<pubDate>Sat, 09 Dec 2006 04:18:02 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.rupturedmonkey.com/?p=56#comment-104</guid>
		<description>You know it&#039;s hilarious - I&#039;ve heard this story time again and I think it all comes down to one basic premise.

People just *LOVE* to overmanage their disks.

In the EMC world, people talk about wanting to know what physical disk each hypervolume is on, then when you start working on 8 and 12-way metavolumes, you&#039;re talking 16-24 back end disks for each lun presented forward.  (In a Raid-1 configuration that is, actually more if you&#039;re talking Raid-5)

Either way, in a cached array the only time you&#039;re dependant on back-end performance is

A&gt; Random Reads - unfortunately there is no way around a random-read.  It&#039;s going to wait for the disk.  But any intelligent array can pick and choose which disks are being read from based on activity level of the mirror pairs.

B&gt; Cache Saturation.   Yes, it does happen, even in boxes with 48-64G of cache.  (Though I don&#039;t see it happen that often)

In the EMC world, and I&#039;m guessing this applies to all high-end arrays, the write has been acknowledged back to the host long before the physical drive enters into the picture.  *THIS* is what actually makes a cached array faster than JBOD.  When write speeds are measured in nano-seconds and not miliseconds, all is good.

Most arrays also have some form of predictive read-ahead, based on odds that once you&#039;ve read X number of tracks, you&#039;re more likely to read the next Y tracks in series.

But yet, (and back to the point of the response)  I still see people trying desparately to micro-manage their disks.  Even in my company I have daily fights with our DBA because he wants to see two dozen 8G volumes presented instead of the 200G volume spread across many spindles on the back-end, despite the fact that the 8G will end up being slower because you can&#039;t use a &#039;scatter-gather&#039; read on a single volume.

In the EMC world the rule is to use sequential symmetrix device ID&#039;s for volumes which, by virtue of the way the Enginuity software lays out the disks by default, guarantees that you won&#039;t step on the same physicals.</description>
		<content:encoded><![CDATA[<p>You know it&#8217;s hilarious &#8211; I&#8217;ve heard this story time again and I think it all comes down to one basic premise.</p>
<p>People just *LOVE* to overmanage their disks.</p>
<p>In the EMC world, people talk about wanting to know what physical disk each hypervolume is on, then when you start working on 8 and 12-way metavolumes, you&#8217;re talking 16-24 back end disks for each lun presented forward.  (In a Raid-1 configuration that is, actually more if you&#8217;re talking Raid-5)</p>
<p>Either way, in a cached array the only time you&#8217;re dependant on back-end performance is</p>
<p>A&gt; Random Reads &#8211; unfortunately there is no way around a random-read.  It&#8217;s going to wait for the disk.  But any intelligent array can pick and choose which disks are being read from based on activity level of the mirror pairs.</p>
<p>B&gt; Cache Saturation.   Yes, it does happen, even in boxes with 48-64G of cache.  (Though I don&#8217;t see it happen that often)</p>
<p>In the EMC world, and I&#8217;m guessing this applies to all high-end arrays, the write has been acknowledged back to the host long before the physical drive enters into the picture.  *THIS* is what actually makes a cached array faster than JBOD.  When write speeds are measured in nano-seconds and not miliseconds, all is good.</p>
<p>Most arrays also have some form of predictive read-ahead, based on odds that once you&#8217;ve read X number of tracks, you&#8217;re more likely to read the next Y tracks in series.</p>
<p>But yet, (and back to the point of the response)  I still see people trying desparately to micro-manage their disks.  Even in my company I have daily fights with our DBA because he wants to see two dozen 8G volumes presented instead of the 200G volume spread across many spindles on the back-end, despite the fact that the 8G will end up being slower because you can&#8217;t use a &#8217;scatter-gather&#8217; read on a single volume.</p>
<p>In the EMC world the rule is to use sequential symmetrix device ID&#8217;s for volumes which, by virtue of the way the Enginuity software lays out the disks by default, guarantees that you won&#8217;t step on the same physicals.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nigel Poulton</title>
		<link>http://blog.nigelpoulton.com/spread-it-and-forget/comment-page-1/#comment-103</link>
		<dc:creator>Nigel Poulton</dc:creator>
		<pubDate>Thu, 07 Dec 2006 20:35:32 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.rupturedmonkey.com/?p=56#comment-103</guid>
		<description>Thanks for the great comments guys.

JM I agree that the isolation game is a non starter in large environments.  Many of the environments Ive worked in recently are very large and one of the commonalities in these environments is that nobody seems to have a clue what storage requirements are around the corner â€“ you can turn up at work one day and find a request for adding 10 new server with 800GB each that needed to be done yesterday ;-)  Thatâ€™s one of the things I was getting at when I said â€œthis approach makes it much simpler to manage storage especially in dynamic environmentsâ€.

Ive also worked quite a lot with EVAs in the past and initially did try to separate workloads to different spindles â€“ one big disk group for random and a few smaller 8 disk (single RSS) disk groups for sequential.  But because of the limited architecture on the controllers I soon stopped doing this and just went for the single large disk group approach.

I remember first using a Compaq EVA and thinking that it was a breath of fresh air and ahead of its time with some of its technology.</description>
		<content:encoded><![CDATA[<p>Thanks for the great comments guys.</p>
<p>JM I agree that the isolation game is a non starter in large environments.  Many of the environments Ive worked in recently are very large and one of the commonalities in these environments is that nobody seems to have a clue what storage requirements are around the corner â€“ you can turn up at work one day and find a request for adding 10 new server with 800GB each that needed to be done yesterday <img src='http://blog.nigelpoulton.com/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' />   Thatâ€™s one of the things I was getting at when I said â€œthis approach makes it much simpler to manage storage especially in dynamic environmentsâ€.</p>
<p>Ive also worked quite a lot with EVAs in the past and initially did try to separate workloads to different spindles â€“ one big disk group for random and a few smaller 8 disk (single RSS) disk groups for sequential.  But because of the limited architecture on the controllers I soon stopped doing this and just went for the single large disk group approach.</p>
<p>I remember first using a Compaq EVA and thinking that it was a breath of fresh air and ahead of its time with some of its technology.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Liho</title>
		<link>http://blog.nigelpoulton.com/spread-it-and-forget/comment-page-1/#comment-102</link>
		<dc:creator>Liho</dc:creator>
		<pubDate>Thu, 07 Dec 2006 18:11:49 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.rupturedmonkey.com/?p=56#comment-102</guid>
		<description>Itâ€™s possible to create disk groups in EVAs to spread data over all disks in dedicated disk group.  Minimal number of disks in such group is 8. HP engineer told me that customers are using one or two groups only. Thus itâ€™s possible to isolate workloads but nobody does it.

I also like this idea to spread similar workload over common disks, but here is one more issue. Chunk size is 2MB in case of EVAs. Thus such striping is good for Oracle and similar databases only. Remember Oracleâ€™s SAME strategy (Stripe And Mirror Everything), where recommended chuck size was 1MB... Iâ€™ve heard that other applications became slower after migration to EVA.

EVAs as another modular storages donâ€™t have a lot of opportunities to isolate workloads: use different controllers and different disks groups only.</description>
		<content:encoded><![CDATA[<p>Itâ€™s possible to create disk groups in EVAs to spread data over all disks in dedicated disk group.  Minimal number of disks in such group is 8. HP engineer told me that customers are using one or two groups only. Thus itâ€™s possible to isolate workloads but nobody does it.</p>
<p>I also like this idea to spread similar workload over common disks, but here is one more issue. Chunk size is 2MB in case of EVAs. Thus such striping is good for Oracle and similar databases only. Remember Oracleâ€™s SAME strategy (Stripe And Mirror Everything), where recommended chuck size was 1MB&#8230; Iâ€™ve heard that other applications became slower after migration to EVA.</p>
<p>EVAs as another modular storages donâ€™t have a lot of opportunities to isolate workloads: use different controllers and different disks groups only.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: JM</title>
		<link>http://blog.nigelpoulton.com/spread-it-and-forget/comment-page-1/#comment-101</link>
		<dc:creator>JM</dc:creator>
		<pubDate>Thu, 07 Dec 2006 16:10:19 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.rupturedmonkey.com/?p=56#comment-101</guid>
		<description>I agree that ideally there&#039;s a balance to be struck between spreading and isolating workload.   However, I would argue that at some point, when you end up managing enough storage it becomes impractical (and expensive!) to play the isolation game.  Everyone thinks their application is more critical than the next and should be isolated, DBAs want separate spindles for everything, etc.  If you&#039;re dealing with a single application or just a handful of applications and one or two storage arrays, fine tuning is probably worth your time, but when you move into larger environments with hundreds of applications the &quot;remove your brain&quot; method of managing storage becomes a lot more attractive.

Array vendors also seem to be heading this direction.  Look at 3Par who spreads LUNs of differing RAID protection across the same spindles.  HP EVA spreads all LUNs of the same RAID level across as many disks as you ask it to.  Pillar will not only spread your IO across as many disks as it can, but it&#039;ll try to optimize your IO by placing high QoS LUNs on outer tracks of disks.  They&#039;re all trying to do this work for us, and I for one appreciate it.  The geek in me wants to tune every app by hand, but it can be a monumental task.   There will always be exceptions for highly critical or IO intensive apps, but in general I&#039;m a proponent of spreading IO as far and wide as possible and letting your arrays do the best job they can.</description>
		<content:encoded><![CDATA[<p>I agree that ideally there&#8217;s a balance to be struck between spreading and isolating workload.   However, I would argue that at some point, when you end up managing enough storage it becomes impractical (and expensive!) to play the isolation game.  Everyone thinks their application is more critical than the next and should be isolated, DBAs want separate spindles for everything, etc.  If you&#8217;re dealing with a single application or just a handful of applications and one or two storage arrays, fine tuning is probably worth your time, but when you move into larger environments with hundreds of applications the &#8220;remove your brain&#8221; method of managing storage becomes a lot more attractive.</p>
<p>Array vendors also seem to be heading this direction.  Look at 3Par who spreads LUNs of differing RAID protection across the same spindles.  HP EVA spreads all LUNs of the same RAID level across as many disks as you ask it to.  Pillar will not only spread your IO across as many disks as it can, but it&#8217;ll try to optimize your IO by placing high QoS LUNs on outer tracks of disks.  They&#8217;re all trying to do this work for us, and I for one appreciate it.  The geek in me wants to tune every app by hand, but it can be a monumental task.   There will always be exceptions for highly critical or IO intensive apps, but in general I&#8217;m a proponent of spreading IO as far and wide as possible and letting your arrays do the best job they can.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

