<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Checksumming Files to Find Bit-Rot</title>
	<atom:link href="http://www.linux-mag.com/id/8794/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.linux-mag.com/id/8794/</link>
	<description>Open Source, Open Standards</description>
	<lastBuildDate>Sat, 05 Oct 2013 13:48:18 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.1</generator>
	<item>
		<title>By: how to card count</title>
		<link>http://www.linux-mag.com/id/8794/#comment-1116675</link>
		<dc:creator>how to card count</dc:creator>
		<pubDate>Mon, 12 Aug 2013 04:52:32 +0000</pubDate>
		<guid isPermaLink="false">http://www.linux-mag.com/?p=8794#comment-1116675</guid>
		<description>I really learn a lot of good things in this &lt;a href=&quot;http://www.cardcountingtrainer.com/how-to-count-cards&quot; rel=&quot;nofollow&quot;&gt;how to card count&lt;/a&gt; blog and I am sharing it with my friends who are searching similar type of information from a lot of days. Hope so this post will be very helpful for everyone.</description>
		<content:encoded><![CDATA[<p>I really learn a lot of good things in this <a href="http://www.cardcountingtrainer.com/how-to-count-cards" rel="nofollow">how to card count</a> blog and I am sharing it with my friends who are searching similar type of information from a lot of days. Hope so this post will be very helpful for everyone.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: how to card count</title>
		<link>http://www.linux-mag.com/id/8794/#comment-1116669</link>
		<dc:creator>how to card count</dc:creator>
		<pubDate>Mon, 12 Aug 2013 04:49:59 +0000</pubDate>
		<guid isPermaLink="false">http://www.linux-mag.com/?p=8794#comment-1116669</guid>
		<description>What I want to know is why I should care? I mean, not to say that what you’ve got to say isn’t important, but I mean, it’s so generic. Everyone is just talking about this man. Give us something more, something that we can get behind so we can feel as passionately about it as you do.</description>
		<content:encoded><![CDATA[<p>What I want to know is why I should care? I mean, not to say that what you’ve got to say isn’t important, but I mean, it’s so generic. Everyone is just talking about this man. Give us something more, something that we can get behind so we can feel as passionately about it as you do.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: cars</title>
		<link>http://www.linux-mag.com/id/8794/#comment-988567</link>
		<dc:creator>cars</dc:creator>
		<pubDate>Sat, 08 Jun 2013 07:43:50 +0000</pubDate>
		<guid isPermaLink="false">http://www.linux-mag.com/?p=8794#comment-988567</guid>
		<description>Car shopping is stressful. Now that there are hundreds of makes and models 
to choose from, not to mention promotions and payment options, it&#039;s easy to become frustrated and stressed out. The information here will help make buying a car as easy and stress-free as possible.</description>
		<content:encoded><![CDATA[<p>Car shopping is stressful. Now that there are hundreds of makes and models<br />
to choose from, not to mention promotions and payment options, it&#8217;s easy to become frustrated and stressed out. The information here will help make buying a car as easy and stress-free as possible.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: cars</title>
		<link>http://www.linux-mag.com/id/8794/#comment-973389</link>
		<dc:creator>cars</dc:creator>
		<pubDate>Fri, 31 May 2013 02:14:38 +0000</pubDate>
		<guid isPermaLink="false">http://www.linux-mag.com/?p=8794#comment-973389</guid>
		<description>A little bit of knowledge goes a long way in all situations in life.
Buying a car is no different! That means you need to read advice from experts, 
as detailed below, to ensure that when you shop for that car, you really know 
what you&#039;re doing and how to get the best deal.</description>
		<content:encoded><![CDATA[<p>A little bit of knowledge goes a long way in all situations in life.<br />
Buying a car is no different! That means you need to read advice from experts,<br />
as detailed below, to ensure that when you shop for that car, you really know<br />
what you&#8217;re doing and how to get the best deal.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: GS test demo</title>
		<link>http://www.linux-mag.com/id/8794/#comment-839963</link>
		<dc:creator>GS test demo</dc:creator>
		<pubDate>Mon, 01 Apr 2013 06:39:24 +0000</pubDate>
		<guid isPermaLink="false">http://www.linux-mag.com/?p=8794#comment-839963</guid>
		<description>Checksumming Files to Find Bit-Rot &#124; Linux Magazine</description>
		<content:encoded><![CDATA[<p>Checksumming Files to Find Bit-Rot | Linux Magazine</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Gaja Parvin</title>
		<link>http://www.linux-mag.com/id/8794/#comment-749025</link>
		<dc:creator>Gaja Parvin</dc:creator>
		<pubDate>Tue, 29 Jan 2013 10:42:55 +0000</pubDate>
		<guid isPermaLink="false">http://www.linux-mag.com/?p=8794#comment-749025</guid>
		<description>What I want to know is why I should care? I mean, not to say that what &lt;a href=&quot;http://www.4hairstyles.com&quot; rel=&quot;nofollow&quot;&gt;4hairstyles&lt;/a&gt; you’ve got to say isn’t important, but I mean, it’s so generic. Everyone is just talking about this man. Give us something more, something that we can get behind so we can feel as passionately about it as you do.</description>
		<content:encoded><![CDATA[<p>What I want to know is why I should care? I mean, not to say that what <a href="http://www.4hairstyles.com" rel="nofollow">4hairstyles</a> you’ve got to say isn’t important, but I mean, it’s so generic. Everyone is just talking about this man. Give us something more, something that we can get behind so we can feel as passionately about it as you do.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: igqmxkgsoo</title>
		<link>http://www.linux-mag.com/id/8794/#comment-144745</link>
		<dc:creator>igqmxkgsoo</dc:creator>
		<pubDate>Thu, 09 Feb 2012 13:41:08 +0000</pubDate>
		<guid isPermaLink="false">http://www.linux-mag.com/?p=8794#comment-144745</guid>
		<description>lL5c0g  &lt;a href=&quot;http://ziqlipwztbpz.com/&quot; rel=&quot;nofollow&quot;&gt;ziqlipwztbpz&lt;/a&gt;</description>
		<content:encoded><![CDATA[<p>lL5c0g  <a href="http://ziqlipwztbpz.com/" rel="nofollow">ziqlipwztbpz</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Aisyah</title>
		<link>http://www.linux-mag.com/id/8794/#comment-144021</link>
		<dc:creator>Aisyah</dc:creator>
		<pubDate>Wed, 08 Feb 2012 15:17:47 +0000</pubDate>
		<guid isPermaLink="false">http://www.linux-mag.com/?p=8794#comment-144021</guid>
		<description>The only rsaeon I&#8217;m not signing up for VaultPress is because the security scanning is only in the $40/m package.  If it was in the $15/m package I would sign up in a second.</description>
		<content:encoded><![CDATA[<p>The only rsaeon I&#8217;m not signing up for VaultPress is because the security scanning is only in the $40/m package.  If it was in the $15/m package I would sign up in a second.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: John Wittkamper</title>
		<link>http://www.linux-mag.com/id/8794/#comment-125865</link>
		<dc:creator>John Wittkamper</dc:creator>
		<pubDate>Fri, 20 Jan 2012 12:53:11 +0000</pubDate>
		<guid isPermaLink="false">http://www.linux-mag.com/?p=8794#comment-125865</guid>
		<description>Although the statistics part of the article are a little questionable, the &quot;fix&quot; is very though provoking. In addition as an individual who is learning python I really appreciate the way you described your thought processes in developing your python program. I also really appreciated the &quot;new&quot; 2.7 update.

These are the type of articles and responses I really like. Thanks and I look forward to more.</description>
		<content:encoded><![CDATA[<p>Although the statistics part of the article are a little questionable, the &#8220;fix&#8221; is very though provoking. In addition as an individual who is learning python I really appreciate the way you described your thought processes in developing your python program. I also really appreciated the &#8220;new&#8221; 2.7 update.</p>
<p>These are the type of articles and responses I really like. Thanks and I look forward to more.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Parrotlover77</title>
		<link>http://www.linux-mag.com/id/8794/#comment-123733</link>
		<dc:creator>Parrotlover77</dc:creator>
		<pubDate>Tue, 17 Jan 2012 21:19:25 +0000</pubDate>
		<guid isPermaLink="false">http://www.linux-mag.com/?p=8794#comment-123733</guid>
		<description>I want to first admit upfront that I am not a statistician and I make no claims about the statistics one way or the other by the author or any of its commentors. :-)  I&#039;m sure it is all correct!

That said, is bit rot, as described by the author, even a real world problem any more?  I was under the impression that disks were so advanced now that even a single rotted bit can be detected upon a read and marked as such, telling the operating system that the drive had a &quot;read error&quot; at which point you are notified of the defect and can take action.  So all it takes is that you just run a disk scan or defrag or do something else that causes a read across your entire drive and you will know.  It doesn&#039;t stop the bit from rotting, of course, but it is far from a silent killer.</description>
		<content:encoded><![CDATA[<p>I want to first admit upfront that I am not a statistician and I make no claims about the statistics one way or the other by the author or any of its commentors. :-)  I&#8217;m sure it is all correct!</p>
<p>That said, is bit rot, as described by the author, even a real world problem any more?  I was under the impression that disks were so advanced now that even a single rotted bit can be detected upon a read and marked as such, telling the operating system that the drive had a &#8220;read error&#8221; at which point you are notified of the defect and can take action.  So all it takes is that you just run a disk scan or defrag or do something else that causes a read across your entire drive and you will know.  It doesn&#8217;t stop the bit from rotting, of course, but it is far from a silent killer.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Tom</title>
		<link>http://www.linux-mag.com/id/8794/#comment-123369</link>
		<dc:creator>Tom</dc:creator>
		<pubDate>Tue, 17 Jan 2012 11:41:19 +0000</pubDate>
		<guid isPermaLink="false">http://www.linux-mag.com/?p=8794#comment-123369</guid>
		<description>find . -type f -exec md5sum {} \;</description>
		<content:encoded><![CDATA[<p>find . -type f -exec md5sum {} \;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Tom</title>
		<link>http://www.linux-mag.com/id/8794/#comment-123361</link>
		<dc:creator>Tom</dc:creator>
		<pubDate>Tue, 17 Jan 2012 11:37:01 +0000</pubDate>
		<guid isPermaLink="false">http://www.linux-mag.com/?p=8794#comment-123361</guid>
		<description>Please please please go find out something about statistics and then correct this article.

A UBER of 1 in 1E14 is a UBER of &lt;strong&gt;1E-14&lt;/strong&gt;, not 1E14.

If the UBER for one drive is 1E-14 then the UBER for two drives is &lt;strong&gt;still 1E-14&lt;/strong&gt;.  It doesn&#039;t matter how many disks you have or how big they are - you still expect to get one unrecoverable bit error for every 1E14 bits read from it/them.  The bit error rate is not in terms of how much data you have &lt;i&gt;stored&lt;/i&gt;, but how much data you &lt;i&gt;read&lt;/i&gt;.

The UBER is an average error rate, not a deterministic process.  That is, you &lt;strong&gt;expect&lt;/strong&gt; to get one bit error for every 1E14 bits you read from the disk.  If you read 1E15 bits from the disk, you expect ten bit errors.  It might be none, it might be ten, it might be &lt;i&gt;every single one of those 1E15 bits&lt;/i&gt; that are wrong, but if you do it over and over again you expect the average to be one out of every 1E14 bits.

To calculate the probability of k bit errors after reading N bits you have to use the &lt;a href=&quot;http://en.wikipedia.org/wiki/Binomial_distribution&quot; rel=&quot;nofollow&quot;&gt;binomial distribution&lt;/a&gt;.  GNU R is the only thing I can find that can calculate it at these sorts of probabilities and number of trials.  It shows that after reading 1E14 bits, the probability of at least one bit error is 63%, at lesat two is 26%, at least three is 8%, at least four is 2% and so on.  Alternatively, the probability of &lt;strong&gt;no&lt;/strong&gt; errors after reading 1E14 bits is 37%, &lt;strong&gt;not 0&lt;/strong&gt;.

No matter how much data you read, the probability of having encountered and error &lt;strong&gt;never reaches 1&lt;/strong&gt;.</description>
		<content:encoded><![CDATA[<p>Please please please go find out something about statistics and then correct this article.</p>
<p>A UBER of 1 in 1E14 is a UBER of <strong>1E-14</strong>, not 1E14.</p>
<p>If the UBER for one drive is 1E-14 then the UBER for two drives is <strong>still 1E-14</strong>.  It doesn&#8217;t matter how many disks you have or how big they are &#8211; you still expect to get one unrecoverable bit error for every 1E14 bits read from it/them.  The bit error rate is not in terms of how much data you have <i>stored</i>, but how much data you <i>read</i>.</p>
<p>The UBER is an average error rate, not a deterministic process.  That is, you <strong>expect</strong> to get one bit error for every 1E14 bits you read from the disk.  If you read 1E15 bits from the disk, you expect ten bit errors.  It might be none, it might be ten, it might be <i>every single one of those 1E15 bits</i> that are wrong, but if you do it over and over again you expect the average to be one out of every 1E14 bits.</p>
<p>To calculate the probability of k bit errors after reading N bits you have to use the <a href="http://en.wikipedia.org/wiki/Binomial_distribution" rel="nofollow">binomial distribution</a>.  GNU R is the only thing I can find that can calculate it at these sorts of probabilities and number of trials.  It shows that after reading 1E14 bits, the probability of at least one bit error is 63%, at lesat two is 26%, at least three is 8%, at least four is 2% and so on.  Alternatively, the probability of <strong>no</strong> errors after reading 1E14 bits is 37%, <strong>not 0</strong>.</p>
<p>No matter how much data you read, the probability of having encountered and error <strong>never reaches 1</strong>.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: mikeg2</title>
		<link>http://www.linux-mag.com/id/8794/#comment-87997</link>
		<dc:creator>mikeg2</dc:creator>
		<pubDate>Sun, 18 Dec 2011 15:05:14 +0000</pubDate>
		<guid isPermaLink="false">http://www.linux-mag.com/?p=8794#comment-87997</guid>
		<description>While your calculations about bit errors/TB are correct your interpretations are a bit off.  The expectation is 1 error per 12TB, but since bit errors are a stochastic process the number you will actually observe will follow a poisson distribution with the rate of 1 error/12TB.  The probability the you will get one or more bad bits out of 12TB (1-pr(0 bad bits) = 0.63.</description>
		<content:encoded><![CDATA[<p>While your calculations about bit errors/TB are correct your interpretations are a bit off.  The expectation is 1 error per 12TB, but since bit errors are a stochastic process the number you will actually observe will follow a poisson distribution with the rate of 1 error/12TB.  The probability the you will get one or more bad bits out of 12TB (1-pr(0 bad bits) = 0.63.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: sona</title>
		<link>http://www.linux-mag.com/id/8794/#comment-82225</link>
		<dc:creator>sona</dc:creator>
		<pubDate>Wed, 14 Dec 2011 10:17:27 +0000</pubDate>
		<guid isPermaLink="false">http://www.linux-mag.com/?p=8794#comment-82225</guid>
		<description>Then you compare the checksums of these three files before you read any data. If two of the three values are the same then you can assume that those two files are correct and &lt;a href=&quot;http://www.nexiumvsprilosec.net&quot; rel=&quot;nofollow&quot;&gt;nexium vs prilosec&lt;/a&gt; that the third file is incorrect resulting in it being replaced from one of the other copies.”</description>
		<content:encoded><![CDATA[<p>Then you compare the checksums of these three files before you read any data. If two of the three values are the same then you can assume that those two files are correct and <a href="http://www.nexiumvsprilosec.net" rel="nofollow">nexium vs prilosec</a> that the third file is incorrect resulting in it being replaced from one of the other copies.”</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Joseph</title>
		<link>http://www.linux-mag.com/id/8794/#comment-62063</link>
		<dc:creator>Joseph</dc:creator>
		<pubDate>Tue, 29 Nov 2011 19:13:51 +0000</pubDate>
		<guid isPermaLink="false">http://www.linux-mag.com/?p=8794#comment-62063</guid>
		<description>This is a really interesting article and I plan on reading it in more detail later (when I get to reading the Extended file attributes article!) :)

I have one comment on the URE issue.
I don&#039;t claim to be an expert on bit rot, URE, or computer hardware. I do, however, know a little bit about BER (bit error rate) as it relates to my line of work. 
An error rate of 1 in 10^14 is usually an average! 
To interpret it, we imagine having a very large number of bits and counting all occurrences of URE in them. The error rate is then the number of URE divided by the (large number) of bits we&#039;ve read. The key word here is &lt;em&gt;large&lt;/em&gt;! For this measure to have any meaning, you must count a large number of bits. 
What I want to say is, in theory, if your drive consists of 3 bits (bear with me), and the manufacturer has reported an error rate of 1 in 10^14, there&#039;s nothing to prevent all 3 bits from having a URE. It&#039;s very unlikely but possible. 
The complete opposite is also possible: if you have a 12 TB hard drive and you read it bit by bit, you may find absolutely no URE&#039;s! If you&#039;ve read all of the 12 TB except one bit and found no URE&#039;s does NOT make you more likely to find the last bit being in error. What makes us usually think this way is a cognitive bias known as the &lt;a href=&quot;http://en.wikipedia.org/wiki/Gambler%27s_fallacy&quot; rel=&quot;nofollow&quot;&gt;Gambler&#039;s fallacy&lt;/a&gt;.

I&#039;m sorry if this seems like nitpicking or arrogant presumption on my part. I&#039;m not undermining your expertise in any way. Thank you for the great article!</description>
		<content:encoded><![CDATA[<p>This is a really interesting article and I plan on reading it in more detail later (when I get to reading the Extended file attributes article!) :)</p>
<p>I have one comment on the URE issue.<br />
I don&#8217;t claim to be an expert on bit rot, URE, or computer hardware. I do, however, know a little bit about BER (bit error rate) as it relates to my line of work.<br />
An error rate of 1 in 10^14 is usually an average!<br />
To interpret it, we imagine having a very large number of bits and counting all occurrences of URE in them. The error rate is then the number of URE divided by the (large number) of bits we&#8217;ve read. The key word here is <em>large</em>! For this measure to have any meaning, you must count a large number of bits.<br />
What I want to say is, in theory, if your drive consists of 3 bits (bear with me), and the manufacturer has reported an error rate of 1 in 10^14, there&#8217;s nothing to prevent all 3 bits from having a URE. It&#8217;s very unlikely but possible.<br />
The complete opposite is also possible: if you have a 12 TB hard drive and you read it bit by bit, you may find absolutely no URE&#8217;s! If you&#8217;ve read all of the 12 TB except one bit and found no URE&#8217;s does NOT make you more likely to find the last bit being in error. What makes us usually think this way is a cognitive bias known as the <a href="http://en.wikipedia.org/wiki/Gambler%27s_fallacy" rel="nofollow">Gambler&#8217;s fallacy</a>.</p>
<p>I&#8217;m sorry if this seems like nitpicking or arrogant presumption on my part. I&#8217;m not undermining your expertise in any way. Thank you for the great article!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: qoung</title>
		<link>http://www.linux-mag.com/id/8794/#comment-44291</link>
		<dc:creator>qoung</dc:creator>
		<pubDate>Fri, 18 Nov 2011 02:34:26 +0000</pubDate>
		<guid isPermaLink="false">http://www.linux-mag.com/?p=8794#comment-44291</guid>
		<description>I were a little bit familiar of this your broadcast offered brilliant clear idea &lt;a href=&quot;http://www.pelletmillguide.com/&quot; rel=&quot;nofollow&quot;&gt;wood pellet&lt;/a&gt;</description>
		<content:encoded><![CDATA[<p>I were a little bit familiar of this your broadcast offered brilliant clear idea <a href="http://www.pelletmillguide.com/" rel="nofollow">wood pellet</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mark</title>
		<link>http://www.linux-mag.com/id/8794/#comment-12215</link>
		<dc:creator>Mark</dc:creator>
		<pubDate>Wed, 26 Oct 2011 13:21:01 +0000</pubDate>
		<guid isPermaLink="false">http://www.linux-mag.com/?p=8794#comment-12215</guid>
		<description>Or, you could spent some money on a decent card enabling you to run a RAID 5 *with* error correction.

Take a look on PAR2 (GNU) or ICE ECC (Windows).</description>
		<content:encoded><![CDATA[<p>Or, you could spent some money on a decent card enabling you to run a RAID 5 *with* error correction.</p>
<p>Take a look on PAR2 (GNU) or ICE ECC (Windows).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: frank1985</title>
		<link>http://www.linux-mag.com/id/8794/#comment-10037</link>
		<dc:creator>frank1985</dc:creator>
		<pubDate>Mon, 03 Oct 2011 05:32:23 +0000</pubDate>
		<guid isPermaLink="false">http://www.linux-mag.com/?p=8794#comment-10037</guid>
		<description>Huh, that never occured to me, but the 1e14 chance of a URE would be just for one device - the odds of a URE would compound the more drives you pile on.  So the URE odds for 2 drives = 1E14 x 1E14 = 1E28 following proper algebraic method.  One very important reason for having multiple, smaller drives in a RAID - you almost assure that you would never hit a URE, but of course you back up just in case, even if it&#039;s just to another SAN in another location.

In fact I believe the chance of an error would never hit one, since the odds of an error would be (1E14)^(infinity) or 1E(infinity), at which point the odds of an error would be as close to zero as to be a non-event for all intents and purposes.</description>
		<content:encoded><![CDATA[<p>Huh, that never occured to me, but the 1e14 chance of a URE would be just for one device &#8211; the odds of a URE would compound the more drives you pile on.  So the URE odds for 2 drives = 1E14 x 1E14 = 1E28 following proper algebraic method.  One very important reason for having multiple, smaller drives in a RAID &#8211; you almost assure that you would never hit a URE, but of course you back up just in case, even if it&#8217;s just to another SAN in another location.</p>
<p>In fact I believe the chance of an error would never hit one, since the odds of an error would be (1E14)^(infinity) or 1E(infinity), at which point the odds of an error would be as close to zero as to be a non-event for all intents and purposes.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: user31416</title>
		<link>http://www.linux-mag.com/id/8794/#comment-9666</link>
		<dc:creator>user31416</dc:creator>
		<pubDate>Wed, 06 Jul 2011 21:13:27 +0000</pubDate>
		<guid isPermaLink="false">http://www.linux-mag.com/?p=8794#comment-9666</guid>
		<description>@anandv
* does not work with subdirectories or space in filenames.
find . -type f -print0 &#124; xargs -0 md5sum</description>
		<content:encoded><![CDATA[<p>@anandv<br />
* does not work with subdirectories or space in filenames.<br />
find . -type f -print0 | xargs -0 md5sum</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Lary Tobos</title>
		<link>http://www.linux-mag.com/id/8794/#comment-9662</link>
		<dc:creator>Lary Tobos</dc:creator>
		<pubDate>Tue, 05 Jul 2011 17:35:21 +0000</pubDate>
		<guid isPermaLink="false">http://www.linux-mag.com/?p=8794#comment-9662</guid>
		<description>&quot;For example, you could make three copies of the checksum data and compute the checksum of these files. Then you compare the checksums of these three files before you read any data. If two of the three values are the same then you can assume that those two files are correct and that the third file is incorrect resulting in it being replaced from one of the other copies.&quot;

This is exactly what google does more or less with their data- couple of years ago, the idea was to take  a &quot;majority vote&quot; among 3, 5 and so on replicas.

PS: running scripts as &quot;root&quot; is NEVER a good ideea - you may screw up all the permissions on the files you act and they may end up owned by root and therefore &quot;invisible/untouchables&quot; to all the other developers.</description>
		<content:encoded><![CDATA[<p>&#8220;For example, you could make three copies of the checksum data and compute the checksum of these files. Then you compare the checksums of these three files before you read any data. If two of the three values are the same then you can assume that those two files are correct and that the third file is incorrect resulting in it being replaced from one of the other copies.&#8221;</p>
<p>This is exactly what google does more or less with their data- couple of years ago, the idea was to take  a &#8220;majority vote&#8221; among 3, 5 and so on replicas.</p>
<p>PS: running scripts as &#8220;root&#8221; is NEVER a good ideea &#8211; you may screw up all the permissions on the files you act and they may end up owned by root and therefore &#8220;invisible/untouchables&#8221; to all the other developers.</p>
]]></content:encoded>
	</item>
</channel>
</rss>