
On Thu, 5 Jul 2012, Rick Moen <rick@linuxmafia.com> wrote:
One of my Xen DomUs is getting memory corruption. I'm not sure why. I've replaced all the RAM in the system and run memtest86+.
memtest86+ (or memtest86), even if left running overnight, won't necessarily always find a bad stick of RAM.
True, but the fact that the problem appeared after installing a new kernel and hypervisor seems relevant. Also I've done things like stopping all DomUs and then just starting the problem one and got the same result. I presume that if it was a physical RAM issue then the mapping of DomU to RAM would be based on startup order and thus changing it would give the crashes to a different DomU. As for a physical RAM problem, if that's the case then I'll have to replace the system to avoid down-time. I've got an almost identical spare system so I just need to upgrade the RAM and run Memtest86+ for a day or two before swapping the disks. -- My Main Blog http://etbe.coker.com.au/ My Documents Blog http://doc.coker.com.au/