I had an interesting meeting today.
We had moved from Oracle/Solaris 8/Veritas/Xyratex to Oracle/Linux 2.4/ext3/Clariion.
Went from quad CPU Sparc 450Mhz to Qaud CPU Opteron 2.2Ghz.
I considered the ext3 the iffiest part of the move, but this is RH AS3 , no decent file systems available, at least for large data.
So the programmers started complaining about incredibly poor performance a month or so after the move.
Note: I was involved in NONE of their code, other than the foundation of the project 5 years ago.
And all the coders who worked on it until 6 months ago are gone.
So anyway, the new guy, who is a brilliant Perl programmer, but knows NOTHING of large data, is bitching about the terrible performance.
None of my tests show anything wrong, so I just watch his processing.
When they moved from Sparc to Opteron, they figured that had a bunch of CPU, so they ALSO killed 6 dual Xeon compute servers and centralized their CPU intensive runs on the single box.
At the same time they were doing the Oracle work.
Driving their load average to 20.
Blaming the system
SMACK!
Today was the day the manager of that group said he had no worries about performance anymore, that the system I designed and gave to him to use work great, and that he was sure there would be no performance problems in the future now that his people realized they were being silly. They've been running just fine for a couple of weeks now.