IWETHEY v. 0.3.0 | TODO
1,095 registered users | 0 active users | 0 LpH | Statistics
Login | Create New User
IWETHEY Banner

Welcome to IWETHEY!

New Merge/Purge On Linux
MP on Unix VS MP on the IBM mainframe is an apples to
oranges process with apples to apples results.

MF programming tries to limit the amount of memory used.
Most processes try to work through data sequentially
since tape is cheaper than disk and VERY large files
often are tape only. Very large in this case is greater than
44 GB because that is the native mainframe single file
limit. You can dance around and create much larger files,
but it can be painful. Something about max number of extents
VS size of disk packs.

We have software on the MF that makes disk pretend to be
tape. So it it much faster than tape but still forces
certain sequential programming practices.

In our case, we have a process that we run twice a month.
If the MF is empty, it takes about 30 hours, plus time to
move the files to and from, another 12 hours or so.

But the MF is almost never empty. So it usually takes 3 to
5 DAYS.

We bought another MP package that run under NT, Unix, and
Linux. The reason I was so anxious to get the Opterons
working is that this process is VERY cpu and disk intensive.
It also will consume all the memory I can give it for sort
buffer.

To achieve approximately the same results on Linux it takes
8.5 hours to run. With no transfer overhead since the files
started on the Unix side.

I say approximately because it finds some matches the old
software missed, and misses some the old software found.
This is an art, with knobs to adjust depending on the type
of matching you want.

So NOW I can buy 2 dual Opteron IBM 335s to install for real
production now that I've proved it. Wheeeeeeee!

If I want, I can pay a bunch more and parallize some steps
of the process and knock it down to about 4 hours. But it
seems like overkill and not worth paying for it.

I originally tested under NT. It worked but was painful
to control / monitor. Good thing they offered it on Linux
as well.

Hey Lincoln: Feel like telling me I'm full of shit again?
New " Gleefully participating in the heat death of the Universe!
New Not Done Yet are you (Now with EIDE and SCSI)
EIDE eServer 325

 
Base Model Features and SpecificationsPart NoUnit PriceQuantityPrice
eServer 325
Form factorRack
Processor typeOpteron
Processor speed2200 MHz
Processor cache1024 KB
Hard drive typeEIDE
Memory1024 MB
Maximum memory12288 MB
Optical drive24X CD-ROM
883562X$3,389.001$3,389.00
 
Accessories and Options SelectedPart NoUnit PriceQuantityPrice
80GB 7200rpm EIDE ATA/100 HDD09N4226$189.001$189.00
eServer 325 Opteron processor model 24813N0701$1,499.001$1,499.00
No Internal RAID32P9669$0.001$0.00
1GB PC2700 CL2.5 ECC DDR SDRAM RDIMM73P2267$619.004$2,476.00
Warranty service upgrade; 3 year onsite repair 24x7x4 hour69P9507$599.001$599.00
  Total:$8152.00


That is without any taxes or shipping. These prices are a bit out of date, as IBM hasn't updated the AMD stuff for the 325 is a bit. SO it'll prolly be cheaper... and you can prolly get the 250GB drives as well.
SCSI eServer 325 
Base Model Features and SpecificationsPart NoUnit PriceQuantityPrice
eServer 325
Form factorRack
Processor typeOpteron
Processor speed2200 MHz
Processor cache1024 KB
Hot swap HDDYes
Memory1024 MB
Maximum memory12288 MB
Optical drive24X CD-ROM
883561X$3,399.001$3,399.00
 
Accessories and Options SelectedPart NoUnit PriceQuantityPrice
Integrated Mirroring - 2 HDD's required01R1356$0.001$0.00
eServer 325 Opteron processor model 24813N0701$1,499.001$1,499.00
IBM 73.4 GB 10K rpm Ultra320 SCSI Hot-Swap HDD32P0727$439.002$878.00
Warranty service upgrade; 3 year onsite repair 24x7x4 hour69P9507$599.001$599.00
1GB PC2700 CL2.5 ECC DDR SDRAM RDIMM73P2267$619.004$2,476.00
  Total:$8851.00


Personally iffn it is only $700, I'd go with the u320 Mirrored Drives meself.
--
[link|mailto:greg@gregfolkert.net|greg],
[link|http://www.iwethey.org/ed_curry|REMEMBER ED CURRY!] @ iwethey

"I told my doctor that everyone hates me. He said I was being ridiculous, everyone hasn't met me yet." -Rodney Dangerfield
Expand Edited by folkert April 5, 2004, 09:27:58 AM EDT
Expand Edited by folkert April 5, 2004, 09:46:25 AM EDT
Expand Edited by folkert April 5, 2004, 10:15:18 AM EDT
Expand Edited by folkert April 5, 2004, 10:17:33 AM EDT
New The disks are too small
I figure I'll get a 3Ware with an external case.
I need about a TB of usable space for MP tmp files.
And the 3Ware / SATA is FAST!!!

Note: I just discovered that our current IBM 345 with the
optional RAID controller and 6 disks doing a RAID 10 is
HORRIBLE! I don't know if it is the setup or an intrinsic
limitation (yet), but my Bonnie++ tests show it to be about
5 time slower than the nearest machine, and 8 times slower than
my 3Ware SATA setup.

We have 3 of these that were setup at the same time. I've tested
2 of them and they performance the same.

I didn't set it up. The guy who did is gone.

You are going to hate me for this one.

These are the systems I was testing SAP-FB on!!!!!!!

This means I consider the test invalid and I was forced
into NT SQL/Server for no good reason. ARRRGGGGG!!!!

6 months of pain and suffering because I was told to leave
the systems group alone when they set stuff up so I didn't
QC / performance test then. The weren't to be my systems
at all. I had 3 days to install and test SAP-DB then they got
handed to the final users.

Never again.
New Try the SAP-DB stuff on
The Opterons now. Good a time as any.
--
[link|mailto:greg@gregfolkert.net|greg],
[link|http://www.iwethey.org/ed_curry|REMEMBER ED CURRY!] @ iwethey

"I told my doctor that everyone hates me. He said I was being ridiculous, everyone hasn't met me yet." -Rodney Dangerfield
New RAID 10 == Nice for Data Protection...
But very sucky for performance.

I'd have done something else, but then I always do thing differently.

No biggy either. I knew something was up.

--
[link|mailto:greg@gregfolkert.net|greg],
[link|http://www.iwethey.org/ed_curry|REMEMBER ED CURRY!] @ iwethey

"I told my doctor that everyone hates me. He said I was being ridiculous, everyone hasn't met me yet." -Rodney Dangerfield
New Explain
This should be the BEST of both worlds.

Many stripes across multiple disks while simulatenous mirroring
so reads can be satisfied by alternating disk. No RAID5 calcluation
or parity read/write penalty.

There should be NOTHING better than RAID 0, which means no
protection.
New RAID 1 can beat RAID 0
At least with some implementations if you plan to read much more often than write, it will.

RAID 0 has faster writes because you stripe the writes between 2 disks. RAID 1 on writing behaviour is pretty close to just having a single disk because it writes to both.

But RAID 1 can choose to stripe reads as well, and can ignore whatever is on the other disk. This lets it dynamically distribute the reading loads between the two disks, giving it better read latency than RAID 0 which has no choice about what disk to access a particular piece of data from.

Whether RAID 1 outperforms RAID 0 in practice depends heavily on your workload and RAID implementation. I know that it can because my boss at a previous job tested both on Linux 2.2 with software RAID and found that RAID 1 was faster, couldn't believe the result, and figured out how it is possible by reading the source-code.

Cheers,
Ben
To deny the indirect purchaser, who in this case is the ultimate purchaser, the right to seek relief from unlawful conduct, would essentially remove the word consumer from the Consumer Protection Act
- [link|http://www.techworld.com/opsys/news/index.cfm?NewsID=1246&Page=1&pagePos=20|Nebraska Supreme Court]
New Not always so.
With RAID 10, you need to have FULL redundancy. 2 Controllers, 4 Disk (2 on each) for it to be fast. Basically duplicate data paths to make it faster.

Why would you want 1 controller with 4 disk? I am assuming you have an LX IBM ServeRAID card right? Meaning one channel. You have the data going to 2 disks first and then the other 2 second. On a single channel this makes it SLOW SLOW on Writes. No matter if you were on Raid 0 or not. Reads, different story, the controller will just ask the drives with the heads closest to get the data.

BTW, I am not talking about vpaths either. Seperate Bus for seperate instance of data. RAID-5 is different, reads and writes are nominal, but seperate arrays on seperate controllers (maybe just channels) in RAID-50, would beat RAID-10 in both reads and writes.

Of course this all changes when you only have one drive per channel, the 3ware stuff IOW.

Think about this, u320 is actually u80 with auto-matic hardware encoding (u160 uses the same type of blackmagic, but fewer tokens) the clocking is still the same. That is why u320 drives work on u160 and u80 channels. When it comes down to it, the scsi channel gets saturated quite easily in the different configs, unless they are designed specifically for the purpose at hand.

Whereas the Seperate Channel for each drive methodology is only limited by the controllers fabric and main IO capability.

Now, if this was SSA, things would be different, from a stand point of stacking rather than saturation, the SSA controller has two ways to access any device at a time... but still serial is still serial.
--
[link|mailto:greg@gregfolkert.net|greg],
[link|http://www.iwethey.org/ed_curry|REMEMBER ED CURRY!] @ iwethey

"I told my doctor that everyone hates me. He said I was being ridiculous, everyone hasn't met me yet." -Rodney Dangerfield
New Funny you should say something about Disk-as-Tape
[link|http://www.internetnews.com/storage/article.php/3335661|EMC introduces Clariion ATA] and [link|http://www.emc.com/news/press_releases/viewUS.jsp?id=2160|The real PR from EMC] and [link|http://www.emc.com/products/systems/clariion_disk/|the product homepage]

Clearly We could have seen this a while ago. As the prices of ATA Storage GO DOWN DOWN DOWN and the size goes UP UP UP.

Then you can mirror it to an offsite storage too while you are at it.


--
[link|mailto:greg@gregfolkert.net|greg],
[link|http://www.iwethey.org/ed_curry|REMEMBER ED CURRY!] @ iwethey

"I told my doctor that everyone hates me. He said I was being ridiculous, everyone hasn't met me yet." -Rodney Dangerfield
New Could of seen it? DID!
[link|http://z.iwethey.org/forums/render/content/show?contentid=122426|http://z.iwethey.org...?contentid=122426]
New Speaking of optimization ...
[link|http://www.computerworld.com/departments/opinions/sharktank/0,4885,91868,00.html| Now THAT'S good programming]
===

Implicitly condoning stupidity since 2001.
New Re: Merge/Purge On Linux

Hey Lincoln: Feel like telling me I'm full of shit again?

I will at every opportunity you give me.
lincoln
"Windows XP has so many holes in its security that any reasonable user will conclude it was designed by the same German officer who created the prison compound in "Hogan's Heroes." - Andy Ihnatko, Chicago Sun-Times
[link|mailto:bconnors@ev1.net|contact me]
New I'm trying!
New And notice that I didn't here?
Because you started off admitting that you were doing an "apples to apples" versus "apples to oranges" comparison.

I'd still like to see a true "apples to apples" comparison between the boxes: ONLY the same job running ( so the cpu(s) aren't giving cycles to other apps ), break the files up to equivalent sizes ( to overcome the limitations on a single file size on the mainframe - guess IBM shoulda known several years ago that people want to have files over 50 gig in size ), the code as identical as possible, hardware optimized in as similar a configuration, and then drop the starter flag and let 'em go.

Remember: [link|http://z.iwethey.org/forums/render/content/show?contentid=150144|if it weren't for this significant event], we probably wouldn't be here right now.
lincoln
"Windows XP has so many holes in its security that any reasonable user will conclude it was designed by the same German officer who created the prison compound in "Hogan's Heroes." - Andy Ihnatko, Chicago Sun-Times
[link|mailto:bconnors@ev1.net|contact me]
New Pure apples to apples?
I did it a while back, at least as best I could.
I had the same software running on the MF and Linux.
I seem to recall I was about 5 times faster.

But now we are going to different software. So it
will always be apples to oranges. No point in me
installing the Linux version of the software we
will not buy.

Also, I just realized that about 80% of the time of
the MP process is creating reports. Reports that
we may not have been getting from the previous
vendor. I can probably cut another hour or 2
for reports we weren't getting.

Also, the IO wait time can be helped. I am going
against RAID 10 disk for temp files. I should just
have a pure RAID 0 stripe.

Gotta reconfigure the disk for that one.

Those type of optimizations will NEVER be done on
the mainframe.

RAID 0? Not on the big EMC array!
New Cut off an hour
9.5 hour process went to 8.5.
Used RAID 0 disks for temp space.
My bonnie++ write rate went from 80MB a second to 300MB per second. But my read read went down a bit, proving alternative reads on mirrored disks work.
I won't keep it like this, too scary.
It is pure temp, but I still don't want to have to recover from a failure.
     Merge/Purge On Linux - (broomberg) - (16)
         " Gleefully participating in the heat death of the Universe! -NT - (Ashton)
         Not Done Yet are you (Now with EIDE and SCSI) - (folkert) - (6)
             The disks are too small - (broomberg) - (5)
                 Try the SAP-DB stuff on - (folkert)
                 RAID 10 == Nice for Data Protection... - (folkert) - (3)
                     Explain - (broomberg) - (2)
                         RAID 1 can beat RAID 0 - (ben_tilly)
                         Not always so. - (folkert)
         Funny you should say something about Disk-as-Tape - (folkert) - (1)
             Could of seen it? DID! - (broomberg)
         Speaking of optimization ... - (drewk)
         Re: Merge/Purge On Linux - (lincoln) - (4)
             I'm trying! -NT - (broomberg) - (3)
                 And notice that I didn't here? - (lincoln) - (2)
                     Pure apples to apples? - (broomberg) - (1)
                         Cut off an hour - (broomberg)

Sex.
63 ms