Post #157,181
5/27/04 9:16:20 AM
|
Database migrations
For migration? Just copy it over. As far as revision tracking, let's assume your config table looks something like this:
Table: clientConfig ------------------ clientRef paramID paramValue
You can set a trigger(s) such that every time a record is added, changed, or deleted, the change is copied to a log that looks something like:
Table: clientConfig ------------------ clientRef paramID paramValue changeType // add, change, or delete changedWhen who // login ID of changer (if avail.)
With this info, one can recreate any time period and study all changes. (I here some RDBMS have such "delta-log" features already built in.) What does "just copy it over" mean? Type it in again? Run some imaginary "copy it over" tool for every table that has changes? Build your own? Export it to a file first? And if some of the changes need to go and not others? How do you track that? Here's another scenario: your dev database is rebuilt every night in order to verify that the entire build is repeatable. Where does the data go? How do you prime the database in the first place with all the changes you've made for the past 8 years? Or another one. Someone wants to create a stand-alone developer machine. What process do you have to follow? And when they have this machine at home, what do they need to do to refresh their copy?
Regards,
-scott anderson
"Welcome to Rivendell, Mr. Anderson..."
|
Post #157,223
5/27/04 1:08:33 PM
|
re: Database migrations
What does "just copy it over" mean? Type it in again? Run some imaginary "copy it over" tool for every table that has changes? Build your own? Export it to a file first? I don't know about Oracle, but MySQL and MS-SQL you can just zip the data directory contents and then unzip it at the target. (Yes, I know it is files, but that is the convention in style these days for bad or bad.) If you have just one or two tables, and you only want to copy over selected records, then you can put something like this in a script: \ndelete from targetTable\ninsert into targetTable from masterDB::templateTable where updateCode = 'foo'\n Further, most RDBMS allow one to obtain schema info one way or another. Thus, if you cannot just move files over per above, then one can iterate through every table to copy. I don't know specific vendor commands off the top of my head. Generally the steps would be 1. drop existing table if any, 2. Create a new table with the given schema, 3. Populated with data. Once you figure out the syntax for one table you can use it for the rest. Here's another scenario: your dev database is rebuilt every night in order to verify that the entire build is repeatable. Where does the data go? How do you prime the database in the first place with all the changes you've made for the past 8 years? There should be some kind of "master" that you copy from using one of the techniques from above. Or another one. Someone wants to create a stand-alone developer machine. What process do you have to follow? And when they have this machine at home, what do they need to do to refresh their copy? See first paragraph.
________________ oop.ismad.com
|
Post #157,245
5/27/04 2:35:43 PM
|
re: Database migrations
If you have just one or two tables, and you only want to copy over selected records, then you can put something like this in a script:
delete from targetTable insert into targetTable from masterDB::templateTable where updateCode = 'foo'
Further, most RDBMS allow one to obtain schema info one way or another. Thus, if you cannot just move files over per above, then one can iterate through every table to copy. I don't know specific vendor commands off the top of my head. Generally the steps would be 1. drop existing table if any, 2. Create a new table with the given schema, 3. Populated with data. Once you figure out the syntax for one table you can use it for the rest. Missing the point. You have one or more tables. People are constantly adding changes to it for their configuration information. Some of it needs to go, some of it doesn't. How do you mark the data that needs to go? While you're changing it? Go back later to mark it once you decide it should be migrated? There won't be any one single query you can run, because the changes are always different. There's an 8 week migration cycle. You can't just blow away test and insert dev, because not all of dev should go. You can't just drop one table and copy from dev, because not all of the changes in that table should go. You can't just run a single query and copy that, because the changes are different every time. There will never be a single set of criteria. Therefore (to answer for you, since you're dragging this out) you would need to build into your tool a way of tracking revisions, tagging revisions, and copying tagged records only. This is what CVS and a dozen other revision control systems do already. But, you're not going to find a pre-built tool to do all that against a database, so you need to build your own. So now: 1) Write up a design for this table-copying tool. 2) Pass it through a review process. This is going to affect dev, test, qa, and prod, so it needs to be right. 3) Code it up. Table definitions, data entry screens, etc. These all need to be in revision control too, since this tool will undoubtedly go through improvements. 4) Unit tests (see 2) 5) Now you have to maintain it, because it's an in-house proprietary tool. All of the above is a significant investment, all for the nebulous benefits of being able to search for a particular entry in TOAD instead of an editor. Or you could use all the existing tools that are just as effective and be up and running in an hour. Plus, you need them anyway to manage your source code. Which brings us back to my original assertion: this is just all overhead. There should be some kind of "master" that you copy from using one of the techniques from above. Ah, a "master" config database. More overhead, more licensing, one more machine to take care of. And you still think this is better?
Regards,
-scott anderson
"Welcome to Rivendell, Mr. Anderson..."
|
Post #157,251
5/27/04 2:59:15 PM
|
You Win! I give up! go with files
You have this weird list of 600 config parameters that requires complex revision tracking that CVS can allegedly do well out-of-the-box. If it works, go for it!
Sometimes tables are the best solution, sometimes files are the best solution, sometimes OO is the best solution, sometimes functional programming is the best solution, etc.
I don't claim that TOP is the best solution ALL the time. Maybe in the future when there is CVS for databases, things may change. Until then, you're the "Filizer".
________________ oop.ismad.com
|
Post #157,254
5/27/04 3:11:24 PM
|
bookmarked :-)
Time for Lord Stanley to get a Tan questions, help? [link|mailto:pappas@catholic.org|email pappas at catholic.org]
|
Post #157,256
5/27/04 3:15:44 PM
|
Most of the time...
Sometimes tables are the best solution, sometimes files are the best solution, sometimes OO is the best solution, sometimes functional programming is the best solution, etc. Glad to some you come to your senses. Most of the time, we don't have the luxury to just pick one - we have to build solutions that represent a compromise betwee performance and abstraction. You don't want to design all your code based on performance alone, but neither can you afford to ignore it totally. Most of the business applications I deal with, will use files, tables, objects, and, more recently, first class functions. Each has their usage and the secret to success is usually getting them to work together.
|
Post #157,260
5/27/04 3:31:37 PM
|
Finally.
But how is it a "weird list"? This is your preferred method: using control tables to modify program flow (and actually as I just posted above it's more like 4000 parameters). Your techniques simply do not scale, which you have finally admitted.
Regards,
-scott anderson
"Welcome to Rivendell, Mr. Anderson..."
|
Post #157,264
5/27/04 3:41:19 PM
|
I was not there
But how is it a "weird list"? This is your preferred method: using control tables to modify program flow I don't understand why you need all that revision-control beurocracy for config info and why it is 600 items. It is one of those things in which one probably has to be there to understand the needs. Your techniques simply do not scale, which you have finally admitted. Perhaps if you used Postgre instead of Oracle, you could just slap on more power as needed rather than worrying about licenses and rework your code/DB to avoid them licenses. And, why couldn't that config table be entirely cached in RAM to avoid disk I/O?
________________ oop.ismad.com
|
Post #157,267
5/27/04 3:55:10 PM
|
Re: I was not there
I don't understand why you need all that revision-control beurocracy for config info and why it is 600 items. Because that's how many decision points there are. Stuff that would be handled by polymorphism and composition in OO code, you say should be handled by control tables in databases. I've just demonstrated why control tables fail from a maintenance and performance standpoint at large scales. If you don't understand a particular argument, bring it up again. But the change patterns I've described occur here on a daily basis on a large P-R system. Perhaps if you used Postgre instead of Oracle, you could just slap on more power as needed rather than worrying about licenses and rework your code/DB to avoid them licenses. Postgres has a piss-poor procedural language compared to Oracle. And are you seriously suggesting that we port 1M lines of code to something else? Where's the business benefit? And do you know of any large scale Postgres implementations? The same thing would happen in DB2, Sybase, Informix, whatever. And, why couldn't that config table be entirely cached in RAM to avoid disk IO? Because there are a million other things, just as important, fighting for that RAM. But the other things are actual dynamic data that need to be in the database, while the parms are just a poor programming practice. Guess which loses when we need to improve performance?
Regards,
-scott anderson
"Welcome to Rivendell, Mr. Anderson..."
|
Post #157,268
5/27/04 4:07:54 PM
|
disk I/O confusion
Because there are a million other things, just as important, fighting for that RAM. But if you put it in OOP code, then it is *still* in RAM, no? If it is cached back and forth to disk, then you are doing what DB's do also. Or are only a small percentage of clients active at any given time? I agree that the layer between code and the DB may slow things a bit because there is a translation cost between the language and the DB, but I don't see why I/O itself would be a problem. Caching your config data in objects shouldn't take up any more RAM than caching a table it would seem.
________________ oop.ismad.com
|
Post #157,272
5/27/04 4:13:32 PM
|
Different machine.
And there aren't control parms at that point, because it's expressed as a connection of polymorphic or composited objects. There's no decision to be made any long because it's made at compile time.
Again, the hacky solution in this case (because it's still in PL/SQL) was to create a binary search tree in IF/THEN code that is called instead of selecting from the table. Added maintenance and coding overhead, but there you go.
Regards,
-scott anderson
"Welcome to Rivendell, Mr. Anderson..."
|
Post #157,299
5/27/04 5:30:45 PM
|
parameter view and flex questions
And there aren't control parms at that point, because it's expressed as a connection of polymorphic or composited objects. There's no decision to be made any long because it's made at compile time. What if somebody wanted to study all those "parameters" such as doing queries and reports on them? Usually in a mega-size operation like you claim to be, somebody starts wanting such info. It may even be useful for debugging. Grep again? Or, wrote new sections in another language that also needed access to the same parameters? Or, change them during run-time instead of having to recompile?
________________ oop.ismad.com
|
Post #157,316
5/27/04 6:47:52 PM
|
Re: parameter view and flex questions
What if somebody wanted to study all those "parameters" such as doing queries and reports on them? Grep would be sufficient. Nothing more complex is needed. Typically the division of code is such that no search is needed, as it's fairly obvious where to go to find the information. Any other configuration isn't worth the trip to the database. Or, wrote new sections in another language that also needed access to the same parameters? We'll deal with that when it happens. Clue: it isn't likely. This is a complete subsystem; there's no reason to introduce other languages. We have 4 languages here: PL/SQL, C++, Java, and Perl. PL/SQL is for database internal operations and UI (previously), C++ for communications, Java for UI (now, not previously), Perl for build system. Language independence simply isn't important, nor is it in general. Decision points aren't shared; data is. Or, change them during run-time instead of having to recompile? Doesn't happen. These sorts of changes happen at most nightly. During production, not at all. During development, doing a recompile is no more burdensome than reloading the parms, given the procedure we have to go through to convert them to the binary search stored procedure. Adding the ability to change at runtime is only useful during development, and it's such a minor gain that it's not worth the other hassles. This isn't binary, Bryce. "Ah ha! I found one reason it's better!" Tabled parms have a few advantages, but in toto are outweighed heavily by the disadvantages.
Regards,
-scott anderson
"Welcome to Rivendell, Mr. Anderson..."
|
Post #157,470
5/28/04 3:52:55 PM
|
perfect un-storm
This isn't binary, Bryce. "Ah ha! I found one reason it's better!" Tabled parms have a few advantages, but in toto are outweighed heavily by the disadvantages. It just seems like your shop is a "perfect storm" situation where the problems of TOP are magnified and none of its benefits are desired by management and customers.
________________ oop.ismad.com
|
Post #157,477
5/28/04 4:06:00 PM
5/28/04 4:09:53 PM
|
Re: perfect un-storm
In reverse order:
The customers desire that the application works, and that it is stable, and that it is fast. "TOP" reduces performance and increases maintenance overhead (which causes bugs), and is hence undesireable.
Management desires that its programmers don't spend time writing needless utilities to do stuff that isn't necessary in the first place, like versioned database tables. Management desires that we do what works, which is contrary to spinning our wheels on self-inflicted maintenance headaches.
The problems of TOP are magnified because it is a large shop. The same problems are always there, but when writ large they become glaringly apparent. At some point of smallness, however, they become negligible enough that you don't notice them.
Given that you've already stated that TOP's benefits are those of scale (managing 1000s of rows as compared to 10s) and that OO and other techniques are fine or even better for small applications, and that now you've admitted that TOP doesn't scale to large applications (and in fact in order to give it any semblance of maintenance scalability, you have to adopt OO-like techniques like building your own jump tables), I'm not quite certain where your middle ground lies... Well, actually I am, but I'm not quite certain that you have realized it yet.
Regards,
-scott anderson
"Welcome to Rivendell, Mr. Anderson..."
Edited by admin
May 28, 2004, 04:09:53 PM EDT
|
Post #157,494
5/28/04 6:44:05 PM
|
the rule or the exception
Management desires that its programmers don't spend time writing needless utilities to do stuff that isn't necessary in the first place, like versioned database tables. The fact that there are not very many existing products for such suggests that it is not a common need. (I suppose you will argue this is because most will move to files instead if they need such, but many may not want to make that leap for an existing system.) you have to adopt OO-like techniques like building your own jump tables I don't know why you keep saying this. The table already exists. I am just sticking function names in it. If not, it is no more effort than building a bunch of classes. You make it sound like I am doing something extra. The amount of finger work is not more. Given that you've already stated that TOP's benefits are those of scale (managing 1000s of rows as compared to 10s) and that OO and other techniques are fine or even better for small applications, and that now you've admitted that TOP doesn't scale to large applications.....I'm not quite certain where your middle ground lies If by chance your shop needed more feature-usage reporting/querying or needed faster configuration changes (rather than recompiling for each change), then you would be more open to such. Whether your current condition is the rule or the exception is probably another 500-message flame-war.
________________ oop.ismad.com
|
Post #157,509
5/28/04 9:55:07 PM
|
Re: the rule or the exception
The fact that there are not very many existing products for such suggests that it is not a common need. Because it's a poor idea. If you think otherwise, build the tool and have everyone beat a path to your door. I don't know why you keep saying this. The table already exists. And why does the table exist? So you can put function names in it. Then jump to them. That is, a "jump table". You make it sound like I am doing something extra. Managing databases is more labor intensive than managing code, so you are, in fact, doing something extra. Let's say you need a new one, which requires adding another column. Ever do that in a large shop? Or even a medium-sized shop? Hint: my company is on the smaller end of "medium sized enterprise". If by chance your shop needed more feature-usage reporting/querying or needed faster configuration changes (rather than recompiling for each change), then you would be more open to such. Feature-usage reporting/querying is a low-priority item no matter where you are. Configuration changes are by their very nature fairly rare. Set it up once and it's done. You are simply not filling a need. And what little need there may be for such a thing is over-balanced by the deficiencies of the technique. And given that I can do the same thing in OO by using a table as well, there really is no compelling reason to use the technique at all. Whether your current condition is the rule or the exception Come back to talk about it more when you've actually worked in something besides a small shop.
Regards,
-scott anderson
"Welcome to Rivendell, Mr. Anderson..."
|
Post #157,609
5/29/04 5:50:13 PM
|
shop size phalic symbols
Because it's a poor idea. Why is it a poor idea? People need to keep track of more things that just code. I think it is not common because it is relatively easy to add your own tracking. Managing databases is more labor intensive than managing code, so you are, in fact, doing something extra. Let's say you need a new one, which requires adding another column. Ever do that in a large shop? Or even a medium-sized shop? I have worked in medium-large companies that divide projects into smaller project rather than one-big-EXE. Perhaps you guys are not partitioning properly. Adding a new column in Microsoft SQL-Server was a snap. I can't really speak for Oracle. Oracle is not known for its RAD abilities. People buy it for speed and reliability, not for nimbleness. Feature-usage reporting/querying is a low-priority item no matter where you are. Tell that to the sales/marketing department. They always want to know what customers want so they can hype it and/or make yet more features related to existing favorite features.
________________ oop.ismad.com
|
Post #157,620
5/29/04 6:54:58 PM
|
Re: shop size phalic symbols
Why is it a poor idea? Did we, or did we not, just go through how many hundreds of posts on this very subject, at the end of which you said, "You Win!"? I have worked in medium-large companies that divide projects into smaller project rather than one-big-EXE. Perhaps you guys are not partitioning properly. What does this have to do with the price of tea in China? I said, "Managing databases is more labor intensive than managing code, so you are, in fact, doing something extra." This is why people hire full-time DBAs (we have two full-timers, and one part-timer), because it takes more effort to manage a database. What in Hades does this have to do with partitioning? An extra database is an extra database. Adding a new column in Microsoft SQL-Server was a snap. And how did you put that change under revision control? How did you promote that change to production? Or even QA? Of course adding a column is a snap if you aren't managing it properly. Tell that to the sales/marketing department. Oddly enough, our sales people know what the applications do without searching source code. They also know what the client's are requesting and using. It's called, "specifications" and "documentation". You ought to try it sometime. Concession noted for posterity (judging by your failure to respond): you admit that you are in fact creating a jump table as described.
Regards,
-scott anderson
"Welcome to Rivendell, Mr. Anderson..."
|
Post #157,660
5/30/04 1:50:20 AM
|
DBAs != "extra"
Did we, or did we not, just go through how many hundreds of posts on this very subject, at the end of which you said, "You Win!"? I meant the concept of verson control for DB's. "Managing databases is more labor intensive than managing code, so you are, in fact, doing something extra." This is why people hire full-time DBAs... No. Hiring DBA's is a "divide and conquer" strategy. DBA's focus on data stuff and app developers focus on providing algorithms for specific tasks that have to be accomplished based on the data. The idea of a hybrid DBA/developer, such as what XBase used to make common, was mostly rejected because developers tended to make messes such as poorly normalized schemas and no longer-term design strategies. Such apps may have worked fine in isolation, but biz often wants to share info with multiple apps. Concession noted for posterity (judging by your failure to respond): you admit that you are in fact creating a jump table as described. However you label it, it is not more effort than doing the same in a class.
________________ oop.ismad.com
|
Post #157,681
5/30/04 9:18:24 AM
|
Re: DBAs != "extra"
I meant the concept of verson control for DB's. Revision control for DBs is fine. We use it ourselves. Via files. And then when it came time for you to describe why doing it in the database was better, you said "You Win!" No. Hiring DBA's is a "divide and conquer" strategy. Because managing a database takes more knowledge and effort than managing a source code control repository, as you have apparently conceded by failing to respond to my questions about adding a new column. Additionally, adding an extra database involves even more work, unlike using a single source code repository. However you label it, it is not more effort than doing the same in a class. Nope, no back-tracking here, Bryce. I've already established and you've already conceded that maintaining control tables [link|/forums/render/content/show?contentid=157245|is more work]. Concession noted for posterity: reporting on code is useless for the sales/marketing people given specs and documentation.
Regards,
-scott anderson
"Welcome to Rivendell, Mr. Anderson..."
|
Post #157,737
5/30/04 4:19:53 PM
|
There is more to life than version control
And then when it came time for you to describe why doing it in the database was better, you said "You Win!" I meant if there would be a *market* for DB-centric version control. Because managing a database takes more knowledge and effort than managing a source code control repository, as you have apparently conceded by failing to respond to my questions about adding a new column. I meant in general. I conceed that existing DB-centric version control management tools are weak. But your claim seemed more general-scoped, not just about version control. Some shops don't care that much about version control (for good or bad, its their decision).
________________ oop.ismad.com
|
Post #157,743
5/30/04 5:13:25 PM
|
Bryce, come to Philly on 7/4
I will sell tickets to see you and SA go at it.
It's wonderful, minds that still function in IT.
-drl
|
Post #157,748
5/30/04 6:03:37 PM
|
Might be fun.
I'd rather just talk about shit, though, than go at it in person. We can talk shop all day here. Philly is for beer and chat, and I'd be more than happy to sit down with Bryce and a beer and shoot the shit.
Regards,
-scott anderson
"Welcome to Rivendell, Mr. Anderson..."
|
Post #157,801
5/31/04 12:02:37 AM
|
warning: alchohol makes me irratable
Most say it relaxes them, but it makes me want to bust OO fans in the choppers instead. Gotta find a different legal relaxor.
________________ oop.ismad.com
|
Post #157,747
5/30/04 6:03:23 PM
|
Re: There is more to life than version control
I meant if there would be a *market* for DB-centric version control. Something that snaps into CVS/PVCS/SourceSafe/whatever and makes it dead stupid simple to do, perhaps. But it would have to work with current source code repositories, which means file-based. People aren't going to want a separate tool with a separate repository, nor will they want something that requires maintaining an extra database instance. Some shops don't care that much about version control Such shops aren't likely to care about much of anything worth caring about either.
Regards,
-scott anderson
"Welcome to Rivendell, Mr. Anderson..."
|
Post #157,797
5/30/04 11:59:19 PM
|
Dilbert reigns
Such shops aren't likely to care about much of anything worth caring about either. Most shops are highly Dilbertian. That's life in Corporate American.
________________ oop.ismad.com
|