Object Prevalence: Get Rid of Your Database?

Slashdot is powered by your submissions, so send in your scoop

Object Prevalence: Get Rid of Your Database? 676

Posted by Hemos on Monday March 03, 2003 @09:45AM from the throwing-it-out dept.

A reader writes:" Persistence for object-oriented systems is an incredibly cumbersome task to deal with when building many kinds of applications: mapping objects to tables, XML, flat files or use some other non-OO way to represent data destroys encapsulation completely, and is generally slow, both at development and at runtime. The Object Prevalence concept, developed by the Prevayler team, and implemented in Java, C#, Smalltalk, Python, Perl, PHP, Ruby and Delphi, can be a great a solution to this mess. The concept is pretty simple: keep all the objects in RAM and serialize the commands that change those objects, optionally saving the whole system to disk every now and then (late at night, for example). This architecture results in query speeds that many people won't believe until they see for themselves: some benchmarks point out that it's 9000 times faster than a fully-cached-in-RAM Oracle database, for example. Good thing is: they can see it for themselves. Here's an article about it, in case you want to learn more."

This discussion has been archived. No new comments can be posted.

Object Prevalence: Get Rid of Your Database?

Load 500 More Comments

Search 676 Comments Log In/Create an Account

Comments Filter:

gigabytes? (Score:5, Insightful)

by qoncept ( 599709 ) writes: on Monday March 03, 2003 @09:49AM (#5423491) Homepage

At first, I had a problem understanding object oriented methodology because I kept thinking of objects in terms of a database -- they seemed so much alike. But...
Who uses a database small enough to fit in RAM?

Share
twitter facebook
- Re:gigabytes? (Score:2, Interesting)
  
  by REBloomfield ( 550182 ) writes:
  
  i think the idea is that the databases are running on servers such as the SunFire, which has a stupid amount of RAM (somewhere in the terabytes if i remember correctly)....
- Re:gigabytes? (Score:2)
  
  by a_n_d_e_r_s ( 136412 ) writes:
  
  Just buy a lot of RAM!
  
  I think many small and mid-sized e-commerce vendors
  whould benefit from this.
- Re:gigabytes? (Score:2)
  
  by Rommel ( 33210 ) writes:
  
  We're not talking about an entire OLTP system that runs a business -- we're talking about the object data used for the code itself. The article suggests a different way of managing the object data instead of using a flat file, XML, or a database.
- Re:gigabytes? (Score:5, Insightful)
  
  by bmongar ( 230600 ) writes: on Monday March 03, 2003 @09:56AM (#5423534)
  
  Who uses a database small enough to fit in RAM?
  Not every solution is for every problem. This isn't for huge data warehousing systems. My impression is that this is for smaller databases where there is a lot of interactions with fewer objects.
  I have also seen object databases used as the data entry point for huge projects, where the database is then periodicaly dumped into a large relational database for warehousing and reports.
  
  Parent Share
  twitter facebook
  - Re:gigabytes? (Score:2, Insightful)
    
    by qoncept ( 599709 ) writes:
    
    Very true, then again, though, if the database is that small anyway, you're probably not taking much of a performance hit unless you never should have been using a database to begin with.
    Offtopic though, I'd love to see a solid state revolution. With the amounts of RAM and flash memory available these days, I don't see why we couldn't run an OS off one. I'm not generally one to be anxious to jump in to new technologies (I used to hate games that used polygons instead of sprites), I think moving to solid state in an intelligent manner would be the biggest thing that could happen in the industry in the near future. ie, along with serial ata, introduce fast, ~2gb bootdrives that run your OS and favorite programs and store everything else on a conventional magnetic hard drive.
  - Re:gigabytes? (Score:5, Insightful)
    
    by juahonen ( 544369 ) writes: <jmz@iki.fi> on Monday March 03, 2003 @10:10AM (#5423650) Homepage
    
    And that goes for OO as well. Not every database (or a collection of data) needs to be accessed in Object-Oriented way. Most (or should I say all) data I store to small tables would not benefit from being objects.
    
    And how does this differ from storing non-object-oriented data structures in RAM? You'd still need to implement searches, and how do you search an collection of objects without placing them on the relational line.
    
    Parent Share
    twitter facebook
    - Re:gigabytes? (Score:3, Insightful)
      
      by Arandir ( 19206 ) writes:
      
      How many times has this been said before? "Use the right tool for the job!" If you have a large collection of objects all of the same class, then use a database. If you have a large collection of objects of differing classes, then use an OO method. For small collections of objects, or if you don't have any real objects at all, neither may be appropriate.
      
      What irks me to no end are database freaks who have to do everything with a database, OO freaks who have to do everything with OO, and GP freaks who have to do everthing as pure GP. They're like guys who only know how to use a screwdriver, so they end up using the screwdriver to hammer in nails and chisel wood.
- Re:gigabytes? (Score:2, Funny)
  
  by AKnightCowboy ( 608632 ) writes:
  
  Who uses a database small enough to fit in RAM?
  The Museum of 20th Century French Military Victories in Paris could make use of this technology on my old 8086 system.
  - - Re:gigabytes? (Score:3, Funny)
      
      by Anonymous Coward writes:
      
      Here's a quick history lesson for the morons here (yeah, that's you Daniel Dvorkin):
      Gallic Wars: The French not only lost ... they lost to an Italian.
      Hundred Years' War: Although they kinda/sorta mostly lost, they were saved by Joan of Arc (a female schizophrenic), who by accident created the First Rule of French Warfare: "France's armies are victorious only when not led by a Frenchman."
      Italian Wars: France became the first and only country in history to lose not just one but TWO wars against Italians.
      Wars of Religion: France was 0-5-4 against the Huguenots.
      Thirty Years' War: Although not technically a principal, they did manage to get invaded anyway. Amusingly, they claim a tie on the basis that eventually the other participants started ignoring them.
      War of Devolution: Tied.
      Dutch War: Tied.
      War of the Augsburg League: Lost, claimed tie.
      King William's War: Lost, claimed tie.
      French and Indian War: Lost, claimed tie.
      Three ties in a row caused some deluded folks to label the period as the height of French military power.
      War of the Spanish Succession: Lost.
      American In a Scribean foreshadow of the future, France claims a win even though the English colonists saw way more action.
      This is eventually known as "de Gaulle Syndrome."
      It also establishes the Second Rule of French Warfare: "France only wins when America does most of the fighting."
      French Revolution: Won, primarily due the fact that the opponent was also French.
      Napoleonic Wars: Lost.
      Franco-Prussian War: Lost.
      World War I: Tied and on the way to losing. France was saved by the United States.
      World War II: Lost. Conquered French liberated by the United States and Britain.
      War in Indochina: Lost.
      Algerian Rebellion: Lost. The first defeat of a Western army by a Non-Turkic Muslim force since the Crusades. It gave birth to the First Rule of Muslim Warfare: "We can always beat the French." This rule is identical to the First Rules of Italian, Russian, German, English, Dutch, Spanish, Vietnamese and Esquimaux Warfare.
      War on Terrorism: France has surrendered to Germans and Muslims just to be safe.
      So France's only military victory was against... the French. How quaint.
- Re:gigabytes? (Score:3, Insightful)
  
  by Ed Avis ( 5917 ) writes:
  
  Who uses a database small enough to fit in RAM?
  
  Even if your database doesn't fit in affordable RAM today, it probably will in a few years. RAM prices fall faster than database sizes increase. Already a couple of gigabytes of storage is more than enough for a big class of applications.
- Re:gigabytes? (Score:5, Insightful)
  
  by mbourgon ( 186257 ) writes: on Monday March 03, 2003 @11:18AM (#5424072) Homepage
  
  Who uses a database small enough to fit in RAM?
  
  I do, but I'll thank my SQL server for doing it for me. Most aggressively cache data and databases - if Database A is used constantly, it'll be kept in RAM, whereas less-frequently Databases will either stay on the hard disk, or certain tables of that database will be put in memory. It lets you make the most of your RAM.
  
  Parent Share
  twitter facebook
Very large? (Score:2, Interesting)

by psychotic_venom ( 521968 ) writes:

What about absolutely monsterous databases? What about huge queries? Or even querying across objects (like we would do joins in a table). I assome that while this can work, there will be some major shifts in thinking in order to get it to be accepted. People like their databases. And enterprise level software isn't going to go out and grab this up--until it does, it probably won't really take off.
Slashdotted (Score:5, Funny)

by Cubeman ( 530448 ) writes: on Monday March 03, 2003 @09:53AM (#5423520)

For a scalability test, it sure fails the Slashdotting Test.

It's about 9000 times slower right now :)

Share
twitter facebook
Neat concept... (Score:3, Interesting)

by Gortbusters.org ( 637314 ) writes: on Monday March 03, 2003 @09:56AM (#5423538) Homepage Journal

but it doesn't really provide any compelling reasons NOT to use a database. Besides the fact that their home page, Prevayler.org [prevayler.org] seems to be non-existant I think it's more of just a "neat idea" type thing rather than a compelling reason for any prodcut/project to drop relational DB support.

You can always have a caching system as the author states, but even then what systems use this? The countless PHP/MySQL sites out there seem to perform just fine. This may be desirable for some very strict real time communications systems, but for just about every other form of app, I don't see it.

What are you going to tell your 3rd party integrators? Drop their XML/ODBC report and surf on over to prevayler.org?

Share
twitter facebook
- Re:Neat concept... (Score:5, Informative)
  
  by truthsearch ( 249536 ) writes: on Monday March 03, 2003 @10:12AM (#5423661) Homepage Journal
  
  The countless PHP/MySQL sites out there seem to perform just fine.
  
  Object-oriented programming and data persistance is about a lot more than public web sites. Private, corporate data warehouses with terabytes of persisted objects squeeze every bit of processing power available. For example, I used to work on Mastercard's Oracle data warehouse. An average of 14 million Mastercard transactions occur per day. That's 14 million new records to one table each day, with reporting needing hundreds of other related tables to look up other information. To get something of that scale to run efficiently for a client app (internal to the company) costs millions of dollars. Object persistance on a large scale is tough to get right and is far from perfected, and there's a lot more going on that public web site development. Every new idea helps. Consider the article written on IBM's developerWorks. It's readers are mostly corporate developers.
  
  Parent Share
  twitter facebook
What about existing data ? (Score:4, Interesting)

by koh ( 124962 ) writes: on Monday March 03, 2003 @09:56AM (#5423541) Journal

Their solution really seems to rock, and may finally be the OO to DB paradigm everyone was waiting for.

That said, I wonder what their position is towards the import of existing data. Many projects would only benefit from the solution if and existing data (usually object-oriented but saved in a roughly flat database as the article points out) can be ported seemlessly to the new environment.

My point is, this solution solves a known problem by introducing a new technology, however this new techno will have to be bent towards the older systems in order to retrieve what was already saved. Same old story : in the database world existing data is paramount.

Share
twitter facebook
- FInally OO? I think and hope not! (Score:5, Informative)
  
  by Steeltoe ( 98226 ) writes: on Monday March 03, 2003 @11:10AM (#5424015) Homepage
  
  OODBMSes have been thoroughly and handily debunked. For the best opinions on relational database technology, visit these hardcore guys: http://www.dbdebunk.com/ [dbdebunk.com]
  
  The problems with OODBMSes can be summarized so (OTOMHRN - on top of my head right now):
  1) Proper relational technology can model OO-hierarchies, but the other way around is unnatural and cumbersome, if not impossible. Proper relational technology is a step up on the ladder in generalization from OO-technology. It's simply a generation or two ahead, while OODBMS is several steps backwards.
  
  2) Proper relational technology is proven concepts from mathematics and logics, while OODBMSes are just a hack to store application data "quick'n dirty". Everything can be modelled as general relations, while OO-technology lacks the fundamentals to model *ANYTHING* and is limited and impeded by having an obligatory and *meaningless* top-to-bottom hierarchy. (You cannot have *meaning* without relations of differing types to other entities.)
  
  3) Proper relational technology allows you to extract, convert and manipulate data in standardized methods (using query languages like SQL), in ways not thought of at the time of design. OODBMSes can only be used properly in the context of the OO-application layer, often relying on runtime data. If you need flexible solutions, you will have to spend extra time programming a specialized solution, instead of having the benefit of a fully relational query language (which unlike SQL, can express almost any problem to be solved).
  
  4) The future is relational. Current RDBMSes do not implement true relational technology, which if they did, nothing else would be needed. The matemathics in the theories behind it would be at the programmers disposal during programming, reducing time and potential errors. Yes, it requires understanding the theory, but wouldn't you like a true DBA to do that anyways?
  
  Don't buy into the hype, look into true relational technology and educating yourself. As for storing everything in RAM, and "saving it for the night", I wouldn't risk to have my bank-account in such a DB. Such solutions are only usable for storing non-volatile data. For non-commercial game-servers, it maybe perfect.
  
  Parent Share
  twitter facebook
  - Re:FInally OO? I think and hope not! (Score:3, Interesting)
    
    by johnnyb ( 4816 ) writes:
    
    Here here! Someone with their head on straight. Let me also add to this that when relational technology took hold, it was NOT because it was faster. In fact, at that time, relational databases were 50 TIMES SLOWER than the current hierarchical databases. The performance gap has narrowed, but the reasons for choosing relational remain the same. The industry at that time realized that the benefits of relational technology was much more important than speed, and hopefully we'll come to the same decision again. Those reasons include:
    
    * the separation between _logical_ and _physical_ layers of the database - the DBA controls physical record layout and indices, while the database designer and applications have access to the logical layers. This way they can do their roles independently of each other.
    
    * the ability for the data model to change without affecting the applications. Using VIEWS - you can do quite a bit of modification to the underlying data model, but applications using the older one will still run if the DBA sets up a view.
    
    * the ability to do arbitrary querys on the data
    
    * The ability to set up views to handle more complex interactions. For example, in a mail system I've written, we have a table for campaigns with a sent/not-sent flag, a list of addresses, and three layers of do-not-send lists. We then have a single view which puts all of this together and gets the list of addresses which need to be sent to. This is a view on top of several views.
    
    I'm sure I'm missing some others, too. Basically, a relational database system is a gigantic inference engine when designed appropriately.
- Re:What about existing data ? (Score:4, Insightful)
  
  by MisterFancypants ( 615129 ) writes: on Monday March 03, 2003 @12:44PM (#5424712)
  
  Their solution really seems to rock, and may finally be the OO to DB paradigm everyone was waiting for.
  Not likely. The REAL problem with OO databases isn't that RDBMs might be more mature or whatever else you might read, it is that the data is almost always more important to companies than the behaviors that operate on that data. For example, if the company has a database of customers, they might want to use that database in dozens of different ways, and they might want to grow it for years, if not decades. The OO-database view tends to look at things too much from the view of one single application of the data and the data gets entangled with code behavior based on that specific application. With a clean RDBMs you can hit the same database from many different applications (assuming the database has a well thought-out schema to begin with)... the data isn't so tightly wound up with a specific bit of application code.
  This 'solution' doesn't fix that aspect of OO databases. In fact, it makes it worse. I will grant that it is a neat technology, but I wouldn't expect to see it take over the place of RDBMs systems any more than OO-databases of the past have.
  
  Parent Share
  twitter facebook
OOP (Score:2, Interesting)

by NitsujTPU ( 19263 ) writes:

Couple things.

1) You COULD use an object-relational database if you wanted to keep an OOD aspect.
2) You COULD load non-object oriented data into RAM with lower overhead.
3) A couple gig's of data into RAM... not really a deployable solution for enterprise, don't you think?

Other than that, nifty idea and all.
Two words... (Score:4, Informative)

by Anonymous Coward writes: on Monday March 03, 2003 @10:00AM (#5423567)

Enterprise JavaBeans.

Here's the definition of an EJB from the http://java.sun.com [sun.com] site.

A component architecture for the development and deployment of object-oriented, distributed, enterprise-level applications. Applications written using the Enterprise JavaBeans architecture are scalable, transactional, and multi-user and secure.

And more specifically, here's the definition of an Entity EJB:

An enterprise bean that represents persistent data maintained in a database. An entity bean can manage its own persistence or it can delegate this function to its container. An entity bean is identified by a primary key. If the container in which an entity bean is hosted crashes, the entity bean, its primary key, and any remote references survive the crash.

Share
twitter facebook
- Re:Two words... (Score:5, Informative)
  
  by neurojab ( 15737 ) writes: on Monday March 03, 2003 @11:53AM (#5424340)
  
  Entity beans are all about transactions. You've got a transaction context that can propogate over several beans. The EJB container doesn't do this on it's own, however. It uses the ACID properties of the database along with the database's commitment control mechanisms to accomplish the properties you mentioned. Entity beans are usually mapped to tables, and could represent a join in the BMP case. That said I'm not sure if you're saying EJB will benefit from using this as a backend, or that EJB did this first? The latter is false, but the former... I'm not sure this technology will benefit entity beans, but may benefit STATEFUL SESSION beans because they're less RDBMS-centric.
  
  Parent Share
  twitter facebook
- - - Re:Don't go there (Score:3, Informative)
      
      by atomray ( 202327 ) writes:
      
      EJB certainly isn't the perfect technology (what is?), and the specification is lacking in some points (generation of primary keys and an incomplete query language), but it is certainly useful. Take a look at O'Reilley's "Building Enterprise Java Applications (vol 1)", it has a nice overview of enterprise java technologies and how they fit together. It discusses how to use session and entity beans effectively - in essense, you shouldn't be sending entity beans to your client (which it sounds like the guy in the original post was doing - this will kill performance due to the number of RMI calls that will be generated), your clients should typically interact with session beans that perform the business logic using the entity beans.
      
      With respect to cost, there's JBoss [jboss.org], that's free, and there are many other venders at a variety of prices and performance. I use JBoss - I did, for a time, see some nasty performance problems, but after reading some documentation quickly realized that it was my mistake.
    - Re:Don't go there (Score:3, Interesting)
      
      by schmaltz ( 70977 ) writes:
      
      A good, recent comparison of EJB containers and Java 2 servers is here--
      
      http://www.onjava.com/pub/a/onjava/2003/02/26/ejbi nherit5.html [onjava.com]
      
      A comparison of Tomcat, Orion, Resin and Weblogic is here:
      
      http://radio.weblogs.com/0107789/stories/2002/05/2 8/isTomcatCrap.html [weblogs.com]
Ever looked at object-oriented databases? (Score:5, Informative)

by carstenkuckuk ( 132629 ) writes: on Monday March 03, 2003 @10:00AM (#5423569)

Have you have looked at object-oriented databases? They give you ACID transactions, and also take care of mapping the data into your main memory so that you as a programmer only have to deal with in-memory objects. The leading OODBs are Objectstore (www.exln.com), Versant (www.versant.com) and Poet (www.poet.com).

Share
twitter facebook
- Re:Ever looked at object-oriented databases? (Score:3, Interesting)
  
  by yorick ( 4133 ) writes:
  
  Yes, I've looked at object-oriented databases. I worked on a project that used Objectstore for a year or two before we gave up and went to Oracle.
  
  Here is why:
  
  1. We realized that like _most_ projects there really wasn't anything that object-oriented about the data. The code, yes. But the data was just as easily represented with typical RDBMS relationships and it was much faster to do basic operations. We saw a several thousand time increase in performance when trying to query the database for a particular object and its associated data. A join or ten wasn't nearly as expensive as getting data out of what was essentially just a dump.
  
  2. Objectstore, at the time, had no concept of administration. It was up to the developer to handle things like when files got too big, or creating the OODBMS concept of indexes, or what have you. The "DBA" could stop and start it, and that's about it. So if we grew, or got new hardware, or changed platforms, it was time to dump the old data (because migrating it was an programming project in itself) and start over.
  
  3. People would ask us questions about the data we were storing that would have been absolutely trivial to find in a RDBMS (like "how many of these events occured last month when this device was in this state) that we'd have to write long slow-performing pieces of code to retrieve.
  
  4. Other people wanted to write applications that used our data. That wasn't too easy, because they wanted slightly different objects. We would have had to agree on a object for everything we shared, or store things twice. With an RDBMS we used could use views, or generate the objects differently from the same tables.
  
  5. There was no way to get a read-consistent hot backup across a couple of hundred files. Maybe there is now. This was just foolish.
  - Re:Ever looked at object-oriented databases? (Score:3, Informative)
    
    by greenrd ( 47933 ) writes:
    
    A join or ten wasn't nearly as expensive as getting data out of what was essentially just a dump.
    What do you mean by a "dump"? Sounds like you were using the OO db inappropriately, e.g. by querying for "SELECT * FROM extent1" and then linearly searching it, or something.
    3. People would ask us questions about the data we were storing that would have been absolutely trivial to find in a RDBMS (like "how many of these events occured last month when this device was in this state) that we'd have to write long slow-performing pieces of code to retrieve.
    Doesn't ObjectStore have a query language similar to SQL? The Object Data Standard defines OQL, but I know that the Object Data Standard is not exactly "industry standard" yet. As for speed, it's still possible to create indexes, optimise data structures and algorithms, etc.
    Other people wanted to write applications that used our data. That wasn't too easy, because they wanted slightly different objects. We would have had to agree on a object for everything we shared, or store things twice. With an RDBMS we used could use views, or generate the objects differently from the same tables.
    This is an odd complaint. In any decent OODBMS, creating views (even manually) should be fairly simple. Yes, you might have to write some repititive code - there's room for improvement there. This is where aspect-oriented programming (specifically, composition filters) comes in, I think.
3 issues I see (Score:4, Interesting)

by foyle ( 467523 ) writes: on Monday March 03, 2003 @10:00AM (#5423573)

First off, I like the concept, but speaking as a former Oracle DBA, I have several issues:

1) You're limited by how much RAM you have on your server, not how much disk space you have

2) If you're making a lot of data changes and have a crash or power outage, I'd imagine that it can take a while to replay the log to get things back to the most recent point in time (you can have the same problem with Oracle, but your checkpoints would be a lot closer together than "once a day")

3) There are millions of people that already know SQL and can write a decent query with it. How does this help them? Never underestimate the power of SQL.

On the other hand, for projects dealing with small amounts of data I can see how implementing this would be far easier than integrating with Mysql, Postgresql or Oracle.

Share
twitter facebook
- Re:3 issues I see (Score:5, Interesting)
  
  by jhines0042 ( 184217 ) writes: on Monday March 03, 2003 @10:23AM (#5423719) Journal
  
  1) What about Swapping? I know that you would by limited by physical ram (which IS getting cheaper) but couldn't you also get a really large virutal memory space and utilize that?
  
  2) You can probably set up your own checkpoints to be more than once a day.
  
  3) I agree. Lack of SQL would cause people to.... GASP.... learn a new system. SQL is very cool. And I admit that I have a system I am thinking of porting away from JDBC and into Prevalence just to see how it goes (No, it isn't mission critical) and one of the first things I realized is that I would have to design a new method of querying. But you know what... That can lead to new thinking and more powerful software in the future.
  
  Parent Share
  twitter facebook
- Buggy whips (Score:5, Insightful)
  
  by Camel Pilot ( 78781 ) writes: on Monday March 03, 2003 @10:49AM (#5423882) Homepage Journal
  
  3) There are millions of people that already know SQL and can write a decent query with it. How does this help them? Never underestimate the power of SQL.
  
  There are millions of people that already know how saddle and ride a horse. How do these new fangled automobile help them? Never underestimate the power of a horse.
  
  While I agree with your other points... number 3 is never a reason to keep from embracing something new. People are suprisingly trainable.
  
  Parent Share
  twitter facebook
- 3 More Issues for the Do-It-Yourself Database (Score:5, Informative)
  
  by JohnDenver ( 246743 ) writes: on Monday March 03, 2003 @11:23AM (#5424112) Homepage
  
  Personally, I still think it sounds a lot easier to just map objects to a database.
  
  4) Concurrency - If you haven't implemented locks for an object model, then you haven't lived. Seriously, I can see a lot of people screwing this up with deadlocks galore. Locking up concurrent systems can be a nightmare.
  
  5) Ad Hoc Support - Goodbye Crystal Reports, Goodbye English Query, Goodbye ANY Ad Hoc query support, because if you need anything different, you're going to have to write a lot more code to enumerate throughout your objects. Have fun.
  
  6) Indexing - I hope you have a good B-Tree library and are familiar with Indexing/Searching algorithms when implementing HARDCODED indexing. Oh yeah, have fun rewriting all of your query procedures when you decide to change your hardcoded indexing.
  
  Nothing says flexible like HARDCODING! Yay!
  
  In all seriousness, this is a bad idea for 99% of projects out there. It's inflexible, unscalable, severely error prone, and timely to implement.
  
  (sarcasm) All this just to avoid the "cumbersome" process of mapping objects to tables?
  
  Seriously people, it's not that hard (3 magnitudes easier than this) and there are a lot of tools that help doing it.
  
  If you're REALLY hung up on not using a relational database, try an Object Database, XML Database, or an Associative Model Database.
  
  Parent Share
  twitter facebook
  - Re:3 More Issues for the Do-It-Yourself Database (Score:3, Interesting)
    
    by dubl-u ( 51156 ) writes:
    
    If you haven't implemented locks for an object model, then you haven't lived. Seriously, I can see a lot of people screwing this up with deadlocks galore. Locking up concurrent systems can be a nightmare.
    
    Then just wrap all of your lock-sensitive stuff in Prevayler command objects. They've got that working fine, and it guarantees isolation.
    
    Goodbye Crystal Reports, Goodbye English Query, Goodbye ANY Ad Hoc query support, because if you need anything different, you're going to have to write a lot more code to enumerate throughout your objects. Have fun.
    
    Oh, please. If you really need SQL compatability, then dump the data occasionally to a data warehouse, which is where you should be doing unconstrained ad-hoc queries anyhow.
    
    Or if it's so the programmers can peek at the live system, then put in something like BeanShell, which will let you see a lot more than just the persistent data.
    
    Or you could drop an SQL interpreter into your system and present your objects as tables. Many of the pieces are already open sourced, so it would be pretty easy.
    
    Indexing - I hope you have a good B-Tree library and are familiar with Indexing/Searching algorithms when implementing HARDCODED indexing. Oh yeah, have fun rewriting all of your query procedures when you decide to change your hardcoded indexing.
    
    Can you really not think of ways to write these things in flexible ways? If that's the case, you could learn something about being a programmer. Pick up Martin Fowler's Patterns of Enterprise Application Architecture [martinfowler.com].
    
    In all seriousness, this is a bad idea for 99% of projects out there. It's inflexible, unscalable, severely error prone, and timely to implement.
    
    Perhaps you should try it before knocking it. As you are, in order, wrong, mostly wrong, wrong, and confused. It's no magic bullet, but it's a useful approach for some systems.
Interfacing (Score:3, Interesting)

by MSBob ( 307239 ) writes: on Monday March 03, 2003 @10:00AM (#5423576)

This may be a great way to snapshot the state of a Java application but how on earth would you query anything out of it with a non-Java/non-OO language?
A SOAP interface could go some ways towards accomplishing this but what about the traditional ACID properties of a DBMS? Durability is obviously guaranteed... Consistency? That would depend on programmers following the practices... Atomicity? Not sure about that one. For simple commands it seems to work. What about compound commands? If no rollback occurs how can I assert that I changed both objects not just one? Isolation? Not sura about this one either.

Share
twitter facebook
C++ soluton (Score:2, Funny)

by debrain ( 29228 ) writes:

I noticed the lack of C++ support, so I thought I'd throw my hat in. :)

template<typename O,typename T> O& operator <<(O&o,T&t) { o.write(t,sizeof(T)); }
Looks like journaling filesystem (Score:2)

by Reinout ( 4282 ) writes:

It looks very much like a journaling filesystem. That one basically also stores the commands executed in a log file. If you've had a crash with ReiserFS for instance, you can see messages like "replaying log for...." at startup.

Now they're doing the same for in-memory object data structures. Might be a nice idea.

On a different note: the objectdatabase behind zope has perhaps the same net effect. To the programmer, everything is in-memory. The object database reads stuff from disk if needed and keeps things in memory when much-requested. And also with a list of transactions which can be replayed or rolled back.

So: it looks nice, but I'm curious to the net results!
Something about this doesn't sit right with me (Score:3, Insightful)

by sielwolf ( 246764 ) writes: on Monday March 03, 2003 @10:04AM (#5423605) Homepage Journal

I think this would work well for most web-server DB backends as the data isn't changing on the fly that much. But what about even /. where the content of a discussion thread is changing possibly several times a second (with new posts and mods)? I'd think then you'd want to use the strong atomic operators of the DB to pull directly from the tables instead of relying on serial operators to try and refresh.

Since the benchmark page was slashdotted I might be speaking out of my ass. But I never trust "9000 times faster!". It sounds too "2 extra inches to your penis, guaranteed!"

Share
twitter facebook
It's not a simple question of speed (Score:4, Insightful)

by Ummite ( 195748 ) writes: on Monday March 03, 2003 @10:05AM (#5423608)

The advantage of putting data into a database isn't just speed! Just think about sharing data between application, between many computers, exporting data into another format, or simply making a query to change some values! You simply don't want to write code that will change value of data with some specific conditions : you prefer make a single query that any database manager or simply a sql newby could write, not just the 2-3 programmers that have done the work on that code some years ago! You also sometime need to visualise data, make reports, sort data. You simply don't want to code that. I think most serious database can also put data in RAM if you have enough, and is able to do some commit/rollback when it's necessary. So your point that RAM data with serialize in-out is ok, as long as you absolutly need 100% speed, don't need to do complex query on your data and is in used only on one computer.

Share
twitter facebook
Blazing fast (Score:4, Funny)

by Zayin ( 91850 ) writes: on Monday March 03, 2003 @10:05AM (#5423611)

This architecture results in query speeds that many people won't believe until they see for themselves: some benchmarks point out that it's 9000 times faster than a fully-cached-in-RAM Oracle database, for example. Good thing is: they can see it for themselves.

Yes, I've seen it. The page on www.prevayler.org only took about 30 seconds to load. Does that mean that a fully-cached-in-RAM Oracle database would spend 75 hours loading that page...?

Share
twitter facebook
no queries (Score:5, Insightful)

by The Pim ( 140414 ) writes: on Monday March 03, 2003 @10:06AM (#5423613)

Queries are run against pure Java language objects, giving developers all the flexibility of the Collections API and other APIs, such as the Jakarta Commons Collections and Jutil.org.

In other words, "it doesn't have queries". What real project doesn't (eventually) need queries? And even if writing your queries "by hand" in Java is good enough for now, what real project doesn't eventually need indices, transactions, or other features of a real database system?

Share
twitter facebook
- Re:no queries (Score:4, Insightful)
  
  by sql*kitten ( 1359 ) writes: on Monday March 03, 2003 @10:20AM (#5423700)
  
  In other words, "it doesn't have queries". What real project doesn't (eventually) need queries? And even if writing your queries "by hand" in Java is good enough for now, what real project doesn't eventually need indices, transactions, or other features of a real database system?
  
  Indeed. It looks like a high-level, language-neutral API for traversing linked lists of structs. Yes, you can rip through such a structure far faster than Oracle can process a relational table, but they are two different solutions to two different problems. I wouldn't use an RDBMS for storing vertex data for a scene rendering application, and I wouldn't use an in-memory linked list for storing bank transactions!
  
  Parent Share
  twitter facebook
- - Fine granularity for writes. (Score:3, Informative)
    
    by zipwow ( 1695 ) writes:
    
    I think you misread the article.
    
    Every time you issue a 'change command', it first makes the change in memory, then records just that command to disk, very much like a journaling file system as I understand it.
    
    Then, presumably, you also change your object in memory to match. If the whole system comes down, then when you start again, it loads its 'starting point', probably from yesterday, and then executes those recorded commands.
    
    Furthermore, future 'reads' on that data aren't blocked by the disk i/o. They wait for the object in memory to change (quick) and pretty much ignore the disk write.
    
    Where I think you're getting confused is that periodically, it goes through the system and makes a new 'starting point', presumably during a period of low utilization (like at night).
    
    I don't know what your comment about never joining tables would mean, this wouldn't have 'tables', but would have objects as you've designed them. Presumably you've designed your objects so that they're accessible in some natural and convenient way. If you haven't, you ought to fix that...
    
    -Zipwow
Get best of both worlds... (Score:5, Interesting)

by ChrisRijk ( 1818 ) writes: on Monday March 03, 2003 @10:07AM (#5423625)

If you need performance for persistant data, this "new" system doesn't seem to be much different at all to what you can do today. Using JDO (Java Data Objects) with a file-system backend would be about identical, though easier to use and have more features.

Of course, you can always write your own persistance layer. I've done this a few times - very easy in Java. Map a row in the DB to an object, and cache the object in memory. If need to fetch that data again, check the cache first. When doing a write, write to the DB and update/flush your cache as necessary.

That's just the basics - what's most optimal depends on how your data is accessed and changed (and also your programming language and capability as a programmer). Java has nice really nice stuff for caching built-in, like SoftReference wrapper objects, and of course threading and shared memory that you can use in production.

I'm currently working on a super optimised threaded message board system. Almost all pages (data fetch/change + HTML generation) complete in about 0.001s.

Share
twitter facebook
Umm what about multiple servers? (Score:3, Insightful)

by jj_johny ( 626460 ) writes: on Monday March 03, 2003 @10:11AM (#5423653)

Reading through the article it seems to lack a rather small but important item - multiple systems interacting read/write with the same database. This is not a very robust or scalable way of doing things. I wonder how this stacks up to one of the normal ways of improving performance by have one read/write database with lots of read only repicas.

Share
twitter facebook
- You're thinking too general case (Score:3, Insightful)
  
  by tkrotchko ( 124118 ) writes:
  
  This is a useful solution for a single-purpose web site for store session information typical in most stateful web site.
  
  Its not a general purpose DBMS solution, nor should you interpret it as such.
Sourceforge Link (Score:4, Informative)

by BoomerSooner ( 308737 ) writes: on Monday March 03, 2003 @10:17AM (#5423686) Homepage Journal

http://sourceforge.net/projects/prevayler [sourceforge.net]

Share
twitter facebook
OO databases are an evolutionary step...backward (Score:5, Interesting)

by GoldTeamRules ( 639624 ) writes: on Monday March 03, 2003 @10:34AM (#5423793)
In 1999, I worked for a company that used an OO database (ObjectStore) to develop an e-commerce shopping portal. It was a disaster.

OO advocates point to extremely fast (extremely special-case, in practice) queries, and natural persistent object mapping as reasons to why OO is superior.

However, this is very misleading.

Some of the MAJOR problems we ran into in using ObjectStore were:
- It is very difficult to "see" an OO database. By nature, the data isn't tabular. It's a persistent object heap. There's no "SELECT * FROM USERS". So tracking down data-related problems involves exporting data to an XML file and sifting through it.
- Reporting tools don't exist for OODB. Try hooking up Crystal or another reporting tool to this. You end up writing every report from scratch.
- DB Performance when querying outside the normal object hierarchy (aggregate queries grouping on object attributes, etc.) is orders of magnitude SLOWER on an OODB! This was a huge problem when allowing users to do a product search on an e-commerce portal.
- 32-bit memory limited our max customer size dramatically.
When developers first consider OO databases, their first assumption is that OODBMS is to RDBMS as OOP is to Procedural Programming. This is a FALSE analogy! Migrating to OODBMS offers precious little to support better software design while introducing significant maintenance and design issues that should be considered prior to using this technology.

Unless I had a product that had an extremely specialized use case that matched OODB strengths, I would NEVER develop on this kind of platform again.
Share
twitter facebook
- OODB are very different from RDBMS (Score:4, Interesting)
  
  by Juju ( 1688 ) writes: on Monday March 03, 2003 @11:13AM (#5424044)
  
  and like in every object system, it is very important you get the design (and objects) right before you start coding.
  
  If you are thinking of accessing your objects like you are doing with SQL, then you haven't understood how OODB work. As for accessing your objects and doing your queries, there are tools (like Inspector for ObjectStore) than enable you to do just that.
  
  In term of performance, Oracle and co are nowhere near what you can reach with ObjectStore, provided you designed your application well.
  The 2 main problems with OODB, are:
  - schema evolution
  - reporting
  But these can easily be solved by a good design of your application.
  
  OODB is a skill that needs time mastering. After 4 years, seeing ObjectStore application from various companies, I can tell the difference between the ones where people knew what they were doing, and those from people who didn't have a clue...
  
  Parent Share
  twitter facebook
  - Re:OODB are very different from RDBMS (Score:3, Insightful)
    
    by battjt ( 9342 ) writes:
    
    and like in every object system, it is very important you get the design (and objects) right before you start coding
    
    And, like in every programming project, your requirements are incomplete, so your model will be incomplete, so you need to allow for flexibility. OO DBMS that I have used don't allow for that flexibility (schema evolution), so we build layers on top of the OODB, just the same as we do for relational DBs. I don't see the advantage. By the time we are done optimizing a relational DB, it has all the same indexes that the OODB would have, but we were able to evolve the system, instead of designing it all up front.
    
    I suppose I could argue for an OO DBMS if the number of transactions was high enough and application had a static set of requirements (general ledger, trade system, etc.).
    
    Joe
  - Re:OODB are very different from RDBMS (Score:5, Funny)
    
    by Samrobb ( 12731 ) writes: on Monday March 03, 2003 @12:55PM (#5424773) Journal
    
    The 2 main problems with OODB, are:
    - schema evolution
    - reporting
    But these can easily be solved by a good design of your application.
    
    In other words, OODB technology is doomed.
    
    Parent Share
    twitter facebook
- Re:OO databases are an evolutionary step...backwar (Score:5, Interesting)
  
  by leomekenkamp ( 566309 ) writes: on Monday March 03, 2003 @11:35AM (#5424220)
  
  Although you certainly have a point, there are some remarks I have to make here:
  
  There's no "SELECT * FROM USERS".
  
  That's just like saying Latin is a bad language because it does not have equivalents for 'the', 'le/la', 'de/het', 'der/die/das', whatever. An rdbms is *fundamentally* different from an oodbms
  
  DB Performance when querying outside the normal object hierarchy (...) is orders of magnitude SLOWER on an OODB!
  
  That's right: you are trying to use a oodbms as a rdbms. Ever tried to drive a car like you ride a bicycle?
  
  Oodbms are relatively new, and they have their 'problems', just like rdbms-es have theirs. But the biggest problems arise when one approaches an oodbms like one would an rdbms. Just like you run into problems using an oo language when you have only used a proc. language
  
  Parent Share
  twitter facebook
  - Nods head (Score:3, Interesting)
    
    by abulafia ( 7826 ) writes:
    
    > DB Performance when querying outside the normal object hierarchy (...)
    > is orders of magnitude SLOWER on an OODB!
    That's right: you are trying to use a oodbms as a rdbms. Ever tried to drive a car like you ride a bicycle?
    I've still never heard a good answer to this problem, only that I'm using the wrong hammer.
    When performing activities against pure OO storage in which selectively collecting data from a (potentially large) number of objects is required, what is the OC (object-correct) way to do so? Asking each one via a method call is horrendously slow in comparison to a RDBMS. For instance, contrast "select last_activity, uid from users" to
    my %blarg;
    foreach ( my $user $users->$next() ) {
    $blarg{uid} = $user->{uid};
    $blarg{last_activity} = $user->{last_activity};
    }
    I suppose if one is building a product instead of managing an ongoing project, saying that lazy access to the hash will save a little time. I still don't see the performance win, and for ad hoc access, building the methods and accessors just takes too much time to be reasonable.
    Use the right tool for the right job, I say. And usually, for managing data, a RDBMS is the right tool. For interacting with that data, OO is frequently nice.
    Please correct my incorrect notions.
- Re:OO databases are an evolutionary step...backwar (Score:3, Insightful)
  
  by swillden ( 191260 ) writes:
  Some of the MAJOR problems we ran into in using ObjectStore were:
  No, the MAJOR problem you ran into was trying to get RDBMS guys to understand OODBMSs, and you clearly failed.
  It is very difficult to "see" an OO database. By nature, the data isn't tabular. It's a persistent object heap. There's no "SELECT * FROM USERS". So tracking down data-related problems involves exporting data to an XML file and sifting through it.
  Well, that would be the hard way to do it. I suppose the easy way would be to take two minutes and write a small program to scour through the DB looking for the problems, but my experience with Objectstore and other OODBMSs would lead me to ask a different question -- How did the "data-related problems" get created? Write your classes with strong invariants and tightly encapsulate your data and you won't really have many such issues.
  Reporting tools don't exist for OODB.
  Actually this isn't really true, but the point is still worth addressing because the available reporting tools aren't very good. This isn't the fault of the tools, it's just a fact that it's impossible to write a general-purpose tool that can intelligently traverse arbitrarily-structured data.
  Again, the solution is: write a small program to extract the data you want to report on.
  If you need to do lots of ad-hoc queries against the database, such that writing a program each time isn't reasonable, then your usage pattern suggests an RDBMS is more appropriate.
  DB Performance when querying outside the normal object hierarchy (aggregate queries grouping on object attributes, etc.) is orders of magnitude SLOWER on an OODB!
  Unless you create indexes for those queries, of course. Ad-hoc querying is a real weakness of OODBMSs. OTOH, queries that are planned for and for which good indexes exist are orders of magnitude FASTER on an OODB! Like, three orders of magnitude faster than an RDBMS.
  32-bit memory limited our max customer size dramatically
  That is a problem if you design your database badly, but Objectstore allows you to segment your DB so that the size of your address space isn't an issue. The segmentation is completely transparent to the programmer using the objects.
  Migrating to OODBMS offers precious little to support better software design while introducing significant maintenance and design issues that should be considered prior to using this technology.
  OODBMSs have advantages and disadvantages. The advantages are:
  
  Ease of initial development. No more figuring out how to map between objects and tables.
  
  Code can be more object-oriented. With an RDBMS, "tableitis" tends to infect your classes.
  
  Performance! Particularly with Objectstore/C++, the facts that (a) database representation is almost identical to in-memory representation and (b) client-side caching means that once an object has been retrieved from the persistent store there is *zero* overhead -- using a persistent object costs *exactly* the same as using a purely in-memory object -- mean that a well-structured Objectstore database is hugely faster than any RDBMS.
  
  The disadvantages vs. RDBMSs are:
  
  Ongoing development requires schema migrations and those can be difficult. Mind you they're not easy for an RDBMS situation, either, since you have to reswizzle all your object-relational mapping stuff.
  
  Ad-hoc queries are hard.
  
  Getting good performance requires more design effort, particularly with page-oriented OODBMSs like Objectstore (which really act more like a specialized virtual memory system than a database).
  
  Very few people understand them.
  
  Overall, OODBMSs shine when your primary need is for an "applicaiton working store", more than a "database" and when you need maximum performance and minimum time to market (assuming you have staff that knows the tool). If you need ad-hoc queries you can still use an OODBMS, but you will want to export the data to a relational DB for query purposes.
  Actually, that's a very nice solution to many problems, IMO. Use an OODBMS as your high-performance working store, and periodically export the the data to a relational "data warehouse" for ad-hoc queries and data mining. This means that you still have to implement and maintain an object-relational mapping, but it's much easier to manage a one-way mapping than a bi-directional mapping.
  The system described in the article is fine for some environments, I'm sure, but a high-quality OODBMS would be just as fast, more robust and would allow you to use databases that won't fit in RAM.
Speed is not the only factor (Score:4, Interesting)

by digerata ( 516939 ) writes: on Monday March 03, 2003 @10:37AM (#5423805) Homepage

The first problem I see with this method is the lack of a powerful and flexible querying method. One of the most powerful features of SQL databases is their capability for searching. No where in the article did I see anthing about advanced querying of the objects. Even if there is, I'm sure its no where near as fast as a MySQL or Oracle. The author states that it is several orders of magnitude faster, but I bet it is this much faster only on fetch routines where you already know what object you are looking for.
Here's the issue they are trying to solve: mapping object to records. That's it. Now the problem with removing the records / database is you lose all of the searching power that is inherit in relational databases. The author states that the codebase is 350 lines of code. How can any complex search engine be implemented in 350 lines of code that also covers the persistance?

Share
twitter facebook
- Re:Speed is not the only factor (Score:3, Insightful)
  
  by moncyb ( 456490 ) writes:
  
  The first problem I see with this method is the lack of a powerful and flexible querying method.
  
  Maybe I don't understand this well enough (the Prevaylor site is down), but if this is really a database based upon objects, and you can access them as normal objects, then any good programmer can make a "powerful and flexible querying method." You can write your own hashtables, searching functions, or whatever.
  One of the most powerful features of SQL databases is their capability for searching. No where in the article did I see anthing about advanced querying of the objects.
  
  Because they probably didn't put any searching routines into Prevayler. From the SourceForge page: "Ridiculously simple, Prevayler provides transparent persistence for PLAIN Java objects." You write the searching routines.
  Even if there is, I'm sure its no where near as fast as a MySQL or Oracle. The author states that it is several orders of magnitude faster, but I bet it is this much faster only on fetch routines where you already know what object you are looking for.
  
  Ever hear of hash codes and hash tables? You write the code yourself. How do you think MySQL and Oracle do it? They have code which does the searches. With this system you cut out the middleman. It'll have its own weaknesses and strengths, so every manager will have to decide if this system will fill their needs.
  At first glance, I see two weaknesses and two strengths to this system. Weaknesses: a) you'll have to be more of a programmer to implement a database. b) the database has to be small enough to fit in memory. Strengths: a) infinitely flexible. b) really fast for anything which will fit in RAM.
  Web hosting services won't want this. (they usually have many customers, and all their databases won't fit in RAM at once.) Big e-commerce sites won't want this for their customer databases. (again, probably won't fit in RAM) They may be able to use it for their product data, unless it's really huge--such as Barnes and Noble. I'm sure it'll be quite usable for most small businesses. The need for a programmer may seem like a huge obstacle, but I'm sure if Object Prevalence gets big, there'll be a book called "Object Prevalence in Java for Dummies" in no time.
Memory is CHEAP? (Score:3, Interesting)

by mpxcz ( 448928 ) writes: on Monday March 03, 2003 @10:39AM (#5423828)

how much is 25Gig Hard disk as opposed to 25gig RAM? is that you call cheap? :)

Share
twitter facebook
This concept is not new (Score:5, Informative)

by Tikiman ( 468059 ) writes: on Monday March 03, 2003 @10:44AM (#5423859)

In fact, this concept actually predates SQL-based databases! The first one I am aware of is MUMPS (Massachusetts General Hospital Utility Multi-Programming System) which goes back to 1966. One company that continues this legacy is Sanchez [sanchez-gtm.com]. Another commercial version is Caché [e-dbms.com]. This makes sense, really - the most obvious solution to serializing an object is to store all properties of a single object together (the OO solution), rather than store a single property of all objects togther (the RDBMS solution)

Share
twitter facebook
Old News: Main Memory Databases (Score:3, Insightful)

by mojorisin67_71 ( 238883 ) writes: on Monday March 03, 2003 @10:52AM (#5423904)

Main Memory Databases have been researched for nearly 10 years now and there are a number of commercial products. For details you can check out:
TimesTen [timesten.com]
Polyhedra [ployhedra.com]
DataBlitz [bell-labs.com]

etc..
The idea it to have enough RAM to be able to store all the database in memory. This gives higher performance than a fully cached Oracle for two primary reasons:
- there is no buffer manager so data can be directly accessed.
- the index structures use smart pointers to access the data in memory.

Typically the data is mapped using mmap or shared memory. Each application can have the databae directly mapped into its memory space.
For providing persistence, typically main memory databases provide transaction logging and checkingpoint to be able to recover the data. Various techniques have been developed to be able to do this without affecting performance.

Share
twitter facebook
The Electric Database ACID Test (Score:5, Insightful)

by kriegsman ( 55737 ) writes: on Monday March 03, 2003 @11:07AM (#5423994) Homepage

Things I want in a persistent datastore:
- Atomicity of transactions (commit/rollback),
- Consistency in the enforcement of my data integrity rules,
- Isolation of each transaction from other competing transactions (locking)
- Durable storage that can survive a crash without losing transactions (e.g., journaling)

My experience with RAM-centeric disk-backed object storage is that you, the developer, often have to implement the ACID fetures yourself, from scratch. And from-scratch implementations of complex data-integrity mechanisms tend to be time-consuming to develop and test and often take much, much longer than you think to "get right".

Call me old-fashioned, but I really like using data storage (database) engines that pass the ACID test and have already been debugged and debugged and debugged and debugged and debugged.

-Mark

Share
twitter facebook
- Re:The Electric Database ACID Test (Score:3, Informative)
  
  by jafac ( 1449 ) writes:
  
  At A Previous Employer Who Shall Remain Nameless:
  (product is still on the market)
  
  We had a product which did (we'll call it "X") and tracked all it's information in a "database" we built in-house. The primary architect, of course, was a pretty sharp guy. He had written a whitepaper for the company stating why he thought "unix was dead" and why we should not waste our time, as a company, developing "portable" products, and that we should take full advantage of Microsoft's technologies on Windows.
  
  As far as ACID test goes, NONE of those elements existed in this "database" we used. Nor were there any verification, export, import, or repair tools initially available.
  
  As soon as this product scaled to a reasonable level, (the field was always one step ahead of our test lab, as far as scaling the application goes), we started seeing weird crashes and corruption that we just could not reproduce or isolate in the lab. When the term "database corruption" was used, the architect would throw a fit, and blame some other component, denying that database corruption was even possible.
  
  The absence of tools meant that we could not troubleshoot in the field. Developing tools was the equvalent of admitting that there was a problem. As we scaled our lab, in response, we started to uncover these problems. This was when our architect resigned. His job had suddenly changed from "Technical Primadonna" to "beleaguered fixer of uncounted bugs".
  
  That's when we REALLY started to get into trouble.
  
  At some point, there was serious talk about ripping the whole database out and going to a "real" commercial database solution. Some third party thing. That was shortly before I left that job. But in the end - there was much suffering and pain, and the product lost a great deal of ground to it's competitors all due to a lack of Respect For Those Who Have Gone Before.
Not a "database" but a persistence mechanism (Score:3, Insightful)

by cushty ( 565359 ) writes: on Monday March 03, 2003 @11:17AM (#5424067)

Some people seem to be missing the point: this is not a "database" it is a persistence mechanism. What they are saying is that persisting objects is difficult (er, tend to disagree but I'll bite) and so they are solving this. Whether a RDBMS offers better searching is completely irrelevent as this, in their architecture, is handled by the application.

What they seemed to gloss over is that you need to take snapshots of the actual data. If you didn't you'd have to keep every single "log" in order to safely playback the actions and know you have the same data in the same state. Loose one log, say the very first one, and you're pretty much screwed.

Share
twitter facebook
Problems with Object databases (Score:3, Insightful)

by praetorian_x ( 610780 ) writes: on Monday March 03, 2003 @11:29AM (#5424159)

This is not a new idea. There are all sorts of object databases out there. (Versant springs to mind).

The main problems I see with object databases:

1) SQL is incredibly powerful. You give up *a lot* of power when you go from sql semantics to object semantics. Sub-selects, group bys and optimized stored procedures, to name just a few things. All the object language query constructs I've seen fall far short of these. (As a side note, most O/R tools make a hash of it as well.)

2) You immedately make a massive reduction in the number of database administrators who will be willing and/or capable of helping you out in your project.

3) Scaling is always a question. With oracle, it just isn't.

4) Backup, redundancy, monitoring, management, etc. Most mature relational databases have very good tools for doing these infrastructure activities. Developers often forget about banal things like this, but they are crucial for the long term health of IT systems.

Don't get me wrong. Every time I construct some nasty query and go through the mind-numbing process of moving the results into an object, I think to myself "There has to be a better way!", but I've looked at the O/R tools and the object database out there and, sadly, I don't feel they are worth the trade off.

Just my opinion,
prat

Share
twitter facebook
Interoperability, Scalability (Score:3, Interesting)

by Rob Riggs ( 6418 ) writes: on Monday March 03, 2003 @11:35AM (#5424217) Homepage Journal

The persistent store is quite language-specific. It doesn't allow for a Python application to access a Java store, for instance. It also doesn't seem to allow concurrent access to data, which would require significantly more than 350 lines of code.

Both of these issues make this solution unusable in an enterprise environment. The RAM size issue has already been mentioned by others and is another very real limitation.

In general, object caching mechanisms are not terribily difficult to create. This generic solution proves the point by only requiring 350 lines of Java code.

I am sure that there is something worthy in this project, I just cannot see it used for anything other than very small-scale development efforts.

Share
twitter facebook
Database System vs Database Management System (Score:4, Informative)

by Lucas Membrane ( 524640 ) writes: on Monday March 03, 2003 @11:38AM (#5424244)

This OO scheme is a database system, but it leaves out much of the management element. (1) Things like changing the database structure without bringing the whole company down probably won't work. (2) You lose all the enforcement of the rules of relational integrity that an RDBMS gives you right out of the box. (3) And you lose Crystal Reports. (1) and (2) kill it technically in many situations, and (3) kills it management-wise.
Gadfly, a Python package, gives you an in-memory DB and SQL. If you want to trade SQL for extra speed and do more programming, you can run the ISAM-like engines of Btrieve or Berkeley DB without the SQL layer on top. We have SQL RDBMS's because the conventional wisdom is that such a trade is not a good idea.

Share
twitter facebook
BS (Score:5, Insightful)

by bwt ( 68845 ) writes: on Monday March 03, 2003 @12:11PM (#5424452)

Sorry this just won't cut it in most enterprise systems.

1) Doesn't scale. Most enterprise databases don't fit in RAM. Data volumes grow with the capacities of hard disks which outpace RAM. If your database fits in memory now and you use this architecture, what do you do when it grows larger than your RAM capacity? You fire the guy that proposed this and switch to an RDBMS.

2) Performance claims are BS. Good databases already serialize net changes to redo logs via a sort of binary diff of the data block. Redo logs are usually the limiting factor on transaction throughput, since they require IO to disk. Serializing the actual commands is more inefficient than using a data block diff. You simply cannot minimize the space any better than an RDBMS does, therefore you cannot minimize the IO for this serialization any better, and therefore you cannot do it faster without sacrificing ACIDity. If your performance is too good to be true, then you gave up an essentail feature of the RDBMS.

3) Consistancy. If there is only one object in memory for each record, then you'll be writing a tremendous amount of custom thread-safety code and even then, either A) writers block readers and readers block writers or B) read consistacny isn't guaranteed. Either is usually unacceptable. One alternative is to clone objects at every write (sounds slow and horribly inefficient). Of course, this too has to be serialized, or you don't have ACIDity. If you are serializing these, then you aren't really different than an RDBMS which uses rollback/undo, except you are wasting disk IO and are slower.

4) Reliability. A hardware failure, software hang/crash, or system administration mistake would force recovery from the last full backup. Replaying a full day's transactions could take hours. Sure you could be continually making a disk image, except for read consistancy issues like above. Its not clear what you do even for a daily backup. Are all sessions simply blocked during backup? Ouch.

Every few years object fanatics try to come up with some way to get rid of RDMBS's. The methods invariably rely on sacrificing some of the core capabilities of the RDBMS: data integrity, performance, consistency, ACID, reliability etc... These "innovations" are really only of interest to OO fanatics. In the real world, OO gets sacrificed way before RDBMS's do. This is not going to change.

OO is a tool that is good for writing maintainable code. It is not good for performance critical uses like OS's, device drivers, and real time systems. It is not good for data intensive systems. These things are not likely to change. If all you can accept is OO, then you are a niche player.

Share
twitter facebook
- Re:BS (Score:3, Insightful)
  
  by p3d0 ( 42270 ) writes:
  
  I agree that OO is not so good for databases, but it works well in OSes, device drivers, and realtime systems. You just need to know how to get good abstractions without sacrificing any performance, and that's not an easy skill to master.
  OO is not all about classes and jump tables. For example, you can get polymorphism in C++ without using any virtual methods at all. If you disagree, then I think your view of what constitutes OO is quite limited, and I'm not surprised you think it's a "niche player".
This Won't Replace A Database (Score:5, Insightful)

by puppetman ( 131489 ) writes: on Monday March 03, 2003 @12:30PM (#5424603) Homepage

As has been mentioned, it fails the ACI portion of ACID (it's not Atomic - all or nothing, not Consistent - data is left in a consistent state, doesn't provide Isolation - you appear to be the only transaction running; other processes don't affect your data in mid-transaction). Passes Durable, I suppose.

I've read a few posts that say that the performance claims (vs a relational database) are not true. I think this will be much faster than a database. This is an in-memory cache. It will be very fast. Our Oracle databases have a cache-hit ratio of 98 and 99+ percent, but will be slower. Why?

First, databases (especially Oracle) do alot of stuff behind the scenes, logging all sorts of stuff from a user connecting to the SQL being run.

Second, this sort of thing offers nearly direct access to the data. SQL usually needs to be parsed before it is executed. The database needs to come up with the optimal query plan before it actually executes the statement. A database offers different ways of joining data, and accessing data. Find me all managers that make more than $50,000 per year and have a last name that start with K. You will have to decide the best way to get the data yourself. A database will do all the work for you.

This is a great, idea, though for a middle-tier cache. Say you want to do some fast searching on a small amount of data. You can use this in the middle tier to save yourself the trip to the database.

A good object oriented database that has not been mentioned yet is Matisse [fresher.com]

Share
twitter facebook
WOW! After twenty years... (Score:3, Funny)

by Master of Transhuman ( 597628 ) writes: on Monday March 03, 2003 @12:36PM (#5424649) Homepage

somebody has figured out that things in memory are faster than disk...

After twenty years, we finally get to...

the in-memory database!

Oh wait, didn't my Atari ST have that?

Share
twitter facebook
OK (Score:4, Interesting)

by anthony_dipierro ( 543308 ) writes: on Monday March 03, 2003 @12:42PM (#5424690) Journal

How is this any different from using a journaling filesystem and mmap?

Share
twitter facebook
MOO (Score:4, Interesting)

by zerOnIne ( 128186 ) writes: on Monday March 03, 2003 @01:16PM (#5424922) Homepage

MOO has been doing this very thing for years, and it actually draws a lot of criticism for it. Keeping a persistant image of objects around and making checkpoints at determined intervals doesn't really seem to be that big of a deal, though it is cool to have bindings to all of those languages. But really, what's the big deal? (an honest question, not a flame)

Share
twitter facebook
race conditions? (Score:4, Insightful)

by vsync64 ( 155958 ) writes: <vsync@quadium.net> on Monday March 03, 2003 @03:11PM (#5425725) Homepage

Is anyone else bothered by the complete lack of the synchronized keyword in his example code? So the ChangeUser Command can apparently be in between these 2 lines:

usersMap.remove(user.getLogin()); usersMap.put(user.getLogin(), user);

Meanwhile someone else can run an AddUser Command with the same username. Guess what happens when ChangeUser gets to that 2nd line?
Maybe when this radical new concept in databases can be presented in a way that avoids race conditions I'll pay a little more attention...

Share
twitter facebook
Congratulations! (Score:3, Informative)

by I Am The Owl ( 531076 ) writes: on Monday March 03, 2003 @07:18PM (#5428077) Homepage Journal

You've invented an Object-Oriented database! Wowee zowie! Wait, what's that? You say this is nothing new? Well, you're [sleepycat.com] right [odbmsfacts.com]. Of course it's faster than an Oracle database stored in RAM. Oracle is not designed for the purpose of storing objects. It's a relational database, which is something else entirely.

Share
twitter facebook
The bad old days (Score:3, Insightful)

by InnovATIONS ( 588225 ) writes: on Tuesday March 04, 2003 @12:37AM (#5430417)

Why really did DBMS come about? It was not because of a need for secure transactions or to store a lot of data, although obviously those are necessary qualities of a DBMS.
Before dbms applications stored their data in very efficient data stores designed just for that application but were worthless for anything else and hard to upgrade or extend without breaking or rewriting the existing application.
DBMS were developed so that data could be stored in an application independent store that could be used and extended for new applications without breaking everything that went before.
DBMS were never designed to be more efficient than the application specific data stores that they replaced, so that somebody saying that they can build a custom data store just for a particular application that is faster is missing the point entirely.

Share
twitter facebook
- One word ...er..acronym: (Score:2)
  
  by PFactor ( 135319 ) writes:
  
  UPS
  - Re:One word ...er..acronym: (Score:3)
    
    by timothy_m_smith ( 222047 ) writes:
    
    What about a hardware failure or an accidental power off. Depending upon how important the application is, you absolutely have to plan for some sort of catastrophic hardward failure.
- Re:RAM ? (Score:4, Insightful)
  
  by bmongar ( 230600 ) writes: on Monday March 03, 2003 @09:50AM (#5423496)
  
  No more than any other database. Perhapse you missed the part where they said they would serialize the commands that change the objects. In this context they are talking about saving the commands.
  
  Parent Share
  twitter facebook
  - Re:RAM ? (Score:3, Insightful)
    
    by sql*kitten ( 1359 ) writes:
    
    No more than any other database. Perhapse you missed the part where they said they would serialize the commands that change the objects. In this context they are talking about saving the commands.
    
    Checkpointing once per day? Re-applying 15 MINUTES worth of Oracle transaction logs takes too long for some failover requirements; you force a log switch every 2 minutes if you have to. Or you eat the performance hit of synchronous replication and spec your hardware to compensate.
    
    I'm guessing this DB was written by a bunch of smart CS graduates who overdosed on OO theory and haven't spent much time in the hard core of OLTP: banks, telcos, airlines, retail, etc.
    - Re:RAM ? (Score:3, Insightful)
      
      by angel'o'sphere ( 80593 ) writes:
      
      Funny, he gets modded up each hour by one point and my answer to it gets modded down by one point each hour.
      
      The modders should read the article as well as the poster should.
      
      The time interval of the snap shot is a configureable option.
      
      If you would read the article you knew that and you knew also that the writer is a 19 year old CS graduate, indeed.
      
      Probably you should get a smal dose of OO as well before freaking out like you did.
      
      angel'o'sphere
    - Re:RAM ? (Score:3, Interesting)
      
      by ADRA ( 37398 ) writes:
      
      I have been soaked on OO theory and I DO know how silly this solution is. You are basically requiring a fast synchronized failover parallel processing environement for this to be viable. The cluster would be faster because all the transactions could be native OO without the DB conversion, but to force such high standards on the environment could be disasterous.
      
      Nobody outside of a garage can loose 1 transaction, don't even imagine loosing minutes of them with a failure.
      
      "Carlos Eduardo Villela is a 19-year old Brazilian graduate in Information Systems." ...
  - Re:RAM ? (Score:4, Insightful)
    
    by Zaiff Urgulbunger ( 591514 ) writes: on Monday March 03, 2003 @10:39AM (#5423824)
    
    This all depends on what data you're trying to preserve. Some data, such as say a users UI preference changes might be deemed "not that important" and thus you can risk storing these in ram until convenient to commit it somewhere.
    
    Conversely, some data such as a financial transaction really needs to be commited straight away.
    
    But commited means *you must* write it out to non-volatile storage (i.e. a disk) otherwise the transaction may be lost. So (I believe) most DB's write the update out to their transaction log very quickly and deal with updating the DB tables/indexes at a latter stage. Obviously, this all depends on if you need to allow other processes to access this data immediatly or not.
    
    Personally, I don't think this represents anything new (**in true /. fashion, I have not read the article!!**) and I doubt this is faster than a well designed system anyway.
    
    What it might offer however is:
    1). A nicer interface for managing object persistence; 'cos it is ugly managing mapping objects to DB columns.
    2). A clear guide to help people manage which objects need persisting to disk and which are less important.
    But thats about all.
    
    ---
    I'll now go and read the article - you can catch me later contradicting myself!
    
    Parent Share
    twitter facebook
    - - Re:RAM ? (Score:4, Insightful)
        
        by Tassach ( 137772 ) writes: on Monday March 03, 2003 @01:35PM (#5425044)
        
        It's a good idea because relational databases, XML files, and flat files are not really all that similar to objects. [...]
        
        Sorry, you're wrong. With a little forethought it's pretty simple to map an object model to an RDBMS schema. The trick is to design the database schema first, then build your objects on top of it. Doing it in reverse is a bitch. The reason many OO programmers have problems using databases is that they treat the database as an afterthought instead of as the foundation of the application. Here's the basic method I use:
        
        Tables map to objects; fields map to the properties of that object on a 1:1 basis
        
        Object methods map to stored procedures where appropriate
        
        A row in a table is an instance of an object
        Foreign-keyed child tables map to collections within the parent object.
        
        using Oracle for storing objects means that you've already got a badly-designed system
        
        You illustrate my point perfectly about putting the cart before the horse. You don't build a database to store your objects -- you build objects to manipulate your database. A badly-designed system is one where the database was tacked on after the object model was complete. Your database schema should be the first thing you write, before you even start thinking about the classes.
        Unfortunately, Comp Sci cirriculums are heavy on OOP concepts but pathetically light on database theory, which is why you wind up with otherwise talented programmers who don't understand the basic fundamentals of designing solid client-server applications.
        
        Parent Share
        twitter facebook
        
        OO to RDBMS mapping IS hard (Score:3, Insightful)
        
        by spideyct ( 250045 ) writes:
        
        We're trying to make progress in the software development field. It hasn't all been worked out yet, but its good to see people taking stabs at it.
        
        One benefit of OO development is the abstraction away from the data store. I want to think about Widgets, and Customers, and Orders, not VARCHAR fields, foreign keys, arbitrary identifier INTEGERs, etc.
        
        So I would argue that the goal IS to build a database to store your objects, instead of building objects to manipulate your database. And I imagine that's what every OO developer would want. But it's hard.
        
        You're suggestion doesn't solve the problem, it just avoids it.
        
        Re:RAM ? (Score:3, Insightful)
        
        by iabervon ( 1971 ) writes:
        
        Designing your application around an RDBMS is great if what you want is a relation-oriented application, but it's terrible if you want an object-oriented application. I've actually done a relational application, and it was just the right thing; when you want to support complex queries spanning your database, there's nothing better than an RDBMS.
        
        On the other hand, there are some cases where you want an object-oriented design, and limiting yourself to what you can fit in an RDBMS schema is a bad idea. There are cases in which you really want a base class extended by multiple subclasses, and now you can't write your foriegn key constraints properly. Alternatively, you could duplicate your common code for each table, but that's even worse.
        
        Using Oracle for storing objects is basically bad; using it to store relations is good. Good design requires you to determine first what sort of data model you have, and then choose your programming paradigm appropriately; deciding to use a relational database just because you need persistence is foolish. You've done something wrong if your queries are mostly "SELECT * FROM tbl WHERE id=?".
        
        I do agree that comp sci doesn't teach enough relational design, because it's often an appropriate design. But sometimes OOP is the right tool for the job, and then you need an appropriate storage system. Relational databases are really their own thing, with a different set of efficient and simple operation, and are not really not that much like objects.
  - Re:RAM ? (Score:3)
    
    by Tassach ( 137772 ) writes:
    
    Insightful? I think not.
    You don't use an RDBMS because it's fast. You use it because it's reliable. Does this new toy support record locking, transactional isolation and integrity, or any of the other key features that an enterprise RDBMS provides? If the answer is no, then it's not a replacement for an RDBMS. If you don't care about the integrity of your data, something like this is fine. When you absoloutely can't have data errors, you take the time to make sure you do it right, which means using an enterprise-grade database server.
    - Re:RAM ? (Score:3, Insightful)
      
      by dubl-u ( 51156 ) writes:
      
      You don't use an RDBMS because it's fast. You use it because it's reliable.
      
      Prevayler can be just as reliable.
      
      Does this new toy support record locking, transactional isolation and integrity, or any of the other key features that an enterprise RDBMS provides? If the answer is no, then it's not a replacement for an RDBMS.
      
      Wrong.
      
      The question isn't the checklist of features, it's whether you can build equivalently reliable systems with Prevayler. The answer: You can.
      
      You'll recall that Prevayler uses the Command Pattern. Before data is changed, the Command object is serialized and written to disk, then executed. Naturally, this means the commands are run in strict order of arrival, yes?
      
      That's all you need to get transactional integrity. All writes are isolated. If you need to isolate the reads, you can use the same mechanism.
      
      The prevalent approach requires developers to do things a little differently, but you don't have to sacrifice reliability.
- Re:RAM ? (Score:2)
  
  by muyuubyou ( 621373 ) writes:
  
  Of course, if you care about your server's reliability in the slightest, an UPS (Uninterruptible Power Supply) system is in order.
  
  And that's regardless of where you store your data.
- Re:RAM ? (Score:5, Informative)
  
  by jmcnally ( 100849 ) writes: on Monday March 03, 2003 @09:58AM (#5423556)
  
  As someone else also posted, applying transactions that have occurred since the last time the db was saved to disk avoids this problem. A small company in WA years ago, Raima, had this transaction log concept implemented nicely to support their network database, dbVista (later called RDM). Basically a transaction log is started for every sequence of updates. All records and pointers are saved in a transaction file first. If any problems or system abends occured the entire sequence would be flushed, avoiding a half-updated sequence of records (for example an invoice is posted but the customer record is not updated). It worked pretty well. The big problem with the RAM scheme is that for very large databases the capacity of the computer or the times required to save to disk are prohibitive.
  
  Parent Share
  twitter facebook
- Re:RAM ? (Score:4, Interesting)
  
  by Lerxst Pratt ( 618277 ) writes: on Monday March 03, 2003 @09:59AM (#5423562)
  
  The client commands are immediately written to a log file for later execution. Even if the power fails, the system may be brought back up and the commands exectuted from the logfile regardless of the power failure. While the commands are written to the log, the command the user has decided to invoke is executed immediately in parallel on the live data in RAM. Pretty ingenious, if you ask me!
  
  Parent Share
  twitter facebook
- Why not try reading the article? (Score:3, Interesting)
  
  by ryan1234 ( 173313 ) writes:
  
  From the article:
  
  Before changes are applied to business objects, each command is serialized and written to a log file (Figure 1). Then, each command is executed immediately. Optionally, in a low-use period, the system can take a snapshot of the business objects, aggregating all the commands applied into one large file to save some reloading time.
  
  From what I've read, Oracle has something similar with their REDO log. If it's good enough for oracle, it can't be all that bad.
- Re:RAM ? (Score:5, Insightful)
  
  by hrieke ( 126185 ) writes: on Monday March 03, 2003 @10:09AM (#5423636) Homepage
  
  Reminds me of something that I heard about year ago- one of the DB players (I think IBM) built a fully OO DB in C. Used to store the relations in RAM.
  Blazing fast, and easy as hell to fuck up beyond replair- you could do both a read and a write to the same memory area at the same time, or something like that.
  
  This sounds just as bad.
  For example, let's say that we're doing a transaction of a few million dollars. In mid process the power dies and the machine goes dark. Outside of shouting 'redunant this that and the other', what state would the machine be in when it comes back online, were is the money, and could we back out of and rerun the transaction?
  
  Parent Share
  twitter facebook
  - Re:RAM ? (Score:5, Informative)
    
    by archeopterix ( 594938 ) writes: on Monday March 03, 2003 @10:43AM (#5423845) Journal
    
    Blazing fast, and easy as hell to fuck up beyond replair- you could do both a read and a write to the same memory area at the same time, or something like that.
    Well, the system is there to stop you from doing that - just like in the traditional DBMSes. Synchronizing memory access isn't harder than synchronizing disk access, it might even be easier if you decide to serialize all access.
    This sounds just as bad.
    
    For example, let's say that we're doing a transaction of a few million dollars. In mid process the power dies and the machine goes dark. Outside of shouting 'redunant this that and the other', what state would the machine be in when it comes back online, were is the money, and could we back out of and rerun the transaction?
    It can be implemented exactly like it is in traditional DB systems, after all they handle similar problems pretty well. After the failure, you have:
    1) The last full image dump
    2) all successful transactions (the DB meaning) serialized in the log, from the last dump to the power failure.
    Since your transaction (both DB & business meaning) hasn't been successful, it has not yet been written into the log, so the money stays in the ordering party account. Of course the power failure could have occured just after a transaction has been written to log and before the client software got the message that it was successful, but traditional DBs have this problem too. To sum it all up: the synchronization problems are there, but they are no worse than in traditional DBMSes.
    
    Parent Share
    twitter facebook
- - Re:RAM ? (Score:3, Interesting)
    
    by Sgs-Cruz ( 526085 ) writes:
    
    A use for 64-bit computing? Larger RAM spaces? I just picked up 256 MB of RAM for $59 Canadian... the stuff isn't exactly expensive right now...
    - - Re:RAM ? (Score:4, Informative)
        
        by tx_mgm ( 82188 ) writes: <notquiteoriginal AT gmail DOT com> on Monday March 03, 2003 @10:26AM (#5423728)
        
        What does 64-bit computing have anything to do with how much RAM one needs
        
        its not how much you need that he's talking about. only with 64 bit computing can one have more than the current limit of RAM (which i believe is 2GB right now). it has to do with the maximum possible number of 32 bit addresses can exist in the RAM. so with a 64 bit processor, you can have enough ram to hold that database all at once time.
        
        Parent Share
        twitter facebook
      - Re:RAM ? (Score:3, Informative)
        
        by moonbender ( 547943 ) writes:
        
        As far as I know - and for what it's worth (not much), I checked with Google - x86-compatible CPUs can address 4 GB. There are extensions in some Intel CPUs that allow programs to address a 36-bit address space, even.
        As for the 2 GB limit, there seems to be a feature in the Windows memory architecture - the upper 2 GB of a process' virtual address space is reserved for shared memory. Or something - I kind of stopped thinking at that point. ;) If you're interested, more info is available [windowsitlibrary.com].
- - Re:RAM ? (Score:2)
    
    by Gortbusters.org ( 637314 ) writes:
    
    Sounds just like Oracle, but without standard access APIs, redundancy features, etc etc etc..
    - - Re:RAM ? (Score:5, Insightful)
        
        by dubl-u ( 51156 ) writes: <2523987012@noSPAm.pota.to> on Monday March 03, 2003 @02:41PM (#5425539)
        
        To license Oracle with similar features to Prevalent, you would only be looking at a 5 figure pricetag.
        
        Don't forget the price tag for all the extra hardware; since a Prevaylent system is thousands of times faster, you can get by with a lot less hardware. And add in all the programmer time spent dealing with SQL. Oh, what about the DBA's salary?
        
        How well does Prevalent do on 30TB+ datasets?
        
        One doesn't use Prevayler for systems like that. Prevayler makes sense if your data can fit in RAM. If it doesn't, you should do something else.
        
        But note that "something else" doesn't have to mean some SQL thingy. Google has a metric shitload of data, and you can bet they don't keep it in an Oracle server.
        
        Parent Share
        twitter facebook
- Re:Obligatory Slashdotted reference (Score:2)
  
  by muyuubyou ( 621373 ) writes:
  
  Yeah, I guess it should be great for non-webserver implementations. For web servers it seems a bit secondary ;)
- Re:Data integrity? (Score:5, Insightful)
  
  by rycamor ( 194164 ) writes: on Monday March 03, 2003 @11:33AM (#5424183)
  
  Aha.
  
  Someone's been reading DBDebunk.com [dbdebunk.com] again.
  
  Yes, data integrity is one of the major considerations here. I'm willing to bet that by the time you implemented the equivalent of constraints, triggers, etc... in a system like this, you would be running no faster than a typical SQL DBMS, and you would have thousands of bugs as you reinvent the wheel. But there are even more considerations than integrity. This is language-specific, or application-specific. What do you do when you need to access your data from another application? Even if it is possible, that means you have to implement all your integrity checks again in that application.
  
  Essentially, what this looks like is just another OO method of heirarchical (or perhaps "multi-valued") data storage. This is nothing new. It will suffer all of the historical problems the industry has had with hierarchical storage (there is a reason the relational data model was invented: the problems IBM had with hierarchical data). For example, what happens to existing data when you need to change your logical schema or business rules? The cost of re-ordering or reformatting _every_ single stored object since the beginning of your application would be ridiculous, and in some cases even impossible. How do you track dependencies? In theory, these kinds of systems will work fine, if your application stays exactly as created, and if the nature of the data doesn't change, and if no other applications are involved. In other words, NOT in the real world.
  
  I have a nick-name for hierarchical data storaqe: "headache-ical".
  
  Parent Share
  twitter facebook
  - - Re:Data integrity? (Score:3, Interesting)
      
      by rycamor ( 194164 ) writes:
      
      Chalk up one more person who's missed the point.
      
      This isn't a relational database. Those techs you mention are used to make a denormalised SQL database act like a normalised SQL database; they aren't needed if your relational database is normalised or you're not using a relational database at all.
      
      I don't believe I have missed the point at all. I am painfully aware that the technology in question is not a relational DBMS. But, normalization (with foreign key constraints) is only the beginning of the constraints needed to create a real database. There are constraints that can be placed on columns, on tables, even on specific types with some systems, and there are *database* constraints (when the system is in condition X, don't allow Y, etc...). Triggers are not the optimal solution for database-level constraints, but that is how most SQL systems are limited.
      
      Yes, I agree that SQL systems are limited, by design. The world has not seen a truly relational DBMS, with the possible exception of Dataphor [alphora.com] (which I dislike for other reasons). The relational model is a logical model. It doesn't "care" which implementation you use. DBMS's are just applications. There is no reason you can't do one in Java. It's not about the language, but about the concept. The relational model offers logical advantages that cannot be found in ANY other system. If any other system achieved those advantages, it would become de facto relational. End of story.
      
      No, the point is being missed by a great many people. This whole thread here really is about physical storage optimization, and running in memory. There is no reason a relational DBMS can't run in memory. That is simply an implementation detail. If the Prevayler application framework achieved the level of logical relational operators and constraints of an RDBMS, then it would become an RDBMS. But it seems pretty obvious that this is not its purpose.
      
      I'm glad you like DBDebunk.com. It helped me, see these concepts in much more clarity. Go back and read the site more carefully, and you will see every argument in this Slashdot thread examined and reexamined. Look for the whole logical-physical confusion thing ;-).
      
      This is language-specific, or application-specific.
      
      Oh? Which one, then? Language, or application? Have you read the page?
      
      Yes, it has many languages, but if you try to access your Java Prevayler application data from PHP or Perl, you are going to have to re-implement all your integrity constraints in those languages. That's my point. No longer will you have a single point of control for your business rules, or the logical model of your data.
      
      It's NOT intrinsically language specific, although if the languages you're wanting to use together don't have any way of talking to each other you're out of luck (although I don't see how SQL could help you there).
      
      Huh? This is the most obvious one of all. With SQL, my VB, Java and Perl application could all interact with the same DBMS without needing to talk to each other. Thats a perfect solution in the real world.
      
      Your other problems are real, of course; but they're also very real for any real database, relational or otherwise.
      
      I understand that complex database design is always a difficult thing. And I recognize that there is nothing a relational DBMS can do that can't be re-implemented in application code. But that's my whole point. Separation of operations. Your data should have a single point of control for the logical constraints in the data, and that point of control should be a logical firewall between the data, and everything else that happens in the application.
      
      Also, in the end there is also nothing that an OO DBMS can do that a truly relational DBMS can't. This is what many people don't understand about the true relational model. With the right kind of RDBMS (not the typical SQL ones), there are many more things that are possible than just messing around with base types and simple 3NF normalization. And with updateable views, functions, and logical RULEs (check out PostgreSQL), you can tailor the external access to the application in any way you want, without changing your base table design. This is power.
  - - Re:OO and hierarhcial DB's (Score:3, Interesting)
      
      by rycamor ( 194164 ) writes:
      
      My good friend the anti-oop guy makes a good point. Yes, and OODBMS can also be network-oriented. But the network database was just an attempt to work around the logical limitations of hierarchical storage. The main advantage was that a child node is not limited to one parent. But once again, a relational approach can do that and much more.
      
      Like I said in another reply, NDB's and HDB's are too different in base philosophy from RDB's, and they will never fully get along (barring some big theorical breakthrough).
      
      We don't need a theoretical breakthrough. The relational data model is just straightforward logic, as applied to data storage. What we need is an implementation breakthrough. Even the best SQL systems only implement about 70% of the advantages of the relational model, and suffer from needless performance problems, due to being too tied to physical mapping of tables and views.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

gigabytes? (Score:5, Insightful)

Re:gigabytes? (Score:2, Interesting)

Re:gigabytes? (Score:2)

Re:gigabytes? (Score:2)

Re:gigabytes? (Score:5, Insightful)

Re:gigabytes? (Score:2, Insightful)

Re:gigabytes? (Score:5, Insightful)

Re:gigabytes? (Score:3, Insightful)

Re:gigabytes? (Score:2, Funny)

Re:gigabytes? (Score:3, Funny)

Re:gigabytes? (Score:3, Insightful)

Re:gigabytes? (Score:5, Insightful)

Very large? (Score:2, Interesting)

Slashdotted (Score:5, Funny)

Neat concept... (Score:3, Interesting)

Re:Neat concept... (Score:5, Informative)

What about existing data ? (Score:4, Interesting)

FInally OO? I think and hope not! (Score:5, Informative)

Re:FInally OO? I think and hope not! (Score:3, Interesting)

Re:What about existing data ? (Score:4, Insightful)

OOP (Score:2, Interesting)

Two words... (Score:4, Informative)

Re:Two words... (Score:5, Informative)

Re:Don't go there (Score:3, Informative)

Re:Don't go there (Score:3, Interesting)

Ever looked at object-oriented databases? (Score:5, Informative)

Re:Ever looked at object-oriented databases? (Score:3, Interesting)

Re:Ever looked at object-oriented databases? (Score:3, Informative)

3 issues I see (Score:4, Interesting)

Re:3 issues I see (Score:5, Interesting)

Buggy whips (Score:5, Insightful)

3 More Issues for the Do-It-Yourself Database (Score:5, Informative)

Re:3 More Issues for the Do-It-Yourself Database (Score:3, Interesting)

Interfacing (Score:3, Interesting)

C++ soluton (Score:2, Funny)

Looks like journaling filesystem (Score:2)

Something about this doesn't sit right with me (Score:3, Insightful)

It's not a simple question of speed (Score:4, Insightful)

Blazing fast (Score:4, Funny)

no queries (Score:5, Insightful)

Re:no queries (Score:4, Insightful)

Fine granularity for writes. (Score:3, Informative)

Get best of both worlds... (Score:5, Interesting)

Umm what about multiple servers? (Score:3, Insightful)

You're thinking too general case (Score:3, Insightful)

Sourceforge Link (Score:4, Informative)

OO databases are an evolutionary step...backward (Score:5, Interesting)

OODB are very different from RDBMS (Score:4, Interesting)

Re:OODB are very different from RDBMS (Score:3, Insightful)

Re:OODB are very different from RDBMS (Score:5, Funny)

Re:OO databases are an evolutionary step...backwar (Score:5, Interesting)

Nods head (Score:3, Interesting)

Re:OO databases are an evolutionary step...backwar (Score:3, Insightful)

Speed is not the only factor (Score:4, Interesting)

Re:Speed is not the only factor (Score:3, Insightful)

Memory is CHEAP? (Score:3, Interesting)

This concept is not new (Score:5, Informative)

Old News: Main Memory Databases (Score:3, Insightful)

The Electric Database ACID Test (Score:5, Insightful)

Re:The Electric Database ACID Test (Score:3, Informative)

Not a "database" but a persistence mechanism (Score:3, Insightful)

Problems with Object databases (Score:3, Insightful)

Interoperability, Scalability (Score:3, Interesting)

Database System vs Database Management System (Score:4, Informative)

BS (Score:5, Insightful)

Re:BS (Score:3, Insightful)

This Won't Replace A Database (Score:5, Insightful)

WOW! After twenty years... (Score:3, Funny)

OK (Score:4, Interesting)

MOO (Score:4, Interesting)

race conditions? (Score:4, Insightful)

Congratulations! (Score:3, Informative)

The bad old days (Score:3, Insightful)

One word ...er..acronym: (Score:2)

Re:One word ...er..acronym: (Score:3)

Re:RAM ? (Score:4, Insightful)

Re:RAM ? (Score:3, Insightful)

Re:RAM ? (Score:3, Insightful)

Re:RAM ? (Score:3, Interesting)

Re:RAM ? (Score:4, Insightful)